CN113651892A - Tlr8基因人源化的非人动物及其构建方法和应用 - Google Patents
Tlr8基因人源化的非人动物及其构建方法和应用 Download PDFInfo
- Publication number
- CN113651892A CN113651892A CN202110802477.1A CN202110802477A CN113651892A CN 113651892 A CN113651892 A CN 113651892A CN 202110802477 A CN202110802477 A CN 202110802477A CN 113651892 A CN113651892 A CN 113651892A
- Authority
- CN
- China
- Prior art keywords
- tlr8
- human
- leu
- gene
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 101100260751 Homo sapiens TLR8 gene Proteins 0.000 title claims abstract description 47
- 101150100822 TLR8 gene Proteins 0.000 title claims abstract description 47
- 238000010276 construction Methods 0.000 title claims abstract description 29
- 101000800483 Homo sapiens Toll-like receptor 8 Proteins 0.000 claims abstract description 327
- 102100033110 Toll-like receptor 8 Human genes 0.000 claims abstract description 180
- 239000002773 nucleotide Substances 0.000 claims abstract description 135
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 135
- 102000045720 human TLR8 Human genes 0.000 claims abstract description 107
- 239000013598 vector Substances 0.000 claims abstract description 29
- 239000003814 drug Substances 0.000 claims abstract description 21
- 229940079593 drug Drugs 0.000 claims abstract description 20
- 241001465754 Metazoa Species 0.000 claims abstract description 17
- 238000012216 screening Methods 0.000 claims abstract description 13
- 230000007246 mechanism Effects 0.000 claims abstract description 3
- 108090000623 proteins and genes Proteins 0.000 claims description 88
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 67
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 51
- 150000001413 amino acids Chemical class 0.000 claims description 50
- 210000004027 cell Anatomy 0.000 claims description 46
- 108020004414 DNA Proteins 0.000 claims description 40
- 238000000034 method Methods 0.000 claims description 33
- 230000008685 targeting Effects 0.000 claims description 29
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 17
- 201000010099 disease Diseases 0.000 claims description 15
- 210000001519 tissue Anatomy 0.000 claims description 15
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 13
- 238000003780 insertion Methods 0.000 claims description 12
- 230000037431 insertion Effects 0.000 claims description 12
- 230000001086 cytosolic effect Effects 0.000 claims description 10
- 238000012217 deletion Methods 0.000 claims description 10
- 230000037430 deletion Effects 0.000 claims description 10
- 108700024394 Exon Proteins 0.000 claims description 9
- 239000002299 complementary DNA Substances 0.000 claims description 8
- 210000000056 organ Anatomy 0.000 claims description 8
- 238000006467 substitution reaction Methods 0.000 claims description 8
- 230000000857 drug effect Effects 0.000 claims description 7
- 238000011161 development Methods 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 5
- 238000011160 research Methods 0.000 claims description 5
- 230000001225 therapeutic effect Effects 0.000 claims description 5
- 125000000539 amino acid group Chemical group 0.000 claims description 4
- 108020004999 messenger RNA Proteins 0.000 claims description 4
- 108010074708 B7-H1 Antigen Proteins 0.000 claims description 3
- 102000008096 B7-H1 Antigen Human genes 0.000 claims description 3
- 238000012356 Product development Methods 0.000 claims description 2
- 239000002260 anti-inflammatory agent Substances 0.000 claims description 2
- 229940124599 anti-inflammatory drug Drugs 0.000 claims description 2
- 230000000259 anti-tumor effect Effects 0.000 claims description 2
- 239000002246 antineoplastic agent Substances 0.000 claims description 2
- 229940041181 antineoplastic drug Drugs 0.000 claims description 2
- 210000005260 human cell Anatomy 0.000 claims description 2
- 230000001900 immune effect Effects 0.000 claims description 2
- 230000002906 microbiologic effect Effects 0.000 claims description 2
- 230000000144 pharmacologic effect Effects 0.000 claims description 2
- 102000017420 CD3 protein, epsilon/gamma/delta subunit Human genes 0.000 claims 1
- 108050005493 CD3 protein, epsilon/gamma/delta subunit Proteins 0.000 claims 1
- 101710089372 Programmed cell death protein 1 Proteins 0.000 claims 1
- 206010028980 Neoplasm Diseases 0.000 abstract description 26
- 238000010171 animal model Methods 0.000 abstract description 21
- 238000010363 gene targeting Methods 0.000 abstract description 3
- 238000012827 research and development Methods 0.000 abstract description 3
- 208000026278 immune system disease Diseases 0.000 abstract description 2
- 239000002547 new drug Substances 0.000 abstract description 2
- 230000006801 homologous recombination Effects 0.000 abstract 1
- 238000002744 homologous recombination Methods 0.000 abstract 1
- 108700026220 vif Genes Proteins 0.000 abstract 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 44
- 241000283984 Rodentia Species 0.000 description 27
- 241000699670 Mus sp. Species 0.000 description 20
- 102000002689 Toll-like receptor Human genes 0.000 description 17
- 108020000411 Toll-like receptor Proteins 0.000 description 17
- 241000700159 Rattus Species 0.000 description 16
- 101100260752 Mus musculus Tlr8 gene Proteins 0.000 description 14
- 206010061218 Inflammation Diseases 0.000 description 13
- 239000012634 fragment Substances 0.000 description 12
- 230000004054 inflammatory process Effects 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- 238000011577 humanized mouse model Methods 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 10
- 238000005215 recombination Methods 0.000 description 10
- 230000006798 recombination Effects 0.000 description 10
- 239000000523 sample Substances 0.000 description 9
- 241000282693 Cercopithecidae Species 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 8
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 8
- 241000283973 Oryctolagus cuniculus Species 0.000 description 8
- 210000004899 c-terminal region Anatomy 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 238000010362 genome editing Methods 0.000 description 7
- 210000001616 monocyte Anatomy 0.000 description 7
- 210000004881 tumor cell Anatomy 0.000 description 7
- -1 at least 10 Chemical class 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 5
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 5
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 210000000952 spleen Anatomy 0.000 description 5
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 4
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 4
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 4
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 4
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 4
- 238000011740 C57BL/6 mouse Methods 0.000 description 4
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 4
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 4
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- 241001529936 Murinae Species 0.000 description 4
- 241000699660 Mus musculus Species 0.000 description 4
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 4
- 238000011579 SCID mouse model Methods 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- 241000282898 Sus scrofa Species 0.000 description 4
- NKANXQFJJICGDU-QPLCGJKRSA-N Tamoxifen Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 NKANXQFJJICGDU-QPLCGJKRSA-N 0.000 description 4
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 4
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 210000001671 embryonic stem cell Anatomy 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000012239 gene modification Methods 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000011580 nude mouse model Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 3
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 3
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 3
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 3
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 3
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 3
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 3
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 3
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 3
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 3
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 3
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 3
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 3
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 3
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 3
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 3
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 3
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 3
- 101001046686 Homo sapiens Integrin alpha-M Proteins 0.000 description 3
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 3
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 3
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 3
- 102100022338 Integrin alpha-M Human genes 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 3
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 3
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 3
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 3
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 3
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 3
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 3
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 3
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 description 3
- SNIPWBQKOPCJRG-CIUDSAMLSA-N Pro-Gln-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O SNIPWBQKOPCJRG-CIUDSAMLSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 3
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 3
- 206010039491 Sarcoma Diseases 0.000 description 3
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 3
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 3
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 3
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 3
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 3
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 3
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 3
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 3
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 3
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 3
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 210000003719 b-lymphocyte Anatomy 0.000 description 3
- 238000013373 clone screening Methods 0.000 description 3
- 210000004443 dendritic cell Anatomy 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 239000000411 inducer Substances 0.000 description 3
- 230000002757 inflammatory effect Effects 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 206010039073 rheumatoid arthritis Diseases 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108010084932 tryptophyl-proline Proteins 0.000 description 3
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 3
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 2
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 2
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- BXLDDWZOTGGNOJ-SZMVWBNQSA-N Arg-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N BXLDDWZOTGGNOJ-SZMVWBNQSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 2
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 2
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 2
- 208000023275 Autoimmune disease Diseases 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 2
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 2
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 2
- 102000016607 Diphtheria Toxin Human genes 0.000 description 2
- 108010053187 Diphtheria Toxin Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 2
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 2
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 2
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 2
- BMZLDCQIWUHVRS-DCAQKATOSA-N His-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CN=CN1 BMZLDCQIWUHVRS-DCAQKATOSA-N 0.000 description 2
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 2
- 101001057747 Human cytomegalovirus (strain AD169) Uncharacterized protein IRL8 Proteins 0.000 description 2
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 2
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 2
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- 102100022297 Integrin alpha-X Human genes 0.000 description 2
- 108010025815 Kanamycin Kinase Proteins 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 2
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 2
- 208000034578 Multiple myelomas Diseases 0.000 description 2
- 101001065556 Mus musculus Lymphocyte antigen 6G Proteins 0.000 description 2
- 102000010168 Myeloid Differentiation Factor 88 Human genes 0.000 description 2
- 108010077432 Myeloid Differentiation Factor 88 Proteins 0.000 description 2
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 2
- 108091092724 Noncoding DNA Proteins 0.000 description 2
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 2
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 2
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 2
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- 241000121210 Sigmodontinae Species 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 2
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 2
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 2
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 2
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- 239000000556 agonist Substances 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 239000006274 endogenous ligand Substances 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 238000003209 gene knockout Methods 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 230000015788 innate immune response Effects 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 230000003234 polygenic effect Effects 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 229960004641 rituximab Drugs 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 229960001603 tamoxifen Drugs 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- QSPOQCXMGPDIHI-UHFFFAOYSA-N 2-amino-n,n-dipropyl-8-[4-(pyrrolidine-1-carbonyl)phenyl]-3h-1-benzazepine-4-carboxamide Chemical compound C1=C2N=C(N)CC(C(=O)N(CCC)CCC)=CC2=CC=C1C(C=C1)=CC=C1C(=O)N1CCCC1 QSPOQCXMGPDIHI-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000014697 Acute lymphocytic leukaemia Diseases 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- 201000003076 Angiosarcoma Diseases 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 208000036170 B-Cell Marginal Zone Lymphoma Diseases 0.000 description 1
- 208000025324 B-cell acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 206010004593 Bile duct cancer Diseases 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 238000011752 CBA/J (JAX™ mouse strain) Methods 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 208000005243 Chondrosarcoma Diseases 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 1
- ZHCCYSDALWJITB-SRVKXCTJSA-N Cys-Phe-Cys Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O ZHCCYSDALWJITB-SRVKXCTJSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 238000011765 DBA/2 mouse Methods 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241001433703 Escherichia coli O111:B4 Species 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 102100027280 Fanconi anemia group A protein Human genes 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 201000008808 Fibrosarcoma Diseases 0.000 description 1
- 241000699694 Gerbillinae Species 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- QENSHQJGWGRPQS-QEJZJMRPSA-N Gln-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 QENSHQJGWGRPQS-QEJZJMRPSA-N 0.000 description 1
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 208000001258 Hemangiosarcoma Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 208000027761 Hepatic autoimmune disease Diseases 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 101000914673 Homo sapiens Fanconi anemia group A protein Proteins 0.000 description 1
- 101000831567 Homo sapiens Toll-like receptor 2 Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 206010020850 Hyperthyroidism Diseases 0.000 description 1
- 206010021245 Idiopathic thrombocytopenic purpura Diseases 0.000 description 1
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 102220470475 L-seryl-tRNA(Sec) kinase_C57L_mutation Human genes 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- 206010024229 Leprosy Diseases 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 208000025205 Mantle-Cell Lymphoma Diseases 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- 101100120552 Mus musculus Foxp3 gene Proteins 0.000 description 1
- 241000699669 Mus saxicola Species 0.000 description 1
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 208000002454 Nasopharyngeal Carcinoma Diseases 0.000 description 1
- 208000001894 Nasopharyngeal Neoplasms Diseases 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 101100290374 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mcd-4 gene Proteins 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- UMIHVJQSXFWWMW-JBACZVJFSA-N Phe-Trp-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UMIHVJQSXFWWMW-JBACZVJFSA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 1
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 206010039085 Rhinitis allergic Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 208000021712 Soft tissue sarcoma Diseases 0.000 description 1
- 208000027520 Somatoform disease Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 208000029052 T-cell acute lymphoblastic leukemia Diseases 0.000 description 1
- 206010042971 T-cell lymphoma Diseases 0.000 description 1
- 208000027585 T-cell non-Hodgkin lymphoma Diseases 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 229940124614 TLR 8 agonist Drugs 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- 208000031981 Thrombocytopenic Idiopathic Purpura Diseases 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 description 1
- 102100024333 Toll-like receptor 2 Human genes 0.000 description 1
- 102100033117 Toll-like receptor 9 Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- YTVJTXJTNRWJCR-JBACZVJFSA-N Trp-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N YTVJTXJTNRWJCR-JBACZVJFSA-N 0.000 description 1
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- 108091005906 Type I transmembrane proteins Proteins 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 208000038016 acute inflammation Diseases 0.000 description 1
- 230000006022 acute inflammation Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 201000010105 allergic rhinitis Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000005809 anti-tumor immunity Effects 0.000 description 1
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 1
- 102000025171 antigen binding proteins Human genes 0.000 description 1
- 108091000831 antigen binding proteins Proteins 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 208000026900 bile duct neoplasm Diseases 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000002459 blastocyst Anatomy 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 208000006990 cholangiocarcinoma Diseases 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 230000006020 chronic inflammation Effects 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 210000001728 clone cell Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 230000005860 defense response to virus Effects 0.000 description 1
- 230000003412 degenerative effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 201000003444 follicular lymphoma Diseases 0.000 description 1
- 235000012631 food intake Nutrition 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000002008 hemorrhagic effect Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000037451 immune surveillance Effects 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 229960001438 immunostimulant agent Drugs 0.000 description 1
- 239000003022 immunostimulating agent Substances 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011503 in vivo imaging Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 210000004901 leucine-rich repeat Anatomy 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 206010024627 liposarcoma Diseases 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 201000005296 lung carcinoma Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 230000000527 lymphocytic effect Effects 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 201000007924 marginal zone B-cell lymphoma Diseases 0.000 description 1
- 208000021937 marginal zone lymphoma Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000004066 metabolic change Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 201000011216 nasopharynx carcinoma Diseases 0.000 description 1
- 230000002956 necrotizing effect Effects 0.000 description 1
- 201000008383 nephritis Diseases 0.000 description 1
- 230000000324 neuroprotective effect Effects 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 208000027753 pain disease Diseases 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 208000031223 plasma cell leukemia Diseases 0.000 description 1
- 210000005134 plasmacytoid dendritic cell Anatomy 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 108700019620 rat Foxp3 Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 241000114864 ssRNA viruses Species 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 206010042863 synovial sarcoma Diseases 0.000 description 1
- 208000006379 syphilis Diseases 0.000 description 1
- 238000010809 targeting technique Methods 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 230000005760 tumorsuppression Effects 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 238000011816 wild-type C57Bl6 mouse Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70596—Molecules with a "CD"-designation not provided for elsewhere
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
- A01K67/0278—Knock-in vertebrates, e.g. humanised vertebrates
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/0004—Screening or testing of compounds for diagnosis of disorders, assessment of conditions, e.g. renal clearance, gastric emptying, testing for diabetes, allergy, rheuma, pancreas functions
- A61K49/0008—Screening agents using (non-human) animal models or transgenic animal models or chimeric hosts, e.g. Alzheimer disease animal model, transgenic model for heart failure
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/15—Humanized animals
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
- A01K2217/052—Animals comprising random inserted nucleic acids (transgenic) inducing gain of function
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/03—Fusion polypeptide containing a localisation/targetting motif containing a transmembrane segment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/001—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination
- C12N2830/005—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination repressible enhancer/promoter combination, e.g. KRAB
- C12N2830/006—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination repressible enhancer/promoter combination, e.g. KRAB tet repressible
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Veterinary Medicine (AREA)
- Toxicology (AREA)
- Molecular Biology (AREA)
- Animal Behavior & Ethology (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- Environmental Sciences (AREA)
- Animal Husbandry (AREA)
- Immunology (AREA)
- Cell Biology (AREA)
- Biodiversity & Conservation Biology (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Medicinal Chemistry (AREA)
- Urology & Nephrology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Rheumatology (AREA)
- Physics & Mathematics (AREA)
- Pathology (AREA)
- Plant Pathology (AREA)
- Endocrinology (AREA)
- Microbiology (AREA)
- Diabetes (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明提供了一种TLR8基因人源化的非人动物及其构建方法、一种嵌合TLR8蛋白、一种TLR8嵌合基因、一种TLR8基因的靶向载体和其在生物医药领域的应用,利用同源重组的方式将部分编码人TLR8蛋白的核苷酸序列导入非人动物基因组中,该动物体内能正常表达嵌合TLR8蛋白,可以作为人TLR8信号机理研究、肿瘤及免疫性疾病药物筛选的动物模型,对免疫靶点的新药研发具有重要的应用价值。
Description
技术领域
本发明属于动物基因工程和基因遗传修饰领域,具体地说,涉及一种TLR8基因修饰人源化动物模型的构建方法及其在生物医药领域的应用。
背景技术
Toll样受体(Toll-like receptors,TLR)能结合机体自身产生的一些内源性分子(即内源性配体)。免疫佐剂可增强抗肿瘤免疫,其分子和细胞机制得到进一步阐明,TLR也在其中扮演重要角色。由于肿瘤在发生发展过程中可以产生一些能被TLR识别的内源性配体,所以TLR在肿瘤免疫监视中可能发挥了一定作用。TLR家族在病原体识别和先天免疫的激活中起着重要作用,其中TLR8基因主要在单核、巨噬细胞和骨髓树突状细胞中表达,并且位于染色体X上的另一个家族成员TLR7附近,TLR8可以与TLR7结合形成异源复合物,两者都能识别单链RNA和短双链RNA,TLR7主要在类浆细胞的树突状细胞中表达,在B细胞和单核、巨噬细胞中也可表达。
TLR8是一种识别单链RNA(ssRNA)的内体小泡(endosomal)受体,可识别ssRNA病毒,如流感、仙台和柯萨奇B病毒(Coxsackie B viruses)。与病毒RNA的结合会募集MyD88并导致转录因子NF-κB的激活和抗病毒反应。同时TLR8可以识别病毒的单链RNA,例如HIV和HCV,TLR8可以识别富含GU的单链RNA。然而,单链RNA中富含GU的序列的存在不足以刺激TLR8。TLR8识别富含G的寡核苷酸。TLR8是I型跨膜蛋白,包括由25个富含亮氨酸的重复序列组成的胞外结构域,一个跨膜区,和胞内的TIR结构域(MyD88招募在TIR结构域)。TLR8似乎在人类和小鼠中的功能不同。例如,人TLR8能够被单链RNA(ssRNA)激活,但是小鼠的Tlr8不能被ssRNA和激动剂激活,但是鼠Tlr8能够被dsDNA牛痘病毒激活。人的TLR8不仅能下调TLR7和TLR9的信号,还可以上调TLR2表达。
TLR8基因激动剂(例如VTX-2337)已经在一些癌症的联合治疗中作为免疫刺激剂进行了临床试验,增强利妥昔单抗(rituximab)介导的抗体依赖性细胞毒性,另一种TLR8激动剂(VTX-1463)目前正在开发中,用于过敏性鼻炎的治疗。通过激活TLR8信号,细胞因子反应可以激活抗原细胞,从而诱导Th1型免疫反应。TLR8在自身免疫紊乱中起作用,如系统性红斑狼疮(SLE),还有X-linked inflammatory bowel disease(IBD)易感基因,其也与类风湿关节炎、牛皮癣、I型糖尿病有关。
实验动物疾病模型对于研究人类疾病发生的病因、发病机制、开发防治技术和开发药物是不可缺少的研究工具。但由于动物与人类的生理结构和代谢系统本身的差异,例如人TLR8和小鼠TLR8蛋白序列的一致性为66.58%,传统的动物模型并不能很好的反映人体的真实状况,在动物体内建立更接近人类的生理特征的疾病模型是生物医药行业的迫切需求。
鉴于TLR8在病原体识别和先天免疫的激活中起着重要作用,为进一步的研究相关的生物学特性,提高临床前期的药效试验的有效性,提高研发成功率,使临床前期的试验更有效并使研发失败最小化,本领域亟需开发涉及TLR8信号通路的非人动物模型。
发明内容
本发明的第一方面,提供了一种嵌合TLR8蛋白,所述的嵌合TLR8蛋白包含人TLR8蛋白的全部或部分。
优选的,所述的嵌合TLR8蛋白包含人TLR8蛋白的信号肽、跨膜区、胞质区和/或胞外区的全部或部分。
进一步优选的,所述的嵌合TLR8蛋白包含人TLR8蛋白的信号肽和/或胞外区的全部或部分。
优选的,所述的嵌合TLR8蛋白还包含非人动物TLR8蛋白的部分,优选为非人动物TLR8蛋白的跨膜区、胞质区。
进一步优选的,所述的嵌合TLR8蛋白包含信号肽、跨膜区、胞质区和胞外区,其中,所述的胞外区包含人TLR8蛋白胞外区的全部或部分,优选的,所述的人TLR8蛋白胞外区的部分包含至少200、250、300或350个氨基酸,例如至少200、250、300、350、400、500、600、700、750、770、790、791、791、793、794、795、780、800、801个氨基酸的与人TLR8蛋白胞外区一致,进一步优选的,包含792个氨基酸的人TLR8蛋白胞外区;优选的,包含C端去除0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的人TLR8蛋白胞外区,进一步优选的,包含C端去除9、5或2个氨基酸的人TLR8蛋白胞外区;更进一步优选的,胞外区包含与SEQ IDNO:4的第27-818位、第27-822位或第27-825位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含SEQ ID NO:4的第27-818位、第27-822位或第27-825位所示氨基酸序列;所述的信号肽包含人TLR8蛋白信号肽的全部或部分,优选的,所述的人TLR8蛋白信号肽的部分包含至少10、15或20个氨基酸,例如至少10、15、17、20、21、22、23、24、25、26个氨基酸的与人TLR8蛋白信号肽一致,进一步优选的,包含22个氨基酸的人TLR8蛋白信号肽;优选的,所述的信号肽的部分包含N端去除0-5(例如0、1、2、3、4、5)个氨基酸的人TLR8蛋白信号肽,进一步优选的,信号肽包含与SEQ ID NO:4的第5-26位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含SEQ IDNO:4的第5-26位所示的氨基酸序列。
优选的,所述的跨膜区、胞质区来源于非人动物,所述的胞外区至多0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸序列来源于非人动物TLR8蛋白胞外区,信号肽至多0-5(例如0、1、2、3、4、5)个氨基酸序列来源于非人动物,其中,所述的胞外区包含C端0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸序列来源于非人动物TLR8蛋白胞外区,进一步优选的,包含C端9、5或2个氨基酸序列来源于非人动物TLR8蛋白胞外区;优选的,所述的信号肽的部分包含N端0-5(例如0、1、2、3、4、5)个氨基酸序列来源于非人动物TLR8蛋白信号肽,进一步优选的,包含N端4个氨基酸序列来源于非人动物TLR8蛋白信号肽。
优选的,所述的嵌合TLR8蛋白包含人TLR8基因的1号外显子至2号外显子的全部或部分编码的氨基酸序列的全部或部分,优选的,所述的嵌合TLR8蛋白包含人TLR8基因的2号外显子的全部或部分编码的氨基酸序列的全部或部分;其中,2号外显子的部分至少包含500bp的核苷酸序列,例如至少包含500、1000、1500、2000、2200、2400、2440、2441、2442、2443、2444、2445、2450、2500、3000、3500、4000、4100、4162bp的核苷酸序列,进一步优选的,包含2442bp的核苷酸序列;优选的,所述人TLR8基因的2号外显子的部分至少包含从编码2号外显子第1-5(例如1、2、3、4、5)个氨基酸的核苷酸序列开始至编码胞外区C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列为止,进一步优选的,从编码2号外显子第4个氨基酸的核苷酸序列开始。
优选的,所述的嵌合TLR8蛋白至少包含SEQ ID NO:7编码的氨基酸序列,或包含与SEQ ID NO:7编码的氨基酸序列具有至少75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的氨基酸序列。
在本发明的一个具体实施方式中,所述的嵌合TLR8蛋白中包含的人TLR8蛋白的部分氨基酸序列选自下列组中的一种:
A)包含SEQ ID NO:4第5-818、1-818、1-822、1-825、5-822或5-825位氨基酸序列的全部或部分;
B)包含与SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位氨基酸序列同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的氨基酸序列;
C)包含与SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸的氨基酸序列;或
D)包含SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位所示的,包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
在本发明的一个具体实施方式中,所述的嵌合TLR8蛋白的氨基酸序列选自下列组中的一种:
a)包含SEQ ID NO:13氨基酸序列的全部或部分;
b)包含与SEQ ID NO:13氨基酸序列同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的氨基酸序列;
c)包含与SEQ ID NO:13所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸的氨基酸序列;或
d)包含SEQ ID NO:13所示的,包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
优选的,所述的非人动物可以选自啮齿类动物、猪、兔子、猴子等任何可以进行基因编辑制备基因人源化的非人动物。
优选的,所述的非人动物为非人哺乳动物,进一步优选的,所述的非人哺乳动物为啮齿类动物,更进一步优选的,所述的啮齿类动物为大鼠或小鼠。
优选的,所述的非人动物是免疫缺陷的非人哺乳动物,进一步优选的,所述的免疫缺陷的非人哺乳动物为免疫缺陷的啮齿类动物、免疫缺陷的猪、免疫缺陷的兔子或免疫缺陷的猴子,更优选的,所述的免疫缺陷的啮齿类动物为免疫缺陷的小鼠或大鼠,更进一步优选的,所述免疫缺陷鼠是NOD-Prkdcscid IL-2rγnull小鼠、NOD-Rag 1-/--IL2rg-/-(NRG)小鼠、Rag 2-/--IL2rg-/-(RG)小鼠、NOD/SCID小鼠或者裸鼠。
本发明的第二方面,提供了一种TLR8嵌合基因,所述的TLR8嵌合基因包含人TLR8基因的部分。
优选的,所述的TLR8嵌合基因包含编码人TLR8蛋白的信号肽、跨膜区、胞质区和/或胞外区的全部或部分核苷酸序列;优选的,所述的TLR8嵌合基因包含编码人TLR8蛋白的信号肽和/或胞外区的全部或部分核苷酸序列,其中,所述的胞外区包含人TLR8蛋白胞外区的全部或部分,进一步优选的,所述的人TLR8蛋白胞外区的部分包含至少200、250、300或350个氨基酸,例如至少200、250、300、350、400、500、600、700、750、770、790、791、791、793、794、795、780、800、801个氨基酸的与人TLR8蛋白胞外区一致,进一步优选的,包含792个氨基酸的人TLR8蛋白胞外区;优选的,包含C端去除0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的人TLR8蛋白胞外区,更优选的,包含C端去除9、5或2个氨基酸的人TLR8蛋白胞外区;更进一步优选的,胞外区包含与SEQ ID NO:4的第27-818位、第27-822位或第27-825位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含SEQ ID NO:4的第27-818位、第27-822位或第27-825位所示的氨基酸序列;所述的信号肽包含人TLR8蛋白信号肽的全部或部分,优选的,所述的人TLR8蛋白信号肽的部分包含至少10、15或20个氨基酸,例如至少10、15、17、20、21、22、23、24、25、26个氨基酸的与人TLR8蛋白信号肽一致,进一步优选的,包含22个氨基酸的人TLR8蛋白信号肽;优选的,所述的信号肽的部分包含N端去除0-5(例如0、1、2、3、4、5)个氨基酸的人TLR8蛋白信号肽,进一步优选的,信号肽包含与SEQ ID NO:4的第5-26位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含与SEQ ID NO:4的第5-26位所示的氨基酸序列。
优选的,所述的TLR8嵌合基因包含编码非人动物的跨膜区、胞质区的核苷酸序列,编码非人动物的胞外区至多0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列,编码非人动物的信号肽至多0-5(例如0、1、2、3、4、5)个氨基酸的核苷酸序列,其中,所述的胞外区包含C端0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸序列来源于非人动物TLR8蛋白胞外区,进一步优选的,包含C端前9、5或2个氨基酸序列来源于非人动物TLR8蛋白胞外区;优选的,所述的信号肽的部分包含N端后0-5(例如0、1、2、3、4、5)个氨基酸序列来源于非人动物TLR8蛋白信号肽,进一步优选的,包含N端后4个氨基酸序列来源于非人动物TLR8蛋白信号肽。
优选的,所述的TLR8嵌合基因编码上述的嵌合TLR8蛋白。
优选的,所述的TLR8嵌合基因包含人TLR8基因的1号外显子至2号外显子的全部或部分,优选的,所述的TLR8嵌合基因包含人TLR8基因的2号外显子的全部或部分;其中,2号外显子的部分至少包含500bp的核苷酸序列,例如至少包含500、1000、1500、2000、2200、2400、2440、2441、2442、2443、2444、2445、2450、2500、3000、3500、4000、4100、4162bp的核苷酸序列,进一步优选的,包含2442bp的核苷酸序列;优选的,所述人TLR8基因的2号外显子的部分至少包含从编码2号外显子第1-5(例如1、2、3、4、5)个氨基酸的核苷酸序列开始至编码胞外区C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列为止,更优选的,从编码2号外显子第4个氨基酸的核苷酸序列开始。
优选的,所述的TLR8嵌合基因中至少包含SEQ ID NO:8、9、10或11所示核苷酸序列,或包含与SEQ ID NO:8、9、10或11具有至少75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%同一性的核苷酸序列。
在本发明的一个具体实施方式中,所述的TLR8嵌合基因中包含的人TLR8核苷酸序列的部分选自下列组中的一种:
(A)包含SEQ ID NO:7所示核苷酸序列的全部或部分;
(B)包含与SEQ ID NO:7所示核苷酸序列的同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的核苷酸序列;
(C)包含与SEQ ID NO:7所示核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸的核苷酸序列;
(D)具有SEQ ID NO:7所示核苷酸序列的,包括取代、缺失和/或插入一个或多个核苷酸的核苷酸序列。
优选的,所述的TLR8嵌合基因还包括非人动物TLR8基因的1号外显子、2号外显子和/或3号外显子的部分;优选的,所述的非人动物TLR8基因的1号外显子、2号外显子和/或3号外显子的部分与NC_000086.7相应的1号至3号外显子至少具有70%、80%、90%或至少95%的同一性;进一步优选的,所述的非人动物TLR8基因的1号外显子、2号外显子和/或3号外显子的部分与SEQ ID NO:1相应的1号外显子、2号外显子、3号外显子一致。
在本发明的一个具体实施方式中,所述的TLR8嵌合基因的核苷酸序列转录的mRNA选自下列组中的一种:
(a)包含SEQ ID NO:12所示核苷酸序列的全部或部分;
(b)包含与SEQ ID NO:12所示核苷酸序列的同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的核苷酸序列;
(c)包含与SEQ ID NO:12所示的核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸的核苷酸序列;或
(d)包含SEQ ID NO:12所示的核苷酸序列所示的,包括取代、缺失和/或插入一个或多个核苷酸的核苷酸序列。
优选的,所述的TLR8嵌合基因包含编码人TLR8蛋白的cDNA。
优选的,所述的TLR8嵌合基因包含人TLR8 2号外显子的cDNA序列。
优选的,所述的TLR8嵌合基因还包含人TLR8非编码区。
任选地,所述的TLR8嵌合基因对于被替换的内源TLR8基因座是纯合或者杂合。
优选的,所述的TLR8嵌合基因还包括特异性诱导物或阻遏物,进一步优选的,所述的特异性诱导物或阻遏物可以为常规可以诱导或阻遏的物质。
在本发明的一个具体实施方式中,所述的特异性诱导物选自四环素系统(Tet-OffSystem/Tet-On System)或他莫昔芬系统(Tamoxifen System)。
优选的,所述的非人动物可以选自啮齿类动物、猪、兔子、猴子等任何可以进行基因编辑制备基因人源化的非人动物。
优选的,所述的非人动物为非人哺乳动物,进一步优选的,所述的非人哺乳动物为啮齿类动物,更进一步优选的,所述的啮齿类动物为大鼠或小鼠。
优选的,所述的非人动物是免疫缺陷的非人哺乳动物,进一步优选的,所述的免疫缺陷的非人哺乳动物为免疫缺陷的啮齿类动物、免疫缺陷的猪、免疫缺陷的兔子或免疫缺陷的猴子,更优选的,所述的免疫缺陷的啮齿类动物为免疫缺陷的小鼠或大鼠,更进一步优选的,所述免疫缺陷鼠是NOD-Prkdcscid IL-2rγnull小鼠、NOD-Rag 1-/--IL2rg-/-(NRG)小鼠、Rag 2-/--IL2rg-/-(RG)小鼠、NOD/SCID小鼠或者裸鼠。
本发明的第三方面,提供了一种靶向载体,所述的靶向载体包含人TLR8基因的部分。
优选的,所述的人TLR8基因的部分包含人TLR8基因的1号至2号外显子的全部或部分;优选的,所述的人TLR8基因的部分包含2号外显子的全部或部分;其中,2号外显子的部分至少包含500bp的核苷酸序列,例如至少包含500、1000、1500、2000、2200、2400、2440、2441、2442、2443、2444、2445、2450、2500、3000、3500、4000、4100、4162bp的核苷酸序列,进一步优选的,包含2442bp的核苷酸序列;优选的,所述人TLR8基因的2号外显子的部分至少包含从编码第1-5(例如1、2、3、4、5)个氨基酸的核苷酸序列开始至编码胞外区C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列为止,进一步优选的,从编码2号外显子第4个氨基酸的核苷酸序列开始,更优选的,所述的人TLR8核苷酸序列的部分至少包含SEQ ID NO:7所示核苷酸序列。
优选的,所述的靶向载体包含编码人TLR8蛋白的cDNA。
优选的,所述的靶向载体包含人TLR8 2号外显子的cDNA序列。
优选的,所述的靶向载体还包含人TLR8非编码区。
优选的,所述的靶向载体还包含与待改变的转换区5’端同源的DNA片段,即5’臂,其选自非人动物TLR8基因基因组DNA的100-10000个长度的核苷酸;优选的,所述的5’臂与NCBI登录号为NC_000086.7至少具有90%同源性的核苷酸;进一步优选的,所述5’臂序列与SEQ ID NO:5至少具有90%同源性,或者如SEQ ID NO:5所示。
优选的,所述的靶向载体还包含与待改变的转换区3’端同源的DNA片段,即3’臂,其选自非人动物TLR8基因基因组DNA的100-10000个长度的核苷酸;优选的,所述的3’臂与NCBI登录号为NC_000086.7至少具有90%同源性的核苷酸;进一步优选的,所述的3’臂序列与SEQ ID NO:6至少具有90%同源性,或者如SEQ ID NO:6所示。
优选的,所述的待改变的转换区位于非人动物TRL8基因座上,进一步优选的,所述的待改变的转换区位于非人动物TRL8基因3号外显子上。
优选的,所述的靶向载体还包含SEQ ID NO:8、9、10和/或11。
优选的,所述的靶向载体还包含标记基因,进一步优选的,所述标记基因为负筛选标记的编码基因,更进一步优选的,所述负筛选标记的编码基因为白喉毒素A亚基的编码基因(DTA)。
在本发明的一个具体实施方式中,所述的靶向载体中还包括阳性克隆筛选的抗性基因,进一步优选的,所述阳性克隆筛选的抗性基因为新霉素磷酸转移酶编码序列Neo。
在本发明的一个具体实施方式中,所述的靶向载体中还包括特异性重组系统,进一步优选的,所述特异性重组系统为Frt重组位点(也可选择常规的LoxP重组系统),所述的特异性重组系统为具有两个Frt重组位点,分别连接在抗性基因的两侧。
优选的,所述的非人动物可以选自啮齿类动物、猪、兔子、猴子等任何可以进行基因编辑制备基因人源化的非人动物。
优选的,所述的非人动物为非人哺乳动物,进一步优选的,所述的非人哺乳动物为啮齿类动物,更进一步优选的,所述的啮齿类动物为大鼠或小鼠。
优选的,所述的非人动物是免疫缺陷的非人哺乳动物,进一步优选的,所述的免疫缺陷的非人哺乳动物为免疫缺陷的啮齿类动物、免疫缺陷的猪、免疫缺陷的兔子或免疫缺陷的猴子,更优选的,所述的免疫缺陷的啮齿类动物为免疫缺陷的小鼠或大鼠,更进一步优选的,所述免疫缺陷鼠是NOD-Prkdcscid IL-2rγnull小鼠、NOD-Rag 1-/--IL2rg-/-(NRG)小鼠、Rag 2-/--IL2rg-/-(RG)小鼠、NOD/SCID小鼠或者裸鼠。
本发明的第四方面,提供了一种包含上述靶向载体的细胞。
本发明的第五方面,提供了上述靶向载体,或者上述的细胞在TLR8基因修饰中的应用,优选的,所述的应用包括但不限于敲除、插入或替换。
本发明的第六方面,提供了一种TLR8基因人源化的非人动物,所述的非人动物体内表达人TLR8蛋白或嵌合TLR8蛋白。
优选的,所述的非人动物的内源TLR8蛋白表达降低或缺失。
优选的,所述的嵌合TLR8蛋白选自上述的嵌合TLR8蛋白。
优选的,所述的非人动物体内包含人TLR8基因的部分,进一步优选的,所述的非人动物体内包含上述的TLR8嵌合基因。
优选的,所述的人TLR8基因的部分或TLR8嵌合基因的核苷酸序列可操作的连接至内源调控元件后。
优选的,所述的非人动物可以选自啮齿类动物、猪、兔子、猴子等任何可以进行基因编辑制备基因人源化的非人动物。
优选的,所述的非人动物为非人哺乳动物,进一步优选的,所述的非人哺乳动物为啮齿类动物,更进一步优选的,所述的啮齿类动物为大鼠或小鼠。
优选的,所述的非人动物是免疫缺陷的非人哺乳动物,进一步优选的,所述的免疫缺陷的非人哺乳动物为免疫缺陷的啮齿类动物、免疫缺陷的猪、免疫缺陷的兔子或免疫缺陷的猴子,更优选的,所述的免疫缺陷的啮齿类动物为免疫缺陷的小鼠或大鼠,更进一步优选的,所述免疫缺陷鼠是NOD-Prkdcscid IL-2rγnull小鼠、NOD-Rag 1-/--IL2rg-/-(NRG)小鼠、Rag 2-/--IL2rg-/-(RG)小鼠、NOD/SCID小鼠或者裸鼠。
本发明的第七方面,提供了一种TLR8基因人源化的非人动物的构建方法,所述的非人动物体内表达人TLR8蛋白或嵌合TLR8蛋白。
优选的,所述的嵌合TLR8蛋白如上述的嵌合TLR8蛋白。
优选的,所述的非人动物的基因组中还包含人TLR8基因或TLR8嵌合基因,所述的TLR8嵌合基因如上述的TLR8嵌合基因。
优选的,所述的非人动物为上述的TLR8基因人源化的非人动物。
优选的,所述的人TLR8基因的部分或TLR8嵌合基因在非人动物体内通过内源或外源调控元件进行调控。进一步优选的,所述的调控元件为启动子。
优选的,所述的构建方法包括用包含人TLR8基因的部分核苷酸序列导入非人动物TLR8基因座;优选的,用包含人TLR8基因的1号至2号外显子的全部或部分核苷酸序列导入非人动物TLR8基因座,优选的,用包含人TLR8基因的2号外显子的部分核苷酸序列导入非人动物TLR8基因座;其中,2号外显子的部分至少包含500bp的核苷酸序列,例如至少包含500、1000、1500、2000、2200、2400、2440、2441、2442、2443、2444、2445、2450、2500、3000、3500、4000、4100、4162bp的核苷酸序列,进一步优选的,包含2442bp的核苷酸序列;优选的,所述人TLR8基因的2号外显子的部分至少包含从编码2号外显子第1-5(例如1、2、3、4、5)个氨基酸的核苷酸序列开始至编码胞外区C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列为止,进一步优选的,从编码2号外显子第4个氨基酸的核苷酸序列开始,更优选的,用包含SEQ ID NO:7的核苷酸序列导入非人动物TLR8基因座。
优选的,所述的构建方法包括用包含人TLR8基因的2号外显子全部或部分核苷酸序列导入非人动物TLR8基因的3号外显子的全部或部分;所述非人动物TLR8核苷酸序列的3号外显子的部分至少包含从编码非人动物3号外显子第1-5(例如1、2、3、4、5)个氨基酸的核苷酸序列开始至编码胞外区C端1-15(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的核苷酸序列为止,更进一步优选的,从编码非人动物3号外显子第4个氨基酸的核苷酸序列开始。
优选的,所述的构建方法包括用包含编码人TLR8蛋白的信号肽、跨膜区、胞质区和/或胞外区的全部或部分核苷酸序列导入非人动物TLR8基因座;进一步优选的,用包含编码人TLR8蛋白的信号肽和/或胞外区的全部或部分核苷酸序列导入非人动物TLR8基因座,其中,所述的胞外区包含人TLR8蛋白胞外区的全部或部分,优选的,所述的人TLR8蛋白胞外区的部分包含至少200、250、300或350个氨基酸,例如至少200、250、300、350、400、500、600、700、750、770、790、791、791、793、794、795、780、800、801个氨基酸的与人TLR8蛋白胞外区一致,进一步优选的,包含792个氨基酸的人TLR8蛋白胞外区;优选的,包含C端去除0-15(例如0、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15)个氨基酸的人TLR8蛋白胞外区,进一步优选的,包含C端去除9、5或2个氨基酸的人TLR8蛋白胞外区;更进一步优选的,胞外区包含与SEQID NO:4的第27-818位、第27-822位或第27-825位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含与SEQ ID NO:4的第27-818位、第27-822位或第27-825位所示的氨基酸序列;所述的信号肽包含人TLR8蛋白信号肽的全部或部分,优选的,所述的人TLR8蛋白信号肽的部分包含至少10、15或20个氨基酸,例如至少10、15、17、20、21、22、23、24、25、26个氨基酸的与人TLR8蛋白信号肽一致,进一步优选的,包含22个氨基酸的人TLR8蛋白信号肽;优选的,所述的信号肽的部分包含N端去除0-5(例如0、1、2、3、4、5)个氨基酸的人TLR8蛋白信号肽,进一步优选的,信号肽包含与SEQ ID NO:4的第5-26位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含与SEQ ID NO:4的第5-26位所示的氨基酸序列。
在本发明的一个具体实施方式中,用包含编码人TLR8蛋白的cDNA序列导入非人动物TLR8基因座。
优选的,所述的构建方法包括用包含所述TLR8嵌合基因的核苷酸序列导入非人动物TLR8基因座。
优选的,所述的构建方法包括用包含编码所述嵌合TLR8蛋白的核苷酸序列导入非人动物TLR8基因座。
优选的,本申请中所述的导入包括但不限于插入、替换或转基因,所述的替换优选为原位替换。
优选的,所述的插入或替换位点为TLR8基因的内源调控元件之后。
优选的,所述的插入为首先破坏非人动物内源TLR8基因的编码框,随后进行插入操作,或者所述的插入步骤既可给内源TLR8基因造成移码突变又可以实现插入人源序列的步骤。
优选的,所述的非人动物是纯合或者杂合的。
优选的,所述非人动物的基因组中至少一个染色体上包含TLR8嵌合基因。
优选的,所述的非人动物中至少一个细胞表达人TLR8蛋白或嵌合TLR8蛋白。
优选的,使用基因编辑技术进行非人动物的构建,所述的基因编辑技术包括利用胚胎干细胞的基因打靶技术、CRISPR/Cas9技术、锌指核酸酶技术、转录激活子样效应因子核酸酶技术、归巢核酸内切酶或其他分子生物学技术。
优选的,使用上述的靶向载体进行非人动物的构建。
本发明的第八方面,提供了一种多基因修饰的非人动物的构建方法,包括如下步骤:
I)提供上述的TLR8基因人源化的非人动物或TLR8基因敲除的非人动物,或者采用上述的构建方法获得的TLR8基因人源化的非人动物;
II)将步骤I)提供的非人动物与其他基因修饰的非人动物交配、体外受精或直接进行基因编辑,并进行筛选,得到多基因修饰的非人动物。
优选的,所述的其他基因修饰的非人动物包括但不限于基因CD3、PD-L1、PD-1等人源化的非人动物。
优选的,所述的多基因修饰的非人动物为双基因人源化非人动物、三基因人源化非人动物、四基因人源化非人动物、五基因人源化非人动物、六基因人源化非人动物、七基因人源化非人动物、八基因人源化非人动物或九基因人源化非人动物。
优选的,所述的多基因修饰的非人动物的基因组中人源化的多个基因中的每一个基因均可以是纯合或杂合的。
本发明的第九方面,提供了一种上述构建方法获得的非人动物或其子代,所述的非人动物或其子代选自TLR8基因人源化的非人动物、TLR8基因敲除的非人动物或者多基因修饰的非人动物。
本发明的第十方面,提供了一种动物模型,所述的动物模型来源于上述的非人动物、上述的构建方法获得的非人动物,或者,上述的非人动物或其子代。优选的,所述的动物模型为荷瘤或炎症动物模型。
本发明的第十一方面,提供了一种动物模型的制备方法,所述的制备方法包括上述构建TLR8基因人源化的非人动物、TLR8基因敲除的非人动物或多基因修饰的非人动物的步骤;优选的,还包括植入肿瘤细胞的步骤。优选的,所述的动物模型为荷瘤或炎症动物模型。
本发明的第十二方面,提供了上述TLR8基因人源化的非人动物、上述TLR8基因敲除的非人动物、上述构建方法获得的TLR8基因人源化的非人动物、上述TLR8基因敲除的非人动物或多基因修饰的非人动物或其子代在制备动物模型中的应用。优选的,所述的动物模型为荷瘤或炎症动物模型。
本发明的第十三方面,提供了一种细胞或细胞系或原代细胞培养物,所述细胞或细胞系或原代细胞培养物来源于上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代,或者上述的动物模型。优选的,所述的细胞或细胞系或原代细胞培养物不能发育为动物个体。
本发明的第十四方面,提供了一种组织或器官或其培养物,所述组织或器官或其培养物来源于上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代,或者上述的动物模型。优选的,所述的组织或器官或其培养物不能发育为动物个体。
本发明的第十五方面,提供了一种荷瘤后的瘤组织,所述的瘤组织来源于上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代,或者上述的动物模型。优选的,所述的荷瘤后的瘤组织不能发育为动物个体。
本发明的第十六方面,提供了一种TLR8基因人源化的细胞,所述的细胞表达人TLR8蛋白或嵌合TLR8蛋白。
优选的,所述的细胞中内源TLR8蛋白的表达降低或缺失。
优选的,所述的细胞的基因组中包含人TLR8核苷酸序列的部分。进一步优选的,所述的细胞包含上述的TLR8嵌合基因。
优选的,所述的细胞不能发育为动物个体。
本发明的第十七方面,提供了一种TLR8基因敲除的细胞,所述的细胞中缺失TLR8基因的全部或部分核苷酸序列。
优选的,所述的细胞缺失TLR8基因的3号外显子的全部或部分核苷酸序列。
本发明的第十八方面,提供了一种表达上述的嵌合TLR8蛋白的构建体。优选的,所述的构建体中包含上述TLR8嵌合基因。
优选的,所述的细胞不能发育为动物个体。
本发明的第十九方面,提供了一种包含上述构建体的细胞。优选的,所述的细胞不能发育为动物个体。
本发明的第二十方面,提供了一种包含上述细胞的组织。优选的,所述的组织不能发育为动物个体。
本发明的第二十一方面,提供了来源于上述的嵌合TLR8蛋白、上述的TLR8嵌合基因、上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代、上述的动物模型、上述的细胞或细胞系或原代细胞培养物、上述的组织或器官或其培养物、上述的荷瘤后的瘤组织、上述的细胞、上述的构建体、上述的细胞或上述的组织在需要涉及人类细胞的免疫过程的产品开发,制造抗体,或者作为药理学、免疫学、微生物学、医学研究的模型系统中的应用;或者在生产和利用动物实验疾病模型,用于开发新的诊断策略和/或治疗策略中的应用;或者在筛选、验证、评价或研究TLR8功能、人TLR8信号机理、靶向人的抗体、靶向人的药物、药效,免疫相关疾病药物以及抗肿瘤或抗炎症药物,筛选和评估人用药及药效研究方面的应用。
优选的,所述应用不是疾病的治疗和/或诊断方法。
本发明的第二十二方面,提供了上述TLR8基因人源化的非人动物、上述TLR8基因敲除的非人动物、上述构建方法获得的TLR8基因人源化的非人动物、上述TLR8基因敲除的非人动物或多基因修饰的非人动物或其子代在制备人TLR8特异性调节剂或者筛选人TLR8特异性调节剂的产品中的应用。
本发明的第二十三方面,提供了一种人TLR8特异性调节剂的筛选方法,所述的筛选方法包括向植入肿瘤细胞的个体施加调节剂,检测肿瘤抑制性;其中,所述的个体选自上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代,或者上述的动物模型。
优选的,所述的调节剂选自CAR-T、药物。进一步优选的,所述的药物为抗体。
优选的,所述的调节剂为单抗或双特异性抗体或两种及两种以上药物的联合使用。
优选的,所述检测包括测定肿瘤细胞的大小和/或增殖速率。
优选的,所述检测的方法包括游标卡尺测量、流式细胞检测和/或动物活体成像检测。
优选的,所述的检测包括评估个体体重、脂肪量、活化途径、神经保护活性或代谢变化,所述的代谢变化包括食物消耗或水消耗的变化。
优选的,所述的肿瘤细胞来源于人或非人动物。
优选的,所述人TLR8特异性调节剂的筛选方法不是治疗方法。该方法用来筛选或评价药物,对候选药物的药效进行检测和比较,以确定哪些候选药物可以作为药物,哪些不能作为药物,或者,比较不同药物的药效敏感程度,即治疗效果不是必然的,只是一种可能性。
本发明的第二十四方面,提供了一种干预方案的评价方法,所述的评价方法包括向个体植入肿瘤细胞,向植入肿瘤细胞的个体施加干预方案,对施加干预方案后的个体进行肿瘤抑制效果检测和评价;其中,所述的个体选自上述的非人动物,上述的构建方法获得的非人动物,上述的非人动物或其子代,或者上述的动物模型。
优选的,所述的干预方案选自CAR-T、药物治疗。进一步优选的,所述的药物为抗原结合蛋白。所述的抗体结合蛋白为抗体。
优选的,所述的肿瘤细胞来源于人或非人动物。
优选的,所述干预方案的评价方法不是治疗方法。该评价方法对干预方案的效果进行检测和评价,以确定该干预方案是否有治疗效果,即治疗效果不是必然的,只是一种可能性。
本发明的第二十五方面,提供了一种来源于上述的非人动物、上述的构建方法获得的非人动物、上述的非人动物或其子代、上述的肿瘤或炎症模型在制备治疗肿瘤、炎症或自身免疫疾病的药物中的用途。
本发明所述的“肿瘤”包括但不限于淋巴瘤、B细胞肿瘤、T细胞肿瘤、骨髓/单核细胞肿瘤、非小细胞肺癌、白血病、卵巢癌、鼻咽癌、乳腺癌、子宫内膜癌、结肠癌、直肠癌、胃癌、膀胱癌、肺癌、支气管癌、骨癌、前列腺癌、胰腺癌、肝和胆管癌、食管癌、肾癌、甲状腺癌、头颈部癌、睾丸癌、胶质母细胞瘤、星形细胞瘤、黑色素瘤、骨髓增生异常综合征、以及肉瘤。其中,所述的白血病选自急性淋巴细胞性(成淋巴细胞性)白血病、急性骨髓性白血病、髓性白血病、慢性淋巴细胞性白血病、多发性骨髓瘤、浆细胞白血病、以及慢性骨髓性白血病;所述淋巴瘤选自霍奇金淋巴瘤和非霍奇金淋巴瘤,包括B细胞淋巴瘤、弥漫性大B细胞淋巴瘤、滤泡性淋巴瘤、套细胞淋巴瘤、边缘区B细胞淋巴瘤、T细胞淋巴瘤、和瓦尔登斯特伦巨球蛋白血症;所述肉瘤选自骨肉瘤、尤文肉瘤、平滑肌肉瘤、滑膜肉瘤、软组织肉瘤、血管肉瘤、脂肪肉瘤、纤维肉瘤、横纹肌肉瘤、及软骨肉瘤。在本发明的一个具体实施方式中,所述的肿瘤选自B细胞肿瘤、T细胞肿瘤、骨髓/单核细胞肿瘤。优选包括B或T细胞急性淋巴细胞白血病(ALL)、急性髓细胞白血病(AML)、非霍奇金淋巴瘤(NHL)和多发性骨髓瘤(MM)、鼻咽癌、肺癌。
本发明所述的“免疫相关疾病”包括但不限于过敏、哮喘、心肌炎、肾炎、肝炎、系统性红斑狼疮、类风湿性关节炎、硬皮病、甲状腺功能亢进、原发性血小板减少性紫癜、自身免疫性溶血性贫血、溃疡性结肠炎、自身免疫性肝病、糖尿病、疼痛或神经障碍等。在本发明的一个具体实施方式中。所述的免疫相关疾病为类风湿性关节炎。
本发明所述的“炎症”包括急性炎症,也包括慢性炎症。具体的,包括但不限于变质性炎症、渗出性炎症(浆液性炎、纤维素性炎、化脓性炎、出血性炎、坏死性炎、卡他性炎)、增生性炎症、特异性炎症(结核、梅毒、麻疯、淋巴肉芽肿等)。
本发明所述的TLR8基因人源化的非人动物体内可以正常表达人TLR8蛋白或嵌合TLR8蛋白。可用于针对人TLR8靶位点的药物筛选、药效评估、免疫疾病和肿瘤治疗,可以加快新药研发过程、节约时间和成本。对于研究TLR8蛋白功能及相关疾病药物筛选提供了有效的保障。
本发明所述的“全部或部分”,“全部”为整体,“部分”为整体中的局部,或者组成整体的个体。
本发明所述的“嵌合TLR8蛋白”,包含来源于人TLR8蛋白的部分和非人TLR8蛋白的部分。其中,所述的“人TLR8蛋白”同“人TLR8蛋白的全部”,即其氨基酸序列与人TLR8蛋白的全长氨基酸序列一致。所述的“人TLR8蛋白的部分”,为连续或间隔的5-1041个氨基酸序列与人TLR8蛋白的氨基酸序列一致。优选为连续或间隔10-814,10-818,10-821,10-822,10-825,更优选为连续5、10、20、30、40、50、60、70、80、90、100、200、300、400、500、600、700、800、814、818、821、822、825、900、1000、1010、1020、1030、1040、1041个氨基酸序列与人TLR8蛋白的氨基酸序列一致。
本发明所述的“人TLR8蛋白的跨膜区的全部”、“人TLR8蛋白的胞质区的全部”或“人TLR8蛋白的胞外区的全部”,代表其氨基酸序列分别与人TLR8蛋白的跨膜区、胞质区或胞外区的全长氨基酸序列一致。
本发明所述的“人TLR8蛋白的信号肽的部分”,为连续或间隔5-26个氨基酸序列与人TLR8蛋白的信号肽氨基酸序列一致,优选为连续或间隔5-22个,更优选为连续5、10、20、21、22、23、24、25、26个氨基酸序列与人TLR8蛋白的信号肽氨基酸序列一致。
本发明所述的“人TLR8蛋白的胞外区的部分”,为连续或间隔5-801个氨基酸序列与人TLR8蛋白的胞外区氨基酸序列一致,优选为连续或间隔5-792个,5-796个,5-799个,更优选为连续5、10、20、30、40、50、60、70、80、90、100、200、300、350、400、500、600、700、750、760、770、780、790、792、796、799、800、801个氨基酸序列与人TLR8蛋白的胞外区氨基酸序列一致。
本发明所述的“TLR8嵌合基因”,包含来源于人TLR8核苷酸序列的部分和非人TLR8基因的部分。其中,所述的“人TLR8核苷酸序列”同“人TLR8核苷酸序列的全部”,即其核苷酸序列与人TLR8核苷酸序列的全长核苷酸序列一致。所述的“人TLR8核苷酸序列的部分”为连续或间隔的20-16549bp个核苷酸序列与人TLR8核苷酸序列一致,优选为20-2442个,更优选为20、50、100、200、300、400、500、600、700、800、900、1000、2000、2442、3000、4000、5000、6000、7000、8000、9000、10000、11000、12000、13000、14000、15000、16000、16549bp个核苷酸序列与人TLR8核苷酸序列一致。
本发明所述的“xx号至xxx号外显子”或“xx号至xxx号外显子的全部”包含外显子及其期间的内含子的核苷酸序列,例如所述的“1号至3号外显子”包含1号外显子、1-2号内含子、2号外显子、2-3号内含子、3号外显子的全部核苷酸序列。
本发明所述的“x-xx号内含子”表示x号外显子与xx号外显子之间的内含子。例如“1-2号内含子”表示1号外显子与2号外显子之间的内含子。
本发明所述的“外显子的部分”表示连续或间隔几个、几十个或几百个核苷酸序列与全部的外显子核苷酸序列一致。例如人TLR8核苷酸序列的2号外显子的部分,包含连续或间隔的5-4125bp个,优选10-2441bp个核苷酸序列与人TLR8核苷酸序列的2号外显子核苷酸序列一致。在本发明的一个具体实施方式中,所述的“TLR8嵌合基因”中包含的“2号外显子的部分”至少包括从2号外显子第2个核苷酸开始至2号外显子的最后一个核苷酸序列为止。
本发明所述的“基因座”广义上讲代表基因在染色体上所占的位置,狭义上讲代表某一基因上的一段DNA片段,即可以是一个基因也可以是一个基因的一部分。例如所述的“TLR8基因座”表示TLR8基因1号至3号外显子上的任选一段的DNA片段。优选为1号外显子、2号外显子、3号外显子的任一个或两个或多个的组合,或一个或两个或多个的全部或部分。
本发明所述的“核苷酸序列”包含天然的或经过修饰的核糖核苷酸序列、脱氧核糖核苷酸序列。优选为DNA、cDNA、pre-mRNA、mRNA、rRNA、hnRNA、miRNAs、scRNA、snRNA、siRNA、sgRNA、tRNA。
本发明所述的“治疗”表示减缓、中断、阻止、控制、停止、减轻、或逆转一种体征、症状、失调、病症、或疾病的进展或严重性,但不一定涉及所有疾病相关体征、症状、病症、或失调的完全消除,且是指在疾病已开始发展后改善疾病或病理状态的体征、症状等等的治疗干预。
本发明所述的“同源性”,是指在使用氨基酸序列或核苷酸序列的方面,本领域技术人员在保证与已知序列相似结构或功能的前提下,可以根据实际工作需要对序列进行调整,使使用序列与现有技术获得的序列相比,具有(包括但不限于)1%,2%,3%,4%,5%,6%,7%,8%,9%,10%,11%,12%,13%,14%,15%,16%,17%,18%,19%,20%,21%,22%,23%,24%,25%,26%,27%,28%,29%,30%,31%,32%,33%,34%,35%,36%,37%,38%,39%,40%,41%,42%,43%,44%,45%,46%,47%,48%,49%,50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,70%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%,99.1%,99.2%,99.3%,99.4%,99.5%,99.6%,99.7%,99.8%,99.9%的同一性。
本领域的技术人员能够确定并比较序列元件或同一性程度,以区分另外的小鼠和人序列。
除非特别说明,本发明的实践将采取细胞生物学、细胞培养、分子生物学、转基因生物学、微生物学、重组DNA和免疫学的传统技术。这些技术在以下文献中进行了详细的解释。例如:Molecular Cloning A Laboratory Manual,2ndEd.,ed.By Sambrook,FritschandManiatis(Cold Spring Harbor Laboratory Press:1989);DNA Cloning,Volumes Iand II(D.N.Glovered.,1985);Oligonucleotide Synthesis(M.J.Gaited.,1984);Mullisetal.U.S.Pat.No.4,683,195;Nucleic Acid Hybridization(B.D.Hames&S.J.Higginseds.1984);Transcription And Translation(B.D.Hames&S.J.Higginseds.1984);Culture Of Animal Cells(R.I.Freshney,AlanR.Liss,Inc.,1987);Immobilized Cells And Enzymes(IRL Press,1986);B.Perbal,A PracticalGuide To Molecular Cloning(1984);the series,Methods In ENZYMOLOGY(J.Abelsonand M.Simon,eds.inchief,Academic Press,Inc.,New York),specifically,Vols.154and 155(Wuetal.eds.)and Vol.185,″Gene ExpressionTechnology″(D.Goeddel,ed.);Gene Transfer Vectors For Mammalian Cells(J.H.Miller andM.P.Caloseds.,1987,Cold Spring Harbor Laboratory);Immunochemical Methods InCell And Molecular Biology(Mayer and Walker,eds.,Academic Press,London,1987);Handbook Of Experimental Immunology,Volumes V(D.M.Weir and C.C.Blackwell,eds.,1986);and Manipulating the Mouse Embryo,(Cold Spring Harbor LaboratoryPress,Cold Spring Harbor,N.Y.,1986)。
在一个方面,所述非人动物是哺乳动物。优选的,所述非人动物是小型哺乳动物,例如跳鼠科。在一个实施方式中,所述非人动物是啮齿动物。在一个实施方式中,所述啮齿动物选自小鼠、大鼠和仓鼠。在一个实施方式中,所述啮齿动物选自鼠家族。在一个实施方式中,所述基因修饰的动物来自选自丽仓鼠科(例如小鼠样仓鼠)、仓鼠科(例如仓鼠、新世界大鼠和小鼠、田鼠)、鼠总科(真小鼠和大鼠、沙鼠、刺毛鼠、冠毛大鼠)、马岛鼠科(登山小鼠、岩小鼠、有尾大鼠、马达加斯加大鼠和小鼠)、刺睡鼠科(例如多刺睡鼠)和鼹形鼠科(例如摩尔大鼠、竹大鼠和鼢鼠)家族。在一个特定实施方式中,所述基因修饰的啮齿动物选自真小鼠或大鼠(鼠总科)、沙鼠、刺毛鼠和冠毛大鼠。在一个实施方式中,所述基因修饰的小鼠来自鼠科家族成员。在一个实施方式中,所述动物是啮齿动物。在一个特定实施方式中,所述啮齿动物选自小鼠和大鼠。在一个实施方式中,所述非人动物是小鼠。
在一个特定实施方式中,所述非人动物是啮齿动物,其为选自BALB/c、A、A/He、A/J、A/WySN、AKR、AKR/A、AKR/J、AKR/N、TA1、TA2、RF、SWR、C3H、C57BR、SJL、C57L、DBA/2、KM、NIH、ICR、CFW、FACA、C57BL/A、C57BL/An、C57BL/GrFa、C57BL/KaLwN、C57BL/6、C57BL/6J、C57BL/6ByJ、C57BL/6NJ、C57BL/10、C57BL/10ScSn、C57BL/10Cr和C57BL/Ola的C57BL、C58、CBA/Br、CBA/Ca、CBA/J、CBA/st、CBA/H品系的小鼠及NOD、NOD/SCID、NOD-Prkdcscid IL-2rgnull背景的小鼠。
以上只是概括了本发明的一些方面,不是也不应该认为是在任何方面限制本发明。
本说明书提到的所有专利和出版物都是通过参考文献作为整体而引入本发明的。本领域的技术人员应认识到,对本发明可作某些改变并不偏离本发明的构思或范围。
下面的实施例进一步详细说明本发明,不能认为是限制本发明或本发明所说明的具体方法的范围。
附图说明
以下,结合附图来详细说明本发明的实施例,其中:
图1:小鼠TLR8基因座和人TLR8基因座对比示意图(非按比例);
图2:小鼠TLR8基因人源化改造示意图(非按比例);
图3:TLR8基因打靶策略及靶向A片段设计示意图(非按比例);
图4:TLR8重组后细胞Southern blot结果,其中WT为野生型对照;
图5:TLR8小鼠人源化FRT重组过程示意图(非按比例);
图6:TLR8人源化小鼠F1代基因型鉴定结果,其中,WT为野生型,H2O为水对照,PC为阳性对照。
具体实施方式
下面结合具体实施例来进一步描述本发明,本发明的优点和特点将会随着描述而更为清楚。但这些实施例仅是范例性的,并不对本发明的范围构成任何限制。本领域技术人员应该理解的是,在不偏离本发明的精神和范围下可以对本发明技术方案的细节和形式进行修改或替换,但这些修改和替换均落入本发明的保护范围内。
在下述每一实施例中,设备和材料是从以下所指出的几家公司获得:
EcoNI、ScaI、DraIII酶购自NEB,货号分别为R0521、R3122、R3510;
Lipopolysaccharides from Escherichia coli O111:B4购自Sigma,货号:L2630;
Attune Nxt Acoustic Focusing Cytometer购自ThermoFisher,型号AttuneNxt;
PrimeScript 1stStrandcDNASynthesisKit购自TAKARA,型号6110A;
HeraeusTM FrescoTM 21Microcentrifuge购自ThermoFisher,型号Fresco 21;
Brilliant Violet 711TM anti-mouse TCRβchain购自Biolegend,货号:109243;
Brilliant Violet 510TM anti-mouse CD45购自Biolegend,货号:103138;
PE/CyTM 7anti-mouse/rat Foxp3购自eBioscienceTM,货号:25-5773-82;
Brilliant Violet 421TM anti-mouse CD4购自Biolegend,货号:100438;
PerCP anti-mouse Ly-6G/Ly-6C(Gr-1)Antibody购自Biolegend,货号:108426;
FITC anti-mouse F4/80购自Biolegend,货号:123108;
V450 Rat Anti-mouse CD11b购自Biolegend,货号:560455;
APC anti-human CD288(TLR8)Antibody购自Biolegend,货号:395505;
TLR8 Monoclonal Antibody(44C143),PE购自Invitrogen,货号:MA5-16194;
Brilliant Violet 605TManti-mouse CD11c购自Biolegend,货号:117334;
PE Mouse IgG1,κIsotype Ctrl购自Biolegend,货号:400112;
APC Mouse IgG2a,K Isotype Ctrl(FC)购自Biolegend,货号:400222。
实施例1TLR8基因人源化小鼠
本实施例对非人动物(如小鼠)进行改造,使该非人动物体内包含编码嵌合TLR8蛋白的核苷酸序列,得到经遗传修饰的非人动物体内可表达嵌合TLR8蛋白(即人源化TLR8蛋白)。小鼠TLR8基因(NCBI Gene ID:170744,Primary source:MGI:2176887,UniProt ID:P58682,位于X染色体NC_000086.7的第167241123至167264329位,基于转录本NM_133212.3(SEQ ID NO:1)及其编码蛋白NP_57345.2(SEQ ID NO:2)和人TLR8基因(NCBI Gene ID:51311,Primary source:HGNC:15632,UniProt ID:Q9NR97,位于X染色体NC_000023.11的第12906620至12923169位,基于转录本NM_138636.5(SEQ ID NO:3)及其编码蛋白NP_619542.1(SEQ ID NO:4),小鼠TLR8基因座与人TLR8基因座对比示意图如图1所示。
为了达到本发明的目的,可在小鼠内源TLR8基因座引入部分编码人TLR8蛋白的核苷酸序列,使得该小鼠表达人或人源化TLR8蛋白。具体来说,可以通过基因编辑技术将小鼠TLR8基因的3号外显子2415bp核苷酸序列用对应的人TLR8基因的2号外显子2442bp核苷酸序列替换,得到人源化TLR8基因(即TLR8嵌合基因)序列(示意图如图2所示),实现对小鼠TLR8基因的人源化改造。
如图3所示的打靶策略示意图中,显示了靶向载体上含有小鼠TLR8基因上游和下游的同源臂序列,以及包含人TLR8基因序列的A片段。其中,上游同源臂序列(5’同源臂,SEQID NO:5)与NCBI登录号为NC_000086.7的第167245841至167252904位核苷酸序列相同,下游同源臂序列(3’同源臂,SEQ ID NO:6)与NCBI登录号为NC_000086.7的第167237001至167240744位核苷酸序列相同;A片段上包含人TLR8基因的2号外显子部分序列的基因组DNA序列(SEQ ID NO:7),该DNA序列与NCBI登录号为NC_0000023.11的第12919053至12921494位核苷酸序列相同;A片段中人TLR8序列上游与小鼠TLR8基因的连接设计为5’-tacaatactttcattgttggtattccttaggaaaacatg tcagtcgtcaatgctgacctgcattttcctgctaat-3’(SEQID NO:8),其中序列“acatg”中的最后一个“g”是小鼠的最后一个核苷酸,序列“ttcct”中的第一个“t”是人的第一个核苷酸;人TLR8序列下游与小鼠TLR8基因的连接设计为5’-cctggggatcaaagagggaagagtattgtgagtctggag gacttgtgtatcggataccactgcagctgt-3’(SEQID NO:9),其中序列“tggag”中的最后一个“g”是人的最后一个核苷酸,序列中的第一个“c”是小鼠的第一个核苷酸。
靶向载体上还包括用于阳性克隆筛选的抗性基因,即新霉素磷酸转移酶编码序列Neo,并在抗性基因的两侧装上两个同向排列的位点特异性重组系统Frt重组位点,组成Neo盒(Neo cassette)。其中Neo盒5’端与小鼠基因的连接设计为5’-gaccaagctacaccaccatatatgcagatggtcttgctc CGGTATCGATAAGCTTGATATCGAATTCCGAAGTTCCTATTCTCTAGAAAGTATAGGA-3’(SEQ ID NO:10),其中序列“tgctc”中的最后一个“c”是小鼠的最后一个核苷酸,序列“GTCGA”中的第一个“G”是Neo盒的第一个核苷酸;Neo盒3’端与小鼠基因的连接设计为5’-AGTATAGGAACTTCATCAGTCAGGTACATAATGGTG aggctccctggttagtggttcagtctctgtgagcccctg-3’(SEQ ID NO:11),其中序列中的最后一个“C”是Neo盒的最后一个核苷酸,序列“aggct”中的“a”是小鼠的第一个核苷酸。此外,还在靶向载体3’同源臂下游构建了具有负筛选标记的编码基因(白喉毒素A亚基的编码基因(DTA))。改造后的人源化小鼠TLR8的mRNA序列如SEQ ID NO:12所示,表达的蛋白序列如SEQ ID NO:13所示。
靶向载体构建可采用常规方法进行,如酶切连接等。构建好的靶向载体通过酶切进行初步验证后,再送测序公司进行测序验证。将测序验证正确的靶向载体电穿孔转染入C57BL/6小鼠的胚胎干细胞中,利用阳性克隆筛选标记基因对得到的细胞进行筛选,并利用PCR和Southern Blot技术进行检测确认外源基因的整合情况,筛选出正确的阳性克隆细胞,经PCR鉴定为阳性的克隆再进行Southern Blot(分别用EcoNI或DraIII或ScaI消化细胞DNA并使用3个探针进行杂交,探针及目的片段长度如表1所示)检测,结果见如图4所示,检测结果表明3个经PCR验证为阳性的克隆,经测序发现除3-G01外,其余3个均为阳性克隆且无随机插入,具体编号为3-F05、4-D08、4-E07。
表1:具体探针及目的片段长度
限制性内切酶 | 探针 | 野生型片段大小 | 重组序列片段大小 |
EcoNI | 3’Probe | 9.7kb | 11.6kb |
DraIII | 5’Probe-A | 22.5kb | 17.1kb |
ScaI | Neo Probe(3’) | — | 9.0kb |
其中,PCR测定包括下述引物:
Southern Blot检测包括如下探针引物:
5’探针(5’Probe-A):
5’Probe-A-F:5’-TCCTCAGGAGAACTGAAGGCCATGT-3’(SEQ ID NO:20),
5’Probe-A-R:5’-GCACCCACAAAAGATTCAAGTCTGCC-3’(SEQ ID NO:21);
3’探针(3’Probe):
3’Probe-F:5’-ATTTTCCACAGCATTTGAGTCTTGC-3’(SEQ ID NO:22),
3’Probe-R:5’-GTATGTGGTAAATTTGAGGATGCCC-3’(SEQ ID NO:23);
Neo探针(Neo Probe):
Neo Probe(3’)-F:5’-GGATCGGCCATTGAACAAGAT-3’(SEQ ID NO:24),
Neo Probe(3’)-R:5’-CAGAAGAACTCGTCAAGAAGGC-3’(SEQ ID NO:25)。
将筛选出的正确阳性克隆细胞(黑色鼠)按照本领域已知的技术导入已分离好的囊胚中(白色鼠),得到的嵌合囊胚转移至培养液中短暂培养后移植至受体母鼠(白色鼠)的输卵管,可生产F0代嵌合体鼠(黑白相间)。将F0代嵌合鼠与野生型鼠回交获得F1代鼠,再将F1代杂合小鼠互相交配即可获得F2代纯合子鼠。还可将阳性鼠与Flp工具鼠交配去除阳性克隆筛选标记基因(该过程示意图见图5)后,再通过互相交配即可得到TLR8基因人源化纯合子小鼠。可通过PCR鉴定子代小鼠体细胞的基因型(引物如表2所示),示例性的F1代小鼠(已去除Neo标记基因)的鉴定结果见图6的A-D,其中,编号为F1-01、F1-02、F1-03、F1-06、F1-07、F1-08和F1-09的7只小鼠均为阳性杂合小鼠。这表明使用本方法能构建出可稳定传代且无随机插入的TLR8人源化基因工程小鼠。
表2:引物名称及具体序列
进一步采用流式细胞术检测TLR8基因人源化小鼠体内TLR8蛋白的表达情况。选取9周龄野生型C57BL/6小鼠和TLR8基因人源化纯合子小鼠各1只,安乐死后取脾脏细胞,分别用抗鼠CD45抗体Brilliant Violet 510TM anti-mouse CD45(mCD45)、抗鼠CD4抗体Brilliant Violet 605TM anti-mouse CD4 Antibody(mCD4)、抗鼠CD8a抗体BrilliantViolet 711TM anti-mouse CD8a(mCD8a),抗鼠Gr-1抗体PerCP anti-mouse Ly-6G/Ly-6C(Gr-1)Antibody(mGr-1)、抗鼠F4/80抗体FITC anti-mouse F4/80Antibody(m F4/80)、抗鼠CD11b抗体V450 Rat Anti-mouse CD11b(mCD11b)、抗人CD288抗体APC anti-humanCD288(hTLR8)Antibody(hTLR8)、人鼠TLR8蛋白交叉识别抗体PE TLR8 MonoclonalAntibody(44C143)(mTLR8)、抗鼠CD11c抗体Brilliant Violet 605TManti-mouse CD11c(mCD11c)识别染色后进行流式检测。
结果表明,C57BL/6小鼠脾脏中树突细胞(DC细胞)(特征为mCD45+mTCRβ-mCD11c+)有35.8%mTLR8阳性细胞(特征为mCD45+mTCRβ-mCD11c+mTLR8+),有0.86%hTLR8阳性细胞(特征为mCD45+mTCRβ-mCD11c+hTLR8+),TLR8基因人源化纯合子小鼠脾脏中树突细胞(DC细胞)有24.5%mTLR8阳性细胞,有25.8%hTLR8阳性细胞;
C57BL/6小鼠脾脏中单核细胞(特征为mCD45+mGr-1-mCD11b+mF4/80-)有2.37%mTLR8阳性细胞(特征为mCD45+mGr-1-mCD11b+mF4/80-mTLR8+),有0.39%hTLR8阳性细胞(特征为mCD45+mGr-1-mCD11b+mF4/80-hTLR8+),TLR8基因人源化小鼠脾脏中单核细胞有3.27%mTLR8阳性细胞,有3.15%hTLR8阳性细胞。
在上述实验中,人鼠TLR8蛋白交叉识别抗体PE TLR8 Monoclonal Antibody(44C143)(mTLR8)可交叉识别鼠TLR8蛋白和人TLR8蛋白,因此在TLR8基因人源化小鼠DC细胞和单核细胞中均检测到阳性信号;抗人CD288抗体APC anti-human CD288(hTLR8)Antibody(hTLR8)为特异性结合人TLR8蛋白的抗体,在TLR8基因人源化小鼠DC细胞和单核细胞中均检测到hTLR8信号,而未检测到mTLR8。综上结果表明,仅在TLR8基因人源化纯合子小鼠的脾脏中检测到人源化TLR8蛋白表达。
实施例2双重人源化或多重双人源化小鼠的制备
利用本方法或制得的TLR8小鼠还可以制备双人源化或多人源化小鼠模型。如,前述实施例1中,囊胚显微注射使用的胚胎干细胞可选择来源于含有CD3、PD-L1、PD-1等其它基因修饰的小鼠,或者,也可在TLR8人源化小鼠的基础上,利用分离小鼠ES胚胎干细胞和基因重组打靶技术,获得TLR8与其它基因修饰的双基因或多基因修饰的小鼠模型。也可将本方法得到的TLR8小鼠纯合子或杂合子与其它基因修饰的纯合或杂合小鼠交配,对其后代进行筛选,根据孟德尔遗传规律,可有一定机率得到TLR8与其它基因修饰的双基因或多基因修饰的人源化杂合小鼠,再将杂合子相互交配可以得到双基因或多基因修饰的纯合子,利用这些双基因或多基因修饰的小鼠可以进行靶向人TLR8和其它基因调节剂的体内药效验证等。
以上详细描述了本发明的优选实施方式,但是,本发明并不限于上述实施方式中的具体细节,在本发明的技术构思范围内,可以对本发明的技术方案进行多种简单变型,这些简单变型均属于本发明的保护范围。
另外需要说明的是,在上述具体实施方式中所描述的各个具体技术特征,在不矛盾的情况下,可以通过任何合适的方式进行组合,为了避免不必要的重复,本发明对各种可能的组合方式不再另行说明。
此外,本发明的各种不同的实施方式之间也可以进行任意组合,只要其不违背本发明的思想,其同样应当视为本发明所公开的内容。
序列表
<110> 百奥赛图(北京)医药科技股份有限公司
<120> TLR8基因人源化的非人动物及其构建方法和应用
<130> 1
<160> 31
<170> SIPOSequenceListing 1.0
<210> 1
<211> 5047
<212> DNA/RNA
<213> 小鼠(Mouse)
<400> 1
ctgtcttcca acataaactt ctgcaaatcc attgctagat tttgctgatg ctaaagcgac 60
cagatttcag aagtttgcca aagtctgctc tctgcacagc tactgtacac agaggaccac 120
tgcactgcta ctacaagcta ttgaaagaga accttgaagg gatcccagcc agaaagggga 180
aagcaagagc cttccaagaa agaatttggg ctgttctatt attttctggt ccagctatag 240
agcacatctt cttccagaaa aacagagttg gatgttaaga gagaaacaaa cgttttacct 300
tcctttgtct atagaacatg gaaaacatgc cccctcagtc atggattctg acgtgctttt 360
gtctgctgtc ctctggaacc agtgccatct tccataaagc gaactattcc agaagctatc 420
cttgtgacga gataaggcac aactcccttg tgattgcaga atgcaaccat cgtcaactgc 480
atgaagttcc ccaaactata ggcaagtatg tgacaaacat agacttgtca gacaatgcca 540
ttacacatat aacgaaagag tcctttcaaa agctgcaaaa cctcactaaa atcgatctga 600
accacaatgc caaacaacag cacccaaatg aaaataaaaa tggtatgaat attacagaag 660
gggcacttct cagcctaaga aatctaacag ttttactgct ggaagacaac cagttatata 720
ctatacctgc tgggttgcct gagtctttga aagaacttag cctaattcaa aacaatatat 780
ttcaggtaac taaaaacaac acttttgggc ttaggaactt ggaaagactc tatttgggct 840
ggaactgcta ttttaaatgt aatcaaacct ttaaggtaga agatggggca tttaaaaatc 900
ttatacactt gaaggtactc tcattatctt tcaataacct tttctatgtg ccccccaaac 960
taccaagttc tctaaggaaa ctttttctga gtaatgccaa aatcatgaac atcactcagg 1020
aagacttcaa aggactggaa aatttaacat tactagatct gagtggaaac tgtccaaggt 1080
gttacaatgc tccatttcct tgcacacctt gcaaggaaaa ctcatccatc cacatacatc 1140
ctctggcttt tcaaagtctc acccaacttc tctatctaaa cctttccagc acttccctca 1200
ggacgattcc ttctacctgg tttgaaaatc tgtcaaatct gaaggaactc catcttgaat 1260
tcaactattt agttcaagaa attgcctcgg gggcattttt aacaaaacta cccagtttac 1320
aaatccttga tttgtccttc aactttcaat ataaggaata tttacaattt attaatattt 1380
cctcaaattt ctctaagctt cgttctctca agaagttgca cttaagaggc tatgtgttcc 1440
gagaacttaa aaagaagcat ttcgagcatc tccagagtct tccaaacttg gcaaccatca 1500
acttgggcat taactttatt gagaaaattg atttcaaagc tttccagaat ttttccaaac 1560
tcgacgttat ctatttatca ggaaatcgca tagcatctgt attagatggt acagattatt 1620
cctcttggcg aaatcgtctt cggaaacctc tctcaacaga cgatgatgag tttgatccac 1680
acgtgaattt ttaccatagc accaaacctt taataaagcc acagtgtact gcttatggca 1740
aggccttgga tttaagtttg aacaatattt tcattattgg gaaaagccaa tttgaaggtt 1800
ttcaggatat cgcctgctta aatctgtcct tcaatgccaa tactcaagtg tttaatggca 1860
cagaattctc ctccatgccc cacattaaat atttggattt aaccaacaac agactagact 1920
ttgatgataa caatgctttc agtgatcttc acgatctaga agtgctggac ctgagccaca 1980
atgcacacta tttcagtata gcaggggtaa cgcaccgtct aggatttatc cagaacttaa 2040
taaacctcag ggtgttaaac ctgagccaca atggcattta caccctcaca gaggaaagtg 2100
agctgaaaag catctcactg aaagaattgg ttttcagtgg aaatcgtctt gaccgtttgt 2160
ggaatgcaaa tgatggcaaa tactggtcca tttttaaaag tctccagaat ttgatacgcc 2220
tggacttatc atacaataac cttcaacaaa tcccaaatgg agcattcctc aatttgcctc 2280
agagcctcca agagttactt atcagtggta acaaattacg tttctttaat tggacattac 2340
tccagtattt tcctcacctt cacttgctgg atttatcgag aaatgagctg tattttctac 2400
ccaattgcct atctaagttt gcacattccc tggagacact gctactgagc cataatcatt 2460
tctctcacct accctctggc ttcctctccg aagccaggaa tctggtgcac ctggatctaa 2520
gtttcaacac aataaagatg atcaataaat cctccctgca aaccaagatg aaaacgaact 2580
tgtctattct ggagctacat gggaactatt ttgactgcac gtgtgacata agtgattttc 2640
gaagctggct agatgaaaat ctgaatatca caattcctaa attggtaaat gttatatgtt 2700
ccaatcctgg ggatcaaaaa tcaaagagta tcatgagcct agatctcacg acttgtgtat 2760
cggataccac tgcagctgtc ctgtttttcc tcacattcct taccacctcc atggttatgt 2820
tggctgctct ggttcaccac ctgttttact gggatgtttg gtttatctat cacatgtgct 2880
ctgctaagtt aaaaggctac aggacttcat ccacatccca aactttctat gatgcttata 2940
tttcttatga caccaaagat gcatctgtta ctgactgggt aatcaatgaa ctgcgctacc 3000
accttgaaga gagtgaagac aaaagtgtcc tcctttgttt agaggagagg gattgggatc 3060
caggattacc catcattgat aacctcatgc agagcataaa ccagagcaag aaaacaatct 3120
ttgttttaac caagaaatat gccaagagct ggaactttaa aacagctttc tacttggcct 3180
tgcagaggct aatggatgag aacatggatg tgattatttt catcctcctg gaaccagtgt 3240
tacagtactc acagtacctg aggcttcggc agaggatctg taagagctcc atcctccagt 3300
ggcccaacaa tcccaaagca gaaaacttgt tttggcaaag tctgaaaaat gtggtcttga 3360
ctgaaaatga ttcacggtat gacgatttgt acattgattc cattaggcaa tactagtgat 3420
gggaagtcac gactctgcca tcataaaaac acacagcttc tccttacaat gaactgtggc 3480
aaatgaaagg gcatgtggct gcttgacgtg ctgagaataa gtgataattg ggtgcacagc 3540
cctaaacagg acaattatac cagcctctct aaggctaggg taactttgtg caagagggag 3600
ccagaaaaat gtatgagcta ggaaacagga agaagggctg gaaaatgctg tcttctgggc 3660
ataacagcca ttgcaactgt aatcacacag cagctacggt tgcctgtcgt gagcctgcac 3720
aagactgggc ctcttatcag tcaattacag atgggggcag gtgtataaaa ctatacttca 3780
tctttctagg gggcaggtcc cctgtactcc atccttctgc actatcaact attagtggat 3840
ccaggaaaga gactatcgtt atctttagct ttgaacccat tgatgagcct agattcaata 3900
gacggttcaa aactcatggt cacacagaaa gagttggtaa agtcagtggc cagaagacaa 3960
aacaaaaggg cctgaacatg agaaatggtt ttttaaggcg gtgggtaggt tgtccgcagt 4020
gagagaggag ataggaaaag gtgggcatga gagtaaccag aatgcatggg tctctctcag 4080
agatttcact ctgggatcca ctaccaagct cacccccatg cagttgtcct caagattcag 4140
cttctcttgt gtgttagcca aaatagaaaa gctcatctcc ctgctatgtg aactttcccc 4200
aaggcagctt gtttcatcaa ggccagcgag aggggacgca gtcagtcttt tggtagtgaa 4260
tcagaaagtg acagcctacc actgcagttg tattctattt ggtaaaagtc aaccacagat 4320
cctaccagct ccaggcacca agtatctcag gttcagggaa agctattgaa aaccagggta 4380
atagatctag gggcctctca gctaagagga aagagttcga gataagtgat ctgatcgtct 4440
gagttaatcg ttatccattt ctttggactg aaattaagag acggcttgcc atcttggtca 4500
cagatcatca atactgggaa agtcaaatca catgggcagt cttagcaggc agcttattat 4560
gcctactttt caatccctaa gaacatttgc cactgtatct aatgatcttg atattaagat 4620
gctggtgaag ttgatacatc tatgactgtg gggttatatt acattgtgat tgcattaatt 4680
tttcctttac aatgacattt taaaatctta agttttgaga ctataactac atcattccct 4740
cttttatcct ctctccaaaa cttctcacat gccccactat gacttttata aatgctgcct 4800
gcaatatttg ctttttaagc ttttgatgtt gcttaaacac tgaaaaggat atttttaaaa 4860
agccttttaa tcctaaatca ttttaaaggt attcgtttaa atgaaaagct tttgggggta 4920
tttgttattt cccattggtg tatattttca aatgaatgat taaaacgtct tcattttaca 4980
tgacaaatgt gtgataaatc tgtagtagaa aacagaactc tgatataaaa gttgtcattt 5040
gcactga 5047
<210> 2
<211> 1032
<212> PRT
<213> 小鼠(Mouse)
<400> 2
Met Glu Asn Met Pro Pro Gln Ser Trp Ile Leu Thr Cys Phe Cys Leu
1 5 10 15
Leu Ser Ser Gly Thr Ser Ala Ile Phe His Lys Ala Asn Tyr Ser Arg
20 25 30
Ser Tyr Pro Cys Asp Glu Ile Arg His Asn Ser Leu Val Ile Ala Glu
35 40 45
Cys Asn His Arg Gln Leu His Glu Val Pro Gln Thr Ile Gly Lys Tyr
50 55 60
Val Thr Asn Ile Asp Leu Ser Asp Asn Ala Ile Thr His Ile Thr Lys
65 70 75 80
Glu Ser Phe Gln Lys Leu Gln Asn Leu Thr Lys Ile Asp Leu Asn His
85 90 95
Asn Ala Lys Gln Gln His Pro Asn Glu Asn Lys Asn Gly Met Asn Ile
100 105 110
Thr Glu Gly Ala Leu Leu Ser Leu Arg Asn Leu Thr Val Leu Leu Leu
115 120 125
Glu Asp Asn Gln Leu Tyr Thr Ile Pro Ala Gly Leu Pro Glu Ser Leu
130 135 140
Lys Glu Leu Ser Leu Ile Gln Asn Asn Ile Phe Gln Val Thr Lys Asn
145 150 155 160
Asn Thr Phe Gly Leu Arg Asn Leu Glu Arg Leu Tyr Leu Gly Trp Asn
165 170 175
Cys Tyr Phe Lys Cys Asn Gln Thr Phe Lys Val Glu Asp Gly Ala Phe
180 185 190
Lys Asn Leu Ile His Leu Lys Val Leu Ser Leu Ser Phe Asn Asn Leu
195 200 205
Phe Tyr Val Pro Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe Leu
210 215 220
Ser Asn Ala Lys Ile Met Asn Ile Thr Gln Glu Asp Phe Lys Gly Leu
225 230 235 240
Glu Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Tyr
245 250 255
Asn Ala Pro Phe Pro Cys Thr Pro Cys Lys Glu Asn Ser Ser Ile His
260 265 270
Ile His Pro Leu Ala Phe Gln Ser Leu Thr Gln Leu Leu Tyr Leu Asn
275 280 285
Leu Ser Ser Thr Ser Leu Arg Thr Ile Pro Ser Thr Trp Phe Glu Asn
290 295 300
Leu Ser Asn Leu Lys Glu Leu His Leu Glu Phe Asn Tyr Leu Val Gln
305 310 315 320
Glu Ile Ala Ser Gly Ala Phe Leu Thr Lys Leu Pro Ser Leu Gln Ile
325 330 335
Leu Asp Leu Ser Phe Asn Phe Gln Tyr Lys Glu Tyr Leu Gln Phe Ile
340 345 350
Asn Ile Ser Ser Asn Phe Ser Lys Leu Arg Ser Leu Lys Lys Leu His
355 360 365
Leu Arg Gly Tyr Val Phe Arg Glu Leu Lys Lys Lys His Phe Glu His
370 375 380
Leu Gln Ser Leu Pro Asn Leu Ala Thr Ile Asn Leu Gly Ile Asn Phe
385 390 395 400
Ile Glu Lys Ile Asp Phe Lys Ala Phe Gln Asn Phe Ser Lys Leu Asp
405 410 415
Val Ile Tyr Leu Ser Gly Asn Arg Ile Ala Ser Val Leu Asp Gly Thr
420 425 430
Asp Tyr Ser Ser Trp Arg Asn Arg Leu Arg Lys Pro Leu Ser Thr Asp
435 440 445
Asp Asp Glu Phe Asp Pro His Val Asn Phe Tyr His Ser Thr Lys Pro
450 455 460
Leu Ile Lys Pro Gln Cys Thr Ala Tyr Gly Lys Ala Leu Asp Leu Ser
465 470 475 480
Leu Asn Asn Ile Phe Ile Ile Gly Lys Ser Gln Phe Glu Gly Phe Gln
485 490 495
Asp Ile Ala Cys Leu Asn Leu Ser Phe Asn Ala Asn Thr Gln Val Phe
500 505 510
Asn Gly Thr Glu Phe Ser Ser Met Pro His Ile Lys Tyr Leu Asp Leu
515 520 525
Thr Asn Asn Arg Leu Asp Phe Asp Asp Asn Asn Ala Phe Ser Asp Leu
530 535 540
His Asp Leu Glu Val Leu Asp Leu Ser His Asn Ala His Tyr Phe Ser
545 550 555 560
Ile Ala Gly Val Thr His Arg Leu Gly Phe Ile Gln Asn Leu Ile Asn
565 570 575
Leu Arg Val Leu Asn Leu Ser His Asn Gly Ile Tyr Thr Leu Thr Glu
580 585 590
Glu Ser Glu Leu Lys Ser Ile Ser Leu Lys Glu Leu Val Phe Ser Gly
595 600 605
Asn Arg Leu Asp Arg Leu Trp Asn Ala Asn Asp Gly Lys Tyr Trp Ser
610 615 620
Ile Phe Lys Ser Leu Gln Asn Leu Ile Arg Leu Asp Leu Ser Tyr Asn
625 630 635 640
Asn Leu Gln Gln Ile Pro Asn Gly Ala Phe Leu Asn Leu Pro Gln Ser
645 650 655
Leu Gln Glu Leu Leu Ile Ser Gly Asn Lys Leu Arg Phe Phe Asn Trp
660 665 670
Thr Leu Leu Gln Tyr Phe Pro His Leu His Leu Leu Asp Leu Ser Arg
675 680 685
Asn Glu Leu Tyr Phe Leu Pro Asn Cys Leu Ser Lys Phe Ala His Ser
690 695 700
Leu Glu Thr Leu Leu Leu Ser His Asn His Phe Ser His Leu Pro Ser
705 710 715 720
Gly Phe Leu Ser Glu Ala Arg Asn Leu Val His Leu Asp Leu Ser Phe
725 730 735
Asn Thr Ile Lys Met Ile Asn Lys Ser Ser Leu Gln Thr Lys Met Lys
740 745 750
Thr Asn Leu Ser Ile Leu Glu Leu His Gly Asn Tyr Phe Asp Cys Thr
755 760 765
Cys Asp Ile Ser Asp Phe Arg Ser Trp Leu Asp Glu Asn Leu Asn Ile
770 775 780
Thr Ile Pro Lys Leu Val Asn Val Ile Cys Ser Asn Pro Gly Asp Gln
785 790 795 800
Lys Ser Lys Ser Ile Met Ser Leu Asp Leu Thr Thr Cys Val Ser Asp
805 810 815
Thr Thr Ala Ala Val Leu Phe Phe Leu Thr Phe Leu Thr Thr Ser Met
820 825 830
Val Met Leu Ala Ala Leu Val His His Leu Phe Tyr Trp Asp Val Trp
835 840 845
Phe Ile Tyr His Met Cys Ser Ala Lys Leu Lys Gly Tyr Arg Thr Ser
850 855 860
Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr Asp Thr Lys
865 870 875 880
Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg Tyr His Leu
885 890 895
Glu Glu Ser Glu Asp Lys Ser Val Leu Leu Cys Leu Glu Glu Arg Asp
900 905 910
Trp Asp Pro Gly Leu Pro Ile Ile Asp Asn Leu Met Gln Ser Ile Asn
915 920 925
Gln Ser Lys Lys Thr Ile Phe Val Leu Thr Lys Lys Tyr Ala Lys Ser
930 935 940
Trp Asn Phe Lys Thr Ala Phe Tyr Leu Ala Leu Gln Arg Leu Met Asp
945 950 955 960
Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro Val Leu Gln
965 970 975
Tyr Ser Gln Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys Ser Ser Ile
980 985 990
Leu Gln Trp Pro Asn Asn Pro Lys Ala Glu Asn Leu Phe Trp Gln Ser
995 1000 1005
Leu Lys Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr Asp Asp Leu
1010 1015 1020
Tyr Ile Asp Ser Ile Arg Gln Tyr
1025 1030
<210> 3
<211> 4216
<212> DNA/RNA
<213> 人(human)
<400> 3
agtttctctt ctcggccacc tcctgcatag agggtaccat tctgcgctgc tgcaagttac 60
ggaatgaaaa attagaacaa cagaaacatg gaaaacatgt tccttcagtc gtcaatgctg 120
acctgcattt tcctgctaat atctggttcc tgtgagttat gcgccgaaga aaatttttct 180
agaagctatc cttgtgatga gaaaaagcaa aatgactcag ttattgcaga gtgcagcaat 240
cgtcgactac aggaagttcc ccaaacggtg ggcaaatatg tgacagaact agacctgtct 300
gataatttca tcacacacat aacgaatgaa tcatttcaag ggctgcaaaa tctcactaaa 360
ataaatctaa accacaaccc caatgtacag caccagaacg gaaatcccgg tatacaatca 420
aatggcttga atatcacaga cggggcattc ctcaacctaa aaaacctaag ggagttactg 480
cttgaagaca accagttacc ccaaataccc tctggtttgc cagagtcttt gacagaactt 540
agtctaattc aaaacaatat atacaacata actaaagagg gcatttcaag acttataaac 600
ttgaaaaatc tctatttggc ctggaactgc tattttaaca aagtttgcga gaaaactaac 660
atagaagatg gagtatttga aacgctgaca aatttggagt tgctatcact atctttcaat 720
tctctttcac acgtgccacc caaactgcca agctccctac gcaaactttt tctgagcaac 780
acccagatca aatacattag tgaagaagat ttcaagggat tgataaattt aacattacta 840
gatttaagcg ggaactgtcc gaggtgcttc aatgccccat ttccatgcgt gccttgtgat 900
ggtggtgctt caattaatat agatcgtttt gcttttcaaa acttgaccca acttcgatac 960
ctaaacctct ctagcacttc cctcaggaag attaatgctg cctggtttaa aaatatgcct 1020
catctgaagg tgctggatct tgaattcaac tatttagtgg gagaaatagc ctctggggca 1080
tttttaacga tgctgccccg cttagaaata cttgacttgt cttttaacta tataaagggg 1140
agttatccac agcatattaa tatttccaga aacttctcta aacttttgtc tctacgggca 1200
ttgcatttaa gaggttatgt gttccaggaa ctcagagaag atgatttcca gcccctgatg 1260
cagcttccaa acttatcgac tatcaacttg ggtattaatt ttattaagca aatcgatttc 1320
aaacttttcc aaaatttctc caatctggaa attatttact tgtcagaaaa cagaatatca 1380
ccgttggtaa aagatacccg gcagagttat gcaaatagtt cctcttttca acgtcatatc 1440
cggaaacgac gctcaacaga ttttgagttt gacccacatt cgaactttta tcatttcacc 1500
cgtcctttaa taaagccaca atgtgctgct tatggaaaag ccttagattt aagcctcaac 1560
agtattttct tcattgggcc aaaccaattt gaaaatcttc ctgacattgc ctgtttaaat 1620
ctgtctgcaa atagcaatgc tcaagtgtta agtggaactg aattttcagc cattcctcat 1680
gtcaaatatt tggatttgac aaacaataga ctagactttg ataatgctag tgctcttact 1740
gaattgtccg acttggaagt tctagatctc agctataatt cacactattt cagaatagca 1800
ggcgtaacac atcatctaga atttattcaa aatttcacaa atctaaaagt tttaaacttg 1860
agccacaaca acatttatac tttaacagat aagtataacc tggaaagcaa gtccctggta 1920
gaattagttt tcagtggcaa tcgccttgac attttgtgga atgatgatga caacaggtat 1980
atctccattt tcaaaggtct caagaatctg acacgtctgg atttatccct taataggctg 2040
aagcacatcc caaatgaagc attccttaat ttgccagcga gtctcactga actacatata 2100
aatgataata tgttaaagtt ttttaactgg acattactcc agcagtttcc tcgtctcgag 2160
ttgcttgact tacgtggaaa caaactactc tttttaactg atagcctatc tgactttaca 2220
tcttcccttc ggacactgct gctgagtcat aacaggattt cccacctacc ctctggcttt 2280
ctttctgaag tcagtagtct gaagcacctc gatttaagtt ccaatctgct aaaaacaatc 2340
aacaaatccg cacttgaaac taagaccacc accaaattat ctatgttgga actacacgga 2400
aacccctttg aatgcacctg tgacattgga gatttccgaa gatggatgga tgaacatctg 2460
aatgtcaaaa ttcccagact ggtagatgtc atttgtgcca gtcctgggga tcaaagaggg 2520
aagagtattg tgagtctgga gctaacaact tgtgtttcag atgtcactgc agtgatatta 2580
tttttcttca cgttctttat caccaccatg gttatgttgg ctgccctggc tcaccatttg 2640
ttttactggg atgtttggtt tatatataat gtgtgtttag ctaaggtaaa aggctacagg 2700
tctctttcca catcccaaac tttctatgat gcttacattt cttatgacac caaagatgcc 2760
tctgttactg actgggtgat aaatgagctg cgctaccacc ttgaagagag ccgagacaaa 2820
aacgttctcc tttgtctaga ggagagggat tgggacccgg gattggccat catcgacaac 2880
ctcatgcaga gcatcaacca aagcaagaaa acagtatttg ttttaaccaa aaaatatgca 2940
aaaagctgga actttaaaac agctttttac ttggctttgc agaggctaat ggatgagaac 3000
atggatgtga ttatatttat cctgctggag ccagtgttac agcattctca gtatttgagg 3060
ctacggcagc ggatctgtaa gagctccatc ctccagtggc ctgacaaccc gaaggcagaa 3120
ggcttgtttt ggcaaactct gagaaatgtg gtcttgactg aaaatgattc acggtataac 3180
aatatgtatg tcgattccat taagcaatac taactgacgt taagtcatga tttcgcgcca 3240
taataaagat gcaaaggaat gacatttctg tattagttat ctattgctat gtaacaaatt 3300
atcccaaaac ttagtggttt aaaacaacac atttgctggc ccacagtttt tgagggtcag 3360
gagtccaggc ccagcataac tgggtcctct gctcagggtg tctcagaggc tgcaatgtag 3420
gtgttcacca gagacatagg catcactggg gtcacactca tgtggttgtt ttctggattc 3480
aattcctcct gggctattgg ccaaaggcta tactcatgta agccatgcga gcctctccca 3540
caaggcagct tgcttcatca gagctagcaa aaaagagagg ttgctagcaa gatgaagtca 3600
caatcttttg taatcgaatc aaaaaagtga tatctcatca ctttggccat attctatttg 3660
ttagaagtaa accacaggtc ccaccagctc catgggagtg accacctcag tccagggaaa 3720
acagctgaag accaagatgg tgagctctga ttgcttcagt tggtcatcaa ctattttccc 3780
ttgactgctg tcctgggatg gcctgctatc ttgatgatag attgtgaata tcaggaggca 3840
gggatcactg tggaccatct tagcagttga cctaacacat cttcttttca atatctaaga 3900
acttttgcca ctgtgactaa tggtcctaat attaagctgt tgtttatatt tatcatatat 3960
ctatggctac atggttatat tatgctgtgg ttgcgttcgg ttttatttac agttgctttt 4020
acaaatattt gctgtaacat ttgacttcta aggtttagat gccatttaag aactgagatg 4080
gatagctttt aaagcatctt ttacttctta ccatttttta aaagtatgca gctaaattcg 4140
aagcttttgg tctatattgt taattgccat tgctgtaaat cttaaaatga atgaataaaa 4200
atgtttcatt ttacaa 4216
<210> 4
<211> 1041
<212> PRT
<213> 人(human)
<400> 4
Met Glu Asn Met Phe Leu Gln Ser Ser Met Leu Thr Cys Ile Phe Leu
1 5 10 15
Leu Ile Ser Gly Ser Cys Glu Leu Cys Ala Glu Glu Asn Phe Ser Arg
20 25 30
Ser Tyr Pro Cys Asp Glu Lys Lys Gln Asn Asp Ser Val Ile Ala Glu
35 40 45
Cys Ser Asn Arg Arg Leu Gln Glu Val Pro Gln Thr Val Gly Lys Tyr
50 55 60
Val Thr Glu Leu Asp Leu Ser Asp Asn Phe Ile Thr His Ile Thr Asn
65 70 75 80
Glu Ser Phe Gln Gly Leu Gln Asn Leu Thr Lys Ile Asn Leu Asn His
85 90 95
Asn Pro Asn Val Gln His Gln Asn Gly Asn Pro Gly Ile Gln Ser Asn
100 105 110
Gly Leu Asn Ile Thr Asp Gly Ala Phe Leu Asn Leu Lys Asn Leu Arg
115 120 125
Glu Leu Leu Leu Glu Asp Asn Gln Leu Pro Gln Ile Pro Ser Gly Leu
130 135 140
Pro Glu Ser Leu Thr Glu Leu Ser Leu Ile Gln Asn Asn Ile Tyr Asn
145 150 155 160
Ile Thr Lys Glu Gly Ile Ser Arg Leu Ile Asn Leu Lys Asn Leu Tyr
165 170 175
Leu Ala Trp Asn Cys Tyr Phe Asn Lys Val Cys Glu Lys Thr Asn Ile
180 185 190
Glu Asp Gly Val Phe Glu Thr Leu Thr Asn Leu Glu Leu Leu Ser Leu
195 200 205
Ser Phe Asn Ser Leu Ser His Val Pro Pro Lys Leu Pro Ser Ser Leu
210 215 220
Arg Lys Leu Phe Leu Ser Asn Thr Gln Ile Lys Tyr Ile Ser Glu Glu
225 230 235 240
Asp Phe Lys Gly Leu Ile Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn
245 250 255
Cys Pro Arg Cys Phe Asn Ala Pro Phe Pro Cys Val Pro Cys Asp Gly
260 265 270
Gly Ala Ser Ile Asn Ile Asp Arg Phe Ala Phe Gln Asn Leu Thr Gln
275 280 285
Leu Arg Tyr Leu Asn Leu Ser Ser Thr Ser Leu Arg Lys Ile Asn Ala
290 295 300
Ala Trp Phe Lys Asn Met Pro His Leu Lys Val Leu Asp Leu Glu Phe
305 310 315 320
Asn Tyr Leu Val Gly Glu Ile Ala Ser Gly Ala Phe Leu Thr Met Leu
325 330 335
Pro Arg Leu Glu Ile Leu Asp Leu Ser Phe Asn Tyr Ile Lys Gly Ser
340 345 350
Tyr Pro Gln His Ile Asn Ile Ser Arg Asn Phe Ser Lys Leu Leu Ser
355 360 365
Leu Arg Ala Leu His Leu Arg Gly Tyr Val Phe Gln Glu Leu Arg Glu
370 375 380
Asp Asp Phe Gln Pro Leu Met Gln Leu Pro Asn Leu Ser Thr Ile Asn
385 390 395 400
Leu Gly Ile Asn Phe Ile Lys Gln Ile Asp Phe Lys Leu Phe Gln Asn
405 410 415
Phe Ser Asn Leu Glu Ile Ile Tyr Leu Ser Glu Asn Arg Ile Ser Pro
420 425 430
Leu Val Lys Asp Thr Arg Gln Ser Tyr Ala Asn Ser Ser Ser Phe Gln
435 440 445
Arg His Ile Arg Lys Arg Arg Ser Thr Asp Phe Glu Phe Asp Pro His
450 455 460
Ser Asn Phe Tyr His Phe Thr Arg Pro Leu Ile Lys Pro Gln Cys Ala
465 470 475 480
Ala Tyr Gly Lys Ala Leu Asp Leu Ser Leu Asn Ser Ile Phe Phe Ile
485 490 495
Gly Pro Asn Gln Phe Glu Asn Leu Pro Asp Ile Ala Cys Leu Asn Leu
500 505 510
Ser Ala Asn Ser Asn Ala Gln Val Leu Ser Gly Thr Glu Phe Ser Ala
515 520 525
Ile Pro His Val Lys Tyr Leu Asp Leu Thr Asn Asn Arg Leu Asp Phe
530 535 540
Asp Asn Ala Ser Ala Leu Thr Glu Leu Ser Asp Leu Glu Val Leu Asp
545 550 555 560
Leu Ser Tyr Asn Ser His Tyr Phe Arg Ile Ala Gly Val Thr His His
565 570 575
Leu Glu Phe Ile Gln Asn Phe Thr Asn Leu Lys Val Leu Asn Leu Ser
580 585 590
His Asn Asn Ile Tyr Thr Leu Thr Asp Lys Tyr Asn Leu Glu Ser Lys
595 600 605
Ser Leu Val Glu Leu Val Phe Ser Gly Asn Arg Leu Asp Ile Leu Trp
610 615 620
Asn Asp Asp Asp Asn Arg Tyr Ile Ser Ile Phe Lys Gly Leu Lys Asn
625 630 635 640
Leu Thr Arg Leu Asp Leu Ser Leu Asn Arg Leu Lys His Ile Pro Asn
645 650 655
Glu Ala Phe Leu Asn Leu Pro Ala Ser Leu Thr Glu Leu His Ile Asn
660 665 670
Asp Asn Met Leu Lys Phe Phe Asn Trp Thr Leu Leu Gln Gln Phe Pro
675 680 685
Arg Leu Glu Leu Leu Asp Leu Arg Gly Asn Lys Leu Leu Phe Leu Thr
690 695 700
Asp Ser Leu Ser Asp Phe Thr Ser Ser Leu Arg Thr Leu Leu Leu Ser
705 710 715 720
His Asn Arg Ile Ser His Leu Pro Ser Gly Phe Leu Ser Glu Val Ser
725 730 735
Ser Leu Lys His Leu Asp Leu Ser Ser Asn Leu Leu Lys Thr Ile Asn
740 745 750
Lys Ser Ala Leu Glu Thr Lys Thr Thr Thr Lys Leu Ser Met Leu Glu
755 760 765
Leu His Gly Asn Pro Phe Glu Cys Thr Cys Asp Ile Gly Asp Phe Arg
770 775 780
Arg Trp Met Asp Glu His Leu Asn Val Lys Ile Pro Arg Leu Val Asp
785 790 795 800
Val Ile Cys Ala Ser Pro Gly Asp Gln Arg Gly Lys Ser Ile Val Ser
805 810 815
Leu Glu Leu Thr Thr Cys Val Ser Asp Val Thr Ala Val Ile Leu Phe
820 825 830
Phe Phe Thr Phe Phe Ile Thr Thr Met Val Met Leu Ala Ala Leu Ala
835 840 845
His His Leu Phe Tyr Trp Asp Val Trp Phe Ile Tyr Asn Val Cys Leu
850 855 860
Ala Lys Val Lys Gly Tyr Arg Ser Leu Ser Thr Ser Gln Thr Phe Tyr
865 870 875 880
Asp Ala Tyr Ile Ser Tyr Asp Thr Lys Asp Ala Ser Val Thr Asp Trp
885 890 895
Val Ile Asn Glu Leu Arg Tyr His Leu Glu Glu Ser Arg Asp Lys Asn
900 905 910
Val Leu Leu Cys Leu Glu Glu Arg Asp Trp Asp Pro Gly Leu Ala Ile
915 920 925
Ile Asp Asn Leu Met Gln Ser Ile Asn Gln Ser Lys Lys Thr Val Phe
930 935 940
Val Leu Thr Lys Lys Tyr Ala Lys Ser Trp Asn Phe Lys Thr Ala Phe
945 950 955 960
Tyr Leu Ala Leu Gln Arg Leu Met Asp Glu Asn Met Asp Val Ile Ile
965 970 975
Phe Ile Leu Leu Glu Pro Val Leu Gln His Ser Gln Tyr Leu Arg Leu
980 985 990
Arg Gln Arg Ile Cys Lys Ser Ser Ile Leu Gln Trp Pro Asp Asn Pro
995 1000 1005
Lys Ala Glu Gly Leu Phe Trp Gln Thr Leu Arg Asn Val Val Leu Thr
1010 1015 1020
Glu Asn Asp Ser Arg Tyr Asn Asn Met Tyr Val Asp Ser Ile Lys Gln
1025 1030 1035 1040
Tyr
<210> 5
<211> 7064
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
ctgatatgta tcatcgccac agagaaacag aaaagcagac gaaataattg tgagaactat 60
ggaaagcaag tgatgacttt gatttaagtg tcttggatga tactacaatg gctgcatatt 120
cttgggagac acatcctaaa atgagctctt tttggttgat ggctgaagtt ttatcaagag 180
tcatgccagg aagggctggg gaaacaactc aatggataaa gcacttgctg tgaggtttga 240
tgactcaagt tcagagctac cagcacctac acatgagcct ggaaaaagtg gtggctgcct 300
gtaaccccag ctcttaggag acaaacacac tgggtctctg ggggcaagct ggctatctag 360
actggccaaa ttggcaagct ctttatctca tgaaagagat cctacctcga tcaatacaat 420
gaggagtgtt taaggaagag aatttgtgtc actgtctggc ttccacatgc agacacctgc 480
acacatgcat gcatgctaca cttgcaaaag aatatttata caagtagtat gttgctaaaa 540
atatgtgtca cattatgcta tacacttatt tgacccacac ccatgcttca tcttcatcac 600
ccatgttaat atagtggtcg ggcatcttga ttaatttctt tttttttttc tttccatttt 660
ttattaggta tttagctcat ttacatttcc aatgctatac caaaagtccc ccatacccac 720
ccacccccac tcccctatcc gcccactccc cctttttggc cctggcgttc ccctgttctg 780
gggcatataa agtttgcatg tccaatgggc ctctctttcc agtgatggcc gactaggcca 840
tcttttgata catatgcagc tagagtcaag agctcctggg tactggttag ttcataatgt 900
tgatccacct atagggttgc agatcccttt agctccttgg gtactttctc tagctcctcc 960
attgggagcc ctgtgatcca tccattagct gactgtgagc atccacttct gtgtttgcta 1020
ggccccggca tagtctcaca agagacagct acatctgggt cctttcaata aaatcttgct 1080
agtgtatgca atggtgtgag cgtttggatg ctgattatgg ggtggatccc tggatatgac 1140
agtctctaca tggtccatcc tttcatctca gctccaaact ttgtctctat aactccttcc 1200
aagggtgttt tgttcccact tctaaggagg ggcatagtgt ccacacttca gtcttcattt 1260
ttcttgagtt tcatgtgttt aggaaattgt atcttatatc ttgggtatcc taggttttgg 1320
gcgaatatcc acttatcagt gagtacatat tgtgtgagtt cctttgtgaa tgtgttacct 1380
cactcaggat gatgccctcc aggtccatcc atttggctag gaatttcata aattcattct 1440
ttttaatagc tgagtagtac tccattgtgt agatgtacca cattttctgt atccattcct 1500
ctgttgaggg gcatctgggt tctttccagc ttctggctat tataaataag gctgctatga 1560
acatagtgga gcatgtgtcc ttcttaccag ttggggcatc ttctggatat atgcccagga 1620
gaggtattgc tggatcctcc ggtagtacta tgtccaattt tctgaggaac cgccagacgg 1680
atttccagag tggttgtaca agcctgcaat cccaccaaca atggaggagt gttcctcttt 1740
ctccacatcc tcgccagcat ctgctgtcac ctgaattttt gttcttagac attctgactg 1800
gtgtgaggtg gaatctcagg gttgttttga tttgcatttc cctgatgatt aaggatgttg 1860
aacatttttt caggtgcttc tctgccattc ggtattcctc aggtgagaat tctttgttca 1920
gttctgagcc ccatttttaa tggggttatt tgattttctg aagtccacct tcttgagttc 1980
tttatatatg ttggatatta gtcccctatc tgatttagga taggtaaaga tcctttccca 2040
atctgttggt ggtctctttg tcttattgac ggtgtctttt gccttgcaga aactttggag 2100
tttcattagg tcccatttgt caattctcga tcttacagca caagtcattg ctgttctgtt 2160
caggaatttt tcccctgtgc ccatatcttc aaggcttttc cccactttct cctctataag 2220
tttcagtgtc tctggtttta tgtgaagttc tttgatccat ttagatttga ccttagtaca 2280
aggagataag tatggatcga ttcgcattct tctacatgat aacaaccagt tgtgccagca 2340
ccatttgttg aaaatgctgt ctttcttcca ctggatggtt ttagctccct tgtcgaagat 2400
caagtgacca taggtgtgtg ggttcatttc tgggtcttca attctattcc attggtctac 2460
ttgtctatct ctataccagt accatgcagt ttttaccaca attgctctgt agtaaagctt 2520
taggtcaggc atggtgattc caccagagga attttatcct tgagaagagt ttttgctatc 2580
ctaggttttt tgttattcca gatgaatttg caaattgctc cttctaattt gttgaagaat 2640
tgagttggaa ttttgatggg gattgcattg aatctgtaga ttgcttttgg caagatagcc 2700
atttttacaa tgttgatcct gccaatccat gagcatggga gatctttcca tcttctgaga 2760
tcttctttaa tttcttactt cagagacttg aagttcttgt catagagatc tttcacttcc 2820
ttagttagag tcacgccggg atattttata ttatttgtga ctattaagaa gggtattgtt 2880
tccctaattt ctttctcagc ctgtttattc tttgtgtaga gaaaggccat tgacttgttt 2940
gagttaattt tatatccagc tacttcaccg aagctgttta tcaggtttag gagttctctg 3000
gtggaatttt tagggttact tatatatact atcatatcat ctgcaaaaag tgatattttg 3060
acttcctcct ttccaatttg tatccccttg atctcctttt gttgtcgaat tgctctggct 3120
aatacttcaa gtactatgtt gaaaaggtag ggagaaagtg ggcagccttg tctagtccct 3180
gattttagtg ggattgcttc cagcttctct ccatttactt tgatgttggc tactggtttg 3240
ctgtagattg cttttatcat gtttaggtat gggccttgaa ttcctgatct ttccagaact 3300
tttatcatga atgggtgttg gatcttgtca aatgcttttt ctgcatctaa cgagatgatc 3360
atgtggtttt tgtctttgag tttgtttata tagtggatta cattgatgga ttttcgtata 3420
ttaaaccata cctgcatccc tggaataaaa cctacttggt caggatggat gattgcttta 3480
atgtgttctt ggattcggtt agcgagaatt ttattgagga tttttgcatc gctattcata 3540
agagaaattg gtctgaagtt ctctatcttt gttgtatctt tctgtggttt aggtatcaga 3600
gtaatagtgg cttcataaaa tgagttgggt agagtacctt ctacttctat cttgtgaaaa 3660
agtttgtgca gaactggaat tagatcttct ttgaaggtct gatagaactc tgcactaaac 3720
ccatctggtc ctgggctttt ttttggctgg gagactattt ataactgctt ctatttcttt 3780
aggggatatg ggactgttta gaaggtcaac ttgatcctga ttcaactttg gtacctggta 3840
tctgtccaga aatttgtcca tttcgtccag gttttccagt tttgttgagt atagcctttt 3900
gtagaaggat ctgatggtgt tttggatttc ttcaggatct gttgttatgt ctcccttttc 3960
agttctgatt ttgttaatta ggattttgtc tctgtgccct ctagtgagtc tagctaaggg 4020
tttatctatc ttgttgattt tctcaaagaa ccaactcctc gtttggttaa ttctttgaat 4080
agttcttctt gtttccactt ggttgatttc acccctgagt ttgattattt cctgccgtct 4140
actcctcttg ggtgaatttg cttccttttt ttctagagct tttagatgtg ttgtcaagct 4200
gctagtatgt gctctctccc gtttcttcat ggaggcactc agagctatga gtttccctct 4260
tagaaatgct ttcattgtgt cccaaaggtt tgggtacgtt gtggcttcat tttcattaaa 4320
ctctaaaaag tctttaattt ctttctttat tccttccttg accaaggtat cattgagaag 4380
agtgttgttc agtttccacg tgagtgttgg ctttctgtta tttttttttg ttattgaaga 4440
tcagccttag tgcatggtga tctgatagga tacatgggac aatttcaata tttttgaatc 4500
tgttgaggcc tgttttgtga cctattatgt ggtcaatttt ggagaaggta ccatgaggtg 4560
ctgagaagaa ggtatatcct tttgttttag gataaaatgt tctgtagata tctgtcagat 4620
ccatttgttt catcacttct gttagtttca gtgtgtccct gtttagtttc tgtttccatg 4680
atctgtccat tggtgaaagt ggtgtgttga agtctcccac tattattgtg tgaggtgcaa 4740
tgtgtgcttt gagctttact aaagtttctt taatgaatgt ggctgccctt gtatttcgag 4800
catagatatt cagaattgag cgttcctctt ggaggatttt acctttgatg agaacgaagt 4860
gcccctcctt gtcttttttg atgactttgg gttcgaagtc aatcttatca gatattagga 4920
tggctactcc agcttgtttc ttcataccat ttgcttggaa aattgttttc cagcctttta 4980
ttctgaggta gtgtctatct ttttctctga gatgtgtctc ctgtaaacag caaaatgttg 5040
ggtcttgttt gtgtagccag tttgttagtc tatgtctttt tattggggag ttgagaccat 5100
tgatgttaag agatattaag gaaaagtaat tgttgcttcc tgttattttt gttgttaaag 5160
ttggcattct gttcttctgg ctgtcttctt ttaggtttgt tgagggatta ccttcttgtt 5220
ttttctaggg cattgttccc gttcttgtat tggttttttt ctgttattaa cctttgaagg 5280
gctggattcg tggagagata atgtgtgaat tttgttttgt cgtggaatac tttggtttat 5340
ccatctatgg taattgagag tttggctggg tatagtagcc tgggctggaa tttgtgttct 5400
cttagtgtct gtataacatc tgtccaggct cttctggctt tcatagtctc tggtgaaaaa 5460
tctggtgtaa ttctgatagg cttgcctttg tatgttactt gacctttttc ccttactgct 5520
tttagtattc tatctttatt tagtgcattt gttgttctga ttattatgtg tcgggaggaa 5580
tttcttttct ggtccagtct atttggagtt ctgtaggctt cttgtatgtt cataggtatc 5640
tctttcttta gatttgggaa gttttcttca ataattttgt tgaagatgtt tgctggtcct 5700
ttgagttgaa aatcttcatt ctcatccact cctattatcc gtaggtttgg tcttctcatt 5760
gtgtcctgga tttcctggat attttgagtt aggatctttt tgcattttcc attttctttg 5820
attgttgtgc cgatgttctc tatggaatct tctgcacctg agattctctc ttccatctct 5880
tgtattctgt tgctgatgct caaatctatg gttccagatt tctttcctag ggtttctatc 5940
tccagtgttg cctcactttg agttttcttt attgtgtcta cttccctttt taggtctagt 6000
atggttttgt tcatttctat cacctgtttg tatgtttttt cctctttttc tgtaaggact 6060
tctacctgtt tgattgtgtt ttcctgtttt tctttaagga cttgtaactc tttagcagtg 6120
ttctcctgta tttctttaag tgatttatta aagtccttct tgatgtcctc taccatcatc 6180
atgagatatg cttttaaatc taggtctagg ttttcgcgtg tgttggggtg ccctggactg 6240
ggcgaagtgg gagggctggg ttctgatgat ggtgagtggt cttggttcct gttagtagga 6300
ttcctacatt tacctttcgc catctggtaa tctctggagt tagtagttat agttgactct 6360
gtttagagat tgttcttctg gtgattctgt tacagtctct cagcagacct gggagacaga 6420
ttctctcctc tgagtttcag tgctcagagc actctctgct ggcaagctct cttacaggga 6480
aggtgcgcag atatcttgtt tttggacctc ctcctggtcg aagaagaagg cccaaaacag 6540
ggcctctctc agaagctgtg ttgctttggc agttcccaga agctgtcagc ttctgtggtg 6600
cagactctca cctgtgcaga ctaaattcct aagttccagg gagtcctgga accaagatgg 6660
cgaccgctgc tcctgaggct gaggccgcct cccaagccag gcggacacct gtcctctggt 6720
ccggacggtg gccggctgtc tgcggcccgc caagggtgct gcctcagcgg ctctgtgctt 6780
ccgcccttcc cagaagctgt ctggttctct cgattaattt cttttcaaaa gctttctatc 6840
atcatgtctc agagttttat ctttaccttt ccagctccct aaaagctaaa aacatatgag 6900
tattctgaaa ctcatttgat gcaacatatt aaataattca tagcatttca aatctcaaaa 6960
ctcataagtg tggtggaaca ctgttaagtt tgccaagaaa catccatgaa atattgctac 7020
tctaatacaa tactttcatt gttggtattc cttaggaaaa catg 7064
<210> 6
<211> 3744
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
aggctccctg gttagtggtt cagtctctgt gagcccctgt gagcccaggt tcgttgactc 60
catgagtttt cttatgatgt tcttgacttc tccagctctc caatccttcc tccctttctt 120
ttgcaggatt tcacatcttc tttgaccact gtattaattt catctatcgt tatcttctgc 180
acctgagatc ctctcttcta tctcttgtat tctgttggtg atgcttgagt ctgaatcaga 240
aaatgatagc ctaccactgc agctgtatta tatttagtaa aaaaaaaaaa aaaatcaacc 300
acagatccta ccagctccag gctttgagat tataactaca tcatttcctc cttacttact 360
ggcttgcttc tcctgccttg ctcagcttgc tttcctatag aacccagaac caccagccca 420
gggatgcccc cacccacaat gggccctccc accttgatca ctaattgaga aaatgcctta 480
cagttgtatt tcacggaggc atttcctcaa ttgaagctcc tttctctgtg ataactccag 540
cttgtgtcaa gttgacacac aaaaccagcc agtacaacaa gcaacacatg ctttaaacca 600
ctgagccatc tctctacccc ctcaaagtgt atacttttca aaaatataaa gcatttcatt 660
aaagaaatac accctgaggt attttcccgg tgttttactg ctgaacattt ggattgcttc 720
caatattttg ccattggaag taatcatata ataactatgt ctttgtgttt atatgcacat 780
tttgataaaa taaagtaaga ttagaacaat gatttgcaaa ggatcgtctg aagcactggt 840
ctgtgaactg agtatgcatg ttctatcctt gggggtggag gggtggggtt tcagctgcag 900
atattgggtt ctcacggctc ttctaggcat tactctaaca gggactccca ctggtagctc 960
cagctctgca tcaggtttcc tcgtggtctc ccaagcagtg aacaagatcc tttgtaacct 1020
aagaaggggc tgccacgcct ccatggctct agtattatac acacctgcag agtgagtacc 1080
acatgggcat tgctagggct aattgcttgt gatggttgaa ttgcgtgatg gtgaattgag 1140
tgatggttga attgcgtttg aacccacttg aaccacagct gaaaacaagc gagcagaggc 1200
caaggtggtc ctgggcagtg agcctattct gccctttaag gcttgccctt taatgttgct 1260
gggatgggag gggcgacctt aatattctct gaaatgcctt cgttctgatg cagagcacct 1320
ggctttattt atgctgatct ctataactag ccagttgctt gcaaatactc ttaacagaac 1380
atttcattct tttacatgta ctggctgcaa atttcccatt ttatgctcta tttatgtctt 1440
aattataatg ttatttacat agtatgtcag tcctttaccc actcacctta ccataaatgg 1500
ctgtattagt tacttttctc tttgctgtga ctaaatcact aacaagaaac agcttagagg 1560
aggggggact tgcactgcat tctaaaacca tcaaaaggga taaaattaaa atagcttagg 1620
taagttttag tcattacctt aaaccagtca taagaaacta ttccttaaac cattcattct 1680
cagggtgctg gagaaatggc tcagaggtta agaacactga ctgctcttcc agaggtcatc 1740
agttcagttt ccagcaacca catggtggct cacaaccatc tgtaataggc ttcgatgcct 1800
tcttctggtg tgtctgaaga cagcaacagt gtactcacat atataaaatg aatacataaa 1860
taaataaata atttttttaa aaagaaatca aggttttaag atacacactt aggatgctct 1920
gagaactcct ggttcagcaa tatgtttagg tcttcattgt caataccctt gaacttaaag 1980
ccataacatc atccactaat tccatgaata ccagagaaag tcccaaagca ttctctttaa 2040
ctctgacctt gagataaaag aaaaggggat tgagaaaccc tcaaaatgta cttagcaatc 2100
cactgtctcc ctccttattc tcagtggcca gaaggttgat tgtgtctagg cactatagaa 2160
tgtcaacaga cttggatgct ataatgtcga tcccacaagg ctcagcatct gggcaagaat 2220
tacaagcatt cttatctgac tcttgggcaa atgctagaag taaaagaaaa aagaaagaac 2280
ttgtaaaagc accctttcac gtgctcagac ttgcattgaa agtcaccttg agaagtgatg 2340
caggtaatta ttaagcttta tttagcctcc tgctttgtat tggctacatg tacaatttgg 2400
aacagataac tagaaatgga catagattgt ctaacccagt atacttagta ggcccctgcc 2460
cttcaaacct gtcagagttc tactgaatac agcatttaaa agcttaaatt tgagattctt 2520
aaaatgatag agactcccac atcccagcag tgacaccctc aaggtctcca agaagatacg 2580
gacacaagaa gaaagggttc cacatggact gtagtatgat aactactgag aaaccattga 2640
aaaatactgc tccattgcct tgcctgatgc tgaggcctgt cctgaactgt ggacaaacag 2700
ggacactgga aatgccttga gttaagacaa agcaaggtcg gtcttccaag ttcctcatct 2760
acaggaaggc ttctcagatt ggccaggcct gggaaccgaa gtttcagtct accctggagc 2820
ctgtgtcccc aatgaagcca tcaacatcac aggaacctag tgaagtgact aagtgatggg 2880
taagtctgtc attttaatgg gcacataggc catttagatt atgatatatc cttatcagat 2940
ctctgagagg gtttgaagcc agtgatcaac aaactgtgtc tcatggtagt aattggatag 3000
gaaaagttta tgtcaggtct gaggatatga aggcttctaa gaaagtattt gaaaatctag 3060
gaaggtcttt tgaggttgga aaatacaagt tgtggaggat ttatcttttt ttttgtttgt 3120
ttgttttcca aaacagagtt tctctgtata gccctggctg tctttttgta aacttaaaaa 3180
agactcttgt aagctagagg aaatccactc aaatccattt cccactgtca aatagactca 3240
agataagcac tgtcaattat cagcctttgt ccctccccac ctgcccattc ctgacagtgg 3300
gatctgtttc tctttccctg gtcctggaat ccccaagcct cctactacta ccaagaccgt 3360
gggcctaaga gttttgaatg tacacccaag aaagtttttc tccctaaact ttatcttctg 3420
tccctaacac ccaggtgttc tctcagcatc tgcatccaag aaagttccaa aacagtcatt 3480
cacaggtgct ctggcccagc cttacagacc gctttaatag gacctaaaca ttcctactcg 3540
gagaaaatcc cacaacctga atagtgatga aacaaccagc aattcctgga ttggctgaca 3600
ttcagctttt tgtccacacc ccataacaat ggcatcccaa tgtcagcgag aagtaatcat 3660
agatgatttc atcgcccatt attcatttat cataataata ataatgagag gctgggatgt 3720
ttgatttcag accctggtac aagg 3744
<210> 7
<211> 2442
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
ttccttcagt cgtcaatgct gacctgcatt ttcctgctaa tatctggttc ctgtgagtta 60
tgcgccgaag aaaatttttc tagaagctat ccttgtgatg agaaaaagca aaatgactca 120
gttattgcag agtgcagcaa tcgtcgacta caggaagttc cccaaacggt gggcaaatat 180
gtgacagaac tagacctgtc tgataatttc atcacacaca taacgaatga atcatttcaa 240
gggctgcaaa atctcactaa aataaatcta aaccacaacc ccaatgtaca gcaccagaac 300
ggaaatcccg gtatacaatc aaatggcttg aatatcacag acggggcatt cctcaaccta 360
aaaaacctaa gggagttact gcttgaagac aaccagttac cccaaatacc ctctggtttg 420
ccagagtctt tgacagaact tagtctaatt caaaacaata tatacaacat aactaaagag 480
ggcatttcaa gacttataaa cttgaaaaat ctctatttgg cctggaactg ctattttaac 540
aaagtttgcg agaaaactaa catagaagat ggagtatttg aaacgctgac aaatttggag 600
ttgctatcac tatctttcaa ttctctttca cacgtgccac ccaaactgcc aagctcccta 660
cgcaaacttt ttctgagcaa cacccagatc aaatacatta gtgaagaaga tttcaaggga 720
ttgataaatt taacattact agatttaagc gggaactgtc cgaggtgctt caatgcccca 780
tttccatgcg tgccttgtga tggtggtgct tcaattaata tagatcgttt tgcttttcaa 840
aacttgaccc aacttcgata cctaaacctc tctagcactt ccctcaggaa gattaatgct 900
gcctggttta aaaatatgcc tcatctgaag gtgctggatc ttgaattcaa ctatttagtg 960
ggagaaatag cctctggggc atttttaacg atgctgcccc gcttagaaat acttgacttg 1020
tcttttaact atataaaggg gagttatcca cagcatatta atatttccag aaacttctct 1080
aaacttttgt ctctacgggc attgcattta agaggttatg tgttccagga actcagagaa 1140
gatgatttcc agcccctgat gcagcttcca aacttatcga ctatcaactt gggtattaat 1200
tttattaagc aaatcgattt caaacttttc caaaatttct ccaatctgga aattatttac 1260
ttgtcagaaa acagaatatc accgttggta aaagataccc ggcagagtta tgcaaatagt 1320
tcctcttttc aacgtcatat ccggaaacga cgctcaacag attttgagtt tgacccacat 1380
tcgaactttt atcatttcac ccgtccttta ataaagccac aatgtgctgc ttatggaaaa 1440
gccttagatt taagcctcaa cagtattttc ttcattgggc caaaccaatt tgaaaatctt 1500
cctgacattg cctgtttaaa tctgtctgca aatagcaatg ctcaagtgtt aagtggaact 1560
gaattttcag ccattcctca tgtcaaatat ttggatttga caaacaatag actagacttt 1620
gataatgcta gtgctcttac tgaattgtcc gacttggaag ttctagatct cagctataat 1680
tcacactatt tcagaatagc aggcgtaaca catcatctag aatttattca aaatttcaca 1740
aatctaaaag ttttaaactt gagccacaac aacatttata ctttaacaga taagtataac 1800
ctggaaagca agtccctggt agaattagtt ttcagtggca atcgccttga cattttgtgg 1860
aatgatgatg acaacaggta tatctccatt ttcaaaggtc tcaagaatct gacacgtctg 1920
gatttatccc ttaataggct gaagcacatc ccaaatgaag cattccttaa tttgccagcg 1980
agtctcactg aactacatat aaatgataat atgttaaagt tttttaactg gacattactc 2040
cagcagtttc ctcgtctcga gttgcttgac ttacgtggaa acaaactact ctttttaact 2100
gatagcctat ctgactttac atcttccctt cggacactgc tgctgagtca taacaggatt 2160
tcccacctac cctctggctt tctttctgaa gtcagtagtc tgaagcacct cgatttaagt 2220
tccaatctgc taaaaacaat caacaaatcc gcacttgaaa ctaagaccac caccaaatta 2280
tctatgttgg aactacacgg aaaccccttt gaatgcacct gtgacattgg agatttccga 2340
agatggatgg atgaacatct gaatgtcaaa attcccagac tggtagatgt catttgtgcc 2400
agtcctgggg atcaaagagg gaagagtatt gtgagtctgg ag 2442
<210> 8
<211> 80
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
tacaatactt tcattgttgg tattccttag gaaaacatgt tccttcagtc gtcaatgctg 60
acctgcattt tcctgctaat 80
<210> 9
<211> 74
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
cctggggatc aaagagggaa gagtattgtg agtctggagc tcacgacttg tgtatcggat 60
accactgcag ctgt 74
<210> 10
<211> 102
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
gaccaagcta caccaccata tatgcagatg gtcttgctcg tcgacggtat cgataagctt 60
gatatcgaat tccgaagttc ctattctcta gaaagtatag ga 102
<210> 11
<211> 80
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
agtataggaa cttcatcagt caggtacata atggtggatc caggctccct ggttagtggt 60
tcagtctctg tgagcccctg 80
<210> 12
<211> 5074
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 12
ctgtcttcca acataaactt ctgcaaatcc attgctagat tttgctgatg ctaaagcgac 60
cagatttcag aagtttgcca aagtctgctc tctgcacagc tactgtacac agaggaccac 120
tgcactgcta ctacaagcta ttgaaagaga accttgaagg gatcccagcc agaaagggga 180
aagcaagagc cttccaagaa agaatttggg ctgttctatt attttctggt ccagctatag 240
agcacatctt cttccagaaa aacagagttg gatgttaaga gagaaacaaa cgttttacct 300
tcctttgtct atagaacatg gaaaacatgt tccttcagtc gtcaatgctg acctgcattt 360
tcctgctaat atctggttcc tgtgagttat gcgccgaaga aaatttttct agaagctatc 420
cttgtgatga gaaaaagcaa aatgactcag ttattgcaga gtgcagcaat cgtcgactac 480
aggaagttcc ccaaacggtg ggcaaatatg tgacagaact agacctgtct gataatttca 540
tcacacacat aacgaatgaa tcatttcaag ggctgcaaaa tctcactaaa ataaatctaa 600
accacaaccc caatgtacag caccagaacg gaaatcccgg tatacaatca aatggcttga 660
atatcacaga cggggcattc ctcaacctaa aaaacctaag ggagttactg cttgaagaca 720
accagttacc ccaaataccc tctggtttgc cagagtcttt gacagaactt agtctaattc 780
aaaacaatat atacaacata actaaagagg gcatttcaag acttataaac ttgaaaaatc 840
tctatttggc ctggaactgc tattttaaca aagtttgcga gaaaactaac atagaagatg 900
gagtatttga aacgctgaca aatttggagt tgctatcact atctttcaat tctctttcac 960
acgtgccacc caaactgcca agctccctac gcaaactttt tctgagcaac acccagatca 1020
aatacattag tgaagaagat ttcaagggat tgataaattt aacattacta gatttaagcg 1080
ggaactgtcc gaggtgcttc aatgccccat ttccatgcgt gccttgtgat ggtggtgctt 1140
caattaatat agatcgtttt gcttttcaaa acttgaccca acttcgatac ctaaacctct 1200
ctagcacttc cctcaggaag attaatgctg cctggtttaa aaatatgcct catctgaagg 1260
tgctggatct tgaattcaac tatttagtgg gagaaatagc ctctggggca tttttaacga 1320
tgctgccccg cttagaaata cttgacttgt cttttaacta tataaagggg agttatccac 1380
agcatattaa tatttccaga aacttctcta aacttttgtc tctacgggca ttgcatttaa 1440
gaggttatgt gttccaggaa ctcagagaag atgatttcca gcccctgatg cagcttccaa 1500
acttatcgac tatcaacttg ggtattaatt ttattaagca aatcgatttc aaacttttcc 1560
aaaatttctc caatctggaa attatttact tgtcagaaaa cagaatatca ccgttggtaa 1620
aagatacccg gcagagttat gcaaatagtt cctcttttca acgtcatatc cggaaacgac 1680
gctcaacaga ttttgagttt gacccacatt cgaactttta tcatttcacc cgtcctttaa 1740
taaagccaca atgtgctgct tatggaaaag ccttagattt aagcctcaac agtattttct 1800
tcattgggcc aaaccaattt gaaaatcttc ctgacattgc ctgtttaaat ctgtctgcaa 1860
atagcaatgc tcaagtgtta agtggaactg aattttcagc cattcctcat gtcaaatatt 1920
tggatttgac aaacaataga ctagactttg ataatgctag tgctcttact gaattgtccg 1980
acttggaagt tctagatctc agctataatt cacactattt cagaatagca ggcgtaacac 2040
atcatctaga atttattcaa aatttcacaa atctaaaagt tttaaacttg agccacaaca 2100
acatttatac tttaacagat aagtataacc tggaaagcaa gtccctggta gaattagttt 2160
tcagtggcaa tcgccttgac attttgtgga atgatgatga caacaggtat atctccattt 2220
tcaaaggtct caagaatctg acacgtctgg atttatccct taataggctg aagcacatcc 2280
caaatgaagc attccttaat ttgccagcga gtctcactga actacatata aatgataata 2340
tgttaaagtt ttttaactgg acattactcc agcagtttcc tcgtctcgag ttgcttgact 2400
tacgtggaaa caaactactc tttttaactg atagcctatc tgactttaca tcttcccttc 2460
ggacactgct gctgagtcat aacaggattt cccacctacc ctctggcttt ctttctgaag 2520
tcagtagtct gaagcacctc gatttaagtt ccaatctgct aaaaacaatc aacaaatccg 2580
cacttgaaac taagaccacc accaaattat ctatgttgga actacacgga aacccctttg 2640
aatgcacctg tgacattgga gatttccgaa gatggatgga tgaacatctg aatgtcaaaa 2700
ttcccagact ggtagatgtc atttgtgcca gtcctgggga tcaaagaggg aagagtattg 2760
tgagtctgga gctcacgact tgtgtatcgg ataccactgc agctgtcctg tttttcctca 2820
cattccttac cacctccatg gttatgttgg ctgctctggt tcaccacctg ttttactggg 2880
atgtttggtt tatctatcac atgtgctctg ctaagttaaa aggctacagg acttcatcca 2940
catcccaaac tttctatgat gcttatattt cttatgacac caaagatgca tctgttactg 3000
actgggtaat caatgaactg cgctaccacc ttgaagagag tgaagacaaa agtgtcctcc 3060
tttgtttaga ggagagggat tgggatccag gattacccat cattgataac ctcatgcaga 3120
gcataaacca gagcaagaaa acaatctttg ttttaaccaa gaaatatgcc aagagctgga 3180
actttaaaac agctttctac ttggccttgc agaggctaat ggatgagaac atggatgtga 3240
ttattttcat cctcctggaa ccagtgttac agtactcaca gtacctgagg cttcggcaga 3300
ggatctgtaa gagctccatc ctccagtggc ccaacaatcc caaagcagaa aacttgtttt 3360
ggcaaagtct gaaaaatgtg gtcttgactg aaaatgattc acggtatgac gatttgtaca 3420
ttgattccat taggcaatac tagtgatggg aagtcacgac tctgccatca taaaaacaca 3480
cagcttctcc ttacaatgaa ctgtggcaaa tgaaagggca tgtggctgct tgacgtgctg 3540
agaataagtg ataattgggt gcacagccct aaacaggaca attataccag cctctctaag 3600
gctagggtaa ctttgtgcaa gagggagcca gaaaaatgta tgagctagga aacaggaaga 3660
agggctggaa aatgctgtct tctgggcata acagccattg caactgtaat cacacagcag 3720
ctacggttgc ctgtcgtgag cctgcacaag actgggcctc ttatcagtca attacagatg 3780
ggggcaggtg tataaaacta tacttcatct ttctaggggg caggtcccct gtactccatc 3840
cttctgcact atcaactatt agtggatcca ggaaagagac tatcgttatc tttagctttg 3900
aacccattga tgagcctaga ttcaatagac ggttcaaaac tcatggtcac acagaaagag 3960
ttggtaaagt cagtggccag aagacaaaac aaaagggcct gaacatgaga aatggttttt 4020
taaggcggtg ggtaggttgt ccgcagtgag agaggagata ggaaaaggtg ggcatgagag 4080
taaccagaat gcatgggtct ctctcagaga tttcactctg ggatccacta ccaagctcac 4140
ccccatgcag ttgtcctcaa gattcagctt ctcttgtgtg ttagccaaaa tagaaaagct 4200
catctccctg ctatgtgaac tttccccaag gcagcttgtt tcatcaaggc cagcgagagg 4260
ggacgcagtc agtcttttgg tagtgaatca gaaagtgaca gcctaccact gcagttgtat 4320
tctatttggt aaaagtcaac cacagatcct accagctcca ggcaccaagt atctcaggtt 4380
cagggaaagc tattgaaaac cagggtaata gatctagggg cctctcagct aagaggaaag 4440
agttcgagat aagtgatctg atcgtctgag ttaatcgtta tccatttctt tggactgaaa 4500
ttaagagacg gcttgccatc ttggtcacag atcatcaata ctgggaaagt caaatcacat 4560
gggcagtctt agcaggcagc ttattatgcc tacttttcaa tccctaagaa catttgccac 4620
tgtatctaat gatcttgata ttaagatgct ggtgaagttg atacatctat gactgtgggg 4680
ttatattaca ttgtgattgc attaattttt cctttacaat gacattttaa aatcttaagt 4740
tttgagacta taactacatc attccctctt ttatcctctc tccaaaactt ctcacatgcc 4800
ccactatgac ttttataaat gctgcctgca atatttgctt tttaagcttt tgatgttgct 4860
taaacactga aaaggatatt tttaaaaagc cttttaatcc taaatcattt taaaggtatt 4920
cgtttaaatg aaaagctttt gggggtattt gttatttccc attggtgtat attttcaaat 4980
gaatgattaa aacgtcttca ttttacatga caaatgtgtg ataaatctgt agtagaaaac 5040
agaactctga tataaaagtt gtcatttgca ctga 5074
<210> 13
<211> 1041
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 13
Met Glu Asn Met Phe Leu Gln Ser Ser Met Leu Thr Cys Ile Phe Leu
1 5 10 15
Leu Ile Ser Gly Ser Cys Glu Leu Cys Ala Glu Glu Asn Phe Ser Arg
20 25 30
Ser Tyr Pro Cys Asp Glu Lys Lys Gln Asn Asp Ser Val Ile Ala Glu
35 40 45
Cys Ser Asn Arg Arg Leu Gln Glu Val Pro Gln Thr Val Gly Lys Tyr
50 55 60
Val Thr Glu Leu Asp Leu Ser Asp Asn Phe Ile Thr His Ile Thr Asn
65 70 75 80
Glu Ser Phe Gln Gly Leu Gln Asn Leu Thr Lys Ile Asn Leu Asn His
85 90 95
Asn Pro Asn Val Gln His Gln Asn Gly Asn Pro Gly Ile Gln Ser Asn
100 105 110
Gly Leu Asn Ile Thr Asp Gly Ala Phe Leu Asn Leu Lys Asn Leu Arg
115 120 125
Glu Leu Leu Leu Glu Asp Asn Gln Leu Pro Gln Ile Pro Ser Gly Leu
130 135 140
Pro Glu Ser Leu Thr Glu Leu Ser Leu Ile Gln Asn Asn Ile Tyr Asn
145 150 155 160
Ile Thr Lys Glu Gly Ile Ser Arg Leu Ile Asn Leu Lys Asn Leu Tyr
165 170 175
Leu Ala Trp Asn Cys Tyr Phe Asn Lys Val Cys Glu Lys Thr Asn Ile
180 185 190
Glu Asp Gly Val Phe Glu Thr Leu Thr Asn Leu Glu Leu Leu Ser Leu
195 200 205
Ser Phe Asn Ser Leu Ser His Val Pro Pro Lys Leu Pro Ser Ser Leu
210 215 220
Arg Lys Leu Phe Leu Ser Asn Thr Gln Ile Lys Tyr Ile Ser Glu Glu
225 230 235 240
Asp Phe Lys Gly Leu Ile Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn
245 250 255
Cys Pro Arg Cys Phe Asn Ala Pro Phe Pro Cys Val Pro Cys Asp Gly
260 265 270
Gly Ala Ser Ile Asn Ile Asp Arg Phe Ala Phe Gln Asn Leu Thr Gln
275 280 285
Leu Arg Tyr Leu Asn Leu Ser Ser Thr Ser Leu Arg Lys Ile Asn Ala
290 295 300
Ala Trp Phe Lys Asn Met Pro His Leu Lys Val Leu Asp Leu Glu Phe
305 310 315 320
Asn Tyr Leu Val Gly Glu Ile Ala Ser Gly Ala Phe Leu Thr Met Leu
325 330 335
Pro Arg Leu Glu Ile Leu Asp Leu Ser Phe Asn Tyr Ile Lys Gly Ser
340 345 350
Tyr Pro Gln His Ile Asn Ile Ser Arg Asn Phe Ser Lys Leu Leu Ser
355 360 365
Leu Arg Ala Leu His Leu Arg Gly Tyr Val Phe Gln Glu Leu Arg Glu
370 375 380
Asp Asp Phe Gln Pro Leu Met Gln Leu Pro Asn Leu Ser Thr Ile Asn
385 390 395 400
Leu Gly Ile Asn Phe Ile Lys Gln Ile Asp Phe Lys Leu Phe Gln Asn
405 410 415
Phe Ser Asn Leu Glu Ile Ile Tyr Leu Ser Glu Asn Arg Ile Ser Pro
420 425 430
Leu Val Lys Asp Thr Arg Gln Ser Tyr Ala Asn Ser Ser Ser Phe Gln
435 440 445
Arg His Ile Arg Lys Arg Arg Ser Thr Asp Phe Glu Phe Asp Pro His
450 455 460
Ser Asn Phe Tyr His Phe Thr Arg Pro Leu Ile Lys Pro Gln Cys Ala
465 470 475 480
Ala Tyr Gly Lys Ala Leu Asp Leu Ser Leu Asn Ser Ile Phe Phe Ile
485 490 495
Gly Pro Asn Gln Phe Glu Asn Leu Pro Asp Ile Ala Cys Leu Asn Leu
500 505 510
Ser Ala Asn Ser Asn Ala Gln Val Leu Ser Gly Thr Glu Phe Ser Ala
515 520 525
Ile Pro His Val Lys Tyr Leu Asp Leu Thr Asn Asn Arg Leu Asp Phe
530 535 540
Asp Asn Ala Ser Ala Leu Thr Glu Leu Ser Asp Leu Glu Val Leu Asp
545 550 555 560
Leu Ser Tyr Asn Ser His Tyr Phe Arg Ile Ala Gly Val Thr His His
565 570 575
Leu Glu Phe Ile Gln Asn Phe Thr Asn Leu Lys Val Leu Asn Leu Ser
580 585 590
His Asn Asn Ile Tyr Thr Leu Thr Asp Lys Tyr Asn Leu Glu Ser Lys
595 600 605
Ser Leu Val Glu Leu Val Phe Ser Gly Asn Arg Leu Asp Ile Leu Trp
610 615 620
Asn Asp Asp Asp Asn Arg Tyr Ile Ser Ile Phe Lys Gly Leu Lys Asn
625 630 635 640
Leu Thr Arg Leu Asp Leu Ser Leu Asn Arg Leu Lys His Ile Pro Asn
645 650 655
Glu Ala Phe Leu Asn Leu Pro Ala Ser Leu Thr Glu Leu His Ile Asn
660 665 670
Asp Asn Met Leu Lys Phe Phe Asn Trp Thr Leu Leu Gln Gln Phe Pro
675 680 685
Arg Leu Glu Leu Leu Asp Leu Arg Gly Asn Lys Leu Leu Phe Leu Thr
690 695 700
Asp Ser Leu Ser Asp Phe Thr Ser Ser Leu Arg Thr Leu Leu Leu Ser
705 710 715 720
His Asn Arg Ile Ser His Leu Pro Ser Gly Phe Leu Ser Glu Val Ser
725 730 735
Ser Leu Lys His Leu Asp Leu Ser Ser Asn Leu Leu Lys Thr Ile Asn
740 745 750
Lys Ser Ala Leu Glu Thr Lys Thr Thr Thr Lys Leu Ser Met Leu Glu
755 760 765
Leu His Gly Asn Pro Phe Glu Cys Thr Cys Asp Ile Gly Asp Phe Arg
770 775 780
Arg Trp Met Asp Glu His Leu Asn Val Lys Ile Pro Arg Leu Val Asp
785 790 795 800
Val Ile Cys Ala Ser Pro Gly Asp Gln Arg Gly Lys Ser Ile Val Ser
805 810 815
Leu Glu Leu Thr Thr Cys Val Ser Asp Thr Thr Ala Ala Val Leu Phe
820 825 830
Phe Leu Thr Phe Leu Thr Thr Ser Met Val Met Leu Ala Ala Leu Val
835 840 845
His His Leu Phe Tyr Trp Asp Val Trp Phe Ile Tyr His Met Cys Ser
850 855 860
Ala Lys Leu Lys Gly Tyr Arg Thr Ser Ser Thr Ser Gln Thr Phe Tyr
865 870 875 880
Asp Ala Tyr Ile Ser Tyr Asp Thr Lys Asp Ala Ser Val Thr Asp Trp
885 890 895
Val Ile Asn Glu Leu Arg Tyr His Leu Glu Glu Ser Glu Asp Lys Ser
900 905 910
Val Leu Leu Cys Leu Glu Glu Arg Asp Trp Asp Pro Gly Leu Pro Ile
915 920 925
Ile Asp Asn Leu Met Gln Ser Ile Asn Gln Ser Lys Lys Thr Ile Phe
930 935 940
Val Leu Thr Lys Lys Tyr Ala Lys Ser Trp Asn Phe Lys Thr Ala Phe
945 950 955 960
Tyr Leu Ala Leu Gln Arg Leu Met Asp Glu Asn Met Asp Val Ile Ile
965 970 975
Phe Ile Leu Leu Glu Pro Val Leu Gln Tyr Ser Gln Tyr Leu Arg Leu
980 985 990
Arg Gln Arg Ile Cys Lys Ser Ser Ile Leu Gln Trp Pro Asn Asn Pro
995 1000 1005
Lys Ala Glu Asn Leu Phe Trp Gln Ser Leu Lys Asn Val Val Leu Thr
1010 1015 1020
Glu Asn Asp Ser Arg Tyr Asp Asp Leu Tyr Ile Asp Ser Ile Arg Gln
1025 1030 1035 1040
Tyr
<210> 14
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
ttgtagcact taacacagta cctga 25
<210> 15
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
ttcttcggcg cataactcac aggaa 25
<210> 16
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 16
gctcgactag agcttgcgga 20
<210> 17
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
acctggaatg gtatatgtgg attgt 25
<210> 18
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
gattagctta aggcccagca caaaa 25
<210> 19
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
ctggtcgaga gaggacgcga cagtt 25
<210> 20
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
tcctcaggag aactgaaggc catgt 25
<210> 21
<211> 26
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
gcacccacaa aagattcaag tctgcc 26
<210> 22
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
attttccaca gcatttgagt cttgc 25
<210> 23
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
gtatgtggta aatttgagga tgccc 25
<210> 24
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
ggatcggcca ttgaacaaga t 21
<210> 25
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
cagaagaact cgtcaagaag gc 22
<210> 26
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
tttatcttta cctttccagc tccct 25
<210> 27
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
caggcaaccc agcaggtata gtata 25
<210> 28
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
acactttctg ccttaacctt cc 22
<210> 29
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
tggagagctg gagaagtcaa 20
<210> 30
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
gacaagcgtt agtaggcaca tatac 25
<210> 31
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
gctccaattt cccacaacat tagt 24
Claims (23)
1.一种嵌合TLR8蛋白,其特征在于,所述的嵌合TLR8蛋白包含人TLR8蛋白的全部或部分。
2.根据权利要求1所述的嵌合TLR8蛋白,其特征在于,所述的嵌合TLR8蛋白包含人TLR8蛋白的信号肽、跨膜区、胞质区和/或胞外区的全部或部分,优选的,所述的嵌合TLR8蛋白包含人TLR8蛋白的信号肽和/或胞外区的全部或部分,优选的,所述的人TLR8蛋白胞外区的部分包含至少200、250、300或350个氨基酸,进一步优选的,所述的胞外区包含与SEQ ID NO:4的第27-818位、第27-822位或第27-825位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含SEQ ID NO:4的第27-818位、第27-822位或第27-825位所示的氨基酸序列;所述的人TLR8蛋白信号肽的部分包含至少10、15或20个氨基酸,优选的,所述的信号肽包含与SEQ ID NO:4的第5-26位具有至少60%、65%、70%、80%、85%、90%、95%或至少99%同一性的氨基酸序列或者包含与SEQ ID NO:4的第5-26位所示的氨基酸序列。
3.根据权利要求1或2所述的嵌合TLR8蛋白,其特征在于,所述的嵌合TLR8蛋白中包含的人TLR8蛋白的部分氨基酸序列选自下列组中的一种:
A)包含SEQ ID NO:4第5-818、1-818、1-822、1-825、5-822或5-825位氨基酸序列的全部或部分;
B)包含与SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位氨基酸序列同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的氨基酸序列;
C)包含与SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸的氨基酸序列;或
D)包含SEQ ID NO:4的第5-818、1-818、1-822、1-825、5-822或5-825位所示的,包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
4.根据权利要求1-3任一所述的嵌合TLR8蛋白,其特征在于,所述的嵌合TLR8蛋白的氨基酸序列选自下列组中的一种:
a)包含SEQ ID NO:13氨基酸序列的全部或部分;
b)包含与SEQ ID NO:13氨基酸序列同一性至少为60%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的氨基酸序列;
c)包含与SEQ ID NO:13所示氨基酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个氨基酸的氨基酸序列;或
d)包含SEQ ID NO:13所示的,包括取代、缺失和/或插入一个或多个氨基酸残基的氨基酸序列。
5.一种TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因包含人TLR8基因的部分。
6.根据权利要求5所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因编码权利要求1-4任一所述的嵌合TLR8蛋白。
7.根据权利要求5或6所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因包含人TLR8基因的1号外显子至2号外显子的全部或部分;优选的,所述的人TLR8基因的部分包含2号外显子的全部或部分,其中,2号外显子的部分至少包含500bp的核苷酸序列,优选的,所述的人TLR8基因的部分至少包含SEQ ID NO:7所示核苷酸序列。
8.根据权利要求5-7任一所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因中包含的人TLR8基因的部分选自下列组中的一种:
(A)包含SEQ ID NO:7所示核苷酸序列的全部或部分;
(B)包含与SEQ ID NO:7所示核苷酸序列的同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的核苷酸序列;
(C)包含与SEQ ID NO:7所示核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸的核苷酸序列;
(D)具有SEQ ID NO:7所示核苷酸序列的,包括取代、缺失和/或插入一个或多个核苷酸的核苷酸序列。
9.根据权利要求5-8任一所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因的核苷酸序列转录的mRNA选自下列组中的一种:
(a)包含SEQ ID NO:12所示核苷酸序列的全部或部分;
(b)包含与SEQ ID NO:12所示核苷酸序列的同一性至少为75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%的核苷酸序列;
(c)包含与SEQ ID NO:12所示的核苷酸序列差异不超过10、9、8、7、6、5、4、3、2或不超过1个核苷酸的核苷酸序列;或
(d)包含与SEQ ID NO:12所示的核苷酸序列所示的,包括取代、缺失和/或插入一个或多个核苷酸的核苷酸序列。
10.根据权利要求5-9任一所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因包含编码人TLR8蛋白的cDNA。
11.根据权利要求5-10任一所述的TLR8嵌合基因,其特征在于,所述的TLR8嵌合基因对于被替换的内源TLR8基因座是纯合或者杂合。
12.一种靶向载体,其特征在于,所述的靶向载体包含人TLR8基因的部分,优选的,所述的人TLR8基因的部分包含1号至2号外显子的全部或部分;进一步优选的,所述的人TLR8基因的部分包含2号外显子的全部或部分,其中,2号外显子的部分至少包含500bp的核苷酸序列,优选的,所述的人TLR8基因的部分至少包含SEQ ID NO:7所示核苷酸序列。
13.根据权利要求12所述的靶向载体,其特征在于,所述的靶向载体包含编码人TLR8蛋白的cDNA的至少部分。
14.根据权利要求12-13任一所述的靶向载体,其特征在于,所述的靶向载体还包含5’臂,其选自非人动物TLR8基因基因组DNA的100-10000个长度的核苷酸;优选的,所述的5’臂与NCBI登录号为NC_000086.7至少具有90%同源性的核苷酸;进一步优选的,所述5’臂序列与SEQ ID NO:5至少具有90%同源性,或者如SEQ ID NO:5所示;和/或,所述的靶向载体还包含3’臂,其选自非人动物TLR8基因基因组DNA的100-10000个长度的核苷酸;优选的,所述的3’臂与NCBI登录号为NC_000086.7至少具有90%同源性的核苷酸;进一步优选的,所述的3’臂序列与SEQ ID NO:6至少具有90%同源性,或者如SEQ ID NO:6所示。
15.一种TLR8基因人源化的非人动物的构建方法,其特征在于,所述的构建方法包括用包含人TLR8基因的部分导入非人动物TLR8基因座,所述的非人动物体内表达人TLR8蛋白或嵌合TLR8蛋白。
16.根据权利要求15所述的构建方法,其特征在于,所述的嵌合TLR8蛋白如权利要求1-4任一所述的嵌合TLR8蛋白。
17.根据权利要求15或16所述的构建方法,其特征在于,所述的非人动物的基因组中还包含人TLR8基因或TLR8嵌合基因,所述的TLR8嵌合基因如权利要求5-11任一所述的TLR8嵌合基因。
18.根据权利要求15-17任一所述的构建方法,其特征在于,所述的导入的人TLR8基因的部分包含人TLR8基因1号至2号外显子的全部或部分;优选的,包含人TLR8基因2号外显子的全部或部分,其中,所述人TLR8基因2号外显子的部分至少包含500bp的核苷酸序列,优选的,所述人TLR8基因2号外显子的部分包含SEQ ID NO:7所示的核苷酸序列。
19.根据权利要求15-18任一所述的构建方法,其特征在于,所述的构建方法包括将人TLR8基因的部分导入非人动物3号外显子。
20.根据权利要求15-19任一所述的构建方法,其特征在于,使用权利要求12-14任一所述的靶向载体进行非人动物的构建。
21.根据权利要求15-20任一所述的构建方法,其特征在于,所述非人动物为至少包含人或人源化CD3、PD-L1、PD-1基因中至少一种的人源化的非人动物。
22.一种包含TLR8基因人源化改造的细胞、组织或者器官,所述的细胞、组织或者器官包含权利要求5-11任一所述的TLR8嵌合基因,所述的细胞、组织或者器官表达权利要求1-4任一所述的嵌合TLR8蛋白。
23.来源于权利要求1-4任一所述的嵌合TLR8蛋白、权利要求5-11任一所述的TLR8嵌合基因、权利要求15-21任一所述的构建方法获得的非人动物、权利要求22所述的细胞、组织或者器官在需要涉及人类细胞的免疫过程的产品开发,制造抗体,或者作为药理学、免疫学、微生物学、医学研究的模型系统中的应用;或者在生产和利用动物实验疾病模型,用于开发新的诊断策略和/或治疗策略中的应用;或者在筛选、验证、评价或研究TLR8功能、人TLR8信号机理、靶向人的抗体、靶向人的药物、药效,免疫相关疾病药物以及抗肿瘤或抗炎症药物,筛选和评估人用药及药效研究方面的应用。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2020106821708 | 2020-07-15 | ||
CN202010682170 | 2020-07-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113651892A true CN113651892A (zh) | 2021-11-16 |
Family
ID=78489395
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110802477.1A Withdrawn CN113651892A (zh) | 2020-07-15 | 2021-07-15 | Tlr8基因人源化的非人动物及其构建方法和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113651892A (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024002259A1 (zh) * | 2022-06-29 | 2024-01-04 | 百奥赛图(北京)医药科技股份有限公司 | 一种osm、osmr、il31ra和/或il31基因修饰的非人动物 |
WO2024012578A1 (zh) * | 2022-07-15 | 2024-01-18 | 百奥赛图(北京)医药科技股份有限公司 | 一种tlr7和/或tlr8基因人源化修饰的非人动物 |
-
2021
- 2021-07-15 CN CN202110802477.1A patent/CN113651892A/zh not_active Withdrawn
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024002259A1 (zh) * | 2022-06-29 | 2024-01-04 | 百奥赛图(北京)医药科技股份有限公司 | 一种osm、osmr、il31ra和/或il31基因修饰的非人动物 |
WO2024012578A1 (zh) * | 2022-07-15 | 2024-01-18 | 百奥赛图(北京)医药科技股份有限公司 | 一种tlr7和/或tlr8基因人源化修饰的非人动物 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108588126B (zh) | Cd47基因人源化改造动物模型的制备方法及应用 | |
US10912287B2 (en) | Genetically modified mice expressing humanized PD-1 | |
CN111793646B (zh) | Il1r1基因人源化改造的非人动物的构建方法及其应用 | |
CN109136275B (zh) | 人源化gitr基因改造动物模型的制备方法及应用 | |
CN113388640B (zh) | Ccr4基因人源化的非人动物及其构建方法和应用 | |
CN113046389B (zh) | 一种ccr2基因人源化的非人动物及其构建方法和应用 | |
CN111793647B (zh) | Cd226基因人源化非人动物的构建方法及应用 | |
CN113651892A (zh) | Tlr8基因人源化的非人动物及其构建方法和应用 | |
CN107955817B (zh) | 人源化基因改造动物模型的制备方法及应用 | |
CN114751973B (zh) | Siglec15基因人源化非人动物的构建方法和应用 | |
CN113881681B (zh) | Ccr8基因人源化非人动物及其构建方法和应用 | |
CN112501206B (zh) | Psma基因人源化非人动物的构建方法及应用 | |
CN112553252B (zh) | Tnfr2基因人源化的非人动物的构建方法和应用 | |
CN112553213B (zh) | Cx3cr1基因人源化的非人动物及其构建方法和应用 | |
CN115011606A (zh) | Cd37基因人源化非人动物的构建方法及应用 | |
CN113831403A (zh) | Sting基因人源化非人动物的构建方法及应用 | |
CN111304247A (zh) | 人源化lag-3基因改造动物模型的制备方法及应用 | |
CN115010800B (zh) | Pvrig基因人源化非人动物的构建方法及应用 | |
CN117384960B (zh) | 一种tl1a基因人源化非人动物及其构建方法和应用 | |
CN115161326B (zh) | Sost基因人源化非人动物及其构建方法和应用 | |
WO2024012578A1 (zh) | 一种tlr7和/或tlr8基因人源化修饰的非人动物 | |
CN114990128A (zh) | Cd20基因人源化非人动物的构建方法及应用 | |
CN115010799A (zh) | Bcma基因人源化非人动物的构建方法及应用 | |
CN116420680A (zh) | Igf1r基因人源化非人动物及其构建方法和应用 | |
CN115918611A (zh) | Tgfbr2基因人源化非人动物及其构建方法和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20211116 |
|
WW01 | Invention patent application withdrawn after publication |