CN1721443A - 人受体蛋白;相关的试剂和方法 - Google Patents
人受体蛋白;相关的试剂和方法 Download PDFInfo
- Publication number
- CN1721443A CN1721443A CNA2005100885947A CN200510088594A CN1721443A CN 1721443 A CN1721443 A CN 1721443A CN A2005100885947 A CNA2005100885947 A CN A2005100885947A CN 200510088594 A CN200510088594 A CN 200510088594A CN 1721443 A CN1721443 A CN 1721443A
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- asn
- phe
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 163
- 102000005962 receptors Human genes 0.000 title abstract description 53
- 108020003175 receptors Proteins 0.000 title abstract description 53
- 241000282414 Homo sapiens Species 0.000 title abstract description 21
- 239000003153 chemical reaction reagent Substances 0.000 title description 29
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 60
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 55
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 55
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 117
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 53
- 229920001184 polypeptide Polymers 0.000 claims description 52
- 230000004927 fusion Effects 0.000 claims description 30
- 239000013604 expression vector Substances 0.000 claims description 12
- 125000000539 amino acid group Chemical group 0.000 claims description 10
- 230000000890 antigenic effect Effects 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 2
- 239000012634 fragment Substances 0.000 abstract description 75
- 239000000203 mixture Substances 0.000 abstract description 43
- 230000001225 therapeutic effect Effects 0.000 abstract description 8
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 244
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 243
- 108090000623 proteins and genes Proteins 0.000 description 113
- 239000002773 nucleotide Substances 0.000 description 96
- 125000003729 nucleotide group Chemical group 0.000 description 96
- 210000004027 cell Anatomy 0.000 description 95
- 239000000370 acceptor Substances 0.000 description 88
- 241000282326 Felis catus Species 0.000 description 62
- 102000004169 proteins and genes Human genes 0.000 description 61
- 229940024606 amino acid Drugs 0.000 description 56
- 235000001014 amino acid Nutrition 0.000 description 56
- 150000001413 amino acids Chemical class 0.000 description 55
- 235000018102 proteins Nutrition 0.000 description 55
- 239000000523 sample Substances 0.000 description 50
- 241000288906 Primates Species 0.000 description 44
- 125000003275 alpha amino acid group Chemical group 0.000 description 39
- 108020004414 DNA Proteins 0.000 description 37
- 230000014509 gene expression Effects 0.000 description 35
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 33
- 108020004635 Complementary DNA Proteins 0.000 description 32
- 150000001875 compounds Chemical class 0.000 description 32
- 108010034529 leucyl-lysine Proteins 0.000 description 32
- 230000027455 binding Effects 0.000 description 30
- 238000010804 cDNA synthesis Methods 0.000 description 28
- 239000002299 complementary DNA Substances 0.000 description 28
- 238000012360 testing method Methods 0.000 description 28
- 230000000694 effects Effects 0.000 description 26
- 230000000875 corresponding effect Effects 0.000 description 24
- 238000002360 preparation method Methods 0.000 description 24
- 230000008521 reorganization Effects 0.000 description 24
- 241000880493 Leptailurus serval Species 0.000 description 21
- 108700026244 Open Reading Frames Proteins 0.000 description 21
- 238000005516 engineering process Methods 0.000 description 21
- 230000006870 function Effects 0.000 description 21
- 108010025306 histidylleucine Proteins 0.000 description 21
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 20
- 102000000589 Interleukin-1 Human genes 0.000 description 20
- 108010002352 Interleukin-1 Proteins 0.000 description 20
- 108010050848 glycylleucine Proteins 0.000 description 20
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 19
- 108010057821 leucylproline Proteins 0.000 description 19
- 241000699666 Mus <mouse, genus> Species 0.000 description 18
- 239000005557 antagonist Substances 0.000 description 18
- 235000013399 edible fruits Nutrition 0.000 description 18
- 241000124008 Mammalia Species 0.000 description 17
- 238000004422 calculation algorithm Methods 0.000 description 17
- 239000000758 substrate Substances 0.000 description 17
- 239000000556 agonist Substances 0.000 description 16
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 16
- 108010092114 histidylphenylalanine Proteins 0.000 description 16
- 238000009396 hybridization Methods 0.000 description 16
- 239000000047 product Substances 0.000 description 16
- 108010048818 seryl-histidine Proteins 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 15
- 230000008859 change Effects 0.000 description 15
- 239000003446 ligand Substances 0.000 description 15
- 150000003839 salts Chemical class 0.000 description 15
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 14
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 14
- 230000036039 immunity Effects 0.000 description 14
- 238000003018 immunoassay Methods 0.000 description 14
- 230000001105 regulatory effect Effects 0.000 description 14
- 238000012216 screening Methods 0.000 description 14
- 241000894007 species Species 0.000 description 14
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 13
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 13
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 13
- 239000003795 chemical substances by application Substances 0.000 description 13
- 230000002163 immunogen Effects 0.000 description 13
- 108020004999 messenger RNA Proteins 0.000 description 13
- 210000001519 tissue Anatomy 0.000 description 13
- 238000011282 treatment Methods 0.000 description 13
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 12
- 108010011559 alanylphenylalanine Proteins 0.000 description 12
- 230000004071 biological effect Effects 0.000 description 12
- 238000013016 damping Methods 0.000 description 12
- 239000003814 drug Substances 0.000 description 12
- 239000012530 fluid Substances 0.000 description 12
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 12
- 230000036961 partial effect Effects 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 11
- 108090000790 Enzymes Proteins 0.000 description 11
- 241000238631 Hexapoda Species 0.000 description 11
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 239000000427 antigen Substances 0.000 description 11
- 108091007433 antigens Proteins 0.000 description 11
- 102000036639 antigens Human genes 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 230000000052 comparative effect Effects 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 102000014909 interleukin-1 receptor activity proteins Human genes 0.000 description 11
- 108040006732 interleukin-1 receptor activity proteins Proteins 0.000 description 11
- 108010054155 lysyllysine Proteins 0.000 description 11
- 108010051242 phenylalanylserine Proteins 0.000 description 11
- 230000008569 process Effects 0.000 description 11
- 239000012266 salt solution Substances 0.000 description 11
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 11
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 10
- 108010077245 asparaginyl-proline Proteins 0.000 description 10
- 230000008878 coupling Effects 0.000 description 10
- 238000010168 coupling process Methods 0.000 description 10
- 238000005859 coupling reaction Methods 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- 230000000699 topical effect Effects 0.000 description 10
- 238000013518 transcription Methods 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 9
- 241000196324 Embryophyta Species 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 9
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 239000000969 carrier Substances 0.000 description 9
- 239000000306 component Substances 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 230000004481 post-translational protein modification Effects 0.000 description 9
- 108010029020 prolylglycine Proteins 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 108010073969 valyllysine Proteins 0.000 description 9
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 8
- 108091026890 Coding region Proteins 0.000 description 8
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 8
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 8
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 8
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 8
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 230000001900 immune effect Effects 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 238000002372 labelling Methods 0.000 description 8
- 108010000761 leucylarginine Proteins 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 108010015796 prolylisoleucine Proteins 0.000 description 8
- 239000007790 solid phase Substances 0.000 description 8
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 7
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 7
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 7
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 7
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 7
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 7
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 7
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 7
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 7
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 7
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 7
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 7
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 230000009260 cross reactivity Effects 0.000 description 7
- 230000013595 glycosylation Effects 0.000 description 7
- 238000006206 glycosylation reaction Methods 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 238000010369 molecular cloning Methods 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- -1 sequence Substances 0.000 description 7
- 239000007787 solid Substances 0.000 description 7
- 108010080629 tryptophan-leucine Proteins 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 6
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 6
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 6
- 108050006617 Interleukin-1 receptor Proteins 0.000 description 6
- 102000019223 Interleukin-1 receptor Human genes 0.000 description 6
- 102100036342 Interleukin-1 receptor-associated kinase 1 Human genes 0.000 description 6
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 6
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 6
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 6
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 6
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 6
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 6
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 6
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 6
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 6
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 230000008034 disappearance Effects 0.000 description 6
- 239000002552 dosage form Substances 0.000 description 6
- 210000003527 eukaryotic cell Anatomy 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010018006 histidylserine Proteins 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 230000003834 intracellular effect Effects 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 238000010561 standard procedure Methods 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 108010076441 Ala-His-His Proteins 0.000 description 5
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 5
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 5
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 5
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 5
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 5
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 5
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 5
- 102000039996 IL-1 family Human genes 0.000 description 5
- 108091069196 IL-1 family Proteins 0.000 description 5
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 5
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 5
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 5
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 5
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 5
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 5
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 5
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 5
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 5
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 5
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 5
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 5
- 108020005091 Replication Origin Proteins 0.000 description 5
- 241000283984 Rodentia Species 0.000 description 5
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 5
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 5
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 5
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 230000001276 controlling effect Effects 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 108010069495 cysteinyltyrosine Proteins 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 108010028295 histidylhistidine Proteins 0.000 description 5
- 108020001756 ligand binding domains Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 210000002826 placenta Anatomy 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 210000001236 prokaryotic cell Anatomy 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 239000011347 resin Substances 0.000 description 5
- 229920005989 resin Polymers 0.000 description 5
- 108010084932 tryptophyl-proline Proteins 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 4
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 4
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 4
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 4
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 4
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 4
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 4
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 4
- 206010010356 Congenital anomaly Diseases 0.000 description 4
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 4
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 4
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 4
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 4
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 4
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 4
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 4
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 4
- GWKBAXRZPLSWJS-QEJZJMRPSA-N Glu-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GWKBAXRZPLSWJS-QEJZJMRPSA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 4
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 4
- 101000852483 Homo sapiens Interleukin-1 receptor-associated kinase 1 Proteins 0.000 description 4
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 4
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- 108060003951 Immunoglobulin Proteins 0.000 description 4
- 102000015696 Interleukins Human genes 0.000 description 4
- 108010063738 Interleukins Proteins 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 4
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 4
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 4
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 4
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 4
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 4
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 4
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 4
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 4
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 4
- 102000010168 Myeloid Differentiation Factor 88 Human genes 0.000 description 4
- 108010077432 Myeloid Differentiation Factor 88 Proteins 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 4
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 4
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 4
- 108091000080 Phosphotransferase Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 4
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 4
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 4
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 4
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 4
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 4
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 4
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 4
- 108091023040 Transcription factor Proteins 0.000 description 4
- 102000040945 Transcription factor Human genes 0.000 description 4
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 4
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 4
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 4
- 210000001015 abdomen Anatomy 0.000 description 4
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 230000009137 competitive binding Effects 0.000 description 4
- 230000002860 competitive effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 150000002148 esters Chemical class 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 210000003630 histaminocyte Anatomy 0.000 description 4
- 102000018358 immunoglobulin Human genes 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 210000004072 lung Anatomy 0.000 description 4
- 210000004698 lymphocyte Anatomy 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 210000001672 ovary Anatomy 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 108010084572 phenylalanyl-valine Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 102000020233 phosphotransferase Human genes 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 238000001742 protein purification Methods 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 238000000163 radioactive labelling Methods 0.000 description 4
- 230000028327 secretion Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 3
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 3
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 3
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 3
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 3
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 3
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 3
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 3
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 3
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 3
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 3
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 3
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 3
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 3
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 3
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 3
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 3
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 3
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 3
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 3
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 3
- 241000020089 Atacta Species 0.000 description 3
- 108090001008 Avidin Proteins 0.000 description 3
- ZWNFOZNJYNDNGM-UBHSHLNASA-N Cys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N ZWNFOZNJYNDNGM-UBHSHLNASA-N 0.000 description 3
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 3
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 3
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 3
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 3
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 3
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 3
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 3
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 3
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 3
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 3
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 3
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 3
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 3
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 3
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 3
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 3
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 3
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 3
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 3
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 3
- 206010061218 Inflammation Diseases 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 3
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 3
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 3
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 3
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 3
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 3
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 3
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 3
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 3
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 3
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 3
- 101150053046 MYD88 gene Proteins 0.000 description 3
- UKUMISIRZAVYOG-CIUDSAMLSA-N Met-Glu-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O UKUMISIRZAVYOG-CIUDSAMLSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 108010057466 NF-kappa B Proteins 0.000 description 3
- 102000003945 NF-kappa B Human genes 0.000 description 3
- 244000061176 Nicotiana tabacum Species 0.000 description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 3
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 3
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 3
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 3
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 3
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 3
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 3
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 3
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 3
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 3
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 3
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 3
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 3
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 3
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 3
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 3
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 3
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 3
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 3
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 3
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 3
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 3
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 3
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 3
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 3
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 3
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 3
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 3
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 3
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 3
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 3
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 3
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 3
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 3
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 3
- 210000001744 T-lymphocyte Anatomy 0.000 description 3
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 3
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 3
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 3
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 3
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 3
- 108010064997 VPY tripeptide Proteins 0.000 description 3
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 3
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 230000002776 aggregation Effects 0.000 description 3
- 238000004220 aggregation Methods 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 210000003719 b-lymphocyte Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 210000004204 blood vessel Anatomy 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- 230000004034 genetic regulation Effects 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 208000026278 immune system disease Diseases 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- 230000004054 inflammatory process Effects 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 108010034507 methionyltryptophan Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 3
- 230000026731 phosphorylation Effects 0.000 description 3
- 238000006366 phosphorylation reaction Methods 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 3
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 2
- 108010013043 Acetylesterase Proteins 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 206010002198 Anaphylactic reaction Diseases 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 2
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 2
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 2
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 2
- NZQFXJKVNUZYAG-BPUTZDHNSA-N Arg-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 NZQFXJKVNUZYAG-BPUTZDHNSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 2
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 2
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 2
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- BSBNNPICFPXDNH-SRVKXCTJSA-N Asn-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BSBNNPICFPXDNH-SRVKXCTJSA-N 0.000 description 2
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 2
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 2
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 2
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 208000011691 Burkitt lymphomas Diseases 0.000 description 2
- 241000219357 Cactaceae Species 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 2
- OJQJUQUBJGTCRY-WFBYXXMGSA-N Cys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N OJQJUQUBJGTCRY-WFBYXXMGSA-N 0.000 description 2
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 2
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 2
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 2
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 2
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 2
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 2
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 2
- LLUXQOVDMQZMPJ-KKUMJFAQSA-N Cys-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CC1=CC=C(O)C=C1 LLUXQOVDMQZMPJ-KKUMJFAQSA-N 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- 238000007399 DNA isolation Methods 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 102000010170 Death domains Human genes 0.000 description 2
- 108050001718 Death domains Proteins 0.000 description 2
- 238000009007 Diagnostic Kit Methods 0.000 description 2
- 241000224495 Dictyostelium Species 0.000 description 2
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 description 2
- 108010082495 Dietary Plant Proteins Proteins 0.000 description 2
- 108700034637 EC 3.2.-.- Proteins 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 2
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 2
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 2
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 2
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 2
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 2
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 2
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 2
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 2
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 2
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 2
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 2
- NTYJJOPFIAHURM-UHFFFAOYSA-N Histamine Chemical compound NCCC1=CN=CN1 NTYJJOPFIAHURM-UHFFFAOYSA-N 0.000 description 2
- 108010021699 I-kappa B Proteins Proteins 0.000 description 2
- 102000008379 I-kappa B Proteins Human genes 0.000 description 2
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 2
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 2
- 102100025458 Inosine triphosphate pyrophosphatase Human genes 0.000 description 2
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 2
- 102100020881 Interleukin-1 alpha Human genes 0.000 description 2
- 102000003777 Interleukin-1 beta Human genes 0.000 description 2
- 108090000193 Interleukin-1 beta Proteins 0.000 description 2
- 101710199015 Interleukin-1 receptor-associated kinase 1 Proteins 0.000 description 2
- 102100039898 Interleukin-18 Human genes 0.000 description 2
- 108010082786 Interleukin-1alpha Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- 240000006240 Linum usitatissimum Species 0.000 description 2
- 108010074338 Lymphokines Proteins 0.000 description 2
- 102000008072 Lymphokines Human genes 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 2
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 2
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 2
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010075285 Nucleoside-Triphosphatase Proteins 0.000 description 2
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 2
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 2
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 2
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 2
- OAOLATANIHTNCZ-IHRRRGAJSA-N Phe-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N OAOLATANIHTNCZ-IHRRRGAJSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- UMIHVJQSXFWWMW-JBACZVJFSA-N Phe-Trp-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UMIHVJQSXFWWMW-JBACZVJFSA-N 0.000 description 2
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 2
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 2
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- 108091035242 Sequence-tagged site Proteins 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- 102000013275 Somatomedins Human genes 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 2
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 2
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 2
- GUWJWCHZNGDKBG-UBHSHLNASA-N Trp-Asn-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUWJWCHZNGDKBG-UBHSHLNASA-N 0.000 description 2
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 2
- NBHGNEJMBNQQKZ-UBHSHLNASA-N Trp-Asp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NBHGNEJMBNQQKZ-UBHSHLNASA-N 0.000 description 2
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- CPZTZWFFGVKHEA-SZMVWBNQSA-N Trp-Gln-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N CPZTZWFFGVKHEA-SZMVWBNQSA-N 0.000 description 2
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 2
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 2
- JZSLIZLZGWOJBJ-PMVMPFDFSA-N Trp-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N JZSLIZLZGWOJBJ-PMVMPFDFSA-N 0.000 description 2
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 2
- STJXERBCEWQLKS-IHPCNDPISA-N Trp-Tyr-Cys Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 STJXERBCEWQLKS-IHPCNDPISA-N 0.000 description 2
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 2
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000036783 anaphylactic response Effects 0.000 description 2
- 208000003455 anaphylaxis Diseases 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 230000006287 biotinylation Effects 0.000 description 2
- 238000007413 biotinylation Methods 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000036755 cellular response Effects 0.000 description 2
- 125000003636 chemical group Chemical group 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000014107 chromosome localization Effects 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000009833 condensation Methods 0.000 description 2
- 230000005494 condensation Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000001086 cytosolic effect Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000013020 embryo development Effects 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 210000004907 gland Anatomy 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010046775 glutamyl-isoleucyl-leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000008611 intercellular interaction Effects 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 230000010534 mechanism of action Effects 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 231100000219 mutagenic Toxicity 0.000 description 2
- 230000003505 mutagenic effect Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- 239000011049 pearl Substances 0.000 description 2
- 210000005105 peripheral blood lymphocyte Anatomy 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108060006633 protein kinase Proteins 0.000 description 2
- 238000004451 qualitative analysis Methods 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000012882 sequential analysis Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 238000010189 synthetic method Methods 0.000 description 2
- 238000010998 test method Methods 0.000 description 2
- 238000004448 titration Methods 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 230000007306 turnover Effects 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- SBKVPJHMSUXZTA-MEJXFZFPSA-N (2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-5-amino-2-[[2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-5-oxopentanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 SBKVPJHMSUXZTA-MEJXFZFPSA-N 0.000 description 1
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 1
- SADYNMDJGAWAEW-JKQORVJESA-N (2s)-2-[[(2s)-3-carboxy-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN SADYNMDJGAWAEW-JKQORVJESA-N 0.000 description 1
- IFMZMDAHXSSNLT-QAETUUGQSA-N (2s)-2-[[(2s)-4-amino-2-[[(2s)-6-amino-2-[[(2r)-2-amino-3-sulfanylpropanoyl]amino]hexanoyl]amino]-4-oxobutanoyl]amino]-3-phenylpropanoic acid Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IFMZMDAHXSSNLT-QAETUUGQSA-N 0.000 description 1
- VUDQSRFCCHQIIU-UHFFFAOYSA-N 1-(3,5-dichloro-2,6-dihydroxy-4-methoxyphenyl)hexan-1-one Chemical compound CCCCCC(=O)C1=C(O)C(Cl)=C(OC)C(Cl)=C1O VUDQSRFCCHQIIU-UHFFFAOYSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- KXGFMDJXCMQABM-UHFFFAOYSA-N 2-methoxy-6-methylphenol Chemical compound [CH]OC1=CC=CC([CH])=C1O KXGFMDJXCMQABM-UHFFFAOYSA-N 0.000 description 1
- 238000005084 2D-nuclear magnetic resonance Methods 0.000 description 1
- 108020005065 3' Flanking Region Proteins 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- 108020005029 5' Flanking Region Proteins 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 241001209435 Actus Species 0.000 description 1
- 208000036762 Acute promyelocytic leukaemia Diseases 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- OMFMCIVBKCEMAK-CYDGBPFRSA-N Ala-Leu-Val-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O OMFMCIVBKCEMAK-CYDGBPFRSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 1
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- 101710153593 Albumin A Proteins 0.000 description 1
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 102000008102 Ankyrins Human genes 0.000 description 1
- 108010049777 Ankyrins Proteins 0.000 description 1
- 241000024287 Areas Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- BBYTXXRNSFUOOX-IHRRRGAJSA-N Arg-Cys-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BBYTXXRNSFUOOX-IHRRRGAJSA-N 0.000 description 1
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- DQBFZVFOKCEJMC-SRVKXCTJSA-N Arg-Lys-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N DQBFZVFOKCEJMC-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ODBSSLHUFPJRED-CIUDSAMLSA-N Asn-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ODBSSLHUFPJRED-CIUDSAMLSA-N 0.000 description 1
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- HGGIYWURFPGLIU-FXQIFTODSA-N Asn-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(N)=O HGGIYWURFPGLIU-FXQIFTODSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- AAIUGNSRQDGCDC-ZLUOBGJFSA-N Asp-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O AAIUGNSRQDGCDC-ZLUOBGJFSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 241000288673 Chiroptera Species 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- BUIYOWKUSCTBRE-CIUDSAMLSA-N Cys-Arg-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O BUIYOWKUSCTBRE-CIUDSAMLSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- JDHMXPSXWMPYQZ-AAEUAGOBSA-N Cys-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N JDHMXPSXWMPYQZ-AAEUAGOBSA-N 0.000 description 1
- ANPADMNVVOOYKW-DCAQKATOSA-N Cys-His-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ANPADMNVVOOYKW-DCAQKATOSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 1
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- YFAFBAPQHGULQT-HJPIBITLSA-N Cys-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N YFAFBAPQHGULQT-HJPIBITLSA-N 0.000 description 1
- NXTYATMDWQYLGJ-BQBZGAKWSA-N Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CS NXTYATMDWQYLGJ-BQBZGAKWSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 1
- XSELZJJGSKZZDO-UBHSHLNASA-N Cys-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XSELZJJGSKZZDO-UBHSHLNASA-N 0.000 description 1
- XAHWYEYOMSGKDA-CWRNSKLLSA-N Cys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CS)N)C(=O)O XAHWYEYOMSGKDA-CWRNSKLLSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 208000031637 Erythroblastic Acute Leukemia Diseases 0.000 description 1
- 208000036566 Erythroleukaemia Diseases 0.000 description 1
- 108010046649 GDNP peptide Proteins 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- QLQDIJBYJZKQPR-BQBZGAKWSA-N Gly-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN QLQDIJBYJZKQPR-BQBZGAKWSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- RZEDHGORCKRINR-STQMWFEESA-N Gly-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN RZEDHGORCKRINR-STQMWFEESA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- WPUAVVXYEJAWIV-KKUMJFAQSA-N His-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WPUAVVXYEJAWIV-KKUMJFAQSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 102000009331 Homeodomain Proteins Human genes 0.000 description 1
- 108010048671 Homeodomain Proteins Proteins 0.000 description 1
- 101000960954 Homo sapiens Interleukin-18 Proteins 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- VUEXLJFLDONGKQ-PYJNHQTQSA-N Ile-His-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N VUEXLJFLDONGKQ-PYJNHQTQSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- 125000000899 L-alpha-glutamyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C(O[H])=O 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 241000446313 Lamella Species 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- OPJRECCCQSDDCZ-TUSQITKMSA-N Lys-Trp-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OPJRECCCQSDDCZ-TUSQITKMSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- PSVAVKGDUAKZKU-BZSNNMDCSA-N Lys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N)O PSVAVKGDUAKZKU-BZSNNMDCSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- 108010038049 Mating Factor Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 1
- VVWQHJUYBPJCNS-UMPQAUOISA-N Met-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 VVWQHJUYBPJCNS-UMPQAUOISA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- SNIXRMIHFOIVBB-UHFFFAOYSA-N N-Hydroxyl-tryptamine Chemical compound C1=CC=C2C(CCNO)=CNC2=C1 SNIXRMIHFOIVBB-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010010875 NKISK peptide Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000237988 Patellidae Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010071384 Peptide T Proteins 0.000 description 1
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- CTNODEMQIKCZGQ-JYJNAYRXSA-N Phe-Gln-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 CTNODEMQIKCZGQ-JYJNAYRXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- RGZYXNFHYRFNNS-MXAVVETBSA-N Phe-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGZYXNFHYRFNNS-MXAVVETBSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- SNIPWBQKOPCJRG-CIUDSAMLSA-N Pro-Gln-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O SNIPWBQKOPCJRG-CIUDSAMLSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- BLJMJZOMZRCESA-GUBZILKMSA-N Pro-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BLJMJZOMZRCESA-GUBZILKMSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 101100408135 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) phnA gene Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 101000852966 Rattus norvegicus Interleukin-1 receptor-like 1 Proteins 0.000 description 1
- 241000244173 Rhabditis Species 0.000 description 1
- 101710141795 Ribonuclease inhibitor Proteins 0.000 description 1
- 229940122208 Ribonuclease inhibitor Drugs 0.000 description 1
- 102100037968 Ribonuclease inhibitor Human genes 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010043943 Starch Phosphorylase Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000002689 Toll-like receptor Human genes 0.000 description 1
- 108020000411 Toll-like receptor Proteins 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- PMIJXCLOQFMOKZ-BPUTZDHNSA-N Trp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PMIJXCLOQFMOKZ-BPUTZDHNSA-N 0.000 description 1
- GZTKZDGIEBKZAH-XIRDDKMYSA-N Trp-Cys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N GZTKZDGIEBKZAH-XIRDDKMYSA-N 0.000 description 1
- NLYCSLWTDMPLSX-QEJZJMRPSA-N Trp-Gln-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NLYCSLWTDMPLSX-QEJZJMRPSA-N 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- YHRCLOURJWJABF-WDSOQIARSA-N Trp-His-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YHRCLOURJWJABF-WDSOQIARSA-N 0.000 description 1
- NOBINHCGDUHOBV-NAZCDGGXSA-N Trp-His-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NOBINHCGDUHOBV-NAZCDGGXSA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- HWCBFXAWVTXXHZ-NYVOZVTQSA-N Trp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HWCBFXAWVTXXHZ-NYVOZVTQSA-N 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- FFWCYWZIVFIUDM-OYDLWJJNSA-N Trp-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O FFWCYWZIVFIUDM-OYDLWJJNSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- QJKMCQRFHJRIPU-XDTLVQLUSA-N Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QJKMCQRFHJRIPU-XDTLVQLUSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VYTUETMEZZLJFU-IHRRRGAJSA-N Tyr-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)N[C@@H](CS)C(=O)O VYTUETMEZZLJFU-IHRRRGAJSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 1
- HOZAIQIEJTWWDG-HJOGWXRNSA-N Val-Trp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HOZAIQIEJTWWDG-HJOGWXRNSA-N 0.000 description 1
- VBTFUDNTMCHPII-FKBYEOEOSA-N Val-Trp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VBTFUDNTMCHPII-FKBYEOEOSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 229930003756 Vitamin B7 Natural products 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 150000008065 acid anhydrides Chemical class 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 208000021841 acute erythroid leukemia Diseases 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000003453 ammonium sulfate precipitation method Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 238000003975 animal breeding Methods 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 210000003050 axon Anatomy 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000007321 biological mechanism Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 210000003995 blood forming stem cell Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 125000000837 carbohydrate group Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000012560 cell impurity Substances 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 230000003399 chemotactic effect Effects 0.000 description 1
- VDQQXEISLMTGAB-UHFFFAOYSA-N chloramine T Chemical compound [Na+].CC1=CC=C(S(=O)(=O)[N-]Cl)C=C1 VDQQXEISLMTGAB-UHFFFAOYSA-N 0.000 description 1
- 125000004218 chloromethyl group Chemical group [H]C([H])(Cl)* 0.000 description 1
- 229910001179 chromel Inorganic materials 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 230000006957 competitive inhibition Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 230000037011 constitutive activity Effects 0.000 description 1
- 239000012050 conventional carrier Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000008358 core component Substances 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 210000003104 cytoplasmic structure Anatomy 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000002875 fluorescence polarization Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 102000035122 glycosylated proteins Human genes 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229960001340 histamine Drugs 0.000 description 1
- 108010009253 histidyl-asparaginyl-glutamyl-leucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 210000000003 hoof Anatomy 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 210000004754 hybrid cell Anatomy 0.000 description 1
- OAKJQQAXSVQMHS-UHFFFAOYSA-N hydrazine Substances NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 230000007124 immune defense Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 229960003130 interferon gamma Drugs 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 230000000527 lymphocytic effect Effects 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 108010053062 lysyl-arginyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 230000001002 morphogenetic effect Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- HOGDNTQCSIKEEV-UHFFFAOYSA-N n'-hydroxybutanediamide Chemical compound NC(=O)CCC(=O)NO HOGDNTQCSIKEEV-UHFFFAOYSA-N 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 238000011392 neighbor-joining method Methods 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000033116 oxidation-reduction process Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 239000005011 phenolic resin Substances 0.000 description 1
- 229920001568 phenolic resin Polymers 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010069184 phenylalanyl-leucyl-glutamyl-glutamyl-isoleucine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- DCWXELXMIBXGTH-QMMMGPOBSA-N phosphonotyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-QMMMGPOBSA-N 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000006461 physiological response Effects 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 239000002985 plastic film Substances 0.000 description 1
- 229920006255 plastic film Polymers 0.000 description 1
- 229920000098 polyolefin Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000012113 quantitative test Methods 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 229910052761 rare earth metal Inorganic materials 0.000 description 1
- 150000002910 rare earth metals Chemical class 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 230000002000 scavenging effect Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 210000002186 septum of brain Anatomy 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000007727 signaling mechanism Effects 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 108010080244 somatostatin(3-6) Proteins 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000006277 sulfonation reaction Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 238000011287 therapeutic dose Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 210000003813 thumb Anatomy 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 229960001479 tosylchloramide sodium Drugs 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 102000027257 transmembrane receptors Human genes 0.000 description 1
- 108091008578 transmembrane receptors Proteins 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 101150044170 trpE gene Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 230000001196 vasorelaxation Effects 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- FEPMHVLSLDOMQC-UHFFFAOYSA-N virginiamycin-S1 Natural products CC1OC(=O)C(C=2C=CC=CC=2)NC(=O)C2CC(=O)CCN2C(=O)C(CC=2C=CC=CC=2)N(C)C(=O)C2CCCN2C(=O)C(CC)NC(=O)C1NC(=O)C1=NC=CC=C1O FEPMHVLSLDOMQC-UHFFFAOYSA-N 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000011735 vitamin B7 Substances 0.000 description 1
- 235000011912 vitamin B7 Nutrition 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 230000003313 weakening effect Effects 0.000 description 1
- 230000003245 working effect Effects 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F02—COMBUSTION ENGINES; HOT-GAS OR COMBUSTION-PRODUCT ENGINE PLANTS
- F02D—CONTROLLING COMBUSTION ENGINES
- F02D11/00—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated
- F02D11/06—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated characterised by non-mechanical control linkages, e.g. fluid control linkages or by control linkages with power drive or assistance
- F02D11/10—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated characterised by non-mechanical control linkages, e.g. fluid control linkages or by control linkages with power drive or assistance of the electric type
- F02D11/106—Detection of demand or actuation
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F02—COMBUSTION ENGINES; HOT-GAS OR COMBUSTION-PRODUCT ENGINE PLANTS
- F02D—CONTROLLING COMBUSTION ENGINES
- F02D11/00—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated
- F02D11/06—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated characterised by non-mechanical control linkages, e.g. fluid control linkages or by control linkages with power drive or assistance
- F02D11/10—Arrangements for, or adaptations to, non-automatic engine control initiation means, e.g. operator initiated characterised by non-mechanical control linkages, e.g. fluid control linkages or by control linkages with power drive or assistance of the electric type
- F02D11/107—Safety-related aspects
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Immunology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Engineering & Computer Science (AREA)
- Pharmacology & Pharmacy (AREA)
- Mechanical Engineering (AREA)
- Animal Behavior & Ethology (AREA)
- Combustion & Propulsion (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Cell Biology (AREA)
- Toxicology (AREA)
- General Chemical & Material Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
编码人受体等哺乳动物受体的核酸、纯化受体蛋白,及其片段。此外还提供抗体,包括多克隆抗体和单克隆抗体。本发明还提供将这些组合物应用于诊断和治疗的方法。
Description
本申请是申请日为2001年5月23日,申请号为01813453.X,发明名称为“人受体蛋白;相关的试剂和方法”的发明专利申请的分案申请。
发明领域
本发明涉及一些组合物及方法,用于影响哺乳动物的生理机能,包括形态发生或免疫系统功能。详细地说,本发明提供一些能够调节发育和/或免疫系统的核酸、蛋白及抗体。此外还公开这些物质的诊断性和治疗性应用。
发明背景
DNA重组技术通常是指把来自源供体的遗传信息整合到载体中用于后续处理,如引入到宿主中,从而在新环境下复制和/或表达被转移的遗传信息的技术。这些遗传信息通常是以互补DNA(cDNA)的形式存在,这些cDNA来源于编码所需蛋白产物的信使RNA(mRNA)。载体通常是一种质粒,这种质粒能够整合cDNA,并随后在宿主中复制,在某些情况下还能够调控cDNA的表达,从而指导其编码产物在宿主中的合成。
此前已认识到哺乳动物免疫应答的基础是一系列复杂的细胞间相互作用,被称为“免疫网络”。近期的研究则使人们对该网络的内部工作方式有了新的理解。事实上,许多免疫应答都围绕着淋巴细胞、巨噬细胞、粒细胞和其他细胞之间的网状相互作用而产生,尽管对此还不完全了解,但免疫学家目前的普遍观点是,被称为淋巴因子、细胞因子或单核因子的可溶性蛋白在对这些细胞间相互作用的调控过程中发挥着关键的作用。因此,分离和鉴定细胞调节因子并研究其作用机理是一大热点,对这些知识的了解有助于显著改进免疫系统失调等众多医学异常疾病的诊断和治疗方法。
显然,淋巴因子可通过多种方式来调节细胞的活动。研究显示,这些淋巴因子可支持多能造血干细胞的增殖、生长和/或分化成大量祖细胞,这些祖细胞包括不同的细胞谱系,从而构成复杂的免疫系统。对正常的免疫应答而言,这些细胞组分之间的相互作用必须适当,并且要保持平衡。将淋巴因子与其他试剂一同给药时,不同的细胞谱系通常以不同的方式产生应答。
对免疫应答尤为重要的细胞谱系包括两类淋巴细胞:产生和分泌免疫球蛋白(能够识别并结合异物从而实现其清除作用的蛋白)的B细胞,以及分泌淋巴因子并诱导或抑制B细胞和构成免疫网络的其他多种细胞(包括其他T细胞)的不同的T细胞亚类。这些淋巴细胞可与其他多种类型的细胞相互作用。
另一种重要的细胞谱系是肥大细胞(目前还未能在所有种类的哺乳动物中明确鉴定出此类细胞),它是一种位于全身毛细血管附近的含有颗粒的结缔组织细胞。此类细胞在肺、皮肤以及胃肠道和泌尿生殖道中出现的浓度特别高。肥大细胞在变态反应相关性病症中起主要作用,尤其是在下述过敏反应中:当特定抗原与结合在肥大细胞表面受体上的一类免疫球蛋白交联时,该肥大细胞可脱粒并释放介体,如组胺、5羟色胺、肝素和前列腺素,从而导致变态反应,如过敏反应。
一般而言,目前还无法在体外培养免疫系统细胞,这使进一步了解并治疗多种免疫功能障碍的研究受阻。免疫学家发现,利用T细胞及其他细胞的上清液可以对多种免疫系统细胞进行培养,这些上清液含有多种生长因子,包括多种淋巴因子。
白介素-1蛋白家族包括IL-1α、IL-1β、IL-1RA和最近发现的IL-1γ(也被命名为干扰素γ诱导因子,IGIF)。研究显示,相关的该基因家族可能具有广泛的生物学功能。参阅Dinarello(1994)
FASEB J.8:1314-1325;Dinarello(1991)
Blood 77:1627-1652;和Okamura,et al.(1995)
Nature 378:88-91。
此外还有多种生长因子和调节因子可以调节形态发生的发展过程。包括,例如,通过与受体结合来产生信号的Toll配体,其结合的受体具有与IL-1受体相同的结构特性和作用机理。可参阅,例如,Lemaitre,et al.(1996)
Cell 86:973-983;和Anderson(1996)Ann.Rev.Cell & Devel.Biol.12:393-416。
由此可知,发现和开发新的可溶性蛋白及其受体,包括与淋巴因子类似的可溶性蛋白及其受体,将有助于开发新的疗法,以治疗那些直接或间接涉及诸如免疫系统和/或造血细胞发育、分化或功能的广泛的变性疾病或异常病症。特别是有些淋巴因子样分子可提高或加强其他淋巴因子的有利活性,发现并了解这些分子的新受体将会带来巨大利益。本发明将提供类似于白介素-1样成分的配体的一些新受体,以及相关的化合物及其用途。
附图简述
图1显示果蝇、新小杆线虫(Caenorabditis)和人DTLRs的蛋白结构比较及其与脊椎动物IL-1受体和植物抗病蛋白的关系示意图。图中将3种果蝇(Dm)DTLRs(Toll、18w和Mst ORF片段)(Morisatoand Anderson(1995)
Ann.Rev.Genet.29:371-399;Chiang andBeachy(1994)
Mech.Develop.47:225-239;Mitcham,et al.(1996)J.Biol.Chem.271:5777-5783;和Eldon,et al.(1994)
Develop.120:885-899)与4种完整人(Hu)受体(DTLRs 1-4)和一种部分人(Hu)受体(DTLR5)进行排列比较。受体胞外域中被PRINTS标记(Attwood,et al.,(1997)
Nucleic Acids Res.25:212-217)的各LRRs均用方框标明;位于LRR阵列C末端或N末端侧翼的“顶部”及“底部”的Cys富集簇分别描绘成对立的半圆。与Toll和18w的784和977个氨基酸残基的延伸部分相比,DTLRs 1-5的胞外域较小(分别为558、570、690和652个氨基酸残基),这主要是因为其内部Cys富集区域缺失。DmMst和HuDTLRs的不完全链(胞外域分别约为519和153个氨基酸残基)用虚线表示。在膜下标明的是DTLRs、IL-1型受体(IL-1Rs)、细胞内蛋白Myd88和烟草抗病基因N产物(DRgN)所共有的胞内信号组件。可参阅,例如,Hardiman,et al.(1996)
Oncogene 13:2467-2475:和Rock.et al.(1998)
Proc. Nat’l Acad.Sci.USA 95:588-。其他结构域还包括IL-1Rs中的Ig样组件三聚体(二硫键连接的环);DRgN蛋白的特征是含有NTPase结构域(方框),而Myd88具有一种死亡结构域(黑色椭圆)。
图2A-2C显示Toll样和IL-1样细胞因子受体以及两种趋异模块化蛋白的信号结构域保守结构模式。图2A-2B显示TH共有结构域的序列比对。DTLRs的标记方法与图1相同;人(Hu)或小鼠(Mo)IL-1家族受体(IL-1R 1-6)是按照前人的建议依次编号(Hardiman,et al.(1996)
Oncogene 13:2467-2475);Myd88以及来自烟草(To)和亚麻L.usitatissimum(Lu)的序列分别代表较大的多结构域分子的C-端和N-端结构域。序列的无空位区(编号1-10)加框表示。有害的突变用三角标明,而将箭头所指部位的N-端截断会破坏人IL-1R1的生物活性(Heguy,et al.(1992)
J.Biol.Chem.267:2605-2609)。PHD(Rost and Sander(1994)
Proteins 19:55-72)和DSC(Kingand Sternbery(1996)
Protein Sci.5:2298-2310)二级结构预测结果中的α-螺旋(H)、β-链(E)或卷曲(L)均标出。氨基酸阴影图描述出化学性质类似的残基:疏水性残基、酸性残基、碱性残基、Cys、芳香族残基、结构破坏性残基以及微小残基。IL-1Rs、DTLRs以及全排列(ALL)的序列分析图式均来自Consensus,严格度为75%。用于标明氨基酸亚类的符号为(详情见互联网站点):o,醇;1,脂肪族;·,任意氨基酸;a,芳香族;c,带电荷的;h,疏水性;-,负电性;p,极性;+,正电性;s,小;u,微小;t,转角样。图2C显示预测的TH β/α结构域折叠的拓扑结构图。平行β-折叠(β-链A-E用黄色三角标示)是从其C-末端观测;α-螺旋(标号1-5的圆圈)与β-链相联。链的联接处可位于前部(可见)或后部(隐藏)。位于β-折叠C-末端的保守带电残基用灰色标注(Asp),或标成单独的黑色残基(Arg)(见正文)。
图3显示信号结构域超家族的进化。利用Neighbor-Joining方法(Thompson.et al.(1994)
Nucleic Acids Res.22:4673-4680)可由图2A-2B的多重TH组件比对结果产生系统树。比对中标记了蛋白;提供的系统树是由TreeView产生。
图4A-4D描述人DTLR基因的FISH染色体作图。将用于定位的生物素标记DTLR cDNA探针与来自人淋巴细胞同步培养物的变性染色体杂交。FISH作图结果(左侧,图4A,DTLR2;4B,DTLR3;4C,DTLR4;4D,DTLR5)与染色体带的比对是将FISH信号与DAPI显带的染色体相叠加而完成的(中间图)。Heng and Tsui(1994)
Meth.Molec. Biol.33:109-122。分析结果被总结成染色体模式图的形式(右侧图)。
图5A-5F显示人DTLRs的mRNA印迹分析结果。用每道约含有2μgpoly(A)+RNA的多种人类组织印迹(He,心脏;Br,脑;P1,胎盘;Lu,肺;Li,肝脏;Mu,肌肉;Ki,肾;Pn,胰腺;Sp,脾;Th,胸腺;Pr,前列腺;Te,睾丸;Ov,卵巢;SI,小肠;Co,结肠;PBL,外周血淋巴细胞)和癌细胞系印迹(早幼粒细胞白血病,HL60;子宫颈癌,HELAS3;慢性髓细胞性白血病,K562;淋巴母细胞白血病,Molt4;结肠直肠腺癌,SW480;黑素瘤,G361;伯基特淋巴瘤;结肠直肠腺癌,SW480;肺癌,A549)与编码上述DTLR1(图5A-5C)、DTLR2(图5D),DTLR3(图5E)和DTLR4(图5F)的放射性标记cDNAs杂交。利用增感屏使印迹于-70℃暴露于X射线胶片2天(图5A-5C)或1周(图5D-5F)。某些道中出现了一种0.3kB的反常核素;杂交实验排除了DTLR胞质片段的编码信息。
发明概述
本发明涉及被命名为DTLR2、DTLR3、DTLR4、DTLR5、DTLR7、DTLR8、DTLR9和DTLR10的9种相关的哺乳动物新受体,如灵长类动物或人的DNAX Toll受体样分子结构,以及这些受体的生物活性。本发明包括这些多肽自身的编码核酸及其产生方法和用途。本发明所述核酸的部分特征是与文中附带的克隆化互补DNA(cDNA)序列具有同源性。在一些实施方案中,本发明提供一种物质组合物,该物质选自:一种基本纯的或重组的DTLR2蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:4的同一性;一种SEQ ID NO:4的天然序列DTLR2;一种含有DTLR2序列的融合蛋白;一种基本纯的或重组的DTLR3蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ IDNO:6的同一性;一种SEQ ID NO:6的天然序列DTLR3;一种含有DTLR3序列的融合蛋白;一种基本纯的或重组的DTLR4蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:8的同一性;一种SEQ ID NO:8的天然序列DTLR4;一种含有DTLR4序列的融合蛋白;一种基本纯的或重组的DTLR5蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:10的同一性;一种SEQ ID NO:10的天然序列DTLR5;一种含有DTLR5序列的融合蛋白;一种基本纯的或重组的DTLR6蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:12、28或30的同一性;一种SEQ ID NO:12、28或30的天然序列DTLR6;一种含有DTLR6序列的融合蛋白;一种基本纯的或重组的DTLR7蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:16、18或37的同一性;一种SEQ ID NO:16、18或37的天然序列DTLR7;一种含有DTLR7序列的融合蛋白;一种基本纯的或重组的DTLR8蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:32或39的同一性;一种SEQ ID NO:32或39的天然序列DTLR8;一种含有DTLR8序列的融合蛋白;一种基本纯的或重组的DTLR9蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQ ID NO:22或41的同一性;一种SEQ ID NO:22或41的天然序列DTLR9;一种含有DTLR9序列的融合蛋白;一种基本纯的或重组的DTLR10蛋白或肽,在其至少约12个氨基酸的长度上表现出与SEQID NO:34、43或45的同一性;一种SEQ ID NO:34、43或45的天然序列DTLR10;一种含有DTLR10序列的融合蛋白。优选的是这种基本纯的或分离的蛋白含有一段与DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9和DTLR10的相应部分表现出同一性的序列段,其中,所述同一性至少表现于约15个氨基酸;优选约19个氨基酸;更优选约25个氨基酸。在特别实施方案中,该物质组合物:是DTLR2,它含有表2所示成熟序列;或缺乏翻译后修饰;是DTLR3,它含有表3所示成熟序列;或缺乏翻译后修饰;是DTLR4,它含有表4所示成熟序列;或缺乏翻译后修饰;是DTLR5,它含有表5所示成熟序列;或缺乏翻译后修饰;是DTLR6,它含有表6所示成熟序列;或缺乏翻译后修饰;是DTLR7,它含有表7所示成熟序列;或缺乏翻译后修饰;是DTLR8,它含有表8所示成熟序列;或缺乏翻译后修饰;是DTLR9,它含有表9所示成熟序列;或缺乏翻译后修饰;是DTLR10,它含有表10所示成熟序列;或缺乏翻译后修饰;该组合物还可以是一种蛋白或肽,这种蛋白或肽:来源于一种选自哺乳动物的温血动物,包括灵长类动物,如人;至少含有SEQ ID NO:4、6、26、10、12、28、30、16、18、32、22或34的一种多肽片段;带有多个具有所述同一性的片段;是DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10的一种天然等位变体;至少长约30个氨基酸;至少有两种对灵长类动物的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10具有特异性的非重叠表位;在至少约35个氨基酸的长度上与灵长类动物的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10表现出序列同一性;另外具有至少两种对灵长类动物的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10具有特异性的非重叠表位;在至少约20个氨基酸的长度上与啮齿类动物的DTLR6表现出同一性;被糖基化;经天然糖基化作用后的分子量至少为100kD:是一种合成多肽;附着于一种固体基质;与另一种化学部分偶联;是天然序列的一种5倍或5倍以下的取代物;或是天然序列的一种缺失变体或插入变体。
其他实施方案还包括一种组合物,该组合物含有:一种无菌的DTLR2蛋白或肽;或DTLR2蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR3蛋白或肽;或DTLR3蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR4蛋白或肽;或DTLR4蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR5蛋白或肽;或DTLR5蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR6蛋白或肽;或DTLR6蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR7蛋白或肽;或DTLR7蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR8蛋白或肽;或DTLR8蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR9蛋白或肽;或DTLR9蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药;一种无菌的DTLR10蛋白或肽;或DTLR10蛋白或肽以及一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药。
在一些融合蛋白实施方案中,本发明提供一种融合蛋白,该融合蛋白含有:包含表2、3、4、5、6、7、8、9或10的序列的成熟蛋白;一种检测标记或纯化标记,包括FLAG、His6或Ig序列;或另一种受体蛋白的序列。
在多种试剂盒实施方案中包括一种试剂盒,该试剂盒含有一种DTLR蛋白或多肽,以及:装有该蛋白或多肽的一种隔室;和/或所述试剂盒中所含试剂的使用或处理说明书。结合化合物的实施方案包括那些含有一种抗体的抗原结合位点的化合物,这种抗体可特异性结合权利要求1的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10蛋白,其中:所述蛋白是一种灵长类动物蛋白;该结合化合物是一种Fv、Fab或Fab2片段;该结合化合物与另一种化学部分偶联;或所述抗体:针对一种具有表2、3、4、5、6、7、8、9或10所示成熟多肽的肽序列;针对成熟的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10;针对纯化的人DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10;经过免疫选择;是一种多克隆抗体;与变性的DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10结合;与抗原结合的Kd至少为30μM;附着于一种固体基质、包括珠或塑料膜;包含在一种无菌组合物中;或带有可检测标记,包括放射性标记或荧光标记。结合组合物试剂盒通常含有所述结合化合物,以及:装有所述结合化合物的一种隔室;和/或所述试剂盒中所含试剂的使用或处理说明书。该试剂盒通常可以进行定量或定性分析。
本发明提供一些方法,如制备抗体的方法,该方法包括,用免疫原性剂量的一种灵长类动物DTLR2、DTLR 3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10对一种免疫系统进行免疫,从而使该抗体能够产生;或产生抗原:抗体复合物的方法,该方法包括,使上述抗体与哺乳动物DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10蛋白相接触,从而使该复合物能够形成。
其他组合物还包括一种组合物,该组合物含有:一种无菌的结合化合物,或这种结合化合物与一种载体,其中该载体:是一种水性化合物,包括水、盐水和/或缓冲液;并且/或被配制用于口服给药、直肠给药、鼻部给药、局部给药或胃肠外给药。
核酸的实施方案包括编码DTLR 2-10蛋白或肽或融合蛋白的一种分离的或重组的核酸,其中,所述DTLR来源于一种哺乳动物;或所述核酸:编码表2、3、4、5、6、7、8、9或10的一种抗原性肽序列;编码表2、3、4、5、6、7、8、9或10的多种抗原性肽序列;含有表2、3、4、5、6、7、8、9或10的至少17个连续核苷酸;与编码所述片段的天然cDNA至少约有80%的同一性;是一种表达载体;另含有一种复制起始点;来自一种天然来源;含有一种可检测标记;含有合成的核苷酸序列;小于6kb,优选小于3kb;来自一种哺乳动物,包括灵长类动物;含有一种天然的全长编码序列;是所述DTLR的编码基因的一种杂交探针;或是一种PCR引物、PCR产物或诱变引物。此外还提供含有这种重组核酸的细胞、组织或器官。该细胞优选为:一种原核细胞;一种真核细胞;一种细菌细胞;一种酵母细胞;一种昆虫细胞;一种哺乳动物细胞;一种小鼠细胞;一种灵长类动物细胞;或一种人类细胞。提供的试剂盒含有这些核酸,以及:装有该核酸的一种隔室;另装有灵长类动物DTLR2、DTLR3、DTLR4或DTLR5蛋白或多肽的一种隔室;和/或所述试剂盒中所含试剂的使用或处理说明书。该试剂盒通常可用于进行定性或定量分析。
其他实施方案还包括一种核酸,该核酸:能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:3杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:5杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:7杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:9杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:11、13、27或29杂交;能够在30C和盐浓度小于2M的洗涤条件下与SEQ ID NO:15、17或36杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:19、31或38杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ IDNO:21或40杂交;能够在30℃和盐浓度小于2M的洗涤条件下与SEQ ID NO:23、33、42或44杂交;在至少约30个核苷酸的片段上表现出与灵长类动物DTLR2的至少约85%的同一性;在至少约30个核苷酸的片段上表现出与灵长类动物DTLR3的至少约85%的同一性;在至少约30个核苷酸的片段上表现出与灵长类动物DTLR4的至少约85%的同一性;在至少约30个核苷酸的片段上表现出与灵长类动物DTLR5的至少约85%的同一性;该核酸优选具有这些性质,其中,洗涤条件为45℃和/或500mM的盐浓度;或所述同一性至少为90%,并且/或所述片段至少为55个核苷酸。
更优选的是洗涤条件为55℃和/或150mM的盐浓度;或所述同一性至少为95%,并且/或所述片段至少为75个核苷酸。
此外还提供一些产生配体:受体复合物的方法,这些方法包括将基本纯的灵长类动物DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10,包括重组或合成产生的蛋白,与候选的Toll配体相接触;从而使所述复合物能够形成。
本发明还提供一种用于调节细胞或组织培养细胞的生理机能或发育的方法,该方法包括将所述细胞与哺乳动物DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9或DTLR10的一种激动剂或拮抗剂相接触。优选的是将pDC 2细胞与DTLR10的激动剂或拮抗剂相接触。
发明详述
提纲
I.概要
II.活性
III.核酸
A.编码片段、序列、探针
B.突变、嵌合体、融合体
C.制备核酸
D.含有核酸的载体、细胞
IV.蛋白、肽
A.片段、序列、免疫原、抗原
B.突变蛋白
C.激动剂/拮抗剂、功能等价物
D.制备蛋白
V.核酸、蛋白的制备
A.合成
B.重组
C.天然表源
VI.抗体
A.多克隆抗体
B.单克隆抗体
C.片段;Kd
D.抗独特型抗体
E.杂交瘤细胞系
VII.用于DTLRs 1-10定量的试剂盒及方法
A.ELISA
B.编码mRNA的测定
C.定性/定量
D.试剂盒
VIII.治疗性组合物、方法
A.联合组合物
B.单位剂量
C.给药方法
IX.配体
I.概要
本发明提供具有特定结构和生物学性质的哺乳动物DNAX Toll样受体分子(DTLR)的氨基酸序列及DNA序列,本文的DTLR来源于灵长类动物。这些分子在文中分别被命名为DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9和DTLR10,从而使人Toll样受体家族成员的数量从1增加到10。编码这些分子的不同cDNAs均来自灵长类动物的cDNA序列库,例如人的cDNA序列库。其他灵长类动物或哺乳类动物的对应物也应考虑在内。
一些可使用的标准方法是来自,例如,Maniatis,et al.(1982)《分子克隆实验手册》,Cold Spring Harbor Laboratory,ColdSpring Harbor Press;Sambrook,et al.(1989)
《分子克隆实验 手册》,(第二版)vols.1-3,CSH Press,NY;Ausubel,et al.,《生物学》,Greene Publishing Associates,Brooklyn,NY;或Ausubel,et al.(1987及定期增刊)《
最新分子生物学实验方法》,Greene/Wiley,New York的描述或其参考文献;这些内容均在此引入作为参考。
表1显示人DTLR1编码区的完整核苷酸序列(SEQ ID NO:1)和相应的氨基酸序列(SEQ ID NO:2)。还可参考Nomura,et al.(1994)DNA Res.1:27-35。表2显示人DTLR2编码区的完整核苷酸序列(SEQID NO:3)和相应的氨基酸序列(SEQ ID NO:4)。表3显示人DTLR3编码区的完整核苷酸序列(SEQ ID NO:5)和相应的氨基酸序列(SEQID NO:6)。表4显示人DTLR4编码区的完整核苷酸序列(SEQ ID NO:7)和相应的氨基酸序列(SEQ ID NO:8);还可参考SEQ ID NO:25和SEQ ID NO:26。表5显示人DTLR5编码区的部分核苷酸序列(SEQID NO:9)和相应的氨基酸序列(SEQ ID NO:10)。表6显示人DTLR6编码区的完整核苷酸序列(SEQ ID NO:11)和相应的氨基酸序列(SEQID NO:12),以及鼠DTLR6的部分序列(SEQ ID NO:13、14、27、28、29和30)。表7显示人DTLR7编码区的部分核苷酸序列(SEQ IDNO:15和17)和相应的氨基酸序列(SEQ ID NO:16和18);SEQ IDNO:36和37则提供全长序列。表8显示人DTLR8编码区的部分核苷酸序列(SEQ ID NO:19)和相应的氨基酸序列(SEQ ID NO:20),以及附加序列(SEQ ID NO:31、32、38和39)。表9显示人DTLR9编码区的部分核苷酸序列(SEQ ID NO:21)和相应的氨基酸序列(SEQID NO:22);还可参考SEQ ID NO:40和41。表10显示人DTLR10编码区的部分核苷酸序列(SEQ ID NO:23)和相应的氨基酸序列(SEQID NO:24),以及附加序列(SEQ ID NO:33、34、42和43)和小鼠等啮齿动物的序列(SEQ ID NO:35、44和45)。
表1:诸如人等灵长类动物的DNAX Toll样受体1(DTLR1)的核苷酸序列和氨基酸序列(见SEQ ID NO:1和2)。
ATG ACT AGC ATC TTC CAT TTT GCC ATT ATC TTC ATG TTA ATA CTT CAG 48
Met Thr Ser Ile Phe His Phe Ala Ile Ile Phe Met Leu Ile Leu Gln
-22 -20 -15 -10
ATC AGA ATA CAA TTA TCT GAA GAA AGT GAA TTT TTA GTT GAT AGG TCA 96
Ile Arg Ile Gln Leu Ser Glu Glu Ser Glu Phe Leu Val Asp Arg Ser
-5 1 5 10
AAA AAC GGT CTC ATC CAC GTT CCT AAA GAC CTA TCC CAG AAA ACA ACA 144
Lys Asn Gly Leu Ile His Val Pro Lys Asp Leu Ser Gln Lys Thr Thr
15 20 25
ATC TTA AAT ATA TCG CAA AAT TAT ATA TCT GAG CTT TGG ACT TCT GAC 192
Ile Leu Asn Ile Ser Gln Asn Tyr Ile Ser Glu Leu Trp Thr Ser Asp
30 35 40
ATC TTA TCA CTG TCA AAA CTG AGG ATT TTG ATA ATT TCT CAT AAT AGA 240
Ile Leu Ser Leu Ser Lys Leu Arg Ile Leu Ile Ile Ser His Asn Arg
45 50 55
ATC CAG TAT CTT GAT ATC AGT GTT TTC AAA TTC AAC CAG GAA TTG GAA 288
Ile Gln Tyr Leu Asp Ile Ser Val Phe Lys Phe Asn Gln Glu Leu Glu
60 65 70
TAC TTG GAT TTG TCC CAC AAC AAG TTG GTG AAG ATT TCT TGC CAC CCT 336
Tyr Leu Asp Leu Ser His Asn Lys Leu Val Lys Ile Ser Cys His Pro
75 80 85 90
ACT GTG AAC CTC AAG CAC TTG GAC CTG TCA TTT AAT GCA TTT GAT GCC 384
Thr Val Asn Leu Lys His Leu Asp Leu Ser Phe Asn Ala Phe Asp Ala
95 100 105
CTG CCT ATA TGC AAA GAG TTT GGC AAT ATG TCT CAA CTA AAA TTT CTG 432
Leu Pro Ile Cys Lys Glu Phe Gly Asn Met Ser Gln Leu Lys Phe Leu
110 115 120
GGG TTG AGC ACC ACA CAC TTA GAA AAA TCT AGT GTG CTG CCA ATT GCT 480
Gly Leu Ser Thr Thr His Leu Glu Lys Set Set val Leu Pro Ile Ala
125 130 135
CAT TTG AAT ATC AGC AAG GTC TTG CTG GTC TTA GGA GAG ACT TAT GGG 528
His Leu Asn Ile Set Lys Val Leu Leu Val Leu Gly Glu Thr Tyr Gly
140 145 150
GAA AAA GAA GAC CCT GAG GGC CTT CAA GAC TTT AAC ACT GAG AGT CTG 576
Glu Lys Glu Asp Pro Glu Gly Leu Gln Asp Phe Asn Thr Giu Set Leu
155 160 165 170
CAC ATT GTG TTC CCC ACA AAC AAA GAA TTC CAT TTT ATT TTG GAT GTG 624
His Ile Val Phe Pro Thr Asn Lys Glu Phe His Phe Ile Leu Asp Val
175 180 185
TCA GTC AAG ACT GTA GCA AAT CTG GAA CTA TCT AAT ATC AAA TGT GTG 672
Ser Val Lys Thr Val Ala Asn Leu Glu Leu Ser Asn Ile Lys Cys Val
190 195 200
CTA GAA GAT AAC AAA TGT TCT TAC TTC CTA AGT ATT CTG GCG AAA CTT 720
Leu Glu Asp Asn Lys Cys Ser Tyr Phe Leu Ser Ile Leu Ala Lys Leu
205 210 215
CAA ACA AAT CCA AAG TTA TCA AGT CTT ACC TTA AAC AAC ATT GAA ACA 768
Gln Thr Asn Pro Lys Leu Ser Ser Leu Thr Leu Asn Asn Ile Glu Thr
220 225 230
ACT TGG AAT TCT TTC ATT AGG ATC CTC CAA CTA GTT TGG CAT ACA ACT 816
Thr Trp Asn Ser Phe Ile Arg Ile Leu Gln Leu Val Trp His Thr Thr
235 240 245 250
GTA TGG TAT TTC TCA ATT TCA AAC GTG AAG CTA CAG GGT CAG CTG GAC 864
Val Trp Tyr Phe Ser Ile Ser Asn Val Lys Leu Gln Gly Gln Leu Asp
255 260 265
TTC AGA GAT TTT GAT TAT TCT GGC ACT TCC TTG AAG GCC TTG TCT ATA 912
Phe Arg Asp Phe Asp Tyr Ser Gly Thr Ser Leu Lys Ala Leu Ser Ile
270 275 280
CAC CAA GTT GTC AGC GAT GTG TTC GGT TTT CCG CAA AGT TAT ATC TAT 960
His Gln Val Val Ser Asp Val Phe Gly Phe Pro Gln Ser Tyr Ile Tyr
285 290 295
GAA ATC TTT TCG AAT ATG AAC ATC AAA AAT TTC ACA GTG TCT GGT ACA 1008
Glu Ile Phe Ser Asn Met Asn Ile Lys Asn Phe Thr Val Ser Gly Thr
300 305 310
CGC ATG GTC CAC ATG CTT TGC CCA TCC AAA ATT AGC CCG TTC CTG CAT 1056
Arg Met Val His Met Leu Cys Pro Ser Lys Ile Ser Pro Phe Leu His
315 320 325 330
TTG GAT TTT TCC AAT AAT CTC TTA ACA GAC ACG GTT TTT GAA AAT TGT 1104
Leu Asp Phe Ser Asn Asn Leu Leu Thr Asp Thr Val Phe Glu Asn Cys
335 340 345
GGG CAC CTT ACT GAG TTG GAG ACA CTT ATT TTA CAA ATG AAT CAA TTA 1152
Gly His Leu Thr Glu Leu Glu Thr Leu Ile Leu Gln Met Asn Gln Leu
350 355 360
AAA GAA CTT TCA AAA ATA GCT GAA ATG ACT ACA CAG ATG AAG TCT CTG 1200
Lys Glu Leu Ser Lys Ile Ala Glu Met Thr Thr Gln Met Lys Ser Leu
365 370 375
CAA CAA TTG GAT ATT AGC CAG AAT TCT GTA AGC TAT GAT GAA AAG AAA 1248
Gln Gln Leu Asp Ile Ser Gln Asn Ser Val Ser Tyr Asp Glu Lys Lys
380 385 390
GGA GAC TGT TCT TGG ACT AAA AGT TTA TTA AGT TTA AAT ATG TCT TCA 1296
Gly Asp Cys Ser Trp Thr Lys Ser Leu Leu Ser Leu Asn Met Ser Ser
395 400 405 410
AAT ATA CTT ACT GAC ACT ATT TTC AGA TGT TTA CCT CCC AGG ATC AAG 1344
Asn Ile Leu Thr Asp Thr Ile Phe Arg Cys Leu Pro Pro Arg Ile Lys
415 420 425
GTA CTT GAT CTT CAC AGC AAT AAA ATA AAG AGC ATT CCT AAA CAA GTC 1392
Val Leu Asp Leu His Ser Asn Lys Ile Lys Ser Ile Pro Lys Gln Val
430 435 440
GTA AAA CTG GAA GCT TTG CAA GAA CTC AAT GTT GCT TTC AAT TCT TTA 1440
Val Lys Leu Glu Ala Leu Gln Glu Leu Asn Val Ala Phe Asn Ser Leu
445 450 455
ACT GAC CTT CCT GGA TGT GGC AGC TTT AGC AGC CTT TCT GTA TTG ATC 1488
Thr Asp Leu Pro Gly Cys Gly Ser Phe Ser Ser Leu Ser Val Leu Ile
460 465 470
ATT GAT CAC AAT TCA GTT TCC CAC CCA TCA GCT GAT TTC TTC CAG AGC 1536
Ile Asp His Asn Ser Val Ser His Pro Ser Ala Asp Phe Phe Gln Ser
475 480 485 490
TGC CAG AAG ATG AGG TCA ATA AAA GCA GGG GAC AAT CCA TTC CAA TGT 1584
Cys Gln Lys Met Arg Ser Ile Lys Ala Gly Asp Asn Pro Phe Gln Cys
495 500 505
ACC TGT GAG CTC GGA GAA TTT GTC AAA AAT ATA GAC CAA GTA TCA AGT 1632
Thr Cys Glu Leu Gly Glu Phe Val Lys Asn Ile Asp Gln Val Ser Ser
510 515 520
GAA GTG TTA GAG GGC TGG CCT GAT TCT TAT AAG TGT GAC TAC CCG GAA 1680
Glu Val Leu Glu Gly Trp Pro Asp Ser Tyr Lys Cys Asp Tyr Pro Glu
525 530 535
AGT TAT AGA GGA ACC CTA CTA AAG GAC TTT CAC ATG TCT GAA TTA TCC 1728
Ser Tyr Arg Gly Thr Leu Leu Lys Asp Phe His Met Ser Glu Leu Ser
540 545 550
TGC AAC ATA ACT CTG CTG ATC GTC ACC ATC GTT GCC ACC ATG CTG GTG 1776
Cys Asn Ile Thr Leu Leu Ile Val Thr Ile Val Ala Thr Met Leu Val
555 560 565 570
TTG GCT GTG ACT GTG ACC TCC CTC TGC ATC TAC TTG GAT CTG CCC TGG 1824
Leu Ala Val Thr Val Thr Ser Leu Cys Ile Tyr Leu Asp Leu Pro Trp
575 580 585
TAT CTC AGG ATG GTG TGC CAG TGG ACC CAG ACC CGG CGC AGG GCC AGG 1872
Tyr Leu Arg Met Val Cys Gln Trp Thr Gln Thr Arg Arg Arg Ala Arg
590 595 600
AAC ATA CCC TTA GAA GAA CTC CAA AGA AAT CTC CAG TTT CAT GCA TTT 1920
Asn Ile Pro Leu Glu Glu Leu Gln Arg Asn Leu Gln Phe His Ala Phe
605 610 615
ATT TCA TAT AGT GGG CAC GAT TCT TTC TGG GTG AAG AAT GAA TTA TTG 1968
Ile Ser Tyr Ser Gly His Asp Ser Phe Trp Val Lys Asn Glu Leu Leu
620 625 630
CCA AAC CTA GAG AAA GAA GGT ATG CAG ATT TGC CTT CAT GAG AGA AAC 2016
Pro Asn Leu Glu Lys Glu Gly Met Gln Ile Cys Leu His Glu Arg Asn
635 640 645 650
TTT GTT CCT GGC AAG AGC ATT GTG GAA AAT ATC ATC ACC TGC ATT GAG 2064
Phe Val Pro Gly Lys Ser Ile Val Glu Asn Ile Ile Thr Cys Ile Glu
655 660 665
AAG AGT TAC AAG TCC ATC TTT GTT TTG TCT CCC AAC TTT GTC CAG AGT 2112
Lys Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser
670 675 680
GAA TGG TGC CAT TAT GAA CTC TAC TTT GCC CAT CAC AAT CTC TTT CAT 2160
Glu Trp Cys His Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His
685 690 695
GAA GGA TCT AAT AGC TTA ATC CTG ATC TTG CTG GAA CCC ATT CCG CAG 2208
Glu Gly Ser Asn Ser Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln
700 705 710
TAC TCC ATT CCT AGC AGT TAT CAC AAG CTC AAA AGT CTC ATG GCC AGG 2256
Tyr Ser Ile Pro Ser Ser Tyr His Lys Leu Lys Ser Leu Met Ala Arg
715 720 725 730
AGG ACT TAT TTG GAA TGG CCC AAG GAA AAG AGC AAA CGT GGC CTT TTT 2304
Arg Thr Tyr Leu Glu Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe
735 740 745
TGG GCT AAC TTA AGG GCA GCC ATT AAT ATT AAG CTG ACA GAG CAA GCA 2352
Trp Ala Asn Leu Arg Ala Ala Ile Asn Ile Lys Leu Thr Glu Gln Ala
750 755 760
AAG AAA TAGTCTAGA 2367
Lys Lys
MTSIFHFAIIFMLILQIRIQLSEESEFLVDRSKNGLIHVPKDLSQKTTILNISQNYISELWTSDILSLSKLRILI
ISHNRIQYLDISVFKFNQELEYLDLSHNKLVKISCHPTVNLKHLDLSFNAFDALPICKEFGNMSQLKFLGLSTTH
LEKSSVLPIAHLNISKVLLVLGETYGEKEDPEGLQDFNTESLHIVFPTNKEFHFILDVSVKTVANLELSNIKCVL
EDNKCSYFLSILAKLQTNPKLSSLTLNNIETTWNSFIRILQLVWHTTVWYFSISNVKLQGQLDFRDFDYSGTSLK
ALSIHQVVSDVFGFPQSYIYEIFSNMNIKNFTVSGTRMVHMLCPSKISPFLHLDFSNNLLTDTVFENCGHLTELE
TLILQMNQLKELSKIAEMTTQMKSLQQLDISQNSVSYDEKKGDCSWTKSLLSLNMSSNILTDTIFRCLPPRIKVL
DLHSNKIKSIPKQVVKLEALQELNVAFNSLTDLPGCGSFSSLSVLIIDHNSVSHPSADFFQSCQKMRSIKAGDNP
FQCTCELGEFVKNIDQVSSEVLEGWPDSYKCDYPESYRGTLLKIFHMSELSCNITLLIVTIVATMLVLAVTVTSL
CIYLDLPWYLRMVCQWTQTRRRARNIPLEELQRNLQFHAFISYSGHDSFWVKNELLPNLEKEGMQICLHERNFVP
GKSIVENIITCIEKSYKSIFVLSPNFVQSEWCHYELYFAHHNLFHEGSNSLILILLEPIPQYSIPSSYHKLKSLM
ARRTYLEWPKEKSKRGLFWANLRAAINIKLTEQAKK
表2:诸如人等灵长类动物的DNAX Toll样受体2(DTLR2)的核苷酸序列和氨基酸序列(见SEQ ID NO:3和4)。
ATG CCA CAT ACT TTG TGG ATG GTG TGG GTC TTG GGG GTC ATC ATC AGC 48
Met Pro His Thr Leu Trp Met Val Trp Val Leu Gly Val Ile Ile Ser
-22 -20 -15 -10
CTC TCC AAG GAA GAA TCC TCC AAT CAG GCT TCT CTG TCT TGT GAC CGC 96
Leu Ser Lys Glu Glu Ser Ser Asn Gln Ala Ser Leu Ser Cys Asp Arg
-5 1 5 10
AAT GGT ATC TGC AAG GGC AGC TCA GGA TCT TTA AAC TCC ATT CCC TCA 144
Asn Gly Ile Cys Lys Gly Ser Ser Gly Ser Leu Asn Ser Ile Pro Ser
15 20 25
GGG CTC ACA GAA GCT GTA AAA AGC CTT GAC CTG TCC AAC AAC AGG ATC 192
Gly Leu Thr Glu Ala Val Lys Ser Leu Asp Leu Ser Asn Asn Arg Ile
30 35 40
ACC TAC ATT AGC AAC AGT GAC CTA CAG AGG TGT GTG AAC CTC CAG GCT 240
Thr Tyr Ile Ser Asn Ser Asp Leu Gln Arg Cys Val Asn Leu Gln Ala
45 50 55
CTG GTG CTG ACA TCC AAT GGA ATT AAC ACA ATA GAG GAA GAT TCT TTT 288
Leu Val Leu Thr Ser Asn Gly Ile Asn Thr Ile Glu Glu Asp Ser Phe
60 65 70
TCT TCC CTG GGC AGT CTT GAA CAT TTA GAC TTA TCC TAT AAT TAC TTA 336
Ser Ser Leu Gly Ser Leu Glu His Leu Asp Leu Ser Tyr Asn Tyr Leu
75 80 85 90
TCT AAT TTA TCG TCT TCC TGG TTC AAG CCC CTT TCT TCT TTA ACA TTC 384
Ser Asn Leu Ser Ser Ser Trp Phe Lys Pro Leu Ser Ser Leu Thr Phe
95 100 105
TTA AAC TTA CTG GGA AAT CCT TAC AAA ACC CTA GGG GAA ACA TCT CTT 432
Leu Asn Leu Leu Gly Asn Pro Tyr Lys Thr Leu Gly Glu Thr Ser Leu
110 115 120
TTT TCT CAT CTC ACA AAA TTG CAA ATC CTG AGA GTG GGA AAT ATG GAC 480
Phe Ser His Leu Thr Lys Leu Gln Ile Leu Arg Val Gly Asn Met Asp
125 130 135
ACC TTC ACT AAG ATT CAA AGA AAA GAT TTT GCT GGA CTT ACC TTC CTT 528
Thr Phe Thr Lys Ile Gln Arg Lys Asp Phe Ala Gly Leu Thr Phe Leu
140 145 150
GAG GAA CTT GAG ATT GAT GCT TCA GAT CTA CAG AGC TAT GAG CCA AAA 576
Glu Glu Leu Glu Ile Asp Ala Ser Asp Leu Gln Ser Tyr Glu Pro Lys
155 160 165 170
AGT TTG AAG TCA ATT CAG AAC GTA AGT CAT CTG ATC CTT CAT ATG AAG 624
Ser Leu Lys Ser Ile Gln Asn Val Ser His Leu Ile Leu His Met Lys
175 180 185
CAG CAT ATT TTA CTG CTG GAG ATT TTT GTA GAT GTT ACA AGT TCC GTG 672
Gln His Ile Leu Leu Leu Glu Ile Phe Val Asp Val Thr Ser Ser Val
190 195 200
GAA TGT TTG GAA CTG CGA GAT ACT GAT TTG GAC ACT TTC CAT TTT TCA 720
Glu Cys Leu Glu Leu Arg Asp Thr Asp Leu Asp Thr Phe His Phe Ser
205 210 215
GAA CTA TCC ACT GGT GAA ACA AAT TCA TTG ATT AAA AAG TTT ACA TTT 768
Glu Leu Ser Thr Gly Glu Thr Asn Ser Leu Ile Lys Lys Phe Thr Phe
220 225 230
AGA AAT GTG AAA ATC ACC GAT GAA AGT TTG TTT CAG GTT ATG AAA CTT 816
Arg Asn Val Lys Ile Thr Asp Glu Ser Leu Phe Gln Val Met Lys Leu
235 240 245 250
TTG AAT CAG ATT TCT GGA TTG TTA GAA TTA GAG TTT GAT GAC TGT ACC 864
Leu Asn Gln Ile Ser Gly Leu Leu Glu Leu Glu Phe Asp Asp Cys Thr
255 260 265
CTT AAT GGA GTT GGT AAT TTT AGA GCA TCT GAT AAT GAC AGA GTT ATA 912
Leu Asn Gly Val Gly Asn Phe Arg Ala Ser Asp Asn Asp Arg Val Ile
270 275 280
GAT CCA GGT AAA GTG GAA ACG TTA ACA ATC CGG AGG CTG CAT ATT CCA 960
Asp Pro Gly Lys Val Glu Thr Leu Thr Ile Arg Arg Leu His Ile Pro
285 290 295
AGG TTT TAC TTA TTT TAT GAT CTG AGC ACT TTA TAT TCA CTT ACA GAA 1008
Arg Phe Tyr Leu Phe Tyr Asp Leu Ser Thr Leu Tyr Ser Leu Thr Glu
300 305 310
AGA GTT AAA AGA ATC ACA GTA GAA AAC AGT AAA GTT TTT CTG GTT CCT 1056
Arg Val Lys Arg Ile Thr Val Glu Asn Ser Lys Val Phe Leu Val Pro
315 320 325 330
TGT TTA CTT TCA CAA CAT TTA AAA TCA TTA GAA TAC TTG GAT CTC AGT 1104
Cys Leu Leu Ser Gln His Leu Lys Ser Leu Glu Tyr Leu Asp Leu Ser
335 340 345
GAA AAT TTG ATG GTT GAA GAA TAC TTG AAA AAT TCA GCC TGT GAG GAT 1152
Glu Asn Leu Met Val Glu Glu Tyr Leu Lys Asn Ser Ala Cys Glu Asp
350 355 360
GCC TGG CCC TCT CTA CAA ACT TTA ATT TTA AGG CAA AAT CAT TTG GCA 1200
Ala Trp Pro Ser Leu Gln Thr Leu Ile Leu Arg Gln Asn His Leu Ala
365 370 375
TCA TTG GAA AAA ACC GGA GAG ACT TTG CTC ACT CTG AAA AAC TTG ACT 1248
Ser Leu Glu Lys Thr Gly Glu Thr Leu Leu Thr Leu Lys Asn Leu Thr
380 385 390
AAC ATT GAT ATC AGT AAG AAT AGT TTT CAT TCT ATG CCT GAA ACT TGT 1296
Asn Ile Asp Ile Ser Lys Asn Ser Phe His Ser Met Pro Glu Thr Cys
395 400 405 410
CAG TGG CCA GAA AAG ATG AAA TAT TTG AAC TTA TCC AGC ACA CGA ATA 1344
Gln Trp Pro Glu Lys Met Lys Tyr Leu Asn Leu Ser Ser Thr Arg Ile
415 420 425
CAC AGT GTA ACA GGC TGC ATT CCC AAG ACA CTG GAA ATT TTA GAT GTT 1392
His Ser Val Thr Gly Cys Ile Pro Lys Thr Leu Glu Ile Leu Asp Val
430 435 440
AGC AAC AAC AAT CTC AAT TTA TTT TCT TTG AAT TTG CCG CAA CTC AAA 1440
Ser Asn Asn Asn Leu Asn Leu Phe Ser Leu Asn Leu Pro Gln Leu Lys
445 450 455
GAA CTT TAT ATT TCC AGA AAT AAG TTG ATG ACT CTA CCA GAT GCC TCC 1488
Glu Leu Tyr Ile Ser Arg Asn Lys Leu Met Thr Leu Pro Asp Ala Ser
460 465 470
CTC TTA CCC ATG TTA CTA GTA TTG AAA ATC AGT AGG AAT GCA ATA ACT 1536
Leu Leu Pro Met Leu Leu Val Leu Lys Ile Ser Arg Asn Ala Ile Thr
475 480 485 490
ACG TTT TCT AAG GAG CAA CTT GAC TCA TTT CAC ACA CTG AAG ACT TTG 1584
Thr Phe Sar Lys Glu Gln Leu Asp Ser Phe His Thr Leu Lys Thr Leu
495 500 505
GAA GCT GGT GGC AAT AAC TTC ATT TGC TCC TGT GAA TTC CTC TCC TTC 1632
Glu Ala Gly Gly Asn Asn Phe Ile Cys Ser Cys Glu Phe Leu Ser Phe
510 515 520
ACT CAG GAG CAG CAA GCA CTG GCC AAA GTC TTG ATT GAT TGG CCA GCA 1680
Thr Gln Glu Gln Gln Ala Leu Ala Lys Val Leu Ile Asp Trp Pro Ala
525 530 535
AAT TAC CTG TGT GAC TCT CCA TCC CAT GTG CGT GGC CAG CAG GTT CAG 1728
Asn Tyr Leu Cys Asp Ser Pro Ser His Val Arg Gly Gln Gln Val Gln
540 545 550
GAT GTC CGC CTC TCG GTG TCG GAA TGT CAC AGG ACA GCA CTG GTG TCT 1776
Asp Val Arg Leu Ser Val Ser Glu Cys His Arg Thr Ala Leu val Ser
555 560 565 570
GGC ATG TGC TGT GCT CTG TTC CTG CTG ATC CTG CTC ACG GGG GTC CTG 1824
Gly Met Cys Cys Ala Leu Phe Leu Leu Ile Leu Leu Thr Gly Val Leu
575 580 585
TGC CAC CGT TTC CAT GGC CTG TGG TAT ATG AAA ATG ATG TGG GCC TGG 1872
Cys His Arg Phe His Gly Leu Trp Tyr Met Lys Met Met Trp Ala Trp
590 595 600
CTC CAG GCC AAA AGG AAG CCC AGG AAA GCT CCC AGC AGG AAC ATC TGC 1920
Leu Gln Ala Lys Arg Lys Pro Arg Lys Ala Pro Ser Arg Asn Ile Cys
605 610 615
TAT GAT GCA TTT GTT TCT TAC AGT GAG CGG GAT GCC TAC TGG GTG GAG 1968
Tyr Asp Ala Phe Val Ser Tyr Ser Glu Arg Asp Ala Tyr Trp Val Glu
620 625 630
AAC CTT ATG GTC CAG GAG CTG GAG AAC TTC AAT CCC CCC TTC AAG TTG 2016
Asn Leu Met Val Gln Glu Leu Glu Asn Phe Asn Pro Pro Phe Lys Leu
635 640 645 650
TGT CTT CAT AAG CGG GAC TTC ATT CCT GGC AAG TGG ATC ATT GAC AAT 2064
Cys Leu His Lys Arg Asp Phe Ile Pro Gly Lys Trp Ile Ile Asp Asn
655 660 665
ATC ATT GAC TCC ATT GAA AAG AGC CAC AAA ACT GTC TTT GTG CTT TCT 2112
Ile Ile Asp Ser Ile Glu Lys Ser His Lys Thr Val Phe Val Leu Ser
670 675 680
GAA AAC TTT GTG AAG AGT GAG TGG TGC AAG TAT GAA CTG GAC TTC TCC 2160
Glu Asn Phe Val Lys Ser Glu Trp Cys Lys Tyr Glu Leu Asp Phe Ser
685 690 695
CAT TTC CGT CTT TTT GAA GAG AAC AAT GAT GCT GCC ATT CTC ATT CTT 2208
His Phe Arg Leu Phe Glu Glu Asn Asn Asp Ala Ala Ile Leu Ile Leu
700 705 710
CTG GAG CCC ATT GAG AAA AAA GCC ATT CCC CAG CGC TTC TGC AAG CTG 2256
Leu Glu Pro Ile Glu Lys Lys Ala Ile Pro Gln Arg Phe Cys Lys Leu
715 720 725 730
CGG AAG ATA ATG AAC ACC AAG ACC TAC CTG GAG TGG CCC ATG GAC GAG 2304
Arg Lys Ile Met Asn Thr Lys Thr Tyr Leu Glu Trp Pro Met Asp Glu
735 740 745
GCT CAG CGG GAA GGA TTT TGG GTA AAT CTG AGA GCT GCG ATA AAG TCC 2352
Ala Gln Arg Glu Gly Phe Trp Val Asn Leu Arg Ala Ala Ile Lys Ser
750 755 760
TAG 2355
MPHTLWMVWVLGVIISLSKEESSNQASLSCDRNGICKGSSGSLNSIPSGLTEAVKSLDLSNNRITYISNSDLQRC
VNLQALVLTSNGINTIEEDSFSSLGSLEHLDLSYNYLSNLSSSWFKPLSSLTFLNLLGNPYKTLGETSLFSHLTK
LQILRVGNMDTFTKIQRKDFAGLTFLEELEIDASDLQSYEPKSLKSIQNVSHLILHMKQHILLLEIFVDVTSSVE
CLELRDTDLDTFHFSELSTGETNSLIKKFTFRNVKITDESLFQVMKLLNQISGLLELEFDDCTLNGVGNFRASDN
DRVIDPGKVETLTIRRLHIPRFYLFYDLSTLYSLTEEVKRITVENSKVFLVPCLLSQHLKSLEYLDLSENLMVEE
YLKNSACEDAWPSLQTLILRQNHLASLEKTGETLLTLKNLTNIDISKNSFHSMPETCQWPEKMKYLNLSSIRIHS
VTGCIPKTLEILDVSNNNLNLFSLNLPQLKELYISRNKLMTLPDASLLPMLLVLKISRNAITTFSKEQLDSFHTL
KTLEAGGNNFICSCEFLSFTQEQQALAKVLIDWPANYLCDSPSHVRGQQVQDVRLSVSECHRTALVSGMCCALFL
LILLTGVLCHRFHGLWYMKMMWAWLQAKRKPRKAPSRNICYDAFVSYSERDAYWVENLMVQELENFNPPFKLCLH
KRDFIPGKWIIDNIIDSIEKSHKTVFVLSENFVKSEWCKYELDFSHFRLFEENNDAAILILLEPIEKKAIPQRFC
KLRKIMNTKTYLEWPMDEAQREGFWVNLRAAIKS
表3:诸如人等灵长类动物的DNAX Toll样受体3(DTLR3)的核苷酸序列和氨基酸序列(见SEQ ID NO:5和6)。
ATG AGA CAG ACT TTG CCT TGT ATC TAC TTT TGG GGG GGC CTT TTG CCC 48
Met Arg Gln Thr Leu Pro Cys Ile Tyr Phe Trp Gly Gly Leu Leu Pro
-21 -20 -15 -10
TTT GGG ATG CTG TGT GCA TCC TCC ACC ACC AAG TGC ACT GTT AGC CAT 96
Phe Gly Met Leu Cys Ala Ser Ser Thr Thr Lys Cys Thr Val Ser His
-5 1 5 10
GAA GTT GCT GAC TGC AGC CAC CTG AAG TTG ACT CAG GTA CCC GAT GAT 144
Glu Val Ala Asp Cys Ser His Leu Lys Leu Thr Gln Val Pro Asp Asp
15 20 25
CTA CCC ACA AAC ATA ACA GTG TTG AAC CTT ACC CAT AAT CAA CTC AGA 192
Leu Pro Thr Asn Ile Thr Val Leu Asn Leu Thr His Asn Gln Leu Arg
30 35 40
AGA TTA CCA GCC GCC AAC TTC ACA AGG TAT AGC CAG CTA ACT AGC TTG 240
Arg Leu Pro Ala Ala Asn Phe Thr Arg Tyr Ser Gln Leu Thr Ser Leu
45 50 55
GAT GTA GGA TTT AAC ACC ATC TCA AAA CTG GAG CCA GAA TTG TGC CAG 288
Asp Val Gly Phe Asn Thr Ile Ser Lys Leu Glu Pro Glu Leu Cys Gln
60 65 70 75
AAA CTT CCC ATG TTA AAA GTT TTG AAC CTC CAG CAC AAT GAG CTA TCT 336
Lys Leu Pro Met Leu Lys Val Leu Asn Leu Gln His Asn Glu Leu Ser
80 85 90
CAA CTT TCT GAT AAA ACC TTT GCC TTC TGC ACG AAT TTG ACT GAA CTC 384
Gln Leu Ser Asp Lys Thr Phe Ala Phe Cys Thr Asn Leu Thr Glu Leu
95 100 105
CAT CTC ATG TCC AAC TCA ATC CAG AAA ATT AAA AAT AAT CCC TTT GTC 432
His Leu Met Ser Asn Ser Ile Gln Lys Ile Lys Asn Asn Pro Phe Val
110 115 120
AAG CAG AAG AAT TTA ATC ACA TTA GAT CTG TCT CAT AAT GGC TTG TCA 480
Lys Gln Lys Asn Leu Ile Thr Leu Asp Leu Ser His Asn Gly Leu Ser
125 130 135
TCT ACA AAA TTA GGA ACT CAG GTT CAG CTG GAA AAT CTC CAA GAG CTT 528
Ser Thr Lys Leu Gly Thr Gln Val Gln Leu Glu Asn Leu Gln Glu Leu
140 145 150 155
CTA TTA TCA AAC AAT AAA ATT CAA GCG CTA AAA AGT GAA GAA CTG GAT 576
Leu Leu Ser Asn Asn Lys Ile Gln Ala Leu Lys Ser Glu Glu Leu Asp
160 165 170
ATC TTT GCC AAT TCA TCT TTA AAA AAA TTA GAG TTG TCA TCG AAT CAA 624
Ile Phe Ala Asn Ser Ser Leu Lys Lys Leu Glu Leu Ser Ser Asn Gln
175 180 185
ATT AAA GAG TTT TCT CCA GGG TGT TTT CAC GCA ATT GGA AGA TTA TTT 672
Ile Lys Glu Phe Ser Pro Gly Cys Phe His Ala Ile Gly Arg Leu Phe
190 195 200
GGC CTC TTT CTG AAC AAT GTC CAG CTG GGT CCC AGC CTT ACA GAG AAG 720
Gly Leu Phe Leu Asn Asn Val Gln Leu Gly Pro Ser Leu Thr Glu Lys
205 210 215
CTA TGT TTG GAA TTA GCA AAC ACA AGC ATT CGG AAT CTG TCT CTG AGT 768
Leu Cys Leu Glu Leu Ala Asn Thr Ser Ile Arg Asn Leu Ser Leu Ser
220 225 230 235
AAC AGC CAG CTG TCC ACC ACC AGC AAT ACA ACT TTC TTG GGA CTA AAG 816
Asn Ser Gln Leu Ser Thr Thr Ser Asn Thr Thr Phe Leu Gly Leu Lys
240 245 250
TGG ACA AAT CTC ACT ATG CTC GAT CTT TCC TAC AAC AAC TTA AAT GTG 864
Trp Thr Asn Leu Thr Met Leu Asp Leu Ser Tyr Asn Asn Leu Asn Val
255 260 265
GTT GGT AAC GAT TCC TTT GCT TGG CTT CCA CAA CTA GAA TAT TTC TTC 912
Val Gly Asn Asp Ser Phe Ala Trp Leu Pro Gln Leu Glu Tyr Phe Phe
270 275 280
CTA GAG TAT AAT AAT ATA CAG CAT TTG TTT TCT CAC TCT TTG CAC GGG 960
Leu Glu Tyr Asn Asn Ile Gln His Leu Phe Ser His Ser Leu His Gly
285 290 295
CTT TTC AAT GTG AGG TAC CTG AAT TTG AAA CGG TCT TTT ACT AAA CAA 1008
Leu Phe Asn Val Arg Tyr Leu Asn Leu Lys Arg Ser Phe Thr Lys Gln
300 305 310 315
AGT ATT TCC CTT GCC TCA CTC CCC AAG ATT GAT GAT TTT TCT TTT CAG 1056
Ser Ile Ser Leu Ala Ser Leu Pro Lys Ile Asp Asp Phe Ser Phe Gln
320 325 330
TGG CTA AAA TGT TTG GAG CAC CTT AAC ATG GAA GAT AAT GAT ATT CCA 1104
Trp Leu Lys Cys Leu Glu His Leu Asn Met Glu Asp Asn Asp Ile Pro
335 340 345
GGC ATA AAA AGC AAT ATG TTC ACA GGA TTG ATA AAC CTG AAA TAC TTA 1152
Gly Ile Lys Ser Asn Met Phe Thr Gly Leu Ile Asn Leu Lys Tyr Leu
350 355 360
AGT CTA TCC AAC TCC TTT ACA AGT TTG CGA ACT TTG ACA AAT GAA ACA 1200
Ser Leu Ser Asn Ser Phe Thr Ser Leu Arg Thr Leu Thr Asn Glu Thr
365 370 375
TTT GTA TGA CTT GCT CAT TCT CCC TTA CAC ATA CTC AAC CTA ACC AAG 1248
Phe Val Ser Leu Ala His Ser Pro Leu His Ile Leu Asn Leu Thr Lys
380 385 390 395
AAT AAA ATC TCA AAA ATA GAG AGT GAT GCT TTC TCT TGG TTG GGC CAC 1296
Asn Lys Ile Ser Lys Ile Glu Ser Asp Ala Phe Ser Trp Leu Gly His
400 405 410
CTA GAA GTA CTT GAC CTG GGC CTT AAT GAA ATT GGG CAA GAA CTC ACA 1344
Leu Glu Val Leu Asp Leu Gly Leu Asn Glu Ile Gly Gln Glu Leu Thr
415 420 425
GGC CAG GAA TGG AGA GGT CTA GAA AAT ATT TTC GAA ATC TAT CTT TCC 1392
Gly Gln Glu Trp Arg Gly Leu Glu Asn Ile Phe Glu Ile Tyr Leu Ser
430 435 440
TAC AAC AAG TAC CTG CAG CTG ACT AGG AAC TCC TTT GCC TTG GTC CCA 1440
Tyr Asn Lys Tyr Leu Gln Leu Thr Arg Asn Ser Phe Ala Leu Val Pro
445 450 455
AGC CTT CAA CGA CTG ATG CTC CGA AGG GTG GCC CTT AAA AAT GTG GAT 1488
Ser Leu Gln Arg Leu Met Leu Arg Arg Val Ala Leu Lys Asn Val Asp
460 465 470 475
AGC TCT CCT TCA CCA TTC CAG CCT CTT CGT AAC TTG ACC ATT CTG GAT 1536
Ser Ser Pro Ser Pro Phe Gln Pro Leu Arg Ash Leu Thr Ile Leu Asp
480 485 490
CTA AGC AAC AAC AAC ATA GCC AAC ATA AAT GAT GAC ATG TTG GAG GGT 1584
Leu Ser Asn Asn Asn Ile Ala Asn Ile Asn Asp Asp Met Leu Glu Gly
495 500 505
CTT GAG AAA CTA GAA ATT CTC GAT TTG CAG CAT AAC AAC TTA GCA CGG 1632
Leu Glu Lys Leu Glu Ile Leu Asp Leu Gln His Asn Asn Leu Ala Arg
510 515 520
CTC TGG AAA CAC GCA AAC CCT GGT GGT CCC ATT TAT TTC CTA AAG GGT 1680
Leu Trp Lys His Ala Asn Pro Gly Gly Pro Ile Tyr Phe Leu Lys Gly
525 530 535
CTG TCT CAC CTC CAC ATC CTT AAC TTG GAG TCC AAC GGC TTT GAC GAG 1728
Leu Ser His Leu His Ile Leu Asn Leu Glu Ser Asn Gly Phe Asp Glu
540 545 550 555
ATC CCA GTT GAG GTC TTC AAG GAT TTA TTT GAA CTA AAG ATC ATC GAT 1776
Ile Pro Val Glu Val Phe Lys Asp Leu Phe Glu Leu Lys Ile Ile Asp
560 555 570
TTA GGA TTG AAT AAT TTA AAC ACA CTT CCA GCA TCT GTC TTT AAT AAT 1824
Leu Gly Leu Asn Asn Leu Asn Thr Leu Pro Ala Ser Val Phe Asn Asn
575 580 585
CAG GTG TCT CTA AAG TCA TTG AAC CTT CAG AAG AAT CTC ATA ACA TCC 1872
Gln Val Ser Leu Lys Ser Leu Asn Leu Gln Lys Asn Leu Ile Thr Ser
590 595 600
GTT GAG AAG AAG GTT TTC GGG CCA GCT TTC AGG AAC CTG ACT GAG TTA 1920
Val Glu Lys Lys Val Phe Gly Pro Ala Phe Arg Asn Leu Thr Glu Leu
605 610 615
GAT ATG CGC TTT AAT CCC TTT GAT TGC ACG TGT GAA AGT ATT GCC TGG 1968
Asp Met Arg Phe Asn Pro Phe Asp Cys Thr Cys Glu Ser Ile Ala Trp
620 625 630 635
TTT GTT AAT TGG ATT AAC GAG ACC CAT ACC AAC ATC CCT GAG CTG TCA 2016
Phe Val Asn Trp Ile Asn Glu Thr His Thr Asn Ile Pro Glu Leu Ser
640 645 650
AGC CAC TAC CTT TGC AAC ACT CCA CCT CAC TAT CAT GGG TTC CCA GTG 2064
Ser His Tyr Leu Cys Asn Thr Pro Pro His Tyr His Gly Phe Pro Val
655 660 665
AGA CTT TTT GAT ACA TCA TCT TGC AAA GAC AGT GCC CCC TTT GAA CTC 2112
Arg Leu Phe Asp Thr Ser Ser Cys Lys Asp Ser Ala Pro Phe Glu Leu
670 675 680
TTT TTC ATG ATC AAT ACC AGT ATC CTG TTG ATT TTT ATC TTT ATT GTA 2160
Phe Phe Met Ile Asn Thr Ser Ile Leu Leu Ile Phe Ile Phe Ile Val
685 690 695
CTT CTC ATC CAC TTT GAG GGC TGG AGG ATA TCT TTT TAT TGG AAT GTT 2208
Leu Leu Ile His Phe Glu Gly Trp Arg Ile Ser Phe Tyr Trp Asn Val
700 705 710 715
TCA GTA CAT CGA GTT CTT GGT TTC AAA GAA ATA GAC AGA CAG ACA GAA 2256
Ser Val His Arg Val Leu Gly Phe Lys Glu Ile Asp Arg Gln Thr Glu
720 725 730
CAG TTT GAA TAT GCA GCA TAT ATA ATT CAT GCC TAT AAA GAT AAG GAT 2304
Gln Phe Glu Tyr Ala Ala Tyr Ile Ile His Ala Tyr Lys Asp Lys Asp
735 740 745
TGG GTC TGG GAA CAT TTC TCT TCA ATG GAA AAG GAA GAC CAA TCT CTC 2352
Trp Val Trp Glu His Phe Ser Ser Met Glu Lys Glu Asp Gln Ser Leu
750 755 760
AAA TTT TGT CTG GAA GAA AGG GAC TTT GAG GCG GGT GTT TTT GAA CTA 2400
Lys Phe Cys Leu Glu Glu Arg Asp Phe Glu Ala Gly Val Phe Glu Leu
765 770 775
GAA GCA ATT GTT AAC AGC ATC AAA AGA AGC AGA AAA ATT ATT TTT GTT 2448
Glu Ala Ile Val Asn Ser Ile Lys Arg Ser Arg Lys Ile Ile Phe Val
780 785 790 795
ATA ACA CAC CAT CTA TTA AAA GAC CCA TTA TGC AAA AGA TTC AAG GTA 2496
Ile Thr His His Leu Leu Lys Asp Pro Leu Cys Lys Arg Phe Lys Val
800 805 810
CAT CAT GCA GTT CAA CAA GCT ATT GAA CAA AAT CTG GAT TCC ATT ATA 2544
His His Ala Val Gln Gln Ala Ile Glu Gln Asn Leu Asp Ser Ile Ile
815 820 825
TTG GTT TTC CTT GAG GAG ATT CCA GAT TAT AAA CTG AAC CAT GCA CTC 2592
Leu Val Phe Leu Glu Glu Ile Pro Asp Tyr Lys Leu Asn His Ala Leu
830 835 840
TGT TTG CGA AGA GGA ATG TTT AAA TCT CAC TGC ATC TTG AAC TGG CCA 2640
Cys Leu Arg Arg Gly Met Phe Lys Ser His Cys Ile Leu Asn Trp Pro
845 850 855
GTT CAG AAA GAA CGG ATA GGT GCC TTT CGT CAT AAA TTG CAA GTA GCA 2688
Val Gln Lys Glu Arg Ile Gly Ala Phe Arg His Lys Leu Gln Val Ala
860 865 870 875
CTT GGA TCC AAA AAC TCT GTA CAT TAA 2715
Leu Gly Ser Lys Asn Ser Val His
880
MRQTLPCIYFWGGLLPFGMLCASSTTKCTVSHEVADCSHLKLTQVPDDLPTNITVLNLTHNQLRRLPAANFTRYS
QLTSLDVGFNTISKLEPELCQKLPMLKVLNLQHNELSQLSDKTFAFCTNLTELHLMSNSIQKIKNNPFVKQKNLI
TLDLSHMGLSSTKLGTQVQLENLQELLLSNNKIQALKSEELDIFANSSLKKLELSSNQIKEFSPGCFHAIGRLFG
LFLNNVQLGPSLTEKLCLELANTSIRNLSLSNSQLSTTSNTTFLGLKWTNLTMLDLSYNNLNVVGNDSFAWLPQL
EYFFLEYNNIQHLPSHSLHGLFNVRYLNLKRSFTKQSISLASLPKIDDFSFQWLKCLEHLNMEDNDIPGIKSNMF
TGLINLKYLSLSNSFTSLRTLTNETFVSLAHSPLHILNLTKNKISKIESDAFSWLGHLEVLDLGLNEIGQELTGQ
EWRGLENIFEIYLSYNKYLQLTRNSFALVPSLQRLMLRRVALKNVDSSPSPFQPLRNLTILDLSNNNIANINDDM
LEGLEKLEILDLQHNNLARLWKHANPGGPIYFLKGLSHLHILNLESNGFDEIPVEVFKDLFELKIIDLGLNNLNT
LPASVFNNQVSLKSLNLQKNLITSVEKKVFGPAFRNLTELDMRFNPFDCTCESIAWFVNWINETHTNIPELSSHY
LCNTPPHYHGFPVRLFDTSSCKDSAPFELFFMINTSILLIFIFIVLLIHFEGWRISFYWNVSVHRVLGFKEIDRQ
TEQFEYAAYIIHAYKDKDWVWEHFSSMEKEDQSLKFCLEERDPEAGVFELEAIVNSIKRSRKIIFVITHHLLKDP
LCKRFKVHHAVQQAIEQNLDSIILVFLEEIPDYKLNHALCLRRGMFKSHCILNWPVQKERIGAFRHKLQVALGSK
NSVH
表4:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体4(DTLR4)的核苷酸序列和氨基酸序列(见SEQ ID NO:7和8)。
ATG GAG CTG AAT TTC TAC AAA ATC CCC GAC AAC CTC CCC TTC TCA ACC 48
Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp Asn Leu Pro Phe Ser Thr
1 5 10 15
AAG AAC CTG GAC CTG AGC TTT AAT CCC CTG AGG CAT TTA GGC AGC TAT 96
Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu Arg His Leu Gly Ser Tyr
20 25 30
AGC TTC TTC AGT TTC CCA GAA CTG CAG GTG CTG GAT TTA TCC AGG TGT 144
Ser Phe Phe Ser Phe Pro Glu Leu Gln Val Leu Asp Leu Ser Arg Cys
35 40 45
GAA ATC CAG ACA ATT GAA GAT GGG GCA TAT CAG AGC CTA AGC CAC CTC 192
Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr Gln Ser Leu Ser His Leu
50 55 60
TCT ACC TTA ATA TTG ACA GGA AAC CCC ATC CAG AGT TTA GCC CTG GGA 240
Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile Gln Ser Leu Ala Leu Gly
65 70 75 80
GCC TTT TCT GGA CTA TCA AGT TTA CAG AAG CTG GTG GCT GTG GAG ACA 288
Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys Leu Val Ala Val Glu Thr
85 90 95
AAT CTA GCA TCT CTA GAG AAC TTC CCC ATT GGA CAT CTC AAA ACT TTG 336
Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile Gly His Leu Lys Thr Leu
100 105 110
AAA GAA CTT AAT GTG GCT CAC AAT CTT ATC CAA TCT TTC AAA TTA CCT 384
Lys Glu Leu Asn Val Ala His Asn Leu Ile Gln Ser Phe Lys Leu Pro
115 120 125
GAG TAT TTT TCT AAT CTG ACC AAT CTA GAG CAC TTG GAC CTT TCC AGC 432
Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu His Leu Asp Leu Ser Ser
130 135 140
AAC AAG ATT CAA AGT ATT TAT TGC ACA GAC TTG CGG GTT CTA CAT CAA 480
Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp Leu Arg Val Leu His Gln
145 150 155 160
ATG CCC CTA CTC AAT CTC TCT TTA GAC CTG TCC CTG AAC CCT ATG AAC 528
Met Pro Leu Leu Asn Leu Ser Leu Asp Leu Ser Leu Asn Pro Met Asn
165 170 175
TTT ATC CAA CCA GGT GCA TTT AAA GAA ATT AGG CTT CAT AAG CTG ACT 576
Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile Arg Leu His Lys Leu Thr
180 185 190
TTA AGA AAT AAT TTT GAT AGT TTA AAT GTA ATG AAA ACT TGT ATT CAA 624
Leu Arg Asn Asn Phe Asp Ser Leu Asn Val Met Lys Thr Cys Ile Gln
195 200 205
GGT CTG GCT GGT TTA GAA GTC CAT CGT TTG GIT CTG GGA GAA TTT AGA 672
Gly Leu Ala Gly Leu Glu Val His Arg Leu Val Leu Gly Glu Phe Arg
210 215 220
AAT GAA GGA AAC TTG GAA AAG TTT GAC AAA TCT GCT CTA GAG GGC CTG 720
Asn Glu Gly Asn Lau Glu Lys Phe Asp Lys Ser Ala Leu Glu Gly Leu
225 230 235 240
TGC AAT TTG ACC ATT GAA GAA TTC CGA TTA GCA TAC TTA GAC TAC TAC 768
Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu Ala Tyr Leu Asp Tyr Tyr
245 250 255
CTC GAT GAT ATT ATT GAC TTA TTT AAT TGT TTG ACA AAT GTT TCT TCA 816
Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys Leu Thr Asn Val Ser Ser
260 265 270
TTT TCC CTG GTG AGT GTG ACT ATT GAA AGG GTA AAA GAC TTT TCT TAT 864
Phe Ser Leu Val Ser Val Thr Ile Glu Arg Val Lys Asp Phe Ser Tyr
275 280 285
AAT TTC GGA TGG CAA CAT TTA GAA TTA GTT AAC TGT AAA TTT GGA CAG 912
Asn Phe Gly Trp Gln His Leu Glu Leu Val Asn Cys Lys Phe Gly Gln
290 295 300
TTT CCC ACA TTG AAA CTC AAA TCT CTC AAA AGG CTT ACT TTC ACT TCC 960
Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys Arg Leu Thr Phe Thr Ser
305 310 315 320
AAC AAA GGT GGG AAT GCT TTT TCA GAA GTT GAT CTA CCA AGC CTT GAG 1008
Asn Lys Gly Gly Asn Ala Phe Ser Glu Val Asp Leu Pro Ser Leu Glu
325 330 335
TTT CTA GAT CTC AGT AGA AAT GGC TTG AGT TTC AAA GGT TGC TGT TCT 1056
Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser Phe Lys Gly Cys Cys Ser
340 345 350
CAA AGT GAT TTT GGG ACA ACC AGC CTA AAG TAT TTA GAT CTG AGC TTC 1104
Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys Tyr Leu Asp Leu Ser Phe
355 360 365
AAT GGT GTT ATT ACC ATG AGT TCA AAC TTC TTG GGC TTA GAA CAA CTA 1152
Asn Gly Val Ile Thr Met Ser Ser Asn Phe Leu Gly Leu Glu Gln Leu
370 375 380
GAA CAT CTG GAT TTC CAG CAT TCC AAT TTG AAA CAA ATG AGT GAG TTT 1200
Glu His Leu Asp Phe Gln His Ser Asn Leu Lys Gln Met Ser Glu Phe
385 390 395 400
TCA GTA TTC CTA TCA CTC AGA AAC CTC ATT TAC CTT GAC ATT TCT CAT 1248
Ser Val Phe Leu Ser Leu Arg Asn Leu Ile Tyr Leu Asp Ile Ser His
405 410 415
ACT CAC ACC AGA GTT GCT TTC AAT GGC ATC TTC AAT GGC TTG TCC AGT 1296
Thr His Thr Arg Val Ala Phe Asn Gly Ile Phe Asn Gly Leu Ser Ser
420 426 430
CTC GAA GTC TTG AAA ATG GCT GGC AAT TCT TTC CAG GAA AAC TTC CTT 1344
Leu Glu Val Leu Lys Met Ala Gly Asn Ser Phe Gln Glu Asn Phe Leu
435 440 445
CCA GAT ATC TTC ACA GAG CTG AGA AAC TTG ACC TTC CTG GAC CTC TCT 1392
Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu Thr Phe Leu Asp Leu Ser
450 455 460
CAG TGT CAA CTG GAG CAG TTG TCT CCA ACA GCA TTT AAC TCA CTC TCC 1440
Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr Ala Phe Asn Ser Leu Ser
465 470 475 480
AGT CTT CAG GTA CTA AAT ATG AGC CAC AAC AAC TTC TTT TCA TTG GAT 1488
Ser Leu Gln Val Leu Asn Met Ser His Asn Asn Phe Phe Ser Leu Asp
485 490 495
ACG TTT CCT TAT AAG TGT CTG AAC TCC CTC CAG GTT CTT GAT TAC AGT 1536
Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu Gln Val Leu Asp Tyr Ser
500 505 510
CTC AAT CAC ATA ATG ACT TCC AAA AAA CAG GAA CTA CAG CAT TTT CCA 1584
Leu Asn His Ile Met Thr Ser Lys Lys Gln Glu Leu Gln His Phe Pro
515 520 525
AGT AGT CTA GCT TTC TTA AAT CTT ACT CAG AAT GAC TTT GCT TGT ACT 1632
Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln Asn Asp Phe Ala Cys Thr
530 535 540
TGT GAA CAC CAG AGT TTC CTG CAA TGG ATC AAG GAC CAG AGG CAG CTC 1680
Cys Glu His Gln Ser Phe Leu Gln Trp Ile Lys Asp Gln Arg Gln Leu
545 550 555 560
TTG GTG GAA GTT GAA CGA ATG GAA TGT GCA ACA CCT TCA GAT AAG CAG 1728
Leu Val Glu Val Glu Arg Met Glu Cys Ala Thr Pro Ser Asp Lys Gln
565 570 575
GGC ATG CCT GTG CTG AGT TTG AAT ATC ACC TGT CAG ATG AAT AAG ACC 1776
Gly Met Pro Val Leu Ser Leu Asn Ile Thr Cys Gln Met Asn Lys Thr
580 585 590
ATC ATT GGT GTG TCG GTC CTC AGT GTG CTT GTA GTA TCT GTT GTA GCA 1824
Ile Ile Gly Val Ser Val Leu Ser Val Leu Val Val Ser Val Val Ala
595 600 605
GTT CTG GTC TAT AAG TTC TAT TTT CAC CTG ATG CTT CTT GCT GGC TGC 1872
Val Leu Val Tyr Lys Phe Tyr Phe His Leu Met Leu Leu Ala Gly Cys
610 615 620
ATA AAG TAT GGT AGA GGT GAA AAC ATC TAT GAT GCC TTT GTT ATC TAC 1920
Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr Asp Ala Phe Val Ile Tyr
625 630 635 640
TCA AGC CAG GAT GAG GAC TGG GTA AGG AAT GAG CTA GTA AAG AAT TTA 1968
Ser Ser Gln Asp Glu Asp Trp Val Arg Asn Glu Leu Val Lys Asn Leu
645 650 655
GAA GAA GGG GTG CCT CCA TTT CAG CTC TGC CTT CAC TAC AGA GAC TTT 2016
Glu Glu Gly Val Pro Pro Phe Gln Leu Cys Leu His Tyr Arg Asp Phe
660 665 670
ATT CCC GGT GTG GCC ATT GCT GCC AAC ATC ATC CAT GAA GGT TTC CAT 2064
Ile Pro Gly Val Ala Ile Ala Ala Asn Ile Ile His Glu Gly Phe His
675 680 685
AAA AGC CGA AAG GTG ATT GTT GTG GTG TCC CAG CAC TTC ATC CAG AGC 2112
Lys Ser Arg Lys Val Ile Val Val Val Ser Gln His Phe Ile Gln Ser
690 695 700
CGC TGG TGT ATC TTT GAA TAT GAG ATT GCT CAG ACC TGG CAG TTT CTG 2160
Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala Gln Thr Trp Gln Phe Leu
705 710 715 720
AGC AGT CGT GCT GGT ATC ATC TTC ATT GTC CTG CAG AAG GTG GAG AAG 2208
Ser Ser Arg Ala Gly Ile Ile Phe Ile Val Leu Gln Lys Val Glu Lys
725 730 735
ACC CTG CTC AGG CAG CAG GTG GAG CTG TAC CGC CTT CTC AGC AGG AAC 2256
Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr Arg Leu Leu Ser Arg Asn
740 745 750
ACT TAC CTG GAG TGG GAG GAC AGT GTC CTG GGG CGG CAC ATC TTC TGG 2304
Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu Gly Arg His Ile Phe Trp
755 760 765
AGA CGA CTC AGA AAA GCC CTG CTG GAT GGT AAA TCA TGG AAT CCA GAA 2352
Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly Lys Ser Trp Asn Pro Glu
770 775 780
GGA ACA GTG GGT ACA GGA TGC AAT TGG CAG GAA GCA ACA TCT ATC 2397
Gly Thr Val Gly Thr Gly Cys Asn Trp Gln Glu Ala Thr Ser Ile
785 790 795
TGA 2400
MELNFYKIPDNLPFSTKNLDLSFNPLRHLGSYSFFSFPELQVLDLSRCEIQTIEDGAYQSLSHLSTLILTGNP
IQSLALGAFSGLSSLQKLVAVETNLASLENFPIGHLKTLKELNVAHNLIQSFKLPEYFSNLTNLEHLDLSSNK
IQSIYCTDLRVLHQMPLLNLSLDLSLNPMNFIQPGAFKEIRLHKLTLRNNFDSLNVMKTCIQGLAGLEVHRLV
LGEFRNEGNLEKFDKSALEGLCNLTIEEFRLAYLDYYLDDIIDLFNCLTNVSSFSLVSVTIERVKDESYNFGW
QHLELVNCKFGQFPTLKLKSLKRLTFTSNKGGNAFSEVDLPSLEFLDLSRNGLSFKGCCSQSDFGTTSLKYLD
LSFNGVITMSSNFLGLEQLEHLDFQHSNLKQMSEFSVFLSLRNLIYLDISHTHTRYAFNGIFNGLSSLEVLKM
AGNSFQENFLPDIFTELRNLTFLDLSQCQLEQLSPTAFNSLSSLQVLNMSHNNFFSLDTFPYKCLNSLQVLDY
SLNHIMTSKKQELQHFPSSLAFLNLTQNDFACTCEHQSFLQWIKDQRQLLVEVERMECATPSDKQGMPVLSLN
ITCQMNKTIIGVSVLSVLVVSVVAVLVYKFYFHLMLLAGCIKYGRGENIYDAFVIYSSQDEDWVRNELVKNLE
EGVPPFQLCLHYRDFIPGVAIAANIIHEGFHKSRKVIVVVSQHFIQSRWCIFEYEIAQTWQFLSSRAGIIFIV
LQKVEKTLLRQQVELYRLLSRNTYLEWEDSVLGRHIFWRRLRKALLDGKSWNPEGTVGTGCNWQEATSI
诸如人等灵长类动物的DTLR4的附加序列(SEQ ID NQ:25和26);注意用A表示的第81、3144、3205和3563位核苷酸的每个都可以是A、C、G或T;用G表叔的第3132、3532、3538和3553位核苷酸的每个都可以是G或T;用A表示的第3638位核苷酸可以是A或T;并且用C表示的第3677、3685和3736位核苷酸的每个都可以是A或C:
AAAATACTCC CTTGCCTCAA AAACTGCTCG GTCAAACGGT GATAGCAAAC CACGCATTCA 60
CAGGGCCACT GCTGCTCACA AAACCAGTGA GGATGATGCC AGGATG ATG TCT GCC 115
Met Ser Ala
-22 -20
TCG CGC CTG GCT GGG ACT CTG ATC CCA GCC ATG GCC TTC CTC TCC TGC 163
Ser Arg Leu Ala Gly Thr Leu Ile Pro Ala Met Ala Phe Leu Ser Cys
-15 -10 -5
GTG AGA CCA GAA AGC TGG GAG CCC TGC GTG GAG GTT CCT AAT ATT ACT 211
Val Arg Pro Glu Ser Trp Glu Pro Cys Val Glu Val Pro Asn Ile Thr
1 5 10
TAT CAA TGC ATG GAG CTG AAT TTC TAC AAA ATC CCC GAC AAC CTC CCC 259
Tyr Gln Cys Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp Asn Leu Pro
15 20 25
TTC TCA ACC AAG AAC CTG GAC CTG AGC TTT AAT CCC CTG AGG CAT TTA 307
Phe Ser Thr Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu Arg His Leu
30 35 40 45
GGC AGC TAT AGC TTC TTC AGT TTC CCA GAA CTG CAG GTG CTG GAT TTA 355
Gly Ser Tyr Ser Phe Phe Ser Phe Pro Glu Leu Gln Val Leu Asp Leu
50 55 60
TCC AGG TGT GAA ATC CAG ACA ATT GAA GAT GGG GCA TAT CAG AGC CTA 403
Ser Arg Cys Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr Gln Ser Leu
65 70 75
AGC CAC CTC TCT ACC TTA ATA TTG ACA GGA AAC CCC ATC CAG AGT TTA 451
Ser His Leu Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile Gln Ser Leu
80 85 90
GCC CTG GGA GCC TTT TCT GGA CTA TCA AGT TTA CAG AAG CTG GTG GCT 499
Ala Leu Gly Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys Leu Val Ala
95 100 105
GTG GAG ACA AAT CTA GCA TCT CTA GAG AAC TTC CCC ATT GGA CAT CTC 547
Val Glu Thr Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile Gly His Leu
110 115 220 125
AAA ACT TTG AAA GAA CTT AAT GTG GCT CAC AAT CTT ATC CAA TCT TTC 595
Lys Thr Leu Lys Glu Leu Asn Val Ala His Asn Leu Ile Gln Ser Phe
130 135 140
AAA TTA CCT GAG TAT TTT TCT AAT CTG ACC AAT CTA GAG CAC TTG GAC 643
Lys Leu Pro Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu His Leu Asp
145 150 155
CTT TCC AGC AAC AAG ATT CAA AGT ATT TAT TGC ACA GAC TTG CGG GTT 691
Leu Ser Ser Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp Leu Arg Val
160 165 170
CTA CAT CAA ATG CCC CTA CTC AAT CTC TCT TTA GAC CTG TCC CTG AAC 739
Leu His Gln Met Pro Leu Leu Asn Leu Ser Leu Asp Leu Ser Leu Asn
175 180 185
CCT ATG AAC TTT ATC CAA CCA GGT GCA TTT AAA GAA ATT AGG CTT CAT 787
Pro Met Asn Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile Arg Leu His
190 195 200 205
AAG CTG ACT TTA AGA AAT AAT TTT GAT AGT TTA AAT GTA ATG AAA ACT 835
Lys Leu Thr Leu Arg Asn Asn Phe Asp Ser Leu Asn Val Met Lys Thr
210 215 220
TGT ATT CAA GGT CTG GCT GGT TTA GAA GTC CAT CGT TTG GTT CTG GGA 883
Cys Ile Gln Gly Leu Ala Gly Leu Glu Val His Arg Leu Val Leu Gly
225 230 235
GAA TTT AGA AAT GAA GGA AAC TTG GAA AAG TTT GAC AAA TCT GCT CTA 931
Glu Phe Arg Asn Glu Gly Asn Leu Glu Lys Phe Asp Lys Ser Ala Leu
240 245 250
GAG GGC CTG TGC AAT TTG ACC ATT GAA GAA TTC CGA TTA GCA TAC TTA 979
Glu Gly Leu Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu Ala Tyr Leu
255 260 265
GAC TAC TAC CTC GAT GAT ATT ATT GAC TTA TTT AAT TGT TTG ACA AAT 1027
Asp Tyr Tyr Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys Leu Thr Asn
270 275 280 285
GTT TCT TCA TTT TCC CTG GTG AGT GTG ACT ATT GAA AGG GTA AAA GAC 1075
Val Ser Ser Phe Ser Leu Val Ser Val Thr Ile Glu Arg Val Lys Asp
290 295 300
TTT TCT TAT AAT TTC GGA TGG CAA CAT TTA GAA TTA GTT AAC TGT AAA 1123
Phe Ser Tyr Asn Phe Gly Trp Gln His Leu Glu Leu Val Asn Cys Lys
305 310 315
TTT GGA CAG TTT CCC ACA TTG AAA CTC AAA TCT CTC AAA AGG CTT ACT 1171
Phe Gly Gln Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys Arg Leu Thr
320 325 330
TTC ACT TCC AAC AAA GGT GGG AAT GCT TTT TCA GAA GTT GAT CTA CCA 1219
Phe Thr Ser Asn Lys Gly Gly Asn Ala Phe Ser Glu Val Asp Leu Pro
335 340 345
AGC CTT GAG TTT CTA GAT CTC AGT AGA AAT GGC TTG AGT TTC AAA GGT 1267
Ser Leu Glu Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser Phe Lys Gly
350 355 360 365
TGC TGT TCT CAA AGT GAT TTT GGG ACA ACC AGC CTA AAG TAT TTA GAT 1315
Cys Cys Ser Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys Tyr Leu Asp
370 375 380
CTG AGC TTC AAT GGT GTT ATT ACC ATG AGT TCA AAC TTC TTG GGC TTA 1363
Leu Ser Phe Asn Gly Val Ile Thr Met Ser Ser Asn Phe Leu Gly Leu
385 390 395
GAA CAA CTA GAA CAT CTG GAT TTC CAG CAT TCC AAT TTG AAA CAA ATG 1411
Glu Gln Leu Glu His Leu Asp Phe Gln His Ser Asn Leu Lys Gln Met
400 405 410
AGT GAG TTT TCA GTA TTC CTA TCA CTC AGA AAC CTC ATT TAC CTT GAC 1459
Ser Glu Phe Ser Val Phe Leu Ser Leu Arg Asn Leu Ile Tyr Leu Asp
415 420 425
ATT TCT CAT ACT CAC ACC AGA GTT GCT TTC AAT GGC ATC TTC AAT GGC 1507
Ile Ser His Thr His Thr Arg Val Als Phe Asn Gly Ile Phe Asn Gly
430 435 440 445
TTG TCC AGT CTC GAA GTC TTG AAA ATG GCT GGC AAT TCT TTC CAG GAA 1555
Leu Ser Ser Leu Glu Val Leu Lys Met Ala Gly Asn Ser Phe Gln Glu
450 455 460
AAC TTC CTT CCA GAT ATC TTC ACA GAG CTG AGA AAC TTG ACC TTC CTG 1603
Asn Phe Leu Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu Thr Phe Leu
465 470 475
GAC CTC TCT CAG TGT CAA CTG GAG CAG TTG TCT CCA ACA GCA TTT AAC 1651
Asp Leu Ser Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr Ala Phe Asn
480 485 490
TCA CTC TCC AGT CTT CAG GTA CTA AAT ATG AGC CAC AAC AAC TTC TTT 1699
Ser Leu Ser Ser Leu Gln Val Leu Asn Met Ser His Asn Asn Phe Phe
495 500 505
TCA TTG GAT ACG TTT CCT TAT AAG TGT CTG AAC TCC CTC CAG GTT CTT 1747
Ser Leu Asp Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu Gln Val Leu
510 515 520 525
GAT TAC AGT CTC AAT CAC ATA ATG ACT TCC AAA AAA CAG GAA CTA CAG 1795
Asp Tyr Ser Leu Asn His Ile Met Thr Ser Lys Lys Gln Glu Leu Gln
530 535 540
CAT TTT CCA AGT AGT CTA GCT TTC TTA AAT CTT ACT CAG AAT GAC TTT 1843
His Phe Pro Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln Asn Asp Phe
545 550 555
GCT TGT ACT TGT GAA CAC CAG AGT TTC CTG CAA TGG ATC AAG GAC CAG 1891
Ala Cys Thr Cys Glu His Gln Ser Phe Leu Gln Trp Ile Lys Asp Gln
560 565 570
AGG CAG CTC TTG GTG GAA GTT GAA CGA ATG GAA TGT GCA ACA CCT TCA 1939
Arg Gln Leu Leu Val Glu Val Glu Arg Met Glu Cys Ala Thr Pro Ser
575 580 585
GAT AAG CAG GGC ATG CCT GTG CTG AGT TTG AAT ATC ACC TGT CAG ATG 1987
Asp Lys Gln Gly Met Pro Val Leu Ser Leu Asn Ile Thr Cys Gln Met
590 595 600 605
AAT AAG ACC ATC ATT GGT GTG TCG GTC CTC AGT GTG CTT GTA GTA TCT 2035
Asn Lys Thr Ile Ile Gly Val Ser Val Leu Ser Val Leu Val Val Ser
610 615 620
GTT GTA GCA GTT CTG GTC TAT AAG TTC TAT TTT CAC CTG ATG CTT CTT 2083
Val Val Ala Val Leu Val Tyr Lys Phe Tyr Phe His Leu Met Leu Leu
625 630 635
GCT GGC TGC ATA AAG TAT GGT AGA GGT GAA AAC ATC TAT GAT GCC TTT 2131
Ala Gly Cys Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr Asp Ala Phe
640 645 650
GTT ATC TAC TCA AGC CAG GAT GAG GAC TGG GTA AGG AAT GAG CTA GTA 2179
Val Ile Tyr Ser Ser Gln Asp Glu Asp Trp Val Arg Asn Glu Leu Val
655 660 665
AAG AAT TTA GAA GAA GGG GTG CCT CCA TTT CAG CTC TGC CTT CAC TAC 2227
Lys Asn Leu Glu Glu Gly Val Pro Pro Phe Gln Leu Cys Leu His Tyr
670 675 680 685
AGA GAC TTT ATT CCC GGT GTG GCC ATT GCT GCC AAC ATC ATC CAT GAA 2275
Arg Asp Phe Ile Pro Gly Val Ala Ile Ala Ala Asn Ile Ile His Glu
690 695 700
GGT TTC CAT AAA AGC CGA AAG GTG ATT GTT GTG GTG TCC CAG CAC TTC 2323
Gly Phs His Lys Ser Arg Lys Val Ile Val Val Val Ser Gln His Phe
705 710 715
ATC CAG AGC CGC TGG TGT ATC TTT GAA TAT GAG ATT GCT CAG ACC TGG 2371
Ile Gln Ser Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala Gln Thr Trp
720 725 730
CAG TTT CTG AGC AGT CGT GCT GGT ATC ATC TTC ATT GTC CTG CAG AAG 2419
Gln Phe Leu Ser Ser Arg Ala Gly Ile Ile Phe Ile Val Leu Gln Lys
735 740 745
GTG GAG AAG ACC CTG CTC AGG CAG CAG GTG GAG CTG TAC CGC CTT CTC 2467
Val Glu Lys Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr Arg Leu Leu
750 755 760 765
AGC AGG AAC ACT TAC CTG GAG TGG GAG GAC AGT GTC CTG GGG CGG CAC 2515
Ser Arg Asn Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu Gly Arg His
770 775 780
ATC TTC TGG AGA CGA CTC AGA AAA GCC CTG CTG GAT GGT AAA TCA TGG 2563
Ile Phe Trp Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly Lys Ser Trp
785 790 795
AAT CCA GAA GGA ACA GTG GGT ACA GGA TGC AAT TGG CAG GAA GCA ACA 2611
Asn Pro Glu Gly Thr Val Gly Thr Gly Cys Asn Trp Gln Glu Ala Thr
800 805 810
TCT ATC TGAAGAGGAA AAATAAAAAC CTCCTGAGGC ATTTCTTGCC CAGCTGGGTC 2667
Ser Ile
815
CAACACTTGT TCAGTTAATA AGTATTAAAT GCTGCCACAT GTCAGGCCTT ATGCTAAGGG 2727
TGAGTAATTC CATGGTGCAC TAGATATGCA GGGCTGCTAA TCTCAAGGAG CTTCCAGTGC 2787
AGAGGGAATA AATGCTAGAC TAAAATACAG AGTCTTCCAG GTGGGCATTT CAACCAACTC 2847
AGTCAAGGAA CCCATGACAA AGAAAGTCAT TTCAACTCTT ACCTCATCAA GTTGAATAAA 2907
GACAGAGAAA ACAGAAAGAG ACATTGTTCT TTTCCTGAGT CTTTTGAATG GAAATTGTAT 2967
TATGTTATAG CCATCATAAA ACCATTTTGG TAGTTTTGAC TGAACTGGGT GTTCACTTTT 3027
TCCTTTTTGA TTGAATACAA TTTAAATTCT ACTTGATGAC TGCAGTCGTC AAGGGGCTCC 3087
TGATGCAAGA TGCCCCTTCC ATTTTAAGTC TGTCTCCTTA CAGAGGTTAA AGTCTAATGG 3147
CTAATTCCTA AGGAAACCTG ATTAACACAT GCTCACAACC ATCCTGGTCA TTCTCGAACA 3207
TGTTCTATTT TTTAACTAAT CACCCCTGAT ATATTTTTAT TTTTATATAT CCAGTTTTCA 3267
TTTTTTTACG TCTTGCCTAT AAGCTAATAT CATAAATAAG GTTGTTTAAG ACGTGCTTCA 3327
AATATCCATA TTAACCACTA TTTTTCAAGG AAGTATGGAA AAGTACACTC TGTCACTTTG 3387
TCACTCGATG TCATTCCAAA GTTATTGCCT ACTAAGTAAT GACTGTCATG AAAGCAGCAT 3447
TGAAATAATT TGTTTAAAGG GGGCACTCTT TTAAACGGGA AGAAAATTTC CGCTTCCTGG 3507
TCTTATCATG GACAATTTGG GCTAGAGGCA GGAAGGAAGT GGGATGACCT CAGGAAGTCA 3567
CCTTTTCTTG ATTCCAGAAA CATATGGGCT GATAAACCCG GGGTGACCTC ATGAAATGAG 3627
TTGCAGCAGA AGTTTATTTT TTTCAGAACA AGTGATGTTT GATGGACCTC TGAATCTCTT 3687
TAGGGAGACA CAGATGGCTG GGATCCCTCC CCTGTACCCT TCTCACTGCC AGGAGAACTA 3747
CGTGTGAAGG TATTCAAGGC AGGGAGTATA CATTGCTGTT TCCTGTTGGG CAATGCTCCT 3807
TGACCACATT TTGGGAAGAG TGGATGTTAT CATTGAGAAA ACAATGTGTC TGGAATTAAT 3867
GGGGTTCTTA TAAAGAAGGT TCCCAGAAAA GAATGTTCAT TCCAGCTTCT TCAGGAAACA 3927
GGAACATTCA AGGAAAAGGA CAATCAGGAT GTCATCAGGG AAATGAAAAT AAAAACCACA 3987
ATGAGATATC ACCTTATACC AGGTAGATGG CTACTATAAA AAAATGAAGT GTCATCAAGG 4047
ATATAGAGAA ATTGGAACCC TTCTTCACTG CTGGAGGGAA TGGAAAATGG TGTAGCCGTT 4107
ATGAAAAACA GTACGGAGGT TTCTCAAAAA TTAAAAATAG AACTGCTATA TGATCCAGCA 4167
ATCTCACTTC TGTATATATA CCCAAAATAA TTGAAATCAG AATTTCAAGA AAATATTTAC 4227
ACTCCCATGT TCATTGTGGC ACTCTTCACA ATCACTGTTT CCAAAGTTAT GGAAACAACC 4287
CAAATTTCCA TTGGAAAATA AATGGACAAA GGAAATGTGC ATATAACGTA CAATGGGGAT 4347
ATTATTCAGC CTAAAAAAAG GGGGGATCCT GTTATTTATG ACAACATGAA TAAACCCGGA 4407
GGCCATTATG CTATGTAAAA TGAGCAAGTA ACAGAAAGAC AAATACTGCC TGATTTCATT 4467
TATATGAGGT TCTAAAATAG TCAAACTCAT AGAAGCAGAG AATAGAACAG TGGTTCCTAG 4527
GGAAAAGGAG GAAGGGAGAA ATGAGGAAAT AGGGAGTTGT CTAATTGGTA TAAAATTATA 4587
GTATGCAAGA TGAATTAGCT CTAAAGATCA GCTGTATAGC AGAGTTCGTA TAATGAACAA 4647
TACTGTATTA TGCACTTAAC ATTTTGTTAA GAGGGTACCT CTCATGTTAA GTGTTCTTAC 4707
CATATACATA TACACAAGGA AGCTTTTGGA GGTGATGGAT ATATTTATTA CCTTGATTGT 4767
GGTGATGGTT TGACAGGTAT GTGACTATGT CTAAACTCAT CAAATTGTAT ACATTAAATA 4827
TATGCAGTTT TATAATATCA AAAAAAAAAA AAAAAAAAAA 4865
MSASRLAGTLIPAMAFLSCVRPESWEPCVEVPNITYQCMELNFYKIPDNLPFSTKNLDLSFNPLRHLGSYSFFSF
PELQVLDLSRCEIQTIEDGAYQSLSHLSTLILTGNPIQSLALGAFSGLSSLQKLVAVETNLASLENFPIGHLKTL
KELNVAHNLIQSFKLPEYFSNLTNLEHLDLSSNKIQSIYCTDLRVLHQMPLLNLSLDLSLNPMNFIQPGAFKEIR
LHKLTLRNNFDSLNVMKTCIQGLAGLEVHRLVLGEFRNEGNLEKFDKSALEGLCNLTIEEFRLAYLDYYLDDIID
LFNCLTNVSSFSLVSVTIERVKDFSYNFGWQHLELVNCKFGQFPTLKLKSLKRLTFTSNKGGNAFSEVDLPSLEF
LDLSRNGLSFKGCCSQSDFGTTSLKYLDLSFNGVITMSSNFLGLEQLEHLDFQHSNLKQMSEFSVFLSLRNLIYL
DISHTHTRVAFNGIFNGLSSLEVLKMAGNSFQENFLPDIFTELRNLTFLDLSQCQLEQLSPTAFNSLSSLQVLNM
SHNNFFSLDTFPYKCLNSLQVLDYSLNHIMTSKKQELQHFPSSLAFLNLTQNDFACTCEHQSFLQWIKDQRQLLV
EVERMECATPSDKQGMPVLSLNITCQMNKTIIGVSVLSVLVVSVVAVLVYKFYEHLMLLAGCIKYGRGENIYDAF
VIYSSQDEDWVRNELVKNLEEGVPPFQLCLHYRDFIPGVAIAANIIHEGFHKSRKVIVVVSQHFIQSRWCIFEYE
IAQTWQFLSSRAGIIFIVLQKVEKTLLRQQVELYRLLSRNTYLEWEDSVLGRHIFWRRLRKALLDGKSWNPEGTV
GTGCNWQEATSI
表5:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体5(DTLR5)的部分核苷酸序列和氨基酸序列(见SEQ ID NO:9和10)。
TGT TGG GAT GTT TTT GAG GGA CTT TCT CAT CTT CAA GTT CTG TAT TTG 48
Cys Trp Asp Val Phe Glu Gly Leu Ser His Leu Gln Val Leu Tyr Leu
1 5 10 15
AAT CAT AAC TAT CTT AAT TCC CTT CCA CCA GGA GTA TTT AGC CAT CTG 96
Asn His Asn Tyr Leu Asn Ser Leu Pro Pro Gly Val Phe Ser His Leu
20 25 30
ACT GCA TTA AGG GGA CTA AGC CTC AAC TCC AAC AGG CTG ACA GTT CTT 144
Thr Ala Leu Arg Gly Leu Ser Leu Asn Ser Asn Arg Leu Thr Val Leu
35 40 45
TCT CAC AAT GAT TTA CCT GCT AAT TTA GAG ATC CTG GAC ATA TCC AGG 192
Ser His Asn Asp Leu Pro Ala Asn Leu Glu Ile Leu Asp Ile Ser Arg
50 55 60
AAC CAG CTC CTA GCT CCT AAT CCT GAT GTA TTT GTA TCA CTT AGT GTC 240
Asn Gln Leu Leu Ala Pro Asn Pro Asp Val Phe Val Ser Leu Ser Val
65 70 75 80
TTG GAT ATA ACT CAT AAC AAG TTC ATT TGT GAA TGT GAA CTT AGC ACT 288
Leu Asp Ile Thr His Asn Lys Phe Ile Cys Glu Cys Glu Leu Ser Thr
85 90 95
TTT ATC AAT TGG CTT AAT CAC ACC AAT GTC ACT ATA GCT GGG CCT CCT 336
Phe Ile Asn Trp Leu Asn His Thr Asn Val Thr Ile Ala Gly Pro Pro
100 105 110
GCA GAC ATA TAT TGT GTG TAC CCT GAC TCG TTC TCT GGG GTT TCC CTC 384
Ala Asp Ile Tyr Cys Val Tyr Pro Asp Ser Phe Ser Gly Val Ser Leu
115 120 125
TTC TCT CTT TCC ACG GAA GGT TGT GAT GAA GAG GAA GTC TTA AAG TCC 432
Phe Ser Leu Ser Thr Glu Gly Cys Asp Glu Glu Glu Val Leu Lys Ser
130 135 140
CTA AAG TTC TCC CTT TTC ATT GTA TGC ACT GTC ACT CTG ACT CTG TTC 480
Leu Lys Phe Ser Leu Phe Ile Val Cys Thr Val Thr Leu Thr Leu Phe
145 150 155 160
CTC ATG ACC ATC CTC ACA GTC ACA AAG TTC CGG GGC TTC TGT TTT ATC 528
Leu Met Thr Ile Leu Thr Val Thr Lys Phe Arg Gly Phe Cys Phe Ile
165 170 175
TGT TAT AAG ACA GCC CAG AGA CTG GTG TTC AAG GAC CAT CCC CAG GGC 576
Cys Tyr Lys Thr Ala Gln Arg Leu Val Phe Lys Asp His Pro Gln Gly
180 185 190
ACA GAA CCT GAT ATG TAC AAA TAT GAT GCC TAT TTG TGC TTC AGC AGC 624
Thr Glu Pro Asp Met Tyr Lys Tyr Asp Ala Tyr Leu Cys Phe Ser Ser
195 200 205
AAA GAC TTC ACA TGG GTG CAG AAT GCT TTG CTC AAA CAC CTG GAC ACT 672
Lys Asp Phe Thr Trp Val Gln Asn Ala Leu Leu Lys His Leu Asp Thr
210 215 220
CAA TAC AGT GAC CAA AAC AGA TTC AAC CTG TGC TTT GAA GAA AGA GAC 720
Gln Tyr Ser Asp Gln Asn Arg Phe Asn Leu Cys Phe Glu Glu Arg Asp
225 230 235 240
TTT GTC CCA GGA GAA AAC CGC ATT GCC AAT ATC CAG GAT GCC ATC TGG 768
Phe Val Pro Gly Glu Asn Arg Ile Ala Asn Ile Gln Asp Ala Ile Trp
245 250 255
AAC AGT AGA AAG ATC GTT TGT CTT GTG AGC AGA CAC TTC CTT AGA GAT 816
Asn Ser Arg Lys Ile Val Cys Leu Val Ser Arg His Phe Leu Arg Asp
260 265 270
GGC TGG TGC CTT GAA GCC TTC AGT TAT GCC CAG GGC AGG TGC TTA TCT 864
Gly Trp Cys Leu Glu Ala Phe Ser Tyr Ala Gln Gly Arg Cys Leu Ser
275 280 285
GAC CTT AAC AGT GCT CTC ATC ATG GTG GTG GTT GGG TCC TTG TCC CAG 912
Asp Leu Asn Ser Ala Leu Ile Met Val Val Val Gly Ser Leu Ser Gln
290 295 300
TAC CAG TTG ATG AAA CAT CAA TCC ATC AGA GGC TTT GTA CAG AAA CAG 960
Tyr Gln Leu Met Lys His Gln Ser Ile Arg Gly Phe Val Gln Lys Gln
305 310 315 320
CAG TAT TTG AGG TGG CCT GAG GAT CTC CAG GAT GTT GGC TGG TTT CTT 1008
Gln Tyr Leu Arg Trp Pro Glu Asp Leu Gln Asp Val Gly Trp Phe Leu
323 330 335
CAT AAA CTC TCT CAA CAG ATA CTA AAG AAA GAA AAG GAA AAG AAG AAA 1056
His Lys Leu Ser Gln Gln Ile Leu Lys Lys Glu Lys Glu Lys Lys Lys
340 345 350
GAC AAT AAC ATT CCG TTG CAA ACT GTA GCA ACC ATC TCC TAATCAAAGG 1105
Asp Asn Asn Ile Pro Leu Gln Thr Val Ala Thr Ile Ser
355 360 365
AGCAATTTCC AACTTATCTC AAGCCACAAA TAACTCTTCA CTTTGTATTT GCACCAAGTT 1165
ATCATTTTGG GGTCCTCTCT GGAGGTTTTT TTTTTCTTTT TGCTACTATG AAAACAACAT 1225
AAATCTCTCA ATTTTCGTAT CAAAAAAAAA AAAAAAAAAA TGGCGGCCGC 1275
CWDVFEGLSHLQVLYLNHNYLNSLPPGVFSHLTALRGLSLNSNRLTVLSHNDLPANLEILDISRNQLLAPNPDVF
VSLSVLDITHNKFICECELSTFINWLNHTNVTIAGPPADIYCVYPDSFSGVSLFSLSTEGCDEEEVLKSLKFSLF
IVCTVTLTLFLMTILTVTKFRGFCFICYKTAQRLVFKDHPQGTEPDMYKYDAYLCESSKDFTWVQNALLKHLDTQ
YSDQNRFNLCFEERDFVPGENRIANIQDAIWNSRKIVCLVSRHFLRDGWCLEAFSYAQGRCLSDLNSALIMVVVG
SLSQYQLMKHQSIRGFVQKQQYLRWPEDLQDVGWFLHKLSQQILKKEKEKKKDNNIPLQTVATIS
表6:诸如灵长类动物或啮齿动物等哺乳动物的DNAX Toll样受体6(DTLR6)的核苷酸序列和氨基酸序列。SEQ ID NO:11和l2来自人等灵长类动物;SEQ ID NO:13和l4来自小鼠等啮齿动物。灵长类动物:
ATG TGG ACA CTG AAG AGA CTA ATT CTT ATC CTT TTT AAC ATA ATC CTA 48
Met Trp Thr Leu Lys Arg Leu Ile Leu Ile Leu Phe Asn Ile Ile Leu
-22 -20 -15 -10
ATT TCC AAA CTC CTT GGG GCT AGA TGG TTT CCT AAA ACT CTG CCC TGT 96
Ile Ser Lys Leu Leu Gly Ala Arg Trp Phe Pro Lys Thr Leu Pro Cys
-5 1 5 10
GAT GTC ACT CTG GAT GTT CCA AAG AAC CAT GTG ATC GTG GAC TGC ACA 144
Asp Val Thr Leu Asp Val Pro Lys Asn His Val Ile Val Asp Cys Thr
15 20 25
GAC AAG CAT TTG ACA GAA ATT CCT GGA GGT ATT CCC ACG AAC ACC ACG 192
Asp Lys His Leu Thr Glu Ile Pro Gly Gly Ile Pro Thr Asn Thr Thr
30 35 40
AAC CTC ACC CTC ACC ATT AAC CAC ATA CCA GAC ATC TCC CCA GCG TCC 240
Asn Leu Thr Leu Thr Ile Asn His Ile Pro Asp Ile Ser Pro Ala Ser
45 50 55
TTT CAC AGA CTG GAC CAT CTG GTA GAG ATC GAT TTC AGA TGC AAC TGT 288
Phe His Arg Leu Asp His Leu Val Glu Ile Asp Phe Arg Cys Asn Cys
60 65 70
GTA CCT ATT CCA CTG GGG TCA AAA AAC AAC ATG TGC ATC AAG AGG CTG 336
Val Pro Ile Pro Leu Gly Ser Lys Asn Asn Met Cys Ile Lys Arg Leu
75 80 85 90
CAG ATT AAA CCC AGA AGC TTT AGT GGA CTC ACT TAT TTA AAA TCC CTT 384
Gln Ile Lys Pro Arg Ser Phe Ser Gly Leu Thr Tyr Leu Lys Ser Leu
95 100 105
TAC CTG GAT GGA AAC CAG CTA CTA GAG ATA CCG CAG GGC CTC CCG CCT 432
Tyr Leu Asp Gly Asn Gln Leu Leu Glu Ile Pro Gln Gly Leu Pro Pro
110 115 120
AGC TTA CAG CTT CTC AGC CTT GAG GCC AAC AAC ATC TTT TCC ATC AGA 480
Ser Leu Gln Leu Leu Ser Leu Glu Ala Asn Asn Ile Phe Ser Ile Arg
125 130 135
AAA GAG AAT CTA ACA GAA CTG GCC AAC ATA GAA ATA CTC TAC CTG GGC 528
Lys Glu Asn Leu Thr Glu Leu Ala Asn Ile Glu Ile Leu Tyr Leu Gly
140 145 150
CAA AAC TGT TAT TAT CGA AAT CCT TGT TAT GTT TCA TAT TCA ATA GAG 576
Gln Asn Cys Tyr Tyr Arg Asn Pro Cys Tyr Val Ser Tyr Ser Ile Glu
155 160 165 170
AAA GAT GCC TTC CTA AAC TTG ACA AAG TTA AAA GTG CTC TCC CTG AAA 624
Lys Asp Ala Phe Leu Asn Leu Thr Lys Leu Lys Val Leu Ser Leu Lys
175 180 185
GAT AAC AAT GTC ACA GCC GTC CCT ACT GTT TTG CCA TCT ACT TTA ACA 672
Asp Asn Asn Val Thr Ala Val Pro Thr Val Leu Pro Ser Thr Leu Thr
190 195 200
GAA CTA TAT CTC TAC AAC AAC ATG ATT GCA AAA ATC CAA GAA GAT GAT 720
Glu Leu Tyr Leu Tyr Asn Asn Met Ile Ala Lys Ile Gln Glu Asp Asp
205 210 215
TTT AAT AAC CTC AAC CAA TTA CAA ATT CTT GAC CTA AGT GGA AAT TGC 768
Phe Asn Asn Leu Asn Gln Leu Gln Ile Leu Asp Leu Ser Gly Asn Cys
220 225 230
CCT CGT TGT TAT AAT GCC CCA TTT CCT TGT GCG CCG TGT AAA AAT AAT 816
Pro Arg Cys Tyr Asn Ala Pro Phe Pro Cys Ala Pro Cys Lys Asn Asn
235 240 245 250
TCT CCC CTA CAG ATC CCT GTA AAT GCT TTT GAT GCG CTG ACA GAA TTA 864
Ser Pro Leu Gln Ile Pro Val Asn Ala Phe Asp Ala Leu Thr Glu Leu
255 260 265
AAA GTT TTA CGT CTA CAC AGT AAC TCT CTT CAG CAT GTG CCC CCA AGA 912
Lys Val Leu Arg Leu His Ser Asn Ser Leu Gin His Val Pro Pro Arg
270 275 280
TGG TTT AAG AAC ATC AAC AAA CTC CAG GAA CTG GAT CTG TCC CAA AAC 960
Trp Phe Lys Asn Ile Asn Lys Leu Gln Glu Leu Asp Leu Ser Gln Asn
285 290 295
TTC TTG GCC AAA GAA ATT GGG GAT GCT AAA TTT CTG CAT TTT CTC CCC 1008
Phe Leu Ala Lys Glu Ile Gly Asp Ala Lys Phe Leu His Phe Leu Pro
300 305 310
AGC CTC ATC CAA TTG GAT CTG TCT TTC AAT TTT GAA CTT CAG GTC TAT 1056
Ser Leu Ile Gln Leu Asp Leu Ser Phe Asn Phe Glu Leu Gln Val Tyr
315 320 325 330
CGT GCA TCT ATG AAT CTA TCA CAA GCA TTT TCT TCA CTG AAA AGC CTG 1104
Arg Ala Ser Met Asn Leu Ser Gln Ala Phe Ser Ser Leu Lys Ser Leu
335 340 345
AAA ATT CTG CGG ATC AGA GGA TAT GTC TTT AAA GAG TTG AAA AGC TTT 1152
Lys Ile Leu Arg Ile Arg Gly Tyr Val Phe Lys Glu Leu Lys Ser Phe
350 355 360
AAC CTC TCG CCA TTA CAT AAT CTT CAA AAT CTT GAA GTT CTT GAT CTT 1200
Asn Leu Ser Pro Leu His Asn Leu Gln Asn Leu Glu Val Leu Asp Leu
365 370 375
GGC ACT AAC TTT ATA AAA ATT GCT AAC CTC AGC ATG TTT AAA CAA TTT 1248
Gly Thr Asn Phe Ile Lys Ile Ala Asn Leu Ser Met Phe Lys Gln Phe
380 385 390
AAA AGA CTG AAA GTC ATA GAT CTT TCA GTG AAT AAA ATA TCA CCT TCA 1296
Lys Arg Leu Lys Val Ile Asp Leu Ser Val Asn Lys Ile Ser Pro Ser
395 400 405 410
GGA GAT TCA AGT GAA GTT GGC TTC TGC TCA AAT GCC AGA ACT TCT GTA 1344
Gly Asp Ser Ser Glu Val Gly Phe Cys Ser Asn Ala Arg Thr Ser Val
415 420 425
GAA AGT TAT GAA CCC CAG GTC CTG GAA CAA TTA CAT TAT TTC AGA TAT 1392
Glu Ser Tyr Glu Pro Gln Val Leu Glu Gln Leu His Tyr Phe Arg Tyr
430 435 440
GAT AAG TAT GCA AGG AGT TGC AGA TTC AAA AAC AAA GAG GCT TCT TTC 1440
Asp Lys Tyr Ala Arg Ser Cys Arg Phe Lys Asn Lys Glu Ala Ser Phe
445 450 455
ATG TCT GTT AAT GAA AGC TGC TAC AAG TAT GGG CAG ACC TTG GAT CTA 1488
Met Ser Val Asn Glu Ser Cys Tyr Lys Tyr Gly Gln Thr Leu Asp Leu
460 465 470
AGT AAA AAT AGT ATA TTT TTT GTC AAG TCC TCT GAT TTT CAG CAT CTT 1536
Ser Lys Asn Ser Ile Phe Phe Val Lys Ser Ser Asp Phe Gln His Leu
475 480 485 490
TCT TTC CTC AAA TGC CTG AAT CTG TCA GGA AAT CTC ATT AGC CAA ACT 1584
Ser Phe Leu Lys Cys Leu Asn Leu Ser Gly Asn Leu Ile Ser Gln Thr
495 500 505
CTT AAT GGC AGT GAA TTC CAA CCT TTA GCA GAG CTG AGA TAT TTG GAC 1632
Leu Asn Gly Ser Glu Phe Gln Pro Leu Ala Glu Leu Arg Tyr Leu Asp
510 515 520
TTC TCC AAC AAC CGG CTT GAT TTA CTC CAT TCA ACA GCA TTT GAA GAG 1680
Phe Ser Asn Asn Arg Leu Asp Leu Leu His Ser Thr Ala Phe Glu Glu
525 530 535
CTT CAC AAA CTG GAA GTT CTG GAT ATA AGC AGT AAT AGC CAT TAT TTT 1728
Leu His Lys Leu Glu Val Leu Asp Ile Ser Ser Asn Ser His Tyr Phe
540 545 550
CAA TCA GAA GGA ATT ACT CAT ATG CTA AAC TTT ACC AAG AAC CTA AAG 1776
Gln Ser Glu Gly Ile Thr His Mer Leu Asn Phe Thr Lys Asn Leu Lys
555 560 565 570
GTT CTG CAG AAA CTG ATG ATG AAC GAC AAT GAC ATC TCT TCC TCC ACC 1824
Val Leu Gln Lys Leu Met Met Asn Asp Asn Asp Ile Ser Ser Ser Thr
575 580 585
AGC AGG ACC ATG GAG AGT GAG TCT CTT AGA ACT CTG GAA TTC AGA GGA 1872
Ser Arg Thr Met Glu Ser Glu Ser Leu Arg Thr Leu Glu Phe Arg Gly
590 595 600
AAT CAC TTA GAT GTT TTA TGG AGA GAA GGT GAT AAC AGA TAC TTA CAA 1920
Asn His Leu Asp Val Leu Trp Arg Glu Gly Asp Asn Arg Tyr Leu Gln
605 610 615
TTA TTC AAG AAT CTG CTA AAA TTA GAG GAA TTA GAC ATC TCT AAA AAT 1968
Leu Phe Lys Asn Leu Leu Lys Leu Glu Glu Leu Asp Ile Ser Lys Asn
620 625 630
TCC CTA AGT TTC TTG CCT TCT GGA GTT TTT GAT GGT ATG CCT CCA AAT 2016
Ser Leu Ser Phe Leu Pro Ser Gly Val Phe Asp Gly Met Pro Pro Asn
635 640 645 650
CTA AAG AAT CTC TCT TTG GCC AAA AAT GGG CTC AAA TCT TTC AGT TGG 2064
Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys Ser Phe Ser Trp
655 660 665
AAG AAA CTC CAG TGT CTA AAG AAC CTG GAA ACT TTG GAC CTC AGC CAC 2112
Lys Lys Leu Gln Cys Leu Lys Asn Leu Glu Thr Leu Asp Leu Ser His
670 675 680
AAC CAA CTG ACC ACT GTC CCT GAG AGA TTA TCC AAC TGT TCC AGA AGC 2160
Asn Gln Leu Thr Thr Val Pro Glu Arg Leu Ser Asn Cys Ser Arg Ser
685 690 695
CTC AAG AAT CTG ATT CTT AAG AAT AAT CAA ATC AGG AGT CTG ACG AAG 2208
Leu Lys Asn Leu Ile Leu Lys Asn Asn Gln Ile Arg Ser Leu Thr Lys
700 705 710
TAT TTT CTA CAA GAT GCC TTC CAG TTG CGA TAT CTG GAT CTC AGC TCA 2256
Tyr Phe Leu Gln Asp Ala Phe Gln Leu Arg Tyr Leu Asp Leu Ser Ser
715 720 725 730
AAT AAA ATC CAG ATG ATC CAA AAG ACC AGC TTC CCA GAA AAT GTC CTC 2304
Asn Lys Ile Gln Met Ile Gln Lys Thr Ser Phe Pro Glu Asn Val Leu
735 740 745
AAC AAT CTG AAG ATG TTG CTT TTG CAT CAT AAT CGG TTT CTG TGC ACC 2352
Asn Asn Leu Lys Met Leu Leu Leu His His Asn Arg Phe Leu Cys Thr
750 755 760
TGT GAT GCT GTG TGG TTT GTC TGG TGG GTT AAC CAT ACG GAG GTG ACT 2400
Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His Thr Glu Val Thr
765 770 775
ATT CCT TAC CTG GCC ACA GAT GTG ACT TGT GTG GGG CCA GGA GCA CAC 2448
Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly Pro Gly Ala His
780 785 790
AAG GGC CAA AGT GTG ATC TCC CTG GAT CTG TAC ACC TGT GAG TTA GAT 2496
Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr Cys Glu Leu Asp
795 800 805 810
CTG ACT AAC CTG ATT CTG TTC TCA CTT TCC ATA TCT GTA TCT CTC TTT 2544
Leu Thr Asn Leu Ile Leu Phe Ser Leu Ser Ile Ser Val Ser Leu Phe
815 820 825
CTC ATG GTG ATG ATG ACA GCA AGT CAC CTC TAT TTC TGG GAT GTG TGG 2592
Leu Met Val Met Met Thr Ala Ser His Leu Tyr Phe Trp Asp Val Trp
830 835 840
TAT ATT TAC CAT TTC TGT AAG GCC AAG ATA AAG GGG TAT CAG CGT CTA 2640
Tyr Ile Tyr His Phe Cys Lys Ala Lys Ile Lys Gly Tyr Gln Arg Leu
845 850 855
ATA TCA CCA GAC TGT TGC TAT GAT GCT TTT ATT GTG TAT GAC ACT AAA 2688
Ile Ser Pro Asp Cys Cys Tyr Asp Ala Phe Ile Val Tyr Asp Thr Lys
860 865 870
GAC CCA GCT GTG ACC GAG TGG GTT TTG GCT GAG CTG GTG GCC AAA CTG 2736
Asp Pro Ala Val Thr Glu Trp Val Leu Ala Glu Leu Val Ala Lys Leu
875 880 885 890
GAA GAC CCA AGA GAG AAA CAT TTT AAT TTA TGT CTC GAG GAA AGG GAC 2784
Glu Asp Pro Arg Glu Lys His Phe ASn Leu Cys Leu Glu Glu Arg Asp
895 900 905
TGG TTA CCA GGG CAG CCA GTT CTG GAA AAC CTT TCC CAG AGC ATA CAG 2832
Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu Ger Gln Ser Ile Gln
910 915 920
CTT AGC AAA AAG ACA GTG TTT GTG ATG ACA GAC AAG TAT GCA AAG ACT 2880
Leu Ser Lys Lys Thr Val Phe Val Met Thr Asp Lys Tyr Ala Lys Thr
925 930 935
GAA AAT TTT AAG ATA GCA TTT TAC TTG TCC CAT CAG AGG CTC ATG GAT 2928
Glu Asn Phe Lys Ile Ala Phe Tyr Leu Ser His Gln Arg Leu Met Asp
940 945 950
GAA AAA GTT GAT GTG ATT ATC TTG ATA TTT CTT GAG AAG CCC TTT CAG 2976
Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu Glu Lys Pro Phe Gln
955 960 965 970
AAG TCC AAG TTC CTC CAG CTC CGG AAA AGG CTC TGT GGG AGT TCT GTC 3024
Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu Cys Gly Ser Ser Val
975 980 985
CTT GAG TGG CCA ACA AAC CCG CAA GCT CAC CCA TAC TTC TGG CAG TGT 3072
Leu Glu Trp Pro Thr Asn Pro Gln Ala His Pro Tyr Phe Trp Gln Cys
990 995 1000
CTA AAG AAC GCC CTG GCC ACA GAC AAT CAT GTG GCC TAT AGT CAG GTG 3120
Leu Lys Asn Ala Leu Ala Thr Asp Asn His Val Ala Tyr Ser Gln Val
1005 1010 1015
TTC AAG GAA ACG GTC TAG 3138
Phe Lys Glu Thr Val
1020
MWTLKRLILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGGIPTNTTNLTLTINHIP
DISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFSGLTYLKSLYLDGNQLLEIPQGLPPSLQL
LSLEANNIFSIRKENLTELANIEILYLGQNCYYRNPCYVSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPST
LTELYLYNNMIAKIQEDDFNNLNQLQILDLSGNCPRCYNAPFPCAPCKNNSPLQIPVNAFDALTELKVLRLHSN
SLQHVPPRWFKNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSLKI
LRIRGYVFKELKSFNLSPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKISPSGDSSEVGFCSNA
RTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQTLDLSKNSIFFVKSSDFQHLSFLKCL
NLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLHSTAFEELHKLEVLDISSNSHYFQSEGITHMLNFTKNLK
VLQKLMMNDNDISSSTSRTMESESLRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVF
DGMPPNLKNLSLAKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ
DAFQLRYLDLSSNKIQMIQKTSFPENVLNNLKMLLLHHNRFLCTCDAVWFVWWVNHTEVTIPYLATDVTCVGPG
AHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVWYIYHFCKAKIKGYQRLISPDCCYD
AFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEERDWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTEN
FKIAFYLSHQRLMDEKVDVIILIFLEKPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVA
YSQVFKETV
啮齿动物(SEQ ID NO:13和14)
CTT GGA AAA CCT CTT CAG AAG TCT AAG TTT CTT CAG CTC AGG AAG AGA 48
Leu Gly Lys Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg
1 5 10 15
CTC TGC AGG AGC TCT GTC CTT GAG TGG CCT GCA AAT CCA CAG GCT CAC 96
Leu Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His
20 25 30
CCA TAC TTC TGG CAG TGC CTG AAA AAT GCC CTG ACC ACA GAC AAT CAT 144
Pro Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His
35 40 45
GTG GCT TAT AGT CAA ATG TTC AAG GAA ACA GTC TAG 180
Val Ala Tyr Ser Gln Met Phe Lys Glu Thr Val
50 55
LGGKPLQKSKFLQLRKRLCRSSVLEWPANPQAHPYFWQCLKALTTDNHVAYSQMFKETV
诸如小鼠等啮齿动物的附加序列:
上游(SEQ ID NO:27和28);用C表示的第186、196、217、276和300位核苷酸的每个都可以是A、C、G或T:
TCC TAT TCT ATG GAA AAA GAT GCT TTC CTA TTT ATG AGA AAT TTG AAG 48
Ser Tyr Ser Met Glu Lys Asp Ala Phe Leu Phe Met Arg Asn Leu Lys
1 5 10 15
GTT CTC TCA CTA AAA GAT AAC AAT GTC ACA GCT GTC CCC ACC ACT TTG 96
Val Leu Ser Leu Lys Asp Asn Asn Val Thr Ala Val Pro Thr Thr Leu
20 25 30
CCA CCT AAT TTA CTA GAG CTC TAT CTT TAT AAC AAT ATC ATT AAG AAA 144
Pro Pro Asn Leu Leu Glu Leu Tyr Leu Tyr Asn Asn Ile Ile Lys Lys
35 40 45
ATC CAA GAA AAT GAT TTC AAT AAC CTC AAT GAG TTG CAA GTC CTT GAC 192
Ile Gln Glu Asn Asp Phe Asn Asn Leu Asn Glu Leu Gln Val Leu Asp
50 55 60
CTA CGT GGA AAT TGC CCT CGA TGT CAT AAT GTC CCA TAT CCG TGT ACA 240
Leu Arg Gly Asn Cys Pro Arg Cys His Asn Val Pro Tyr Pro Cys Thr
65 70 75 80
CCG TGT GAA AAT AAT TCC CCC TTA CAG ATC CAT GAC AAT GCT TTC AAT 288
Pro Cys Glu Asn Asn Ser Pro Leu Gln Ile His Asp Asn Ala Phe Asn
85 90 95
TCA TCG ACA GAC 300
Ser Ser Thr Asp
100
SYSMEKDAFLFMRNLKVLSLKDNNVTAVPTTLPPNLLELYLYNNIIKKIQENDFNNLNELQXLDLXGNCPRCXNV
PYPCTPCENNSPLQIHXNAFNSSTX
下游(SEQ ID NO:29和30);用A表示的第1643位核苷酸可以是A或G;用C表示的第1664位核苷酸可以是A、C、G或T;用G表示的第1680和1735位核苷酸可以是G或T;用C表示的第1719位核苷酸可以是C或T;并且用A表示的第1727位核苷酸可以是A、G或T:
TCT CCA GAA ATT CCC TGG AAT TCC TTG CCT CCT GAG GTT TTT GAG GGT 48
Ser Pro Glu Ile Pro Trp Asn Ser Leu Pro Pro Glu Val Phe Glu Gly
1 5 10 15
ATG CCG CCA AAT CTA AAG AAT CTC TCC TTG GCC AAA AAT GGG CTC AAA 96
Met Pro Pro Asn Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys
20 25 30
TCT TTC TTT TGG GAC AGA CTC CAG TTA CTG AAG CAT TTG GAA ATT TTG 144
Ser Phe Phe Trp Asp Arg Leu Gln Leu Leu Lys His Leu Glu Ile Leu
35 40 45
GAC CTC AGC CAT AAC CAG CTG ACA AAA GTA CCT GAG AGA TTG GCC AAC 192
Asp Leu Ser His Asn Gln Leu Thr Lys Val Pro Glu Arg Leu Ala Asn
50 55 60
TGT TCC AAA AGT CTC ACA ACA CTG ATT CTT AAG CAT AAT CAA ATC AGG 240
Cys Ser Lys Ser Leu Thr Thr Leu Ile Leu Lys His Asn Gln Ile Arg
65 70 75 80
CAA TTG ACA AAA TAT TTT CTA GAA GAT GCT TTG CAA TTG CGC TAT CTA 288
Gln Leu Thr Lys Tyr Phe Leu Glu Asp Ala Leu Gln Leu Arg Tyr Leu
85 90 95
GAC ATC AGT TCA AAT AAA ATC CAG GTC ATT CAG AAG ACT AGC TTC CCA 336
Asp Ile Ser Ser Asn Lys Ile Gln Val Ile Gln Lys Thr Ser Phe Pro
100 105 110
GAA AAT GTC CTC AAC AAT CTG GAG ATG TTG GTT TTA CAT CAC AAT CGC 384
Glu Asn Val Leu Asn Asn Leu Glu Met Leu Val Leu His His Asn Arg
115 120 125
TTT CTT TGC AAC TGT GAT GCT GTG TGG TTT GTC TGG TGG GTT AAC CAT 432
Phe Leu Cys Asn Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His
130 135 140
ACA GAT GTT ACT ATT CCA TAC CTG GCC ACT GAT GTG ACT TGT GTA GGT 480
Thr Asp Val Thr Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly
145 150 155 160
CCA GGA GCA CAC AAA GGT CAA AGT GTC ATA TCC CTT GAT CTG TAT ACG 528
Pro Gly Ala His Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr
165 170 175
TGT GAG TTA GAT CTC ACA AAC CTG ATT CTG TTC TCA GTT TCC ATA TCA 576
Cys Glu Leu Asp Leu Thr Asn Leu Ile Leu Phe Ser Val Ser Ile Ser
180 185 190
TCA GTC CTC TTT CTT ATG GTA GTT ATG ACA ACA AGT CAC CTC TTT TTC 624
Ser Val Leu Phe Leu Met Val Val Met Thr Thr Ser His Leu Phe Phe
195 200 205
TGG GAT ATG TGG TAC ATT TAT TAT TTT TGG AAA GCA AAG ATA AAG GGG 672
Trp Asp Met Trp Tyr Ile Tyr Tyr Phe Trp Lys Ala Lys Ile Lys Gly
210 215 220
TAT CCA GCA TCT GCA ATC CCA TGG AGT CCT TGT TAT GAT GCT TTT ATT 720
Tyr Pro Ala Ser Ala Ile Pro Trp Ser Pro Cys Tyr Asp Ala Phe Ile
225 230 235 240
GTG TAT GAC ACT AAA AAC TCA GCT GTG ACA GAA TGG GTT TTG CAG GAG 768
Val Tyr Asp Thr Lys Asn Ser Ala Val Thr Glu Trp Val Leu Gln Glu
245 250 255
CTG GTG GCA AAA TTG GAA GAT CCA AGA GAA AAA CAC TTC AAT TTG TGT 816
Leu Val Ala Lys Leu Glu Asp Pro Arg Glu Lys His Phe Asn Leu Cys
260 265 270
CTA GAA GAA AGA GAC TGG CTA CCA GGA CAG CCA GTT CTA GAA AAC CTT 864
Leu Glu Glu Arg Asp Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu
275 280 285
TCC CAG AGC ATA CAG CTC AGC AAA AAG ACA GTG TTT GTG ATG ACA CAG 912
Ser Gln Ser Ile Gln Leu Ser Lys Lys Thr Val Phe val Met Thr Gln
290 295 300
AAA TAT GCT AAG ACT GAG AGT TTT AAG ATG GCA TTT TAT TTG TCT CAT 960
Lys Tyr Ala Lys Thr Glu Ser Phe Lys Met Ala Phe Tyr Leu Ser His
305 310 315 320
CAG AGG CTC CTG GAT GAA AAA GTG GAT GTG ATT ATC TTG ATA TTC TTG 1008
Gln Arg Leu Leu Asp Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu
325 330 335
GAA AGA CCT CTT CAG AAG TCT AAG TTT CTT CAG CTC AGG AAG AGA CTC 1056
Glu Arg Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu
340 345 350
TGC AGG AGC TCT GTC CTT GAG TGG CCT GCA AAT CCA CAG GCT CAC CCA 1104
Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His Pro
355 360 365
TAC TTC TGG CAG TGC CTG AAA AAT GCC CTG ACC ACA GAC AAT CAT GTG 1152
Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His Val
370 375 380
GCT TAT AGT CAA ATG TTC AAG GAA ACA GTC TAGCTCTCTG AAGAATGTCA 1202
Ala Tyr Ser Gln Mer Phe Lys Glu Thr Val
385 390
CCACCTAGGA CATGCCTTGG TACCTGAAGT TTTCATAAAG GTTTCCATAA ATGAAGGTCT 1262
GAATTTTTCC TAACAGTTGT CATGGCTCAG ATTGGTGGGA AATCATCAAT ATATGGCTAA 1322
GAAATTAAGA AGGGGAGACT GATAGAAGAT AATTTCTTTC TTCATGTGCC ATGCTCAGTT 1382
AAATATTTCC CCTAGCTCAA ATCTGAAAAA CTGTGCCTAG GAGACAACAC AAGGCTTTGA 1442
TTTATCTGCA TACAATTGAT AAGAGCCACA CATCTGCCCT GAAGAAGTAC TAGTAGTTTT 1502
AGTAGTAGGG TAAAAATTAC ACAAGCTTTC TCTCTCTCTG ATACTGAACT GTACCAGAGT 1562
TCAATGAAAT AAAAGCCCAG AGAACTTCTC AGTAAATGGT TTCATTATCA TGTAGTATCC 1622
ACCATGCAAT ATGCCACAAA ACCGCTACTG GTACAGGACA GCTGGTAGCT GCTTCAAGGC 1682
CTCTTATCAT TTTCTTGGGG CCCATGGAGG GGTTCTCTGG GAAAAAGGGA AGGTTTTTTT 1742
TGGCCATCCA TGAA 1756
SPEIPWNSLPPEVFEGMPPNLKNLSLAKNGLKSFFWDRLQLLKHLEILDLSHNQLTKVPERLANCSKSLTTLILK
HNQIRQLTKYFLEDALQLRYLDISSNKIQVIQKTSFPENVLNNLEMLVLHHNRFLCNCDAVWFVWWVNHTDVTIP
YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSVSISSVLFLMVVMTTSHLFFWDMWYIYYFWKAKIKGY
PASAIPWSPCYDAFIVYDTKNSAVTEWVLQELVAKLEDPREKHFNLCLEERDWLPGQPVLENLSQSIQLSKKTVF
VMTQKYAKTESFKMAFYLSHQRLLDEKVDVIILIFLERPLQKSKFLQLRKRLCRSSVLEWPANPQAHPYFWQCLK
NALTTDNHVAYSQMFKETV
表7:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体7(DTLR7)的核苷酸序列和氨基酸序列。
上游(SEQ ID NO:15和16):
G AAT TCC AGA CTT ATA AAC TTG AAA AAT CTC TAT TTG GCC TGG AAC 46
Asn Ser Arg Leu Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn
1 5 10 15
TGC TAT TTT AAC AAA GTT TGC GAG AAA ACT AAC ATA GAA GAT GGA GTA 94
Cys Tyr Phe Asn Lys Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val
20 25 30
TTT GAA ACG CTG ACA AAT TTG GAG TTG CTA TCA CTA TCT TTC AAT TCT 142
Phe Glu Thr Leu Thr Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser
35 40 45
CTT TCA CAT GTG CCA CCC AAA CTG CCA AGC TCC CTA CGC AAA CTT TTT 190
Leu Ser His Val Pro Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe
50 55 60
CTG AGC AAC ACC CAG ATC AAA TAC ATT AGT GAA GAA GAT TTC AAG GGA 238
Leu Ser Asn Thr Gln Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly
65 70 75
TTG ATA AAT TTA ACA TTA CTA GAT TTA AGC GGG AAC TGT CCG AGG TGC 286
Leu Ile Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys
80 85 90 95
TTC AAT GCC CCA TTT CCA TGC GTG CCT TGT GAT GGT GGT GCT TCA ATT 334
Phe Asn Ala Pro Phe Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile
100 105 110
AAT ATA GAT CGT TTT GCT TTT CAA AAC TTG ACC CAA CTT CGA TAC CTA 382
Asn Ile Asp Arg Phe Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu
115 120 125
AAC CTC TCT AGC ACT TCC CTC AGG AAG ATT AAT GCT GCC TGG TTT AAA 430
Asn Leu Ser Ser Thr Ser Leu Arg Lys Ile Asn Ala Ala Trp Phe Lys
130 135 140
AAT ATG CCT CAT CTG AAG GTG CTG GAT CTT GAA TTC AAC TAT TTA GTG 478
Asn Met Pro His Leu Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val
145 150 155
GGA GAA ATA GCC TCT GGG GCA TTT TTA ACG ATG CTG CCC CGC TTA GAA 526
Gly Glu Ile Ala Ser Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu
160 165 170 175
ATA CTT GAC TTG TCT TTT AAC TAT ATA AAG GGG AGT TAT CCA CAG CAT 574
Ile Leu Asp Leu Ser Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His
180 185 190
ATT AAT ATT TCC AGA AAC TTC TCT AAA CTT TTG TCT CTA CGG GCA TTG 622
I1e Asn Ile Ser Arg Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu
195 200 205
CAT TTA AGA GGT TAT GTG TTC CAG GAA CTC AGA GAA GAT GAT TTC CAG 670
His Leu Arg Gly Tyr Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln
210 215 220
CCC CTG ATG CAG CTT CCA AAC TTA TCG ACT ATC AAC TTG GGT ATT AAT 718
Pro Leu Met Gln Leu Pro Asn Leu Ser Thr Ile Asn Leu Gly Ile Asn
225 230 235
TTT ATT AAG CAA ATC GAT TTC AAA CTT TTC CAA AAT TTC TCC AAT CEG 766
Phe Ile Lys Gln Ile Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu
240 245 250 255
GAA ATT ATT TAC TTG TCA GAA AAC AGA ATA TCA CCG TTG GTA AAA GAT 814
Glu Ile Ile Tyr Leu Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp
260 265 270
ACC CGG CAG AGT TAT GCA AAT AGT TCC TCT TTT CAA CGT CAT ATC CGG 862
Thr Arg Gln Ser Tyr Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg
275 280 285
AAA CGA CGC TCA ACA GAT TTT GAG TTT GAC CCA CAT TCG AAC TTT TAT 910
Lys Arg Arg Ser Thr Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr
290 295 300
CAT TTC ACC CGT CCT TTA ATA AAG CCA CAA TGT GCT GCT TAT GGA AAA 958
His Phe Thr Arg Pro Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys
305 310 315
GCC TTA GAT TTA AGC CTC AAC AGT ATT TTC TT 990
Ala Leu Asp Leu Ser Leu Asn Ser Ile Phe
320 325
NSRLINLKNLYLAWNCYFNKVCEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISE
EDFKGLINLTLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNMPHL
KVLDLEFNYLVGEIASGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRALHLRGYVFQELREDDF
QPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLVKDTRQSYANSSSFQRHIRKRRSTDFEF
DPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIF
下游(SEQ ID NO:17和18):
CAG TCT CTT TCC ACA TCC CAA ACT TTC TAT GAT GCT TAC ATT TCT TAT 48
Gln Ser Leu Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr
1 5 10 15
GAC ACC AAA GAT GCC TCT GTT ACT GAC TGG GTG ATA AAT GAG CTG CGC 96
Asp Thr Lys Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg
20 25 30
TAC CAC CTT GAA GAG AGC CGA GAC AAA AAC GTT CTC CTT TGT CTA GAG 144
Tyr His Leu Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu
35 40 45
GAG AGG GAT TGG GAC CCG GGA TTG GCC ATC ATC GAC AAC CTC ATG CAG 192
Glu Arg Asp Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln
50 55 60
AGC ATC AAC CAA AGC AAG AAA ACA GTA TTT GTT TTA ACC AAA AAA TAT 240
Ser Ile Asn Gln Ser Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr
65 70 75 80
GCA AAA AGC TGG AAC TTT AAA ACA GCT TTT TAC TTG GGC TTG CAG AGG 288
Ala Lys Ser Trp Asn Phe Lys Thr Ala Phe Tyr Leu Gly Leu Gln Arg
85 90 95
CTA ATG GGT GAG AAC ATG GAT GTG ATT ATA TTT ATC CTG CTG GAG CCA 336
Leu Met Gly Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro
100 105 110
GTG TTA CAG CAT TCT CCG TAT TTG AGG CTA CGG CAG CGG ATC TGT AAG 384
Val Leu Gln His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys
115 120 125
AGC TCC ATC CTC CAG TGG CCT GAC AAC CCG AAG GCA GAA AGG TTG TTT 432
Ser Ser Ile Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Arg Leu Phe
130 135 140
TGG CAA ACT CTG AGA AAT GTG GTC TTG ACT GAA AAT GAT TCA CGG TAT 480
Trp Gln Thr Leu Arg Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr
145 150 155 160
AAC AAT ATG TAT GTC GAT TCC ATT AAG CAA TAC TAACTGACGT TAAGTCATGA 533
Asn Asn Met Tyr Val Asp Ser Ile Lys Gln Tyr
165 170
TTTCGCGCCA TAATAAAGAT GCAAAGGAAT GACATTTCCG TATTAGTTAT CTATTGCTAC 593
GGTAACCAAA TTACTCCCAA AAACCTTACG TCGGTTTCAA AACAACCACA TTCTGCTGGC 653
CCCACAGTTT TTGAGGGTCA GGAGTCCAGG CCCAGCATAA CTGGGTCTTC TGCTTCAGGG 713
TGTCTCCAGA GGCTGCAATG TAGGTGTTCA CCAGAGACAT AGGCATCACT GGGGTCACAC 773
TCCATGTGGT TGTTTTCTGG ATTCAATTCC TCCTGGGCTA TTGGCCAAAG GCTATACTCA 833
TGTAAGCCAT GCGAGCCTAT CCCACAACGG CAGCTTGCTT CATCAGAGCT AGCAAAAAAG 893
AGAGGTTGCT AGCAAGATGA AGTCACAATC TTTTGTAATC GAATCAAAAA AGTGATATCT 953
CATCACTTTG GCCATATTCT ATTTGTTAGA AGTAAACCAC AGGTCCCACC AGCTCCATGG 1013
GAGTGACCAC CTCAGTCCAG GGAAAACAGC TGAAGACCAA GATGGTGAGC TCTGATTGCT 1073
TCAGTTGGTC ATCAACTATT TTCCCTTGAC TGCTGTCCTG GGATGGCCGG CTATCTTGAT 1133
GGATAGATTG TGAATATCAG GAGGCCAGGG ATCACTGTGG ACCATCTTAG CAGTTGACCT 1193
AACACATCTT CTTTTCAATA TCTAAGAACT TTTGCCACTG TGACTAATGG TCCTAATATT 1253
AAGCTGTTGT TTATATTTAT CATATATCTA TGGCTACATG GTTATATTAT GCTGTGGTTG 1313
CGTTCGGTTT TATTTACAGT TGCTTTTACA AATATTTGCT GTAACATTTG ACTTCTAAGG 1373
TTTAGATGCC ATTTAAGAAC TGAGATGGAT AGCTTTTAAA GCATCTTTTA CTTCTTACCA 1433
TTTTTTAAAA GTATGCAGCT AAATTCGAAG CTTTTGGTCT ATATTGTTAA TTGCCATTGC 1493
TGTAAATCTT AAAATGAATG AATAAAAATG TTTCATTTTA AAAAAAAAAA AAAAAAAAAA 1553
AAAA 1557
QSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIIDNLMQSINQSKKTVFV
LTKKYAKSWNFKTAFYLGLQRLMGENMDVIIFILLEPVLQHSPYLRLRQRICKSSILQWPDNPKAERLFWQTLRN
VVLTENDSRYNNMYVDSIKQY
诸如人等灵长类动物的DTLR7的其他序列(SEQ ID NO:37和37)。
atg ctg acc tgc att ttc ctg cta ata tct ggt tcc tgt gag tta tgc 48
Met Leu Thr Cys Ile Phe Leu Leu Ile Ser Gly Ser Cys Glu Leu Cys
-15 -10 -5
gcc gaa gaa aat ttt tct aga agc tat cct tgt gat gag aaa aag caa 96
Ala Glu Glu Asn Phe Ser Arg Ser Tyr Pro Cys Asp Glu Lys Lys Gln
-1 1 5 10 15
aat gac rca gtt att gca gag tgc agc aat cgt cga cta cag gaa gtt 144
Asn Asp Ser Val Ile Ala Glu Cys Ser Asn Arg Arg Leu Gln Glu Val
20 25 30
ccc caa acg gtg ggc aaa tat gtg aca gaa cta gac ctg tct gat aat 192
Pro Gln Thr Val Gly Lys Tyr Val Thr Glu Leu Asp Leu Set Asp Asn
35 40 45
ttc atc aca cac ata acg aat gaa tca ttt caa ggg ctg caa aat ctc 240
Phe Ile Thr His Ile Thr Asn Glu Ser Phe Gln Gly Leu Gln Asn Leu
50 55 60
act aaa ata aat cta aac cac aac ccc aat gta cag cac cag aac gga 288
Thr Lys Ile Asn Leu Asn His Asn Pro Asn Val Gln His Gln Asn Gly
65 70 75
aat ccc ggt ata caa tca aat ggc ttg aat atc aca gac ggg gca ttc 336
Asn Pro Gly Ile Gln Set Asn Gly Leu Asn Ile Thr Asp Gly Ala Phe
80 85 90 95
ctc aac cta aaa aac cta agg gag tta ctg ctt gaa gac aac cag tta 384
Leu Asn Leu Lys Asn Leu Arg Glu Leu Leu Leu Glu Asp Asn Gln Leu
100 105 110
ccc caa ata ccc tct ggt ttg cca gag tct ttg aca gaa ctt agt cta 432
Pro Gln Ile Pro Ser Gly Leu Pro Glu Ser Leu Thr Glu Leu Ser Leu
115 120 125
att caa aac aat ata tac aac ata act aaa gag ggc att tca aga ctt 480
Ile Gln Asn Asn Ile Tyr Asn Ile Thr Lys Glu Gly Ile Ser Arg Leu
130 135 140
ata aac ttg aaa aat ctc tat ttg gcc tgg aac tgc tat ttt aac aaa 528
Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn Cys Tyr Phe Asn Lys
145 150 155
gtt tgc gag aaa act aac ata gaa gat gga gta ttt gaa acg ctg aca 576
Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val Phe Glu Thr Leu Thr
160 165 170 175
aat ttg gag ttg cta tca cta tct ttc aat tct ctt tca cat gtg cca 624
Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser Leu Ser His Val Pro
180 185 190
ccc aaa ctg cca agc tcc cta cgc aaa ctt ttt ctg agc aac acc cag 672
Pro Lys Leu Pro Ser Set Leu Arg Lys Leu Phe Leu Set Asn Thr Gln
195 200 205
atc aaa tac att agt gaa gaa gat ttc aag gga ttg ata aat tta aca 720
Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly Leu Ile Asn Leu Thr
210 215 220
tta cta gat tta agc ggg aac tgt ccg agg tgc ttc aat gcc cca ttt 768
Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Phe Asn Ala Pro Phe
225 230 235
cca tgc gtg cct tgt gat ggt ggt gct tca att aat ata gat cgt ttt 816
Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile Asn Ile Asp Arg Phe
240 245 250 255
gct ttt caa aac ttg acc caa ctt cga tac cta aac ctc tct agc act 864
Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu Asn Leu Ser Ser Thr
260 265 270
tcc ctc agg aag att aat gct gcc tgg ttt aaa aat atg cct cat ctg 912
Ser Leu Arg Lys Ile Asn Ala Ala Trp Phe Lys Asn Met Pro His Leu
275 280 285
aag gtg ctg gat ctt gaa ttc aac tat tta gtg gga gaa ata gcc tct 960
Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val Gly Glu Ile Ala Ser
290 295 300
ggg gca ttt tta acg atg ctg ccc cgc tta gaa ata ctt gac ttg tct 1008
Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu Ile Leu Asp Leu Ser
305 310 315
ttt aac tat ata aag ggg agt tat cca cag cat att aat att tcc aga 1056
Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His Ile Asn Ile Ser Arg
320 325 330 335
aac ttc tct aaa ctt ttg tct cta cgg gca ttg cat tta aga ggt tat 1104
Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu His Leu Arg Gly Tyr
340 345 350
gtg ttc cag gaa ctc aga gaa gat gat ttc cag ccc ctg atg cag ctt 1152
Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln Pro Leu Met Gln Leu
355 360 365
cca aac tta tcg act atc aac ttg ggt att aat ttt att aag caa atc 1200
Pro Asn Leu Ser Thr Ile Asn Leu Gly Ile Asn Phe Ile Lys Gln Ile
370 375 380
gat ttc aaa ctt ttc caa aat ttc tcc aat ctg gaa att att tac ttg 1248
Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu Glu Ile Ile Tyr Leu
385 390 395
tca gaa aac aga ata tca ccg ttg gta aaa gat acc cgg cag agt tat 1296
Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp Thr Arg Gln Ser Tyr
400 405 410 415
gca aat agt tcc tct ttt caa cgt cat atc cgg aaa cga cgc tca aca 1344
Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg Lys Arg Arg Ser Thr
420 425 430
gat ttt gag ttt gac cca cat tcg aac ttt tat cat ttc acc cgt cct 1392
Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr His Phe Thr Arg Pro
435 440 445
tta ata aag cca caa tgt gct gct tat gga aaa gcc tta gat tta agc 1440
Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys Ala Leu Asp Leu Ser
450 455 460
ctc aac agt att ttc ttc att ggg cca aac caa ttt gaa aat ctt cct 1488
Leu Asn Ser Ile Phe Phe Ile Gly Pro Asn Gln Phe Glu Asn Leu Pro
465 470 475
gac att gcc tgt tta aat ctg tct gca aat agc aat gct caa gtg tta 1536
Asp Ile Ala Cys Leu Asn Leu Ser Ala Asn Ser Asn Ala Gln Val Leu
480 485 490 495
agt gga act gaa ttt tca gcc att cct cat gtc aaa tat ttg gat ttg 1584
Ser Gly Thr Glu Phe Ser Ala Ile Pro His Val Lys Tyr Leu Asp Leu
500 505 510
aca aac aat aga cta gac ttt gat aat gct agt gct ctt act gaa ttg 1632
Thr Asn Asn Arg Leu Asp Phe Asp Asn Ala Ser Ala Leu Thr Glu Leu
515 520 525
tcc gac ttg gaa gtt cta gat ctc agc tat aat tca cac tat ttc aga 1680
Ser Asp Leu Glu Val Leu Asp Leu Ser Tyr Asn Ser His Tyr Phe Arg
530 535 540
ata gca ggc gta aca cat cat cta gaa ttt att caa aat ttc aca aat 1728
Ile Ala Gly Val Thr His His Leu Glu Phe Ile Gln Asn Phe Thr Asn
545 550 555
cta aaa gtt tta aac ttg agc cac aac aac att tat act tta aca gat 1776
Leu Lys Val Leu Asn Leu Ser His Asn Asn Ile Tyr Thr Leu Thr Asp
560 565 570 575
aag tat aac ctg gaa agc aag tcc ctg gta gaa tta gtt ttc agt ggc 1824
Lys Tyr Asn Leu Glu Ser Lys Ser Leu Val Glu Leu Val Phe Ser Gly
580 585 590
aat cgc ctt gac att ttg tgg aat gat gat gac aac agg tat atc tcc 1872
Asn Arg Leu Asp Ile Leu Trp Asn Asp Asp Asp Asn Arg Tyr Ile Ser
595 600 605
att ttc aaa ggt ctc aag aat ctg aca cgt ctg gat tta tcc ctt aat 1920
Ile Phe Lys Gly Leu Lys Asn Leu Thr Arg Leu Asp Leu Ser Leu Asn
610 615 620
agg ctc aag cac atc cca aat gaa gca ttc ctt aat ttg cca gcg agt 1968
Arg Leu Lys His Ile Pro Asn Glu Ala Phe Leu Asn Leu Pro Ala Ser
625 630 635
ctc act gaa cta cat ata aat gat aat atg tta aag ttt ttt aac tgg 2016
Leu Thr Glu Leu His Ile Asn Asp Asn Met Leu Lys Phe Phe Asn Trp
640 645 650 655
aca tta ctc cag cag ttt cct cgt ctc gag ttg ctt gac tta cgt gga 2064
Thr Leu Leu Gln Gln Phe Pro Arg Leu Glu Leu Leu Asp Leu Arg Gly
660 665 670
aac aaa cta ctc ttt tta act gat agc cta tct gac ttt aca tct tcc 2112
Asn Lys Leu Leu Phe Leu Thr Asp Ser Leu Ser Asp Phe Thr Ser Ser
675 680 685
ctt cgg aca ctg ctg ctg agt cat aac agg att tcc cac cta ccc tct 2160
Leu Arg Thr Leu Leu Leu Ser His Asn Arg Ile Ser His Leu Pro Ser
690 695 700
ggc ttt ctt tct gaa gtc agt agt ctg aag cac ctc gat tta agt tcc 2208
Gly Phe Leu Ser Glu Val Ser Ser Leu Lys His Leu Asp Leu Ser Ser
705 710 715
aat ctg cta aaa aca atm aac aaa tcc gca ctt gaa act aag acc acc 2256
Asn Leu Leu Lys Thr Xaa Asn Lys Ser Ala Leu Glu Thr Lys Thr Thr
720 725 730 735
acc aaa tta tct atg ttg gaa cta cac gga aac ccc ttt gaa tgc acc 2304
Thr Lys Leu Ser Met Leu Glu Leu His Gly Asn Pro Phe Glu Cys Thr
740 745 750
tgt gac att gga gat ttc cga aga tgg atg gat gaa cat ctg aat gtc 2352
Cys Asp Ile Gly Asp Phe Arg Arg Trp Met Asp Glu His Leu Asn Val
755 760 765
aaa att ccc aga ctg gta gat gtc att tgt gcc agt cct ggg gat caa 2400
Lys Ile Pro Arg Leu Val Asp Val Ile Cys Ala Ser Pro Gly Asp Gln
770 775 780
aga ggg aag agt att gtg agt ctg gag cta aca act tgt gtt tca gat 2448
Arg Gly Lys Ser Ile Val Ser Leu Glu Leu Thr Thr Cys Val Ser Asp
785 790 795
gtc act gca gtg ata tta ttt ttc ttc acg ttc ttt atc acc acc atg 2496
Val Thr Ala Val Ile Leu Phe Phe Phe Thr Phe Phe Ile Thr Thr Met
800 805 810 815
gtt atg ttg gct gcc ctg gct cac cat ttg ttt tac tgg gat gtt tgg 2544
Val Met Leu Ala Ala Leu Ala His His Leu Phe Tyr Trp Asp Val Trp
820 825 830
ttt ata tat aat gtg tgt tta gct aag tta aaa ggc tac agg tct ctt 2592
Phe Ile Tyr Asn Val Cys Leu Ala Lys Leu Lys Gly Tyr Arg Ser Leu
835 840 845
tcc aca tcc caa act ttc tat gat gct tac att tct tat gac acc aaa 2640
Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr Asp Thr Lys
850 855 860
gat gcc tct gtt act gac tgg gtg ata aat gag ctg cgc tac cac ctt 2688
Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg Tyr His Leu
865 870 875
gaa gag agc cga gac aaa aac gtt ctc ctt tgt cta gag gag agg gat 2736
Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu Glu Arg Asp
880 885 890 895
tgg gac ccg gga ttg gcc atc atc gac aac ctc atg cag agc atc aac 2784
Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln Set Ile Asn
900 905 910
caa agc aag aaa aca gta ttt gtt tta acc aaa aaa tat gca aaa agc 2832
Gln Ser Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr Ala Lys Ser
915 920 925
tgg aac ttt aaa aca gct ttt tac ttg gcc ttg cag agg cta atg ggt 2880
Trp Asn Phe Lys Thr Ala Phe Tyr Leu Ala Leu Gln Arg Leu Met Gly
930 935 940
gag aac atg gat gtg att ata ttt atc ctg ctg gag cca gtg tta cag 2928
Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro Val Leu Gln
945 950 955
cat tct ccg tat ttg agg cta cgg cag cgg atc tgt aag agc tcc atc 2976
His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys Set Set Ile
960 965 970 975
ctc cag tgg cct gac aac ccg aag gca gaa ggc ttg ttt tgg caa act 3024
Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Gly Leu Phe Trp Gln Thr
980 985 990
ctg aga aat gtg gtc ttg act gaa aat gat tca cgg tat aac aat atg 3072
Leu Arg Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr Asn Asn Met
995 1000 1005
tat gtc gat tcc att aag caa tac taa 3099
Tyr Val Asp Ser Ile Lys Gln Tyr
1010 1015
MLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVGKYVTELDLSDNFITHI
TNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNLKNLRELLLEDNQLPQIPSGLPES
LTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKVCEKTNIEDGVFETLTNLELLSLSFNSLSHV
PPKLPSSLRKLFLSNTQIKYISEEDFKGLINLTLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQN
LTQLRYLNLSSTSLRKINAAWFKNMPHLKVLDLEFNYLVGEIASGAFLTMLPRLEILDLSFNYIKGSYP
QHINISRNFSKLLSLRALHLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEII
YLSENRISPLVKDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNS
IFFIGPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEVLDL
SYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRLDILWNDDDNR
YISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFNWTLLQQFPRLELLDLRGNK
LLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLSSNLLKTINKSALETKTTTKLSMLE
LHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASPGDQRGKSIVSLELTTCVSDVTAVILFFFTFF
ITTMVMLAALAHHLFYWDVWFIYNVCLAKLKGYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLE
ESRDKNVLLCLEERDWDPGLAIIDNLMQSINQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMGENMDVI
IFILLEPVLQHSPYLRLRQRICKSSILQWPDNPKAEGLFWQTLRNVVLTENDSRYNNMYVDSIKQY
表8:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体8(DTLR8)的部分核苷酸序列和氨基酸序列(见SEQ ID NO:19和20)。
AAT GAA TTG ATC CCC AAT CTA GAG AAG GAA GAT GGT TCT ATC TTG ATT 48
Asn Glu Leu Ile Pro Asn Leu Glu Lys Glu Asp Gly Ser Ile Leu Ile
1 5 10 15
TGC CTT TAT GAA AGC TAC TTT GAC CCT GGC AAA AGC ATT AGT GAA AAT 96
Cys Leu Tyr Glu Ser Tyr Phe Asp Pro Gly Lys Ser Ile Ser Glu Asn
20 25 30
ATT GTA AGC TTC ATT GAG AAA AGC TAT AAG TCC ATC TTT GTT TTG TCC 144
Ile Val Ser Phe Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu Ser
35 40 45
CCC AAC TTT GTC CAG AAT GAG TGG TGC CAT TAT GAA TTC TAC TTT GCC 192
Pro Asn Phe Val Gln Asn Glu Trp Cys His Tyr Glu Phe Tyr Phe Ala
50 55 60
CAC CAC AAT CTC TTC CAT GAA AAT TCT GAT CAC ATA ATT CTT ATC TTA 240
His His Asn Leu Phe His Glu Asn Ser Asp His Ile Ile Leu Ile Leu
65 70 75 80
CTG GAA CCC ATT CCA TTC TAT TGC ATT CCC ACC AGG TAT CAT AAA CTG 288
Leu Glu Pro Ile Pro Phe Tyr Cys Ile Pro Thr Arg Tyr His Lys Leu
85 90 95
GAA GCT CTC CTG GAA AAA AAA GCA TAC TTG GAA TGG CCC AAG GAT AGG 336
Glu Ala Leu Leu Glu Lys Lys Ala Tyr Leu Glu Trp Pro Lys Asp Arg
100 105 110
CGT AAA TGT GGG CTT TTC TGG GCA AAC CTT CGA GCT GCT GTT AAT GTT 384
Arg Lys Cys Gly Leu Phe Trp Ala Asn Leu Arg Ala Ala Val Asn Val
115 120 125
AAT GTA TTA GCC ACC AGA GAA ATG TAT GAA CTG CAG ACA TTC ACA GAG 432
Asn Val Leu Ala Thr Arg Glu Met Tyr Glu Leu Gln Thr Phe Thr Glu
130 135 140
TTA AAT GAA GAG TCT CGA GGT TCT ACA ATC TCT CTG ATG AGA ACA GAC 480
Leu Asn Glu Glu Ser Arg Gly Ser Thr Ile Ser Leu Met Arg Thr Asp
145 150 155 160
TGT CTA TAAAATCCCA CAGTCCTTGG GAAGTTGGGG ACCACATACA CTGTTGGGAT 536
Cys Leu
GTACATTGAT ACAACCTTTA TGATGGCAAT TTGACAATAT TTATTAAAAT AAAAAATGGT 596
TATTCCCTTC AAAAAAAAAA AAAAAAAAAA AAA 629
NELIPNLEKEDGSILICLYESYFDPGKSISENIVSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENS
HIILILLEPIPFYCIPTRYHKLEALLEKKAYLEWPKDRRKCGLFWANLRAAVNVNVLATREMYELQTFTELNE
SRGSTISLMRTDCL
诸如人等灵长类动物的附加序列(SEQ ID NO:31和32);用C表示的第4和23位核苷酸可以是A、C、G或T;用C表示的第845位核苷酸可以是C或T:
C TCC GAT GCC AAG ATT CGG CAC CAG GCA TAT TCA GAG GTC ATG ATG 46
Ser Asp Ala Lys Ile Arg His Gln Ala Tyr Ser Glu Val Met Met
1 5 10 15
GTT GGA TGG TCA GAT TCA TAC ACC TGT GAA TAC CCT TTA AAC CTA AGG 94
Val Gly Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg
20 25 30
GGA ACT AGG TTA AAA GAC GTT CAT CTC CAC GAA TTA TCT TGC AAC ACA 142
Gly Thr Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr
35 40 45
GCT CTG TTG ATT GTC ACC ATT GTG GTT ATT ATG CTA GTT CTG GGG TTG 190
Ala Leu Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu
50 55 60
GCT GTG GCC TTC TGC TGT CTC CAC TTT GAT CTG CCC TGG TAT CTC AGG 238
Ala Val Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg
65 70 75
ATG CTA GGT CAA TGC ACA CAA ACA TGG CAC AGG GTT AGG AAA ACA ACC 286
Met Leu Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr
80 85 90 95
CAA GAA CAA CTC AAG AGA AAT GTC CGA TTC CAC GCA TTT ATT TCA TAC 334
Gln Glu Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr
100 105 110
AGT GAA CAT GAT TCT CTG TGG GTG AAG AAT GAA TTG ATC CCC AAT CTA 382
Ser Glu His Asp Ser Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu
115 120 125
GAG AAG GAA GAT GGT TCT ATC TTG ATT TGC CTT TAT GAA AGC TAC TTT 430
Glu Lys Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe
130 135 140
GAC CCT GGC AAA AGC ATT AGT GAA AAT ATT GTA AGC TTC ATT GAG AAA 478
Asp Pro Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys
145 150 155
AGC TAT AAG TCC ATC TTT GTT TTG TCT CCC AAC TTT GTC CAG AAT GAG 526
Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu
160 165 170 175
TGG TGC CAT TAT GAA TTC TAC TTT GCC CAC CAC AAT CTC TTC CAT GAA 574
Trp Cys His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu
180 185 190
AAT TCT GAT CAC ATA ATT CTT ATC TTA CTG GAA CCC ATT CCA TTC TAT 622
Asn Ser Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr
195 200 205
TGC ATT CCC ACC AGG TAT CAT AAA CTG GAA GCT CTC CTG GAA AAA AAA 670
Cys Ile Pro Thr Arg Tyr His Lys Leu Glu Ala Leu Leu Glu Lys Lys
210 215 220
GCA TAC TTG GAA TGG CCC AAG GAT AGG CGT AAA TGT GGG CTT TTC TGG 718
Ala Tyr Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Phe Trp
225 230 235
GCA AAC CTT CGA GCT GCT GTT AAT GTT AAT GTA TTA GCC ACC AGA GAA 766
Ala Asn Leu Arg Ala Ala Val Asn Val Asn Val Leu Ala Thr Arg Glu
240 245 250 255
ATG TAT GAA CTG CAG ACA TTC ACA GAG TTA AAT GAA GAG TCT CGA GGT 814
Met Tyr Glu Leu Gln Thr Phe Thr Glu Leu Asn Glu Glu Ser Arg Gly
260 265 270
TCT ACA ATC TCT CTG ATG AGA ACA GAC TGT CTA TAAAATCCCA CAGTCCTTGG 867
Ser Thr Ile Ser Leu Met Arg Thr Asp Cys Leu
275 280
GAAGTTGGGG ACCACATACA CTGTTGGGAT GTACATTGAT ACAACCTTTA TGATGGCAAT 927
TTGACAATAT TTATTAAAAT AAAAAATGGT TATTCCCTTC AAAAAAAAAA AAAAAAAAAA 987
AAAAAAAAAA AA 999
SDAKIRHQAYSEVMMVGWSDSYTCEYPLNLRGTRLKDVHLHELSCNTALLIVTIVVIMLVLGLAVAFCCLHFDK
WYLRMLGQCTQTWHRVRKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKST
ENIVSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHKLEALLEKKA
LEWPKDRRKCGLFWANLRAAVNVNVLATREMYELQTFTELNEESRGSTISLMRTDCL
诸如人等灵长类动物的DTLR8的其他序列(SEQ ID NO:38和39):
gaatcatcca cgcacctgca gctctgctga gagagtgcaa gccgtggggg ttttgagctc 60
atcttcatca ttcatatgag gaaataagtg gtaaaatcct tggaaataca atg aga 116
Met Arg
ctc atc aga aac att tac ata ttt tgt agt att gtt atg aca gca gag 164
Leu Ile Arg Asn Ile Tyr Ile Phe Cys Ser Ile Val Met Thr Ala Glu
-15 -10 -5
ggt gat gct cca gag ctg cca gaa gaa agg gaa ctg atg acc aac tgc 212
Gly Asp Ala Pro Glu Leu Pro Glu Glu Arg Glu Leu Met Thr Asn Cys
-1 1 5 10 15
tcc aac atg tct cta aga aag gtt ccc gca gac ttg acc cca gcc aca 260
Ser Asn Met Ser Leu Arg Lys Val Pro Ala Asp Leu Thr Pro Ala Thr
20 25 30
acg aca ctg gat tta tcc tat aac ctc ctt ttt caa ctc cag agt tca 308
Thr Thr Leu Asp Leu Ser Tyr Asn Leu Leu Phe Gln Leu Gln Set Set
35 40 45
gat ttt cat tct gtc tcc aaa ctg aga gtt ttg att cta tgc cat aac 356
Asp Phe His Ser Val Ser Lys Leu Arg Val Leu Ile Leu Cys His Asn
50 55 60
aga att caa cag ctg gat ctc aaa acc ttt gaa ttc aac aag gag tta 404
Arg Ile Gln Gln Leu Asp Leu Lys Thr Phe Glu Phe Asn Lys Glu Leu
65 70 75
aga tat tta gat ttg tct aat aac aga ctg aag agt gta act tgg tat 452
Arg Tyr Leu Asp Leu Ser Asn Asn Arg Leu Lys Ser Val Thr Trp Tyr
80 85 90 95
tta ctg gca ggt ctc agg tat tta gat ctt tct ttt aat gac ttt gac 500
Leu Leu Ala Gly Leu Arg Tyr Leu Asp Leu Ser Phe Asn Asp Phe Asp
100 105 110
acc atg cct atc tgt gag gaa gct ggc aac atg tca cac ctg gaa atc 548
Thr Met Pro Ile Cys Glu Glu Ala Gly Asn Met Ser His Leu Glu Ile
115 120 125
cta ggt ttg agt ggg gca aaa ata caa aaa tca gat ttc cag aaa att 596
Leu Gly Leu Ser Gly Ala Lys Ile Gln Lys Ser Asp Phe Gln Lys Ile
130 135 140
gct cat ctg cat cta aat act gtc ttc tta gga ttc aga act ctt cct 644
Ala His Leu His Leu Asn Thr Val Phe Leu Gly Phe Arg Thr Leu Pro
145 150 155
cat tat gaa gaa ggt agc ctg ccc atc tta aac aca aca aaa ctg cac 692
His Tyr Glu Glu Gly Ser Leu Pro Ile Leu Asn Thr Thr Lys Leu His
160 165 170 175
att gtt tta cca atg gac aca aat ttc tgg gtt ctt ttg cgt gat gga 740
Ile Val Leu Pro Met Asp Thr Asn Phe Trp Val Leu Leu Arg Asp Gly
180 185 190
atc aag act tca aaa ata tta gaa atg aca aat ata gat ggc aaa agc 788
Ile Lys Thr Ser Lys Ile Leu Glu Met Thr Asn Ile Asp Gly Lys Ser
195 200 205
caa ttt gta agt tat gaa atg caa cga aat ctt agt tta gaa aat gct 836
Gln Phe Val Ser Tyr Glu Met Gln Arg Asn Leu Ser Leu Glu Asn Ala
210 215 220
aag aca tcg gtt cta ttg ctt aat aaa gtt gat tta ctc tgg gac gac 884
Lys Thr Ser Val Leu Leu Leu Asn Lys Val Asp Leu Leu Trp Asp Asp
225 230 235
ctt ttc ctt atc tta caa ttt gtt tgg cat aca tca gtg gaa cac ttt 932
Leu Phe Leu Ile Leu Gln Phe Val Trp His Thr Ser Val Glu His Phe
240 245 250 255
cag atc cga aat gtg act ttt ggt ggt aag gct tat ctt gac cac aat 980
Gln Ile Arg Asn Val Thr Phe Gly Gly Lys Ala Tyr Leu Asp His Asn
260 265 270
tca ttt gac tac tca aat act gta atg aga act ata aaa ttg gag cat 1028
Ser Phe Asp Tyr Ser Asn Thr Val Met Arg Thr Ile Lys Leu Glu His
275 280 285
gta cat ttc aga gtg ttt tac att caa cag gat aaa atc tat ttg ctt 1076
Val His Phe Arg Val Phe Tyr Ile Gln Gln Asp Lys Ile Tyr Leu Leu
290 295 300
ttg acc aaa atg gac ata gaa aac ctg aca ata tca aat gca caa atg 1124
Leu Thr Lys Met Asp Ile Glu Asn Leu Thr Ile Set Asn Ala Gln Met
305 310 315
cca cac atg ctt ttc ccg aat tat cct acg aaa ttc caa tat tta aat 1172
Pro His Met Leu Phe Pro Asn Tyr Pro Thr Lys Phe Gln Tyr Leu Asn
320 325 330 335
ttt gcc aat aat atc tta aca gac gag ttg ttt aaa aga act atc caa 1220
Phe Ala Asn Asn Ile Leu Thr Asp Glu Leu Phe Lys Arg Thr Ile Gln
340 345 350
ctg cct cac ttg aaa act ctc att ttg aat ggc aat aaa ctg gag aca 1268
Leu Pro His Leu Lys Thr Leu Ile Leu Asn Gly Asn Lys Leu Glu Thr
355 360 365
ctt tct tta gta agt tgc ttt gct aac aac aca ccc ttg gaa cac ttg 1316
Leu Ser Leu Val Ser Cys Phe Ala Asn Asn Thr Pro Leu Glu His Leu
370 375 380
gat ctg agt caa aat cta tta caa cat aaa aat gat gaa aat tgc tca 1364
Asp Leu Ser Gln Asn Leu Leu Gln His Lys Asn Asp Glu Asn Cys Ser
385 390 395
tgg cca gaa act gtg gtc aat atg aat ctg tca tac aat aaa ttg tct 1412
Trp Pro Glu Thr Val Val Asn Met Asn Leu Ser Tyr Asn Lys Leu Ser
400 405 410 415
gat tct gtc ttc agg tgc ttg ccc aaa agt att caa ata ctt gac cta 1460
Asp Ser Val Phe Arg Cys Leu Pro Lys Ser Ile Gln Ile Leu Asp Leu
420 425 430
aat aat aac caa atc caa act gta cct aaa gag act att cat ctg atg 1508
Asn Asn Asn Gln Ile Gln Thr Val Pro Lys Glu Thr Ile His Leu Met
435 440 445
gcc tta cga gaa cta aat att gca ttt aat ttt cta act gat ctc cct 1556
Ala Leu Arg Glu Leu Asn Ile Ala Phe Asn Phe Leu Thr Asp Leu Pro
450 455 460
gga tgc agt cat ttc agt aga ctt tca gtt ctg aac att gaa atg aac 1604
Gly Cys Ser His Phe Ser Arg Leu Ser Val Leu Asn Ile Glu Met Asn
465 470 475
ttc att ctc agc cca tct ctg gat ttt gtt cag agc tgc cag gaa gtt 1652
Phe Ile Leu Ser Pro Ser Leu Asp Phe Val Gln Ser Cys Gln Glu Val
480 485 490 495
aaa act cta aat gcg gga aga aat cca ttc cgg tgt acc tgt gaa tta 1700
Lys Thr Leu Asn Ala Gly Arg Asn Pro Phe Arg Cys Thr Cys Glu Leu
500 505 510
aaa aat ttc att cag ctt gaa aca tat tca gag gtc atg atg gtt gga 1748
Lys Asn Phe Ile Gln Leu Glu Thr Tyr Ser Glu Val Met Met Val Gly
515 520 525
tgg tca gat tca tao acc tgt gaa tac cct tta aac cta agg gga act 1796
Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg Gly Thr
530 535 540
agg tta aaa gac gtt cat ctc cac gaa ttatct tgc aac aca gct ctg 1844
Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr Ala Leu
545 550 555
ttg att gtc acc att gtg gtt att atg cta gtt ctg ggg ttg gct gtg 1892
Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu Ala Val
560 565 570 575
gcc ttc tgc tgt ctc cac ttt gat ctg ccc tgg tat ctc agg atg cta 1940
Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg Met Leu
580 585 590
ggt caa tgc aca caa aca tgg cac agg gtt agg aaa aca acc caa gaa 1988
Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr Gln Glu
595 600 605
caa ctc aag aga aat gtc cga ttc cac gca ttt att tca tac agt gaa 2036
Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr Ser Glu
610 615 620
cat gat tct ctg tgg gtg aag aat gaa ttg atc ccc aat cta gag aag 2084
His Asp Ser Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu Glu Lys
625 630 635
gaa gat ggt tct atc ttg att tgc ctt tat gaa agc tac ttt gac cct 2132
Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe Asp Pro
640 645 650 655
ggc aaa agc att agt gaa aat att gta agc ttc att gag aaa agc tat 2180
Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys Ser Tyr
660 665 670
aag tcc atc ttt gtt ttg tct ccc aac ttt gtc cag aat gag tgg tgc 2228
Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu Trp Cys
675 680 685
cat tat gaa ttc tac ttt gcc cac cac aat ctc ttc cat gaa aat tct 2276
His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu Asn Ser
690 695 700
gat cat ata att ctt atc tta ctg gaa ccc att cca ttc tat tgc att 2324
Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr Cys Ile
705 710 715
ccc acc agg tat cat aaa ctg aaa gct ctc ctg gaa aaa aaa gca tac 2372
Pro Thr Arg Tyr His Lys Leu Lys Ala Leu Leu Glu Lys Lys Ala Tyr
720 725 730 735
ttg gaa tgg ccc aag gat agg cgt aaa tgt ggg ctt ttc tgg gca aac 2420
Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Phe Trp Ala Asn
740 745 750
ctt cga gct gct att aat gtt aat gta tta gcc acc aga gaa atg tat 2468
Leu Arg Ala Ala Ile Asn Val Asn Val Leu Ala Thr Arg Glu Met Tyr
755 760 765
gaa ctg cag aca ttc aca gag tta aat gaa gag tct cga ggt tct aca 2516
Glu Leu Gln Thr Phe Thr Glu Leu Asn Glu Glu Ser Arg Gly Ser Thr
770 775 780
atc tct ctg atg aga aca gat tgt cta taaaatccca cagtccttgg 2563
Ile Ser Leu Met Arg Thr Asp Cys Leu
785 790
gaagttgggg accacataca ctgttgggat gtacattgat acaaccttta tgatggcaat 2623
ttgacaatat ttattaaaat aaaaaatggt tattcccttc atatcagttt ctagaaggat 2683
ttctaagaat gtatcctata gaaacacctt cacaagttta taagggctta tggaaaaagg 2743
tgttcatccc aggattgttt ataatcatga aaaatgtggc caggtgcagt ggctcactct 2803
tgtaatccca gcactatggg aggccaaggt gggtgaccca cgaggtcaag agatggagac 2863
catcctggcc aacatggtga aaccctgtct ctactaaaaa tacaaaaatt agctgggcgt 2923
gatggtgcac gcctgtagtc ccagctactt gggaggctga ggcaggagaa tcgcttgaac 2983
ccgggaggtg gcagttgcag tgagctgaga tcgagccact gcactccagc ctggtgacag 3043
agc 3046
MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQLQSSDFH
SVSKLRVLILCHNRIQQLDLKTFEFNKELRYLDLSNNRLKSVTWYLLAGLRYLDLSFNDFDTMPICEEA
GNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLPILNTTKLHIVLPMDTNFWVLL
RDGIKTSKILEMTNIDGKSQFVSYEMQRNLSLENAKTSVLLLNKVDLLWDDLFLILQFVWHTSVEHFQI
RNVTFGGKAYLDHNSFDYSNTVMRTIKLEHVHFRVFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPN
YPTKFQYLNFANNILTDELFKRTIQLPHLKTLILNGNKLETLSLVSCFANNTPLEHLDLSQNLLQHKND
ENCSWPETVVNMNLSYNKLSDSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPG
CSHFSRLSVLNIEMNFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYT
CEYPLNLRGTRLKDVHLHELSCNTALLIVTIVVIMLVLGLAVAFCCLHFDLPWYLRMLGQCTQTWHRVR
KTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENIVSFIEKSY
KSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHKLKALLEKKAYLEWPK
DRRKCGLFWANLRAAINVNVLATREMYELQTFTELNEESRGSTISLMRTDCL
表9:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体9(DTLR9)的部分核苷酸序列和氨基酸序列(见SEQ ID NO:21和22)。
AAG AAC TCC AAA GAA AAC CTC CAG TTT CAT GCT TTT ATT TCA TAT AGT 48
Lys Asn Ser Lys Glu Asn Leu Gln Phe His Ala Phe Ile Ser Tyr Ser
1 5 10 15
GAA CAT GAT TCT GCC TGG GTG AAA AGT GAA TTG GTA CCT TAC CTA GAA 96
Glu His Asp Ser Ala Trp Val Lys Ser Glu Leu Val Pro Tyr Leu Glu
20 25 30
AAA GAA GAT ATA CAG ATT TGT CTT CAT GAG AGA AAC TTT GTC CCT GGC 144
Lys Glu Asp Ile Gln Ile Cys Leu His Glu Arg Asn Phe Val Pro Gly
35 40 45
AAG AGC ATT GTG GAA AAT ATC ATC AAC TGC ATT GAG AAG AGT TAC AAG 192
Lys Ser Ile Val Glu Asn Ile Ile Asn Cys Ile Glu Lys Ser Tyr Lys
50 55 60
TCC ATC TTT GTT TTG TCT CCC AAC TTT GTC CAG AGT GAG TGG TGC CAT 240
Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser Glu Trp Cys His
65 70 75 80
TAC GAA CTC TAT TTT GCC CAT CAC AAT CTC TTT CAT GAA GGA TCT AAT 288
Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His Glu Gly Ser Asn
85 90 95
AAC TTA ATC CTC ATC TTA CTG GAA CCC ATT CCA CAG AAC AGC ATT CCC 336
Asn Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln Asn Ser Ile Pro
100 105 110
AAC AAG TAC CAC AAG CTG AAG GCT CTC ATG ACG CAG CGG ACT TAT TTG 384
Asn Lys Tyr His Lys Leu Lys Ala Leu Met Thr Gln Arg Thr Tyr Leu
115 120 125
CAG TGG CCC AAG GAG AAA AGC AAA CGT GGG CTC TTT TGG GCT 426
Gln Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe Trp Ala
130 135 140
A 427
KNSKENLQFHAFISYSEHDSAWVKSELVPYLEKEDIQICLHERNFVPGKSIVENIINCIEKSYKSIFVLSPNE
SEWCHYELYFAHHNLFHEGSNNLILILLEPIPQNSIPNKYHKLKALMTQRTYLQWPKEKSKRGLFWA
诸如人等灵长类动物的DTLR9的其他序列(SEQ ID NO:40和41):
aagaatttgg actcatatca agatgctctg aagaagaaca accctttagg atagccactg 60
caacatc atg acc aaa gac aaa gaa cct att gtt aaa agc ttc cat ttt 109
Met Thr Lys Asp Lys Glu Pro Ile Val Lys Ser Phe His Phe
-30 -25 -20
gtt tgc ctt atg atc ata ata gtt gga acc aga atc cag ttc tcc gac 157
Val Cys Leu Met Ile Ile Ile Val Gly Thr Arg Ile Gln Phe Ser Asp
-15 -10 -5
gga aat gaa ttt gca gta gac aag tca aaa aga ggt ctt att cat gtt 205
Gly Asn Glu Phe Ala Val Asp Lys Ser Lys Arg Gly Leu Ile His Val
-1 1 5 10 15
cca aaa gac cta ccg ctg aaa acc aaa gtc tta gat atg tct cag aac 253
Pro Lys Asp Leu Pro Leu Lys Thr Lys Val Leu Asp Met Ser Gln Asn
20 25 30
rac atc gct gag ctt cag gtc tct gac atg agc ttt cta tca gag ttg 301
Tyr Ile Ala Glu Leu Gln Val Ser Asp Met Ser Phe Leu Ser Glu Leu
35 40 45
aca gtt ttg aga ctt tcc cat aac aga atc cag cta ctt gat tta agt 349
Thr Val Leu Arg Leu Ser His Asn Arg Ile Gln Leu Leu Asp Leu Ser
50 55 60
gtt ttc aag ttc aac cag gat tta gaa tat ttg gat tta tct cat aat 397
Val Phe Lys Phe Asn Gln Asp Leu Glu Tyr Leu Asp Leu Ser His Asn
65 70 75
cag ttg caa aag ata tcc tgc cat cct att gtg agt ttc agg cat tta 445
Gln Leu Gln Lys Ile Ser Cys His Pro Ile Val Ser Phe Arg His Leu
80 85 90 95
gat ctc tca ttc aat gat ttc aag gcc ctg ccc atc tgt aag gaa ttt 493
Asp Leu Ser Phe Asn Asp Phe Lys Ala Leu Pro Ile Cys Lys Glu Phe
100 105 110
ggc aac tta tca caa ctg aat ttc ttg gga ttg agt gct atg aag ctg 541
Gly Asn Leu Ser Gln Leu Asn Phe Leu Gly Leu Ser Ala Met Lys Leu
115 120 125
caa aaa tta gat ttg ctg cca att gct cac ttg cat cta agt tat atc 589
Gln Lys Leu Asp Leu Leu Pro Ile Ala His Leu His Leu Set Tyr Ile
130 135 140
ctt ctg gat tta aga aat tat tat ata aaa gaa aat gag aca gaa agt 637
Leu Leu Asp Leu Arg Asn Tyr Tyr Ile Lys Glu Asn Glu Thr Glu Ser
145 150 155
cta caa att ctg aat gca aaa acc ctt cac ctt gtt ttt cac cca act 685
Leu Gln Ile Leu Asn Ala Lys Thr Leu His Leu Val Phe His Pro Thr
160 165 170 175
agt tta ttc gct atc caa gtg aac ata tca gtt aat act tta ggg tgc 733
Ser Leu Phe Ala Ile Gln Val Asn Ile Ser Val Asn Thr Leu Gly Cys
180 185 190
tta caa ctg act aat att aaa ttg aat gat gac aac tgt caa gtt ttc 781
Leu Gln Leu Thr Asn Ile Lys Leu Asn Asp Asp Asn Cys Gln Val Phe
195 200 205
att aaa ttt tta tca gaa ctc acc aga ggt cca acc tta ctg aat ttt 829
Ile Lys Phe Leu Ser Glu Leu Thr Arg Gly Pro Thr Leu Leu Asn Phe
210 215 220
acc ctc aac cac ata gaa acg act tgg aaa tgc ctg gtc aga gtc ttt 877
Thr Leu Asn His Ile Glu Thr Thr Trp Lys Cys Leu Val Arg Val Phe
225 230 235
caa ttt ctt tgg ccc aaa cct gtg gaa tat ctc aat att tac aat tta 925
Gln Phe Leu Trp Pro Lys Pro Val Glu Tyr Leu Asn Ile Tyr Asn Leu
240 245 250 255
aca ata att gaa agc att cgt gaa gaa gat ttt act tat tct aaa acg 973
Thr Ile Ile Glu Ser Ile Arg Glu Glu Asp Phe Thr Tyr Ser Lys Thr
260 265 270
aca ttg aaa gca ttg aca ata gaa cat atc acg aac caa gtt ttt ctg 1021
Thr Leu Lys Ala Leu Thr Ile Glu His Ile Thr Asn Gln Val Phe Leu
275 280 285
ttt tca cag aca gct ttg tac acc gtg ttt tct gag atg aac att atg 1069
Phe Ser Gln Thr Ala Leu Tyr Thr Val Phe Ser Glu Met Asn Ile Met
290 295 300
atg tta acc att tca gat aca cct ttt ata cac atg ctg tgt cct cat 1117
Met Leu Thr Ile Ser Asp Thr Pro Phe Ile His Met Leu Cys Pro His
305 310 315
gca cca agc aca ttc aag ttt ttg aac ttt acc cag aac gtt ttc aca 1165
Ala Pro Ser Thr Phe Lys Phe Leu Asn Phe Thr Gln Asn Val Phe Thr
320 325 330 335
gat agt att ttt gaa aaa tgt tcc acg tta gtt aaa ttg gag aca ctt 1213
Asp Ser Ile Phe Glu Lys Cys Ser Thr Leu Val Lys Leu Glu Thr Leu
340 345 350
atc tta caa aag aat gga tta aaa gac ctt ttc aaa gta ggt ctc atg 1261
Ile Leu Gln Lys Asn Gly Leu Lys Asp Leu Phe Lys Val Gly Leu Met
355 360 365
acg aag gat atg cct tct ttg gaa ata ctg gat gtt agc tgg aat tct 1309
Thr Lys Asp Met Pro Ser Leu Glu Ile Leu Asp Val Ser Trp Asn Ser
370 375 380
ttg gaa tct ggt aga cat aaa gaa aac tgc act tgg gtt gag agt ata 1357
Leu Glu Ser Gly Arg His Lys Glu Asn Cys Thr Trp Val Glu Ser Ile
385 390 395
gtg gtg tta aat ttg tct tca aat atg ctt act gac tct gtt ttc aga 1405
Val Val Leu Asn Leu Ser Ser Asn Met Leu Thr Asp Ser Val Phe Arg
400 405 410 415
tgt tta cct ccc agg atc aag gta ctt gat ctt cac agc aat aaa ata 1453
Cys Leu Pro Pro Arg Ile Lys Val Leu Asp Leu His Ser Asn Lys Ile
420 425 430
aag agc gtt cct aaa caa gtc gta aaa ctg gaa gct ttg caa gaa ctc 1501
Lys Ser Val Pro Lys Gln Val Val Lys Leu Glu Ala Leu Gln Glu Leu
435 440 445
aat gtt gct ttc aat tct tta act gac ctt cct gga tgt ggc agc ttt 1549
Asn Val Ala Phe Asn Ser Leu Thr Asp Leu Pro Gly Cys Gly Ser Phe
450 455 460
agc agc ctt tct gta ttg atc att gat cac aat tca gtt tcc cac cca 1597
Ser Ser Leu Ser Val Leu Ile Ile Asp His Asn Ser Val Ser His Pro
465 470 475
tcg gct gat ttc ttc cag agc tgc cag aag atg agg tca ata aaa gca 1645
Ser Ala Asp Phe Phe Gln Ser Cys Gln Lys Met Arg Ser Ile Lys Ala
480 485 490 495
ggg gac aat cca ttc caa tgt acc tgt gag cta aga gaa ttt gtc aaa 1693
Gly Asp Asn Pro Phe Gln Cys Thr Cys Glu Leu Arg Glu Phe Val Lys
500 505 510
aat ata gac caa gta tca agt gaa gtg tta gag ggc tgg cct gat tct 1741
Asn Ile Asp Gln Val Ser Ser Glu Val Leu Glu Gly Trp Pro Asp Ser
515 520 525
tat aag tgt gac tac cca gaa agt tat aga gga agc cca cta aag gac 1789
Tyr Lys Cys Asp Tyr Pro Glu Ser Tyr Arg Gly Ser Pro Leu Lys Asp
530 535 540
ttt cac atg tct gaa tta tcc tgc aac ata act ctg ctg atc gtc acc 1837
Phe His Met Ser Glu Leu Ser Cys Asn Ile Thr Leu Leu Ile Val Thr
545 550 555
atc ggt gcc acc atg ctg gtg ttg gct gtg act gtg acc tcc ctc tgc 1885
Ile Gly Ala Thr Met Leu Val Leu Ala Val Thr Val Thr Ser Leu Cys
560 565 570 575
atc tac ttg gat ctg ccc tgg tat ctc agg atg gtg tgc cag tgg acc 1933
Ile Tyr Leu Asp Leu Pro Trp Tyr Leu Arg Met Val Cys Gln Trp Thr
580 585 590
cag act cgg cgc agg gcc agg aac ata ccc tta gaa gaa ctc caa aga 1981
Gln Thr Arg Arg Arg Ala Arg Asn Ile Pro Leu Glu Glu Leu Gln Arg
595 600 605
aac ctc cag ttt cat gct ttt att tca tat agt gaa cat gat tct gcc 2029
Asn Leu Gln Phe His Ala Phe Ile Ser Tyr Ser Glu His Asp Ser Ala
610 615 620
tgg gtg aaa agt gaa ttg gta cct tac cta gaa aaa gaa gat ata cag 2077
Trp Val Lys Ser Glu Leu Val Pro Tyr Leu Glu Lys Glu Asp Ile Gln
625 630 635
att tgt ctt cat gag agg aac ttt gtc cct ggc aag agc att gtg gaa 2125
Ile Cys Leu His Glu Arg Asn Phe Val Pro Gly Lys Ser Ile Val Glu
640 645 650 655
aat atc atc aac tgc att gag aag agt tac aag tcc atc ttt gtt ttg 2173
Asn Ile Ile Asn Cys Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu
660 665 670
tct ccc aac ttt gtc cag agt gag tgg tgc cat tac gaa ctc tat ttt 2221
Ser Pro Asn Phe Val Gln Ser Glu Trp Cys His Tyr Glu Leu Tyr Phe
675 680 685
gcc cat cac aat ctc ttt cat gaa gga tct aat aac tta atc ctc atc 2269
Ala His His Asn Leu Phe His Glu Gly Ser Asn Asn Leu Ile Leu Ile
690 695 700
tta ctg gaa ccc att cca cag aac agc att ccc aac aag tac cac aag 2317
Leu Leu Glu Pro Ile Pro Gln Asn Ser Ile Pro Asn Lys Tyr His Lys
705 710 715
ctg aag gct ctc atg acg cag cgg act tat ttg cag tgg ccc aag gag 2365
Leu Lys Ala Leu Met Thr Gln Arg Thr Tyr Leu Gln Trp Pro Lys Glu
720 725 730 735
aaa agc aaa cgt ggg ctc ttt tgg gct aac att aga gcc gct ttt aat 2413
Lys Ser Lys Arg Gly Leu Phe Trp Ala Asn Ile Arg Ala Ala Phe Asn
740 745 750
atg aaa tta aca cta gtc act gaa aac aat gat gtg aaa tct 2455
Met Lys Leu Thr Leu Val Thr Glu Asn Asn Asp Val Lys Ser
755 760 765
taaaaaaatt taggaaattc aacttaagaa accattattt acttggatga tggtgaatag 2515
tacagtcgta agtnactgtc tggaggtgcc tccattatcc tcatgccttc aggaaagact 2575
taacaaaaac aatgtttcat ctggggaact gagctaggcg gtgaggttag cctgccagtt 2635
agagacagcc cagtctcttc tggtttaatc attatgtttc aaattgaaac agtctctttt 2695
gagtaaatgc tcagtttttc agctcctctc cactctgctt tcccaaatgg attctgttgg 2755
tgaag 2760
MTKDKEPIVKSFHFVCLMIIIVGTRIQFSDGNEFAVDKSKRGLIHVPKDLPLKTKVLDMSQNYIAELQV
SDMSFLSELTVLRLSHNRIQLLDLSVFKFNQDLEYLDLSHNQLQKISCHPIVSFRHLDLSFNDFKALPI
CKEFGNLSQLNFLGLSAMKLQKLDLLPIAHLHLSYILLDLRNYYIKENRTESLQILNAKTLHLVFHPTS
LFAIQVNISVNTLGCLQLTNIKLNDDNCQVFIKFLSELTRGPTLLNFTLNHIETTWKCLVRVFQFLWPK
PVEYLNIYNLTIIESIREEDFTYSKTTLKALTIEHITNQVFLFSQTALYTVFSEMNIMMLTISDTPFIH
MLCPHAPSTFKFLNFTQNVFTDSIFEKCSTLVKLETLILQKNGLKDLFKVGLMTKDMPSLEILDVSWNS
LESGRHKENCTWVESIVVLNLSSMMLTDSVFRCLPPRIKVLDLHSNKIKSVPKQVVKLEALQELNVAFN
SLTDLPGCGSFSSLSVLIIDHNSVSHPSADFFQSCQKMRSIKAGDNPFQCTCELREFVKNIDQVSSEVL
EGWPDSYKCDYPESYRGSPLKDFHMSELSCNITLLIVTIGATMLVLAVTVTSLCIYLDLPWYLRMVCQW
TQTRRRARNIPLEELQRNLQFHAFISYSEHDSAWVKSELVPYLEKEDIQICLHERNFVPGKSIVENIIN
CIEKSYKSIFVLSPNFVQSEWCHYELYFAHHNLFHEGSNNLILILLEPIPQNSIPNKYHKLKALMTQRT
YLQWPKEKSKRGLFWANIRAAFNMKLTLVTENNDVKS
表10:诸如灵长类动物或人等哺乳动物的DNAX Toll样受体10(DTLR10)的核苷酸序列和氨基酸序列(见SEQ ID NO:23和24)。用A表示的第54、103和345位核苷酸的每个都可以是A或G;用G表示的第313位核苷酸可以是G或T;并且用C表示的第316、380、407和408位核苷酸的每个都可以是A、C、G或T。
GCT TCC ACC TGT GCC TGG CCT GGC TTC CCT GGC GGG GGC GGC AAA GTG 48
Ala Ser Thr Cys Ala Trp Pro Gly Phe Pro Gly Gly Gly Gly Lys Val
1 5 10 15
GGC GAA ATG AGG ATG CCC TGC CCT ACG ATG CCT TCG TGG TCT TCG ACA 96
Gly Glu Met Arg Met Pro Cys Pro Thr Mer Pro Ser Trp Ser Ser Thr
20 25 30
AAA CGC AGA GCG CAG TGG CAG ACT GGG TGT ACA ACG AGC TTC GGG GGC 144
Lys Arg Arg Ala Gln Trp Gln Thr Gly Cys Thr Thr Ser Phe Gly Gly
35 40 45
AGC TGG AGG AGT GCC GTG GGC GCT GGG CAC TCC GCC TGT GCC TGG AGG 192
Ser Trp Arg Ser Ala Val Gly Ala Gly His Ser Ala Cys Ala Trp Arg
50 55 60
AAC GCG ACT GGC TGC CTG GCA AAA CCC TCT TTG AGA ACC TGT GGG CCT 240
Asn Ala Thr Gly Cys Leu Ala Lys Pro Ser Leu Arg Thr Cys Gly Pro
65 70 75 80
CGG TCT ATG GCA GCC GCA AGA CGC TGT TTG TGC TGG CCC ACA CGG ACC 288
Arg Ser Met Ala Ala Ala Arg Arg Cys Leu Cys Trp Pro Thr Arg Thr
85 90 95
GGG TCA GTG GTC TCT TGC GCG CCA GTT CTC CTG CTG GCC CAG CAG CGC 336
Gly Ser Val Val Ser Cys Ala Pro Val Leu Leu Leu Ala Gln Gln Arg
100 105 110
CTG CTG GAA GAC CGC AAG GAC GTC GTG GTG CTG GTG ATC CTA ACG CCT 384
Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu Val Ile Leu Thr Pro
115 120 125
GAC GGC CAA GCC TCC CGA CTA CCC GAT GCG CTG ACC AGC GCC TCT GCC 432
Asp Gly Gln Ala Ser Arg Leu Pro Asp Ala Leu Thr Ser Ala Ser Ala
130 135 140
GCC AGA GTG TCC TCC TCT GGC CCC ACC AGC CCA GTG GTC GCG CAG CTT 480
Ala Arg Val Ser Ser Ser Gly Pro Thr Ser Pro Val Val Ala Gln Leu
145 150 155 160
CTG AGG CCA GCA TGC ATG GCC CTG ACC AGG GAC AAC CAC CAC TTC TAT 528
Leu Arg Pro Ala Cys Mer Ala Leu Thr Arg Asp Asn His His Phe Tyr
165 170 175
AAC CGG AAC TTC TGC CAG GGA ACC CAC GGC CGA ATA GCC GTG AGC CGG 576
Asn Arg Asn Phe Cys Gln Gly Thr His Gly Arg Ile Ala Val Ser Arg
180 185 190
AAT CCT GCA CGG TGC CAC CTC CAC ACA CAC CTA ACA TAT GCC TGC CTG 624
Asn Pro Ala Arg Cys His Leu His Thr His Leu Thr Tyr Ala Cys Leu
195 200 205
ATC TGACCAACAC ATGCTCGCCA CCCTCACCAC ACACC 662
Ile
ASTCAWPGFPGGGGKVGEMRMPCPTMPSWSSTKRRAQWQTGCTTSFGGSWRSAVGAGHSACAWRNATGCLAKPSL
RTCGPRSMAAARRCLCWPTRTGSVVSCAPVLLLAQQRLLEDRKDVVVLVILTPDGQASRLPDALTSASAARVSSS
GPTSPVVAQLLRPACMALTRDNHHFYNRNFCQGTHGRIAVSRNPARCHLHTHLTYACLI
诸如人等灵长类动物的DTLR10的附加序列(SEQ ID NO:33和34);用A表示的第854位核苷酸可以是A或T;并且用C表示的第1171和1172位核苷酸的每个都可以是A、C、G或T:
CTG CCT GCT GGC ACC CGG CTC CGG AGG CTG GAT GTC AGC TGC AAC AGC 48
Leu Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser
1 5 10 15
ATC AGC TTC GTG GCC CCC GGC TTC TTT TCC AAG GCC AAG GAG CTG CGA 96
Ile Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg
20 25 30
GAG CTC AAC CTT AGC GCC AAC GCC CTC AAG ACA GTG GAC CAC TCC TGG 144
Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp
35 40 45
TTT GGG CCC CTG GCG AGT GCC CTG CAA ATA CTA GAT GTA AGC GCC AAC 192
Phe Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn
50 55 60
CCT CTG CAC TGC GCC TGT GGG GCG GCC TTT ATG GAC TTC CTG CTG GAG 240
Pro Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu
65 70 75 80
GTG CAG GCT GCC GTG CCC GGT CTG CCC AGC CGG GTG AAG TGT GGC AGT 288
Val Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser
85 90 95
CCG GGC CAG CTC CAG GGC CTC AGC ATC TTT GCA CAG GAC CTG CGC CTC 336
Pro Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu
100 105 110
TGC CTG GAT GAG GCC CTC TCC TGG GAC TGT TTC GCC CTC TCG CTG CTG 384
Cys Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu
115 120 125
GCT GTG GCT CTG GGC CTG GGT GTG CCC ATG CTG CAT CAC CTC TGT GGC 432
Ala Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly
130 135 140
TGG GAC CTC TGG TAC TGC TTC CAC CTG TGC CTG GCC TGG CTT CCC TGG 480
Trp Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp
145 150 155 160
CGG GGG CGG CAA AGT GGG CGA GAT GAG GAT GCC CTG CCC TAC GAT GCC 528
Arg Gly Arg Gln Ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala
165 170 175
TTC GTG GTC TTC GAC AAA ACG CAG AGC GCA GTG GCA GAC TGG GTG TAC 576
Phe Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr
180 185 190
AAC GAG CTT CGG GGG CAG CTG GAG GAG TGC CGT GGG CGC TGG GCA CTC 624
Asn Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu
195 200 205
CGC CTG TGC CTG GAG GAA CGC GAC TGG CTG CCT GGC AAA ACC CTC TTT 672
Arg Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe
210 215 220
GAG AAC CTG TGG GCC TCG GTC TAT GGC AGC CGC AAG ACG CTG TTT GTG 720
Glu Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val
225 230 235 240
CTG GCC CAC ACG GAC CGG GTC AGT GGT CTC TTG CGC GCC AGC TTC CTG 768
Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu
245 250 255
CTG GCC CAG CAG CGC CTG CTG GAG GAC CGC AAG GAC GTC GTG GTG CTG 816
Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu
260 265 270
GTG ATC CTG AGC CCT GAC GGC CGC CGC TCC CGC TAC GAG CGG CTG CGC 864
Val Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Glu Arg Leu Arg
275 280 285
CAG CGC CTC TGC CGC CAG AGT GTC CTC CTC TGG CCC CAC CAG CCC AGT 912
Gln Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser
290 295 300
GGT CAG CGC AGC TTC TGG GCC CAG CTG GGC ATG GCC CTG ACC AGG GAC 960
Gly Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp
305 310 315 320
AAC CAC CAC TTC TAT AAC CGG AAC TTC TGC CAG GGA CCC ACG GCC GAA 1008
Asn His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
325 330 335
TAGCCGTGAG CCGGAATCCT GCACGGTGCC ACCTCCACAC TCACCTCACC TCTGCCTGCC 1068
TGGTCTGACC CTCCCCTGCT CGCCTCCCTC ACCCCACACC TGACACAGAG CAGGCACTCA 1128
ATAAATGCTA CCGAAGGCTA AAAAAAAAAA AAAAAAAAAA AACCA 1173
LPAGTRLRRLDVSCNSISFVAPGFFSKAKELRELNLSANALKTVDHSWFGPLASALQILDVSANPLHCACGAAFM
DFLLEVQAAVPGLPSRVKCGSPGQLQGLSIFAQDLRLCLDEALSWDCFALSLLAVALGLGVPMLHHLCGWDLWYC
FHLCLAWLPWRGRQSGRDEDALPYDAFVVFDKTQSAVADWVYNELRGQLEECRGRWALRLCLEERDWLPGKTLFE
NLWASVYGSRKTLFVLAHTDRVSGLLRASFLLAQQRLLEDRKDVVVLVILSPDGRRSRY.RLRQRLCRQSVLLWP
HQPSGQRSFWAQLGMALTRDNHHFYNRNFCQGPTAE
诸如人等灵长类动物的DTLRl0的其他序列(SEQ ID NO:42和43):
atg ccc atg aag tgg agt ggg tgg agg tgg agc tgg ggg ccg gcc act 48
Met Pro Met Lys Trp Ser Gly Trp Arg Trp Ser Trp Gly Pro Ala Thr
-45 -40 -35
cac aca gcc ctc cca ccc cca cag ggt ttc tgc cgc agc gcc ctg cac 96
His Thr Ala Leu Pro Pro Pro Gln Gly Phe Cys Arg Ser Ala Leu His
-30 -25 -20
ccg ctg tct ctc ctg gtg cag gcc atc atg ctg gcc atg acc ctg gcc 144
Pro Leu Ser Leu Leu Val Gln Ala Ile Met Leu Ala Met Thr Leu Ala
-15 -10 -5 -1
ctg ggt acc ttg cct gcc ttc cta ccc tgt gag ctc cag ccc cac ggc 192
Leu Gly Thr Leu Pro Ala Phe Leu Pro Cys Glu Leu Gln Pro His Gly
1 5 10 15
ctg gtg aac tgc aac tgg ctg ttc ctg aag tct gtg ccc cac ttc tcc 240
Leu Val Asn Cys Asn Trp Leu Phe Leu Lys Ser Val Pro His Phe Ser
20 25 30
atg gca gca ccc cgt ggc aat gtc acc agc ctt tcc ttg tcc tcc aac 288
Met Ala Ala Pro Arg Gly Asn Val Thr Ser Leu Ser Leu Ser Ser Asn
35 40 45
cgc atc cac cac ctc cat gat tct gac ttt gcc cac ctg ccc agc ctg 336
Arg Ile His His Leu His Asp Ser Asp Phe Ala His Leu Pro Ser Leu
50 55 60
cgg cat ctc aac ctc aag tgg aac tgc ccg ccg gtt ggc ctc agc ccc 384
Arg His Leu Asn Leu Lys Trp Asn Cys Pro Pro Val Gly Leu Ser Pro
65 70 75 80
atg cac ttc ccc tgc cac atg acc atc gag ccc agc acc ttc ttg gct 432
Met His Phe Pro Cys His Met Thr Ile Glu Pro Ser Thr Phe Leu Ala
85 90 95
gtg ccc acc ctg gaa gag cta aac ctg agc tac aac aac atc atg act 480
Val Pro Thr Leu Glu Glu Leu Asn Leu Ser Tyr Asn Asn Ile Met Thr
100 105 110
gtg cct gcg ctg ccc aaa tcc ctc ata tcc ctg tcc ctc agc cat acc 528
Val Pro Ala Leu Pro Lys Ser Leu Ile Ser Leu Ser Leu Ser His Thr
115 120 125
aac atc ctg atg cta gac tct gcc agc ctc gcc ggc ctg cat gcc ctg 576
Asn Ile Leu Met Leu Asp Ser Ala Ser Leu Ala Gly Leu His Ala Leu
130 135 140
cgc ttc cta ttc atg gac ggc aac tgt tat tac aag aac ccc tgc agg 624
Arg Phe Leu Phe Met Asp Gly Asn Cys Tyr Tyr Lys Asn Pro Cys Arg
145 150 155 160
cag gca ctg gag gtg gcc ccg ggt gcc ctc ctt ggc ctg ggc aac ctc 672
Gln Ala Leu Glu Val Ala Pro Gly Ala Leu Leu Gly Leu Gly Asn Leu
165 170 175
acc cac ctg tca ctc aag tac aac aac ctc act gtg gtg ccc cgc aac 720
Thr His Leu Ser Leu Lys Tyr Asn Asn Leu Thr Val Val Pro Arg Asn
180 185 190
ctg cct tcc agc ctg gag tat ctg ctg ttg tcc tac aac cgc atc gtc 768
Leu Pro Ser Ser Leu Glu Tyr Leu Leu Leu Ser Tyr Asn Arg Ile Val
195 200 205
aaa ctg gcg cct gag gac ctg gcc aat ctg acc gcc ctg cgt gtg ctc 816
Lys Leu Ala Pro Glu Asp Leu Ala Asn Leu Thr Ala Leu Arg Val Leu
210 215 220
gat gtg ggc gga aat tgc cgc cgc tgc gac cac gct ccc aac ccc tgc 864
Asp Val Gly Gly Asn Cys Arg Arg Cys Asp His Ala Pro Asn Pro Cys
225 230 235 240
atg gag tgc cct cgt cac ttc ccc cag cta cat ccc gat acc ttc agc 912
Met Glu Cys Pro Arg His Phe Pro Gln Leu His Pro Asp Thr Phe Ser
245 250 255
cac ctg agc cgt ctt gaa ggc ctg gtg ttg aag gac agt tct ctc tcc 960
His Leu Ser Arg Leu Glu Gly Leu Val Leu Lys Asp Ser Ser Leu Ser
260 265 270
tgg ctg aat gcc agt tgg ttc cgt ggg ctg gga aac ctc cga gtg ctg 1008
Trp Leu Asn Ala Ser Trp Phe Arg Gly Leu Gly Asn Leu Arg Val Leu
275 280 285
gac ctg agt gag aac ttc ctc tac aaa tgc atc act aaa acc aag gcc 1056
Asp Leu Ser Glu Asn Phe Leu Tyr Lys Cys Ile Thr Lys Thr Lys Ala
290 295 300
ttc cag ggc cta aca cag ctg cgc aag ctt aac ctg tcc ttc aat tac 1104
Phe Gln Gly Leu Thr Gln Leu Arg Lys Leu Asn Leu Ser Phe Asn Tyr
305 310 315 320
caa aag agg gtg tcc ttt gcc cac ctg tct ctg gcc cct tcc ttc ggg 1152
Gln Lys Arg Val Ser Phe Ala His Leu Ser Leu Ala Pro Ser Phe Gly
325 330 335
agc ctg gtc gcc ctg aag gag ctg gac atg cac ggc atc ttc ttc cgc 1200
Ser Leu Val Ala Leu Lys Glu Leu Asp Met His Gly Ile Phe Phe Arg
340 345 350
tca ctc gat gag acc acg ctc cgg cca ctg gcc cgc ctg ccc atg ctc 1248
Ser Leu Asp Glu Thr Thr Leu Arg Pro Leu Ala Arg Leu Pro Met Leu
355 360 365
cag act ctg cgt ctg cag atg aac ttc atc aac cag gcc cag ctc ggc 1296
Gln Thr Leu Arg Leu Gln Met Asn Phe Ile Asn Gln Ala Gln Leu Gly
370 375 380
atc ttc agg gcc ttc cct ggc ctg cgc tac gtg gac ctg tcg gac aac 1344
Ile Phe Arg Ala Phc Pro Gly Leu Arg Tyr Val Asp Leu Ser Asp Asn
385 390 395 400
cgc atc agc gga gct tcg gag ctg aca gcc acc atg ggg gag gca gat 1392
Arg Ile Ser Gly Ala Ser Glu Leu Thr Ala Thr Met Gly Glu Ala Asp
405 410 415
gga ggg gag aag gtc tgg ctg cag cct ggg gac ctt gct ccg gcc cca 1440
Gly Gly Glu Lys Val Trp Leu Gln Pro Gly Asp Leu Ala Pro Ala Pro
420 425 430
gtg gac act ccc agc tct gaa gac ttc agg ccc aac tgc agc acc ctc 1488
Val Asp Thr Pro Ser Ser Glu Asp Phe Arg Pro Asn Cys Ser Thr Leu
435 440 445
aac ttc acc ttg gat ctg tca cgg aac aac ctg gtg acc gtg cag ccg 1536
Asn Phe Thr Leu Asp Leu Ser Arg Asn Asn Leu Val Thr Val Gln Pro
450 455 460
gag atg ttt gcc cag ctc tcg cac ctg cag tgc ctg cgc ctg agc cac 1584
Glu Met Phe Ala Gln Leu Ser His Leu Gln Cys Leu Arg Leu Ser His
465 470 475 480
aac tgc atc tcg cag gca gtc aat ggc tcc cag ttc ctg ccg ctg acc 1632
Asn Cys Ile Ser Gln Ala Val Asn Gly Ser Gln Phe Leu Pro Leu Thr
485 490 495
ggt ctg cag gtg cta gac ctg tcc cac aat aag ctg gac ctc tac cac 1680
Gly Leu Gln Val Leu Asp Leu Ser His Asn Lys Leu Asp Leu Tyr His
500 505 510
gag cac tca ttc acg gag cta cca cga ctg gag gcc ctg gac ctc agc 1728
Glu His Ser Phe Thr Glu Leu Pro Arg Leu Glu Ala Leu Asp Leu Ser
515 520 525
tac aac agc cag ccc ttt ggc atg cag ggc gtg ggc cac aac ttc agc 1776
Tyr Asn Ser Gln Pro Phe Gly Met Gln Gly Val Gly His Asn Phe Ser
530 535 540
ttc gtg gct cac ctg cgc acc ctg cgc cac ctc agc ctg gcc cac aac 1824
Phe Val Ala His Leu Arg Thr Leu Arg His Leu Ser Leu Ala His Asn
545 550 555 560
aac atc cac agc caa gtg tcc cag cag ctc tgc agt acg tcg ctg cgg 1872
Asn Ile His Ser Gln Val Ser Gln Gln Leu Cys Ser Thr Ser Leu Arg
565 570 575
gcc ctg gac ttc agc ggc aat gca ctg ggc cat atg tgg gcc gag gga 1920
Ala Leu Asp Phe Ser Gly Asn Ala Leu Gly His Met Trp Ala Glu Gly
580 585 590
gac ctc tat ctg cac ttc ttc caa ggc ctg agc ggt ttg atc tgg ctg 1968
Asp Leu Tyr Leu His Phe Phe Gln Gly Leu Ser Gly Leu Ile Trp Leu
595 600 605
gac ttg tcc cag aac cgc ctg cac acc ctc ctg ccc caa acc ctg cgc 2016
Asp Leu Ser Gln Asn Arg Leu His Thr Leu Leu Pro Gln Thr Leu Arg
610 615 620
aac ctc ccc aag agc cta cag gtg ctg cgt ctc cgt gac aat tac ctg 2064
Asn Leu Pro Lys Ser Leu Gln Val Leu Arg Leu Arg Asp Asn Tyr Leu
625 630 635 640
gcc ttc ttt aag tgg tgg agc ctc cac ttc ctg ccc aaa ctg gaa gtc 2112
Ala Phe Phe Lys Trp Trp Ser Leu His Phe Leu Pro Lys Leu Glu Val
645 650 655
ctc gac ctg gca gga aac cag ctg aag gcc ctg acc aat ggc agc ctg 2160
Leu Asp Leu Ala Gly Asn Gln Leu Lys Ala Leu Thr Asn Gly Ser Leu
660 665 670
cct gct ggc acc cgg ctc cgg agg ctg gat gtc agc tgc aac agc atc 2208
Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser Ile
675 680 685
agc ttc gtg gcc ccc ggc ttc ttt tcc aag gcc aag gag ctg cga gag 2256
Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg Glu
690 695 700
ctc aac ctt agc gcc aac gcc ctc aag aca gtg gac cac tcc tgg ttt 2304
Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp Phe
705 710 715 720
ggg ccc ctg gcg agt gcc ctg caa ata cta gat gta agc gcc aac cct 2352
Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn Pro
725 730 735
ctg cac tgc gcc tgt ggg gcg gcc ttt atg gac ttc ctg ctg gag gtg 2400
Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu Val
740 745 750
cag gct gcc gtg ccc ggt ctg ccc agc cgg gtg aag tgt ggc agt ccg 2448
Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser Pro
755 760 765
ggc cag ctc cag ggc ctc agc atc ttt gca cag gac ctg cgc ctc tgc 2496
Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu Cys
770 775 780
ctg gat gag gcc ctc tcc tgg gac tgt ttc gcc ctc tcg ctg ctg gct 2544
Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu Ala
785 790 795 800
gtg gct ctg ggc ctg ggt gtg ccc atg ctg cat cac ctc tgt ggc tgg 2592
Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly Trp
805 810 815
gac ctc tgg tac tgc ttc cac ctg tgc ctg gcc tgg ctt ccc tgg cgg 2640
Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp Arg
820 825 830
ggg cgg caa agt ggg cga gat gag gat gcc ctg ccc tac gat gcc ttc 2688
Gly Arg Gln Ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala Phe
835 840 845
gtg gtc ttc gac aaa acg cag agc gca gtg gca gac tgg gtg tac aac 2736
Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr Asn
850 855 860
gag ctt cgg ggg cag ctg gag gag tgc cgt ggg cgc tgg gca ctc cgc 2784
Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu Arg
865 870 875 880
ctg tgc ctg gag gaa cgc gac tgg ctg cct ggc aaa acc ctc ttt gag 2832
Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe Glu
885 890 895
aac ctg tgg gcc tcg gtc tat ggc agc cgc aag acg ctg ttt gtg ctg 2880
Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val Leu
900 905 910
gcc cac acg gac cgg gtc agt ggt ctc ttg cgc gcc agc ttc ctg ctg 2928
Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu Leu
915 920 925
gcc cag cag cgc ctg ctg gag gac cgc aag gac gtc gtg gtg ctg gtg 2976
Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu Val
930 935 940
atc ctg agc cct gac ggc cgc cgc tcc cgc tat gtg cgg ctg cgc cag 3024
Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Val Arg Leu Arg Gln
945 950 955 960
cgc ctc tgc cgc cag agt gtc ctc ctc tgg ccc cac cag ccc agt ggt 3072
Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser Gly
965 970 975
cag cgc agc ttc tgg gcc cag ctg ggc atg gcc ctg acc agg gac aac 3120
Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp Asn
980 985 990
cac cac ttc tat aac cgg aac ttc tgc cag gga ccc acg gcc gaa tag 3168
His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
995 1000 1005
MPMKWSGWRWSWGPATHTALPPPQGFCRSALHPLSLLVQAIMLAMTLALGTLPAFLPCELQPHGLVNCN
WLFLKSVPHFSMAAPRGNVTSLSLSSNRIHHLHDSDFAHLPSLRHLNLKWNCPPVGLSPMHFPCHMTIE
PSTFLAVPTLEELNLSYNNIMTVPALPKSLISLSLSHTNILMLDSASLAGLHALRFLFMDGNCYYKNPC
RQALEVAPGALLGLGNLTHLSLKYNNLTVVPRNLPSSLEYLLLSYNRIVKLAPEDLANLTALRVLDVGG
NCRRCDHAPNPCMECPRHFPQLHPDTFSHLSRLEGLVLKDSSLSWLNASWFRGLGNLRVLDLSENFLYK
CITKTKAFQGLTQLRKLNLSFNYQKRVSFAHLSLAPSFGSLVALKELDMHGIFFRSLDETTLRPLARLP
MLQTLRLQMNFINQAQLGIFRAFPGLRYVDLSDNRISGASELTATMGEADGGEKVWLQPGDLAPAPVDT
PSSEDFRPNCSTLNFTLDLSRNNLVTVQPEMFAQLSHLQCLRLSHNCISQAVNGSQFLPLTGLQVLDLS
HNKLDLYHEHSFTELPRLEALDLSYNSQPFGMQGVGHNFSFVAHLRTLRHLSLAHNNIHSQVSQQLCST
SLRALDFSGNALGHMWAEGDLYLHFFQGLSGLIWLDLSQNRLHTLLPQTLRNLPKSLQVLRLRDNYLAF
FKWWSLHFLPKLEVLDLAGNQLKALTNGSLPAGTRLRRLDVSCNSISFVAPGFFSKAKELRELNLSANA
LKTVDHSWFGPLASALQILDVSANPLHCACGAAFMDFLLEVQAAVPGLPSRVKCGSPGQLQGLSIFAQD
LRLCLDEALSWDCFALSLLAVALGLGVPMLHHLCGWDLWYCFHLCLAWLPWRGRQSGRDEDALPYDAFV
VFDKTQSAVADWVYNELRGQLEECRGRWALRLCLEERDWLPGKTLFENLWASVYGSRKTLFVLAHTDRV
SGLLRASFLLAQQRLLEDRKDVVVLVILSPDGRRSRYVRLRQRLCRQSVLLWPHQPSGQRSFWAQLGMA
LTRDNHHFYNRNFCQGPTAE
诸如小鼠等啮齿动物的DTLR10的部分核苷酸序列(SEQ ID NO:35):
TGGCCCACAC GGACCGCGTC AGTGGCCTCC TGCGCACCAG CTTCCTGCTG GCTCAGCAGC 60
GCCTGTTGGA AGACCGCAAG GACGTGGTGG TGTTGGTGAT CCTGCGTCCG GATGCCCCAC 120
CGTCCCGCTA TGTGCGACTG CGCCAGCGTC TCTGCCGCCA GAGTGTGCTC TTCTGGCCCC 180
AGCGACCCAA CGGGCAGGGG GGCTTCTGGG CCCAGCTGAG TACAGCCCTG ACTAGGGACA 240
ACCGCCACTT CTATAACCAG AACTTCTGCC GGGGACCTAC AGCAGAATAG CTCAGAGCAA 300
CAGCTGGAAA CAGCTGCATC TTCATGTCTG GTTCCCGAGT TGCTCTGCCT GCCTTGCTCT 360
GTCTTACTAC ACCGCTATTT GGCAAGTGCG CAATATATGC TACCAAGCCA CCAGGCCCAC 420
GGAGCAAAGG TTGGCTGTAA AGGGTAGTTT TCTTCCCATG CATCTTTCAG GAGAGTGAAG 480
ATAGACACCA AACCCAC 497
诸如小鼠等啮齿动物的DTLR10的其他序列(SEQ ID NO:44和45):
aac ctg tcc ttc aat tac cgc aag aag gta tcc ttt gcc cgc ctc cac 48
Asn Leu Ser Phe Asn Tyr Arg Lys Lys Val Ser Phe Ala Arg Leu His
1 5 10 15
ctg gca agt tcc ttt aag aac ctg gtg tca ctg cag gag ctg aac atg 96
Leu Ala Ser Ser Phe Lys Asn Leu Val Ser Leu Gln Glu Leu Asn Met
20 25 30
aac ggc atc ttc ttc cgc ttg ctc aac aag tac acg ctc aga tgg ctg 144
Asn Gly Ile Phe Phe Arg Leu Leu Asn Lys Tyr Thr Leu Arg Trp Leu
35 40 45
gcc gat ctg ccc aaa ctc cac act ctg cat ctt caa atg aac ttc atc 192
Ala Asp Leu Pro Lys Leu His Thr Leu His Leu Gln Met Asn Phe Ile
50 55 60
aac cag gca cag ctc agc atc ttt ggt acc ttc cga gcc ctt cgc ttt 240
Asn Gln Ala Gln Leu Ser Ile Phe Gly Thr Phe Arg Ala Leu Arg Phe
65 70 75 80
gtg gac ttg tca gac aat cgc atc agt ggg cct tca acg ctg tca gaa 288
Val Asp Leu Ser Asp Asn Arg Ile Ser Gly Pro Ser Thr Leu Ser Glu
85 90 95
gcc acc cct gaa gag gca gat gat gca gag cag gag gag ctg ttg tct 336
Ala Thr Pro Glu Glu Ala Asp Asp Ala Glu Gln Glu Glu Leu Leu Ser
100 105 110
gcg gat cct cac cca gct ccg ctg agc acc cct gct tct aag aac ttc 384
Ala Asp Pro His Pro Ala Pro Leu Ser Thr Pro Ala Ser Lys Asn Phe
115 120 125
atg gac agg tgt aag aac ttc aag ttc aac atg gac ctg tct cgg aac 432
Met Asp Arg Cys Lys Asn Phe Lys Phe Asn Met Asp Leu Ser Arg Asn
130 135 140
aac ctg gtg act atc aca gca gag atg ttt gta aat ctc tca cgc ctc 480
Asn Leu Val Thr Ile Thr Ala Glu Met Phe Val Asn Leu Ser Arg Leu
145 150 155 160
cag tgt ctt agc ctg agc cac aac tca att gca cag gct gtc aat ggc 528
Gln Cys Leu Ser Leu Ser His Asn Ser Ile Ala Gln Ala Val Asn Gly
165 170 175
tct cag ttc ctg ccg ctg acc ggt ctg cag gtg cta gac ctg tcc cac 576
Ser Gln Phe Leu Pro Leu Thr Gly Leu Gln Val Leu Asp Leu Ser His
180 185 190
aat aag ctg gac ctc tac cac gag cac tca ttc acg gag cta cca cga 624
Asn Lys Leu Asp Leu Tyr His Glu His Ser Phe Thr Glu Leu Pro Arg
195 200 205
ctg gag gcc ctg gac ctc agc tac aac agc cag ccc ttt agc atg aag 672
Leu Glu Ala Leu Asp Leu Ser Tyr Asn Ser Gln Pro Phe Ser Met Lys
210 215 220
ggt ata ggc cac aat ttc agt ttt gtg acc cat ctg tcc atg cta cag 720
Gly Ile Gly His Asn Phe Ser Phe Val Thr His Leu Ser Met Leu Gln
225 230 235 240
agc ctt agc ctg gca cac aat gac att cat acc cgt gtg tcc tca cat 768
Ser Leu Ser Leu Ala His Asn Asp Ile His Thr Arg Val Ser Ser His
245 250 255
ctc aac agc aac tca gtg agg ttt ctt gac ttc agc ggc aac ggt atg 816
Leu Asn Ser Asn Ser Val Arg Phe Leu Asp Phe Ser Gly Asn Gly Met
260 265 270
ggc cgc atg tgg gat gag ggg ggc ctt tat ctc cat ttc ttc caa ggc 864
Gly Arg Met Trp Asp Glu Gly Gly Leu Tyr Leu His Phe Phe Gln Gly
275 280 285
ctg agt ggc gtg ctg aag ctg gac ctg tct caa aat aac ctg cat atc 912
Leu Ser Gly Val Leu Lys Leu Asp Leu Ser Gln Asn Asn Leu His Ile
290 295 300
ctc cgg ccc cag aac ctt gac aac ctc ccc aag agc ctg aag ctg ctg 960
Leu Arg Pro Gln Asn Leu Asp Asn Leu Pro Lys Ser Leu Lys Leu Leu
305 310 315 320
agc ctc cga gac aac tac cta tct ttc ttt aac tgg acc agt ctg tcc 1008
Ser Leu Arg Asp Asn Tyr Leu Ser Phe Phe Asn Trp Thr Ser Leu Ser
325 330 335
ttc cta ccc aac ctg gaa gtc cta gac ctg gca ggc aac cag cta aag 1056
Phe Leu Pro Asn Leu Glu Val Leu Asp Leu Ala Gly Asn Gln Leu Lys
340 345 350
gcc ctg acc aat ggc acc ctg cct aat ggc acc ctc ctc cag aaa ctc 1104
Ala Leu Thr Asn Gly Thr Leu Pro Asn Gly Thr Leu Leu Gln Lys Leu
355 360 365
gat gtc agt agc aac agt atc gtc tct gtg gcc ccc ggc ttc ttt tcc 1152
Asp Val Ser Ser Asn Ser Ile Val Ser Val Ala Pro Gly Phe Phe Ser
370 375 380
aag gcc aag gag ctg cga gag ctc aac ctt agc gcc aac gcc ctc aag 1200
Lys Ala Lys Glu Leu Arg Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys
385 390 395 400
aca gtg gac cac tcc tgg ttt ggg ccc att gtg atg aac ctg aca gtt 1248
Thr Val Asp His Ser Trp Phe Gly Pro Ile Val Met Asn Leu Thr Val
405 410 415
cta gac gtg aga agc aac cct ctg cac tgt gcc tgt ggg gca gcc ttc 1296
Leu Asp Val Arg Ser Asn Pro Leu His Cys Ala Cys Gly Ala Ala Phe
420 425 430
gta gac tta ctg ttg gag gtg cag acc aag gtg cct ggc ctg gct aat 1344
Val Asp Leu Leu Leu Glu Val Gln Thr Lys Val Pro Gly Leu Ala Asn
435 440 445
ggt gtg aag tgt ggc agc ccc ggc cag ctg cag ggc cgt agc atc ttc 1392
Gly Val Lys Cys Gly Ser Pro Gly Gln Leu Gln Gly Arg Ser Ile Phe
450 455 460
gcg cag gac ctg cgg ctg tgc ctg gat gag gtc ctc tct tgg gac tgc 1440
Ala Gln Asp Leu Arg Leu Cys Leu Asp Glu Val Leu Ser Trp Asp Cys
465 470 475 480
ttt ggc ctt tca ctc ttg gct gtg gcc gtg ggc atg gtg gtg cct ata 1488
Phe Gly Leu Set Leu Leu Ala Val Ala Val Gly Met Val Val Pro Ile
485 490 495
ctg cac cat ctc tgc ggc tgg gac gtc tgg tac tgt ttt cat ctg tgc 1536
Leu His His Leu Cys Gly Trp Asp Val Trp Tyr Cys Phe His Leu Cys
500 505 510
ctg gca tgg cta cct ttg cta gcc cgc agc cga cgc agc gcc caa act 1584
Leu Ala Trp Leu Pro Leu Leu Ala Arg Ser Arg Arg Ser Ala Gln Thr
515 520 525
ctc cct tat gat gcc ttc gtg gtg ttc gat aag gca cag agc gca gtt 1632
Leu Pro Tyr Asp Ala Phe Val Val Phe Asp Lys Ala Gln Ser Ala Val
530 535 540
gcc gac tgg gtg tat aac gag ctg cgg gtg cgg ctg gag gag cgg cgc 1680
Ala Asp Trp Val Tyr Asn Glu Leu Arg Val Arg Leu Glu Glu Arg Arg
545 550 555 560
ggc cgc tgg gca ctc cgc ctg tgc ctg gag gac cga gat tgg ctg cct 1728
Gly Arg Trp Ala Leu Arg Leu Cys Leu Glu Asp Arg Asp Trp Leu Pro
565 570 575
ggc cag acg ctc ttc gag aac ctc tgg gct tcc atc tat ggg agc cgc 1776
Gly Gln Thr Leu Phe Glu Asn Leu Trp Ala Ser Ile Tyr Gly Ser Arg
580 585 590
aag act cta ttt gtg ctg gcc cac acg gac cgc gtc agt ggc ctc ctg 1824
Lys Thr Leu Phe Val Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu
595 600 605
cgc acc agc ttc ctg ctg gct cag cag cgc ctg ttg gaa gac cgc aag 1872
Arg Thr Ser Phe Leu Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys
610 615 620
gac gtg gtg gtg ttg gtg atc ctg cgt ccg gat gcc cac cgc tcc cgc 1920
Asp Val Val Val Leu Val Ile Leu Arg Pro Asp Ala His Arg Ser Arg
625 630 635 640
tat gtg cga ctg cgc cag cgt ctc tgc cgc cag agt gtg ctc ttc tgg 1968
Tyr Val Arg Leu Arg Gln Arg Leu Cys Arg Gln Ser Val Leu Phe Trp
645 650 655
ccc cag cag ccc aac ggg cag ggg ggc ttc tgg gcc cag ctg agt aca 2016
Pro Gln Gln Pro Asn Gly Gln Gly Gly Phe Trp Ala Gln Leu Ser Thr
660 665 670
gcc ctg act agg gac aac cgc cac ttc tat aac cag aac ttc tgc cgg 2064
Ala Leu Thr Arg Asp Asn Arg His Phe Tyr Asn Gln Asn Phe Cys Arg
675 680 685
gga cct aca gca gaa tagctcagag caacagctgg aaacagctgc atcttcatgt 2119
Gly Pro Thr Ala Glu
690
ctggttcccg agttgctctg cctgccttgc tctgtcttac tacaccgcta tttggcaagt 2179
gcgcaatata tgctaccaag ccaccaggcc cacggagcaa aggttggctg taaagggtag 2239
ttttcttccc atgcatcttt caggagagtg aagatagaca ccaaacccac 2289
NLSFNYRKKVSFARLHLASSFKNLVSLQELNMNGIFFRLLNKYTLRWLADLPKLHTLHLQMNFINQAQL
SIFGTFRALRFVDLSDNRISGPSTLSEATPEEADDAEQEELLSADPHPAPLSTPASKNFMDRCKNFKFN
MDLSRNNLVTITAEMFVNLSRLQCLSLSHNSIAQAVNGSQFLPLTGLQVLDLSHNKLDLYHEHSFTELP
RLEALDLSYNSQPFSMKGIGHNFSFVTHLSMLQSLSLAHNDIHTRVSSHLNSNSVRFLDFSGNGMGRMW
DEGGLYLHFFQGLSGVLKLDLSQNNLHILRPQNLDNLPKSLKLLSLRDNYLSFFNWTSLSFLPNLEVLD
LAGNQLKALTNGTLPNGTLLQKLDVSSNSIVSVAPGFFSKAKELRELNLSANALKTVDHSWFGPIVMNL
TVLDVRSNPLHCACGAAFVDLLLEVQTKVPGLANGVKCGSPGQLQGRSIFAQDLRLCLDEVLSWDCFGL
SLLAVAVGMVVPILHHLCGWDVWYCFHLCLAWLPLLARSRRSAQTLPYDAFVVFDKAQSAVADWVYNEL
RVRLEERRGRWALRLCLEDRDWLPGQTLFENLWASIYGSRKTLFVLAHTDRVSGLLRTSFLLAQQRLLE
DRKDVVVLVILRPDAHRSRYVRLRQRLCRQSVLFWPQQPNGQGGFWAQLSTALTRDNRHFYNQNFCRGP
TAE
表11:人DTLRs胞内结构域的比较。DTLR1为SEQ ID NO:2;DTLR2为SEQ ID NO:4;DTLR3为SEQ ID NO:6;DTLR4为SEQ ID NO:8;DTLR5为SEQ ID NO:10;DTLR6为SEQ ID NO:12。在DTLRs中特别重要并且保守的残基,如特征性残基,对应于SEQ ID NO:18的tyr10-tyr13残基;trp26残基;cys46残基;trp52残基;pro54-gly55残基;ser69残基;lys71残基;trp134-pro135残基;以及phe144-trp145残基。
DTLR1 QRNLQFHAFISYSGHD---SFWVKNELLPNLEKEG-----MQICLHERNF
DTLR9 KENLQFHAFISYSEHD---SAWVKSELVPYLEKED-----IQICLHERNF
DTLR8 ------------------------NELIPNLEKEDGS---ILICLYESYF
DTLR2 SRNICYDAFVSYSERD---AYWVENLMVQELENFNPP---FKLCLHKRDF
DTLR6 SPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREK--HFNLCLEERDW
DTLR7 TSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDK--NVLLCLEERDW
DTLR10 EDALPYDAFVVFDKTXSAVADWVYNELRGQLEECRGRW-ALRLCLEERDW
DTLR4 RGENIYDAFVIYSSQD---EDWVRNELVKNLEEGVPP---FQLCLHYRDF
DTLR5 PDMYKYDAYLCFSSKD---FTWVQNALLKHLDTQYSDQNRFNLCFEERDF
DTLR3 TEQFEYAAYIIHAYKD---KDWVWEHFSSMEKEDQS----LKFCLEERDF
: . . :*: :
DTLR1 VPGKSIVENIITC-IEKSYKSIFVLSPNFVQSEWCH-YELYFAHHNLFHE
DTLR9 VPGKSIVENIINC-IEKSYKSIFVLSPNFVQSEWCH-YELYFAHHNLFHE
DTLR8 DPGKSISENIVSF-IEKSYKSIFVLSPNFVQNEWCH-YEFYFAHHNLFHE
DTLR2 IPGKWIIDNIIDS-IEKSHKTVFVLSENFVKSEWCK-YELDFSHFRLFEE
DTLR6 LPGQPVLENLSQS-IQLSKKTVFVMTDKYAKTENFK-IAFYLSHQRLMDE
DTLR7 DPGLAIIDNLMQS-INQSKKTVFVLTKKYAKSWNFK-TAFYLXLQRLMGE
DTLR10 LPGKTLFENLWAS-VYGSRKTLFVLAHTDRVSGLLR-AIFLLAQQRLLE-
DTLR4 IPGVAIAANIIHEGFHKSRKVIVVVSQHFIQSRWCI-FEYEIAQTWQFLS
DTLR5 VPGENRIANIQDA-IWNSRKIVCLVSRHFLRDGWCL-EAFSYAQGRCLSD
DTLR3 EAGVFELEAIVNS-IKRSRKIIFVITHHLLKDPLCKRFKVHHAVQQAIEQ
.* : . * * : ::: :
DTLR1 GSNSLILILLEPIPQYSIPSSYHKLKSLMARRTYLEWPKEKSKRGLFWAN
DTLR9 GSNNLILILLEPIPQNSIPNKYHKLKALMTQRTYLQWPKEKSKRGLFWA-
DTLR8 NSDHIILILLEPIPFYCIPTRYHKLEALLEKKAYLEWPKDRRKCGLFWAN
DTLR2 NNDAAILILLEPIEKKAIPQRFCKLRKIMNTKTYLEWPMDEAQREGFWVN
DTLR6 KVDVIILIFLEKPFQK---SKFLQLRKRLCGSSVLEWPTNPQAHPYFWQC
DTLR7 NMDVIIFILLEPVLQH---SPYLRLRQRICKSSILQWPDNPKAERLFWQT
DTLR10 --------------------------------------------------
DTLR4 SRAGIIFIVLQKVEKT-LLRQQVELYRLLSRNTYLEWEDSVLGRHIFWRR
DTLR5 LNSALIMVVVGSLSQY-QLMKHQSIRGFVQKQQYLRWPEDLQDVGWFLHK
DTLR3 NLDSIILVFLEEIPDYKLNHALCLRRGMFKSHCILNWPVQKERIGAFRHK
DTLR1 LRAAINIKLTEQAKK--------------------------
DTLR9 -----------------------------------------
DTLR8 LRAAVNVNVLATREMYELQTFTELNEESRGSTISLMRTDCL
DTLR2 LRAAIKS----------------------------------
DTLR6 LKNALATDNHVAYSQVFKETV--------------------
DTLR7 LXNVVLTENDSRYNNMYVDSIKQY-----------------
DTLR10 -----------------------------------------
DTLR4 LRKALLDGKSWNPEGTVGTGCNWQEATSI------------
DTLR5 LSQQILKKEKEKKKDNNIPLQTVATIS--------------
DTLR3 LQVALGSKNSVH-----------------------------
跨膜区约相当于灵长类动物DTLR7 SEQ ID NO:37的802-818(791-823);DTLR8 SEQ ID NO:39的559-575(550-586);DTLR9SEQ ID NO:41的553-569(549-582);DTLR10 SEQ ID NO:43的796-810(790-814);以及DTLR10 SEQ ID NO:45的481-497(475-503)。
文中使用的术语DNAX Toll样受体2(DTLR2)是指包含一种具有或含有表2所示氨基酸序列的蛋白或肽段的蛋白质或其实质性片段。类似的关系也适用于DTLR3和表3;DTLR4和表4;DTLR5和表5;DTLR6和表6;DTLR7和表7;DTLR8和表8;DTLR9和表9;以及DTLR10和表10。诸如小鼠等啮齿动物的DTLR11序列是由,例如,EST AA739083提供;DTLR13序列是由EST AI019567提供;DTLR14序列是由ESTsAI390330和AA244663提供。
本发明还包括已提供序列的各种DTLR等位体的蛋白变体,如突变型蛋白激动剂或拮抗剂。这些激动剂或拮抗剂的序列差异一般约小于10%,因而常具有1-11倍(fold)取代,如2倍、3倍、5倍、7倍和其他倍取代。此外还包括所述蛋白的等位变体和其他变体,如天然多形性变体。该变体通常能够与其相应生物学受体高亲和力结合,如至少约100nM的亲和力,通常是高于约30nM的亲和力,优选的是高于约10nM的亲和力,更优选的是高于约3nM的亲和力。文中使用的该术语还涉及与所述哺乳动物蛋白相关的天然形式,例如其等位体、多形性变体和代谢变体。
本发明还包括一些具有与表2所示氨基酸序列基本相同的氨基酸序列的蛋白或肽。包括带有较少取代的序列变体,例如,优选的是少于约3-5个取代。类似的特性也适用于表3、4、5、6、7、8、9或10中提供的其他DTLR序列。
实质性多肽“片段”或“肽段”是指至少含有约8个氨基酸的氨基酸残基段,一般是至少10个氨基酸,更一般的是至少12个氨基酸,通常是至少14个氨基酸,更多的是至少16个氨基酸,典型的是至少18个氨基酸,更典型的是至少20个氨基酸,常见的是至少22个氨基酸,更常见的是至少24个氨基酸,优选的是至少26个氨基酸,更优选的是至少28个氨基酸,而在特别优选实施方案中是至少约30个或更多个氨基酸。不同蛋白的片段可以在适当长度的序列段上彼此进行序列比较。
氨基酸序列同源性或序列同一性的测定方法是优化残基匹配,如果必要,还可以在需要的地方引入空位。可参考,例如,Needleham,et a1.,(1970)
J.Mol.Biol.48:443-453;Sankoff,et al.,(1983)《
时间扭曲、信息串编辑和大分子:序列比较的理论与实践》第一章,Addison-Wesley,Reading,MA;和IntelliGenetics,Mountain View,CA的软件包;威斯康星大学遗传计算机组(GCG),Madison,WI;以及NCBI(NIH);其内容均在此引入作为参考。如果保守取代也被认为是匹配,则测定方法会有所改变。保守取代通常包括在下述分组内部进行的取代:甘氨酸,丙氨酸;缬氨酸,异亮氨酸,亮氨酸;天冬氨酸,谷氨酸;天冬酰胺,谷氨酰胺;丝氨酸,苏氨酸;赖氨酸,精氨酸;和苯丙氨酸,酪氨酸。同源氨基酸序列应包括细胞因子序列的天然等位变体和种间变体。典型的同源蛋白或同源肽应该与表2、3、4、5、6、7、8、9或10所示氨基酸序列片段具有50-100%(若引入空位)至60-100%(若包括保守取代)的同源性。同源性测定值应至少约为70%,一般是至少76%,更一般的是至少81%,通常是至少85%,更多的是至少88%,典型的是至少90%,更典型的是至少92%,常见的是至少94%,更常见的是至少95%,优选的是至少96%,更优选的是至少97%,而在特别优选实施方案中是至少98%或更高。同源性程度会随比较片段的长度而有所改变。诸如等位变体等同源蛋白或同源肽应具有表2、3、4、5、6、7、8、9或10所述实施方案的大多数生物活性。通过氨基酸序列比较或核苷酸序列比较而获得的特别引人注意的区域是图2A-2B中标明的区块1-10的内部区域或块内区。
文中使用的术语“生物活性”是指通过相应配体对炎症反应、先天免疫和/或形态发生过程产生的影响,但不局限于此。举例来说,这些受体应能够象IL-1受体那样调节磷酸酶或磷酸化酶的活性,这些活性可通过标准方法来方便地测量。可参考,例如,Hardie,et al.(1995年版)《
蛋白激酶手册》第I和第II卷,Academic Press,SanDiego,CA;Hanks,et al.(1991)
Meth.Enzymol.200:38-62;Hunter,et al.(1992)Cell 70:375-388;Lewin(1990)
Cell61:743-752;Pines,et al.(1991)
Cold Spring Harbor Symp.Quant. Biol.56:449-463;以及Parker.et al.(1993)
Nature363:736-738。通过对配体结合的调节,这些受体可表现出非常类似于可调节酶的生物活性。但其周转率更接近于酶,而不是受体复合物。此外,诱导这种酶学活性所需占用的受体数量要少于大多数受体系统,其所需数量更接近于每个细胞数十个受体,相比之下,大多数受体都需要以每个细胞数千个受体的数量来进行触发。这些受体或其部分可作为磷酸标记酶,用来标记常规底物或特定底物。
诸如DTLR等受体的配体、激动剂、拮抗剂和类似物包括那些可以调节对Toll配体样蛋白的特征性细胞应答的分子,当该受体,例如,是一种天然受体或一种抗体时,则还包括那些具有配体-受体相互作用中的其他标准的结合竞争结构特性的分子。这些细胞应答的调节可能是通过不同Toll配体与细胞受体的结合来进行,这些细胞受体与I型或II型IL-1受体有关,但可能截然不同。可参考,例如,Belvinand Anderson(1996)
Ann.Rev.Cell Dev.Biol.12:393-416;Morisato and Anderson(1995)
Ann.Rev.Genetics 29:371-3991和Hultmark(1994)
Nature 367:116-117。
与此类似,配体可以是能够与所述受体或其类似物结合的天然配体分子,也可以是天然配体的功能类似物。这种功能类似物可以是结构被改变的配体,也可以是完全不相关但其分子形状能够与适当配体结合决定簇相互作用的分子。这些配体可作为激动剂,也可作为拮抗剂。可参考,例如,Goodman,et al.(1990年版)《
Goodman & Gilman’s:治疗的药学基础》,Pergamon Press,New York。
还可通过受体或抗体以及其他效应剂或配体的分子结构研究来进行合理的药物设计。效应剂可以是调节随配体结合而产生的其他功能的其他蛋白,也可以是通常情况下与受体相互作用的其他蛋白。测定哪些位点与其他特定蛋白相互作用的一种方法是进行物理结构测定,如X-射线晶体学或二维NMR技术。这些测定方法可以提供线索,以了解分子接触区由哪些氨基酸残基构成。有关蛋白质结构测定的详细描述,可参考,例如,Blundell and Johnson(1976)《
蛋白质晶 体学》Academic Press,New York,其内容在此引入作为参考。
II.活性
Toll样受体蛋白具有多种不同的生物活性,例如在磷酸代谢中能够将其添加至特定底物,也能够将其去除,这些底物一般为蛋白质。其结果通常是对炎症功能、其他先天性免疫应答或形态学作用进行调节。DTLR2、3、4、5、6、7、8、9或10蛋白与其他Toll样受体蛋白同源,但又各自存在结构上的差异。例如,人DTLR2基因编码序列可能与小鼠DTLR2的核苷酸编码序列具有约70%的同一性。在氨基酸水平上可能也具有适当的同一性。
DTLRs的生物活性涉及在底物上添加或去除磷酸基团,其方式通常具有特异性,但有时也以非特异性方式来进行。底物鉴定或酶活性条件的测定可采用标准方法,有关这些方法的描述,可参考,例如,Hardie,et al.(1995年版)《
蛋白激酶手册》第I和第II卷,AcademicPress,San Diego,CA;Hanks,et al.1991)
Meth.Enzymol.200:38-62;Hunter,et al.(1992)Cell 70:375-388;Lewin(1990)Cell 61:743-752;Pines,et al.(1991)
Cold Spring Harbor Symp. Quant.Biol.56:449-463;以及Parker,et al.(1993)
Nature363:736-738。
III.核酸
本发明涉及分离的核酸或核酸片段的应用,这些核酸或核酸片段可以编码,例如,这些蛋白或其密切相关蛋白,也编码其片段,如编码相应的多肽,优选的是具有生物活性的多肽。此外,本发明还包括分离的或重组的DNA,该DNA编码那些单独具有或整体上具有各DTLRs的特征性序列的蛋白或多肽。该核酸通常能够在适当条件下与表2-10的一种核酸序列段杂交,但优选的是不与表1所示的相应片段杂交。上述生物活性蛋白或多肽可以是全长蛋白,也可以是片段,并且通常含有一段与表2-10所示序列高度同源的氨基酸序列。此外,本发明还包括分离的或重组的核酸或其片段的应用,这些核酸或其片段编码一些含有与DTLR2-10蛋白相同的片段的蛋白。这些分离的核酸可以在5’和3’侧翼区含有相应的调控序列,如启动子、增强子、poly-A附加信号和来源于天然基因的其他调控序列。
“分离的”核酸是指RNA、DNA或混聚物核酸基本上被纯化,例如与源物种中自然伴随天然序列的其他组分相分离,如核糖体、聚合酶和基因组侧翼序列。该术语包括与天然环境脱离的核酸序列,还包括经重组或克隆而与天然成分明显不同的DNA分离物,以及化学合成的类似物或由异源系统生物合成的类似物。基本纯的分子包括完全被纯化或基本上被纯化的分离分子。
分离的核酸通常是一种匀质的分子组合物,但在某些实施方案中也会含有异质成分,优选的是含有少量异质成分。该异质成分通常位于聚合物末端或对特定生物学功能或生物活性不重要的部分。
“重组”核酸一般是按照其产生方法或结构来定义的。根据其产生方法,重组核酸是指,例如,用核酸重组技术获得的产物,这些技术涉及,例如,对核苷酸序列进行人工干预。这种干预通常包括体外操作,但在某些情况下也包括更经典的动物育种技术。作为选择,制备重组核酸的方法也可以是将天然条件下彼此不连续的两种片段融合成一种序列,但重组核酸应排除天然产物,如自然状态下存在的天然突变体。因此,重组核酸包括由任一种非天然载体转化的细胞获得的产物,以及含有经任何寡核苷酸合成方法产生的序列的核酸。这种方法通常是将密码子替换成编码相同氨基酸或保守氨基酸的冗余密码子,同时引入或去除一种限制酶识别位点序列。作为选择,还可利用该方法将具有特定功能的核酸片段相融合,以产生一种不以常见的天然形式存在的单一遗传实体,该遗传实体具有所需的功能组合,如编码一种融合蛋白。这些人工操作一般是针对限制酶识别位点,但也可以设计引入其他位点特异性靶序列,如启动子、DNA复制位点、调节序列、控制序列或其他有用的特性。类似的概念也适用于重组体,如融合蛋白和多肽。此外还包括重复二聚体。此外还特别包括一些合成核酸,根据遗传密码冗余性,这些核酸可以编码与DTLR2-5片段相同的多肽,也编码由多种不同相关分子,如其他IL-1受体家族成员,获得的融合序列。
核酸的“片段”是指至少约17个核苷酸的连续序列段,一般是至少21个核苷酸,更一般的是至少25个核苷酸,普通的是至少30个核苷酸,更普通的是至少35个核苷酸,通常是至少39个核苷酸,更多的是至少45个核苷酸,典型的是至少50个核苷酸,更典型的是至少55个核苷酸,常见的是至少60个核苷酸,更常见的是至少66个核苷酸,优选的是至少72个核苷酸,更优选的是至少79个核苷酸,而在特别优选实施方案中是至少85个或更多个核苷酸。不同遗传序列的片段通常可以在适当长度的序列段上彼此进行比较,特别是诸如下述结构域等特定的片段。
编码DTLR2-10的核酸特别适用于鉴定那些编码自身或密切相关蛋白的基因、mRNA和cDNA,以及编码多形性变体、等位变体或其他遗传变体的DNAs,如来源于不同个体或相关物种的变体。这些筛选方法的优选探针是在不同多形性变体中保守的白介素区域,或含有缺乏特异性的核苷酸的区域,优选的则是全长或近似全长的区域。在其他情况下,多形性变体特异性序列会具有更大的作用。
本发明还包括含有与上述分离DNA相同或高度同源的核酸序列的重组核酸分子和片段。具体地说,这些序列通常与调控转录、翻译及DNA复制的DNA片段有效联接。这些附加片段通常能协助所需核酸片段的表达。
当同源核酸序列或高度同一的核酸序列彼此比较或与表2-10所示序列比较时,可表现出明显的相似性。核酸的同源性标准可采用该领域常用的通过序列比较获得同源性的方法,也可根据杂交条件来确定。下文将更详细地描述用于比较的杂交条件。
核酸序列比较中的基本相同是指,比较片段或其互补链在通过适当的核苷酸插入或缺失而实现优化排列后,其核苷酸的同一性至少约为60%,一般是至少66%,普通的是至少71%,通常是至少76%,更多的是至少80%,常见的是至少84%,更常见的是至少88%,典型的是至少91%,更典型的是至少约93%,优选的是至少约95%,更优选的是至少约96-98%或更高,而在特别实施方案中则高达约99%或更高,包括,例如,编码结构域的片段,如下文描述的片段。作为选择,基本相同还可以指片段能够在选择性杂交条件下与一种链或其互补链杂交,该链通常采用来源于表2-10的一种序列。典型的选择性杂交出现在至少约14个核苷酸的序列段具有至少约55%同源性的情况下,更典型的是具有至少约65%的同源性,优选的是具有至少约75%的同源性,更优选的是具有至少约90%的同源性。参考Kanehisa(1984)
Nucl.Acids Res.12:203-213,其内容在此引入作为参考。进行上述同源性比较的序列段可以具有更长的长度,在某些实施方案中,该序列段至少长约17个核苷酸,一般是至少长约20个核苷酸,普通的是至少长约24个核苷酸,常见的是至少长约28个核苷酸,典型的是至少长约32个核苷酸,更典型的是至少长约40个核苷酸,优选的是至少长约50个核苷酸,更优选的是至少长约75-100个或更多个核苷酸。
在涉及杂交同源性的描述中,严格条件是指盐浓度、温度、有机溶剂以及杂交反应中受控制的其他典型参数的严格组合条件。常见的严格温度条件包括温度高于约30℃,更常见的是高于约37℃,典型的是高于约45℃,更典型的是高于约55℃,优选的是高于约65℃,更优选的是高于约70℃。普通的严格盐浓度条件是低于约500mM,常见的是低于约400mM,更常见的是低于约300mM,典型的是低于约200mM,优选的是低于约100mM,更优选的是低于约80mM,甚至低于约20mM。但与任何单个参数的测定值相比,参数的组合更加重要。可参考,例如,Wetmur and Dayidson(1968)
J.Mol.Biol.31:349-370,其内容在此引入作为参考。
作为选择,通常还可以将一种序列作为序列比较的参照序列,并将测试序列与其进行比较。当使用一种序列比较算法时,可以将测试序列和参照序列输入计算机,如果需要,还可以设定子序列坐标,并设定序列比较程序的参数。再根据设定的程序参数,由该序列比较算法计算出测试序列相对于参照序列的同一性百分率。
比较序列的最佳比对可以通过,例如,Smith and Waterman(1981)
Adv.Appl.Math.2:482的局部同源性算法、Needlman andWunsch(1970)
J.Mol.Biol.48:443的同源性比较算法、Pearsonand Lipman(1988)
Proc.Nat’l Acad.Sci.USA 85:2444的相似性搜索方法、这些算法的计算机程序(Wisconsin遗传学软件包中的GAP、BESTFIT、FASTA和TFASTA,Genetics Computer Group,575Science Dr.,Madison,WI)或通过目测来实现(通常可参考Ausubelet al.,见上文)。
例如,PILEUP就是一种有效的算法。它可以通过渐进的成对比较由一组相关序列产生多序列比对结果,以显示这些序列的关系和同一性百分率。PILEUP还可以绘制一种树状图,以显示用于产生比较结果的集簇关系。PILEUP采用的是Feng and Doolittle(1987)
J.Mol. Evol.35:351-360的一种简化的渐进比较方法。该方法与Higginsand Sharp(1989)
CABIOS 5:151-153中描述的方法类似。该程序可以对300个序列进行比对,每个序列的最大长度为5,000个核苷酸或氨基酸。多序列比对的过程是先将两种最相似的序列进行成对比对,以产生这两种比对序列的簇。然后将该簇与下一个最相关序列或比对序列簇进行比对。两种序列簇的比对方法只是两种单独序列的成对比对方法的简单延伸。通过一系列的渐进性成对比对可以获得最终的比对结果。该程序的运行方法是指定特定的序列及其序列比较区域的氨基酸或核苷酸坐标,并设定程序参数。例如,可以将一种参照序列与其他测试序列进行比较,以利用下述参数来确定序列的同一性百分率:默认的空位权重(3.00)、默认的空位长度权重(0.10)以及加权的末端空位。
另一种适用于计算序列同一性百分率和序列相似性百分率的算法实例是Altschul,et al.(1990)
J.Mol.Biol.215:403-410中描述的BLAST算法。从国家生物技术信息中心(http:www.ncbi.nlm.nih.gov/)可以公开获得用于实现BLAST分析的软件。该算法包括,先确定高评分值的序列对(HSPs),方法是在查询序列中确定长度为W的短字符串,该字符串与数据库序列中长度相同的字符串进行比较时能够符合或满足一些正评分阈值T。T是指邻近字符串评分阈值(Altschul,et al.,见上文)。将这些最初的邻近字符串采样结果作为起始点来进行搜索,以寻找含有这些字符串的更长的HSPs。然后将采样字符串沿每条序列进行双向延伸,只要累计比较评分值能够增加。采样字符串在每个方向上延伸的终止条件是:累计比较评分值下降到比其达到的最大值低一定值X;由于一个或多个负评分值比较残基的累加而使累计评分值达到0或0以下;或到达任一序列的末端。BLAST算法的参数W、T和X确定了该算法的灵敏度和速度。BLAST程序的字符串长度(W)默认值为11,BLOSUM62评分矩阵(参考Henikoff and Henikoff(1989)
Proc.Nat’l Acad. Sci.USA 89:10915)排列(B)默认值为50,期望值(E)默认值为10,M=5,N=4,并且是两条链均比较。
除计算序列的同一性百分率外,BLAST算法还可以对两种序列之间的相似性进行统计分析(可参考,例如,Karlin and Altschul(1993)
Proc.Nat’l Acad.Sci.USA 90:5873-5787)。BLAST算法提供的一种相似性度量方法是最小的总概率值(P(N)),它可作为一种指标来说明两种核苷酸或氨基酸序列之间偶然出现匹配的概率。例如,一种核酸被认为与参照序列相似的条件是,该测试核酸与参照核酸比较的最小总概率值小于约0.1,更优选的是小于约0.01,最优选的是小于约0.001。
如下文所述,两种多肽的核酸序列基本相同的另一种指标是,第一种核酸编码的多肽能够与第二种核酸编码的多肽发生免疫交叉反应。因此,一种多肽与另一种多肽基本相同的条件通常是,例如,两种肽只因保守取代而有所不同。如下文所述,两种核酸序列基本相同的另一指标是这两种分子能够在严格条件下彼此杂交。
通过核苷酸取代、核苷酸缺失、核苷酸插入以及核苷酸序列段的倒位可以很容易地对分离DNA进行修饰。这些修饰可以产生编码该蛋白或其衍生物的新DNA序列。这些修饰序列可用于产生突变蛋白(突变型蛋白质),或用于促进不同物种的表达。表达的加强可能涉及基因扩增、转录增加、翻译增加以及其他机制。这些突变的DTLR样衍生物包括预定的或位点特异性的突变蛋白或其片段,包括由遗传密码简并性获得的沉寂突变。文中使用的“突变型DTLR”包括上述DTLR同源性定义范围内的多肽,以及具有一种通过缺失、取代或插入而与天然存在的其他DTLR样蛋白有所不同的氨基酸序列的多肽。具体地说,“位点特异性突变型DTLR”包括与表2-10的蛋白基本同源的蛋白,并且通常具有文中公开的形式的大多数生物活性或作用。
位点特异性突变位点是预先确定的,但并不要求突变体必须具有位点特异性。通过在基因中插入或去除氨基酸并使其表达可以实现哺乳动物DTLR的诱变。可以用取代、缺失、插入或其任意组合来获得最终的构建体。插入包括氨基或羧基末端的融合。可以对预定密码子进行随机诱变,再根据预期活性对表达的哺乳动物DTLR突变体进行筛选。对序列已知的DNA的预定位点进行置换突变的方法已为该领域所熟知,例如,可采用M13引物诱变方法。还可以参考Sambrook,etal.(1989)和Ausubel,et al.(1987及定期增刊)。
DNA中的突变通常不应该使编码序列脱离可读框,优选的是不会产生可杂交形成环袢或发夹等二级mRNA结构的互补区。
Beaucage and Carruthers(1981)
Tetra.Letts.22:1859-1862中描述的亚磷酰胺法可用于产生适当的合成DNA片段。获得双链片段的方法通常是合成互补链并使链在适当条件下退火,或是将互补链与适当的引物序列混合并用DNA聚合酶来产生。
通常可以用聚合酶链反应(PCR)技术来实现诱变。作为选择,诱变引物法也是在预定位点产生特定突变的常用方法。可参考,例如,Innis,et al.(1990年版)《
PCR方法及应用指南》Academic Press,San Diego,CA;和Dieffenbach and Dyeksler(1995年版)《
PCR 引物实验手册》Cold Spring Harbor Press,CSH,NY。
IV.蛋白、肽
如上所述,本发明包括灵长类动物的DTLR2-10,如具有表2-10所示序列的DTLR2-10,以及上文描述的其他形式。此外还涉及等位变体和其他变体,包括,例如,将这些序列的某些部分与表位标记或功能结构域等其他部分相结合的融合蛋白。
本发明还提供一些重组蛋白,例如用啮齿动物的这些蛋白的片段产生的异源融合蛋白。异源融合蛋白是指自然条件下一般不会以这种方式融合的蛋白或蛋白片段的一种融合体。因此,DTLR与IL-1受体的融合产物是一种通过标准肽键将两种序列融合的连续蛋白分子,这种蛋白分子通常是作为单一的翻译产物而产生,并且具有每种来源肽的特性,如序列或抗原性。类似的概念也适用于异源核酸序列。
此外,还可以用IL-1受体或其他DTLRs等相关蛋白的类似的功能域或结构域来组合产生新构建体,这些相关蛋白包括物种变体。例如,可以将不同的新融合多肽或多肽片段之间的配体结合区或其他区域进行“交换”。可参考,例如,Cunningham,et al.(1989)
Science243:1330-1336;和O’Dowd.et al.(1988)
J.Biol.Chem.263:15985-15992,其内容均在此引入作为参考。因此,通过受体结合特性的功能性联接可以产生具有新特性组合的新嵌合多肽。例如,可以添加其他相关受体分子的配体结合区,也可以将其替换成该分子或相关蛋白的其他结构域。所得蛋白通常具有组合的功能和特性。例如,融合蛋白可含有一种能够使其局限于特定亚细胞器的定向结构域。
候选的融合配偶体及序列可选自不同的序列数据库,如GenBank,c/o IntelliGenetics,Mountain View,CA;和BCG,University ofWisconsin Biotechnology Computing Group,Madison,WI,其内容均在此引入作为参考。
本发明特别提供一些能够与Toll配体结合并且(或者)在信号转导中受影响的突变蛋白。人DTLR1-10与IL-1家族其他成员的结构比较可说明其具有保守的性质(残基)。例如可参考图3A。人DTLR序列与IL-1家族其他成员的比较结果显示出结构及功能上的多种共性。还可参考Bazan,et al.(1996)
Nature 379:591;Lodi,et al.(1994)
Science 263:1762-1766;Sayle and Milner-White(1995)TIBS 20:374-376;和Gronenberg,et al.(1991)
Protein Engineering 4:263-269。
IL-1α和IL-1β配体能够与作为初级受体的I型IL-1受体结合,继而该复合物能够与III型IL-1受体形成高亲和性受体复合物。这些受体亚基有可能是IL-1家族新成员的共用受体。
DTLR2-10序列的其他物种对应物中的类似变异,例如在对应区域中的类似变异,应能够产生与配体或底物相互作用的类似功能。特别优选的是用小鼠序列或人序列进行替换。反之,配体结合相互作用区以外的保守取代可以使大多数信号活性得以保留。
灵长类动物DTLR2-10的“衍生物”包括氨基酸序列突变体、糖基化变体、代谢衍生物以及与其他化学基团的共价结合物或聚集物。共价衍生物的制备方法可以是,例如,用该领域熟知的方法使功能基团与DTLR氨基酸侧链或N-或C-末端的基团进行联接。这些衍生物可包括羧基末端的或含羧基侧链残基的脂肪族酯类衍生物或酰胺类衍生物、含羟基残基的O-酰基衍生物,以及氨基末端氨基酸或赖氨酸或精氨酸等含氨基残基的N-酰基衍生物,但不局限于此。酰基可选自烷基基团,包括C3-C18标准烷基,因此形成的是烷酰芳酰类衍生物。
具体地说,糖基化衍生物包括,例如,通过在多肽合成及加工过程中或在进一步的加工过程中改变其糖基化模式而产生的糖基化衍生物。其实现的特别优选方法是将这种多肽暴露于由正常情况下能够进行这种加工的细胞获得的糖基化酶,如哺乳动物糖基化酶。还可以考虑去糖基化酶。此外还包括一级氨基酸序列相同但含有其他次要修饰的形式,包括磷酸化氨基酸残基,如磷酸酪氨酸、磷酸丝氨酸或磷酸苏氨酸。
主要的一组衍生物是这些受体或其片段与其他蛋白的共价偶联物。这些衍生物可以通过诸如N-末端或C-末端融合体的重组培养物来加以合成,也可以利用该领域已知的能够通过活性侧链基团与蛋白交联的制剂来合成。与交联制剂的优选衍生化位点是游离氨基、糖类基团,以及半胱氨酸残基。
此外还提供这些受体与其他同源或异源蛋白的融合多肽。同源多肽可以是不同受体的融合体,由此可以产生,例如,一种对多种不同Toll配体具有结合特异性的杂合蛋白,或一种对底物作用的特异性变宽或减弱的受体。也可以构建具有衍生蛋白组合特性或活性的异源融合体。典型的实例是萤光素酶等报告多肽与配体结合区等受体片段或受体结构域的融合体,这样可以很容易地检测特定配体的存在情况或定位。可参考,例如,Dull,et al.,美国专利4,859,609,其内容在此引入作为参考。其他基因融合配偶体包括谷胱甘肽-S-转移酶(GST)、细菌β-半乳糖苷酶、trpE、蛋白A、β-内酰胺酶、α-淀粉酶、乙醇脱氢酶和酵母α交配因子。可参考,例如,Godowski,et al.(1988)
Science 241:812-816。
Beaucage and Carruthers(1981)
Tetra.Letts.22:1859-1862中描述的亚磷酰胺法可用于产生适当的合成DNA片段。获得双链片段的方法通常是合成互补链并使链在适当条件下退火,或是将互补链与适当的引物序列混合并用DNA聚合酶来产生。
这些多肽还可以含有已被化学修饰的氨基酸残基,如磷酸化、磺化、生物素化,或被添加或去除了其他基团,特别是分子形状与磷酸基团类似的基团。一些实施方案中的修饰是有效的标记试剂或纯化目标,如亲和配体。
融合蛋白的制备一般可以通过核酸重组法或多肽合成法来实现。有关核酸操作及表达技术的综合描述,可参考,例如,Sambrook,etal.(1989)《
分子克隆实验手册》(第二版),Vols.1-3,Cold SpringHarbor Laboratory和Ausubel,et al.(1987年版及定期增刊)《
最 新分子生物学实验方法》,Greene/Wiley,New York;其内容均在此引入作为参考。有关多肽合成技术的描述,可参考,例如,Merrifield(1963)
J.Amer.Chem.Soc.85:2149-2156;Merrifield(1986)Science 232:341-347;和Atherton,et al.(1989)《
固相肽合成 实验方法》,IRL Press,Oxford;其内容均在此引入作为参考。有关制备较长多肽的方法,还可以参考Dawson,et al.(1994)
Science266:776-779。
本发明还涉及除氨基酸序列变体或糖基化变体之外的其他DTLR2-10衍生物的应用。这些衍生物可包括与化学基团的共价结合物或聚集物。这些衍生物通常可分为3类:(1)盐类衍生物,(2)侧链及末端残基的共价修饰物,以及(3)吸附复合物,例如吸附于细胞膜的复合物。这些共价衍生物或聚集物可作为免疫原,也可作为免疫测定法中的试剂,还可以作为诸如用于亲和纯化受体或抗体等其他结合分子的纯化方法中的试剂。举例来说,可以用该领域熟知的方法将Toll配体通过共价结合固定在一种固体支持物上,如固定在溴化氰活化的Sepharose上,也可以通过或不通过戊二醛交联将Toll配体吸附于聚烯烃表面,从而用于DTLR受体、抗体或其他类似分子的测定或纯化。还可以用一种可检测基团标记该配体,例如用氯胺T方法进行放射性碘标记,或与稀土螯合物共价结合,或与另一种荧光基团偶联,从而用于诊断测定。
本发明的DTLR可作为抗原,用于产生对该DTLR或其不同片段具有特异性的抗血清或抗体,例如能使其有别于其他IL-1受体家族成员。纯化的DTLR可用于筛选由包含该蛋白的混合制品免疫获得的单克隆抗体或抗原结合片段。术语“抗体”还特别包括天然抗体的抗原结合片段,如Fab、Fab2、Fv等。纯化的DTLR还可作为试剂,用于检测随表达水平提高而产生的抗体或导致产生抗内源受体抗体的免疫学病症。此外,还可以如下文所述,将DTLR片段作为免疫原用于产生本发明的抗体。例如,本发明涉及一些对表2-10所示氨基酸序列或其片段或多种同源肽具有结合亲和力或由其诱发产生的抗体。本发明还特别涉及一些对可能暴露或确实暴露于天然DTLR蛋白外表面的特定片段具有结合亲和力或由这些片段诱发产生的抗体。
抑制配体与受体结合的结果可能是通过竞争性抑制来阻断对该受体配体的生理学反应。因此,本发明的体外测定方法通常是采用抗体、这些抗体的抗原结合区,或与固相底物连接的片段。这些测定方法还可用于诊断测定配体结合区突变或修饰的影响,或诸如影响信号功能或酶功能的其他突变和修饰的作用。
本发明还涉及竞争性药物筛选测定法的应用,例如用受体的中和抗体或其片段来竞争测试化合物与配体或其他抗体的结合。由此可利用中和抗体或其片段来检测具有一种或多种受体结合位点的多肽的存在情况,还可用来占据受体上可能与配体结合的结合位点。
V.核酸及蛋白的制备
编码蛋白或其片段的DNA的获得方法可以是化学合成、筛选cDNA库,或筛选由多种细胞系或组织样品制备的基因组库。天然序列的分离可采用标准的方法,诸如文中的表2-10提供了这些序列。其他物种的对应物的鉴定可采用杂交技术、或不同的PCR技术,并结合或采用搜索GenBank等序列数据库的方法。
该DNA可以在多种不同的宿主细胞中表达,以合成可继而用于,例如,产生多克隆抗体或单克隆抗体的全长受体或其片段;用于结合测定;用于构建并表达经过改造的配体结合区或激酶/磷酸酶结构域;并用于结构/功能研究。变体或片段的表达可以在适当表达载体转化或转染的宿主细胞中进行。除含有来自重组宿主的杂质之外,这些分子可以基本上不含其他蛋白杂质或细胞杂质,因而特别适合与一种药学容许的载体和/或稀释剂组合而作为药用组合物使用。该蛋白或其片段可作为与其他蛋白的融合体来加以表达。
表达载体通常是含有所需受体基因或其片段的自我复制型DNA或RNA构建体,其中的基因或其片段通常与一些能够被适当宿主细胞识别的适当遗传调控元件有效连接。这些调控元件能够在适当宿主内影响表达。为影响表达所必需的特定调控元件的类型取决于最终使用的宿主细胞。这些遗传调控元件通常可包含一种原核启动子系统或一种真核启动子表达调控系统,典型的则还包含一种转录启动子、一种可选的用于调控转录起始的操纵基因、用于提高mRNA表达水平的转录增强子、一种编码适当核糖体结合位点的序列,以及终止转录和翻译的序列。表达载体通常还可以包含一种复制起始点,以使该载体的复制不依赖于宿主细胞。
本发明的载体包括那些含有所述蛋白编码DNA或编码等效生物活性多肽的DNA片段的载体。这些DNA可以用病毒启动子来调控,也可以编码一种选择标记。本发明还涉及这些表达载体的应用,这些载体能够在原核或真核宿主中表达真核蛋白的编码cDNA,其中,所述载体与所述宿主相容,并且在所述载体中插入有真核受体的编码cDNA,从而使含有该载体的宿主在生长时能够表达所述cDNA。表达载体通常被设计成能够在其宿主细胞内稳定复制或扩增,从而极大地提高每个细胞中的所需基因总拷贝数。表达载体在宿主细胞内复制的能力并不总是必需具备的条件,例如,可以用不含能被宿主细胞识别的复制起始点的载体在多种宿主中实现该蛋白或其片段的瞬时表达。此外,还可以使用能通过重组将蛋白编码区或其片段整合到宿主DNA中的载体。
文中描述的载体包括质粒、病毒、细菌噬菌体、整合型DNA片段,以及能够使DNA片段整合到宿主基因组中的其他载体。表达载体是一些含有遗传调控元件的特化载体,这些调控元件可影响与其有效连接的基因的表达。质粒是最常用的载体,但该领域已了解或将要了解的其他所有功能等价载体都适合于在本发明中使用。可参考,例如,Pouwels,et al.(1985及增刊)《
克隆载体实验手册》Elsevier,N.Y.和Rodriquez,et al.(eds.)《
载体:分子克隆载体及其应用概论》Buttersworth,Boston,1988,其内容均在此引入作为参考。
转化细胞是已被利用DNA重组技术构建的受体载体转化或转染的细胞,优选的是哺乳动物细胞。被转化的宿主细胞通常可表达所需蛋白或其片段,但如果是用于其DNA的克隆、扩增和操作,则这些转化细胞无需表达目的蛋白。本发明还涉及将转化细胞培养在营养培养基中,从而使所述受体能够聚集在细胞膜上。可以从培养物中回收目的蛋白,在某些情况下,也可以从培养基中回收目的蛋白。
就本发明而言,可以将功能上相关的核酸序列彼此有效连接。举例来说,可以将一种原序列或分泌型导肽与一种多肽的DNA有效连接,条件是这种这种原序列或分泌型导肽可作为原蛋白表达,或可参与将所述多肽引导至细胞膜,或可参与所述多肽的分泌。可以将一种启动子与编码序列有效连接,条件是这种启动子可调控该多肽的转录;还可以将核糖体结合位点与编码序列有效连接,条件是这种核糖体结合位点的定位能够使翻译得以进行。有效连接通常是指邻接并且可读框吻合,但有些遗传元件,如阻抑物基因,不与操纵子序列邻接,而只是与其连接并调控其表达。
适当的宿主细胞包括原核细胞、低等真核细胞和高等真核细胞。原核细胞包括革兰氏阴性生物和革兰氏阳性生物,如
大肠杆菌和
枯草 芽孢杆菌。低等真核细胞包括酵母,如
酿酒酵母和
毕赤酵母,以及网柄菌属中的物种。高等真核细胞包括由动物细胞建立的组织培养细胞系,包括非哺乳动物来源的细胞,如昆虫细胞和鸟类细胞,还包括哺乳动物来源的细胞,如人、灵长类动物和啮齿动物的细胞。
原核宿主-载体系统包括多种用于不同物种的载体。文中使用的大肠杆菌及其载体一般包括用于其他原核细胞的等价载体。用于DNA扩增的典型载体是pBR322或其多种衍生物。用于表达受体或其片段的载体包括那些含有lac启动子(pUC系列);trp启动子(pBR322-trp);Ipp启动子(pIN系列);λ-pP或pR启动子(pOTS);或杂合启动子,如ptac(pDR540),但不局限于此。可参考《
载体:分子克隆载 体及其应用概论》(eds.Rodriquez and Denhardt),Buttersworth,Boston,第10章,pp.205-236中的Brosius,et al.(1988)“采用λ-、trp-、lac-及Ipp-衍生启动子的表达载体”。
可以用含有DTLR系列的载体转化低等真核细胞,如酵母和网柄菌属细胞。就本发明而言,最常用的低等真核细胞宿主为面包酵母,即酿酒酵母。一般是将其作为典型的低等真核细胞,但也可以使用其他多种菌株和物种。酵母载体通常含有一种复制起始点(整合型除外)、一种选择基因、一种启动子、受体或其片段的编码DNA,以及翻译终止序列、多腺苷酸化序列和转录终止序列。适当的酵母表达载体可含有组成型启动子,如3-磷酸甘油酸激酶启动子和其他多种糖酵解酶基因启动子,也可含有诱导型启动子,如乙醇脱氢酶2启动子或金属硫蛋白启动子。适当的载体包括下述类型的衍生物:自我复制的低拷贝型(如YRp-系列)、自我复制的高拷贝型(YEp-系列);整合型(YIp-系列)或微小染色体(如YCp-系列)。
高等真核组织培养细胞一般是用于表达功能活性白介素蛋白的优选宿主细胞。原则上可以使用任何非脊椎动物或脊椎动物来源的高等真核组织培养细胞系,如昆虫杆状病毒表达系统。这些细胞的转化或转染以及增殖已成为常规的方法。可使用的细胞系实例包括HeLa细胞、中国仓鼠卵巢(CHO)细胞系、幼鼠肾(BRK)细胞系、昆虫细胞系、鸟类细胞系,以及猴(COS)细胞系。这些细胞系的表达载体通常含有一种复制起始点、一种启动子、一种翻译起始位点、RNA剪接位点(如果使用基因组DNA)、一种多腺苷酸化位点和一种转录终止位点。这些载体特别还可以包含一种选择基因或扩增基因。适当的表达载体可以是带有来源于诸如腺病毒、SV40、细小病毒、痘苗病毒或巨细胞病毒等的启动子的质粒、病毒或逆转录病毒。适当表达载体的典型实例包括pCDNA1;pCD,参考Okayama,et al.(1985)
Mol.Cell Biol.5:1136-1142;pMClneo PolyA,参考Thomas,et al.(1987)Cell 51:503-512;和杆状病毒载体,如pAC373或pAC610。
分泌型蛋白的可读框通常编码一种多肽,该多肽含有一种在其N端与信号肽共价连接的成熟产物或分泌产物。该信号肽在成熟多肽或活性多肽分泌之前被剪切。剪切位点可通过经验法则高度精确地加以预测,如von-Hei jne(1986)
Nucleic Acids Research 14:4683-4690,而信号肽的精确氨基酸组成似乎不对其功能起关键性作用,如Randall,et al.(1989)
Science 243:1156-1159;Kaiser,et al.(1987)
Science 235:312-317。
经常会希望在某种系统中以特定或规定的糖基化模式来表达这些多肽。在这种情况下,通常是获得由表达系统天然产生的糖基化模式。但也可以对这种模式进行修饰,方法是将多肽,如非糖基化形式的多肽,暴露于被引入到异源表达系统中的适当糖基化蛋白。例如,可以将一种或多种编码哺乳动物糖基化酶或其他糖基化酶的基因与受体基因进行共转化。利用这种方法可以在原核细胞或其他细胞内获得某些哺乳动物糖基化模式。
DTLR的来源可以是表达重组DTLR的真核或原核宿主,如上文描述的宿主。此外还可以是小鼠Swiss 3T3成纤维细胞等细胞系,但本发明还涉及其他哺乳动物细胞系,优选的则是来源于人的细胞系。
由于序列为已知,因而可通过传统的肽合成方法来制备灵长类动物DTLRs或其片段或衍生物。这些方法包括,例如,Stewart and Young(1984)《
固相肽合成》,Pierce Chemical Co.,Rockford,IL;Bodanszky and Bodanszky(1984)《
肽合成的实践方法》,Springer-Verlag,New York;和Bodanszky(1984)《
肽合成原理》,Springer-Verlag,New York中描述的方法,其内容均在此引入作为参考。举例来说,可以使用叠氮法、酰基氯法、酸酐法、混合酐法、活化酯法(如对硝基苯酯、N-羟基琥珀酰亚胺基酯,或氰基甲酯)、碳化二亚胺唑法、氧化还原法,或二环己基碳二亚胺(DCCD)/附加剂法。上述方法可采用固相和液相合成。部分DTLR序列也可采用类似的技术。
根据上文描述的常用肽合成方法可以制备出适当的DTLR蛋白或其片段或衍生物,一般可通过所谓的逐步方式来进行,该方式包括将氨基酸按照序列逐个与末端氨基酸缩合,也可以用肽段与末端氨基酸结合。结合反应中不使用的氨基通常需加以保护,以避免在错误位置发生结合。
如果采用固相合成法,则可通过羧基将C-末端氨基酸结合于一种不溶性载体或支持物。不溶性载体要能够与活性羧基结合,除此之外并没有特别的限制。不溶性载体的实例包括卤甲基树脂,如氯甲基树脂或溴甲基树脂,羟甲基树脂、苯酚树脂、叔-烷氧基羰基肼化树脂,等等。
肽的逐步合成可通过氨基被保护的氨基酸的依次结合来实现,这种氨基酸的活性羧基能够与此前形成的肽或肽链上的活性氨基缩合。当合成了全部序列之后,可以使肽与不溶性载体分离,以获得所需的肽。有关这种固相方法的综合描述,可参考Merrifield,et al.(1963)
J.Am.Chem.Soc.85:2149-2156,其内容在此引入作为参考。
利用肽分离方法可以从反应混合物中分离和纯化出已制备好的肽及其片段,例如可采用抽提、沉淀、电泳、不同形式的层析等方法。根据预期的应用,可以获得不同纯度的本发明所述受体。可以用下文公开的蛋白纯化技术来进行纯化,也可以用免疫吸附亲和层析方法中描述的抗体来进行纯化。免疫吸附亲和层析的实施方法是,先将抗体与固体支持物连接,再用连接的抗体与适当细胞的增溶裂解物、表达该受体的其他细胞的裂解物,或经过DNA技术处理而能够产生该蛋白的细胞的裂解物或上清液相接触,见下文。
一般而言,纯化蛋白的纯度至少约为40%,普通的是至少约50%,常见的是至少约60%,典型的是至少约70%,更典型的是至少约80%,优选的是至少约90%,更优选的是至少约95%,而在特别实施方案中是至少约97%-99%或更高。纯度一般是根据重量来计算,但也可以根据摩尔数来计算。可以根据需要采用不同的测定方法。
VI.抗体
灵长类动物等不同哺乳动物的DTLR蛋白及其片段都可用于产生抗体,这些蛋白及其片段可以是自然界中存在的天然形式,也可以是重组形式,其区别在于天然受体的抗体更可能识别只以天然构象存在的表位。变性抗原的检测也具有应用价值,例如可用于蛋白印迹分析。此外还包括抗独特型抗体,这些抗体可作为天然受体或抗体的激动剂或拮抗剂使用。
优选的抗体应同时具备亲和性和选择性。通常是优选高亲和性,而选择性能够使抗体鉴别不同亚类的实施方案。具体地说,理想的是获得一些能够与,例如,不同特定组合的相关成员特征性结合但不与其他成员结合的抗体制剂。这些不同的亚类组合可以特异性地形式,例如,可利用免疫亲和、筛选等标准方法来产生或筛选这些试剂。
对蛋白的特定片段具有抗性的抗体,包括结合片段和单链型抗体,可通过将该片段与免疫原性蛋白的结合物对动物进行免疫的方法来产生。单克隆抗体可由分泌该抗体的细胞获得。可以根据与正常蛋白或缺陷型蛋白的结合能力对这些抗体进行筛选,也可以根据激动活性或拮抗活性进行筛选。这些单克隆抗体通常能够以至少约1mM的KD进行结合,更多的是至少约300μM,典型的是至少约100μM,更典型的是至少约30μM,优选的是至少约10μM,更优选的是至少约3μM或更低。
本发明的抗体,包括抗原结合片段,可具有明显的诊断或治疗价值。这些抗体可作为强拮抗剂与其受体结合,并抑制配体的结合或该受体引发生物学应答的能力,例如对其底物的作用。这些抗体还可作为非中和性抗体使用,并且可联接于毒素或放射性核素,使其能够与产生白介素的细胞或定位于白介素产生源的细胞相结合。
此外,还可以将这些抗体直接或通过连接物间接偶联于药物或其他治疗剂。
本发明的抗体还可用于诊断性应用。这些抗体可作为俘获抗体或非中和性抗体与受体结合,但不抑制配体或底物的结合。这些抗体可作为中和抗体用于竞争性结合测定。此外还可用于配体的检测或定量。这些抗体可作为试剂用于蛋白印迹分析,或用于相应蛋白的免疫重叠或免疫纯化。
由于融合的或共价联接的多肽可作为免疫原使用,因而可以将蛋白片段与其他物质相联接,特别是与多肽相联接。哺乳动物DTLR及其片段可以与多种免疫原融合或共价联接,如钥孔血兰素、牛血清白蛋白、破伤风类毒素,等等。有关多克隆抗血清制备方法的描述,可参考《
微生物学》,Hoeber Medical Division,Harper and Row,1969;Landsteiner(1962)《
血清反应的特异性》,Dover Publications,New York;和Williams,et al.(1967)《
免疫学及免疫化学方法》,Vol.1,Academic Press,New York;其内容均在此引入作为参考。典型的方法涉及用一种抗原对动物进行超免疫。再于重复免疫短时间后收集动物血并分离γ球蛋白。
有时需要由多种哺乳动物宿主制备单克隆抗体,如小鼠、啮齿动物、灵长类动物、人,等等。有关这些单克隆抗体制备技术的描述,可参考,例如,Stites,et al.(eds.)《
基础及临床免疫学》(第四版),Lange Medical Publications,Los Altos,CA及其参考文献;Harlow and Lane(1988)《
抗体实验手册》,CSH Press;Goding(1986)《
单克隆抗体原理及实践》(第二版)Academic Press,NewYork;特别参考Kohler and Milstein(1975)
Nature 256:495-497中讨论的一种产生单克隆抗体的方法。这些文献均在此引入作为参考。简单地说,该方法涉及用一种免疫原对动物进行注射。然后将动物处死并取其脾细胞,再将这些细胞与骨髓瘤细胞融合。所得的杂合细胞或“杂交瘤”能够在体外繁殖。然后对杂交瘤群体进行筛选,以分离出单克隆,每种克隆可分泌单一种类的抗免疫原抗体。由此获得的单种抗体是无限增殖的克隆化单个B细胞的产物,这些细胞是免疫动物在对免疫原性物质上的特定识别位点产生应答的过程中产生的。
其他适当技术则涉及在体外将淋巴细胞暴露于抗原性多肽,作为选择,还可以对噬菌体或其他载体中携带的抗体库进行筛选。可参考Huse,et al.(1989)“在λ噬菌体中产生大型免疫球蛋白组合库”,Science 246:1275-1781;和Ward.et al.(1989)
Nature341:544-546,其内容均在此引入作为参考。本发明的多肽和抗体可经过改造或不经改造加以使用,如嵌合抗体或人源化抗体。通常可以将多肽和抗体进行标记,方法是与一种提供可检测信号的底物共价或非共价结合。已知有多种标记和结合物可以使用,并且在科技文献和专利文献中有广泛的描述。适当的标记包括放射性核素、酶、底物、辅因子、抑制剂、荧光基团、化学发光基团、磁性颗粒,等等。讲述这些标记的使用方法的专利包括美国专利3,817,837;3,850,752;3,939,350;3,996,345;4,277,437;4,275,149;和4,366,241。此外还可以产生重组的或嵌合的免疫球蛋白,可参考Cabilly,美国专利4,816,567;也可以在转基因小鼠中产生,可参考Mendez,et al.(1997)
Nature Genetics 15:146-156。这些文献均在此引入作为参考。
本发明的抗体还可用于DTLRs的亲和层析分离。在制备的层析柱中,可以将抗体连接于一种固体支持物,如琼脂糖、Sephadex等颗粒,然后使细胞裂解物过柱,洗涤,再用浓度递增的温和变性剂过柱,由此释放出纯化的蛋白。该蛋白可用于纯化抗体。
这些抗体还可用于在表达库中筛选特定的表达产物。该方法中使用的抗体通常带有一种标记基团,以便于通过抗体结合来检测抗原的存在情况。
用DTLR诱发的抗体还可用于产生抗独特型抗体。这些抗体可用于检测或诊断与该蛋白的表达有关或与表达该蛋白的细胞有关的多种免疫学病情。这些抗体还可作为配体的激动剂或拮抗剂使用,这些抗体可能是天然配体的竞争性抑制物,也可能是天然配体的替代物。
利用免疫测定法通常可检测出能够与一种抗体,例如用含有SEQID NO:4、6、8、10、12、14、16、18、20、22或24所示氨基酸序列的一种特定免疫原诱发产生的抗体,发生特异性结合或特异性免疫反应的DTLR蛋白。该免疫测定法通常使用一种多克隆抗血清,例如用SEQ ID NO:4、6、8、10、12、14、16、18、20、22或24的一种蛋白诱发产生的抗血清。这种抗血清经过筛选而对DTLR1等其他IL-1R家族成员具有低交叉反应性,优选的则是对相同物种的其他IL-1R家族成员具有低交叉反应性,在用于免疫测定之前,可通过免疫吸附来去除所有的交叉反应性。
为了产生用于免疫测定的抗血清,可以按文中所述分离出SEQ IDNO:4、6、8、10、12、14、16、18、20、22或24的蛋白或其组合。例如,可以在哺乳动物细胞系中产生重组蛋白。用选定的蛋白对适当宿主进行免疫,例如对Balb/c等近交系小鼠进行免疫,免疫一般采用标准的佐剂,如弗氏佐剂,并使用标准的小鼠免疫方案(参考Harlowand Lane,见上文)。作为选择,按本文所示序列合成并与一种载体蛋白连接的合成肽也可作为免疫原使用。收集多克隆血清,并利用免疫测定法,例如将免疫原固定在固体支持物上的固相免疫测定法,对免疫原蛋白进行滴定。选择那些效价为104或更高的多克隆抗血清,并利用上文Harlow and Lane第570-573页中描述的竞争性结合免疫测定法检验这些抗血清对小鼠DTLRs或人DTLR1等其他IL-1R家族成员的交叉反应性。优选的是在该测定中至少将两种DTLR家族成员与人DTLR2-10中的一些蛋白协同检测。这些IL-1R家族成员可以是通过文中描述的标准分子生物学及蛋白化学技术产生并分离的重组蛋白。
交叉反应性的测定可采用竞争性结合免疫测定法。举例来说,可以将SEQ ID NO:4、6、8、10、12、14、16、18、20、22和/或24的蛋白或其不同的片段固定在一种固体支持物上。测试反应中添加的蛋白可以对固定化抗原与抗血清的结合产生竞争作用。将上述蛋白对固定化蛋白与抗血清结合的竞争能力同SEQ ID NO:4、6、8、10、12、14、16、18、20、22和/或24的蛋白进行比较。利用标准方法计算出上述蛋白的交叉反应性百分率。挑选并合并那些与每种上述蛋白的交叉反应性低于10%的抗血清。然后用上述蛋白对合并的抗血清进行免疫吸附,以去除交叉反应性抗体。
然后将免疫吸附的合并抗血清用于上文描述的竞争性结合免疫测定,以此将第二种蛋白与免疫原蛋白(如SEQ ID NO:4、6、8、10、12、14、16、18、20、22和/或24的IL-1R样蛋白)进行比较。为了进行比较,可以在较宽的浓度范围内对这两种蛋白均进行检测,并确定每种蛋白在50%抑制固定化蛋白与抗血清结合时的必需量。如果第二种蛋白的必需量低于一种或多种选定蛋白的必需量的两倍,则可认为这第二种蛋白能够与免疫原诱发产生的抗体特异性结合。
应该了解的是,这些DTLR蛋白属于一种同源蛋白家族,迄今为止,该家族至少包括10种已确认的基因。对DTLR2-10等特定的基因产物而言,该术语不仅是指文中公开的氨基酸序列,而且还涉及属于等位变体、非等位变体或物种变体的其他蛋白。此外,该术语还包括非天然突变体,其突变的引入方法可以是利用单点突变等常规重组技术故意产生突变,也可以是将相应蛋白编码DNA的小段序列切除,还可以是替换成新氨基酸或添加新氨基酸。这些微小变体必须要基本保持原分子的免疫特性和/或生物活性。因此,这些变体包括能够与指定的天然IL-1R相关蛋白,如SEQ ID NO:2、4、6、8、10、12、14、16、18、20、22或24中显示的DTLR蛋白,产生特异性免疫反应的蛋白。检测变体蛋白生物学特性的方法可以是,在适当细胞系中表达该蛋白,并检测对淋巴细胞的适当影响。被认为不重要的特定蛋白修饰包括用化学性质类似的氨基酸进行保守取代,这与上文有关整个IL-1R家族的描述相同。本发明所述蛋白的成分的确定方法可以是,将蛋白与DTLR2-10的蛋白进行优化比较,以及用文中描述的传统免疫测定法来检测免疫特性。
VII.试剂盒及定量方法
本发明的天然IL-1R样分子和重组IL-1R样分子均特别适合于在试剂盒及测定方法中使用。举例来说,这些方法可用于结合活性筛选,如筛选这些蛋白的配体。近几年已发展了一些自动测定的方法,因而每年可对数万种化合物进行筛选。可参考,例如,BIOMEK自动化工作站,Beckman Insturments,Palo Alto,California和Fodor,et al.(1991)Science 251:767-773,其内容均在此引入作为参考。后者描述的是利用多种在固相底物上合成的特定聚合物来进行结合测试的方法。本发明可用于获得大量处于活性状态的可溶性纯化DTLRs,这非常有利于建立一些适合筛选配体或激动剂/拮抗剂同源蛋白的测定方法。
可以将纯化的DTLR直接包被在平板上,以便于在上述的配体筛选技术中使用。也可以用这些蛋白的非中和性抗体作为俘获抗体,将相应的受体固定在固相载体上,从而应用于,例如,诊断领域。
本发明还涉及将DTLR2-10及其片段、肽和融合产物应用于多种诊断试剂盒及方法,以检测该蛋白或其配体的存在情况。此外,或作为选择,还可以在这些试剂盒及方法中引入这些分子的抗体。试剂盒通常会含有一种隔室,其中装有特定的DTLR肽段或基因段,或装有能够识别肽段或基因段的试剂。肽的识别试剂通常是一种受体或抗体,基因段的识别试剂通常是一种杂交探针。
用于检测诸如DTLR4等样品浓度的优选试剂盒通常含有已知对DTLR4具有结合亲和力的一种标记化合物,如配体或抗体,一种作为阳性对照的DTLR4源(天然的或重组的),以及将结合标记化合物与游离标记化合物分离的一种工具,如用于固定测试样品中的DTLR4的一种固相载体。通常还提供装有试剂的隔室以及说明书。
对哺乳动物DTLR或其片段具有特异性的抗体,包括抗原结合片段,或受体片段可应用于诊断领域,以检测配体和/或其片段的水平是否提高。诊断测定可以是均质的(不需要将游离试剂与抗体-抗原复合物分离的步骤),也可以是异质的(需要一个分离步骤)。目前已有多种商品化的测定方法,如放射免疫测定法(RIA)、酶联免疫吸附测定法(ELISA)、酶免疫测定法(EIA)、酶放大免疫测定技术(EMIT)、底物标记荧光免疫测定法(SLFIA),等等。例如,未标记抗体的使用方法是用一种标记的二抗来识别这种抗DTLR4或其特定片段的抗体。此外,在文献中已对这些测定方法进行了广泛地论述。例如,可参考Harlow and Lane(1988)《
抗体实验手册》,CSH.和Coligan(1991年版及定期增刊)《
最新免疫学实验方法》Greene/Wiley,New York。
抗独特型抗体在作为DTLR4的激动剂或拮抗剂方面可能具有类似的用途。在适当情况下,这些抗体也可作为治疗剂使用。
通常可将用于诊断测定的试剂以试剂盒的形式提供,以便于优化测定的灵敏度。就本发明而言,可根据测定的性质来提供测定方法和标记物、该标记物可以是标记抗体或非标记抗体,也可以是标记的配体。试剂盒中通常还包含其他附加剂,如缓冲液、稳定剂、产生信号所需的物质,如酶底物,等等。优选的是该试剂盒中还包含正确使用的说明书,以及所含试剂在使用之后的处理说明书。该试剂盒通常含有一些用于装各种有效试剂的隔室,并且带有正确使用的说明书以及试剂的处理说明书。理想的是这些试剂以冻干粉末的形式提供,当进行测定时,可将这些试剂以适当的浓度重构于一种水性介质中。
用于诊断测定的上述组分不经改变即可使用,也可以用多种方法进行调整。例如,标记的方法可以是共价或非共价连接一种直接或间接提供可检测信号的基团。在所有这些测定方法中,可以对测试化合物、DTLR或其抗体进行直接标记,也可以进行间接标记。可用于直接标记的标记基团包括:125I等放射性标记、过氧化物酶和碱性磷酸酶等酶类(美国专利3,645,090),以及可通过荧光强度、波长偏移或荧光偏振来检测变化的荧光标记(美国专利3,940,475)。这两个专利均在此引入作为参考。可用于间接标记的方法包括将一种组分生物素化,然后与偶联了一种上述标记基团的抗生物素蛋白结合。
此外,还有多种将结合配体与游离配体分离的方法,作为选择,还有多种将结合化合物与游离化合物分离的方法。可以将DTLR固定在不同的基质上,然后进行洗涤。适当的基质包括ELISA平板等塑料制品、滤器和珠。将受体固定于基质的方法包括,但不局限于,直接粘连于塑料制品、使用一种俘获抗体、化学偶联法,以及生物素-抗生物素蛋白法。该方法的最后步骤涉及利用某些方法将抗体/抗原复合物沉淀,这些方法包括,例如,使用聚乙二醇等有机溶剂,或使用硫酸铵等盐类。其他适当的分离技术包括,但不局限于,Rattle,etal.(1984)
Clin.Chem.30(9):1457-1461描述的荧光素抗体磁化颗粒法和美国专利4,659,678描述的双抗体磁化颗粒分离法,其内容均在此引入作为参考。
在文献中已广泛报导了将蛋白或片段与不同标记连接的方法,在此无需详细论述。许多方法涉及利用碳化二亚胺或活性酯上的活性羧基来形成肽键,或通过巯基与氯乙酰基等活性卤素的反应来形成硫醚,或利用马来酰亚胺等活性烯烃进行连接,等等。在这些应用领域中也可以使用融合蛋白。
作为诊断性应用的另一方面,本发明涉及来源于DTLR序列的寡核苷酸或多核苷酸序列的应用。这些序列可作为探针,用于检测可能患有免疫功能障碍的患者体内的相应DTLR水平。在文献中已对RNA及DNA核苷酸序列的制备方法、标记方法和优选长度进行了大量论述。寡核苷酸探针通常应含有至少约14个核苷酸,常见的是至少约18个核苷酸,而多核苷酸可含有多达数千个碱基。可使用的标记有许多种,最常用的是放射性核素,特别是32P。但也可以使用其他技术,例如可在多核苷酸中引入生物素修饰的核苷酸。然后可将生物素作为抗生物素蛋白或抗体的结合位点,这些抗生物素蛋白或抗体可带有不同的标记,如放射性核素、荧光剂、酶,等等。作为选择,还可以使用能够识别特定双链体的抗体,包括DNA双链体、RNA双链体、DNA-RNA杂合双链体,或DNA-蛋白双链体。继而可将这些抗体标记,并在所述双链体与一种表面结合的情况下进行测定,从而在该表面上形成双链体时能够对抗体与该双链体结合的情况进行检测。将探针应用于新反义RNA的实现方法可以是任意的常规技术,包括核酸杂交技术、加减筛选技术、重组探测技术、杂交体释放翻译技术(HRT)和杂交体中止翻译技术(HART)。此外还包括聚合酶链反应(PCR)等扩增技术。
此外还涉及可用于定性或定量检测其他标记的诊断试剂盒。诊断或预后结果可能取决于多种标记的组合指征。因此,这些试剂盒可用于组合标记的测定。可参考,例如,Viallet,et al.(1989)
Progress in Growth Factor Res.1:89-97。
VIII.治疗性应用
本发明提供一些具有显著的治疗应用价值的试剂。DTLRs(天然的或重组的)及其片段、突变型蛋白受体、抗体以及确实与这些受体和抗体具有结合亲和力的化合物都可用于治疗其配体的受体表现出异常表达的疾病。这些异常通常表现为免疫功能障碍。此外,本发明对多种与配体表达异常或异常触发配体应答有关的疾病或病症也具有显著的治疗应用价值。研究表明,Toll配体可参与形态发生,如背腹极性建立,以及免疫应答,特别是原发性先天应答。可参考,例如,Sun,et al.(1991)
Eur.J.Biochem.196:247-254;Hultmark(1994)
Nature 367:116-117。
可以将重组DTLRs或其突变蛋白、激动剂、拮抗剂或抗体加以纯化,然后给药于患者。这些试剂可与其他活性成分,如药学容许的常规载体或稀释剂,以及不具有生理毒性的稳定剂和赋形剂联合用于治疗。这些联合制剂可以是无菌的,例如被过滤除菌,并可将其以剂量形式在剂量瓶中冻干或在稳定水性制剂中保存。本发明还涉及非补体结合性抗体及其结合片段的应用。
可利用DTLR或其片段进行配体筛选,以鉴定对受体具有结合亲和力的分子。随后可利用生物测定方法来检测是否假定配体具有阻断内源刺激活性的竞争性结合能力。由于受体片段可阻断配体活性,因而可作为阻断剂或拮抗剂使用。此外,具有内源刺激活性的化合物可以激活受体,从而可作为激动剂来刺激配体的活性,如信号诱发活性。本发明还涉及DTLR的拮抗剂抗体的治疗性应用。
有效治疗所需的试剂量取决于多种不同的因素,包括给药方法、目标部位、患者的生理状态,以及使用的其他药剂。因此,应该对治疗剂量进行滴定,以获得最佳的安全性和效力。通常可以将这些试剂的体外剂量作为指导来确定原位给药剂量。用动物来测试对特定疾病的治疗有效剂量的方法可以为人用剂量的确定提供进一步的预指标。有关其他多种注意事项的描述,可参考,例如,Gilman,et al.(1990年版)《
Goodman & Gilman’s:治疗的药学基础》,第8版,Pergamon Press;和《
Remington药物科学》(最新版),MackPublishing Co.,Easton,Penn.;其内容均在此引入作为参考。这些文献还描述了给药的方法,下文也将对此进行讨论,这些给药方法包括,例如,口服给药、静脉内给药、腹膜内给药、肌内给药、经皮肤扩散,等等。药学容许的载体包括水、盐水、缓冲液,以及
Merck Index,Merck & Co.,Rahway,New Jersey中描述的其他化合物。由于假定配体与其受体之间可能具有很高的结合亲和力或周转率,因而可首先预期低剂量的这些试剂就会有效。对信号通路的研究表明,极低剂量的配体即可产生影响。因此,在含有适当载体的条件下,通常可将剂量范围预定在1mM浓度以下,典型的是约在10μM浓度以下,常见的是约在100nM以下,优选的是约在10pM(皮摩尔)以下,最优选的是约在1fM(飞摩尔)以下。通常可以用缓释制剂或缓释装置来实现连续给药。
可以将DTLRs或其片段,抗体或其片段、拮抗剂和激动剂直接给药于待治疗的宿主,也可以根据化合物的大小,将其与卵白蛋白或血清白蛋白等运载蛋白偶联,然后进行给药。可以将治疗剂以任意的传统剂量形式进行给药。可以将活性成分单独给药,但优选的是将其作为药剂提供。药剂中至少含有一种上述的活性成分,并含有一种或多种相容性载体。每种载体都要与其他成分相容,并且不会对患者造成损伤,因而必须具有药学或生理相容性。这些药剂包括适用于口服给药、直肠给药、鼻部给药,或胃肠外(包括皮下、肌内、静脉内和皮内)给药的那些药剂。这些药剂可方便地以单位剂量形式提供,并且可利用制药领域中熟知的任意方法加以制备。可参考,例如,Gilman,et al.(1990年版)《
Goodman & Gilman’s:治疗的药学基础》,第8版,Pergamon Press;和《
Remington药物科学》最新版),Mack Publishing Co.,Easton,Penn.;Avis,et al.(1993年版)《
药物剂型:胃肠外给药法》Dekker,NY;Lieberman,et al.(1990年版)《
药物剂型:片剂》Dekker,NY;以及Lieberman,et al.(1990年版)《
药物剂型:分散体系》Dekker,NY。本发明的治疗方法可与其他治疗剂联合或结合使用,特别是其他IL-1家族成员的激动剂或拮抗剂。
IX.配体
上文有关Toll配体的描述可提供鉴定配体的方法。该配体应能够以适当的高亲和力与相应受体特异性结合。有多种构建体可用于对受体进行标记,以检测其配体。例如,可以直接标记DTLR,也可将其融合于二次标记型标记物,如FLAG或其他表位标记等等,这些方法均可用于实现对受体的检测。检测的方法可以是组织学方法,如用于生化纯化的亲和法,也可以是标记或筛选表达克隆的方法。还可利用双杂交选择系统,由现有的DTLR序列获得适当的构建体。可参考,例如,Fields and Song(1989)
Nature 340:245-246。
一般而言,有关DTLRs的描述可类似地应用于分别针对DTLR2、DTLR3、DTLR4、DTLR5、DTLR6、DTLR7、DTLR8、DTLR9和/或DTLR10试剂及组合物的特定实施方案。
参考下文的实施例可以更好地了解本发明的适用范围,但这些实施例不是要将本发明限制于特定的实施方案。
实施例
I.常规方法
一些标准方法是来自,例如,Manistis,et al.(1982)《
分 子克隆实验手册》Cold Spring Harbor Laboratory,Cold SpringHarbor Press;Sambrook,et al.(1989)《
分子克隆实验手册》(第二版),1-3卷,CSH Press,NY;Ausubel,et al.,《
生物学》,Greene Publishing Associates,Brooklyn,NY;或Ausubel,et al.,(1987及增刊)《
最新分子生物学实验方法》,Greene/Wiley,NewYork的描述或参考文献。蛋白纯化方法包括硫酸铵沉淀法、柱层析法、电泳法、离心法、结晶法,等等。可参考,例如,Ausubel,et al.,(1987及定期增刊);Coligan,et al.,(1996年版及定期增刊)《
最新蛋白质科学实验方法》Greene/Wiley,New York;Deutschet(1990)《
酶学方法》,第182卷中的“蛋白质纯化指南”和该丛书中的其他部分;以及Pharmacia,Piscataway,N.J.或Bio-Rad,Richmond,CA等制造商为其蛋白纯化制品的使用提供的参考文献。利用重组技术可以实现与适当片段的融合,例如与FLAG序列或能够被蛋白酶去除的等价序列相融合。可参考,例如,Hochuli(1989)Chemische Industrie 12:69-70;Setlow(ed.)《
遗传工程的原理 和方法》12:87-98,Plenum Press,N.Y.中的Hochuli(1990)“通过金属螯合吸附来纯化重组蛋白”;以及Crowe,et al.,(1992)《
QIAexpress:高水平表达和蛋白纯化系统》QUIAGEN,Inc.,Chatsworth,CA。
有关标准的免疫学技术和检测方法的描述可参考,例如,Hertzenberg,et al.(1996年版)《
Weir’s实验免疫学手册》1-4卷,Blackwell Science;Coligan(1991)《
最新免疫学实验方法》,Wiley/Greene,NY;以及《
酶学方法》第70、73、74、84、92、93、108、116、121、132、150、162和163卷。
检测血管生物学活动的方法已为该领域所熟知。包括对肿瘤或其他组织中的血管发生和血管舒张活动进行检测,例如检测动脉平滑肌增殖(可参考,例如,Koyoma et al.(1996)
Cell 87:1069-1078)、单核细胞与血管上皮的粘联(参考McEvoy,et al.(1997)
J.Exp. Med.185:2069-2077),等等。还可参考Ross(1993)
Nature362:801-809;Rekhter and Gordon(1995)
Am.J.Pathol.147:668-677;Thyberg,et al.(1990)
Atherosclerosis10:966-990;以及Gumbiner(1996)
Cell 84:345-357。
有关神经细胞生物学活动检测方法的描述,可参考,例如,Wouterlood《
神经科学实验方法》(1995年版)第10章,Elsevier;《
神经科学方法》Academic Press;以及《神经学方法》Humana Press,Totowa,NJ。有关发育系统方法学的描述,可参考,例如,Meisami(ed.)《
人类生长和发育生物学手册》CRC Press;和Chrispeels(ed.)《
发育生物学中的分子技术和方法》Interscience。
计算机序列分析的实现可采用,例如,现有软件程序,包括由GCG(U.Wisconsin)和GenBank活动的程序。此外,还可以使用公共序列数据库,如GenBank、NCBI、EMBO的数据库和其他数据库。可利用这些生物信息学工具来预测跨膜基序和其他重要基序。
适用于IL-10受体的多种技术也可同样适用于DTLRs,如USSN08/110,683(IL-10受体)中的技术,其内容在此完全引入作为参考。
II.新的人类受体家族
缩写:DTLR,DNAX Toll-样受体;IL-1R,白介素-1受体;TH,Toll同源性;LRR,亮氨酸富集重复序列;EST,表达序列标记;STS,序列标记位点;FISH,荧光原位杂交。
果蝇Toll胞质区与人白介素-1(IL-1)受体之间的序列同源性的发现使人们确信这两种分子都可触发与Rel型转录因子核定位有关的信号通路。在昆虫和脊椎动物中,这种保守的信号机制控制着在进化上比较古老的免疫应答。本发明描述的是一类新的假定人受体的分子克隆方法,这些受体的蛋白结构无论是胞内段还是胞外段都与果蝇Toll十分相似。被命名为DTLRs 1-5的五种人Toll-样受体都可能是果蝇分子的直向同源体,因而可能是人类先天免疫力中未知的重要成分;令人感兴趣的是,DTLRs在脊椎动物中的进化保守性表明这些DTLRs具有另一种与Toll在果蝇胚胎背腹形成过程中的作用相类似的功能,即作为早期形态发生模式形成过程中的调节物。多组织mRNA印迹的结果说明,人DTLRs的表达模式有显著的不同。此外,荧光原位杂交和序列标记点数据库分析的结果表明,DTLR同源基因位于4号染色体(DTLRs 1、2和3)、9号染色体(DTLR4)和1号染色体(DTLR5)上。对参与比较的多种昆虫及人DTLRs、脊椎动物IL-1受体、MyD88因子和植物抗病蛋白的Toll同源域进行结构预测,由此可发现一种带有酸性活性位点的平行β/α折叠;值得注意的是,在广泛参与细菌感觉信息传导的一类反应调节因子中也发现了与其类似的结构。
在果蝇与人的普通胚胎形状和模式中已蕴含了使其产生显著形态差异的因素,但由这些因素引发的细胞复杂性却有很大不同。DeRobertis和Sasai(1996)
Nature 380:37-40;和Arendt andNübler-Jung(1997)
Mech.Develop.61:7-21。昆虫与脊椎动物在发育方向上的趋异性是由非常类似的信号通路决定的,这说明由不同基因库形成的蛋白网络和生化机制具有很强的保守性。Miklos andRubin(1996)
Cell 86:521-529;和Chothia(1994)
Develop.1994Suppl.,27-33。绘制这些调节通路的进化图的有效方法是通过种间蛋白序列和结构的比较来推断其可能的分子成分(和生物学功能)。Miklos and Rubin(1996)
Cell 86:521-529;Chothia(1994)
Develop.1994 Suppl.,27-33(3-5);和Banfi,et al.(1996)
Nature Genet.13:167-174。
体轴的确立,无论是由先天不对称性导致还是由外界信号触发,都是胚胎发育过程中普遍的一个关键步骤。DeRobertis and Sasai(1996)
Nature 380:37-40;和Arendt and Nübler-Jung(1997)Mech.Develop.61:7-21。目前已对背腹极性形成模型系统的系统发生原理和细胞机制给予了特别的关注。DeRobertis and Sasai(1996)Nature 380:37-40;和Arendt and Nübler-Jung(1997)
Mech. Develop.61:7-21。通过对果蝇胚胎的研究,人们提出了一种典型的分子策略来解释这种转变,即少量基因的顺序作用使转录因子Dorsal形成腹侧化梯度。St.Johnston and NüssleinVolhard(1992)
Cell68:201-219;和Morisato and Anderson(1995)
Ann.Rev.Genet.29:371-399。
该信号通路的中心是跨膜受体Toll,该受体可与母体分泌的腹侧因子Sp_tzle结合,然后将信号传递到胞质侧结合的辅助分子Tube,并且激活Ser/Thr激酶Pelle,这种激酶对Dorsal与其抑制剂Cactus的解离具有催化作用,从而使Dorsal能够迁移至腹侧核(Morisatoand Anderson(1995)
Ann.Rev.Genet.29:371-399;和Belvin andAnderson(1996)
Ann.Rev.Cell Develop.Biol.12:393-416)。此外,成体果蝇中的Toll通路还可以对强抗菌因子的诱导进行调控(Lemaitre,et al.(1996)
Cell 86:973-983);在果蝇的免疫防御中,这种作用可加强与其机制类似的IL-1通路,在脊椎动物体内,IL-1通路控制着宿主的免疫应答和炎症反应。Belvin and Anderson(1996)
Ann.Rey.Cell Develop.Biol.12:393-416;和Wasserman(1993)
Molec.Biol.Cell 4:767-771。IL-1受体中的Toll相关胞质区可指导Pelle样激酶IRAK的结合以及潜伏NF-κB/I-κB复合体的激活,该复合体相当于Dorsal与Cactus的结合体。Belvin andAnderson(1996)
Ann.Rev.Cell Develop.Biol.12:393-416;和Wasserman(1993)
Molec.Biol.Cell 4:767-771。
本发明描述了四种被称为DTLRs 2-5的(按Chiang and Beachy(194)
Mech.Develop.47:225-239的命名)新的Toll样受体的克隆方法及分子特性,与脊椎动物IL-1受体相比,这些分子构成了一种与果蝇Toll同源体关系更密切的受体家族。DTLR的序列来源于人ESTs;这些cDNAs片段被用来绘制上述五种DTLRs在人类组织中的完整表达模式和同源基因的染色体定位图谱,以及缩小用于获得全长cDNA的cDNA库的选择范围。在其他研究成果的支持下(Banfi,etal.(1996)
Nature Genet.13:167-174;和Wang.et al.(1996)J.Biol.Chem.271:4468-4476),我们根据结构保守性和分子简约性组装了一种与果蝇的主要调节模式相对应的人类生物系统。此外还根据Toll同源(TH)结构域的推荐三级折叠结构提出了Toll信号通路的生物学机制,该结构域是DTLRs、IL-1受体大家族、哺乳动物MyD88因子和植物抗病蛋白的共有核心组件。Mitcham,et al.(1996)
J.Biol.Chem.271:5777-5783;和Hardiman,et al.(1996)
Oncogene 13:2467-2475。我们认为,参与昆虫、植物和动物的形态发生及原发免疫的信号通路(Belvin and Anderson(1996)Ann.Rev.Cell Develop.Biol.12:393-416;和Wilson.et al.(1997)
Curr.Biol.7:175-178)可能来源于细菌的双组分通路。
计算机分析
利用国家生物技术信息中心(NCBI)的BLAST服务器从EST数据库(dbEST)中获得与昆虫DTLRs相关的人序列(Altschul,et al.(1994)
Nature Genet.6:119-129)。利用基于模式和基于图形的更灵敏的方法(Bork and Gibson(1996)
Meth.Enzymol.266:163-184)分离出非冗余数据库中提供的脊椎动物蛋白和植物蛋白所共有的DTLR家族信号结构域。DTLR胞内区和胞外区序列的渐进性比对是利用ClustalW来完成(Thompson,et al.(1994)
Nucleic Acids Res.22:4673-4680);此外还通过Neighbor-Joining算法用该程序计算出比对序列的分支等级(5000次自举重复保证了3个分组结果的可信度)。
用Consensus程序(网址http://www.bork.embl-heidelberg.de/Alignment/consensus.html)绘制出严格度不同的几种保守比较图。用PRINTS蛋白指纹库(http://www.biochem.ucl.ac.uk/bsm/dbbrowser/PRINTS/PRINTS.html)在DTLRs胞外区明确鉴别出大量的亮氨酸富集重复(LRRs)(Attwood,et al.(1997)
Nucleic Acids Res.25:212-217),其使用的是一种能够与不同LRRs的N-端和C-端特性灵活匹配的复合基序(PRINTS代码Leurichrpt)。用三态精确度大于72%的两种预测算法推导出胞内结构域比较结果的共有二级结构,以此来提高识别的效果(Fischer,et al.(1996)
FASEB J.10:126-136)。神经网络程序PHD(Rost and Sander(1994)Proteins 19:55-72)和统计预测方法DSC(King and Sternberg(1996)
Protein Sci.5:2298-2310)都可以通过网络服务器来实现(链接地址分别为http://www.embl-heidelberg.de/predictprotein/phd_pred.html和http://bonsai.lif.icnet.uk/bmm/dsc/dsc_read_align.html)。胞内区可以编码THD区,有关其论述,可参考,例如,Hardiman,etal.(1996)
Oncogene 13:2467-2475;和Rock,et al.(1998)
Proc. Nat’l Acad.Sci.USA 95:588-593,其内容均在此引入作为参考。该结构域在将磷酸基团转移至底物的受体信号机制中具有十分重要的作用。
人DTLR全长cDNAs的克隆方法
利用来源于Toll样Humrsc786序列(GenBank登录号D13637)(Nomura,et al.(1994)
DNA Res.1:27-35)的PCR引物来探测由人红白血病TF-1细胞系获得的cDNA库(Kitamura,et al.(1989)Blood 73:375-380),以产生DTLRl的cDNA序列。其余的DTLR序列则来自dbEST,通过Research Genetics(Huntsville,AL)从I.M.A.G.E.协会(Lennon,et al.(1996)
Genomics 33:151-152)获得的相应EST克隆是:Clone编号80633和117262(DTLR2)、144675(DTLR3)、202057(DTLR4)以及277229(DTLR5)。人DTLRs2-4的全长cDNAs的克隆方法是分别对λgt10噬菌体、成人肺、胎盘和胎儿肝脏的5’-Stretch Plus cDNA库(Clontech)进行DNA杂交筛选;DTLR5的序列来源于人多发性硬化症斑块的EST。对所有阳性克隆进行测序和序列比较,以确定每种DTLR的ORFs:DTLR1(2366bp的克隆,ORF含有786个氨基酸)、DTLR2(2600bp的克隆,ORF含有784个氨基酸)、DTLR3(3029bp的克隆,ORF含有904个氨基酸)、DTLR4(3811bp的克隆,ORF含有879个氨基酸),以及DTLR5(1275bp的克隆,ORF含有370个氨基酸)。DTLR6-10也使用类似的方法。与DTLR3和DTLR4杂交的探针的产生方法是分别以人胎盘(Stratagene)和成人肝(Clontech)的cDNA库为模板进行PCR;引物对来源于相应的EST序列。PCR反应采用的是T.aquaticusTaqplus DNA聚合酶(Stratagene),其反应条件如下:1x(94℃ 2分钟),30x(55℃ 20秒;72℃ 30秒;94℃ 20秒),1x(72℃8分钟)。用于DTLR2全长cDNA筛选的探针是通过第一个EST克隆(ID#80633)的EcoRI/XbaI酶切产生的一种900bp片段。
mRNA印迹和染色体定位
每道含有约2μg poly(A)+RNA的人多种组织(Cat#1,2)及癌细胞系的印迹膜(Cat#7751-1)购自Clontech(Palo Alto,CA)。用于DTLRs1-4的探针是分离的全长cDNAs,用于DTLR5的探针是EST克隆(ID#277229)质粒的插入片段。简单地说,是通过Amersham的Rediprime随机引物标记试剂盒(RPN1633)用[α-32P]dATP对探针进行放射性标记。预杂交和杂交是在65℃和0.5M Na2HPO4、7%SDS、0.5M EDTA(pH 8.0)的条件下进行。所有严格洗涤的方法都是在65℃条件下先用2xSSC、0.1%SDS洗两次,每次40分钟,再用0.1xSSC、0.1%SDS洗20分钟。然后将印迹膜置于X-射线胶片(Kodak)上,并附加增感屏,使其于-70℃曝光。通过cDNA库印迹法(14)用选定的人DTLR克隆进行更详细的研究,以检测造血细胞亚类中这些基因的表达。
按照Heng and Tsui(1994)
Meth.Molec.Biol.33:109-122中描述的荧光原位杂交法(FISH)用不同的全长序列(DTLRs2-4)或部分序列(DTLR5)作为探针进行人染色体制图。这些分析均由SeeDNABiotech Inc.(Ontario,Canada)完成。利用网络服务器在人-鼠畸形同源数据库(http://www.hgmp.mrc.ac.uk/DHMHD/hum_chromel.html)中搜索与定位的DTLR基因有关的人类综合症(或位于同线基因位点的小鼠缺陷)。对DTLRs6-10也可采用类似的方法。昆虫及人DTLR胞外结构域的保守结构
果蝇的Toll家族至少包括四种不同的基因产物:参与果蝇胚胎背腹模式形成的典型受体Toll(Morisato and anderson(1995)
Ann. Rev.Genet.29:371-399),第二种是同样参与胚胎早期发育的“18Wheeler”(18w)(Chiang and Beachy(1994)
Mech.Develop.47:225-239;Eldon,et al.(1994)
Develop.120:885-899);另两种是由雄性特异性转录本(Mst)基因位点下游的不完整Toll样ORFs预测的受体或由“序列标记位点”(STS)Dm2245(GenBank号G01378)编码的受体(Mitcham,et al.(1996)
J.Biol.Chem.271:5777-5783)。Toll和18w的胞外区特征性地含有约24个氨基酸的不完整LRR基序(Chiang and Beachy(1994)
Mech.Develop.47:225-239;和Eldon,et al.(1994)
Develop.120:885-899)。相似LRRs的串接排列通常可形成不同细胞表面分子的粘联性触角,其通用三级结构可能与核糖核酸酶抑制剂的马蹄形折叠架相似,其中的17个LRRs表现为一种含有28个残基的重复β/α发卡状基序(Buchanan and Gay(1996)
Prog.Biophys.Molec.Biol.65:1-44)。有人提出,蛇根碱受体多LRR胞外域与胱氨酸折叠簇糖蛋白激素的结合可能是通过弧形β折叠的凹面来进行(Kajava,et al.(1995)
Structure 3:867-877),Toll对Sp_tzle的特异性识别可能也符合该模型;令人感兴趣的是,Sp_tzle和果蝇孤配体Trunk中的半胱氨酸模式预示其具有类似的胱氨酸簇三级结构(Belvin andAnderson(1996)
Ann.Rev.Cell Develop.Biol.12:393-416;和Casanova,et al.(1995)
Genes Develop.9:2539-2544)。
DTLRs1-4分别具有18、19、24和22个类似的LRR排列(DTLR5不完整链目前含有近膜端的4个LRRs),序列及模式分析(Altschul,et al.(1994)
Nature Genet.6:119-129;和Bork and Bibson(1996)Meth.Enzymol.266:162-184)的结果表明,与其关系最为密切的是分别含有22个和31个LRR的Toll胞外域和18w胞外域(Mst ORF片段含有16个LRRs)(图1)。Toll、18w和Mst ORF的胞外域都带有位置不同的一种约90个残基的半胱氨酸富集区(与膜边界的距离分别为4、6和2个LRRs),与此明显不同的是,人DTLR链都不包含该区域。这些半胱氨酸簇均由“顶部”(使一个LRR终止)和“底部”(叠加在一个LRR之上)这两种不同的组件构成(Chiang andBeachy(1994)
Mech.Develop.47:225-239;Eldon,et al.(1994)Develop.120:885-899;和Buchanan and Gay(1996)
Prog.Biophys. Molec.Biol.65:1-44);在果蝇和人的DTLRs中,“顶部”组件还作为保守的近膜间隔区重复出现(图1)。我们认为,当“顶部”与“底部”结合时,果蝇受体(和其他LRR蛋白)中灵活定位的半胱氨酸簇可形成一种具有配对末端的紧密组件,该组件可以在不改变DTLR胞外域总体折叠的情况下插入到任一对LRRs之间;在其他蛋白的结构中也含有类似的“突出”结构域(Russell(1994)
Protein Engin.7:1407-1410)。
TH信号结构域的分子设计
通过Toll受体与I型IL-1受体(IL-1R1)的序列比较发现了一种相似性较低的胞质结构域,据推测,这种约200个氨基酸的结构域可通过类似的Rel型转录因子来调节信号通路。Belvin and Aderson(1996)
Ann.Rev.Cell Develop.Biol.12:393-416;和(Belvinand Aderson(1996)
Ann.Rev.Cell Develop.Biol.12:393-416;和Wasserman(1993)
Molec.Biol.Cell 4:767-771)。最近发现,该功能示意图还包括两种来源于烟草和亚麻的植物抗病蛋白,其特征是含有一种N-端TH组件,随后是核苷酸结合区(NTPase)和LRR区(Wilson,et al.(1997)
Curr.Biol.7:175-178);相比之下,MyD88在TH链前端含有“死亡结构域”,这是一种细胞内髓样分化标记(Mitcham,et al.(1996)
J.Biol.Chem.271:5777-5783;和Hardiman,et al.(1996)
Oncogene 13:2467-2475)(图1)。新IL-1型受体包括附属信号分子IL-1R3、孤受体IL-1R4(也被称为ST2/Fit-1/T1)、IL-1R5(IL-1R相关蛋白),以及IL-1R6(IL-lR相关蛋白-2)(Mitcham,et al.(1996)
J.Biol.Chem.271:5777-5783;Hardiman,et al.(1996)
Oncogene 13:2467-2475)。我们已利用新的人DTLR序列对TH共用组件进行了构象分析,并针对其进化路线提出了一种结构上的解释:最小的TH结构域折叠是由10个含有128个氨基酸的保守序列区决组成;比较结果中的空位标明了序列和长度可变环的可能位置(图2A-2B)。
PHD(Rost and Sander(1994)
Proteins 19:55-72)和DSC(Kingand Sternberg(1996)
Protein Sci.5:2298-2310)是利用多种比较序列的保守和变异模式进行预测的两种算法,由这两种算法产生的TH信号组件结果非常明显,并且彼此吻合(图2A-2B)。每个区块都含有一种不连续的二级结构元件:β链(标为A-E)与α螺旋(编号1-5)交替的模式是β/α类折叠的特征,即平行β-片层的两侧表面均带有α螺旋。据推测,疏水性β-链A、C和D构成了β-片层的“内部”板条结构,而较短的两亲性β-链B和E构成了独特的“边缘”部分(图2A-2B)。这种分配方式与核心β-片层中B-A-C-D-E的β链顺序相吻合(图2C);折叠比较(“制图法”)及识别(“穿线法”)程序(Fischer,et al.(1996)
FASEB J.10:126-136)可明显地重现这种双缠绕β/α拓扑结构。对TH结构域的这种大概结构进行功能预测的结果十分令人惊奇,据多序列比较结果显示,许多保守的带电残基都位于β-折叠片的C-末端:βA末端的Asp16残基(区块编号方案——图2A-2B)、βB之后的Arg39和Asp40、α3的第一个转角中的Glu75,以及βD-α4环中或βE之后的保守性较低的Glu/Asp残基(图2A-2B)。其他4个保守残基(Asp7残基、Glu28残基和Arg57-Arg/Lys58残基对)的位置与β片层的对侧N末端盐桥网络相容。其他DTLR实施方案的比较结果也表现出类似的特征,特别重要的是具有这些特征的肽段,如具有这些特征的20个氨基酸的序列段。
TH结构域的信号功能依赖于其结构的完整性。目前已对IL-1R1和Toll的TH组件内的失活突变或缺失进行了分类。Heguy,et al.(1992)
J.Biol.Chem.267:2605-2609;Croston,et al.(1995)J.Biol.Chem.270:16514-16517;Schneider,et al.(1991)
Genes Develop.5:797-807;Norris and Manley.(1992)
Genes Develop.6:1654-1667;Norris and Manley(1995)
Genes Develop.9:358-369;和Norris and Manley(1996)
Genes Develop.10:862-872。与人DTLR1-5最小化TH结构域之后的延伸链(长度分别为8、0、6、22和18个残基)最相似的是长度为4个氨基酸残基的Mst ORF短“尾”。Toll和18w则带有不相关的102个残基和207个残基的尾(图2A-2B),其作用可能是对融合的TH结构域的信号功能进行负调节。Norris and Manley(1995)
Genes Develop.9:358-369;和Norris and Manley(1996)
Genes Develop.10:862-872。
利用由多序列比较结构产生的系统树可以最好地阐明带有TH结构域的异种蛋白之间的进化关系(图3)。4个主要的分支将植物蛋白、MyD88因子、IL-1受体和Toll样分子分隔开来;最后一个分支将果蝇和人的DTLRs分组。
人DTLR基因的染色体分布
为了研究人DTLR新基因家族的遗传连锁情况,我们通过FISH对5种基因中的4种进行了染色体基因位点制图(图4)。在此之前,DTLR1基因的图谱已通过人类基因组计划而确定:STS数据库中具有Humrsc786 cDNA的基因位点(dbSTS登录号G06709,相当于STSWI-7804或SHGC-12827)(Nomura,et al.(1994)
DNA Res.1:27-35),它将该基因定位于第4号染色体约4p14的D4S1587-D42405标记之间(50-56cM)。这一定位最近已被FISH分析结果所证实。Taguchi,et al.(1996)
Genomics 32:486-488。我们的这项工作将剩余DTLR基因的位点定位于染色体4q32(DTLR2)、4q35(DTLR3)、9q32-33(DTLR4)和1q33.3(DTLR5)。在此期间还产生了亲本DTLR2 EST(克隆编号80633)的STS(dbSTS登录号为T57791的STS SHGC-33147),并根据我们的检测结果将其定位于第4号染色体4q32的D4S424-D4S1548标记之间(143-153cM)。
DTLR基因的差异表达
Toll和18w在果蝇体内均表现出复杂的时空表达模式,说明二者在胚胎模式形成之外可能还具有其他功能。St.Johnston andNüsslein-Volhard(1992)
Cell 68:201-219;Morisato and Anderson(1995)
Ann.Rev.Genet.29:371-399;Belvin and Anderson(1996)
Ann.Rev.Cell Develop.Biol.12:39 3-416;Lemaitre,et al.(1996)
Cell 86:973-983;Chiang and Beachy(1994)
Mech. Develop.47:225-239;和Eldon,et al.(1994)
Develop.120:885-899。我们已检测了DTLR转录体的空间分布,方法是用放射性标记的DTLR cDNAs对不同的人类组织和癌细胞系进行mRNA印迹分析(图5)。结果表明,DTLR1以高于其他受体的水平被广泛表达。在卵巢和脾中分别存在DTLR1的3.0kB“短”转录体和8.0kB“长”转录体(图5,A和B面),估计这是可变剪接的结果。癌细胞mRNA的结果图表明,伯基特淋巴瘤Raji细胞系中也存在明显的DTLR1过表达(图5,C面)。DTLR2 mRNA的表达没有DTLR1那么广泛,在肺中检测到的转录体为4.0kB,而心脏、脑和肌肉中明显含有4.4kB的转录体。DTLR3的组织分布模式与DTLR2类似(图5,E面)。此外,DTLR3的转录体主要有两种,长度分别为4.0和6.0kB,并且在胎盘和胰腺中的表达水平最高。相比之下,DTLR4和DTLR5的信使转录体似乎具有很强的组织特异性。只在胎盘中检测到DTLR4的一种转录体,长度约为7.0kB。在卵巢和外周血单核细胞中观测到一种很弱的DTLR5信号,长度为4.0kB。
进化上比较古老的调节系统的组分
利用基因组比较方法可以重建信号通路的原始分子蓝图和趋异方向。Miklos and Rubin(1996)
Cell 86:521-529;Chothia(1994)Develop.1994 Suppl.,27-33;Banfi,et al.(1996)
J.Biol.Chem.271:4468-4476。我们利用该推理方法鉴定出一种新的人类基因家族,该家族目前编码5种类似的受体,即DTLRs1-5,这些受体是以Toll为代表的果蝇基因家族的直向进化对应物(图1-3)。人与果蝇DTLRs的结构保守性,包括保守的LRR胞外结构域和胞内TH区块(图1),提示果蝇体内与Toll偶联的稳健通路也存在于脊椎动物体内。最好的证据则来自一种重复通路:多种IL-1系统及其受体融合TH结构域库、IRAK、NF-κB和I-κB同源物(Belvin and Anderson(1996)Ann.Rev.Cell Develop.Biol.12:393-416;Wasserman(1993)Molec.Biol.Cell 4:767-771;Hardiman,et al.(1996)
Oncogene13:2467-2475;和Cao,et al.(1996)
Science 271:1128-1131);此外还鉴定出一种Tube样因子。目前还不清楚DTLRs是否能够与IL-IR信号通路有效偶联,也不清楚使用一组平行蛋白是否可行。据推测,人DTLRs与IL-1受体的不同之处在于其LRR架可保持对Sp_tzle/Trunk相关胱氨酸簇因子的亲和力;目前已分离出一些适用于该模型的候选DTLR配体(被称为PENs)。
利用在信号通路中相互作用的蛋白折叠的保守性可以对信号转导的生化机制进行判断。Miklos and Rubin(1996)
Cell 86:521-529;Chothia(1994)
Develop.1994 Suppl.,27-33。目前的Toll信号通路示意图中包含一些通过其结构、作用或结果来严密定义的分子:Pelle是一种Ser/Thr激酶(磷酸化),Dorsal是一种NF-κB样转录因子(与DNA结合),Cactus是一种锚蛋白重复抑制因子(与Dorsal结合,降解)。Belvin and Anderson(1996)
Ann.Rev.Cell Develop. Biol.12:393-416。相比之下,Toll TH结构域和Tube的功能仍不清楚。与其它细胞因子受体相同(Heldin(1995)
Cell 80:213-223),配体介导的Toll二聚体形成可能是一种触发事件:Toll近膜区的游离半胱氨酸可用于产生组成型活性受体对(Schneider,et al.(1991)
Genes Develop.5:797-807),而Torso-Toll嵌合受体是以二聚体的形式发挥信号功能(Galindo,et al.(1995)
Develop.121:2209-2218);但Toll胞外域的严重截断或全部丧失可导致胞内域的信号作用出现混乱(Norris and Manley(1995)
Genes Develop.9:358-369;和Winans and Hashimoto(1995)
Molec.Biol.Cell6:587-596),这使人联想起带有催化结构域的癌基因受体(Heldin(1995)
Cell 80:213-223)。Tube位于膜上,它能够与Pelle的N端(死亡)结构域结合,并且被磷酸化,但是用双杂交分析方法未能检测到Toll-Tube或Toll-Pelle相互作用(Galindo,et al.(1995)Develop.121:2209-2218;和Gtoβhans,et al.(1994)
Nature 372:563-566);该结果说明Toll TH结构域的构象“状态”可通过某种方式对因子募集产生影响。Norris and Manley(1996)
Genes Develop.10:862-872;和Galindo,et al.(1995)
Develop.121:2209-2218。
在这些繁杂的问题中,最重要的是Toll TH组件的结构特性。为解决这个问题,我们根据昆虫、植物、脊椎动物以及人DTLR的TH序列的进化多样性提取出该保守蛋白的最小化核心,并将其用于结构预测和折叠识别(图2)。明显预测出的(β/α)5TH结构域折叠带有不对称的酸性残基簇,这种拓扑结构与细菌双组分通路中的应答调节因子相同(Volz(1993)
Biochemistry 32:11741-11753;和Parkinson(1993)
Cell 73:857-871)(图2A-2C)。典型的趋化调节因子CheY能够与核心β-片层C-末端的“天冬氨酸袋”中的二价阳离子短暂结合;该阳离子可提供静电稳定性,并且有利于激活恒定Asp的磷酸化。Volz(1993)
Biochemistry 32:11741-11753。TH结构域的酸性套也能够俘获阳离子,但激活作用和随后的信号作用则依赖于特异性结合一种负电基团:阴离子配体可陷落于精密的氢键网络,并以此来克服结合位点的强负电位。Ledvina,et al.(1996)
Proc.Natl.Acad. Sci.USA 93:6786-6791。令人感兴趣的是,在Tube/Pelle复合物与Toll的组装过程或植物和脊椎动物的类似系统中,TH结构域可能不只是作为被动支架,而是能作为真正的构象引发物主动参与信号转导系统。Toll二聚体形成可以通过调节性受体尾来促进其暴露(Norris and Manley(1995)
Genes Develop.9:358-369;Norrisand Manley(1996)
Genes Develop.10:862-872),或通过TH袋的小分子激活物来促进其结合,这可能是Tube/Pelle复合物有条件结合的原因。但细胞内的“游离”TH组件(Norris and Manley(1995)Genes Develop.9:358-369;Winans and Hashimoto(1995)
Molec. Biol.Cell 6:587-596)可通过激活和结合游离的Tube/Pelle复合物来发挥其CheY样引发物的催化作用。
形态发生受体和免疫防御
昆虫与脊椎动物免疫系统之间的进化关系被记录在DNA中:昆虫抗微生物因子编码基因的上游基序与已知能结合NF-κB转录因子的哺乳动物体急性期应答因子相似。Hultmark(1993)
Trends Genet.9:178-183。被细菌攻击后,Dorsal和两种Dorsal相关因子Dif及Relish都有助于诱导这些防御蛋白产生(Reichhart,et al.(1993)C.R.Acad.Sci.Paris 316:1218-1224;Ip,et al.(1993)
Cell75:753-763;和Dushay,et al.(1996)
Proc.Natl.Acad.Sci. USA 93:10343-10347);Toll或其他DTLRs可能会调节成体果蝇中的这些快速免疫应答(Lemaitre,et al.(1996)
Cell 86:973-983;和Rosetto.et al.(1995)
Biochem.Biophys.Res.Commun.209:111-116)。这些类似于脊椎动物IL-1炎症反应的机制可以证明Toll信号通路的功能多样性,并且表明胚胎模式形成和先天免疫之间存在古老的协同性(Belvin and Anderson(1996)
Ann.Rev.Cell Develop. Biol.12:393-416;Lemaitre,et al.(1996)
Cell 86:973-983;Wasserman(1993)
Molec.Biol.Cell 4:767-771;Wilson,et al.(1997)
Curt.Biol.7:175-178;Hultmark(1993)
Trends Genet.9:178-183;Reichhart,et al.(1993)
C.R.Acad.Sci.Paris316:1218-1224;Ip,et al.(1993)Cell 75:753-763;Dushay,etal.(1996)
Proc.Natl.Acad.Sci.USA 93:10343-10347;Rosetto,et al.(1995)
Biochem.Biophys.Res.Commun.209:111-116;Medzhitov and Janeway(1997)
Curr.Opin.Immunol.9:4-9;和Medzhitov and Janeway(1997)
Curr.Opin.Immunol.9:4-9)。昆虫和人的DTLR蛋白具有更高的同源性,因而在替换与IL-1系统类似的纯免疫系统方面,其生物学功能的重叠性也更强,并且可以在脊椎动物胚胎的背腹形成或其他变化中借用一些潜在的调节分子。DeRobertis and Sasai(1996)
Nature 380:37-40;和Arendt andNübler-Jung(1997)
Mech.Develop.61:7-21。
最近发现的脊椎动物Wnt模式形成因子Frizzled受体也反映在本文描述的新兴但稳健的人类受体家族中。Wang,et al.(1996)
J. Biol.Chem.271:4468-4476。由于在早期发育过程中有其他多种细胞因子-受体系统发挥作用(Lemaire and Kodjabachian(1996)Trends Genet.12:525-531),因而致密胚胎与细长成体的不同细胞环境可能是导致类似信号通路及其扩散性引发物,如形态发生和免疫防御中的DTLR,在不同时间产生不同生物学效应的简单原因。对昆虫、植物和人的Toll相关系统而言(Hardiman,et al.(1996)Oncogene 13:2467-2475;Wilson,et al.(1997)
Curr.Biol.7:175-178),这些信号要经过调节性TH结构域,令人感兴趣的是,该TH结构域类似于细菌的转导系统(Parkinson(1993)
Cell73:857-871)。
具体地说,DTLR6具有使其成为该家族成员的结构特性。此外,该家族的成员可能涉及多种主要的发育疾病,并且可参与先天免疫系统的功能。详细地说,DTLR6定位于X染色体,其位置是用于研究主要发育异常疾病的热点。例如可参考Sanger中心:人类X染色体网站http://www.sanger.ac.uk/HGP/ChrX/index.shtml;以及Baylor医学院人类基因组测序网站http://gc.bcm.tmc.edu:8088/cgi-bin/seq/home。
保存的PAC的登录号为AC003046。该登录号包括两种PACs的序列:RPC-164K3和RPC-263P4。这两种PAC序列定位于人染色体Xp22,在Baylor网站中是定位于STS标记DXS704和DXS7166之间。该区域是用于研究几种发育异常疾病的“热点”。
III.DTLR片段的PCR扩增
选择两种适当的引物序列(见表1-10)。选择含有适当信息的mRNA样品,如能够表达特定基因的样品,进行RT-PCR以产生部分或全长cDNA。可参考,例如,Innis,et al.(1990年版)《
PCR实验 方法及应用指南》Academic Press,San Diego,CA和Dieffenbachand Dveksler(1995年版)《
PCR引物实验手册》Cold Spring HarborPress,CSH,NY。由此来确定适用于在cDNA库中探测全长基因的序列。基因组中的DTLR6序列是连续的,其他DTLRs可能也是如此。因而对基因组DNA进行PCR可能会产生连续的全长序列,然后可以使用染色体步查法。作为选择,序列库中可含有与所述实施方案的某些部分相关的序列或其密切相关的形式,如可变剪接形式,等等。此外,还可以将表达克隆技术应用于cDNA库。
IV.DTLRs的组织分布
目前已获得了这些DTLRs的每种编码基因的信息。参考图5A-5F。对其他细胞和组织的检测可采用适当的技术,如PCR、免疫测定、杂交,等等。组织和器官的cDNA制品可由Clontech,Mountain View,CA等公司购买。按文中所述对天然表达源进行检测的方法也同样适用。
DNA印迹分析:用适当的限制酶对来源于灵长类动物扩增cDNA库的DNA(5μg)进行消化,以释放其插入序列,用1%琼脂糖凝胶进行电泳,然后转移到尼龙膜上(Schleicher and Schuell,Keene,NH)。
用于分离人mRNA的样品通常包括,例如:外周血单核细胞(单核细胞、T细胞、NK细胞、粒细胞、B细胞),静息型(T100);外周血单核细胞,用抗CD3活化2、6、12小时后收集(T101);T细胞,TH0克隆Mot 72,静息型(T102);T细胞,TH0克隆Mot 72,用抗CD28和抗CD3活化3、6、12小时后收集(T103);T细胞,TH0克隆Mot 72,用特定的肽无反应性处理2、7、12小时后收集(T104);T细胞,TH1克隆HY06,静息型(T107);T细胞,TH1克隆HY06,用抗CD28和抗CD3活化3、6、12小时后收集(T108);T细胞,TH1克隆HY06,用特定的肽无反应性处理2、6、12小时后收集(T109);T细胞,TH2克隆HY935,静息型(T110);T细胞,TH2克隆HY935,用抗CD28和抗CD3活化2、7、12小时后收集(T111);T细胞,在抗CD28、IL-4和抗IFN-γ中极化27天的CD4+CD45RO-T细胞,用抗CD3和抗CD28活化4小时至极化TH2(T116);Jurkat和Hut78肿瘤T细胞系,静息型(T117);T细胞克隆,合并的AD130.2、Tc783.12、Tc783.13、Tc783.58、Tc782.69,静息型(T118);T细胞,随机γδT细胞克隆,静息型(T119);脾细胞,静息型(B100);脾细胞,用抗CD40和IL-4活化(B101);EBV系B细胞,合并的WT49、RSB、JY、CVIR、721.221、RM3、HSY,静息型(B102);JY系B细胞,用PMA和离子霉素活化1、6小时后收集(B103);合并的NK20克隆,静息型(K100);合并的NK20克隆,用PMA和离子霉素活化6小时(K101);NKL克隆,来自LGL白血病患者的外周血,IL-2处理(K106);NK细胞毒性克隆640-A30-1,静息型(K107);TF1系造血细胞前体,用PMA和离子霉素活化1、6小时后收集(C100);U937前单核细胞系,静息型(M100);U937前单核细胞系,用PMA和离子霉素活化1、6小时后收集(M101);净化的单核细胞,用LPS、IFNγ、抗IL-10活化1、2、6、12、24小时后收集(M102);净化的单核细胞,用LPS、IFNγ、IL-10活化1、2、6、12、24小时后收集(M103);净化的单核细胞,用LPS、IFNγ、抗IL-10活化4、16小时后收集(M106);净化的单核细胞,用LPS、IFNγ、IL-10活化4、16小时后收集(M107);净化的单核细胞,用LPS活化1小时(M108);净化的单核细胞,用LPS活化6小时(M109);DC 70%CD1a+,来自CD34+GM-CSF,TNFα处理12天,静息型(D101);DC 70%CD1a+,来自CD34+GM-CSF,TNFα处理12天,用PMA和离子霉素活化1小时(D102);DC 70%CD1a+,来自CD 34+GM-CSF,TNFα处理12天,用PMA和离子霉素活化6小时(D103);DC 95%CD1a+,来自CD34+GM-CSF,TNFα处理12天后FACS分类,用PMA和离子霉素活化1、6小时后收集(D104);DC 95%CD14+,不含CD34+GM-CSF,TNFα处理12天后FACS分类,用PMA和离子霉素活化1、6小时后收集(D105);DC CD1a+CD86+,来自CD34+GM-CSF,TNFα处理12天后FACS分类,用PMA和离子霉素活化1、6小时后收集(D106);来自单核细胞GM-CSF的DC,IL-4处理5天,静息型(D107);来自单核细胞GM-CSF的DC,IL-4处理5天,静息型(D108);来自单核细胞GM-CSF的DC,IL-4处理5天,用LPS活化4、16小时后收集(D109);来自单核细胞GM-CSF的DC,IL-4处理5天,用TNFα、单核细胞上清液活化4、16小时后收集(D110);平滑肌瘤L11良性肿瘤(X101);正常子宫肌层M5(0115);恶性平滑肌肉瘤GS1(X103);MRC5系肺纤维母细胞瘤,用PMA和离子霉素活化1、6小时后收集(C101);CHA系肾上皮癌细胞,用PMA和离子霉素活化1、6小时后收集(C102);28周男性胎肾(0100);28周男性胎肺(0101);28周男性胎肝(0102);28周男性胎心(0103);28周男性胎脑(0104);28周男性胎儿胆囊(0106);28周男性胎儿小肠(0107);28周男性胎儿脂肪组织(0108);25周女性胎儿卵巢(0109);25周女性胎儿子宫(0110);28周男性胎儿睾丸(0111);28周男性胎脾(0112);28周成熟胎盘(0113);以及发炎的扁桃体,来自12岁的个体(X100)。
用于分离小鼠mRNA的样品包括,例如:静息的小鼠成纤维细胞L细胞系(C200);Braf:ER(Braf与雌激素受体的融合体)转染的细胞,对照;T细胞,TH1极化(来自脾的Me114幼CD4+细胞,用IFN-γ和抗IL-4极化7天;T200);T细胞,TH2极化(来自脾的Me114幼CD4+细胞,用IL-4和抗IFN-γ极化7天;T201);T细胞,TH1高度极化(参考Openshaw,et al.(1995)
J.Exp.Med.182:1357-1367;用抗CI-3活化2、6、16小时后收集;T202);T细胞,TH2高度极化(参考Openshaw,et al.(1995)
J.Exp.Med.182:1357-1367;用抗CD-3活化2、6、16小时后收集;T203);CD44-CD25+前T细胞,从胸腺中分类获得(T204);TH1 T细胞克隆D1.1,在最后一次抗原刺激之后静息3周(T205);TH1 T细胞克隆D1.1,用10μg/mlConA刺激15小时(T206);TH2 T细胞克隆CDC35,在最后一次抗原刺激之后静息3周(T207);TH2 T细胞克隆CDC35,用10μg/mlConA刺激15小时(T208);来自脾的Me114+幼T细胞,静息型(T209);Me114+T细胞,用IFN-γ/IL12/抗IL-4极化6、12、24小时至TH1后收集(T210);Me114+T细胞,用IL-4/抗IFN-γ极化6、13、24小时至TH2后收集(T211);未经刺激的成熟B细胞白血病A20细胞系(B200);未经刺激的CH12系B细胞(B201);未经刺激的脾大B细胞(B202);整个脾的B细胞,LPS活化(B203);经三碘苯甲酰氨基葡糖富集的脾树突状细胞,静息型(D200);骨髓树突状细胞,静息型(D201);单核细胞RAW264.7细胞系,用LPS活化4小时(M200);来自GM和M-CSF的骨髓巨噬细胞(M201);巨噬细胞J774细胞系,静息型(M202);巨噬细胞J774细胞系,用LPS和抗IL-10处理0.5、1、3、6、12小时后收集(M203);巨噬细胞J774细胞系,用LPS和IL-10处理0.5、1、3、5、12小时后收集(M203);气雾剂诱发的小鼠肺组织,Th2致敏,用气雾剂OVA诱发7、14、23小时后收集(参考Garlisi,et al.(1995)
临床免疫学和免疫病理学,75:75-83;X206);日本圆线虫感染的肺组织(参考Coffman,etal.(1989)
Science 245:308-310;X200);正常的成熟全肺(0200);全肺,rag-1(参考Schwarz,et al.(1993)
Immunodeficiency4:249-252;0205);经IL-10处理的脾(参考Kuhn,et al.(1991)Cell 75:263-274;X201);正常的成熟全脾(0201);全脾,rag-1(0207);经IL-10处理的派尔集合淋巴结(0202);正常的全派尔集合淋巴结(0210);经IL-10处理的肠系膜淋巴结(X203);正常的总肠系膜淋巴结(0211);经IL-10处理的结肠(X203);正常全结肠(0212);NOD小鼠胰腺(参考Makino,et al.(1980)
Jikken Dobutsu 29:1-13;X205);全胸腺,rag-1(0208);全肾,rag-1(0209);全心脏,rag-1(0202);全脑,rag-1(0203);全睾丸,rag-1(0204);全肝脏,rag-1(0206);大鼠正常关节组织(0300);和大鼠关节炎性关节组织(X300)。
检测发现,DTLR10能够在2型前树突状细胞(pDC2)中高水平表达。可参考,例如,Rissoan,et al.(1999)
Science 283:1183-1186;和Siegal,et al.(1999)
Science 284:1835-1837。但在单核细胞中不表达。DTLR10的局限性表达进一步表明其在宿主免疫防御中的受体作用。pDC 2细胞是天然的产干扰素细胞(NIPC),它可以在对单纯疱疹病毒感染的应答过程中产生大量的IFNα。
V.DTLRs物种对应物的克隆方法
可使用多种策略来获得这些DTLRs的物种对应物,优选的是来源于其他灵长类动物。一种方法是利用近亲物种的DNA特征进行杂交。它可作为对进化上相似的物种进行研究的有效中间步骤。另一种方法是使用特定的PCR引物,该方法的基础是要确定人类等特定物种的基因之间的相似部分或差异部分,如高度保守或非保守的多肽或核苷酸序列区域。作为选择,还可将抗体用于表达克隆化。
VI.哺乳动物DTLR蛋白的产生
设计一种适当的融合构建体,如GST融合构建体,用于,例如,在大肠杆菌中表达。举例来说,可以构建一种小鼠IGIF pGex质粒,并将其转化到大肠杆菌中。将新转化的细胞培养在含有50μg/ml氨苄青霉素的LB培养基中,并用IPTG(Sigma,St.Louis,MO)进行诱导。诱导过夜,然后收集细菌,并分离出含有DTLR蛋白的沉淀。用2升TE缓冲液(50mM Tris-base pH8.0、10mM EDTA和2mMpefabloc)使沉淀匀浆。使该匀浆液3次穿过微量流化器(Microfluidics,Newton,MA)。用Sorval GS-3转头使流化上清13,000rpm离心1小时。将获得的含有DTLR蛋白的上清液过滤,并使其穿过在50mM Tris-base pH8.0中平衡的谷胱甘肽-SEPHAROSE柱。将含有DTLR-GST融合蛋白的组分合并,然后用凝血酶(EnzymeResearch Laboratories,Inc.,South Bend,IN)剪切。然后使剪切的合并物穿过50mM Tris-base平衡的Q-SEPHAROSE柱。将含有DTLR的组分合并,再用冷蒸馏水稀释,以降低其传导率,然后只需再次穿过新平衡的Q-Sepharose柱,或是接着再穿过一种免疫亲和抗体柱。将含有DTLR蛋白的组分合并,分装,并保存于-70℃冰箱。
DTLR1蛋白的CD谱比较结果说明该蛋白已正确折叠。可参考Hazuda,et al.(1969)
J.Biol.Chem.264:1689-1693。
VII.用DTLRs进行生物学测定
生物学测定通常涉及蛋白的配体结合特性或受体的激酶/磷酸酶活性。与其他多种酶的作用相同,受体的这种活性通常是可逆的,并且可调节磷酸酶或磷酸化酶的活性,这些活性可通过标准方法来方便地检测。可参考,例如,Hardie,et al.(1995年版)《
蛋白激酶 手册》第I和第II卷,Academic Press,San Diego,CA;Hanks,etal.(1991)
Meth.Enzymol.200:38-62;Hunter,et al.(1992)Cell 70:375-388;Lewin(1990)
Cell 61:743-752;Pines,et al.(1991)
Cold Spring Harbor Symp.Quant.Biol.56:449-463;和Parker,et al.(1993)
Nature 363:736-738。
白介素1s家族包括一些可作为炎症疾病重要介体的分子。有关的综合论述,可参考Dinarello(1996)“
白介素-1在疾病中的生物学 基础”
Blood 87:2095-2147。有人认为,可能有多种Toll配体在疾病初期发挥着重要作用,特别是炎症反应初期。与IL-1家族有关的新蛋白的发现有助于鉴定那些可作为疾病初期主要成分的分子,并且有助于产生范围更广、疗效更高的治疗策略。
VIII.诸如DTLR4的特异性抗体的制备
用纯化DTLR4等重组蛋白或稳定转染的NIH-3T3细胞对近交Ba1b/c小鼠进行腹膜内免疫。在适当时间点用附加或不加佐剂的蛋白对动物进行再次注射促升,以进一步刺激抗体的产生。收集血清,或收集脾中产生的杂交瘤细胞。
作为选择,还可以用该基因或其片段转化的内源性或外源性细胞对Balb/c小鼠进行免疫,也可以用分离的抗原富集膜对Ba1b/c小鼠进行免疫。在适当的时间收集血清,一般是在多次给药之后收集血清。为了产生免疫应答,可以用多种基因治疗技术来实现,例如,原位蛋白产生。
可以制备单克隆抗体。例如将脾细胞与适当的融合配偶体相融合,并利用标准方法在生长培养基中挑选杂交瘤细胞。利用诸如ELISA或其他测定方法筛选那些含有能够与所需DTLR结合的抗体的杂交瘤细胞上清液。还可以选择或制备能够特异性识别特定DTLR的抗体。
用于产生单克隆或多克隆抗体的另一种方法是将合成肽或纯化蛋白呈递至免疫系统。可参考,例如,Coligan(1991)《
最新免疫学 实验方法》Wiley/Greene;和Harlow and Lane(1989)《
抗体实验 手册》Cold Spring Harbor Press。在适当条件下,可以按上文所述对结合剂进行标记,如荧光标记等,也可以将结合剂固定在一种基质上,以便在淘选方法中使用。还可以将核酸引入动物体内的细胞,以产生用于诱导免疫应答的抗原。可参考,例如,Wang,et al.(1993)Proc.Nat’l.Acad.Sci.90:4156-4160;Barry,et al.(1994)BioTechniques 16:616-619;和Xiang,et al.(1995)Immunity2:129-135。
IX.诸如DTLR5的融合蛋白的产生
用DTLR5可以制备多种融合构建体。可以将该基因部分与FLAG等表位标记相融合,也可以与一种双杂交系统构建体相融合。可参考,例如,Fields and Song(1989)
Nature 340:245-246。
该表位标记可以在表达克隆化过程中使用,以便于利用抗-FLAG抗体来检测结合配偶体,如相应DTLR5的配体。也可以用双杂交系统来分离与DTLR5特异性结合的蛋白。
X.DTLRs的染色体制图
制备铺展染色体。在用培养72小时并经植物凝集素刺激的淋巴细胞制备的铺展染色体上进行原位杂交。在最后7个小时的培养过程中添加5-溴脱氧尿嘧啶,以确保杂交后染色体能够良好地显带。
以B细胞总cDNA为模板,用引物扩增出适当的片段,如PCR片段,然后将该片段克隆到适当的载体中。利用3H缺口平移技术对该载体进行标记。按Mattei,et al.(1985)Hum.Genet.69:327-331中的描述使标记探针与中期铺展染色体杂交。
用核径迹乳剂(KODAK NTB2)包被玻片,并使其曝光,例如,4℃曝光18天。为避免银颗粒在显带过程中移动,首先用Giemsa缓冲溶液对铺展染色体染色,并进行中期照相。然后用荧光染料-光解-Giemsa(FPG)法进行R-显带操作,中期照相,并进行分析。
作为选择,还可以按上文所述进行FISH。DTLR基因位于不同的染色体上。DTLR2和DTLR3定位于人类第4号染色体;DTLR4定位于人类第9号染色体,而DTLR5定位于人类第1号染色体。参考图4A-4D。
XI.结构活性关系
利用标准的程序和分析方法来确定特定残基的关键程度。实施标准诱变分析的方法是,例如,在预定位点,如上文确定的位点,产生多种不同的变体,并评定这些变体的生物活性。利用该方法可以最终确定那些可以改变活性的位点,也可以针对特定的位置来确定那些被取代后能够保留、阻断或调节生物活性的残基。
作为选择,对天然变体进行分析的结果可以表明那些能够耐受天然突变的位点。这可以通过对个体或杂交品系或物种的种群变异分析来实现。对选定的个体样品进行分析的方法可以是,例如,PCR分析和测序。由此可以对种群多态性进行评定。
XI.DTLR配体的分离
利用DTLR的结合特异性可将其作为特异性结合剂来鉴定其结合配偶体,这非常类似于使用。可以如上所述对结合剂进行标记,如荧光标记等,也可以将其固定在一种基质上,以便在淘选方法中使用。
该结合组合物可用于筛选由一种细胞系制备的表达库,该细胞系可以表达一种结合配偶体,即配体,优选的是与膜结合的配体。利用标准的染色技术对表面表达的配体进行检测或分类,或用淘选方法对表面表达转化细胞进行筛选。胞内表达的筛选可通过多种染色或免疫荧光方法来实现。还可参考McMahan,et al.(1991)
EMBO J.10:2821-2832。
举例来说,第0天,用10ng/ml溶于PBS的纤连蛋白对双池玻片进行预包被,每池使用1ml,室温预包被30分钟。PBS洗涤一次。然后将COS细胞铺片,每池使用1.5ml生长培养基,其中含有2-3×105个细胞。37℃保温过夜。
第一天,制备含有66μg/ml DEAE-葡聚糖、66μM氯奎和4μg DNA的无血清DME溶液,每种样品制备0.5ml。每组样品制备一个阳性对照,如稀释度为1和1/200的DTLR-FLAG cDNA,并制备一个阴性对照。用无血清DME洗涤细胞。添加DNA溶液,并于37℃保温5小时。去除培养基,并添加0.5ml溶于DME的10%DMSO,作用2.5分钟。去除并用DME洗涤一次。添加1.5ml生长培养基并保温过夜。
第二天,改变培养基。第三天或第四天,固定细胞并染色。用Hank’s缓冲盐溶液(HBSS)将细胞洗涤两次,再用4%多聚甲醛(PFA)/葡糖将细胞固定5分钟。然后用HBSS将细胞洗涤三次。去除所有液体,然后将玻片保存于-80℃。按下述方法进行保温,每池使用0.5ml。添加含有32μl/ml 1M叠氮化钠的HBSS/皂角苷(0.1%),保温20分钟。然后用HBSS/皂角苷将细胞洗涤一次。在细胞中添加适当的DTLR或DTLR/抗体复合物,保温30分钟。再用HBSS/皂角苷将细胞洗涤两次。在适当条件下可以先添加抗体并保温30分钟。再添加稀释度为1/200的二抗,如Vector抗小鼠抗体,并保温30分钟。制备ELISA溶液,如Vector Elite ABC辣根过氧化物酶溶液,并预保温30分钟。每2.5ml HBSS/皂角苷使用,例如,1滴溶液A(抗生物素蛋白)和1滴溶液B(生物素)。用HBSS/皂角苷将细胞洗涤两次。添加ABC HRP溶液并保温30分钟。用HBSS将细胞洗涤两次,第二次洗2分钟,以封闭细胞,然后添加Vector二氨基联苯胺(DAB),保温5-10分钟。每5ml蒸馏水中添加2滴缓冲液、4滴DAB和2滴双氧水。仔细移去样品池,并将玻片置水中洗涤。空气干燥数分钟,然后添加1滴Crystal Mount和盖玻片。85-90℃烘焙5分钟。
鉴定阳性染色样品池,然后逐步进行亚克隆,以分离能够结合的单个基因。
作为选择,还可以用DTLR试剂来亲和纯化或分选能够表达假定配体的细胞。可参考,例如,Sambrool,et al.或Ausubel,et al.。
另一种策略是用淘选方法筛选与膜结合的受体。如上所述构建受体cDNA。可以将配体固定化,并用于固定表达细胞。可以用能够识别诸如DTLR融合构建体中的FLAG序列的适当抗体来进行固定,也可以利用由一抗产生的抗体来实现固定。通过循环筛选和扩增即可实现适当克隆的富集,并且最终分离出受体表达克隆。
还可以用哺乳动物DTLRs对噬菌体表达库进行筛选。利用适当的标记技术,如抗-FLAG抗体,可以特异性地标记适当克隆。
文中的所有引文均以相同的程度在此引入作为参考,其方式就如同每个出版物或专利申请都单独地特别指明被引入作为参考。
该领域的技术人员都了解,在不脱离本发明的精神和范围的情况下,本发明可以有多种修改和变化。文中描述的特定实施方案只作为实例加以提供,而本发明将局限于附属权利要求的条款,以及与这些权利要求的授权相同的整个范围;本发明并不局限于文中以实例方式提供的特定实施方案。
人类具有两种不同类型的树突状细胞(DC)前体。经GMCSF和IL-4培养之后的外周血单核细胞(pDC1)可形成未成熟的髓样DCs。经CD40配体(CD40L)刺激之后,这些未成熟细胞可变为成熟的髓样DCs(DC1)。经IL-3培养之后,来自血液或扁桃体的CD4+CD3-CD11c-浆细胞样细胞(pDC2)可形成不同类型的未成熟DC,经CD40L刺激后可分化为成熟的DCs(DC2)。Rissoan,et al.(1999)
Science283:1183-1186。
根据Siegal,et al.(1999)
Science 284:1835-1837的描述,pDC2是“天然的产干扰素细胞”(IPC)。干扰素(IFNs)是抗病毒免疫应答中最重要的细胞因子。人类血液中的“天然产干扰素细胞”(NIPCs)可表达CD4和II-类主要组织相容性复合体蛋白,但由于这些细胞很稀少,并且会迅速凋亡,同时又缺乏同系标记物,所以至今还未能对其进行分离和进一步检测。本文的纯化NIPCs是CD4(+)CD11c-2型树突状细胞前体(pDC2s),被微生物诱发后,该细胞能产生比其他血细胞多200-1000倍的IFN。因此,pDC2s是一类免疫系统效应细胞,它在抗病毒和抗肿瘤免疫应答中起着关键作用。这些细胞被认为是HIV感染患者内的重要细胞。
Toll-样受体(TLR)分子属于IL-1/Toll受体家族。目前已鉴定出TLR2和TLR4的配体,这些配体的功能与宿主对微生物抗原或损伤的免疫应答有关。Takeuchi,et al.(1999)
Immunity 11:443-451;和Noshino,et al.(1999)
J.Immunol.162:3749-3752。TLRs的表达模式可能是限制性的。Muzio,et al.(2000)
J.Immunol.164:5998-6004。这些结果说明:i)TLR10能够在pDC2s中高水平表达,并且其表达局限于pDC2s,以及ii)pDC2属于NIPC,TLR10可能在宿主先天免疫应答中起着重要作用。
序列表
SEQ ID NO:1提供灵长类动物的DTLR1核苷酸序列。
SEQ ID NO:2提供灵长类动物的DTLR1多肽序列。
SEQ ID NO:3提供灵长类动物的DTLR2核苷酸序列。
SEQ ID NO:4提供灵长类动物的DTLR2多肽序列。
SEQ ID NO:5提供灵长类动物的DTLR3核苷酸序列。
SEQ ID NO:6提供灵长类动物的DTLR3多肽序列。
SEQ ID NO:7提供灵长类动物的DTLR4核苷酸序列。
SEQ ID NO:8提供灵长类动物的DTLR4多肽序列。
SEQ ID NO:9提供灵长类动物的DTLR5核苷酸序列。
SEQ ID NO:10提供灵长类动物的DTLR5多肽序列。
SEQ ID NO:11提供灵长类动物的DTLR6核苷酸序列。
SEQ ID NO:12提供灵长类动物的DTLR6多肽序列。
SEQ ID NO:13提供啮齿类动物的DTLR6核苷酸序列。
SEQ ID NO:14提供啮齿类动物的DTLR6多肽序列。
SEQ ID NO:15提供灵长类动物的DTLR7核苷酸序列。
SEQ ID NO:16提供灵长类动物的DTLR7多肽序列。
SEQ ID NO:17提供灵长类动物的DTLR7核苷酸序列。
SEQ ID NO:18提供灵长类动物的DTLR7多肽序列。
SEQ ID NO:19提供灵长类动物的DTLR8核苷酸序列。
SEQ ID NO:20提供灵长类动物的DTLR8多肽序列。
SEQ ID NO:21提供灵长类动物的DTLR9核苷酸序列。
SEQ ID NO:22提供灵长类动物的DTLR9多肽序列。
SEQ ID NO:23提供灵长类动物的DTLR10核苷酸序列。
SEQ ID NO:24提供灵长类动物的DTLR10多肽序列。
SEQ ID NO:25提供灵长类动物的DTLR4核苷酸序列。
SEQ ID NO:26提供灵长类动物的DTLR4多肽序列。
SEQ ID NO:27提供啮齿类动物的DTLR6核苷酸序列。
SEQ ID NO:28提供啮齿类动物的DTLR6多肽序列。
SEQ ID NO:29提供啮齿类动物的DTLR6核苷酸序列。
SEQ ID NO:30提供啮齿类动物的DTLR6多肽序列。
SEQ ID NO:31提供灵长类动物的DTLR8核苷酸序列。
SEQ ID NO:32提供灵长类动物的DTLR8多肽序列。
SEQ ID NO:33提供灵长类动物的DTLR10核苷酸序列。
SEQ ID NO:34提供灵长类动物的DTLR10多肽序列。
SEQ ID NO:35提供啮齿类动物的DTLR10核苷酸序列。
SEQ ID NO:36提供灵长类动物的DTLR7核苷酸序列。
SEQ ID NO:37提供灵长类动物的DTLR7多肽序列。
SEQ ID NO:38提供灵长类动物的DTLR8核苷酸序列。
SEQ ID NO:39提供灵长类动物的DTLR8多肽序列。
SEQ ID NO:40提供灵长类动物的DTLR9核苷酸序列。
SEQ ID NO:41提供灵长类动物的DTLR9多肽序列。
SEQ ID NO:42提供灵长类动物的DTLR10核苷酸序列。
SEQ ID NO:43提供灵长类动物的DTLR10多肽序列。
SEQ ID NO:42提供啮齿类动物的DTLR10核苷酸序列。
SEQ ID NO:43提供啮齿类动物的DTLR10多肽序列。
<110>先灵公司
<120>人受体蛋白;相关的试剂和方法
<130>DX0724XKP
<140>
<141>
<160>45
<170>PatentIn Ver.2.0
<210>1
<211>2367
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(2358)
<220>
<221>mat肽
<222>(67)..(2358)
<400>1
atg act agc atc ttc cat ttt gcc att atc ttc atg tta ata ctt cag 48
Met Thr Ser Ile Phe His Phe Ala Ile Ile Phe Met Leu Ile Leu Gln
-20 -15 -10
atc aga ata caa tta tct gaa gaa agt gaa ttt tta gtt gat agg tca 96
Ile Arg Ile Gln Leu Ser Glu Glu Ser Glu Phe Leu Val Asp Arg Ser
-5 -1 1 5 10
aaa aac ggt ctc atc cac gtt cct aaa gac cta tcc cag aaa aca aca 144
Lys Asn Gly Leu Ile His Val Pro Lys Asp Leu Ser Gln Lys Thr Thr
15 20 25
atc tta aat ata tcg caa aat tat ata tct gag ctt tgg act tct gac 192
Ile Leu Asn Ile Ser Gln Asn Tyr Ile Ser Glu Leu Trp Thr Ser Asp
30 35 40
atc tta tca ctg tca aaa ctg agg att ttg ata att tct cat aat aga 240
Ile Leu Ser Leu Ser Lys Leu Arg Ile Leu Ile Ile Ser His Asn Arg
45 50 55
atc cag tat ctt gat atc agt gtt ttc aaa ttc aac cag gaa ttg gaa 288
Ile Gln Tyr Leu Asp Ile Ser Val Phe Lys Phe Asn Gln Glu Leu Glu
60 65 70
tac ttg gat ttg tcc cac aac aag ttg gtg aag att tct tgc cac cct 336
Tyr Leu Asp Leu Ser His Asn Lys Leu Val Lys Ile Ser Cys His Pro
75 80 85 90
act gtg aac ctc aag cac ttg gac ctg tca ttt aat gca ttt gat gcc 384
Thr Val Asn Leu Lys His Leu Asp Leu Ser Phe Asn Ala Phe Asp Ala
95 100 105
ctg cct ata tgc aaa gag ttt ggc aat atg tct caa cta aaa ttt ctg 432
Leu Pro Ile Cys Lys Glu Phe Gly Asn Met Ser Gln Leu Lys Phe Leu
110 115 120
ggg ttg agc acc aca cac tta gaa aaa tct agt gtg ctg cca att gct 480
Gly Leu Ser Thr Thr His Leu Glu Lys Ser Ser Val Leu Pro Ile Ala
125 130 135
cat ttg aat atc agc aag gtc ttg ctg gtc tta gga gag act tat ggg 528
His Leu Asn Ile Ser Lys Val Leu Leu Val Leu Gly Glu Thr Tyr Gly
140 145 150
gaa aaa gaa gac cct gag ggc ctt caa gac ttt aac act gag agt ctg 576
Glu Lys Glu Asp Pro Glu Gly Leu Gln Asp Phe Asn Thr Glu Ser Leu
155 160 165 170
cac att gtg ttc ccc aca aac aaa gaa ttc cat ttt att ttg gat gtg 624
His Ile Val Phe Pro Thr Asn Lys Glu Phe His Phe Ile Leu Asp Val
175 180 185
tca gtc aag act gta gca aat ctg gaa cta tct aat atc aaa tgt gtg 672
Ser Val Lys Thr Val Ala Asn Leu Glu Leu Ser Asn Ile Lys Cys Val
190 195 200
cta gaa gat aac aaa tgt tct tac ttc cta agt att ctg gcg aaa ctt 720
Leu Glu Asp Asn Lys Cys Ser Tyr Phe Leu Ser Ile Leu Ala Lys Leu
205 210 215
caa aca aat cca aag tta tca agt ctt acc tta aac aac att gaa aca 768
Gln Thr Asn Pro Lys Leu Ser Ser Leu Thr Leu Asn Asn Ile Glu Thr
220 225 230
act tgg aat tct ttc att agg atc ctc caa cta gtt tgg cat aca act 816
Thr Trp Asn Ser Phe Ile Arg Ile Leu Gln Leu Val Trp His Thr Thr
235 240 245 250
gta tgg tat ttc tca att tca aac gtg aag cta cag ggt cag ctg gac 864
Val Trp Tyr Phe Ser Ile Ser Asn Val Lys Leu Gln Gly Gln Leu Asp
255 260 265
ttc aga gat ttt gat tat tct ggc act tcc ttg aag gcc ttg tct ata 912
Phe Arg Asp Phe Asp Tyr Ser Gly Thr Ser Leu Lys Ala Leu Ser Ile
270 275 280
cac caa gtt gtc agc gat gtg ttc ggt ttt ccg caa agt tat atc tat 960
His Gln Val Val Ser Asp Val Phe Gly Phe Pro Gln Ser Tyr Ile Tyr
285 290 295
gaa atc ttt tcg aat atg aac atc aaa aat ttc aca gtg tct ggt aca 1008
Glu Ile Phe Ser Asn Met Asn Ile Lys Asn Phe Thr Val Ser Gly Thr
300 305 310
cgc atg gtc cac atg ctt tgc cca tcc aaa att agc ccg ttc ctg cat 1056
Arg Met Val His Met Leu Cys Pro Ser Lys Ile Ser Pro Phe Leu His
315 320 325 330
ttg gat ttt tcc aat aat ctc tta aca gac acg gtt ttt gaa aat tgt 1104
Leu Asp Phe Ser Asn Asn Leu Leu Thr Asp Thr Val Phe Glu Asn Cys
335 340 345
ggg cac ctt act gag ttg gag aca ctt att tta caa atg aat caa tta 1152
Gly His Leu Thr Glu Leu Glu Thr Leu Ile Leu Gln Met Asn Gln Leu
350 355 360
aaa gaa ctt tca aaa ata gct gaa atg act aca cag atg aag tct ctg 1200
Lys Glu Leu Ser Lys Ile Ala Glu Met Thr Thr Gln Met Lys Ser Leu
365 370 375
caa caa ttg gat att agc cag aat tct gta agc tat gat gaa aag aaa 1248
Gln Gln Leu Asp Ile Ser Gln Asn Ser Val Ser Tyr Asp Glu Lys Lys
380 385 390
gga gac tgt tct tgg act aaa agt tta tta agt tta aat atg tct tca 1296
Gly Asp Cys Ser Trp Thr Lys Ser Leu Leu Ser Leu Asn Met Ser Ser
395 400 405 410
aat ata ctt act gac act att ttc aga tgt tta cct ccc agg atc aag 1344
Asn Ile Leu Thr Asp Thr Ile Phe Arg Cys Leu Pro Pro Arg Ile Lys
415 420 425
gta ctt gat ctt cac agc aat aaa ata aag agc att cct aaa caa gtc 1392
Val Leu Asp Leu His Ser Asn Lys Ile Lys Ser Ile Pro Lys Gln Val
430 435 440
gta aaa ctg gaa gct ttg caa gaa ctc aat gtt gct ttc aat tct tta 1440
Val Lys Leu Glu Ala Leu Gln Glu Leu Asn Val Ala Phe Asn Ser Leu
445 450 455
act gac ctt cct gga tgt ggc agc ttt agc agc ctt tct gta ttg atc 1488
Thr Asp Leu Pro Gly Cys Gly Ser Phe Ser Ser Leu Ser Val Leu Ile
460 465 470
att gat cac aat tca gtt tcc cac cca tca gct gat ttc ttc cag agc 1536
Ile Asp His Asn Ser Val Ser His Pro Ser Ala Asp Phe Phe Gln Ser
475 480 485 490
tgc cag aag atg agg tca ata aaa gca ggg gac aat cca ttc caa tgt 1584
Cys Gln Lys Met Arg Ser Ile Lys Ala Gly Asp Asn Pro Phe Gln Cys
495 500 505
acc tgt gag ctc gga gaa ttt gtc aaa aat ata gac caa gta tca agt 1632
Thr Cys Glu Leu Gly Glu Phe Val Lys Asn Ile Asp Gln Val Ser Ser
510 515 520
gaa gtg tta gag ggc tgg cct gat tct tat aag tgt gac tac ccg gaa 1680
Glu Val Leu Glu Gly Trp Pro Asp Ser Tyr Lys Cys Asp Tyr Pro Glu
525 530 535
agt tat aga gga acc cta cta aag gac ttt cac atg tct gaa tta tcc 1728
Ser Tyr Arg Gly Thr Leu Leu Lys Asp Phe His Met Ser Glu Leu Ser
540 545 550
tgc aac ata act ctg ctg atc gtc acc atc gtt gcc acc atg ctg gtg 1776
Cys Asn Ile Thr Leu Leu Ile Val Thr Ile Val Ala Thr Met Leu Val
555 560 565 570
ttg gct gtg act gtg acc tcc ctc tgc atc tac ttg gat ctg ccc tgg 1824
Leu Ala Val Thr Val Thr Ser Leu Cys Ile Tyr Leu Asp Leu Pro Trp
575 580 585
tat ctc agg atg gtg tgc cag tgg acc cag acc cgg cgc agg gcc agg 1872
Tyr Leu Arg Met Val Cys Gln Trp Thr Gln Thr Arg Arg Arg Ala Arg
590 595 600
aac ata ccc tta gaa gaa ctc caa aga aat ctc cag ttt cat gca ttt 1920
Asn Ile Pro Leu Glu Glu Leu Gln Arg Asn Leu Gln Phe His Ala Phe
605 610 615
att tca tat agt ggg cac gat tct ttc tgg gtg aag aat gaa tta ttg 1968
Ile Ser Tyr Ser Gly His Asp Ser Phe Trp Val Lys Asn Glu Leu Leu
620 625 630
cca aac cta gag aaa gaa ggt atg cag att tgc ctt cat gag aga aac 2016
Pro Asn Leu Glu Lys Glu Gly Met Gln Ile Cys Leu His Glu Arg Asn
635 640 645 650
ttt gtt cct ggc aag agc att gtg gaa aat atc atc acc tgc att gag 2064
Phe Val Pro Gly Lys Ser Ile Val Glu Asn Ile Ile Thr Cys Ile Glu
655 660 665
aag agt tac aag tcc arc ttt gtt ttg tct ccc aac ttt gtc cag agt 2112
Lys Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser
670 675 680
gaa tgg tgc cat tat gaa ctc tac ttt gcc cat cac aat ctc ttt cat 2160
Glu Trp Cys His Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His
685 690 695
gaa gga tct aat agc tta atc ctg atc ttg ctg gaa ccc att ccg cag 2208
Glu Gly Ser Asn Ser Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln
700 705 710
tac tcc att cct agc agt tat cac aag ctc aaa agt ctc atg gcc agg 2256
Tyr Ser Ile Pro Ser Ser Tyr His Lys Leu Lys Ser Leu Met Ala Arg
715 720 725 730
agg act tat ttg gaa tgg ccc aag gaa aag agc aaa cgt ggc ctt ttt 2304
Arg Thr Tyr Leu Glu Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe
735 740 745
tgg gct aac tta agg gca gcc att aat att aag ctg aca gag caa gca 2352
Trp Ala Asn Leu Arg Ala Ala Ile Asn Ile Lys Leu Thr Glu Gln Ala
750 755 760
aag aaa tagtctaga 2367
Lys Lys
<210>2
<211>786
<212>PRT
<213>未知
<400>2
Met Thr Ser Ile Phe His Phe Ala Ile Ile Phe Met Leu Ile Leu Gln
-20 -15 -10
Ile Arg Ile Gln Leu Ser Glu Glu Ser Glu Phe Leu Val Asp Arg Ser
-5 -1 1 5 10
Lys Asn Gly Leu Ile His Val Pro Lys Asp Leu Ser Gln Lys Thr Thr
15 20 25
Ile Leu Asn Ile Ser Gln Asn Tyr Ile Ser Glu Leu Trp Thr Ser Asp
30 35 40
Ile Leu Ser Leu Ser Lys Leu Arg Ile Leu Ile Ile Ser His Asn Arg
45 50 55
Ile Gln Tyr Leu Asp Ile Ser Val Phe Lys Phe Asn Gln Glu Leu Glu
60 65 70
Tyr Leu Asp Leu Ser His Asn Lys Leu Val Lys Ile Ser Cys His Pro
75 80 85 90
Thr Val Asn Leu Lys His Leu Asp Leu Ser Phe Asn Ala Phe Asp Ala
95 100 105
Leu Pro Ile Cys Lys Glu Phe Gly Asn Met Ser Gln Leu Lys Phe Leu
110 115 120
Gly Leu Ser Thr Thr His Leu Glu Lys Ser Ser Val Leu Pro Ile Ala
125 130 135
His Leu Asn Ile Ser Lys Val Leu Leu Val Leu Gly Glu Thr Tyr Gly
140 145 150
Glu Lys Glu Asp Pro Glu Gly Leu Gln Asp Phe Asn Thr Glu Ser Leu
155 160 165 170
His Ile Val Phe Pro Thr Asn Lys Glu Phe His Phe Ile Leu Asp Val
175 180 185
Ser Val Lys Thr Val Ala Asn Leu Glu Leu Ser Asn Ile Lys Cys Val
190 195 200
Leu Glu Asp Asn Lys Cys Ser Tyr Phe Leu Ser Ile Leu Ala Lys Leu
205 210 215
Gln Thr Asn Pro Lys Leu Ser Ser Leu Thr Leu Asn Asn Ile Glu Thr
220 225 230
Thr Trp Asn Ser Phe Ile Arg Ile Leu Gln Leu Val Trp His Thr Thr
235 240 245 250
Val Trp Tyr Phe Ser Ile Ser Asn Val Lys Leu Gln Gly Gln Leu Asp
255 260 265
Phe Arg Asp Phe Asp Tyr Ser Gly Thr Ser Leu Lys Ala Leu Ser Ile
270 275 280
His Gln Val Val Ser Asp Val Phe Gly Phe Pro Gln Ser Tyr Ile Tyr
285 290 295
Glu Ile Phe Ser Asn Met Asn Ile Lys Asn Phe Thr Val Ser Gly Thr
300 305 310
Arg Met Val His Met Leu Cys Pro Ser Lys Ile Ser Pro Phe Leu His
315 320 325 330
Leu Asp Phe Ser Asn Asn Leu Leu Thr Asp Thr Val Phe Glu Asn Cys
335 340 345
Gly His Leu Thr Glu Leu Glu Thr Leu Ile Leu Gln Met Asn Gln Leu
350 355 360
Lys Glu Leu Ser Lys Ile Ala Glu Met Thr Thr Gln Met Lys Ser Leu
365 370 375
Gln Gln Leu Asp Ile Ser Gln Asn Ser Val Ser Tyr Asp Glu Lys Lys
380 385 390
Gly Asp Cys Ser Trp Thr Lys Ser Leu Leu Ser Leu Asn Met Ser Ser
395 400 405 410
Asn Ile Leu Thr Asp Thr Ile Phe Arg Cys Leu Pro Pro Arg Ile Lys
415 420 425
Val Leu Asp Leu His Ser Asn Lys Ile Lys Ser Ile Pro Lys Gln Val
430 435 440
Val Lys Leu Glu Ala Leu Gln Glu Leu Asn Val Ala Phe Asn Ser Leu
445 450 455
Thr Asp Leu Pro Gly Cys Gly Ser Phe Ser Ser Leu Ser Val Leu Ile
460 465 470
Ile Asp His Asn Ser Val Ser His Pro Ser Ala Asp Phe Phe Gln Ser
475 480 485 490
Cys Gln Lys Met Arg Ser Ile Lys Ala Gly Asp Asn Pro Phe Gln Cys
495 500 505
Thr Cys Glu Leu Gly Glu Phe Val Lys Asn Ile Asp Gln Val Ser Ser
510 515 520
Glu Val Leu Glu Gly Trp Pro Asp Ser Tyr Lys Cys Asp Tyr Pro Glu
525 530 535
Ser Tyr Arg Gly Thr Leu Leu Lys Asp Phe His Met Ser Glu Leu Ser
540 545 550
Cys Asn Ile Thr Leu Leu Ile Val Thr Ile Val Ala Thr Met Leu Val
555 560 565 570
Leu Ala Val Thr Val Thr Ser Leu Cys Ile Tyr Leu Asp Leu Pro Trp
575 580 585
Tyr Leu Arg Mer Val Cys Gln Trp Thr Gln Thr Arg Arg Arg Ala Arg
590 595 600
Asn Ile Pro Leu Glu Glu Leu Gln Arg Asn Leu Gln Phe His Ala Phe
605 610 615
Ile Ser Tyr Ser Gly His Asp Ser Phe Trp Val Lys Asn Glu Leu Leu
620 625 630
Pro Asn Leu Glu Lys Glu Gly Met Gln Ile Cys Leu His Glu Arg Asn
635 640 645 650
Phe Val Pro Gly Lys Ser Ile Val Glu Asn Ile Ile Thr Cys Ile Glu
655 660 665
Lys Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser
670 675 680
Glu Trp Cys His Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His
685 690 695
Glu Gly Ser Asn Ser Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln
700 705 710
Tyr Ser Ile Pro Ser Ser Tyr His Lys Leu Lys Ser Leu Met Ala Arg
715 720 725 730
Arg Thr Tyr Leu Glu Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe
735 740 745
Trp Ala Asn Leu Arg Ala Ala Ile Asn Ile Lys Leu Thr Glu Gln Ala
750 755 760
Lys Lys
<210>3
<211>2355
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(L)..(2352)
<220>
<22l>mat_肽
<222>(67)..(2352)
<400>3
atg cca cat act ttg tgg atg gtg tgg gtc ttg ggg gtc atc atc agc 48
Met Pro His Thr Leu Trp Met Val Trp Val Leu Gly Val Ile Ile Ser
-20 -15 -10
ctc tcc aag gaa gaa tcc tcc aat cag gct tct ctg tct tgt gac cgc 96
Leu Ser Lys Glu Glu Ser Ser Asn Gln Ala Ser Leu Ser Cys Asp Arg
-5 -1 1 5 10
aat ggt atc tgc aag ggc agc tca gga tct tta aac tcc att ccc tca 144
Asn Gly Ile Cys Lys Gly Ser Ser Gly Ser Leu Asn Ser Ile Pro Ser
15 20 25
ggg ctc aca gaa gct gta aaa agc ctt gac ctg tcc aac aac agg atc 192
Gly Leu Thr Glu Ala Val Lys Ser Leu Asp Leu Ser Asn Asn Arg Ile
30 35 40
acc tac att agc aac agt gac cta cag agg tgt gtg aac ctc cag gct 240
Thr Tyr Ile Ser Asn Ser Asp Leu Gln Arg Cys Val Asn Leu Gln Ala
45 50 55
ctg gtg ctg aca tcc aat gga att aac aca ata gag gaa gat tct ttt 288
Leu Val Leu Thr Ser Asn Gly Ile Asn Thr Ile Glu Glu Asp Ser Phe
60 65 70
tct tcc ctg ggc agt ctt gaa cat tta gac tta tcc tat aat tac tta 336
Ser Ser Leu Gly Ser Leu Glu His Leu Asp Leu Ser Tyr Asn Tyr Leu
75 80 85 90
tct aat tta tcg tct tcc tgg ttc aag ccc ctt tct tct tta aca ttc 384
Ser Asn Leu Ser Ser Ser Trp Phe Lys Pro Leu Ser Ser Leu Thr Phe
95 100 105
tta aac tta ctg gga aat cct tac aaa acc cta ggg gaa aca tct ctt 432
Leu Asn Leu Leu Gly Asn Pro Tyr Lys Thr Leu Gly Glu Thr Ser Leu
110 115 120
ttt tct cat ctc aca aaa ttg caa atc ctg aga gtg gga aat atg gac 480
Phe Ser His Leu Thr Lys Leu Gln Ile Leu Arg Val Gly Asn Met Asp
125 130 135
acc ttc act aag att caa aga aaa gat ttt gct gga ctt acc ttc ctt 528
Thr Phe Thr Lys Ile Gln Arg Lys Asp Phe Ala Gly Leu Thr Phe Leu
140 145 150
gag gaa ctt gag att gat gct tca gat cta cag agc tat gag cca aaa 576
Glu Glu Leu Glu Ile Asp Ala Ser Asp Leu Gln Ser Tyr Glu Pro Lys
155 160 165 170
agt ttg aag tca att cag aac gta agt cat ctg atc ctt cat atg aag 624
Ser Leu Lys Ser Ile Gln Asn Val Ser His Leu Ile Leu His Met Lys
175 180 185
cag cat att tta ctg ctg gag att ttt gta gat gtt aca agt tcc gtg 672
Gln His Ile Leu Leu Leu Glu Ile Phe Val Asp Val Thr Ser Ser Val
190 195 200
gaa tgt ttg gaa ctg cga gat act gat ttg gac act ttc cat ttt tca 720
Glu Cys Leu Glu Leu Arg Asp Thr Asp Leu Asp Thr Phe His Phe Ser
205 210 215
gaa cta tcc act ggt gaa aca aat tca ttg att aaa aag ttt aca ttt 768
Glu Leu Ser Thr Gly Glu Thr Asn Ser Leu Ile Lys Lys Phe Thr Phe
220 225 230
aga aat gtg aaa atc acc gat gaa agt ttg ttt cag gtt atg aaa ctt 816
Arg Asn Val Lys Ile Thr Asp Glu Ser Leu Phe Gln Val Met Lys Leu
235 240 245 250
ttg aat cag att tct gga ttg tta gaa tta gag ttt gat gac tgt acc 864
Leu Asn Gln Ile Ser Gly Leu Leu Glu Leu Glu Phe Asp Asp Cys Thr
255 260 265
ctt aat gga gtt ggt aat ttt aga gca tct gat aat gac aga gtt ata 912
Leu Asn Gly Val Gly Asn Phe Arg Ala Ser Asp Asn Asp Arg Val Ile
270 275 280
gat cca ggt aaa gtg gaa acg tta aca atc cgg agg ctg cat att cca 960
Asp Pro Gly Lys Val Glu Thr Leu Thr Ile Arg Arg Leu His Ile Pro
285 290 295
agg ttt tac tta ttt tat gat ctg agc act tta tat tca ctt aca gaa 1008
Arg Phe Tyr Leu Phe Tyr Asp Leu Ser Thr Leu Tyr Ser Leu Thr Glu
300 305 310
aga gtt aaa aga atc aca gta gaa aac agt aaa gtt ttt ctg gtt cct 1056
Arg Val Lys Arg Ile Thr Val Glu Asn Ser Lys Val Phe Leu Val Pro
315 320 325 330
tgt tta ctt tca caa cat tta aaa tca tta gaa tac ttg gat ctc agt 1104
Cys Leu Leu Ser Gln His Leu Lys Ser Leu Glu Tyr Leu Asp Leu Ser
335 340 345
gaa aat ttg atg gtt gaa gaa tac ttg aaa aat tca gcc tgt gag gat 1152
Glu Asn Leu Met Val Glu Glu Tyr Leu Lys Asn Ser Ala Cys Glu Asp
350 355 360
gcc tgg ccc tct cta caa act tta att tta agg caa aat cat ttg gca 1200
Ala Trp Pro Ser Leu Gln Thr Leu Ile Leu Arg Gln Asn His Leu Ala
365 370 375
tca ttg gaa aaa acc gga gag act ttg ctc act ctg aaa aac ttg act 1248
ser Leu Glu Lys Thr Gly Glu Thr Leu Leu Thr Leu Lys Asn Leu Thr
380 385 390
aac att gat atc agt aag aat agt ttt cat tct atg cct gaa act tgt 1296
Asn Ile Asp Ile Ser Lys Asn Ser Phe His Ser Met Pro Glu Thr Cys
395 400 405 410
cag tgg cca gaa aag atg aaa tat ttg aac tta tcc agc aca cga ata 1344
Gln Trp Pro Glu Lys Met Lys Tyr Leu Asn Leu Ser Ser Thr Arg Ile
415 420 425
cac agt gta aca ggc tgc att ccc aag aca ctg gaa att tta gat gtt 1392
His Ser Val Thr Gly Cys Ile Pro Lys Thr Leu Glu Ile Leu Asp Val
430 435 440
agc aac aac aat ctc aat tta ttt tct ttg aat ttg ccg caa ctc aaa 1440
Ser Asn Asn Asn Leu Asn Leu Phe Ser Leu Asn Leu Pro Gln Leu Lys
445 450 455
gaa ctt tat att tcc aga aat aag ttg atg act cta cca gat gcc tcc 1488
Glu Leu Tyr Ile Ser Arg Asn Lys Leu Met Thr Leu Pro Asp Ala Ser
460 465 470
ctc tta ccc atg tta cta gta ttg aaa atc agt agg aat gca ata act 1536
Leu Leu Pro Met Leu Leu Val Leu Lys Ile Ser Arg Asn Ala Ile Thr
475 480 485 490
acg ttt tct aag gag caa ctt gac tca ttt cac aca ctg aag act ttg 1584
Thr Phe Ser Lys Glu Gln Leu Asp Ser Phe His Thr Leu Lys Thr Leu
495 500 505
gaa gct ggt ggc aat aac ttc att tgc tcc tgt gaa ttc ctc tcc ttc 1632
Glu Ala Gly Gly Asn Asn Phe Ile Cys Ser Cys Glu Phe Leu Ser Phe
510 515 520
act cag gag cag caa gca ctg gcc aaa gtc ttg att gat tgg cca gca 1680
Thr Gln Glu Gln Gln Ala Leu Ala Lys Val Leu Ile Asp Trp Pro Ala
525 530 535
aat tac ctg tgt gac tct cca tcc cat gtg cgt ggc cag cag gtt cag 1728
Asn Tyr Leu Cys Asp Ser Pro Ser His Val Arg Gly Gln Gln Val Gln
540 545 550
gat gtc cgc ctc tcg gtg tcg gaa tgt cac agg aca gca ctg gtg tct 1776
Asp Val Arg Leu Ser Val Ser Glu Cys His Arg Thr Ala Leu Val Ser
555 560 565 570
ggc atg tgc tgt gct ctg ttc ctg ctg atc ctg ctc acg ggg gtc ctg 1824
Gly Met Cys Cys Ala Leu Phe Leu Leu Ile Leu Leu Thr Gly Val Leu
575 580 585
tgc cac cgt ttc cat ggc ctg tgg tat atg aaa atg atg tgg gcc tgg 1872
Cys His Arg Phe His Gly Leu Trp Tyr Met Lys Met Met Trp Ala Trp
590 595 600
ctc cag gcc aaa agg aag ccc agg aaa gct ccc agc agg aac atc tgc 1920
Leu Gln Ala Lys Arg Lys Pro Arg Lys Ala Pro Ser Arg Asn Ile Cys
605 610 615
tat gat gca ttt gtt tct tac agt gag cgg gat gcc tac tgg gtg gag 1968
Tyr Asp Ala Phe Val Ser Tyr Ser Glu Arg Asp Ala Tyr Trp Val Glu
620 625 630
aac ctt atg gtc cag gag ctg gag aac ttc aat ccc ccc ttc aag ttg 2016
Asn Leu Met Val Gln Glu Leu Glu Asn Phe Asn Pro Pro Phe Lys Leu
635 640 645 650
tgt ctt cat aag cgg gac ttc att cct ggc aag tgg atc att gac aat 2064
Cys Leu His Lys Arg Asp Phe Ile Pro Gly Lys Trp Ile Ile Asp Asn
655 660 665
atc att gac tcc att gaa aag agc cac aaa act gtc ttt gtg ctt tct 2112
Ile Ile Asp Ser Ile Glu Lys Ser His Lys Thr Val Phe Val Leu Ser
670 675 680
gaa aac ttt gtg aag agt gag tgg tgc aag tat gaa ctg gac ttc tcc 2160
Glu Asn Phe Val Lys Ser Glu Trp Cys Lys Tyr Glu Leu Asp Phe Ser
685 690 695
cat ttc cgt ctt ttt gaa gag aac aat gat gct gcc att ctc att ctt 2208
His Phe Arg Leu Phe Glu Glu Asn Asn Asp Ala Ala Ile Leu Ile Leu
700 705 710
ctg gag ccc att gag aaa aaa gcc att ccc cag cgc ttc tgc aag ctg 2256
Leu Glu Pro Ile Glu Lys Lys Ala Ile Pro Gln Arg Phe Cys Lys Leu
715 720 725 730
cgg aag ata atg aac acc aag acc tac ctg gag tgg ccc atg gac gag 2304
Arg Lys Ile Met Asn Thr Lys Thr Tyr Leu Glu Trp Pro Met Asp Glu
735 740 745
gct cag cgg gaa gga ttt tgg gta aat ctg aga gct gcg ata aag tcc 2352
Ala Gln Arg Glu Gly Phe Trp Val Asn Leu Arg Ala Ala Ile Lys Ser
750 755 760
tag 2355
<210>4
<211>784
<212>PRT
<213>未知
<400>4
Met Pro His Thr Leu Trp Met Val Trp Val Leu Gly Val Ile Ile Ser
-20 -15 -10
Leu Ser Lys Glu Glu Ser Ser Asn Gln Ala Ser Leu Ser Cys Asp Arg
-5 -1 1 5 10
Asn Gly Ile Cys Lys Gly Ser Ser Gly Ser Leu Asn Ser Ile Pro Ser
15 20 25
Gly Leu Thr Glu Ala Val Lys Ser Leu Asp Leu Ser Asn Asn Arg Ile
30 35 40
Thr Tyr Ile Ser Asn Ser Asp Leu Gln Arg Cys Val Asn Leu Gln Ala
45 50 55
Leu Val Leu Thr Ser Asn Gly Ile Asn Thr Ile Glu Glu Asp Ser Phe
60 65 70
Ser Ser Leu Gly Ser Leu Glu His Leu Asp Leu Ser Tyr Asn Tyr Leu
75 80 85 90
Ser Asn Leu Ser Ser Ser Trp Phe Lys Pro Leu Ser Ser Leu Thr Phe
95 100 105
Leu Asn Leu Leu Gly Asn Pro Tyr Lys Thr Leu Gly Glu Thr Ser Leu
110 115 120
Phe Ser His Leu Thr Lys Leu Gln Ile Leu Arg Val Gly Asn Met Asp
125 130 135
Thr Phe Thr Lys Ile Gln Arg Lys Asp Phe Ala Gly Leu Thr Phe Leu
140 145 150
Glu Glu Leu Glu Ile Asp Ala Ser Asp Leu Gln Ser Tyr Glu Pro Lys
155 160 165 170
Ser Leu Lys Ser Ile Gln Asn Val Ser His Leu Ile Leu His Met Lys
175 180 185
Gln His Ile Leu Leu Leu Glu Ile Phe Val Asp Val Thr Ser Ser Val
190 195 200
Glu Cys Leu Glu Leu Arg Asp Thr Asp Leu Asp Thr Phe His Phe Ser
205 210 215
Glu Leu Ser Thr Gly Glu Thr Asn Ser Leu Ile Lys Lys Phe Thr Phe
220 225 230
Arg Asn Val Lys Ile Thr Asp Glu Ser Leu Phe Gln Val Met Lys Leu
235 240 245 250
Leu Asn Gln Ile Ser Gly Leu Leu Glu Leu Glu Phe Asp Asp Cys Thr
255 260 265
Leu Asn Gly Val Gly Asn Phe Arg Ala Ser Asp Asn Asp Arg Val Ile
270 275 280
Asp Pro Gly Lys Val Glu Thr Leu Thr Ile Arg Arg Leu His Ile Pro
285 290 295
Arg Phe Tyr Leu Phe Tyr Asp Leu Ser Thr Leu Tyr Ser Leu Thr Glu
300 305 310
Arg Val Lys Arg Ile Thr Val Glu Asn Ser Lys Val Phe Leu Val Pro
315 320 325 330
Cys Leu Leu Ser Gln His Leu Lys Ser Leu Glu Tyr Leu Asp Leu Ser
335 340 345
Glu Asn Leu Met Val Glu Glu Tyr Leu Lys Asn Ser Ala Cys Glu Asp
350 355 360
Ala Trp Pro Ser Leu Gln Thr Leu Ile Leu Arg Gln Asn His Leu Ala
365 370 375
Ser Leu Glu Lys Thr Gly Glu Thr Leu Leu Thr Leu Lys Asn Leu Thr
380 385 390
Asn Ile Asp Ile Ser Lys Asn Ser Phe His Ser Met Pro Glu Thr Cys
395 400 405 410
Gln Trp Pro Glu Lys Met Lys Tyr Leu Asn Leu Ser Ser Thr Arg Ile
415 420 425
His Ser Val Thr Gly Cys Ile Pro Lys Thr Leu Glu Ile Leu Asp Val
430 435 440
Ser Asn Asn Asn Leu Asn Leu Phe Ser Leu Asn Leu Pro Gln Leu Lys
445 450 455
Glu Leu Tyr Ile Ser Arg Asn Lys Leu Met Thr Leu Pro Asp Ala Ser
460 465 470
Leu Leu Pro Met Leu Leu Val Leu Lys Ile Ser Arg Asn Ala Ile Thr
475 480 485 490
Thr Phe Ser Lys Glu Gln Leu Asp Ser Phe His Thr Leu Lys Thr Leu
495 500 505
Glu Ala Gly Gly Asn Asn Phe Ile Cys Ser Cys Glu Phe Leu Ser Phe
510 515 520
Thr Gln Glu Gln Gln Ala Leu Ala Lys Val Leu Ile Asp Trp Pro Ala
525 530 535
Asn Tyr Leu Cys Asp Ser Pro Ser His Val Arg Gly Gln Gln Val Gln
540 545 550
Asp Val Arg Leu Ser Val Ser Glu Cys His Arg Thr Ala Leu Val Ser
555 560 565 570
Gly Met Cys Cys Ala Leu Phe Leu Leu Ile Leu Leu Thr Gly Val Leu
575 580 585
Cys His Arg Phe His Gly Leu Trp Tyr Met Lys Met Met Trp Ala Trp
590 595 600
Leu Gln Ala Lys Arg Lys Pro Arg Lys Ala Pro Ser Arg Asn Ile Cys
605 610 615
Tyr Asp Ala Phe Val Ser Tyr Ser Glu Arg Asp Ala Tyr Trp Val Glu
620 625 630
Asn Leu Met Val Gln Glu Leu Glu Asn Phe Asn Pro Pro Phe Lys Leu
635 640 645 650
Cys Leu His Lys Arg Asp Phe Ile Pro Gly Lys Trp Ile Ile Asp Asn
655 660 665
Ile Ile Asp Ser Ile Glu Lys Ser His Lys Thr Val Phe Val Leu Ser
670 675 680
Glu Asn Phe Val Lys Ser Glu Trp Cys Lys Tyr Glu Leu Asp Phe Ser
685 690 695
His Phe Arg Leu Phe Glu Glu Asn Asn Asp Ala Ala Ile Leu Ile Leu
700 705 710
Leu Glu Pro Ile Glu Lys Lys Ala Ile Pro Gln Arg Phe Cys Lys Leu
715 720 725 730
Arg Lys Ile Met Asn Thr Lys Thr Tyr Leu Glu Trp Pro Met Asp Glu
735 740 745
Ala Gln Arg Glu Gly Phe Trp Val Asn Leu Arg Ala Ala Ile Lys Ser
750 755 760
<210>5
<211>2715
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(2712)
<220>
<221>mat_肽
<222>(64)..(2712)
<400>5
atg aga cag act ttg cct tgt atc tac ttt tgg ggg ggc ctt ttg ccc 48
Met Arg Gln Thr Leu Pro Cys Ile Tyr Phe Trp Gly Gly Leu Leu Pro
-20 -15 -10
ttt ggg atg ctg tgt gca tcc tcc acc acc aag tgc act gtt agc cat 96
Phe Gly Met Leu Cys Ala Ser Ser Thr Thr Lys Cys Thr Val Ser His
-5 -1 1 5 10
gaa gtt gct gac tgc agc cac ctg aag ttg act cag gta ccc gat gat 144
Glu Val Ala Asp Cys Ser His Leu Lys Leu Thr Gln Val Pro Asp Asp
15 20 25
cta ccc aca aac ata aca gtg ttg aac ctt acc cat aat caa ctc aga 192
Leu Pro Thr Asn Ile Thr Val Leu Asn Leu Thr His Asn Gln Leu Arg
30 35 40
aga tta cca gcc gcc aac ttc aca agg tat agc cag cta act agc ttg 240
Arg Leu Pro Ala Ala Asn Phe Thr Arg Tyr Ser Gln Leu Thr Ser Leu
45 50 55
gat gta gga ttt aac acc atc tca aaa ctg gag cca gaa ttg tgc cag 288
Asp Val Gly Phe Asn Thr Ile Ser Lys Leu Glu Pro Glu Leu Cys Gln
60 65 70 75
aaa ctt ccc atg tta aaa gtt ttg aac ctc cag cac aat gag cta tct 336
Lys Leu Pro Met Leu Lys Val Leu Asn Leu Gln His Asn Glu Leu Ser
80 85 90
caa ctt tct gat aaa acc ttt gcc ttc tgc acg aat ttg act gaa ctc 384
Gln Leu Ser Asp Lys Thr Phe Ala Phe Cys Thr Asn Leu Thr Glu Leu
95 100 105
cat ctc atg tcc aac tca atc cag aaa att aaa aat aat ccc ttt gtc 432
His Leu Met Ser Asn Ser Ile Gln Lys Ile Lys Asn Asn Pro Phe Val
110 115 120
aag cag aag aat tta atc aca tta gat ctg tct cat aat ggc ttg tca 480
Lys Gln Lys Asn Leu Ile Thr Leu Asp Leu Ser His Asn Gly Leu Ser
125 130 135
tct aca aaa tta gga act cag gtt cag ctg gaa aat ctc caa gag ctt 528
Ser Thr Lys Leu Gly Thr Gln Val Gln Leu Glu Asn Leu Gln Glu Leu
140 145 150 155
cta tta tca aac aat aaa att caa gcg cta aaa agt gaa gaa ctg gat 576
Leu Leu Ser Asn Asn Lys Ile Gln Ala Leu Lys Ser Glu Glu Leu Asp
160 165 170
atc ttt gcc aat tca tct tta aaa aaa tta gag ttg tca tcg aat caa 624
Ile Phe Ala Asn Ser Ser Leu Lys Lys Leu Glu Leu Ser Ser Asn Gln
175 180 185
att aaa gag ttt tct cca ggg tgt ttt cac gca att gga aga tta ttt 672
Ile Lys Glu Phe Ser Pro Gly Cys Phe His Ala Ile Gly Arg Leu Phe
190 195 200
ggc ctc ttt ctg aac aat gtc cag ctg ggt ccc agc ctt aca gag aag 720
Gly Leu Phe Leu Asn Asn Val Gln Leu Gly Pro Ser Leu Thr Glu Lys
205 210 215
cta tgt ttg gaa tta gca aac aca agc att cgg aat ctg tct ctg agt 768
Leu Cys Leu Glu Leu Ala Asn Thr Ser Ile Arg Asn Leu Ser Leu Ser
220 225 230 235
aac agc cag ctg tcc acc acc agc aat aca act ttc ttg gga cta aag 816
Asn Ser Gln Leu Ser Thr Thr Ser Asn Thr Thr Phe Leu Gly Leu Lys
240 245 250
tgg aca aat ctc act atg ctc gat ctt tcc tac aac aac tta aat gtg 864
Trp Thr Asn Leu Thr Met Leu Asp Leu Ser Tyr Asn Asn Leu Asn Val
255 260 265
gtt ggt aac gat tcc ttt get tgg ctt cca caa cta gaa tat ttc ttc 912
Val Gly Asn Asp Ser Phe Ala Trp Leu Pro Gln Leu Glu Tyr Phe Phe
270 275 280
cta gag tat aat aat ata cag cat ttg ttt tct cac tct ttg cac ggg 960
Leu Glu Tyr Asn Asn Ile Gln His Leu Phe Ser His Ser Leu His Gly
285 290 295
ctt ttc aat gtg agg tac ctg aat ttg aaa cgg tct ttt act aaa caa 1008
Leu Phe Asn Val Arg Tyr Leu Asn Leu Lys Arg Ser Phe Thr Lys Gln
300 305 310 315
agt att tcc ctt gcc tca ctc ccc aag att gat gat ttt tct ttt cag 1056
Ser Ile Ser Leu Ala Ser Leu Pro Lys Ile Asp Asp Phe Ser Phe Gln
320 325 330
tgg cta aaa tgt ttg gag cac ctt aac atg gaa gat aat gat att cca 1104
Trp Leu Lys Cys Leu Glu His Leu Asn Met Glu Asp Asn Asp Ile Pro
335 340 345
ggc ata aaa agc aat atg ttc aca gga ttg ata aac ctg aaa tac tta 1152
Gly Ile Lys Ser Asn Met Phe Thr Gly Leu Ile Asn Leu Lys Tyr Leu
350 355 360
agt cta tcc aac tcc ttt aca agt ttg cga act ttg aca aat gaa aca 1200
Ser Leu Ser Asn Ser Phe Thr Ser Leu Arg Thr Leu Thr Asn Glu Thr
365 370 375
ttt gta tca ctt gct cat tct ccc tta cac ata ctc aac cta acc aag 1248
Phe Val Ser Leu Ala His Ser Pro Leu His Ile Leu Asn Leu Thr Lys
380 385 390 395
aat aaa atc tca aaa ata gag agt gat gct ttc tct tgg ttg ggc cac 1296
Asn Lys Ile Ser Lys Ile Glu Ser Asp Ala Phe Ser Trp Leu Gly His
400 405 410
cta gaa gta ctt gac ctg ggc ctt aat gaa att ggg caa gaa ctc aca 1344
Leu Glu Val Leu Asp Leu Gly Leu Asn Glu Ile Gly Gln Glu Leu Thr
415 420 425
ggc cag gaa tgg aga ggt cta gaa aat att ttc gaa atc tat ctt tcc 1392
Gly Gln Glu Trp Arg Gly Leu Glu Asn Ile Phe Glu Ile Tyr Leu Ser
430 435 440
tac aac aag tac ctg cag ctg act agg aac tcc ttt gcc ttg gtc cca 1440
Tyr Asn Lys Tyr Leu Gln Leu Thr Arg Asn Ser Phe Ala Leu Val Pro
445 450 455
agc ctt caa cga ctg atg ctc cga agg gtg gcc ctt aaa aat gtg gat 1488
Ser Leu Gln Arg Leu Met Leu Arg Arg Val Ala Leu Lys Asn Val Asp
460 465 470 475
agc tct cct tca cca ttc cag cct ctt cgt aac ttg acc att ctg gat 1536
Ser Ser Pro Ser Pro Phe Gln Pro Leu Arg Asn Leu Thr Ile Leu Asp
480 485 490
cta agc aac aac aac ata gcc aac ata aat gat gac atg ttg gag ggt 1584
Leu Ser Asn Asn Asn Ile Ala Asn Ile Asn Asp Asp Met Leu Glu Gly
495 500 505
ctt gag aaa cta gaa att ctc gat ttg cag cat aac aac tta gca cgg 1632
Leu Glu Lys Leu Glu Ile Leu Asp Leu Gln His Asn Asn Leu Ala Arg
510 515 520
ctc tgg aaa cac gca aac cct ggt ggt ccc att tat ttc cta aag ggt 1680
Leu Trp Lys His Ala Asn Pro Gly Gly Pro Ile Tyr Phe Leu Lys Gly
525 530 535
ctg tct cac ctc cac atc ctt aac ttg gag tcc aac ggc ttt gac gag 1728
Leu Ser His Leu His Ile Leu Asn Leu Glu Ser Asn Gly Phe Asp Glu
540 545 550 555
atc cca gtt gag gtc ttc aag gat tta ttt gaa cta aag atc atc gat 1776
Ile Pro Val Glu Val Phe Lys Asp Leu Phe Glu Leu Lys Ile Ile Asp
560 565 570
tta gga ttg aat aat tta aac aca ctt cca gca tct gtc ttt aat aat 1824
Leu Gly Leu Asn Asn Leu Asn Thr Leu Pro Ala Ser Val Phe Asn Asn
575 580 585
cag gtg tct cta aag tca ttg aac ctt cag aag aat ctc ata aca tcc 1872
Gln Val Ser Leu Lys Ser Leu Asn Leu Gln Lys Asn Leu Ile Thr Ser
590 595 600
gtt gag aag aag gtt ttc ggg cca gct ttc agg aac ctg act gag tta 1920
Val Glu Lys Lys Val Phe Gly Pro Ala Phe Arg Asn Leu Thr Glu Leu
605 610 615
gat atg cgc ttt aat ccc ttt gat tgc acg tgt gaa agt att gcc tgg 1968
Asp Met Arg Phe Asn Pro Phe Asp Cys Thr Cys Glu Ser Ile Ala Trp
620 625 630 635
ttt gtt aat tgg att aac gag acc cat acc aac atc cct gag ctg tca 2016
Phe Val Asn Trp Ile Asn Glu Thr His Thr Asn Ile Pro Glu Leu Ser
640 645 650
agc cac tac ctt tgc aac act cca cct cac tat cat ggg ttc cca gtg 2064
Ser His Tyr Leu Cys Asn Thr Pro Pro His Tyr His Gly Phe Pro Val
655 660 665
aga ctt ttt gat aca tca tct tgc aaa gac agt gcc ccc ttt gaa ctc 2112
Arg Leu Phe Asp Thr Ser Ser Cys Lys Asp Ser Ala Pro Phe Glu Leu
670 675 680
ttt ttc atg atc aat acc agt atc ctg ttg att ttt atc ttt att gta 2160
Phe Phe Met Ile Asn Thr Ser Ile Leu Leu Ile Phe Ile Phe Ile Val
685 690 695
ctt ctc atc cac ttt gag ggc tgg agg ata tct ttt tat tgg aat gtt 2208
Leu Leu Ile His Phe Glu Gly Trp Arg Ile Ser Phe Tyr Trp Asn Val
700 705 710 715
tca gta cat cga gtt ctt ggt ttc aaa gaa ata gac aga cag aca gaa 2256
Ser Val His Arg Val Leu Gly Phe Lys Glu Ile Asp Arg Gln Thr Glu
720 725 730
cag ttt gaa tat gca gca tat ata att cat gcc tat aaa gat aag gat 2304
Gln Phe Glu Tyr Ala Ala Tyr Ile Ile His Ala Tyr Lys Asp Lys Asp
735 740 745
tgg gtc tgg gaa cat ttc tct tca atg gaa aag gaa gac caa tct ctc 2352
Trp Val Trp Glu His Phe Ser Ser Met Glu Lys Glu Asp Gln Ser Leu
750 755 760
aaa ttt tgt ctg gaa gaa agg gac ttt gag gcg ggt gtt ttt gaa cta 2400
Lys Phe Cys Leu Glu Glu Arg Asp Phe Glu Ala Gly Val Phe Glu Leu
765 770 775
gaa gca att gtt aac agc atc aaa aga agc aga aaa att att ttt gtt 2448
Glu Ala Ile Val Asn Ser Ile Lys Arg Ser Arg Lys Ile Ile Phe Val
780 785 790 795
ata aca cac cat cta tta aaa gac cca tta tgc aaa aga ttc aag gta 2496
Ile Thr His His Leu Leu Lys Asp Pro Leu Cys Lys Arg Phe Lys Val
800 805 810
cat cat gca gtt caa caa gct att gaa caa aat ctg gat tcc att ata 2544
His His Ala Val Gln Gln Ala Ile Glu Gln Asn Leu Asp Ser Ile Ile
815 820 825
ttg gtt ttc ctt gag gag att cca gat tat aaa ctg aac cat gca ctc 2592
Leu Val Phe Leu Glu Glu Ile Pro Asp Tyr Lys Leu Asn His Ala Leu
830 835 840
tgt ttg cga aga gga atg ttt aaa tct cac tgc atc ttg aac tgg cca 2640
Cys Leu Arg Arg Gly Met Phe Lys Ser His Cys Ile Leu Asn Trp Pro
845 850 855
gtt cag aaa gaa cgg ata ggt gcc ttt cgt cat aaa ttg caa gta gca 2688
Val Gln Lys Glu Arg Ile Gly Ala Phe Arg His Lys Leu Gln Val Ala
860 865 870 875
ctt gga tcc aaa aac tct gta cat taa 2715
Leu Gly Ser Lys Asn Ser Val His
880
<210>6
<211>904
<212>PRT
<213>未知
<400>6
Met Arg Gln Thr Leu Pro Cys Ile Tyr Phe Trp Gly Gly Leu Leu Pro
-20 -15 -10
Phe Gly Met Leu Cys Ala Ser Ser Thr Thr Lys Cys Thr Val Ser His
-5 -1 1 5 10
Glu Val Ala Asp Cys Ser His Leu Lys Leu Thr Gln Val Pro Asp Asp
15 20 25
Leu Pro Thr Asn Ile Thr Val Leu Asn Leu Thr His Asn Gln Leu Arg
30 35 40
Arg Leu Pro Ala Ala Asn Phe Thr Arg Tyr Ser Gln Leu Thr Ser Leu
45 50 55
Asp Val Gly Phe Asn Thr Ile Ser Lys Leu Glu Pro Glu Leu Cys Gln
60 65 70 75
Lys Leu Pro Met Leu Lys Val Leu Asn Leu Gln His Asn Glu Leu Ser
80 85 90
Gln Leu Ser Asp Lys Thr Phe Ala Phe Cys Thr Asn Leu Thr Glu Leu
95 100 105
His Leu Met Ser Asn Ser Ile Gln Lys Ile Lys Asn Asn Pro Phe Val
110 115 120
Lys Gln Lys Asn Leu Ile Thr Leu Asp Leu Ser His Asn Gly Leu Ser
125 130 135
Ser Thr Lys Leu Gly Thr Gln Val Gln Leu Glu Asn Leu Gln Glu Leu
140 145 150 155
Leu Leu Ser Asn Asn Lys Ile Gln Ala Leu Lys Ser Glu Glu Leu Asp
160 165 170
Ile Phe Ala Asn Ser Ser Leu Lys Lys Leu Glu Leu Ser Ser Asn Gln
175 180 185
Ile Lys Glu Phe Ser Pro Gly Cys Phe His Ala Ile Gly Arg Leu Phe
190 195 200
Gly Leu Phe Leu Asn Asn Val Gln Leu Gly Pro Ser Leu Thr Glu Lys
205 210 215
Leu Cys Leu Glu Leu Ala Asn Thr Ser Ile Arg Asn Leu Ser Leu Ser
220 225 230 235
Asn Ser Gln Leu Ser Thr Thr Ser Asn Thr Thr Phe Leu Gly Leu Lys
240 245 250
Trp Thr Asn Leu Thr Met Leu Asp Leu Ser Tyr Asn Asn Leu Asn Val
255 260 265
Val Gly Asn Asp Ser Phe Ala Trp Leu Pro Gln Leu Glu Tyr Phe Phe
270 275 280
Leu Glu Tyr Asn Asn Ile Gln His Leu Phe Ser His Ser Leu His Gly
285 290 295
Leu Phe Asn Val Arg Tyr Leu Asn Leu Lys Arg Ser Phe Thr Lys Gln
300 305 310 315
Ser lle Ser Leu Ala Ser Leu Pro Lys Ile Asp Asp Phe Ser Phe Gln
320 325 330
Trp Leu Lys Cys Leu Glu His Leu Asn Met Glu Asp Asn Asp Ile Pro
335 340 345
Gly Ile Lys Ser Asn Met Phe Thr Gly Leu Ile Asn Leu Lys Tyr Leu
350 355 360
Ser Leu Ser Asn Ser Phe Thr Ser Leu Arg Thr Leu Thr Asn Glu Thr
365 370 375
Phe Val Ser Leu Ala His Ser Pro Leu His Ile Leu Asn Leu Thr Lys
380 385 390 395
Asn Lys Ile Ser Lys Ile Glu Ser Asp Ala Phe Ser Trp Leu Gly His
400 405 410
Leu Glu Val Leu Asp Leu Gly Leu Asn Glu Ile Gly Gln Glu Leu Thr
415 420 425
Gly Gln Glu Trp Arg Gly Leu Glu Asn Ile Phe Glu Ile Tyr Leu Ser
430 435 440
Tyr Asn Lys Tyr Leu Gln Leu Thr Arg Asn Ser Phe Ala Leu Val Pro
445 450 455
Ser Leu Gln Arg Leu Mer Leu Arg Arg Val Ala Leu Lys Asn Val Asp
460 465 470 475
Ser Ser Pro Ser Pro Phe Gln Pro Leu Arg Asn Leu Thr Ile Leu Asp
480 485 490
Leu Ser Asn Asn Asn Ile Ala Asn Ile Asn Asp Asp Met Leu Glu Gly
495 500 505
Leu Glu Lys Leu Glu Ile Leu Asp Leu Gln His Asn Asn Leu Ala Arg
510 515 520
Leu Trp Lys His Ala Asn Pro Gly Gly Pro Ile Tyr Phe Leu Lys Gly
525 530 535
Leu Ser His Leu His Ile Leu Asn Leu Glu Ser Asn Gly Phe Asp Glu
540 545 550 555
Ile Pro Val Glu Val Phe Lys Asp Leu Phe Glu Leu Lys Ile Ile Asp
560 565 570
Leu Gly Leu Asn Asn Leu Asn Thr Leu Pro Ala Ser Val Phe Asn Asn
575 580 585
Gln Val Ser Leu Lys Ser Leu Asn Leu Gln Lys Asn Leu Ile Thr Ser
590 595 600
Val Glu Lys Lys Val Phe Gly Pro Ala Phe Arg Asn Leu Thr Glu Leu
605 610 615
Asp Met Arg Phe Asn Pro Phe Asp Cys Thr Cys Glu Ser Ile Ala Trp
620 625 630 635
Phe Val Asn Trp Ile Asn Glu Thr His Thr Asn Ile Pro Glu Leu Ser
640 645 650
Ser His Tyr Leu Cys Asn Thr Pro Pro His Tyr His Gly Phe Pro Val
655 660 665
Arg Leu Phe Asp Thr Ser Ser Cys Lys Asp Ser Ala Pro Phe Glu Leu
670 675 680
Phe Phe Met Ile Asn Thr Ser Ile Leu Leu Ile Phe Ile Phe Ile Val
685 690 695
Leu Leu Ile His Phe Glu Gly Trp Arg Ile Ser Phe Tyr Trp Asn Val
700 705 710 715
Ser Val His Arg Val Leu Gly Phe Lys Glu Ile Asp Arg Gln Thr Glu
720 725 730
Gln Phe Glu Tyr Ala Ala Tyr Ile Ile His Ala Tyr Lys Asp Lys Asp
735 740 745
Trp Val Trp Glu His Phe Ser Ser Met Glu Lys Glu Asp Gln Ser Leu
750 755 760
Lys Phe Cys Leu Glu Glu Arg Asp Phe Glu Ala Gly Val Phe Glu Leu
765 770 775
Glu Ala Ile Val Asn Ser Ile Lys Arg Ser Arg Lys Ile Ile Phe Val
780 785 790 795
Ile Thr His His Leu Leu Lys Asp Pro Leu Cys Lys Arg Phe Lys Val
800 805 810
His His Ala Val Gln Gln Ala Ile Glu Gln Asn Leu Asp Ser Ile Ile
815 820 825
Leu Val Phe Leu Glu Glu Ile Pro Asp Tyr Lys Leu Asn His Ala Leu
830 835 840
Cys Leu Arg Arg Gly Met Phe Lys Ser His Cys Ile Leu Asn Trp Pro
845 850 855
Val Gln Lys Glu Arg Ile Gly Ala Phe Arg His Lys Leu Gln Val Ala
860 865 870 875
Leu Gly Ser Lys Asn Ser Val His
880
<210>7
<211>2400
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(2397)
<400>7
atg gag ctg aat ttc tac aaa atc ccc gac aac ctc ccc ttc tca acc 48
Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp Asn Leu Pro Phe Ser Thr
1 5 10 15
aag aac ctg gac ctg agc ttt aat ccc ctg agg cat tta ggc agc tat 96
Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu Arg His Leu Gly Ser Tyr
20 25 30
agc ttc ttc agt ttc cca gaa ctg cag gtg ctg gat tta tcc agg tgt 144
Ser Phe Phe Ser Phe Pro Glu Leu Gln Val Leu Asp Leu Ser Arg Cys
35 40 45
gaa atc cag aca att gaa gat ggg gca tat cag agc cta agc cac ctc 192
Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr Gln Ser Leu Ser His Leu
50 55 60
tct acc tta ata ttg aca gga aac ccc atc cag agt tta gcc ctg gga 240
Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile Gln Ser Leu Ala Leu Gly
65 70 75 80
gcc ttt tct gga cta tca agt tta cag aag ctg gtg gct gtg gag aca 288
Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys Leu Val Ala Val Glu Thr
85 90 95
aat cta gca tct cta gag aac ttc ccc att gga cat ctc aaa act ttg 336
Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile Gly His Leu Lys Thr Leu
100 105 110
aaa gaa ctt aat gtg gct cac aat ctt atc caa tct ttc aaa tta cct 384
Lys Glu Leu Asn Val Ala His Asn Leu Ile Gln Ser Phe Lys Leu Pro
115 120 125
gag tat ttt tct aat ctg acc aat cta gag cac ttg gac ctt tcc agc 432
Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu His Leu Asp Leu Ser Ser
130 135 140
aac aag att caa agt att tat tgc aca gac ttg cgg gtt cta cat caa 480
Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp Leu Arg Val Leu His Gln
145 150 155 160
atg ccc cta ctc aat ctc tct tta gac ctg tcc ctg aac cct atg aac 528
Met Pro Leu Leu Asn Leu Ser Leu Asp Leu Ser Leu Asn Pro Met Asn
165 170 175
ttt atc caa cca ggt gca ttt aaa gaa att agg ctt cat aag ctg act 576
Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile Arg Leu His Lys Leu Thr
180 185 190
tta aga aat aat ttt gat agt tta aat gta atg aaa act tgt att caa 624
Leu Arg Asn Asn Phe Asp Ser Leu Asn Val Met Lys Thr Cys Ile Gln
195 200 205
ggt ctg gct ggt tta gaa gtc cat cgt ttg gtt ctg gga gaa ttt aga 672
Gly Leu Ala Gly Leu Glu Val His Arg Leu Val Leu Gly Glu Phe Arg
210 215 220
aat gaa gga aac ttg gaa aag ttt gac aaa tct gct cta gag ggc ctg 720
Asn Glu Gly Asn Leu Glu Lys Phe Asp Lys Ser Ala Leu Glu Gly Leu
225 230 235 240
tgc aat ttg acc att gaa gaa ttc cga tta gca tac tta gac tac tac 768
Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu Ala Tyr Leu Asp Tyr Tyr
245 250 255
ctc gat gat att att gac tta ttt aat tgt ttg aca aat gtt tct tca 816
Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys Leu Thr Asn Val Ser Ser
260 265 270
ttt tcc ctg gtg agt gtg act att gaa agg gta aaa gac ttt tct tat 864
Phe Ser Leu Val Ser Val Thr Ile Glu Arg Val Lys Asp Phe Ser Tyr
275 280 285
aat ttc gga tgg caa cat tta gaa tta gtt aac tgt aaa ttt gga cag 912
Asn Phe Gly Trp Gln His Leu Glu Leu Val Asn Cys Lys Phe Gly Gln
290 295 300
ttt ccc aca ttg aaa ctc aaa tct ctc aaa agg ctt act ttc act tcc 960
Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys Arg Leu Thr Phe Thr Ser
305 310 315 320
aac aaa ggt ggg aat gct ttt tca gaa gtt gat cta cca agc ctt gag 1008
Asn Lys Gly Gly Asn Ala Phe Ser Glu Val Asp Leu Pro Ser Leu Glu
325 330 335
ttt cta gat ctc agt aga aat ggc ttg agt ttc aaa ggt tgc tgt tct 1056
Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser Phe Lys Gly Cys Cys Ser
340 345 350
caa agt gat ttt ggg aca acc agc cta aag tat tta gat ctg agc ttc 1104
Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys Tyr Leu Asp Leu Ser Phe
355 360 365
aat ggt gtt att acc atg agt tca aac ttc ttg ggc tta gaa caa cta 1152
Asn Gly Val Ile Thr Met Ser Ser Asn Phe Leu Gly Leu Glu Gln Leu
370 375 380
gaa cat ctg gat ttc cag cat tcc aat ttg aaa caa atg agt gag ttt 1200
Glu His Leu Asp Phe Gln His Ser Asn Leu Lys Gln Met Ser Glu Phe
385 390 395 400
tca gta ttc cta tca ctc aga aac ctc att tac ctt gac att tct cat 1248
Ser Val Phe Leu Ser Leu Arg Asn Leu Ile Tyr Leu Asp Ile Ser His
405 410 415
act cac acc aga gtt gct ttc aat ggc atc ttc aat ggc ttg tcc agt 1296
Thr His Thr Arg Val Ala Phe Asn Gly Ile Phe Asn Gly Leu Ser Ser
420 425 430
ctc gaa gtc ttg aaa atg gct ggc aat tct ttc cag gaa aac ttc ctt 1344
Leu Glu Val Leu Lys Met Ala Gly Asn Ser Phe Gln Glu Asn Phe Leu
435 440 445
cca gat atc ttc aca gag ctg aga aac ttg acc ttc ctg gac ctc tct 1392
Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu Thr Phe Leu Asp Leu Ser
450 455 460
cag tgt caa ctg gag cag ttg tct cca aca gca ttt aac tca ctc tcc 1440
Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr Ala Phe Asn Ser Leu Ser
465 470 475 480
agt ctt cag gta cta aat atg agc cac aac aac ttc ttt tca ttg gat 1488
Ser Leu Gln Val Leu Asn Met Ser His Asn Asn Phe Phe Ser Leu Asp
485 490 495
acg ttt cct tat aag tgt ctg aac tcc ctc cag gtt ctt gat tac agt 1536
Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu Gln Val Leu Asp Tyr Ser
500 505 510
ctc aat cac ata atg act tcc aaa aaa cag gaa cta cag cat ttt cca 1584
Leu Asn His Ile Met Thr Ser Lys Lys Gln Glu Leu Gln His Phe Pro
515 520 525
agt agt cta gct ttc tta aat ctt act cag aat gac ttt gct tgt act 1632
Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln Asn Asp Phe Ala Cys Thr
530 535 540
tgt gaa cac cag agt ttc ctg caa tgg atc aag gac cag agg cag ctc 1680
Cys Glu His Gln Ser Phe Leu Gln Trp Ile Lys Asp Gln Arg Gln Leu
545 550 555 560
ttg gtg gaa gtt gaa cga atg gaa tgt gca aca cct tca gat aag cag 1728
Leu Val Glu Val Glu Arg Met Glu Cys Ala Thr Pro Ser Asp Lys Gln
565 570 575
ggc atg cct gtg ctg agt ttg aat atc acc tgt cag atg aat aag acc 1776
Gly Met Pro Val Leu Ser Leu Asn Ile Thr Cys Gln Met Asn Lys Thr
580 585 590
atc att ggt gtg tcg gtc ctc agt gtg ctt gta gta tct gtt gta gca 1824
Ile Ile Gly Val Ser Val Leu Ser Val Leu Val Val Ser Val Val Ala
595 600 605
gtt ctg gtc tat aag ttc tat ttt cac ctg atg ctt ctt gct ggc tgc 1872
Val Leu Val Tyr Lys Phe Tyr Phe His Leu Met Leu Leu Ala Gly Cys
610 615 620
ata aag tat ggt aga ggt gaa aac atc tat gat gcc ttt gtt atc tac 1920
Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr Asp Ala Phe Val Ile Tyr
625 630 635 640
tca agc cag gat gag gac tgg gta agg aat gag cta gta aag aat tta 1968
Ser Ser Gln Asp Glu Asp Trp Val Arg Asn Glu Leu Val Lys Asn Leu
645 650 655
gaa gaa ggg gtg cct cca ttt cag ctc tgc ctt cac tac aga gac ttt 2016
Glu Glu Gly Val Pro Pro Phe Gln Leu Cys Leu His Tyr Arg Asp Phe
660 665 670
att ccc ggt gtg gcc att gct gcc aac atc atc cat gaa ggt ttc cat 2064
Ile Pro Gly Val Ala Ile Ala Ala Asn Ile Ile His Glu Gly Phe His
675 680 685
aaa agc cga aag gtg att gtt gtg gtg tcc cag cac ttc atc cag agc 2112
Lys Ser Arg Lys Val Ile Val Val Val Ser Gln His Phe Ile Gln Ser
690 695 700
cgc tgg tgt atc ttt gaa tat gag att gct cag acc tgg cag ttt ctg 2160
Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala Gln Thr Trp Gln Phe Leu
705 710 715 720
agc agt cgt gct ggt atc atc ttc att gtc ctg cag aag gtg gag aag 2208
Ser Ser Arg Ala Gly Ile Ile Phe Ile Val Leu Gln Lys Val Glu Lys
725 730 735
acc ctg ctc agg cag cag gtg gag ctg tac cgc ctt ctc agc agg aac 2256
Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr Arg Leu Leu Ser Arg Asn
740 745 750
act tac ctg gag tgg gag gac agt gtc ctg ggg cgg cac atc ttc tgg 2304
Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu Gly Arg His Ile Phe Trp
755 760 765
aga cga ctc aga aaa gcc ctg ctg gat ggt aaa tca tgg aat cca gaa 2352
Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly Lys Ser Trp Asn Pro Glu
770 775 780
gga aca gtg ggt aca gga tgc aat tgg cag gaa gca aca tct atc tga 2400
Gly Thr Val Gly Thr Gly Cys Asn Trp Gln Glu Ala Thr Ser Ile
785 790 795
<210>8
<211>799
<212>PRT
<213>未知
<400>8
Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp Asn Leu Pro Phe Ser Thr
1 5 10 15
Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu Arg His Leu Gly Ser Tyr
20 25 30
Ser Phe Phe Ser Phe Pro Glu Leu Gln Val Leu Asp Leu Ser Arg Cys
35 40 45
Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr Gln Ser Leu Ser His Leu
50 55 60
Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile Gln Ser Leu Ala Leu Gly
65 70 75 80
Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys Leu Val Ala Val Glu Thr
85 90 95
Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile Gly His Leu Lys Thr Leu
100 105 110
Lys Glu Leu Asn Val Ala His Asn Leu Ile Gln Ser Phe Lys Leu Pro
115 120 125
Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu His Leu Asp Leu Ser Ser
130 135 140
Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp Leu Arg Val Leu His Gln
145 150 155 160
Met Pro Leu Leu Asn Leu Ser Leu Asp Leu Ser Leu Asn Pro Met Asn
165 170 175
Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile Arg Leu His Lys Leu Thr
180 185 190
Leu Arg Asn Asn Phe Asp Ser Leu Asn Val Met Lys Thr Cys Ile Gln
195 200 205
Gly Leu Ala Gly Leu Glu Val His Arg Leu Val Leu Gly Glu Phe Arg
210 215 220
Asn Glu Gly Asn Leu Glu Lys Phe Asp Lys Ser Ala Leu Glu Gly Leu
225 230 235 240
Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu Ala Tyr Leu Asp Tyr Tyr
245 250 255
Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys Leu Thr Asn Val Ser Ser
260 265 270
Phe Ser Leu Val Sar Val Thr Ile Glu Arg Val Lys Asp Phe Ser Tyr
275 280 285
Asn Phe Gly Trp Gln His Leu Glu Leu Val Asn Cys Lys Phe Gly Gln
290 295 300
Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys Arg Leu Thr Phe Thr Ser
305 310 315 320
Asn Lys Gly Gly Asn Ala Phe Ser Glu Val Asp Leu Pro Ser Leu Glu
325 330 335
Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser Phe Lys Gly Cys Cys Ser
340 345 350
Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys Tyr Leu Asp Leu Ser Phe
355 360 365
Asn Gly Val Ile Thr Met Ser Ser Asn Phe Leu Gly Leu Glu Gln Leu
370 375 380
Glu His Leu Asp Phe Gln His Ser Asn Leu Lys Gln Met Ser Glu Phe
385 390 395 400
Ser Val Phe Leu Ser Leu Arg Asn Leu Ile Tyr Leu Asp Ile Ser His
405 410 415
Thr His Thr Arg Val Ala Phe Asn Gly Ile Phe Asn Gly Leu Ser Ser
420 425 430
Leu Glu Val Leu Lys Met Ala Gly Asn Ser Phe Gln Glu Asn Phe Leu
435 440 445
Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu Thr Phe Leu Asp Leu Ser
450 455 460
Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr Ala Phe Asn Ser Leu Ser
465 470 475 480
Ser Leu Gln Val Leu Asn Met Ser His Asn Asn Phe Phe Ser Leu Asp
485 490 495
Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu Gln Val Leu Asp Tyr Ser
500 505 510
Leu Asn His Ile Met Thr Ser Lys Lys Gln Glu Leu Gln His Phe Pro
515 520 525
Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln Asn Asp Phe Ala Cys Thr
530 535 540
Cys Glu His Gln Ser Phe Leu Gln Trp Ile Lys Asp Gln Arg Gln Leu
545 550 555 560
Leu Val Glu Val Glu Arg Met Glu Cys Ala Thr Pro Ser Asp Lys Gln
565 570 575
Gly Met Pro Val Leu Ser Leu Asn Ile Thr Cys Gln Met Asn Lys Thr
580 585 590
Ile Ile Gly Val Ser Val Leu Ser Val Leu Val Val Ser Val Val Ala
595 600 605
Val Leu Val Tyr Lys Phe Tyr Phe His Leu Met Leu Leu Ala Gly Cys
610 615 620
Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr Asp Ala Phe Val Ile Tyr
625 630 635 640
Ser Ser Gln Asp Glu Asp Trp Val Arg Asn Glu Leu Val Lys Asn Leu
645 650 655
Glu Glu Gly Val Pro Pro Phe Gln Leu Cys Leu His Tyr Arg Asp Phe
660 665 670
Ile Pro Gly Val Ala Ile Ala Ala Asn Ile Ile His Glu Gly Phe His
675 680 685
Lys Ser Arg Lys Val Ile Val Val Val Ser Gln His Phe Ile Gln Ser
690 695 700
Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala Gln Thr Trp Gln Phe Leu
705 710 715 720
ser Ser Arg Ala Gly Ile Ile Phe Ile Val Leu Gln Lys Val Glu Lys
725 730 735
Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr Arg Leu Leu Ser Arg Asn
740 745 750
Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu Gly Arg His Ile Phe Trp
755 760 765
Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly Lys Ser Trp Asn Pro Glu
770 775 780
Gly Thr Val Gly Thr Gly Cys Asn Trp Gln Glu Ala Thr Ser Ile
785 790 795
<210>9
<211>1275
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(1095)
<400>9
tgt tgg gat gtt ttt gag gga ctt tct cat ctt caa gtt ctg tat ttg 48
Cys Trp Asp Val Phe Glu Gly Leu Ser His Leu Gln Val Leu Tyr Leu
1 5 10 15
aat cat aac tat ctt aat tcc ctt cca cca gga gta ttt agc cat ctg 96
Asn His Asn Tyr Leu Asn Ser Leu Pro Pro Gly Val Phe Ser His Leu
20 25 30
act gca tta agg gga cta agc ctc aac tcc aac agg ctg aca gtt ctt 144
Thr Ala Leu Arg Gly Leu Ser Leu Asn Ser Asn Arg Leu Thr Val Leu
35 40 45
tct cac aat gat tta cct gct aat tta gag atc ctg gac ata tcc agg 192
Ser His Asn Asp Leu Pro Ala Asn Leu Glu Ile Leu Asp Ile Ser Arg
50 55 60
aac cag ctc cta gct cct aat cct gat gta ttt gta tca ctt agt gtc 240
Asn Gln Leu Leu Ala Pro Asn Pro Asp Val Phe Val Ser Leu Ser Val
65 70 75 80
ttg gat ata act cat aac aag ttc att tgt gaa tgt gaa ctt agc act 288
Leu Asp Ile Thr His Asn Lys Phe Ile Cys Glu Cys Glu Leu Ser Thr
85 90 95
ttt atc aat tgg ctt aat cac acc aat gtc act ata gct ggg cct cct 336
Phe Ile Asn Trp Leu Asn His Thr Asn Val Thr Ile Ala Gly Pro Pro
100 105 110
gca gac ata tat tgt gtg tac cct gac tcg ttc tct ggg gtt tcc ctc 384
Ala Asp Ile Tyr Cys Val Tyr Pro Asp Ser Phe Ser Gly Val Ser Leu
115 120 125
ttc tct ctt tcc acg gaa ggt tgt gat gaa gag gaa gtc tta aag tcc 432
Phe Ser Leu Ser Thr Glu Gly Cys Asp Glu Glu Glu Val Leu Lys Ser
130 135 140
cta aag ttc tcc ctt ttc att gta tgc act gtc act ctg act ctg ttc 480
Leu Lys Phe Set Leu Phe Ile Val Cys Thr Val Thr Leu Thr Leu Phe
145 150 155 160
ctc atg acc atc ctc aca gtc aca aag ttc cgg ggc ttc tgt ttt atc 528
Leu Met Thr Ile Leu Thr Val Thr Lys Phe Arg Gly Phe Cys Phe Ile
165 170 175
tgt tat aag aca gcc cag aga ctg gtg ttc aag gac cat ccc cag ggc 576
Cys Tyr Lys Thr Ala Gln Arg Leu Val Phe Lys Asp His Pro Gln Gly
180 185 190
aca gaa cct gat atg tac aaa tat gat gcc tat ttg tgc ttc agc agc 624
Thr Glu Pro Asp Met Tyr Lys Tyr Asp Ala Tyr Leu Cys Phe Ser Ser
195 200 205
aaa gac ttc aca tgg gtg cag aat gct ttg ctc aaa cac ctg gac act 672
Lys Asp Phe Thr Trp Val Gln Asn Ala Leu Leu Lys His Leu Asp Thr
210 215 220
caa tac agt gac caa aac aga ttc aac ctg tgc ttt gaa gaa aga gac 720
Gln Tyr Ser Asp Gln Asn Arg Phe Asn Leu Cys Phe Glu Glu Arg Asp
225 230 235 240
ttt gtc cca gga gaa aac cgc att gcc aat atc cag gat gcc atc tgg 768
Phe Val Pro Gly Glu Asn Arg Ile Ala Asn Ile Gln Asp Ala Ile Trp
245 250 255
aac agt aga aag atc gtt tgt ctt gtg agc aga cac ttc ctt aga gat 816
Asn Ser Arg Lys Ile Val Cys Leu Val Ser Arg His Phe Leu Arg Asp
260 265 270
ggc tgg tgc ctt gaa gcc ttc agt tat gcc cag ggc agg tgc tta tct 864
Gly Trp Cys Leu Glu Ala Phe Ser Tyr Ala Gln Gly Arg Cys Leu Ser
275 280 285
gac ctt aac agt gct ctc atc atg gtg gtg gtt ggg tcc ttg tcc cag 912
Asp Leu Asn Ser Ala Leu Ile Met Val Val Val Gly Ser Leu Ser Gln
290 295 300
tac cag ttg atg aaa cat caa tcc atc aga ggc ttt gta cag aaa cag 960
Tyr Gln Leu Met Lys His Gln Ser Ile Arg Gly Phe Val Gln Lys Gln
305 310 315 320
cag tat ttg agg tgg cct gag gat ctc cag gat gtt ggc tgg ttt ctt 1008
Gln Tyr Leu Arg Trp Pro Glu Asp Leu Gln Asp Val Gly Trp Phe Leu
325 330 335
cat aaa ctc tct caa cag ata cta aag aaa gaa aag gaa aag aag aaa 1056
His Lys Leu Ser Gln Gln Ile Leu Lys Lys Glu Lys Glu Lys Lys Lys
340 345 350
gac aat aac att ccg ttg caa act gta gca acc atc tcc taatcaaagg 1105
Asp Asn Asn Ile Pro Leu Gln Thr Val Ala Thr Ile Ser
355 360 365
agcaatttcc aacttatctc aagccacaaa taactcttca ctttgtattt gcaccaagtt 1165
atcattttgg ggtcctctct ggaggttttt tttttctttt tgctactatg aaaacaacat 1225
aaatctctca attttcgtat caaaaaaaaa aaaaaaaaaa tggcggccgc 1275
<210>10
<211>365
<212>PRT
<213>未知
<400>10
Cys Trp Asp Val Phe Glu Gly Leu Ser His Leu Gln Val Leu Tyr Leu
1 5 10 15
Asn His Asn Tyr Leu Asn Ser Leu Pro Pro Gly Val Phe Ser His Leu
20 25 30
Thr Ala Leu Arg Gly Leu Ser Leu Asn Ser Asn Arg Leu Thr Val Leu
35 40 45
Ser His Asn Asp Leu Pro Ala Asn Leu Glu Ile Leu Asp Ile Ser Arg
50 55 60
Asn Gln Leu Leu Ala Pro Asn Pro Asp Val Phe Val Ser Leu Ser Val
65 70 75 80
Leu Asp Ile Thr His Asn Lys Phe Ile Cys Glu Cys Glu Leu Ser Thr
85 90 95
Phe Ile Asn Trp Leu Asn His Thr Asn Val Thr Ile Ala Gly Pro Pro
100 105 110
Ala Asp Ile Tyr Cys Val Tyr Pro Asp Ser Phe Ser Gly Val Ser Leu
115 120 125
Phe Ser Leu Ser Thr Glu Gly Cys Asp Glu Glu Glu Val Leu Lys Ser
130 135 140
Leu Lys Phe Ser Leu Phe Ile Val Cys Thr Val Thr Leu Thr Leu Phe
145 150 155 160
Leu Met Thr Ile Leu Thr Val Thr Lys Phe Arg Gly Phe Cys Phe Ile
165 170 175
Cys Tyr Lys Thr Ala Gln Arg Leu Val Phe Lys Asp His Pro Gln Gly
180 185 190
Thr Glu Pro Asp Met Tyr Lys Tyr Asp Ala Tyr Leu Cys Phe Ser Ser
195 200 205
Lys Asp Phe Thr Trp Val Gln Asn Ala Leu Leu Lys His Leu Asp Thr
210 215 220
Gln Tyr Ser Asp Gln Asn Arg Phe Asn Leu Cys Phe Glu Glu Arg Asp
225 230 235 240
Phe Val Pro Gly Glu Asn Arg Ile Ala Asn Ile Gln Asp Ala Ile Trp
245 250 255
Asn Ser Arg Lys Ile Val Cys Leu Val Ser Arg His Phe Leu Arg Asp
260 265 270
Gly Trp Cys Leu Glu Ala Phe Ser Tyr Ala Gln Gly Arg Cys Leu Ser
275 280 285
Asp Leu Asn Ser Ala Leu Ile Met Val Val Val Gly Ser Leu Ser Gln
290 295 300
Tyr Gln Leu Met Lys His Gln Ser Ile Arg Gly Phe Val Gln Lys Gln
305 310 315 320
Gln Tyr Leu Arg Trp Pro Glu Asp Leu Gln Asp Val Gly Trp Phe Leu
325 330 335
His Lys Leu Ser Gln Gln Ile Leu Lys Lys Glu Lys Glu Lys Lys Lys
340 345 350
Asp Asn Asn Ile Pro Leu Gln Thr Val Ala Thr Ile Ser
355 360 365
<210>11
<211>3138
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(3135)
<220>
<221>mat_肽
<222>(67)..(3135)
<400>11
atg tgg aca ctg aag aga cta att ctt atc ctt ttt aac ata atc cta 48
Met Trp Thr Leu Lys Arg Leu Ile Leu Ile Leu Phe Asn Ile Ile Leu
-20 -15 -10
att tcc aaa ctc ctt ggg gct aga tgg ttt cct aaa act ctg ccc tgt 96
Ile Ser Lys Leu Leu Gly Ala Arg Trp Phe Pro Lys Thr Leu Pro Cys
-5 -1 1 5 10
gat gtc act ctg gat gtt cca aag aac cat gtg atc gtg gac tgc aca 144
Asp Val Thr Leu Asp Val Pro Lys Asn His Val Ile Val Asp Cys Thr
15 20 25
gac aag cat ttg aca gaa att cct gga ggt att ccc acg aac acc acg 192
Asp Lys His Leu Thr Glu Ile Pro Gly Gly Ile Pro Thr Asn Thr Thr
30 35 40
aac ctc acc ctc acc att aac cac ata cca gac atc tcc cca gcg tcc 240
Asn Leu Thr Leu Thr Ile Asn His Ile Pro Asp Ile Ser Pro Ala Ser
45 50 55
ttt cac aga ctg gac cat ctg gta gag atc gat ttc aga tgc aac tgt 288
Phe His Arg Leu Asp His Leu Val Glu Ile Asp Phe Arg Cys Asn Cys
60 65 70
gta cct att cca ctg ggg tca aaa aac aac atg tgc atc aag agg ctg 336
Val Pro Ile Pro Leu Gly Ser Lys Asn Asn Met Cys Ile Lys Arg Leu
75 80 85 90
cag att aaa ccc aga agc ttt agt gga ctc act tat tta aaa tcc ctt 384
Gln Ile Lys Pro Arg Ser Phe Ser Gly Leu Thr Tyr Leu Lys Ser Leu
95 100 105
tac ctg gat gga aac cag cta cta gag ata ccg cag ggc ctc ccg cct 432
Tyr Leu Asp Gly Asn Gln Leu Leu Glu Ile Pro Gln Gly Leu Pro Pro
110 115 120
agc tta cag ctt ctc agc ctt gag gcc aac aac atc ttt tcc atc aga 480
Ser Leu Gln Leu Leu Ser Leu Glu Ala Asn Asn Ile Phe Ser Ile Arg
125 130 135
aaa gag aat cta aca gaa ctg gcc aac ata gaa ata ctc tac ctg ggc 528
Lys Glu Asn Leu Thr Glu Leu Ala Asn Ile Glu Ile Leu Tyr Leu Gly
140 145 150
caa aac tgt tat tat cga aat cct tgt tat gtt tca tat tca ata gag 576
Gln Asn Cys Tyr Tyr Arg Asn Pro Cys Tyr Val Ser Tyr Ser Ile Glu
155 160 165 170
aaa gat gcc ttc cta aac ttg aca aag tta aaa gtg ctc tcc ctg aaa 624
Lys Asp Ala Phe Leu Asn Leu TAr Lys Leu Lys Val Leu Ser Leu Lys
175 180 185
gat aac aat gtc aca gcc gtc cct act gtt ttg cca tct act tta aca 672
Asp Asn Asn Val Thr Ala Val Pro Thr Val Leu Pro Ser Thr Leu Thr
190 195 200
gaa cta tat ctc tac aac aac atg att gca aaa atc caa gaa gat gat 720
Glu Leu Tyr Leu Tyr Asn Asn Met Ile Ala Lys Ile Gln Glu Asp Asp
205 210 215
ttt aat aac ctc aac caa tta caa att ctt gac cta agt gga aat tgc 768
Phe Asn Asn Leu Asn Gln Leu Gln Ile Leu Asp Leu Ser Gly Asn Cys
220 225 230
cct cgt tgt tat aat gcc cca ttt cct tgt gcg ccg tgt aaa aat aat 816
Pro Arg Cys Tyr Asn Ala Pro Phe Pro Cys Ala Pro Cys Lys Asn Asn
235 240 245 250
tct ccc cta cag atc cct gta aat gct ttt gat gcg ctg aca gaa tta 864
Ser Pro Leu Gln Ile Pro Val Asn Ala Phe Asp Ala Leu Thr Glu Leu
255 260 265
aaa gtt tta cgt cta cac agt aac tct ctt cag cat gtg ccc cca aga 912
Lys Val Leu Arg Leu His Ser Asn Ser Leu Gln His Val Pro Pro Arg
270 275 280
tgg ttt aag aac atc aac aaa ctc cag gaa ctg gat ctg tcc caa aac 960
Trp Phe Lys Asn Ile Asn Lys Leu Gln Glu Leu Asp Leu Ser Gln Asn
285 290 295
ttc ttg gcc aaa gaa att ggg gat gct aaa ttt ctg cat ttt ctc ccc 1008
Phe Leu Ala Lys Glu Ile Gly Asp Ala Lys Phe Leu His Phe Leu Pro
300 305 310
agc ctc atc caa ttg gat ctg tct ttc aat ttt gaa ctt cag gtc tat 1056
Ser Leu Ile Gln Leu Asp Leu Ser Phe Asn Phe Glu Leu Gln Val Tyr
315 320 325 330
cgt gca tct atg aat cta tca caa gca ttt tct tca ctg aaa agc ctg 1104
Arg Ala Ser Met Asn Leu Ser Gln Ala Phe Ser Ser Leu Lys Ser Leu
335 340 345
aaa att ctg cgg atc aga gga tat gtc ttt aaa gag ttg aaa agc ttt 1152
Lys Ile Leu Arg Ile Arg Gly Tyr Val Phe Lys Glu Leu Lys Ser Phe
350 355 360
aac ctc tcg cca tta cat aat ctt caa aat ctt gaa gtt ctt gat ctt 1200
Asn Leu Ser Pro Leu His Asn Leu Gln Asn Leu Glu Val Leu Asp Leu
365 370 375
ggc act aac ttt ata aaa att gct aac ctc agc atg ttt aaa caa ttt 1248
Gly Thr Asn Phe Ile Lys Ile Ala Asn Leu Ser Met Phe Lys Gln Phe
380 385 390
aaa aga ctg aaa gtc ata gat ctt tca gtg aat aaa ata tca cct tca 1296
Lys Arg Leu Lys Val Ile Asp Leu Ser Val Asn Lys Ile Ser Pro Ser
395 400 405 410
gga gat tca agt gaa gtt ggc ttc tgc tca aat gcc aga act tct gta 1344
Gly Asp Ser Ser Glu Val Gly Phe Cys Ser Asn Ala Arg Thr Ser Val
415 420 425
gaa agt tat gaa ccc cag gtc ctg gaa caa tta cat tat ttc aga tat 1392
Glu Ser Tyr Glu Pro Gln Val Leu Glu Gln Leu His Tyr Phe Arg Tyr
430 435 440
gat aag tat gca agg agt tgc aga ttc aaa aac aaa gag gct tct ttc 1440
Asp Lys Tyr Ala Arg Ser Cys Arg Phe Lys Asn Lys Glu Ala Ser Phe
445 450 455
atg tct gtt aat gaa agc tgc tac aag tat ggg cag acc ttg gat cta 1488
Met Ser Val Asn Glu Ser Cys Tyr Lys Tyr Gly Gln Thr Leu Asp Leu
460 465 470
agt aaa aat agt ata ttt ttt gtc aag tcc tct gat ttt cag cat ctt 1536
Ser Lys Asn Ser Ile Phe Phe Val Lys Ser Ser Asp Phe Gln His Leu
475 480 485 490
tct ttc ctc aaa tgc ctg aat ctg tca gga aat ctc att agc caa act 1584
Ser Phe Leu Lys Cys Leu Asn Leu Ser Gly Asn Leu Ile Ser Gln Thr
495 500 505
ctt aat ggc agt gaa ttc caa cct tta gca gag ctg aga tat ttg gac 1632
Leu Asn Gly Ser Glu Phe Gln Pro Leu Ala Glu Leu Arg Tyr Leu Asp
510 515 520
ttc tcc aac aac cgg ctt gat tta ctc cat tca aca gca ttt gaa gag 1680
Phe Ser Asn Asn Arg Leu Asp Leu Leu His Ser Thr Ala Phe Glu Glu
525 530 535
ctt cac aaa ctg gaa gtt ctg gat ata agc agt aat agc cat tat ttt 1728
Leu His Lys Leu Glu Val Leu Asp Ile Ser Ser Asn Ser His Tyr Phe
540 545 550
caa tca gaa gga att act cat atg cta aac ttt acc aag aac cta aag 1776
Gln Ser Glu Gly Ile Thr His Met Leu Asn Phe Thr Lys Asn Leu Lys
555 560 565 570
gtt ctg cag aaa ctg atg atg aac gac aat gac atc tct tcc tcc acc 1824
Val Leu Gln Lys Leu Met Met Asn Asp Asn Asp Ile Ser Ser Ser Thr
575 580 585
agc agg acc atg gag agt gag tct ctt aga act ctg gaa ttc aga gga 1872
Ser Arg Thr Met Glu Ser Glu Ser Leu Arg Thr Leu Glu Phe Arg Gly
590 595 600
aat cac tta gat gtt tta tgg aga gaa ggt gat aac aga tac tta caa 1920
Asn His Leu Asp Val Leu Trp Arg Glu Gly Asp Asn Arg Tyr Leu Gln
605 610 615
tta ttc aag aat ctg cta aaa tta gag gaa tta gac atc tct aaa aat 1968
Leu Phe Lys Asn Leu Leu Lys Leu Glu Glu Leu Asp Ile Ser Lys Asn
620 625 630
Lcc cta agt ttc ttg cct tct gga gtt ttt gat ggt atg cct cca aat 2016
Ser Leu Ser Phe Leu Pro Ser Gly Val Phe Asp Gly Met Pro Pro Asn
635 640 645 650
cta aag aat ctc tct ttg gcc aaa aat ggg ctc aaa tct ttc agt tgg 2064
Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys Ser Phe Ser Trp
655 660 665
aag aaa ctc cag tgt cta aag aac ctg gaa act ttg gac ctc agc cac 2112
Lys Lys Leu Gln Cys Leu Lys Asn Leu Glu Thr Leu Asp Leu Ser His
670 675 680
aac caa ctg acc act gtc cct gag aga tta tcc aac tgt tcc aga agc 2160
Asn Gln Leu Thr Thr Val Pro Glu Arg Leu Ser Asn Cys Ser Arg Ser
685 690 695
ctc aag aat ctg att ctt aag aat aat caa atc agg agt ctg acg aag 2208
Leu Lys Asn Leu Ile Leu Lys Asn Asn Gln Ile Arg Ser Leu Thr Lys
700 705 710
tat ttt cta caa gat gcc ttc cag ttg cga tat ctg gat ctc agc tca 2256
Tyr Phe Leu Gln Asp Ala Phe Gln Leu Arg Tyr Leu Asp Leu Ser Ser
715 720 725 730
aat aaa atc cag atg atc caa aag acc agc ttc cca gaa aat gtc ctc 2304
Asn Lys Ile Gln Met Ile Gln Lys Thr Ser Phe Pro Glu Asn Val Leu
735 740 745
aac aat ctg aag atg ttg ctt ttg cat cat aat cgg ttt ctg tgc acc 2352
Asn Asn Leu Lys Met Leu Leu Leu His His Asn Arg Phe Leu Cys Thr
750 755 760
tgt gat gct gtg tgg ttt gtc tgg tgg gtt aac cat acg gag gtg act 2400
Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His Thr Glu Val Thr
765 770 775
att cct tac ctg gcc aca gat gtg act tgt gtg ggg cca gga gca cac 2448
Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly Pro Gly Ala His
780 785 790
aag ggc caa agt gtg atc tcc ctg gat ctg tac acc tgt gag tta gat 2496
Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr Cys Glu Leu Asp
795 800 805 810
ctg act aac ctg att ctg ttc tca ctt tcc ata tct gta tct ctc ttt 2544
Leu Thr Asn Leu Ile Leu Phe Ser Leu Ser Ile Ser Val Ser Leu Phe
815 820 825
ctc atg gtg atg atg aca gca agt cac ctc tat ttc tgg gat gtg tgg 2592
Leu Met Val Met Met Thr Ala Ser His Leu Tyr Phe Trp Asp Val Trp
830 835 840
tat att tac cat ttc tgt aag gcc aag ata aag ggg tat cag cgt cta 2640
Tyr Ile Tyr His Phe Cys Lys Ala Lys Ile Lys Gly Tyr Gln Arg Leu
845 850 855
ata tca cca gac tgt tgc tat gat gct ttt att gtg tat gac act aaa 2688
Ile Ser Pro Asp Cys Cys Tyr Asp Ala Phe Ile Val Tyr Asp Thr Lys
860 865 870
gac cca gct gtg acc gag tgg gtt ttg gct gag ctg gtg gcc aaa ctg 2736
Asp Pro Ala Val Thr Glu Trp Val Leu Ala Glu Leu Val Ala Lys Leu
875 880 885 890
gaa gac cca aga gag aaa cat ttt aat tta tgt ctc gag gaa agg gac 2784
Glu Asp Pro Arg Glu Lys His Phe Asn Leu Cys Leu Glu Glu Arg Asp
895 900 905
tgg tta cca ggg cag cca gtt ctg gaa aac ctt tcc cag agc ata cag 2832
Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu Ser Gln Ser Ile Gln
910 915 920
ctt agc aaa aag aca gtg ttt gtg atg aca gac aag tat gca aag act 2880
Leu Ser Lys Lys Thr Val Phe Val Met Thr Asp Lys Tyr Ala Lys Thr
925 930 935
gaa aat ttt aag ata gca ttt tac ttg tcc cat cag agg ctc atg gat 2928
Glu Asn Phe Lys Ile Ala Phe Tyr Leu Ser His Gln Arg Leu Met Asp
940 945 950
gaa aaa gtt gat gtg att atc ttg ata ttt ctt gag aag ccc ttt cag 2976
Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu Glu Lys Pro Phe Gln
955 960 965 970
aag tcc aag ttc ctc cag ctc cgg aaa agg ctc tgt ggg agt tct gtc 3024
Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu Cys Gly Ser Ser Val
975 980 985
ctt gag tgg cca aca aac ccg caa gct cac cca tac ttc tgg cag tgt 3072
Leu Glu Trp Pro Thr Asn Pro Gln Ala His Pro Tyr Phe Trp Gln Cys
990 995 1000
cta aag aac gcc ctg gcc aca gac aat cat gtg gcc tat agt cag gtg 3120
Leu Lys Asn Ala Leu Ala Thr Asp Asn His Val Ala Tyr Ser Gln Val
1005 1010 1015
ttc aag gaa acg gtc tag 3138
Phe Lys Glu Thr Val
1020
<210>12
<211>1045
<212>PRT
<213>未知
<400>12
Met Trp Thr Leu Lys Arg Leu Ile Leu Ile Leu Phe Asn Ile Ile Leu
-20 -15 -10
Ile Ser Lys Leu Leu Gly Ala Arg Trp Phe Pro Lys Thr Leu Pro Cys
-5 -1 1 5 10
Asp Val Thr Leu Asp Val Pro Lys Asn His Val Ile Val Asp Cys Thr
15 20 25
Asp Lys His Leu Thr Glu Ile Pro Gly Gly Ile Pro Thr Asn Thr Thr
30 35 40
Asn Leu Thr Leu Thr Ile Asn His Ile Pro Asp Ile Ser Pro Ala Ser
45 50 55
Phe His Arg Leu Asp His Leu Val Glu Ile Asp Phe Arg Cys Asn Cys
60 65 70
Val Pro Ile Pro Leu Gly Ser Lys Asn Asn Met Cys Ile Lys Arg Leu
75 80 85 90
Gln Ile Lys Pro Arg Ser Phe Ser Gly Leu Thr Tyr Leu Lys Ser Leu
95 100 105
Tyr Leu Asp Gly Asn Gln Leu Leu Glu Ile Pro Gln Gly Leu Pro Pro
110 115 120
Ser Leu Gln Leu Leu Ser Leu Glu Ala Asn Asn Ile Phe Ser Ile Arg
125 130 135
Lys Glu Asn Leu Thr Glu Leu Ala Asn Ile Glu Ile Leu Tyr Leu Gly
140 145 150
Gln Asn Cys Tyr Tyr Arg Asn Pro Cys Tyr Val Ser Tyr Ser Ile Glu
155 160 165 170
Lys Asp Ala Phe Leu Asn Leu Thr Lys Leu Lys Val Leu Ser Leu Lys
175 180 185
Asp Asn Asn Val Thr Ala Val Pro Thr Val Leu Pro Ser Thr Leu Thr
190 195 200
Glu Leu Tyr Leu Tyr Asn Asn Met Ile Ala Lys Ile Gln Glu Asp Asp
205 210 215
Phe Asn Asn Leu Asn Gln Leu Gln Ile Leu Asp Leu Ser Gly Asn Cys
220 225 230
Pro Arg Cys Tyr Asn Ala Pro Phe Pro Cys Ala Pro Cys Lys Asn Asn
235 240 245 250
Ser Pro Leu Gln Ile Pro Val Asn Ala Phe Asp Ala Leu Thr Glu Leu
255 260 265
Lys Val Leu Arg Leu His Ser Asn Ser Leu Gln His Val Pro Pro Arg
270 275 280
Trp Phe Lys Asn Ile Asn Lys Leu Gln Glu Leu Asp Leu Ser Gln Asn
285 290 295
Phe Leu Ala Lys Glu Ile Gly Asp Ala Lys Phe Leu His Phe Leu Pro
300 305 310
Ser Leu Ile Gln Leu Asp Leu Ser Phe Asn Phe Glu Leu Gln Val Tyr
315 320 325 330
Arg Ala Ser Met Asn Leu Ser Gln Ala Phe Ser Ser Leu Lys Ser Leu
335 340 345
Lys Ile Leu Arg Ile Arg Gly Tyr Val Phe Lys Glu Leu Lys Ser Phe
350 355 360
Asn Leu Ser Pro Leu His Asn Leu Gln Asn Leu Glu Val Leu Asp Leu
365 370 375
Gly Thr Asn Phe Ile Lys Ile Ala Asn Leu Ser Met Phe Lys Gln Phe
380 385 390
Lys Arg Leu Lys Val Ile Asp Leu Ser Val Asn Lys Ile Ser Pro Ser
395 400 405 410
Gly Asp Ser Ser Glu Val Gly Phe Cys Ser Asn Ala Arg Thr Ser Val
415 420 425
Glu Ser Tyr Glu Pro Gln Val Leu Glu Gln Leu His Tyr Phe Arg Tyr
430 435 440
Asp Lys Tyr Ala Arg Ser Cys Arg Phe Lys Asn Lys Glu Ala Ser Phe
445 450 455
Met Ser Val Asn Glu Ser Cys Tyr Lys Tyr Gly Gln Thr Leu Asp Leu
460 465 470
Ser Lys Asn Ser Ile Phe Phe Val Lys Ser Ser Asp Phe Gln His Leu
475 480 485 490
Ser Phe Leu Lys Cys Leu Asn Leu Ser Gly Asn Leu Ile Ser Gln Thr
495 500 505
Leu Asn Gly Ser Glu Phe Gln Pro Leu Ala Glu Leu Arg Tyr Leu Asp
510 515 520
Phe Ser Asn Asn Arg Leu Asp Leu Leu His Ser Thr Ala Phe Glu Glu
525 530 535
Leu His Lys Leu Glu Val Leu Asp Ile Ser Ser Asn Ser His Tyr Phe
540 545 550
Gln Ser Glu Gly Ile Thr His Met Leu Asn Phe Thr Lys Asn Leu Lys
555 560 565 570
Val Leu Gln Lys Leu Met Met Asn Asp Asn Asp Ile Ser Ser Ser Thr
575 580 585
Ser Arg Thr Met Glu Ser Glu Ser Leu Arg Thr Leu Glu Phe Arg Gly
590 595 600
Asn His Leu Asp Val Leu Trp Arg Glu Gly Asp Asn Arg Tyr Leu Gln
605 610 615
Leu Phe Lys Asn Leu Leu Lys Leu Glu Glu Leu Asp Ile Ser Lys Asn
620 625 630
Ser Leu Ser Phe Leu Pro Ser Gly Val Phe Asp Gly Met Pro Pro Asn
635 640 645 650
Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys Ser Phe Ser Trp
655 660 665
Lys Lys Leu Gln Cys Leu Lys Asn Leu Glu Thr Leu Asp Leu Ser His
670 675 680
Asn Gln Leu Thr Thr Val Pro Glu Arg Leu Ser Asn Cys Ser Arg Ser
685 690 695
Leu Lys Asn Leu Ile Leu Lys Asn Asn Gln Ile Arg Ser Leu Thr Lys
700 705 710
Tyr Phe Leu Gln Asp Ala Phe Gln Leu Arg Tyr Leu Asp Leu Ser Ser
715 720 725 730
Asn Lys Ile Gln Met Ile Gln Lys Thr Ser Phe Pro Glu Asn Val Leu
735 740 745
Asn Asn Leu Lys Met Leu Leu Leu His His Asn Arg Phe Leu Cys Thr
750 755 760
Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His Thr Glu Val Thr
765 770 775
Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly Pro Gly Ala His
780 785 790
Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr Cys Glu Leu Asp
795 800 805 810
Leu Thr Asn Leu Ile Leu Phe Ser Leu Ser Ile Ser Val Ser Leu Phe
815 820 825
Leu Met Val Met Met Thr Ala Ser His Leu Tyr Phe Trp Asp Val Trp
830 835 840
Tyr Ile Tyr His Phe Cys Lys Ala Lys Ile Lys Gly Tyr Gln Arg Leu
845 850 855
Ile Ser Pro Asp Cys Cys Tyr Asp Ala Phe Ile Val Tyr Asp Thr Lys
860 865 870
Asp Pro Ala Val Thr Glu Trp Val Leu Ala Glu Leu Val Ala Lys Leu
875 880 885 890
Glu Asp Pro Arg Glu Lys His Phe Asn Leu Cys Leu Glu Glu Arg Asp
895 900 905
Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu Ser Gln Ser Ile Gln
910 915 920
Leu Ser Lys Lys Thr Val Phe Val Met Thr Asp Lys Tyr Ala Lys Thr
925 930 935
Glu Asn Phe Lys Ile Ala Phe Tyr Leu Ser His Gln Arg Leu Met Asp
940 945 950
Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu Glu Lys Pro Phe Gln
955 960 965 970
Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu Cys Gly Ser Ser Val
975 980 985
Leu Glu Trp Pro Thr Asn Pro Gln Ala His Pro Tyr Phe Trp Gln Cys
990 995 1000
Leu Lys Asn Ala Leu Ala Thr Asp Asn His Val Ala Tyr Ser Gln Val
1005 1010 1015
Phe Lys Glu Thr Val
1020
<210>13
<211>180
<212>DNA
<213>未知
<220>
<223>未知生物的说明:啮齿类动物;推测为
小鼠(Mus musculus)
<220>
<221>CDS
<222>(1)..(177)
<400>13
ctt gga aaa cct ctt cag aag tct aag ttt ctt cag ctc agg aag aga 48
Leu Gly Lys Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg
1 5 10 15
ctc tgc agg agc tct gtc ctt gag tgg cct gca aat cca cag gct cac 96
Leu Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His
20 25 30
cca tac ttc tgg cag tgc ctg aaa aat gcc ctg acc aca gac aat cat 144
Pro Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His
35 40 45
gtg gct tat agt caa atg ttc aag gaa aca gtc tag 180
Val Ala Tyr Ser Gln Met Phe Lys Glu Thr Val
50 55
<210>14
<211>59
<212>PRT
<213>未知
<400>14
Leu Gly Lys Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg
1 5 10 15
Leu Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His
20 25 30
Pro Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His
35 40 45
Val Ala Tyr Ser Gln Met Phe Lys Glu Thr Val
50 55
<210>15
<211>990
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(2)..(988)
<400>15
g aat tcc aga ctt ata aac ttg aaa aat ctc tat ttg gcc tgg aac tgc 49
Asn Ser Arg Leu Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn Cys
1 5 10 15
tat ttt aac aaa gtt tgc gag aaa act aac ata gaa gat gga gta ttt 97
Tyr Phe Asn Lys Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val Phe
20 25 30
gaa acg ctg aca aat ttg gag ttg cta tca cta tct ttc aat tct ctt 145
Glu Thr Leu Thr Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser Leu
35 40 45
tca cat gtg cca ccc aaa ctg cca agc tcc cta cgc aaa ctt ttt ctg 193
Ser His Val Pro Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe Leu
50 55 60
agc aac acc cag atc aaa tac att agt gaa gaa gat ttc aag gga ttg 241
Ser Asn Thr Gln Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly Leu
65 70 75 80
ata aat tta aca tta cta gat tta agc ggg aac tgt ccg agg tgc ttc 289
Ile Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Phe
85 90 95
aat gcc cca ttt cca tgc gtg cct tgt gat ggt ggt gct tca att aat 337
Asn Ala Pro Phe Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile Asn
100 105 110
ata gat cgt ttt gct ttt caa aac ttg acc caa ctt cga tac cta aac 385
Ile Asp Arg Phe Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu Asn
115 120 125
ctc tct agc act tcc ctc agg aag att aat gct gcc tgg ttt aaa aat 433
Leu Ser Ser Thr Ser Leu Arg Lys Ile Asn Ala Ala Trp Phe Lys Asn
130 135 140
atg cct cat ctg aag gtg ctg gat ctt gaa ttc aac tat tta gtg gga 481
Met Pro His Leu Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val Gly
145 150 155 160
gaa ata gcc tct ggg gca ttt tta acg atg ctg ccc cgc tta gaa ata 529
Glu Ile Ala Ser Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu Ile
165 170 175
ctt gac ttg tct ttt aac tat ata aag ggg agt tat cca cag cat att 577
Leu Asp Leu Ser Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His Ile
180 185 190
aat att tcc aga aac ttc tct aaa ctt ttg tct cta cgg gca ttg cat 625
Asn Ile Ser Arg Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu His
195 200 205
tta aga ggt tat gtg ttc cag gaa ctc aga gaa gat gat ttc cag ccc 673
Leu Arg Gly Tyr Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln Pro
210 215 220
ctg atg cag ctt cca aac tta tcg act atc aac ttg ggt att aat ttt 721
Leu Met Gln Leu Pro Asn Leu Ser Thr Ile Asn Leu Gly Ile Asn Phe
225 230 235 240
att aag caa atc gat ttc aaa ctt ttc caa aat ttc tcc aat ctg gaa 769
Ile Lys Gln Ile Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu Glu
245 250 255
att att tac ttg tca gaa aac aga ata tca ccg ttg gta aaa gat acc 817
Ile Ile Tyr Leu Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp Thr
260 265 270
cgg cag agt tat gca aat agt tcc tct ttt caa cgt cat atc cgg aaa 865
Arg Gln Ser Tyr Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg Lys
275 280 285
cga cgc tca aca gat ttt gag ttt gac cca cat tcg aac ttt tat cat 913
Arg Arg Ser Thr Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr His
290 295 300
ttc acc cgt cct tta ata aag cca caa tgt gct gct tat gga aaa gcc 961
Phe Thr Arg Pro Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys Ala
305 310 315 320
tta gat tta agc ctc aac agt att ttc tt 990
Leu Asp Leu Ser Leu Asn Ser Ile Phe
325
<210>16
<211>329
<212>PRT
<213>未知
<400>16
Asn Ser Arg Leu Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn Cys
1 5 10 15
Tyr Phe Asn Lys Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val Phe
20 25 30
Glu Thr Leu Thr Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser Leu
35 40 45
Ser His Val Pro Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe Leu
50 55 60
Ser Asn Thr Gln Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly Leu
65 70 75 80
Ile Asn Leu Thr Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Phe
85 90 95
Asn Ala Pro Phe Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile Asn
100 105 110
Ile Asp Arg Phe Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu Asn
115 120 125
Leu Ser Ser Thr Ser Leu Arg Lys Iie Asn Ala Ala Trp Phe Lys Asn
130 135 140
Met Pro His Leu Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val Gly
145 150 155 160
Glu Ile Ala Ser Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu Ile
165 170 175
Leu Asp Leu Ser Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His Ile
180 185 190
Asn Ile Ser Arg Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu His
195 200 205
Leu Arg Gly Tyr Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln Pro
210 215 220
Leu Met Gln Leu Pro Asn Leu Ser Thr Ile Asn Leu Gly Ile Asn Phe
225 230 235 240
Ile Lys Gln Ile Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu Glu
245 250 255
Ile Ile Tyr Leu Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp Thr
260 265 270
Arg Gln Ser Tyr Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg Lys
275 280 285
Arg Arg Ser Thr Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr His
290 295 300
Phe Thr Arg Pro Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys Ala
305 310 315 320
Leu Asp Leu Ser Leu Asn Ser Ile Phe
325
<210>17
<211>1557
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiells)
<220>
<221>CDS
<222>(1)..(513)
<220>
<221>misc_特征
<222>(93)..(149)
<223>Xaa翻译取决于遗传密码
<400>17
cag tct ctt tcc aca tcc caa act ttc tat gat gct tac att tct tat 48
Gln Ser Leu Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr
1 5 10 15
gac acc aaa gat gcc tct gtt act gac tgg gtg ata aat gag ctg cgc 96
Asp Thr Lys Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg
20 25 30
tac cac ctt gaa gag agc cga gac aaa aac gtt ctc ctt tgt cta gag 144
Tyr His Leu Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu
35 40 45
gag agg gat tgg gac ccg gga ttg gcc atc atc gac aac ctc atg cag 192
Glu Arg Asp Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln
50 55 60
agc atc aac caa agc aag aaa aca gta ttt gtt tta acc aaa aaa tat 240
Ser Ile Asn Gln Ser Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr
65 70 75 80
gca aaa agc tgg aac ttt aaa aca gct ttt tac ttg gsc ttg cag agg 288
Ala Lys Ser Trp Asn Phe Lys Thr Ala Phe Tyr Leu Xaa Leu Gln Arg
85 90 95
cta atg ggt gag aac atg gat gtg att ata ttt atc ctg ctg gag cca 336
Leu Met Gly Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro
100 105 110
gtg tta cag cat tct ccg tat ttg agg cta cgg cag cgg atc tgt aag 384
Val Leu Gln His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys
115 120 125
agc tcc atc ctc cag tgg cct gac aac ccg aag gca gaa agg ttg ttt 432
Ser Ser Ile Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Arg Leu Phe
130 135 140
tgg caa act ctg wga aat gtg gtc ttg act gaa aat gat tca cgg tat 480
Trp Gln Thr Leu Xaa Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr
145 150 155 160
aac aat atg tat gtc gat tcc att aag caa tac taactgacgt taagtcatga 533
Asn Asn Met Tyr Val Asp Ser Ile Lys Gln Tyr
165 170
tttcgcgcca taataaagat gcaaaggaat gacatttcng tattagttat ctattgctan 593
ggtaacnaaa ttantcccaa aaancttang tnggtttnaa aacaacnaca ttntgctggn 653
cccacagttt ttgagggtca ggagtccagg cccagcataa ctgggtcttc tgcttcaggg 713
tgtctncaga ggctgcaatg taggtgttca ccagagacat aggcatcact ggggtcacac 773
tncatgtggt tgttttctgg attcaattcc tcctgggcta ttggccaaag gctatactca 833
tgtaagccat gcgagcctat cccacaangg cagcttgctt catcagagct agcaaaaaag 893
agaggttgct agcaagatga agtcacaatc ttttgtaatc gaatcaaaaa agtgatatct 953
catcactttg gccatattct atttgttaga agtaaaccac aggtcccacc agctccatgg 1013
gagtgaccac ctcagtccag ggaaaacagc tgaagaccaa gatggtgagc tctgattgct 1073
tcagttggtc atcaactatt ttcccttgac tgctgtcctg ggatggccgg ctatcttgat 1133
ggatagattg tgaatatcag gaggccaggg atcactgtgg accatcttag cagttgacct 1193
aacacatctt cttttcaata tctaagaact tttgccactg tgactaatgg tcctaatatt 1253
aagctgttgt ttatatttat catatatcta tggctacatg gttatattat gctgtggttg 1313
cgttcggttt tatttacagt tgcttttaca aatatttgct gtaacatttg acttctaagg 1373
tttagatgcc atttaagaac tgagatggat agcttttaaa gcatctttta cttcttacca 1433
ttttttaaaa gtatgcagct aaattcgaag cttttggtct atattgttaa ttgccattgc 1493
tgtaaatctt aaaatgaatg aataaaaatg tttcatttta aaaaaaaaaa aaaaaaaaaa 1553
aaaa 1557
<210>18
<211>171
<212>PRT
<213>未知
<400>18
Gln Ser Leu Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr
1 5 10 15
Asp Thr Lys Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg
20 25 30
Tyr His Leu Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu
35 40 45
Glu Arg Asp Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln
50 55 60
Ser Ile Asn Gln Ser Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr
65 70 75 80
Ala Lys Ser Trp Asn Phe Lys Thr Ala Phe Tyr Leu Xaa Leu Gln Arg
85 90 95
Leu Met Gly Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro
100 105 110
Val Leu Gln His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys
115 120 125
Ser Ser Ile Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Arg Leu Phe
130 135 140
Trp Gln Thr Leu Xaa Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr
145 150 155 160
Asn Asn Met Tyr Val Asp Ser Ile Lys Gln Tyr
165 170
<210>19
<211>629
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(486)
<220>
<221>misc_特征
<222>(48)..(75)
<223>Xaa翻译取决于遗传密码
<400>19
aat gaa ttg atc ccc aat cta gag aag gaa gat ggt tct atc ttg att 48
Asn Glu Leu Ile Pro Asn Leu Glu Lys Glu Asp Gly Ser Ile Leu Ile
1 5 10 15
tgc ctt tat gaa agc tac ttt gac cct ggc aaa agc att agt gaa aat 96
Cys Leu Tyr Glu Ser Tyr Phe Asp Pro Gly Lys Ser Ile Ser Glu Asn
20 25 30
att gta agc ttc att gag aaa agc tat aag tcc atc ttt gtt ttg tcy 144
Ile Val Ser Phe Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu Xaa
35 40 45
ccc aac ttt gtc cag aat gag tgg tgc cat tat gaa ttc tac ttt gcc 192
Pro Asn Phe Val Gln Asn Glu Trp Cys His Tyr Glu Phe Tyr Phe Ala
50 55 60
cac cac aat ctc ttc cat gaa aat tct gat cay ata att ctt atc tta 240
His His Asn Leu Phe His Glu Asn Ser Asp Xaa Ile Ile Leu Ile Leu
65 70 75 80
ctg gaa ccc att cca ttc tat tgc att ccc acc agg tat cat aaa ctg 288
Leu Glu Pro Ile Pro Phe Tyr Cys Ile Pro Thr Arg Tyr His Lys Leu
85 90 95
gaa gct ctc ctg gaa aaa aaa gca tac ttg gaa tgg ccc aag gat agg 336
Glu Ala Leu Leu Glu Lys Lys Ala Tyr Leu Glu Trp Pro Lys Asp Arg
100 105 110
cgt aaa tgt ggg ctt ttc tgg gca aac ctt cga gct gct gtt aat gtt 384
Arg Lys Cys Gly Leu Phe Trp Ala Asn Leu Arg Ala Ala Val Asn Val
115 120 125
aat gta tta gcc acc aga gaa atg tat gaa ctg cag aca ttc aca gag 432
Asn Val Leu Ala Thr Arg Glu Met Tyr Glu Leu Gln Thr Phe Thr Glu
130 135 140
tta aat gaa gag tct cga ggt tct aca atc tct ctg atg aga aca gac 480
Leu Asn Glu Glu Ser Arg Gly Ser Thr Ile Ser Leu Met Arg Thr Asp
145 150 155 160
tgt cta taaaatccca cagtccttgg gaagttgggg accacataca ctgttgggat 536
Cys Leu
gtacattgat acaaccttta tgatggcaat ttgacaatat ttattaaaat aaaaaatggt 596
tattcccttc aaaaaaaaaa aaaaaaaaaa aaa 629
<210>20
<211>162
<212>PRT
<213>未知
<400>20
Asn Glu Leu Ile Pro Asn Leu Glu Lys Glu Asp Gly Ser Ile Leu Ile
1 5 10 15
Cys Leu Tyr Glu Ser Tyr Phe Asp Pro Gly Lys Ser Ile Ser Glu Asn
20 25 30
Ile Val Ser Phe Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu Xaa
35 40 45
Pro Asn Phe Val Gln Asn Glu Trp Cys His Tyr Glu Phe Tyr Phe Ala
50 55 60
His His Asn Leu Phe His Glu Asn Ser Asp Xaa Ile Ile Leu Ile Leu
65 70 75 80
Leu Glu Pro Ile Pro Phe Tyr Cys Ile Pro Thr Arg Tyr His Lys Leu
85 90 95
Glu Ala Leu Leu Glu Lys Lys Ala Tyr Leu Glu Trp Pro Lys Asp Arg
100 105 110
Arg Lys Cys Gly Leu Phe Trp Ala Asn Leu Arg Ala Ala Val Asn Val
115 120 125
Asn Val Leu Ala Thr Arg Glu Met Tyr Glu Leu Gln Thr Phe Thr Glu
130 135 140
Leu Asn Glu Glu Ser Arg Gly Ser Thr Ile Ser Leu Met Arg Thr Asp
145 150 155 160
Cys Leu
<210>21
<211>427
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(426)
<400>21
aag aac tcc aaa gaa aac ctc cag ttt cat gct ttt att tca tat agt 48
Lys Asn Ser Lys Glu Asn Leu Gln Phe His Ala Phe Ile Ser Tyr Ser
1 5 10 15
gaa cat gat tct gcc tgg gtg aaa agt gaa ttg gta cct tac cta gaa 96
Glu His Asp Ser Ala Trp Val Lys Ser Glu Leu Val Pro Tyr Leu Glu
20 25 30
aaa gaa gat ata cag att tgt ctt cat gag aga aac ttt gtc cct ggc 144
Lys Glu Asp Ile Gln Ile Cys Leu His Glu Arg Asn Phe Val Pro Gly
35 40 45
aag agc att gtg gaa aat atc atc aac tgc att gag aag agt tac aag 192
Lys Ser Ile Val Glu Asn Ile Ile Asn Cys Ile Glu Lys Ser Tyr Lys
50 55 60
tcc atc ttt gtt ttg tct ccc aac ttt gtc cag agt gag tgg tgc cat 240
Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser Glu Trp Cys His
65 70 75 80
tac gaa ctc tat ttt gcc cat cac aat ctc ttt cat gaa gga tct aat 288
Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His Glu Gly Ser Asn
85 90 95
aac tta atc ctc atc tta ctg gaa ccc att cca cag aac agc att ccc 336
Asn Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln Asn Ser Ile Pro
100 105 110
aac aag tac cac aag ctg aag gct ctc atg acg cag cgg act tat ttg 384
Asn Lys Tyr His Lys Leu Lys Ala Leu Met Thr Gln Arg Thr Tyr Leu
115 120 125
cag tgg ccc aag gag aaa agc aaa cgt ggg ctc ttt tgg gct a 427
Gln Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe Trp Ala
130 135 140
<210>22
<211>142
<212>PRT
<213>未知
<400>22
Lys Asn Ser Lys Glu Asn Leu Gln Phe His Ala Phe Ile Ser Tyr Ser
1 5 10 15
Glu His Asp Ser Ala Trp Val Lys Ser Glu Leu Val Pro Tyr Leu Glu
20 25 30
Lys Glu Asp Ile Gln Ile Cys Leu His Glu Arg Asn Phe Val Pro Gly
35 40 45
Lys Ser Ile Val Glu Asn Ile Ile Asn Cys Ile Glu Lys Ser Tyr Lys
50 55 60
Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Ser Glu Trp Cys His
65 70 75 80
Tyr Glu Leu Tyr Phe Ala His His Asn Leu Phe His Glu Gly Ser Asn
85 90 95
Asn Leu Ile Leu Ile Leu Leu Glu Pro Ile Pro Gln Asn Ser Ile Pro
100 105 110
Asn Lys Tyr His Lys Leu Lys Ala Leu Met Thr Gln Arg Thr Tyr Leu
115 120 125
Gln Trp Pro Lys Glu Lys Ser Lys Arg Gly Leu Phe Trp Ala
130 135 140
<210>23
<211>662
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(627)
<220>
<221>misc_特征
<222>(18)..(136)
<223>Xaa翻译取决于遗传密码
<400>23
gct tcc acc tgt gcc tgg cct ggc ttc cct ggc ggg ggc ggc aaa gtg 48
Ala Ser Thr Cys Ala Trp Pro Gly Phe Pro Gly Gly Gly Gly Lys Val
1 5 10 15
ggc gar atg agg atg ccc tgc cct acg atg cct tcg tgg tct tcg aca 96
Gly Xaa Met Arg Met Pro Cys Pro Thr Met Pro Ser Trp Ser Ser Thr
20 25 30
aaa cgc rga gcg cag tgg cag act ggg tgt aca acg agc ttc ggg ggc 144
Lys Arg Xaa Ala Gln Trp Gln Thr Gly Cys Thr Thr Ser Phe Gly Gly
35 40 45
agc tgg agg agt gcc gtg ggc gct ggg cac tcc gcc tgt gcc tgg agg 192
Ser Trp Arg Ser Ala Val Gly Ala Gly His Ser Ala Cys Ala Trp Arg
50 55 60
aac gcg act ggc tgc ctg gca aaa ccc tct ttg aga acc tgt ggg cct 240
Asn Ala Thr Gly Cys Leu Ala Lys Pro Ser Leu Arg Thr Cys Gly Pro
65 70 75 80
cgg tct atg gca gcc gca aga cgc tgt ttg tgc tgg ccc aca cgg acc 288
Arg Ser Met Ala Ala Ala Arg Arg Cys Leu Cys Trp Pro Thr Arg Thr
85 90 95
ggg tca gtg gtc tct tgc gcg cca ktt ntc ctg ctg gcc cag cag cgc 336
Gly Ser Val Val Ser Cys Ala Pro Xaa Xaa Leu Leu Ala Gln Gln Arg
100 105 110
ctg ctg gar gac cgc aag gac gtc gtg gtg ctg gtg atc cta ang cct 384
Leu Leu Xaa Asp Arg Lys Asp Val Val Val Leu Val Ile Leu Xaa Pro
115 120 125
gac ggc caa gcc tcc cga cta cnn gat gcg ctg acc agc gcc tct gcc 432
Asp Gly Gln Ala Ser Arg Leu Xaa Asp Ala Leu Thr Ser Ala Ser Ala
130 135 140
gcc aga gtg tcc tcc tct ggc ccc acc agc cca gtg gtc gcg cag ctt 480
Ala Arg Val Ser Ser Ser Gly Pro Thr Ser Pro Val Val Ala Gln Leu
145 150 155 160
ctg agg cca gca tgc atg gcc ctg acc agg gac aac cac cac ttc tat 528
Leu Arg Pro Ala Cys Met Ala Leu Thr Arg Asp Asn His His Phe Tyr
165 170 175
aac cgg aac ttc tgc cag gga acc cac ggc cga ata gcc gtg agc cgg 576
Asn Arg Asn Phe Cys Gln Gly Thr His Gly Arg Ile Ala Val Ser Arg
180 185 190
aat cct gca cgg tgc cac ctc cac aca cac cta aca tat gcc tgc ctg 624
Asn Pro Ala Arg Cys His Leu His Thr His Leu Thr Tyr Ala Cys Leu
195 200 205
atc tgaccaacac atgctcgcca ccctcaccac acacc 662
Ile
<210>24
<211>209
<212>PRT
<213>未知
<400>24
Ala Ser Thr Cys Ala Trp Pro Gly Phe Pro Gly Gly Gly Gly Lys Val
1 5 10 15
Gly Xaa Met Arg Met Pro Cys Pro Thr Met Pro Ser Trp Ser Ser Thr
20 25 30
Lys Arg Xaa Ala Gln Trp Gln Thr Gly Cys Thr Thr Ser Phe Gly Gly
35 40 45
Ser Trp Arg Ser Ala Val Gly Ala Gly His Ser Ala Cys Ala Trp Arg
50 55 60
Asn Ala Thr Gly Cys Leu Ala Lys Pro Ser Leu Arg Thr Cys Gly Pro
65 70 75 80
Arg Ser Met Ala Ala Ala Arg Arg Cys Leu Cys Trp Pro Thr Arg Thr
85 90 95
Gly Ser Val Val Ser Cys Ala Pro Xaa Xaa Leu Leu Ala Gln Gln Arg
100 105 110
Leu Leu Xaa Asp Arg Lys Asp Val Val Val Leu Val Ile Leu Xaa Pro
115 120 125
Asp Gly Gln Ala Ser Arg Leu Xaa Asp Ala Leu Thr Ser Ala Ser Ala
130 135 140
Ala Arg Val Ser Ser Ser Gly Pro Thr Ser Pro Val Val Ala Gln Leu
145 150 155 160
Leu Arg Pro Ala Cys Met Ala Leu Thr Arg Asp Asn His His Phe Tyr
165 170 175
Asn Arg Asn Phe Cys Gln Gly Thr His Gly Arg Ile Ala Val Sar Arg
180 185 190
Asn Pro Ala Arg Cys His Leu His Thr His Leu Thr Tyr Ala Cys Leu
195 200 205
Ile
<210>25
<211>4865
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(107)..(2617)
<220>
<221>mat_肽
<222>(173)..(2617)
<220>
<221>misc_特征
<222>(189)
<223>Xaa翻译取决于遗传密码
<400>25
aaaatactcc cttgcctcaa aaactgctcg gtcaaacggt gatagcaaac cacgcattca 60
cagggccact gctgctcaca naascagtga ggatgatgcc aggatg atg tct gcc 115
Met Ser Ala
-20
tcg cgc ctg gct ggg act ctg atc cca gcc atg gcc ttc ctc tcc tgc 163
Ser Arg Leu Ala Gly Thr Leu Ile Pro Ala Met Ala Phe Leu Ser Cys
-15 -10 -5
gtg aga cca gaa agc tgg gag ccc tgc gtg gag gtt cct aat att act 211
Val Arg Pro Glu Ser Trp Glu Pro Cys Val Glu Val Pro Asn Ile Thr
-1 1 5 10
tat caa tgc atg gag ctg aat ttc tac aaa atc ccc gac aac ctc ccc 259
Tyr Gln Cys Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp Asn Leu Pro
15 20 25
ttc tca acc aag aac ctg gac ctg agc ttt aat ccc ctg agg cat tta 307
Phe Ser Thr Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu Arg His Leu
30 35 40 45
ggc agc tat agc ttc ttc agt ttc cca gaa ctg cag gtg ctg gat tta 355
Gly Ser Tyr Ser Phe Phe Ser Phe Pro Glu Leu Gln Val Leu Asp Leu
50 55 60
tcc agg tgt gaa atc cag aca att gaa gat ggg gca tat cag agc cta 403
Ser Arg Cys Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr Gln Ser Leu
65 70 75
agc cac ctc tct acc tta ata ttg aca gga aac ccc atc cag agt tta 451
Ser His Leu Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile Gln Ser Leu
80 85 90
gcc ctg gga gcc ttt tct gga cta tca agt tta cag aag ctg gtg gct 499
Ala Leu Gly Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys Leu Val Ala
95 100 105
gtg gag aca aat cta gca tct cta gag aac ttc ccc att gga cat ctc 547
Val Glu Thr Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile Gly His Leu
110 115 120 125
aaa act ttg aaa gaa ctt aat gtg gct cac aat ctt atc caa tct ttc 595
Lys Thr Leu Lys Glu Leu Asn Val Ala His Asn Leu Ile Gln Ser Phe
130 135 140
aaa tta cct gag tat ttt tct aat ctg acc aat cta gag cac ttg gac 643
Lys Leu Pro Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu His Leu Asp
145 150 155
ctt tcc agc aac aag att caa agt att tat tgc aca gac ttg cgg gtt 691
Leu Ser Ser Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp Leu Arg Val
160 165 170
cta cat caa atg ccc cta ctc aat ctc tct tta gac ctg tcc ctg aay 739
Leu His Gln Met Pro Leu Leu Asn Leu Ser Leu Asp Leu Ser Leu Xaa
175 180 185
cct atg aac ttt atc caa cca ggt gca ttt aaa gaa att agg ctt cat 787
Pro Met Asn Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile Arg Leu His
190 195 200 205
aag ctg act tta aga aat aat ttt gat agt tta aat gta atg aaa act 835
Lys Leu Thr Leu Arg Asn Asn Phe Asp Ser Leu Asn Val Met Lys Thr
210 215 220
tgt att caa ggt ctg gct ggt tta gaa gtc cat cgt ttg gtt ctg gga 883
Cys Ile Gln Gly Leu Ala Gly Leu Glu Val His Arg Leu Val Leu Gly
225 230 235
gaa ttt aga aat gaa gga aac ttg gaa aag ttt gac aaa tct gct cta 931
Glu Phe Arg Asn Glu Gly Asn Leu Glu Lys Phe Asp Lys Ser Ala Leu
240 245 250
gag ggc ctg tgc aat ttg acc att gaa gaa ttc cga tta gca tac tta 979
Glu Gly Leu Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu Ala Tyr Leu
255 260 265
gac tac tac ctc gat gat att att gac tta ttt aat tgt ttg aca aat 1027
Asp Tyr Tyr Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys Leu Thr Asn
270 275 280 285
gtt tct tca ttt tcc ctg gtg agt gtg act att gaa agg gta aaa gac 1075
Val Ser Ser Phe Ser Leu Val Ser Val Thr Ile Glu Arg Val Lys Asp
290 295 300
ttt tct tat aat ttc gga tgg caa cat tta gaa tta gtt aac tgt aaa 1123
Phe Ser Tyr Asn Phe Gly Trp Gln His Leu Glu Leu Val Asn Cys Lys
305 310 315
ttt gga cag ttt ccc aca ttg aaa ctc aaa tct ctc aaa agg ctt act 1171
Phe Gly Gln Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys Arg Leu Thr
320 325 330
ttc act tcc aac aaa ggt ggg aat gct ttt tca gaa gtt gat cta cca 1219
Phe Thr Ser Asn Lys Gly Gly Asn Ala Phe Ser Glu Val Asp Leu Pro
335 340 345
agc ctt gag ttt cta gat ctc agt aga aat ggc ttg agt ttc aaa ggt 1267
Ser Leu Glu Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser Phe Lys Gly
350 355 360 365
tgc tgt tct caa agt gat ttt ggg aca acc agc cta aag tat tta gat 1315
Cys Cys Ser Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys Tyr Leu Asp
370 375 380
ctg agc ttc aat ggt gtt att acc atg agt tca aac ttc ttg ggc tta 1363
Leu Ser Phe Asn Gly Val Ile Thr Met Ser Ser Asn Phe Leu Gly Leu
385 390 395
gaa caa cta gaa cat ctg gat ttc cag cat tcc aat ttg aaa caa atg 1411
Glu Gln Leu Glu His Leu Asp Phe Gln His Ser Asn Leu Lys Gln Met
400 405 410
agt gag ttt tca gta ttc cta tca ctc aga aac ctc att tac ctt gac 1459
Ser Glu Phe Ser Val Phe Leu Ser Leu Arg Asn Leu Ile Tyr Leu Asp
415 420 425
att tct cat act cac acc aga gtt gct ttc aat ggc atc ttc aat ggc 1507
Ile Ser His Thr His Thr Arg Val Ala Phe Asn Gly Ile Phe Asn Gly
430 435 440 445
ttg tcc agt ctc gaa gtc ttg aaa atg gct ggc aat tct ttc cag gaa 1555
Leu Ser Ser Leu Glu Val Leu Lys Met Ala Gly Asn Ser Phe Gln Glu
450 455 460
aac ttc ctt cca gat atc ttc aca gag ctg aga aac ttg acc ttc ctg 1603
Asn Phe Leu Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu Thr Phe Leu
465 470 475
gac ctc tct cag tgt caa ctg gag cag ttg tct cca aca gca ttt aac 1651
Asp Leu Ser Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr Ala Phe Asn
480 485 490
tca ctc tcc agt ctt cag gta cta aat atg agc cac aac aac ttc ttt 1699
Ser Leu Ser Ser Leu Gln Val Leu Asn Met Ser His Asn Asn Phe Phe
495 500 505
tca ttg gat acg ttt cct tat aag tgt ctg aac tcc ctc cag gtt ctt 1747
Ser Leu Asp Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu Gln Val Leu
510 515 520 525
gat tac agt ctc aat cac ata atg act tcc aaa aaa cag gaa cta cag 1795
Asp Tyr Ser Leu Asn His Ile Met Thr Ser Lys Lys Gln Glu Leu Gln
530 535 540
cat ttt cca agt agt cta gct ttc tta aat ctt act cag aat gac ttt 1843
His Phe Pro Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln Asn Asp Phe
545 550 555
gct tgt act tgt gaa cac cag agt ttc ctg caa tgg atc aag gac cag 1891
Ala Cys Thr Cys Glu His Gln Ser Phe Leu Gln Trp Ile Lys Asp Gln
560 565 570
agg cag ctc ttg gtg gaa gtt gaa cga atg gaa tgt gca aca cct tca 1939
Arg Gln Leu Leu Val Glu Val Glu Arg Met Glu Cys Ala Thr Pro Ser
575 580 585
gat aag cag ggc atg cct gtg ctg agt ttg aat atc acc tgt cag atg 1987
Asp Lys Gln Gly Met Pro Val Leu Ser Leu Asn Ile Thr Cys Gln Met
590 595 600 605
aat aag acc atc att ggt gtg tcg gtc ctc agt gtg ctt gta gta tct 2035
Asn Lys Thr Ile Ile Gly Val Ser Val Leu Ser Val Leu Val Val Ser
610 615 620
gtt gta gca gtt ctg gtc tat aag ttc tat ttt cac ctg atg ctt ctt 2083
Val Val Ala Val Leu Val Tyr Lys Phe Tyr Phe His Leu Met Leu Leu
625 630 635
gct ggc tgc ata aag tat ggt aga ggt gaa aac atc tat gat gcc ttt 2131
Ala Gly Cys Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr Asp Ala Phe
640 645 650
gtt atc tac tca agc cag gat gag gac tgg gta agg aat gag cta gta 2179
Val Ile Tyr Ser Ser Gln Asp Glu Asp Trp Val Arg Asn Glu Leu Val
655 660 665
aag aat tta gaa gaa ggg gtg cct cca ttt cag ctc tgc ctt cac tac 2227
Lys Asn Leu Glu Glu Gly Val Pro Pro Phe Gln Leu Cys Leu His Tyr
670 675 680 685
aga gac ttt att ccc ggt gtg gcc att gct gcc aac atc atc cat gaa 2275
Arg Asp Phe Ile Pro Gly Val Ala Ile Ala Ala Asn Ile Ile His Glu
690 695 700
ggt ttc cat aaa agc cga aag gtg att gtt gtg gtg tcc cag cac ttc 2323
Gly Phe His Lys Ser Arg Lys Val Ile Val Val Val Ser Gln His Phe
705 710 715
atc cag agc cgc tgg tgt atc ttt gaa tat gag att gct cag acc tgg 2371
Ile Gln Ser Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala Gln Thr Trp
720 725 730
cag ttt ctg agc agt cgt gct ggt atc atc ttc att gtc ctg cag aag 2419
Gln Phe Leu Ser Ser Arg Ala Gly Ile Ile Phe Ile Val Leu Gln Lys
735 740 745
gtg gag aag acc ctg ctc agg cag cag gtg gag ctg tac cgc ctt ctc 2467
Val Glu Lys Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr Arg Leu Leu
750 755 760 765
agc agg aac act tac ctg gag tgg gag gac agt gtc ctg ggg cgg cac 2515
Ser Arg Asn Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu Gly Arg His
770 775 780
atc ttc tgg aga cga ctc aga aaa gcc ctg ctg gat ggt aaa tca tgg 2563
Ile Phe Trp Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly Lys Ser Trp
785 790 795
aat cca gaa gga aca gtg ggt aca gga tgc aat tgg cag gaa gca aca 2611
Asn Pro Glu Gly Thr Val Gly Thr Gly Cys Asn Trp Gln Glu Ala Thr
800 805 810
tct arc tgaagaggaa aaataaaaac ctcctgaggc atttcttgcc cagctgggtc 2667
Ser Ile
815
caacacttgt tcagttaata agtattaaat gctgccacat gtcaggcctt atgctaaggg 2727
tgagtaattc catggtgcac tagatatgca gggctgctaa tctcaaggag cttccagtgc 2787
agagggaata aatgctagac taaaatacag agtcttccag gtgggcattt caaccaactc 2847
agtcaaggaa cccatgacaa agaaagtcat ttcaactctt acctcatcaa gttgaataaa 2907
gacagagaaa acagaaagag acattgttct tttcctgagt cttttgaatg gaaattgtat 2967
tatgttatag ccatcataaa accattttgg tagttttgac tgaactgggt gttcactttt 3027
tcctttttga ttgaatacaa tttaaattct acttgatgac tgcagtcgtc aaggggctcc 3087
tgatgcaaga tgccccttcc attttaagtc tgtctcctta cagakgttaa agtctantgg 3147
ctaattccta aggaaacctg attaacacat gctcacaacc atcctggtca ttctcganca 3207
tgttctattt tttaactaat cacccctgat atatttttat ttttatatat ccagttttca 3267
tttttttacg tcttgcctat aagctaatat cataaataag gttgtttaag acgtgcttca 3327
aatatccata ttaaccacta tttttcaagg aagtatggaa aagtacactc tgtcactttg 3387
tcactcgatg tcattccaaa gttattgcct actaagtaat gactgtcatg aaagcagcat 3447
tgaaataatt tgtttaaagg gggcactctt ttaaacggga agaaaatttc cgcttcctgg 3507
tcttatcatg gacaatttgg gctakaggca kgaaggaagt gggatkacct caggangtca 3567
ccttttcttg attccagaaa catatgggct gataaacccg gggtgacctc atgaaatgag 3627
ttgcagcaga wgtttatttt tttcagaaca agtgatgttt gatggacctm tgaatctmtt 3687
tagggagaca cagatggctg ggatccctcc cctgtaccct tctcactgmc aggagaacta 3747
cgtgtgaagg tattcaaggc agggagtata cattgctgtt tcctgttggg caatgctcct 3807
tgaccacatt ttgggaagag tggatgttat cattgagaaa acaatgtgtc tggaattaat 3867
ggggttctta taaagaaggt tcccagaaaa gaatgttcat tccagcttct tcaggaaaca 3927
ggaacattca aggaaaagga caatcaggat gtcatcaggg aaatgaaaat aaaaaccaca 3987
atgagatatc accttatacc aggtagatgg ctactataaa aaaatgaagt gtcatcaagg 4047
atatagagaa attggaaccc ttcttcactg ctggagggaa tggaaaatgg tgtagccgtt 4107
atgaaaaaca gtacggaggt ttctcaaaaa ttaaaaatag aactgctata tgatccagca 4167
atctcacttc tgtatatata cccaaaataa ttgaaatcag aatttcaaga aaatatttac 4227
actcccatgt tcattgtggc actcttcaca atcactgttt ccaaagttat ggaaacaacc 4287
caaatttcca ttggaaaata aatggacaaa ggaaatgtgc atataacgta caatggggat 4347
attattcagc ctaaaaaaag gggggatcct gttatttatg acaacatgaa taaacccgga 4407
ggccattatg ctatgtaaaa tgagcaagta acagaaagac aaatactgcc tgatttcatt 4467
tatatgaggt tctaaaatag tcaaactcat agaagcagag aatagaacag tggttcctag 4527
ggaaaaggag gaagggagaa atgaggaaat agggagttgt ctaattggta taaaattata 4587
gtatgcaaga tgaattagct ctaaagatca gctgtatagc agagttcgta taatgaacaa 4647
tactgtatta tgcacttaac attttgttaa gagggtacct ctcatgttaa gtgttcttac 4707
catatacata tacacaagga agcttttgga ggtgatggat atatttatta ccttgattgt 4767
ggtgatggtt tgacaggtat gtgactatgt ctaaactcat caaattgtat acattaaata 4827
tatgcagttt tataatatca aaaaaaaaaa aaaaaaaa 4865
<210>26
<211>837
<212>PRT
<213>未知
<400>26
Met Ser Ala Ser Arg Leu Ala Gly Thr Leu Ile Pro Ala Met Ala Phe
-20 -15 -10
Leu Ser Cys Val Arg Pro Glu Ser Trp Glu Pro Cys Val Glu Val Pro
-5 -1 1 5 10
Asn Ile Thr Tyr Gln Cys Met Glu Leu Asn Phe Tyr Lys Ile Pro Asp
15 20 25
Asn Leu Pro Phe Ser Thr Lys Asn Leu Asp Leu Ser Phe Asn Pro Leu
30 35 40
Arg His Leu Gly Ser Tyr Ser Phe Phe Ser Phe Pro Glu Leu Gln Val
45 50 55
Leu Asp Leu Ser Arg Cys Glu Ile Gln Thr Ile Glu Asp Gly Ala Tyr
60 65 70
Gln Ser Leu Ser His Leu Ser Thr Leu Ile Leu Thr Gly Asn Pro Ile
75 80 85 90
Gln Ser Leu Ala Leu Gly Ala Phe Ser Gly Leu Ser Ser Leu Gln Lys
95 100 105
Leu Val Ala Val Glu Thr Asn Leu Ala Ser Leu Glu Asn Phe Pro Ile
110 115 120
Gly His Leu Lys Thr Leu Lys Glu Leu Asn Val Ala His Asn Leu Ile
125 130 135
Gln Ser Phe Lys Leu Pro Glu Tyr Phe Ser Asn Leu Thr Asn Leu Glu
140 145 150
His Leu Asp Leu Ser Ser Asn Lys Ile Gln Ser Ile Tyr Cys Thr Asp
155 160 165 170
Leu Arg Val Leu His Gln Met Pro Leu Leu Asn Leu Ser Leu Asp Leu
175 180 185
Ser Leu Xaa Pro Met Asn Phe Ile Gln Pro Gly Ala Phe Lys Glu Ile
190 195 200
Arg Leu His Lys Leu Thr Leu Arg Asn Asn Phe Asp Ser Leu Asn Val
205 210 215
Met Lys Thr Cys Ile Gln Gly Leu Ala Gly Leu Glu Val His Arg Leu
220 225 230
Val Leu Gly Glu Phe Arg Asn Glu Gly Asn Leu Glu Lys Phe Asp Lys
235 240 245 250
Ser Ala Leu Glu Gly Leu Cys Asn Leu Thr Ile Glu Glu Phe Arg Leu
255 260 265
Ala Tyr Leu Asp Tyr Tyr Leu Asp Asp Ile Ile Asp Leu Phe Asn Cys
270 275 280
Leu Thr Asn Val Ser Ser Phe Ser Leu Val Ser Val Thr Ile Glu Arg
285 290 295
Val Lys Asp Phe Ser Tyr Asn Phe Gly Trp Gln His Leu Glu Leu Val
300 305 310
Asn Cys Lys Phe Gly Gln Phe Pro Thr Leu Lys Leu Lys Ser Leu Lys
315 320 325 330
Arg Leu Thr Phe Thr Ser Asn Lys Gly Gly Asn Ala Phe Ser Glu Val
335 340 345
Asp Leu Pro Ser Leu Glu Phe Leu Asp Leu Ser Arg Asn Gly Leu Ser
350 355 360
Phe Lys Gly Cys Cys Ser Gln Ser Asp Phe Gly Thr Thr Ser Leu Lys
365 370 375
Tyr Leu Asp Leu Ser Phe Asn Gly Val Ile Thr Met Ser Ser Asn Phe
380 385 390
Leu Gly Leu Glu Gln Leu Glu His Leu Asp Phe Gln His Ser Asn Leu
395 400 405 410
Lys Gln Met Ser Glu Phe Ser Val Phe Leu Ser Leu Arg Asn Leu Ile
415 420 425
Tyr Leu Asp Ile Ser His Thr His Thr Arg Val Ala Phe Asn Gly Ile
430 435 440
Phe Asn Gly Leu Ser Ser Leu Glu Val Leu Lys Met Ala Gly Asn Ser
445 450 455
Phe Gln Glu Asn Phe Leu Pro Asp Ile Phe Thr Glu Leu Arg Asn Leu
460 465 470
Thr Phe Leu Asp Leu Ser Gln Cys Gln Leu Glu Gln Leu Ser Pro Thr
475 480 485 490
Ala Phe Asn Ser Leu Ser Ser Leu Gln Val Leu Asn Met Ser His Asn
495 500 505
Asn Phe Phe Ser Leu Asp Thr Phe Pro Tyr Lys Cys Leu Asn Ser Leu
510 515 520
Gln Val Leu Asp Tyr Ser Leu Asn His Ile Met Thr Ser Lys Lys Gln
525 530 535
Glu Leu Gln His Phe Pro Ser Ser Leu Ala Phe Leu Asn Leu Thr Gln
540 545 550
Asn Asp Phe Ala Cys Thr Cys Glu His Gln Ser Phe Leu Gln Trp Ile
555 560 565 570
Lys Asp Gln Arg Gln Leu Leu Val Glu Val Glu Arg Met Glu Cys Ala
575 580 585
Thr Pro Ser Asp Lys Gln Gly Met Pro Val Leu Ser Leu Asn Ile Thr
590 595 600
Cys Gln Met Asn Lys Thr Ile Ile Gly Val Ser Val Leu Ser Val Leu
605 610 615
Val Val Ser Val Val Ala Val Leu Val Tyr Lys Phe Tyr Phe His Leu
620 625 630
Met Leu Leu Ala Gly Cys Ile Lys Tyr Gly Arg Gly Glu Asn Ile Tyr
635 640 645 650
Asp Ala Phe Val Ile Tyr Ser Ser Gln Asp Glu Asp Trp Val Arg Asn
655 660 665
Glu Leu Val Lys Asn Leu Glu Glu Gly Val Pro Pro Phe Gln Leu Cys
670 675 680
Leu His Tyr Arg Asp Phe Ile Pro Gly Val Ala Ile Ala Ala Asn Ile
685 690 695
Ile His Glu Gly Phe His Lys Ser Arg Lys Val Ile Val Val Val Ser
700 705 710
Gln His Phe Ile Gln Ser Arg Trp Cys Ile Phe Glu Tyr Glu Ile Ala
715 720 725 730
Gln Thr Trp Gln Phe Leu Ser Ser Arg Ala Gly Ile Ile Phe Ile Val
735 740 745
Leu Gln Lys Val Glu Lys Thr Leu Leu Arg Gln Gln Val Glu Leu Tyr
750 755 760
Arg Leu Leu Ser Arg Asn Thr Tyr Leu Glu Trp Glu Asp Ser Val Leu
765 770 775
Gly Arg His Ile Phe Trp Arg Arg Leu Arg Lys Ala Leu Leu Asp Gly
780 785 790
Lys Ser Trp Asn Pro Glu Gly Thr Val Gly Thr Gly Cys Asn Trp Gln
795 800 805 810
Glu Ala Thr Ser Ile
815
<210>27
<211>300
<212>DNA
<213>未知
<220>
<223>未知生物的说明:啮齿类动物;推测为
小鼠(Mus musculus)
<220>
<221>CDS
<222>(1)..(300)
<220>
<221>misc_特征
<222>(62)..(100)
<223>Xaa翻译取决于遗传密码
<400>27
tcc tat tct atg gaa aaa gat gct ttc cta ttt atg aga aat ttg aag 48
Ser Tyr Ser Met Glu Lys Asp Ala Phe Leu Phe Met Arg Asn Leu Lys
1 5 10 15
gtt ctc tca cta aaa gat aac aat gtc aca gct gtc ccc acc act ttg 96
Val Leu Ser Leu Lys Asp Asn Asn Val Thr Ala Val Pro Thr Thr Leu
20 25 30
cca cct aat tta cta gag ctc tat ctt tat aac aat atc att aag aaa 144
Pro Pro Asn Leu Leu Glu Leu Tyr Leu Tyr Asn Asn Ile Ile Lys Lys
35 40 45
atc caa gaa aat gat ttc aat aac ctc aat gag ttg caa gtn ctt gac 192
Ile Gln Glu Asn Asp Phe Asn Asn Leu Asn Glu Leu Gln Xaa Leu Asp
50 55 60
cta ngt gga aat tgc cct cga tgt nat aat gtc cca tat ccg tgt aca 240
Leu Xaa Gly Asn Cys Pro Arg Cys Xaa Asn Val Pro Tyr Pro Cys Thr
65 70 75 80
ccg tgt gaa aat aat tcc ccc tta cag atc cat gan aat gct ttc aat 288
Pro Cys Glu Asn Asn Ser Pro Leu Gln Ile His Xaa Asn Ala Phe Asn
85 90 95
tca tcg aca gan 300
Ser Ser Thr Xaa
100
<210>28
<211>100
<212>PRT
<213>未知
<400>28
Ser Tyr Ser Met Glu Lys Asp Ala Phe Leu Phe Met Arg Asn Leu Lys
1 5 10 15
Val Leu Ser Leu Lys Asp Asn Asn Val Thr Ala Val Pro Thr Thr Leu
20 25 30
Pro Pro Asn Leu Leu Glu Leu Tyr Leu Tyr Asn Asn Ile Ile Lys Lys
35 40 45
Ile Gln Glu Asn Asp Phe Asn Asn Leu Asn Glu Leu Gln Xaa Leu Asp
50 55 60
Leu Xaa Gly Asn Cys Pro Arg Cys Xaa Asn Val Pro Tyr Pro Cys Thr
65 70 75 80
Pro Cys Glu Asn Asn Ser Pro Leu Gln Ile His Xaa Asn Ala Phe Asn
85 90 95
Ser Ser Thr Xaa
100
<210>29
<211>1756
<212>DNA
<213>未知
<220>
<223>未知生物的说明:啮齿类动物;推测为
小鼠(Mus musculus)
<220>
<221>CDS
<222>(1)..(1182)
<400>29
tct cca gaa att ccc tgg aat tcc ttg cct cct gag gtt ttt gag ggt 48
Ser Pro Glu Ile Pro Trp Asn Ser Leu Pro Pro Glu Val Phe Glu Gly
1 5 10 15
atg ccg cca aat cta aag aat ctc tcc ttg gcc aaa aat ggg ctc aaa 96
Met Pro Pro Asn Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys
20 25 30
tct ttc ttt tgg gac aga ctc cag tta ctg aag cat ttg gaa att ttg 144
Ser Phe Phe Trp Asp Arg Leu Gln Leu Leu Lys His Leu Glu Ile Leu
35 40 45
gac ctc agc cat aac cag ctg aca aaa gta cct gag aga ttg gcc aac 192
Asp Leu Ser His Asn Gln Leu Thr Lys Val Pro Glu Arg Leu Ala Asn
50 55 60
tgt tcc aaa agt ctc aca aca ctg att ctt aag cat aat caa atc agg 240
Cys Ser Lys Ser Leu Thr Thr Leu Ile Leu Lys His Asn Gln Ile Arg
65 70 75 80
caa ttg aca aaa tat ttt cta gaa gat gct ttg caa ttg cgc tat cta 288
Gln Leu Thr Lys Tyr Phe Leu Glu Asp Ala Leu Gln Leu Arg Tyr Leu
85 90 95
gac atc agt tca aat aaa atc cag gtc att cag aag act agc ttc cca 336
Asp Ile Ser Ser Asn Lys Ile Gln Val Ile Gln Lys Thr Ser Phe Pro
100 105 110
gaa aat gtc ctc aac aat ctg gag atg ttg gtt tta cat cac aat cgc 384
Glu Asn Val Leu Asn Asn Leu Glu Met Leu Val Leu His His Asn Arg
115 120 125
ttt ctt tgc aac tgt gat gct gtg tgg ttt gtc tgg tgg gtt aac cat 432
Phe Leu Cys Asn Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His
130 135 140
aca gat gtt act att cca tac ctg gcc act gat gtg act tgt gta ggt 480
Thr Asp Val Thr Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly
145 150 155 160
cca gga gca cac aaa ggt caa agt gtc ata tcc ctt gat ctg tat acg 528
Pro Gly Ala His Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr
165 170 175
tgt gag tta gat ctc aca aac ctg att ctg ttc tca gtt tcc ata tca 576
Cys Glu Leu Asp Leu Thr Asn Leu Ile Leu Phe Ser Val Ser Ile Ser
180 185 190
tca gtc ctc ttt ctt atg gta gtt atg aca aca agt cac ctc ttt ttc 624
Ser Val Leu Phe Leu Met Val Val Met Thr Thr Ser His Leu Phe Phe
195 200 205
tgg gat atg tgg tac att tat tat ttt tgg aaa gca aag ata aag ggg 672
Trp Asp Met Trp Tyr Ile Tyr Tyr Phe Trp Lys Ala Lys Ile Lys Gly
210 215 220
tat cca gca tct gca atc cca tgg agt cct tgt tat gat gct ttt att 720
Tyr Pro Ala Ser Ala Ile Pro Trp Ser Pro Cys Tyr Asp Ala Phe Ile
225 230 235 240
gtg tat gac act aaa aac tca gct gtg aca gaa tgg gtt ttg cag gag 768
Val Tyr Asp Thr Lys Asn Ser Ala Val Thr Glu Trp Val Leu Gln Glu
245 250 255
ctg gtg gca aaa ttg gaa gat cca aga gaa aaa cac ttc aat ttg tgt 816
Leu Val Ala Lys Leu Glu Asp Pro Arg Glu Lys His Phc Asn Leu Cys
260 265 270
cta gaa gaa aga gac tgg cta cca gga cag cca gtt cta gaa aac ctt 864
Leu Glu Glu Arg Asp Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu
275 280 285
tcc cag agc ata cag ctc agc aaa aag aca gtg ttt gtg atg aca cag 912
Ser Gln Ser Ile Gln Leu Ser Lys Lys Thr Val Phe Val Met Thr Gln
290 295 300
aaa tat gct aag act gag agt ttt aag atg gca ttt tat ttg tct cat 960
Lys Tyr Ala Lys Thr Glu Ser Phe Lys Met Ala Phe Tyr Leu Ser His
305 310 315 320
cag agg ctc ctg gat gaa aaa gtg gat gtg att atc ttg ata ttc ttg 1008
Gln Arg Leu Leu Asp Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu
325 330 335
gaa aga cct ctt cag aag tct aag ttt ctt cag ctc agg aag aga ctc 1056
Glu Arg Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu
340 345 350
tgc agg agc tct gtc ctt gag tgg cct gca aat cca cag gct cac cca 1104
Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His Pro
355 360 365
tac ttc tgg cag tgc ctg aaa aat gcc ctg acc aca gac aat cat gtg 1152
Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His Val
370 375 380
gct tat agt caa atg ttc aag gaa aca gtc tagctctctg aagaatgtca 1202
Ala Tyr Ser Gln Met Phe Lys Glu Thr Val
385 390
ccacctagga catgccttgg tacctgaagt tttcataaag gtttccataa atgaaggtct 1262
gaatttttcc taacagttgt catggctcag attggtggga aatcatcaat atatggctaa 1322
gaaattaaga aggggagact gatagaagat aatttctttc ttcatgtgcc atgctcagtt 1382
aaatatttcc cctagctcaa atctgaaaaa ctgtgcctag gagacaacac aaggctttga 1442
tttatctgca tacaattgat aagagccaca catctgccct gaagaagtac tagtagtttt 1502
agtagtaggg taaaaattac acaagctttc tctctctctg atactgaact gtaccagagt 1562
tcaatgaaat aaaagcccag agaacttctc agtaaatggt ttcattatca tgtagtatcc 1622
accatgcaat atgccacaaa rccgctactg gtacaggaca gntggtagct gcttcaakgc 1682
ctcttatcat tttcttgggg cccatggagg ggttctytgg gaaadaggga agkttttttt 1742
tggccatcca tgaa 1756
<210>30
<211>394
<212>PRT
<213>未知
<400>30
Ser Pro Glu Ile Pro Trp Asn Ser Leu Pro Pro Glu Val Phe Glu Gly
1 5 10 15
Met Pro Pro Asn Leu Lys Asn Leu Ser Leu Ala Lys Asn Gly Leu Lys
20 25 30
Ser Phe Phe Trp Asp Arg Leu Gln Leu Leu Lys His Leu Glu Ile Leu
35 40 45
Asp Leu Ser His Asn Gln Leu Thr Lys Val Pro Glu Arg Leu Ala Asn
50 55 60
Cys Ser Lys Ser Leu Thr Thr Leu Ile Leu Lys His Asn Gln Ile Arg
65 70 75 80
Gln Leu Thr Lys Tyr Phe Leu Glu Asp Ala Leu Gln Leu Arg Tyr Leu
85 90 95
Asp Ile Ser Ser Asn Lys Ile Gln Val Ile Gln Lys Thr Ser Phe Pro
100 105 110
Glu Asn Val Leu Asn Asn Leu Glu Met Leu Val Leu His His Asn Arg
115 120 125
Phe Leu Cys Asn Cys Asp Ala Val Trp Phe Val Trp Trp Val Asn His
130 135 140
Thr Asp Val Thr Ile Pro Tyr Leu Ala Thr Asp Val Thr Cys Val Gly
145 150 155 160
Pro Gly Ala His Lys Gly Gln Ser Val Ile Ser Leu Asp Leu Tyr Thr
165 170 175
Cys Glu Leu Asp Leu Thr Asn Leu Ile Leu Phe Ser Val Ser Ile Ser
180 185 190
Ser Val Leu Phe Leu Met Val Val Met Thr Thr Ser His Leu Phe Phe
195 200 205
Trp Asp Met Trp Tyr Ile Tyr Tyr Phe Trp Lys Ala Lys Ile Lys Gly
210 215 220
Tyr Pro Ala Ser Ala Ile Pro Trp Ser Pro Cys Tyr Asp Ala Phe Ile
225 230 235 240
Val Tyr Asp Thr Lys Asn Ser Ala Val Thr Glu Trp Val Leu Gln Glu
245 250 255
Leu Val Ala Lys Leu Glu Asp Pro Arg Glu Lys His Phe Asn Leu Cys
260 265 270
Leu Glu Glu Arg Asp Trp Leu Pro Gly Gln Pro Val Leu Glu Asn Leu
275 280 285
Ser Gln Ser Ile Gln Leu Ser Lys Lys Thr Val Phe Val Met Thr Gln
290 295 300
Lys Tyr Ala Lys Thr Glu Ser Phe Lys Met Ala Phe Tyr Leu Ser His
305 310 315 320
Gln Arg Leu Leu Asp Glu Lys Val Asp Val Ile Ile Leu Ile Phe Leu
325 330 335
Glu Arg Pro Leu Gln Lys Ser Lys Phe Leu Gln Leu Arg Lys Arg Leu
340 345 350
Cys Arg Ser Ser Val Leu Glu Trp Pro Ala Asn Pro Gln Ala His Pro
355 360 365
Tyr Phe Trp Gln Cys Leu Lys Asn Ala Leu Thr Thr Asp Asn His Val
370 375 380
Ala Tyr Ser Gln Met Phe Lys Glu Thr Val
385 390
<210>31
<211>999
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(2)..(847)
<220>
<221>misc_特征
<222>(1)..(282)
<223>Xaa翻译取决于遗传密码
<400>31
c tcn gat gcc aag att cgg cac nag gca tat tca gag gtc atg atg gtt 49
Xaa Asp Ala Lys Ile Arg His Xaa Ala Tyr Ser Glu Val Met Met Val
1 5 10 15
gga tgg tca gat tca tac acc tgt gaa tac cct tta aac cta agg gga 97
Gly Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg Gly
20 25 30
act agg tta aaa gac gtt cat ctc cac gaa tta tct tgc aac aca gct 145
Thr Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr Ala
35 40 45
ctg ttg att gtc acc att gtg gtt att atg cta gtt ctg ggg ttg gct 193
Leu Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu Ala
50 55 60
gtg gcc ttc tgc tgt ctc cac ttt gat ctg ccc tgg tat ctc agg atg 241
Val Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg Met
65 70 75 80
cta ggt caa tgc aca caa aca tgg cac agg gtt agg aaa aca acc caa 289
Leu Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr Gln
85 90 95
gaa caa ctc aag aga aat gtc cga ttc cac gca ttt att tca tac agt 337
Glu Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr Ser
100 105 110
gaa cat gat tct ctg tgg gtg aag aat gaa ttg atc ccc aat cta gag 385
Glu His Asp Ser Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu Glu
115 120 125
aag gaa gat ggt tct atc ttg att tgc ctt tat gaa agc tac ttt gac 433
Lys Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe Asp
130 135 140
cct ggc aaa agc att agt gaa aat att gta agc ttc att gag aaa agc 481
Pro Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys Ser
145 150 155 160
tat aag tcc atc ttt gtt ttg tct ccc aac ttt gtc cag aat gag tgg 529
Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu Trp
165 170 175
tgc cat tat gaa ttc tac ttt gcc cac cac aat ctc ttc cat gaa aat 577
Cys His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu Asn
180 185 190
tct gat cac ata att ctt atc tta ctg gaa ccc att cca ttc tat tgc 625
Ser Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr Cys
195 200 205
att ccc acc agg tat cat aaa ctg raa gct ctc ctg gaa aaa aaa gca 673
Ile Pro Thr Arg Tyr His Lys Leu Xaa Ala Leu Leu Glu Lys Lys Ala
210 215 220
tac ttg gaa tgg ccc aag gat agg cgt aaa tgt ggg ctt tty tgg gca 721
Tyr Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Xaa Trp Ala
225 230 235 240
aac ctt cga gct gct gtt aat gtt aat gta tta gcc acc aga gaa atg 769
Asn Leu Arg Ala Ala Val Asn Val Asn Val Leu A la Thr Arg Glu Met
245 250 255
tat gaa ctg cag aca ttc aca gag tta aat gaa gag tct cga ggt tct 817
Tyr Glu Leu Gln Thr Phe Thr Glu Leu Asn Glu Glu Ser Arg Gly Ser
260 265 270
aca atc tyt ctg atg aga aca gac tgt yta taaaatccca cagtccttgg 867
Thr Ile Xaa Leu Met Arg Thr Asp Cys Xaa
275 280
gaagttgggg accacataca ctgttgggat gtacattgat acaaccttta tgatggcaat 927
ttgacaatat ttattaaaat aaaaaatggt tattcccttc aaaaaaaaaa aaaaaaaaaa 987
aaaaaaaaaa aa 999
<210>32
<211>282
<212>PRT
<213>未知
<400>32
Xaa Asp Ala Lys Ile Arg His Xaa Ala Tyr Ser Glu Val Met Met Val
1 5 10 15
Gly Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg Gly
20 25 30
Thr Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr Ala
35 40 45
Leu Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu Ala
50 55 60
Val Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg Met
65 70 75 80
Leu Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr Gln
85 90 95
Glu Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr Ser
100 105 110
Glu His Asp Ser Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu Glu
115 120 125
Lys Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe Asp
130 135 140
Pro Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys Ser
145 150 155 160
Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu Trp
165 170 175
Cys His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu Asn
180 185 190
Ser Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr Cys
195 200 205
Ile Pro Thr Arg Tyr His Lys Leu Xaa Ala Leu Leu Glu Lys Lys Ala
210 215 220
Tyr Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Xaa Trp Ala
225 230 235 240
Asn Leu Arg Ala Ala Val Asn Val Asn Val Leu Ala Thr Arg Glu Met
245 250 255
Tyr Gtu Leu Gln Thr Phe Thr Glu Leu Asn Glu Glu Ser Arg Gly Ser
260 265 270
Thr Ile Xaa Leu Met Arg Thr Asp Cys Xaa
275 280
<210>33
<211>1173
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<22l>CDS
<222>(1)..(1008)
<220>
<22l>misc_特征
<222>(285)
<223>Xaa翻译取决于遗传密码
<400>33
ctg cct gct ggc acc cgg ctc cgg agg ctg gat gtc agc tgc aac agc 48
Leu Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser
1 5 10 15
atc agc ttc gtg gcc ccc ggc ttc ttt tcc aag gcc aag gag ctg cga 96
Ile Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg
20 25 30
gag ctc aac ctt agc gcc aac gcc ctc aag aca gtg gac cac tcc tgg 144
Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp
35 40 45
ttt ggg ccc ctg gcg agt gcc ctg caa ata cta gat gta agc gcc aac 192
Phe Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn
50 55 60
cct ctg cac tgc gcc tgt ggg gcg gcc ttt atg gac ttc ctg ctg gag 240
Pro Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu
65 70 75 80
gtg cag gct gcc gtg ccc ggt ctg ccc agc cgg gtg aag tgt ggc agt 288
Val Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser
85 90 95
ccg ggc cag ctc cag ggc ctc agc atc ttt gca cag gac ctg cgc ctc 336
Pro Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu
100 105 110
tgc ctg gat gag gcc ctc tcc tgg gac tgt ttc gcc ctc tcg ctg ctg 384
Cys Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu
115 120 125
gct gtg gct ctg ggc ctg ggt gtg ccc atg ctg cat cac ctc tgt ggc 432
Ala Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly
130 135 140
tgg gac ctc tgg tac tgc ttc cac ctg tgc ctg gcc tgg ctt ccc tgg 480
Trp Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp
145 150 155 160
cgg ggg cgg caa agt ggg cga gat gag gat gcc ctg ccc tac gat gcc 528
Arg Gly Arg Gln Ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala
165 170 175
ttc gtg gtc ttc gac aaa acg cag agc gca gtg gca gac tgg gtg tac 576
Phe Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr
180 185 190
aac gag ctt cgg ggg cag ctg gag gag tgc cgt ggg cgc tgg gca ctc 624
Asn Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu
195 200 205
cgc ctg tgc ctg gag gaa cgc gac tgg ctg cct ggc aaa acc ctc ttt 672
Arg Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe
210 215 220
gag aac ctg tgg gcc tcg gtc tat ggc agc cgc aag acg ctg ttt gtg 720
Glu Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val
225 230 235 240
ctg gcc cac acg gac cgg gtc agt ggt ctc ttg cgc gcc agc ttc ctg 768
Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu
245 250 255
ctg gcc cag cag cgc ctg ctg gag gac cgc aag gac gtc gtg gtg ctg 816
Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu
260 265 270
gtg atc ctg agc cct gac ggc cgc cgc tcc cgc tac gkg cgg ctg cgc 864
Val Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Xaa Arg Leu Arg
275 280 285
cag cgc ctc tgc cgc cag agt gtc ctc ctc tgg ccc cac cag ccc agt 912
Gln Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser
290 295 300
ggt cag cgc agc ttc tgg gcc cag ctg ggc atg gcc ctg acc agg gac 960
Gly Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp
305 310 315 320
aac cac cac ttc tat aac cgg aac ttc tgc cag gga ccc acg gcc gaa 1008
Asn His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
325 330 335
tagccgtgag ccggaatcct gcacggtgcc acctccacac tcacctcacc tctgcctgcc 1068
tggtctgacc ctcccctgct cgcctccctc accccacacc tgacacagag caggcactca 1128
ataaatgcta ccgaaggcta aaaaaaaaaa aaaaaaaaaa aanna 1173
<210>34
<211>336
<212>PRT
<213>未知
<400>34
Leu Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser
1 5 10 15
Ile Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg
20 25 30
Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp
35 40 45
Phe Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn
50 55 60
Pro Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu
65 70 75 80
Val Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser
85 90 95
Pro Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu
100 105 110
Cys Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu
115 120 125
Ala Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly
130 135 140
Trp Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp
145 150 155 160
Arg Gly Arg Gln Ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala
165 170 175
Phe Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr
180 185 190
Asn Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu
195 200 205
Arg Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe
210 215 220
Glu Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val
225 230 235 240
Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu
245 250 255
Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu
260 265 270
Val Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Xaa Arg Leu Arg
275 280 285
Gln Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser
290 295 300
Gly Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp
305 310 315 320
Asn His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
325 330 335
<210>35
<211>497
<212>DNA
<213>未知
<220>
<223>未知生物的说明:啮齿类动物;推测为
小鼠(Mus musculus)
<400>35
tggcccacac ggaccgcgtc agtggcctcc tgcgcaccag cttcctgctg gctcagcagc 60
gcctgttgga agaccgcaag gacgtggtgg tgttggtgat cctgcgtccg gatgccccac 120
cgtcccgcta tgtgcgactg cgccagcgtc tctgccgcca gagtgtgctc ttctggcccc 180
agcgacccaa cgggcagggg ggcttctggg cccagctgag tacagccctg actagggaca 240
accgccactt ctataaccag aacttctgcc ggggacctac agcagaatag ctcagagcaa 300
cagctggaaa cagctgcatc ttcatgtctg gttcccgagt tgctctgcct gccttgctct 360
gtcttactac accgctattt ggcaagtgcg caatatatgc taccaagcca ccaggcccac 420
ggagcaaagg ttggctgtaa agggtagttt tcttcccatg catctttcag gagagtgaag 480
atagacacca aacccac 497
<210>36
<211>3099
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<22l>CDS
<222>(1)..(3096)
<220>
<22l>mat_肽
<222>(52)..(3096)
<220>
<221>misc_特征
<222>(725)
<223>Xaa翻译取决于遗传密码
<400>36
atg ctg acc tgc att ttc ctg cta ata tct ggt tcc tgt gag tta tgc 48
Met Leu Thr Cys Ile Phe Leu Leu Ile Ser Gly Ser Cys Glu Leu Cys
-15 -10 -5
gcc gaa gaa aat ttt tct aga agc tat cct tgt gat gag aaa aag caa 96
Ala Glu Glu Asn Phe Ser Arg Ser Tyr Pro Cys Asp Glu Lys Lys Gln
-1 l 5 10 15
aat gac tca gtt att gca gag tgc agc aat cgt cga cta cag gaa gtt 144
Asn Asp Ser Val Ile Ala Glu Cys Ser Asn Arg Arg Leu Gln Glu Val
20 25 30
ccc caa acg gtg ggc aaa tat gtg aca gaa cta gac ctg tct gat aat 192
Pro Gln Thr Val Gly Lys Tyr Val Thr Glu Leu Asp Leu Ser Asp Asn
35 40 45
ttc atc aca cac ata acg aat gaa tca ttt caa ggg ctg caa aat ctc 240
Phe Ile Thr His Ile Thr Asn Glu Ser Phe Gln G1y Leu Gln Asn Leu
50 55 60
act aaa ata aat cta aac cac aac ccc aat gta cag cac cag aac gga 288
Thr Lys Ile Asn Leu Asn His Asn Pro Asn Val Gln His Gln Asn Gly
65 70 75
aat ccc ggt ata caa tca aat ggc ttg aat atc aca gac ggg gca ttc 336
Asn Pro Gly Ile Gln Ser Asn Gly Leu Asn Ile Thr Asp Gly Ala Phe
80 85 90 95
ctc aac cta aaa aac cta agg gag tta ctg ctt gaa gac aac cag tta 384
Leu Asn Leu Lys Asn Leu Arg Glu Leu Leu Leu Glu Asp Asn Gln Leu
100 105 110
ccc caa ata ccc tct ggt ttg cca gag tct ttg aca gaa ctt agt cta 432
Pro Gln Ile Pro Ser Gly Leu Pro Glu Ser Leu Thr Glu Leu Ser Leu
115 120 125
att caa aac aat ata tac aac ata act aaa gag ggc att tca aga ctt 480
Ile Gln Asn Asn Ile Tyr Asn Ile Thr Lys Glu Gly Ile Ser Arg Leu
130 135 140
ata aac ttg aaa aat ctc tat ttg gcc tgg aac tgc tat ttt aac aaa 528
Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn Cys Tyr Phe Asn Lys
145 150 155
gtt tgc gag aaa act aac ata gaa gat gga gta ttt gaa acg ctg aca 576
Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val Phe Glu Thr Leu Thr
160 165 170 175
aat ttg gag ttg cta tca cta tct ttc aat tct ctt tca cat gtg cca 624
Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser Leu Ser His Val Pro
180 185 190
ccc aaa ctg cca agc tcc cta cgc aaa ctt ttt ctg agc aac acc cag 672
Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe Leu Ser Asn Thr Gln
195 200 205
atc aaa tac att agt gaa gaa gat ttc aag gga ttg ata aat tta aca 720
Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly Leu Ile Asn Leu Thr
210 215 220
tta cta gat tta agc ggg aac tgt ccg agg tgc ttc aat gcc cca ttt 768
Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Phe Asn Ala Pro Phe
225 230 235
cca tgc gtg cct tgt gat ggt ggt gct tca att aat ata gat cgt ttt 816
Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile Asn Ile Asp Arg Phe
240 245 250 255
gct ttt caa aac ttg acc caa ctt cga tac cta aac ctc tct agc act 864
Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu Asn Leu Ser Ser Thr
260 265 270
tcc ctc agg aag att aat gct gcc tgg ttt aaa aat atg cct cat ctg 912
Ser Leu Arg Lys Ile Asn Ala Ala Trp Phe Lys Asn Met Pro His Leu
275 280 285
aag gtg ctg gat ctt gaa ttc aac tat tta gtg gga gaa ata gcc tct 960
Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val Gly Glu Ile Ala Ser
290 295 300
ggg gca ttt tta acg atg ctg ccc cgc tta gaa ata ctt gac ttg tct 1008
Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu Ile Leu Asp Leu Ser
305 310 315
ttt aac tat ata aag ggg agt tat cca cag cat att aat att tcc aga 1056
Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His Ile Asn Ile Ser Arg
320 325 330 335
aac ttc tct aaa ctt ttg tct cta cgg gca ttg cat tta aga ggt tat 1104
Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu His Leu Arg Gly Tyr
340 345 350
gtg ttc cag gaa ctc aga gaa gat gat ttc cag ccc ctg atg cag ctt 1152
Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln Pro Leu Met Gln Leu
355 360 365
cca aac tta tcg act atc aac ttg ggt att aat ttt att aag caa atc 1200
Pro Asn Leu Ser Thr Ilc Asn Leu Gly Ile Asn Phe Ile Lys Gln Ile
370 375 380
gat ttc aaa ctt ttc caa aat ttc tcc aat ctg gaa att att tac ttg 1248
Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu Glu Ile Ile Tyr Leu
385 390 395
tca gaa aac aga ata tca ccg ttg gta aaa gat acc cgg cag agt tat 1296
Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp Thr Arg Gln Ser Tyr
400 405 410 415
gca aat agt tcc tct ttt caa cgt cat atc cgg aaa cga cgc tca aca 1344
Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg Lys Arg Arg Ser Thr
420 425 430
gat ttt gag ttt gac cca cat tcg aac ttt tat cat ttc acc cgt cct 1392
Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr His Phe Thr Arg Pro
435 440 445
tta ata aag cca caa tgt gct gct tat gga aaa gcc tta gat tta agc 1440
Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys Ala Leu Asp Leu Ser
450 455 460
ctc aac agt att ttc ttc att ggg cca aac caa ttt gaa aat ctt cct 1488
Leu Asn Ser Ile Phe Phe Ile Gly Pro Asn Gln Phe Glu Asn Leu Pro
465 470 475
gac att gcc tgt tta aat ctg tct gca aat agc aat gct caa gtg tta 1536
Asp Ile Ala Cys Leu Asn Leu Ser Ala Asn Ser Asn Ala Gln Val Leu
480 485 490 495
agt gga act gaa ttt tca gcc att cct cat gtc aaa tat ttg gat ttg 1584
Ser Gly Thr Glu Phe Ser Ala Ile Pro His Val Lys Tyr Leu Asp Leu
500 505 510
aca aac aat aga cta gac ttt gat aat gct agt gct ctt act gaa ttg 1632
Thr Asn Asn Arg Leu Asp Phe Asp Asn Ala Ser Ala Leu Thr Glu Leu
515 520 525
tcc gac ttg gaa gtt cta gat ctc agc tat aat tca cac tat ttc aga 1680
Ser Asp Leu Glu Val Leu Asp Leu Ser Tyr Asn Ser His Tyr Phe Arg
530 535 540
ata gca ggc gta aca cat cat cta gaa ttt att caa aat ttc aca aat 1728
Ile Ala Gly Val Thr His His Leu Glu Phe Ile Gln Asn Phe Thr Asn
545 550 555
cta aaa gtt tta aac ttg agc cac aac aac att tat act tta aca gat 1776
Leu Lys Val Leu Asn Leu Ser His Asn Asn Ile Tyr Thr Leu Thr Asp
560 565 570 575
aag tat aac ctg gaa agc aag tcc ctg gta gaa tta gtt ttc agt ggc 1824
Lys Tyr Asn Leu Glu Ser Lys Ser Leu Val Glu Leu Val Phe Ser Gly
580 585 590
aat cgc ctt gac att ttg tgg aat gat gat gac aac agg tat atc tcc 1872
Asn Arg Leu Asp Ile Leu Trp Asn Asp Asp Asp Asn Arg Tyr Ile Ser
595 600 605
att ttc aaa ggt ctc aag aat ctg aca cgt ctg gat tta tcc ctt aat 1920
Ile Phe Lys Gly Leu Lys Asn Leu Thr Arg Leu Asp Leu Ser Leu Asn
610 615 620
agg ctc aag cac atc cca aat gaa gca ttc ctt aat ttg cca gcg agt 1968
Arg Leu Lys His Ile Pro Asn Glu Ala Phe Leu Asn Leu Pro Ala Ser
625 630 635
ctc act gaa cta cat ata aat gat aat atg tta aag ttt ttt aac tgg 2016
Leu Thr Glu Leu His Ile Asn Asp Asn Met Leu Lys Phe Phe Asn Trp
640 645 650 655
aca tta ctc cag cag ttt cct cgt ctc gag ttg ctt gac tta cgt gga 2064
Thr Leu Leu Gln Gln Phe Pro Arg Leu Glu Leu Leu Asp Leu Arg Gly
660 665 670
aac aaa cta ctc ttt tta act gat agc cta tct gac ttt aca tct tcc 2112
Asn Lys Leu Leu Phe Leu Thr Asp Ser Leu Ser Asp Phe Thr Ser Ser
675 680 685
ctt cgg aca ctg ctg ctg agt cat aac agg att tcc cac cta ccc tct 2160
Leu Arg Thr Leu Leu Leu Ser His Asn Arg Ile Ser His Leu Pro Ser
690 695 700
ggc ttt ctt tct gaa gtc agt agt ctg aag cac ctc gat tta agt tcc 2208
Gly Phe Leu Ser Glu Val Ser Ser Leu Lys His Leu Asp Leu Ser Ser
705 710 715
aat ctg cta aaa aca atm aac aaa tcc gca ctt gaa act aag acc acc 2256
Asn Leu Leu Lys Thr Xaa Asn Lys Ser Ala Leu Glu Thr Lys Thr Thr
720 725 730 735
acc aaa tta tct atg ttg gaa cta cac gga aac ccc ttt gaa tgc acc 2304
Thr Lys Leu Ser Met Leu Glu Leu His Gly Asn Pro Phe Glu Cys Thr
740 745 750
tgt gac att gga gat ttc cga aga tgg atg gat gaa cat ctg aat gtc 2352
Cys Asp Ile Gly Asp Phe Arg Arg Trp Met Asp Glu His Leu Asn Val
755 760 765
aaa att ccc aga ctg gta gat gtc att tgt gcc agt cct ggg gat caa 2400
Lys Ile Pro Arg Leu Val Asp Val Ile Cys Ala Ser Pro Gly Asp Gln
770 775 780
aga ggg aag agt att gtg agt ctg gag cta aca act tgt gtt tca gat 2448
Arg Gly Lys Ser Ile Val Ser Leu Glu Leu Thr Thr Cys Val Ser Asp
785 790 795
gtc act gca gtg ata tta ttt ttc ttc acg ttc ttt atc acc acc atg 2496
Val Thr Ala Val Ile Leu Phe Phe Phe Thr Phe Phe Ile Thr Thr Met
800 805 810 815
gtt atg ttg gct gcc ctg gct cac cat ttg ttt tac tgg gat gtt tgg 2544
Val Met Leu Ala Ala Leu Ala His His Leu Phe Tyr Trp Asp Val Trp
820 825 830
ttt ata tat aat gtg tgt tta gct aag tta aaa ggc tac agg tct ctt 2592
Phe Ile Tyr Asn Val Cys Leu Ala Lys Leu Lys Gly Tyr Arg Ser Leu
835 840 845
tcc aca tcc caa act ttc tat gat gct tac att tct tat gac acc aaa 2640
Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr Asp Thr Lys
850 855 860
gat gcc tct gtt act gac tgg gtg ata aat gag ctg cgc tac cac ctt 2688
Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg Tyr His Leu
865 870 875
gaa gag agc cga gac aaa aac gtt ctc ctt tgt cta gag gag agg gat 2736
Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu Glu Arg Asp
880 885 890 895
tgg gac ccg gga ttg gcc atc atc gac aac ctc atg cag agc atc aac 2784
Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln Ser Ile Asn
900 905 910
caa agc aag aaa aca gta ttt gtt tta acc aaa aaa tat gca aaa agc 2832
Gln Ser Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr Ala Lys Ser
915 920 925
tgg aac ttt aaa aca gct ttt tac ttg gcc ttg cag agg cta atg ggt 2880
Trp Asn Phe Lys Thr Ala Phe Tyr Leu Ala Leu Gln Arg Leu Met Gly
930 935 940
gag aac atg gat gtg att ata ttt atc ctg ctg gag cca gtg tta cag 2928
Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro Val Leu Gln
945 950 955
cat tct ccg tat ttg agg cta cgg cag cgg atc tgt aag agc tcc atc 2976
His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys Ser Ser Ile
960 965 970 975
ctc cag tgg cct gac aac ccg aag gca gaa ggc ttg ttt tgg caa act 3024
Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Gly Leu Phe Trp Gln Thr
980 985 990
ctg aga aat gtg gtc ttg act gaa aat gat tca cgg tat aac aat atg 3072
Leu Arg Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr Asn Asn Met
995 1000 1005
tat gtc gat tcc att aag caa tac taa 3099
Tyr Val Asp Ser Ile Lys Gln Tyr
1010 1015
<210>37
<211>1032
<212>PRT
<213>未知
<400>37
Met Leu Thr Cys Ile Phe Leu Leu Ile Ser Gly Ser Cys Glu Leu Cys
-15 -10 -5
Ala Glu Glu Asn Phe Ser Arg Ser Tyr Pro Cys Asp Glu Lys Lys Gln
-1 1 5 10 15
Asn Asp Ser Val Ile Ala Glu Cys Ser Asn Arg Arg Leu Gln Glu Val
20 25 30
Pro Gln Thr Val Gly Lys Tyr Val Thr Glu Leu Asp Leu Ser Asp Asn
35 40 45
Phe Ile Thr His Ile Thr Asn Glu Ser Phe Gln Gly Leu Gln Asn Leu
50 55 60
Thr Lys Ile Asn Leu Asn His Asn Pro Asn Val Gln His Gln Asn Gly
65 70 75
Asn Pro Gly Ile Gln Ser Asn Gly Leu Asn Ile Thr Asp Gly Ala Phe
80 85 90 95
Leu Asn Leu Lys Asn Leu Arg Glu Leu Leu Leu Glu Asp Asn Gln Leu
100 105 110
Pro Gln Ile Pro Ser Gly Leu Pro Glu Ser Leu Thr Glu Leu Ser Leu
115 120 125
Ile Gln Asn Asn Ile Tyr Asn Ile Thr Lys Glu Gly Ile Ser Arg Leu
130 135 140
Ile Asn Leu Lys Asn Leu Tyr Leu Ala Trp Asn Cys Tyr Phe Asn Lys
145 150 155
Val Cys Glu Lys Thr Asn Ile Glu Asp Gly Val Phe Glu Thr Leu Thr
160 165 170 175
Asn Leu Glu Leu Leu Ser Leu Ser Phe Asn Ser Leu Ser His Val Pro
180 185 190
Pro Lys Leu Pro Ser Ser Leu Arg Lys Leu Phe Leu Ser Asn Thr Gln
195 200 205
Ile Lys Tyr Ile Ser Glu Glu Asp Phe Lys Gly Leu Ile Asn Leu Thr
210 215 220
Leu Leu Asp Leu Ser Gly Asn Cys Pro Arg Cys Phe Asn Ala Pro Phe
225 230 235
Pro Cys Val Pro Cys Asp Gly Gly Ala Ser Ile Asn Ile Asp Arg Phe
240 245 250 255
Ala Phe Gln Asn Leu Thr Gln Leu Arg Tyr Leu Asn Leu Ser Ser Thr
260 265 270
Ser Leu Arg Lys Ile Ash Ala Ala Trp Phe Lys Asn Met Pro His Leu
275 280 285
Lys Val Leu Asp Leu Glu Phe Asn Tyr Leu Val Gly Glu Ile Ala Ser
290 295 300
Gly Ala Phe Leu Thr Met Leu Pro Arg Leu Glu Ile Leu Asp Leu Ser
305 310 315
Phe Asn Tyr Ile Lys Gly Ser Tyr Pro Gln His Ile Asn Ile Ser Arg
320 325 330 335
Asn Phe Ser Lys Leu Leu Ser Leu Arg Ala Leu His Leu Arg Gly Tyr
340 345 350
Val Phe Gln Glu Leu Arg Glu Asp Asp Phe Gln Pro Leu Met Gln Leu
355 360 365
Pro Asn Leu Ser Thr Ile Asn Leu Gly Ile Asn Phe Ile Lys Gln Ile
370 375 380
Asp Phe Lys Leu Phe Gln Asn Phe Ser Asn Leu Glu Ile Ile Tyr Leu
385 390 395
Ser Glu Asn Arg Ile Ser Pro Leu Val Lys Asp Thr Arg Gln Ser Tyr
400 405 410 415
Ala Asn Ser Ser Ser Phe Gln Arg His Ile Arg Lys Arg Arg Ser Thr
420 425 430
Asp Phe Glu Phe Asp Pro His Ser Asn Phe Tyr His Phe Thr Arg Pro
435 440 445
Leu Ile Lys Pro Gln Cys Ala Ala Tyr Gly Lys Ala Leu Asp Leu Ser
450 455 460
Leu Asn Ser Ile Phe Phe Ile Gly Pro Asn Gln Phe Glu Asn Leu Pro
465 470 475
Asp Ile Ala Cys Leu Asn Leu Ser Ala Asn Ser Asn Ala Gln Val Leu
480 485 490 495
Ser Gly Thr Glu Phe Ser Ala Ile Pro His Val Lys Tyr Leu Asp Leu
500 505 510
Thr Asn Asn Arg Leu Asp Phe Asp Asn Ala Ser Ala Leu Thr Glu Leu
515 520 525
Ser Asp Leu Glu Val Leu Asp Leu Ser Tyr Asn Ser His Tyr Phe Arg
530 535 540
Ile Ala Gly Val Thr His His Leu Glu Phe Ile Gln Asn Phe Thr Asn
545 550 555
Leu Lys Val Leu Asn Leu Ser His Asn Asn Ile Tyr Thr Leu Thr Asp
560 565 570 575
Lys Tyr Asn Leu Glu Ser Lys Ser Leu Val Glu Leu Val Phe Ser Gly
580 585 590
Asn Arg Leu Asp Ile Leu Trp Asn Asp Asp Asp Asn Arg Tyr Ile Ser
595 600 605
Ile Phe Lys Gly Leu Lys Asn Leu Thr Arg Leu Asp Leu Ser Leu Asn
610 615 620
Arg Leu Lys His Ile Pro Asn Glu Ala Phe Leu Asn Leu Pro Ala Ser
625 630 635
Leu Thr Glu Leu His Ile Asn Asp Asn Met Leu Lys Phe Phe Asn Trp
640 645 650 655
Thr Leu Leu Gln Gln Phe Pro Arg Leu Glu Leu Leu Asp Leu Arg Gly
660 665 670
Asn Lys Leu Leu Phe Leu Thr Asp Ser Leu Ser Asp Phe Thr Ser Ser
675 680 685
Leu Arg Thr Leu Leu Leu Ser His Asn Arg Ile Ser His Leu Pro Ser
690 695 700
Gly Phe Leu Ser Glu Val Ser Ser Leu Lys His Leu Asp Leu Ser Ser
705 710 715
Asn Leu Leu Lys Thr Xaa Asn Lys Ser Ala Leu Glu Thr Lys Thr Thr
720 725 730 735
Thr Lys Leu Ser Met Leu Glu Leu His Gly Asn Pro Phe Glu Cys Thr
740 745 750
Cys Asp Ile Gly Asp Phe Arg Arg Trp Met Asp Glu His Leu Asn Val
755 760 765
Lys Ile Pro Arg Leu Val Asp Val Ile Cys Ala Ser Pro Gly Asp Gln
770 775 780
Arg Gly Lys Ser Ile Val Ser Leu Glu Leu Thr Thr Cys Val Ser Asp
785 790 795
Val Thr Ala Val Ile Leu Phe Phe Phe Thr Phe Phe Ile Thr Thr Met
800 805 810 815
Val Met Leu Ala Ala Leu Ala His His Leu Phe Tyr Trp Asp Val Trp
820 825 830
Phe Ile Tyr Asn Val Cys Leu Ala Lys Leu Lys Gly Tyr Arg Ser Leu
835 840 845
Ser Thr Ser Gln Thr Phe Tyr Asp Ala Tyr Ile Ser Tyr Asp Thr Lys
850 855 860
Asp Ala Ser Val Thr Asp Trp Val Ile Asn Glu Leu Arg Tyr His Leu
865 870 875
Glu Glu Ser Arg Asp Lys Asn Val Leu Leu Cys Leu Glu Glu Arg Asp
880 885 890 895
Trp Asp Pro Gly Leu Ala Ile Ile Asp Asn Leu Met Gln Ser Ile Asn
900 905 910
Gln Sar Lys Lys Thr Val Phe Val Leu Thr Lys Lys Tyr Ala Lys Ser
915 920 925
Trp Asn Phe Lys Thr Ala Phe Tyr Leu Ala Leu Gln Arg Leu Met Gly
930 935 940
Glu Asn Met Asp Val Ile Ile Phe Ile Leu Leu Glu Pro Val Leu Gln
945 950 955
His Ser Pro Tyr Leu Arg Leu Arg Gln Arg Ile Cys Lys Ser Ser Ile
960 965 970 975
Leu Gln Trp Pro Asp Asn Pro Lys Ala Glu Gly Leu Phe Trp Gln Thr
980 985 990
Leu Arg Asn Val Val Leu Thr Glu Asn Asp Ser Arg Tyr Asn Asn Met
995 1000 1005
Tyr Val Asp Ser Ile Lys Gln Tyr
1010 1015
<210>38
<211>3046
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(111)..(2543)
<220>
<221>mat_肽
<222>(168)..(2543)
<400>38
gaatcatcca cgcacctgca gctctgctga gagagtgcaa gccgtggggg ttttgagctc 60
atcttcatca ttcatatgag gaaataagtg gtaaaatcct tggaaataca atg aga 116
Met Arg
ctc atc aga aac att tac ata ttt tgt agt att gtt atg aca gca gag 164
Leu Ile Arg Asn Ile Tyr Ile Phe Cys Ser Ile Val Met Thr Ala Glu
-15 -10 -5
ggt gat gct cca gag ctg cca gaa gaa agg gaa ctg atg acc aac tgc 212
Gly Asp Ala Pro Glu Leu Pro Glu Glu Arg Glu Leu Met Thr Asn Cys
-1 1 5 10 15
tcc aac atg tct cta aga aag gtt ccc gca gac ttg acc cca gcc aca 260
Ser Asn Met Ser Leu Arg Lys Val Pro Ala Asp Leu Thr Pro Ala Thr
20 25 30
acg aca ctg gat tta tcc tat aac ctc ctt ttt caa ctc cag agt tca 308
Thr Thr Leu Asp Leu Ser Tyr Asn Leu Leu Phe Gln Leu Gln Ser Ser
35 40 45
gat ttt cat tct gtc tcc aaa ctg aga gtt ttg att cta tgc cat aac 356
Asp Phe His Ser Val Ser Lys Leu Arg Val Leu Ile Leu Cys His Asn
50 55 60
aga att caa cag ctg gat ctc aaa acc ttt gaa ttc aac aag gag tta 404
Arg Ile Gln Gln Leu Asp Leu Lys Thr Phe Glu Phe Asn Lys Glu Leu
65 70 75
aga tat tta gat ttg tct aat aac aga ctg aag agt gta act tgg tat 452
Arg Tyr Leu Asp Leu Ser Asn Asn Arg Leu Lys Ser Val Thr Trp Tyr
80 85 90 95
tta ctg gca ggt ctc agg tat tta gat ctt tct ttt aat gac ttt gac 500
Leu Leu Ala Gly Leu Arg Tyr Leu Asp Leu Ser Phe Asn Asp Phe Asp
100 105 110
acc atg cct atc tgt gag gaa gct ggc aac atg tca cac ctg gaa atc 548
Thr Met Pro Ile Cys Glu Glu Ala Gly Asn Met Ser His Leu Glu Ile
115 120 125
cta ggt ttg agt ggg gca aaa ata caa aaa tca gat ttc cag aaa att 596
Leu Gly Leu Ser Gly Ala Lys Ile Gln Lys Ser Asp Phe Gln Lys Ile
130 135 140
gct cat ctg cat cta aat act gtc ttc tta gga ttc aga act ctt cct 644
Ala His Leu His Leu Asn Thr Val Phe Leu Gly Phe Arg Thr Leu Pro
145 150 155
cat tat gaa gaa ggt agc ctg ccc atc tta aac aca aca aaa ctg cac 692
His Tyr Glu Glu Gly Ser Leu Pro Ile Leu Asn Thr Thr Lys Leu His
160 165 170 175
att gtt tta cca atg gac aca aat ttc tgg gtt ctt ttg cgt gat gga 740
Ile Val Leu Pro Met Asp Thr Asn Phe Trp Val Leu Leu Arg Asp Gly
180 185 190
atc aag act tca aaa ata tta gaa atg aca aat ata gat ggc aaa agc 788
Ile Lys Thr Ser Lys Ile Leu Glu Met Thr Asn Ile Asp Gly Lys Ser
195 200 205
caa ttt gta agt tat gaa atg caa cga aat ctt agt tta gaa aat gct 836
Gln Phe Val Ser Tyr Glu Met Gln Arg Asn Leu Ser Leu Glu Asn Ala
210 215 220
aag aca tcg gtt cta ttg ctt aat aaa gtt gat tta ctc tgg gac gac 884
Lys Thr Ser Val Leu Leu Leu Asn Lys Val Asp Leu Leu Trp Asp Asp
225 230 235
ctt ttc ctt atc tta caa ttt gtt tgg cat aca tca gtg gaa cac ttt 932
Leu Phe Leu Ile Leu Gln Phe Val Trp His Thr Ser Val Glu His Phe
240 245 250 255
cag atc cga aat gtg act ttt ggt ggt aag gct tat ctt gac cac aat 980
Gln Ile Arg Asn Val Thr Phe Gly Gly Lys Ala Tyr Leu Asp His Asn
260 265 270
tca ttt gac tac tca aat act gta atg aga act ata aaa ttg gag cat 1028
Ser Phe Asp Tyr Ser Asn Thr Val Met Arg Thr Ile Lys Leu Glu His
275 280 285
gta cat ttc aga gtg ttt tac att caa cag gat aaa atc tat ttg ctt 1076
Val His Phe Arg Val Phe Tyr Ile Gln Gln Asp Lys Ile Tyr Leu Leu
290 295 300
ttg acc aaa atg gac ata gaa aac ctg aca ata tca aat gca caa atg 1124
Leu Thr Lys Met Asp Ile Glu Asn Leu Thr Ile Ser Asn Ala Gln Met
305 310 315
cca cac atg ctt ttc ccg aat tat cct acg aaa ttc caa tat tta aat 1172
Pro His Met Leu Phe Pro Asn Tyr Pro Thr Lys Phe Gln Tyr Leu Asn
320 325 330 335
ttt gcc aat aat atc tta aca gac gag ttg ttt aaa aga act atc caa 1220
Phe Ala Asn Asn Ile Leu Thr Asp Glu Leu Phe Lys Arg Thr Ile Gln
340 345 350
ctg cct cac ttg aaa act ctc att ttg aat ggc aat aaa ctg gag aca 1268
Leu Pro His Leu Lys Thr Leu Ile Leu Asn Gly Asn Lys Leu Glu Thr
355 360 365
ctt tct tta gta agt tgc ttt gct aac aac aca ccc ttg gaa cac ttg 1316
Leu Ser Leu Val Ser Cys Phe Ala Asn Asn Thr Pro Leu Glu His Leu
370 375 380
gat ctg agt caa aat cta tta caa cat aaa aat gat gaa aat tgc tca 1364
Asp Leu Ser Gln Asn Leu Leu Gln His Lys Asn Asp Glu Asn Cys Ser
385 390 395
tgg cca gaa act gtg gtc aat atg aat ctg tca tac aat aaa ttg tct 1412
Trp Pro Glu Thr Val Val Asn Met Asn Leu Ser Tyr Asn Lys Leu Ser
400 405 410 415
gat tct gtc ttc agg tgc ttg ccc aaa agt att caa ata ctt gac cta 1460
Asp Ser Val Phe Arg Cys Leu Pro Lys Ser Ile Gln Ile Leu Asp Leu
420 425 430
aat aat aac caa atc caa act gta cct aaa gag act att cat ctg atg 1508
Asn Asn Asn Gln Ile Gln Thr Val Pro Lys Glu Thr Ile His Leu Met
435 440 445
gcc tta cga gaa cta aat att gca ttt aat ttt cta act gat ctc cct 1556
Ala Leu Arg Glu Leu Asn Ile Ala Phe Asn Phe Leu Thr Asp Leu Pro
450 455 460
gga tgc agt cat ttc agt aga ctt tca gtt ctg aac att gaa atg aac 1604
Gly Cys Ser His Phe Sar Arg Leu Ser Val Leu Asn Ile Glu Met Asn
465 470 475
ttc att ctc agc cca tct ctg gat ttt gtt cag agc tgc cag gaa gtt 1652
Phe Ile Leu Set Pro Ser Leu Asp Phe Val Gln Ser Cys Gln Glu Val
480 485 490 495
aaa act cta aat gcg gga aga aat cca ttc cgg tgt acc tgt gaa tta 1700
Lys Thr Leu Asn Ala Gly Arg Asn Pro Phe Arg Cys Thr Cys Glu Leu
500 505 510
aaa aat ttc att cag ctt gaa aca tat tca gag gtc atg atg gtt gga 1748
Lys Asn Phe Ile Gln Leu Glu Thr Tyr Ser Glu Val Met Met Val Gly
515 520 525
tgg tca gat tca tac acc tgt gaa tac cct tta aac cta agg gga act 1796
Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg Gly Thr
530 535 540
agg tta aaa gac gtt cat ctc cac gaa tta tct tgc aac aca gct ctg 1844
Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr Ala Leu
545 550 555
ttg att gtc acc att gtg gtt att atg cta gtt ctg ggg ttg gct gtg 1892
Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu Ala Val
560 565 570 575
gcc ttc tgc tgt ctc cac ttt gat ctg ccc tgg tat ctc agg atg cta 1940
Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg Met Leu
5850 585 590
ggt caa tgc aca caa aca tgg cac agg gtt agg aaa aca acc caa gaa 1988
Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr Gln Glu
595 600 605
caa ctc aag aga aat gtc cga ttc cac gca ttt att tca tac agt gaa 2036
Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr Ser Glu
610 615 620
cat gat tct ctg tgg gtg aag aat gaa ttg atc ccc aat cta gag aag 2084
His Asp Set Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu Glu Lys
625 630 635
gaa gat ggt tct atc ttg att tgc ctt tat gaa agc tac ttt gac cct 2132
Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe Asp Pro
640 645 650 655
ggc aaa agc att agt gaa aat att gta agc ttc att gag aaa agc tat 2180
Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys Ser Tyr
660 665 670
aag tcc atc ttt gtt ttg tct ccc aac ttt gtc cag aat gag tgg tgc 2228
Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu Trp Cys
675 680 685
cat tat gaa ttc tac ttt gcc cac cac aat ctc ttc cat gaa aat tct 2276
His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu Asn Ser
690 695 700
gat cat ata att ctt atc tta ctg gaa ccc att cca ttc tat tgc att 2324
Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr Cys Ile
705 710 715
ccc acc agg tat cat aaa ctg aaa gct ctc ctg gaa aaa aaa gca tac 2372
Pro Thr Arg Tyr His Lys Leu Lys Ala Leu Leu Glu Lys Lys Ala Tyr
720 725 730 735
ttg gaa tgg ccc aag gat agg cgt aaa tgt ggg ctt ttc tgg gca aac 2420
Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Phe Trp Ala Asn
740 745 750
ctt cga gct gct att aat gtt aat gta tta gcc acc aga gaa atg tat 2468
Leu Arg Ala Ala Ile Asn Val Asn Val Leu Ala Thr Arg Glu Met Tyr
755 760 765
gaa ctg cag aca ttc aca gag tta aat gaa gag tct cga ggt tct aca 2516
Glu Leu Gln Thr Phe Thr Glu Leu Asn Glu glu Ser Arg Gly Ser Thr
770 775 780
atc tct ctg atg aga aca gat tgt cta taaaatccca cagtccttgg 2563
Ile Ser Leu Met Arg Thr Asp Cys Leu
785 790
gaagttgggg accacataca ctgttgggat gtacattgat acaaccttta tgatggcaat 2623
ttgacaatat ttattaaaat aaaaaatggt tattcccttc atatcagttt ctagaaggat 2683
ttctaagaat gtatcctata gaaacacctt cacaagttta taagggctta tggaaaaagg 2743
tgttcatccc aggattgttt ataatcatga aaaatgtggc caggtgcagt ggctcactct 2803
tgtaatccca gcactatggg aggccaaggt gggtgaccca cgaggtcaag agatggagac 2863
catcctggcc aacatggtga aaccctgtct ctactaaaaa tacaaaaatt agctgggcgt 2923
gatggtgcac gcctgtagtc ccagctactt gggaggctga ggcaggagaa tcgcttgaac 2983
ccgggaggtg gcagttgcag tgagctgaga tcgagccact gcactccagc ctggtgacag 3043
agc 3046
<210>39
<211>811
<212>PRT
<213>未知
<400>39
Met Arg Leu Ile Arg Asn Ile Tyr Ile Phe Cys Ser Ile Val Met Thr
-15 -10 -5
Ala Glu Gly Asp Ala Pro Glu Leu Pro Glu Glu Arg Glu Leu Met Thr
-1 1 5 10
Asn Cys Ser Asn Met Ser Leu Arg Lys Val Pro Ala Asp Leu Thr Pro
15 20 25
Ala Thr Thr Thr Leu Asp Leu Ser Tyr Asn Leu Leu Phe Gln Leu Gln
30 35 40 45
Ser Ser Asp Phe His Ser Val Ser Lys Leu Arg Val Leu Ile Leu Cys
50 55 60
His Asn Arg Ile Gln Gln Leu Asp Leu Lys Thr Phe Glu Phe Asn Lys
65 70 75
Glu Leu Arg Tyr Leu Asp Leu Ser Asn Asn Arg Leu Lys Ser Val Thr
80 85 90
Trp Tyr Leu Leu Ala Gly Leu Arg Tyr Leu Asp Leu Ser Phe Asn Asp
95 100 105
Phe Asp Thr Met Pro Ile Cys Glu Glu Ala Gly Asn Met Ser His Leu
110 115 120 125
Glu Ile Leu Gly Leu Ser Gly Ala Lys Ile Gln Lys Ser Asp Phe Gln
130 135 140
Lys Ile Ala His Leu His Leu Asn Thr Val Phe Leu Gly Phe Arg Thr
145 150 155
Leu Pro His Tyr Glu Glu Gly Ser Leu Pro Ile Leu Asn Thr Thr Lys
160 165 170
Leu His Ile Val Leu Pro Met Asp Thr Asn Phe Trp Val Leu Leu Arg
175 180 185
Asp Gly Ile Lys Thr Ser Lys Ile Leu Glu Met Thr Asn Ile Asp Gly
190 l95 200 205
Lys Ser Gln Phe Val Ser Tyr Glu Met Gln Arg Asn Leu Ser Leu Glu
210 215 220
Asn Ala Lys Thr Ser Val Leu Leu Leu Asn Lys Val Asp Leu Leu Trp
225 230 235
Asp Asp Leu Phe Leu Ile Leu G1n Phe Val Trp His Thr Ser Val Glu
240 245 250
His Phe Gln Ile Arg Asn Val Thr Phe Gly Gly Lys Ala Tyr Leu Asp
255 260 265
His Asn Ser Phe Asp Tyr Ser Asn Thr Val Met Arg Thr Ile Lys Leu
270 275 280 285
Glu His Val His Phe Arg Val Phe Tyr Ile Gln Gln Asp Lys Ile Tyr
290 295 300
Leu Leu Leu Thr Lys Met Asp Ile Glu Asn Leu Thr Ile Ser Asn Ala
305 310 315
Gln Met Pro His Met Leu Phe Pro Asn Tyr Pro Thr Lys Phe Gln Tyr
320 325 330
Leu Asn Phe Ala Asn Asn Ile Leu Thr Asp Glu Leu Phe Lys Arg Thr
335 340 345
Ile Gln Leu Pro His Leu Lys Thr Leu Ile Leu Asn Gly Asn Lys Leu
350 355 360 365
Glu Thr Leu Ser Leu Val Ser Cys Phe Ala Asn Asn Thr Pro Leu Glu
370 375 380
His Leu Asp Leu Ser Gln Asn Leu Leu Gln His Lys Asn Asp Glu Asn
385 390 395
Cys Ser Trp Pro Glu Thr Val Val Asn Met Asn Leu Ser Tyr Asn Lys
400 405 410
Leu Ser Asp Ser Val Phe Arg Cys Leu Pro Lys Ser Ile Gln Ile Leu
415 420 425
Asp Leu Asn Asn Asn Gln Ile Gln Thr Val Pro Lys Glu Thr Ile His
430 435 440 445
Leu Met Ala Leu Arg Glu Leu Asn Ile Ala Phe Asn Phe Leu Thr Asp
450 455 460
Leu Pro Gly Cys Ser His Phe Ser Arg Leu Ser Val Leu Asn Ile Glu
465 470 475
Met Asn Phe Ile Leu Ser Pro Ser Leu Asp Phe Val Gln Ser Cys Gln
480 485 490
Glu Val Lys Thr Leu Asn Ala Gly Arg Asn Pro Phe Arg Cys Thr Cys
495 500 505
Glu Leu Lys Asn Phe Ile Gln Leu Glu Thr Tyr Ser Glu Val Met Met
510 515 520 525
Val Gly Trp Ser Asp Ser Tyr Thr Cys Glu Tyr Pro Leu Asn Leu Arg
530 535 540
Gly Thr Arg Leu Lys Asp Val His Leu His Glu Leu Ser Cys Asn Thr
545 550 555
Ala Leu Leu Ile Val Thr Ile Val Val Ile Met Leu Val Leu Gly Leu
560 565 570
Ala Val Ala Phe Cys Cys Leu His Phe Asp Leu Pro Trp Tyr Leu Arg
575 580 585
Met Leu Gly Gln Cys Thr Gln Thr Trp His Arg Val Arg Lys Thr Thr
590 595 600 605
Gln Glu Gln Leu Lys Arg Asn Val Arg Phe His Ala Phe Ile Ser Tyr
610 615 620
Ser Glu His Asp Ser Leu Trp Val Lys Asn Glu Leu Ile Pro Asn Leu
625 630 635
Glu Lys Glu Asp Gly Ser Ile Leu Ile Cys Leu Tyr Glu Ser Tyr Phe
640 645 650
Asp Pro Gly Lys Ser Ile Ser Glu Asn Ile Val Ser Phe Ile Glu Lys
655 660 665
Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro Asn Phe Val Gln Asn Glu
670 675 680 685
Trp Cys His Tyr Glu Phe Tyr Phe Ala His His Asn Leu Phe His Glu
690 695 700
Asn Ser Asp His Ile Ile Leu Ile Leu Leu Glu Pro Ile Pro Phe Tyr
705 710 715
Cys Ile Pro Thr Arg Tyr His Lys Leu Lys Ala Leu Leu Glu Lys Lys
720 725 730
Ala Tyr Leu Glu Trp Pro Lys Asp Arg Arg Lys Cys Gly Leu Phe Trp
735 740 745
Ala Asn Leu Arg Ala Ala Ile Asn Val Asn Val Leu Ala Thr Arg Glu
750 755 760 765
Met Tyr Glu Leu Gln Thr Phe Thr Glu Leu Asn Glu Glu Ser Arg Gly
770 775 780
Ser Thr Ile Ser Leu Met Arg Thr Asp Cys Leu
785 790
<210>40
<211>2760
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(68)..(2455)
<220>
<221>mat_肽
<222>(161)..(2455)
<220>
<22l>misc_特征
<222>(2529)
<223>n可以是a,c,g,或t
<400>40
aagaatttgg actcatatca agatgctctg aagaagaaca accctttagg atagccactg 60
caacatc atg acc aaa gac aaa gaa cct att gtt aaa agc ttc cat ttt 109
Met Thr Lys Asp Lys Glu Pro Ile Val Lys Ser Phe His Phe
-30 -25 -20
gtt tgc ctt atg atc ata ata gtt gga acc aga atc cag ttc tcc gac 157
Val Cys Leu Met Ile Ile Ile Val Gly Thr Arg Ile Gln Phe Ser Asp
-15 -10 -5
gga aat gaa ttt gca gta gac aag tca aaa aga ggt ctt att cat gtt 205
Gly Asn Glu Phe Ala Val Asp Lys Ser Lys Arg Gly Leu Ile His Val
-1 1 5 10 15
cca aaa gac cta ccg ctg aaa acc aaa gtc tta gat atg tct cag aac 253
Pro Lys Asp Leu Pro Leu Lys Thr Lys Val Leu Asp Met Ser Gln Asn
20 25 30
tac atc gct gag ctt cag gtc tct gac atg agc ttt cta tca gag ttg 301
Tyr Ile Ala Glu Leu Gln Val Ser Asp Met Ser Phe Leu Ser Glu Leu
35 40 45
aca gtt ttg aga ctt tcc cat aac aga atc cag cta ctt gat tta agt 349
Thr Val Leu Arg Leu Ser His Asn Arg Ile Gln Leu Leu Asp Leu Ser
50 55 60
gtt ttc aag ttc aac cag gat tta gaa tat ttg gat tta tct cat aat 397
Val Phe Lys Phe Asn Gln Asp Leu Glu Tyr Leu Asp Leu Ser His Asn
65 70 75
cag ttg caa aag ata tcc tgc cat cct att gtg agt ttc agg cat tta 445
Gln Leu Gln Lys Ile Ser Cys His Pro Ile Val Ser Phe Arg His Leu
80 85 90 95
gat ctc tca ttc aat gat ttc aag gcc ctg ccc atc tgt aag gaa ttt 493
Asp Leu Ser Phe Asn Asp Phe Lys Ala Leu Pro Ile Cys Lys Glu Phe
100 105 110
ggc aac tta tca caa ctg aat ttc ttg gga ttg agt gct atg aag ctg 541
Gly Asn Leu Ser Gln Leu Asn Phe Leu Gly Leu Ser Ala Met Lys Leu
115 120 125
caa aaa tta gat ttg ctg cca att gct cac ttg cat cta agt tat atc 589
Gln Lys Leu Asp Leu Leu Pro Ile Ala His Leu His Leu Ser Tyr Ile
130 135 140
ctt ctg gat tta aga aat tat tat ata aaa gaa aat gag aca gaa agt 637
Leu Leu Asp Leu Arg Asn Tyr Tyr Ile Lys Glu Asn Glu Thr Glu Ser
145 150 155
cta caa att ctg aat gca aaa acc ctt cac ctt gtt ttt cac cca act 685
Leu Gln Ile Leu Asn Ala Lys Thr Leu His Leu Val Phe His Pro Thr
160 165 170 175
agt tta ttc gct atc caa gtg aac ata tca gtt aat act tta ggg tgc 733
Ser Leu Phe Ala Ile Gln Val Asn Ile Ser Val Asn Thr Leu Gly Cys
180 185 190
tta caa ctg act aat att aaa ttg aat gat gac aac tgt caa gtt ttc 781
Leu Gln Leu Thr Asn Ile Lys Leu Asn Asp Asp Asn Cys Gln Val Phe
195 200 205
att aaa ttt tta tca gaa ctc acc aga ggt cca acc tta ctg aat ttt 829
Ile Lys Phe Leu Ser Glu Leu Thr Arg Gly Pro Thr Leu Leu Asn Phe
210 215 220
acc ctc aac cac ata gaa acg act tgg aaa tgc ctg gtc aga gtc ttt 877
Thr Leu Asn His Ile Glu Thr Thr Trp Lys Cys Leu Val Arg Val Phe
225 230 235
caa ttt ctt tgg ccc aaa cct gtg gaa tat ctc aat att tac aat tta 925
Gln Phe Leu Trp Pro Lys Pro Val Glu Tyr Leu Asn Ile Tyr Asn Leu
240 245 250 255
aca ata att gaa agc att cgt gaa gaa gat ttt act tat tct aaa acg 973
Thr Ile Ile Glu Ser Ile Arg Glu Glu Asp Phe Thr Tyr Ser Lys Thr
260 265 270
aca ttg aaa gca ttg aca ata gaa cat atc acg aac caa gtt ttt ctg 1021
Thr Leu Lys Ala Leu Thr Ile Glu His Ile Thr Asn Gln Val Phe Leu
275 280 285
ttt tca cag aca gct ttg tac acc gtg ttt tct gag atg aac att atg 1069
Phe Ser Gln Thr Ala Leu Tyr Thr Val Phe Ser Glu Met Asn Ile Met
290 295 300
atg tta acc att tca gat aca cct ttt ata cac atg ctg tgt cct cat 1117
Met Leu Thr Ile Ser Asp Thr Pro Phe Ile His Met Leu Cys Pro His
305 310 315
gca cca agc aca ttc aag ttt ttg aac ttt acc cag aac gtt ttc aca 1165
Ala Pro Ser Thr Phe Lys Phe Leu Asn Phe Thr Gln Asn Val Phe Thr
320 325 330 335
gat agt att ttt gaa aaa tgt tcc acg tta gtt aaa ttg gag aca ctt 1213
Asp Ser Ile Phe Glu Lys Cys Ser Thr Leu Val Lys Leu Glu Thr Leu
340 345 350
atc tta caa aag aat gga tta aaa gac ctt ttc aaa gta ggt ctc atg 1261
Ile Leu Gln Lys Asn Gly Leu Lys Asp Leu Phe Lys Val Gly Leu Met
355 360 365
acg aag gat atg cct tct ttg gaa ata ctg gat gtt agc tgg aat tct 1309
Thr Lys Asp Met Pro Ser Leu Glu Ile Leu Asp Val Ser Trp Asn Ser
370 375 380
ttg gaa tct ggt aga cat aaa gaa aac tgc act tgg gtt gag agt ata 1357
Leu Glu Ser Gly Arg His Lys Glu Asn Cys Thr Trp Val Glu Ser Ile
385 390 395
gtg gtg tta aat ttg tct tca aat atg ctt act gac tct gtt ttc aga 1405
Val Val Leu Asn Leu Ser Ser Asn Met Leu Thr Asp Ser Val Phe Arg
400 405 410 415
tgt tta cct ccc agg atc aag gta ctt gat ctt cac agc aat aaa ata 1453
Cys Leu Pro Pro Arg Ile Lys Val Leu Asp Leu His Ser Asn Lys Ile
420 425 430
aag agc gtt cct aaa caa gtc gta aaa ctg gaa gct ttg caa gaa ctc 1501
Lys Ser Val Pro Lys Gln Val Val Lys Leu Glu Ala Leu Gln Glu Leu
435 440 445
aat gtt gct ttc aat tct tta act gac ctt cct gga tgt ggc agc ttt 1549
Asn Val Ala Phe Asn Ser Leu Thr Asp Leu Pro Gly Cys Gly Ser Phe
450 455 460
agc agc ctt tct gta ttg atc att gat cac aat tca gtt tcc cac cca 1597
Ser Ser Leu Ser Val Leu Ile Ile Asp His Asn Ser Val Ser His Pro
465 470 475
tcg gct gat ttc ttc cag agc tgc cag aag atg agg tca ata aaa gca 1645
Ser Ala Asp Phe Phe Gln Ser Cys Gln Lys Met Arg Ser Ile Lys Ala
480 485 490 495
ggg gac aat cca ttc caa tgt acc tgt gag cta aga gaa ttt gtc aaa 1693
Gly Asp Asn Pro Phe Gln Cys Thr Cys Glu Leu Arg Glu Phe Val Lys
500 505 510
aat ata gac caa gta tca agt gaa gtg tta gag ggc tgg cct gat tct 1741
Asn Ile Asp Gln Val Ser Ser Glu Val Leu Glu Gly Trp Pro Asp Ser
515 520 525
tat aag tgt gac tac cca gaa agt tat aga gga agc cca cta aag gac 1789
Tyr Lys Cys Asp Tyr Pro Glu Ser Tyr Arg Gly Ser Pro Leu Lys Asp
530 535 540
ttt cac atg tct gaa tta tcc tgc aac ata act ctg ctg atc gtc acc 1837
Phe His Met Ser Glu Leu Ser Cys Asn Ile Thr Leu Leu Ile Val Thr
545 550 555
atc ggt gcc acc atg ctg gtg ttg gct gtg act gtg acc tcc ctc tgc 1885
Ile Gly Ala Thr Met Leu Val Leu Ala Val Thr Val Thr Ser Leu Cys
560 565 570 575
atc tac ttg gat ctg ccc tgg tat ctc agg atg gtg tgc cag tgg acc 1933
Ile Tyr Leu Asp Leu Pro Trp Tyr Leu Arg Met Val Cys Gln Trp Thr
580 585 590
cag act cgg cgc agg gcc agg aac ata ccc tta gaa gaa ctc caa aga 1981
Gln Thr Arg Arg Arg Ala Arg Asn Ile Pro Leu Glu Glu Leu Gln Arg
595 600 605
aac ctc cag ttt cat gct ttt att tca tat agt gaa cat gat tct gcc 2029
Asn Leu Gln Phe His Ala Phe Ile Ser Tyr Ser Glu His Asp Ser Ala
610 615 620
tgg gtg aaa agt gaa ttg gta cct tac cta gaa aaa gaa gat ata cag 2077
Trp Val Lys Ser Glu Leu Val Pro Tyr Leu Glu Lys Glu Asp Ile Gln
625 630 635
att tgt ctt cat gag agg aac ttt gtc cct ggc aag agc att gtg gaa 2125
Ile Cys Leu His Glu Arg Asn Phe Val Pro Gly Lys Ser Ile Val Glu
640 645 650 655
aat atc atc aac tgc att gag aag agt tac aag tcc atc ttt gtt ttg 2173
Asn Ile Ile Asn Cys Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu
660 665 670
tct ccc aac ttt gtc cag agt gag tgg tgc cat tac gaa ctc tat ttt 2221
Ser Pro Asn Phe Val Gln Ser Glu Trp Cys His Tyr Glu Leu Tyr Phe
675 680 685
gcc cat cac aat ctc ttt cat gaa gga tct aat aac tta atc ctc atc 2269
Ala His His Asn Leu Phe His Glu Gly ser Asn Asn Leu Ile Leu Ile
690 695 700
tta ctg gaa ccc att cca cag aac agc att ccc aac aag tac cac aag 2317
Leu Leu Glu Pro Ile Pro Gln Asn Ser Ile Pro Asn Lys Tyr His Lys
705 710 715
ctg aag gct ctc atg acg cag cgg act tat ttg cag tgg ccc aag gag 2365
Leu Lys Ala Leu Met Thr Gln Arg Thr Tyr Leu Gln Trp Pro Lys Glu
720 725 730 735
aaa agc aaa cgt ggg ctc ttt tgg gct aac att aga gcc gct ttt aat 2415
Lys Ser Lys Arg Gly Leu Phe Trp Ala Asn Ile Arg Ala Ala Phe Asn
740 745 750
atg aaa tta aca cta gtc act gaa aac aat gat gtg aaa tct 2455
Met Lys Leu Thr Leu Val Thr Glu Asn Asn Asp Val Lys Ser
755 760 765
taaaaaaatt taggaaattc aacttaagaa accattattt acttggatga tggtgaatag 2515
tacagtcgta agtnactgtc tggaggtgcc tccattatcc tcatgccttc aggaaagact 2575
taacaaaaac aatgtttcat ctggggaact gagctaggcg gtgaggttag cctgccagtt 2635
agagacagcc cagtctcttc tggtttaatc attatgtttc aaattgaaac agtctctttt 2695
gagtaaatgc tcagtttttc agctcctctc cactctgctt tcccaaatgg attctgttgg 2755
tgaag 2760
<210>41
<211>796
<212>PRT
<213>未知
<400>41
Met Thr Lys Asp Lys Glu Pro Ile Val Lys Ser Phe His Phe Val Cys
-30 -25 -20
Leu Met Ile Ile Ile Val Gly Thr Arg Ile Gln Phe Ser Asp Gly Asn
-15 -10 -5 -1 1
Glu Phe Ala Val Asp Lys Ser Lys Arg Gly Leu Ile His Val Pro Lys
5 10 15
Asp Leu Pro Leu Lys Thr Lys Val Leu Asp Met Ser Gln Asn Tyr Ile
20 25 30
Ala Glu Leu Gln Val Ser Asp Met Ser Phe Leu Ser Glu Leu Thr Val
35 40 45
Leu Arg Leu Ser His Asn Arg Ile Gln Leu Leu Asp Leu Ser Val Phe
50 55 60 65
Lys Phe Asn Gln Asp Leu Glu Tyr Leu Asp Leu Ser His Asn Gln Leu
70 75 80
Gln Lys Ile Ser Cys His Pro Ile Val Ser Phe Arg His Leu Asp Leu
85 90 95
Ser Phe Asn Asp Phe Lys Ala Leu Pro Ile Cys Lys Glu Phe Gly Asn
100 105 110
Leu Ser Gln Leu Asn Phe Leu Gly Leu Ser Ala Met Lys Leu Gln Lys
115 120 125
Leu Asp Leu Leu Pro Ile Ala His Leu His Leu Ser Tyr Ile Leu Leu
130 135 140 145
Asp Leu Arg Asn Tyr Tyr Ile Lys Glu Asn Glu Thr Glu Ser Leu Gln
150 155 160
Ile Leu Asn Ala Lys Thr Leu His Leu Val Phe His Pro Thr Ser Leu
165 170 175
Phe Ala Ile Gln Val Asn Ile Ser Val Asn Thr Leu Gly Cys Leu Gln
180 185 190
Leu Thr Asn Ile Lys Leu Asn Asp Asp Asn Cys Gln Val Phe Ile Lys
195 200 205
Phe Leu Ser Glu Leu Thr Arg Gly Pro Thr Leu Leu Asn Phe Thr Leu
210 215 220 225
Asn His Ile Glu Thr Thr Trp Lys Cys Leu Val Arg Val Phe Gln Phe
230 235 240
Leu Trp Pro Lys Pro Val Glu Tyr Leu Asn Ile Tyr Asn Leu Thr Ile
245 250 255
Ile Glu Ser Ile Arg Glu Glu Asp Phe Thr Tyr Ser Lys Thr Thr Leu
260 265 270
Lys Ala Leu Thr Ile Glu His Ile Thr Asn Gln Val Phe Leu Phe Ser
275 280 285
Gln Thr Ala Leu Tyr Thr Val Phe Ser Glu Met Asn Ile Met Met Leu
290 295 300 305
Thr Ile Ser Asp Thr Pro Phe Ile His Met Leu Cys Pro His Ala Pro
310 315 320
Ser Thr Phe Lys Phe Leu Asn Phe Thr Gln Asn Val Phe Thr Asp Ser
325 330 335
Ile Phe Glu Lys Cys Ser Thr Leu Val Lys Leu Glu Thr Leu Ile Leu
340 345 350
Gln Lys Asn Gly Leu Lys Asp Leu Phe Lys Val Gly Leu Met Thr Lys
355 360 365
Asp Met Pro Ser Leu Glu Ile Leu Asp Val Ser Trp Asn Ser Leu Glu
370 375 380 385
Ser Gly Arg His Lys Glu Asn Cys Thr Trp Val Glu Ser Ile Val Val
390 395 400
Leu Asn Leu Ser Ser Asn Met Leu Thr Asp Ser Val Phe Arg Cys Leu
405 410 415
Pro Pro Arg Ile Lys Val Leu Asp Leu His Ser Asn Lys Ile Lys Ser
420 425 430
Val Pro Lys Gln Val Val Lys Leu Glu Ala Leu Gln Glu Leu Asn Val
435 440 445
Ala Phe Asn Ser Leu Thr Asp Leu Pro Gly Cys Gly Ser Phe Ser Ser
450 455 460 465
Leu Ser Val Leu Ile Ile Asp His Asn Ser Val Ser His Pro Ser Ala
470 475 480
Asp Phe Phe Gln Ser Cys Gln Lys Met Arg Ser Ile Lys Ala Gly Asp
485 490 495
Asn Pro Phe Gln Cys Thr Cys Glu Leu Arg Glu Phe Val Lys Asn Ile
500 505 510
Asp Gln Val Ser Ser Glu Val Leu Glu Gly Trp Pro Asp Ser Tyr Lys
515 520 525
Cys Asp Tyr Pro Glu Ser Tyr Arg Gly Ser Pro Leu Lys Asp Phe His
530 535 540 545
Met Ser Glu Leu Ser Cys Asn Ile Thr Leu Leu Ile Val Thr Ile Gly
550 555 560
Ala Thr Met Leu Val Leu Ala Val Thr Val Thr Ser Leu Cys Ile Tyr
565 570 575
Leu Asp Leu Pro Trp Tyr Leu Arg Met Val Cys Gln Trp Thr Gln Thr
580 585 590
Arg Arg Arg Ala Arg Asn Ile Pro Leu Glu Glu Leu Gln Arg Asn Leu
595 600 605
Gln Phe His Ala Phe Ile Ser Tyr Ser Glu His Asp Ser Ala Trp Val
610 615 620 625
Lys Ser Glu Leu Val Pro Tyr Leu Glu Lys Glu Asp Ile Gln Ile Cys
630 635 640
Leu His Glu Arg Asn Phe Val Pro Gly Lys Ser Ile Val Glu Asn Ile
645 650 655
Ile Asn Cys Ile Glu Lys Ser Tyr Lys Ser Ile Phe Val Leu Ser Pro
660 665 670
Asn Phe Val Gln Ser Glu Trp Cys His Tyr Glu Leu Tyr Phe Ala His
675 680 685
His Asn Leu Phe His Glu Gly Ser Asn Asn Leu Ile Leu Ile Leu Leu
690 695 700 705
Glu Pro Ile Pro Gln Asn Ser Ile Pro Asn Lys Tyr His Lys Leu Lys
710 715 720
Ala Leu Met Thr Gln Arg Thr Tyr Leu Gln Trp Pro Lys Glu Lys Ser
725 730 735
Lys Arg Gly Leu Phe Trp Ala Asn Ile Arg Ala Ala Phe Asn Met Lys
740 745 750
Leu Thr Leu Val Thr Glu Asn Asn Asp Val Lys Ser
755 760 765
<210>42
<211>3168
<212>DNA
<213>未知
<220>
<223>未知生物的说明:灵长类动物;推测为
人(Homo sapiens)
<220>
<221>CDS
<222>(1)..(3165)
<220>
<221>mat_肽
<222>(144)..(3165)
<400>42
atg ccc atg aag tgg agt ggg tgg agg tgg agc tgg ggg ccg gcc act 48
Met Pro Met Lys Trp Ser Gly Trp Arg Trp Ser Trp Gly Pro Ala Thr
-45 -40 -35
cac aca gcc ctc cca ccc cca cag ggt ttc tgc cgc agc gcc ctg cac 96
His Thr Ala Leu Pro Pro Pro Gln Gly Phe Cys Arg Ser Ala Leu His
-30 -25 -20
ccg ctg tct ctc ctg gtg cag gcc atc atg ctg gcc atg acc ctg gcc 144
Pro Leu Ser Leu Leu Val Gln Ala Ile Met Leu Ala Met Thr Leu Ala
-15 -10 -5 -1
ctg ggt acc ttg cct gcc ttc cta ccc tgt gag ctc cag ccc cac ggc 192
Leu Gly Thr Leu Pro Ala Phe Leu Pro Cys Glu Leu Gln Pro His Gly
1 5 10 15
ctg gtg aac tgc aac tgg ctg ttc ctg aag tct gtg ccc cac ttc tcc 240
Leu Val Asn Cys Asn Trp Leu Phe Leu Lys Ser Val Pro His Phe Ser
20 25 30
atg gca gca ccc cgt ggc aat gtc acc agc ctt tcc ttg tcc tcc aac 288
Met Ala Ala Pro Arg Gly Asn Val Thr Ser Leu Ser Leu Ser Ser Asn
35 40 45
cgc atc cac cac ctc cat gat tct gac ttt gcc cac ctg ccc agc ctg 336
Arg Ile His His Leu His Asp Ser Asp Phe Ala His Leu Pro Ser Leu
50 55 60
cgg cat ctc aac ctc aag tgg aac tgc ccg ccg gtt ggc ctc agc ccc 384
Arg His Leu Asn Leu Lys Trp Asn Cys Pro Pro Val Gly Leu Ser Pro
65 70 75 80
atg cac ttc ccc tgc cac atg acc atc gag ccc agc acc ttc ttg gct 432
Met His Phe Pro Cys His Met Thr Ile Glu Pro Ser Thr Phe Leu Ala
85 90 95
gtg ccc acc ctg gaa gag cta aac ctg agc tac aac aac atc atg act 480
Val Pro Thr Leu Glu Glu Leu Asn Leu Ser Tyr Asn Asn Ile Met Thr
100 105 110
gtg cct gcg ctg ccc aaa tcc ctc ata tcc ctg tcc ctc agc cat acc 528
Val Pro Ala Leu Pro Lys Ser Leu Ile Ser Leu Ser Leu Ser His Thr
115 120 125
aac atc ctg atg cta gac tct gcc agc ctc gcc ggc ctg cat gcc ctg 576
Asn Ile Leu Met Leu Asp Ser Ala Ser Leu Ala Gly Leu His Ala Leu
130 135 140
cgc ttc cta ttc atg gac ggc aac tgt tat tac aag aac ccc tgc agg 624
Arg Phe Leu Phe Met Asp Gly Asn Cys Tyr Tyr Lys Asn Pro Cys Arg
145 150 155 160
cag gca ctg gag gtg gcc ccg ggt gcc ctc ctt ggc ctg ggc aac ctc 672
Gln Ala Leu Glu Val Ala Pro Gly Ala Leu Leu Gly Leu Gly Asn Leu
165 170 175
acc cac ctg tca ctc aag tac aac aac ctc act gtg gtg ccc cgc aac 720
Thr His Leu Ser Leu Lys Tyr Asn Asn Leu Thr Val Val Pro Arg Asn
180 185 190
ctg cct tcc agc ctg gag tat ctg ctg ttg tcc tac aac cgc atc gtc 768
Leu Pro Ser Ser Leu Glu Tyr Leu Leu Leu Ser Tyr Asn Arg Ile Val
195 200 205
aaa ctg gcg cct gag gac ctg gcc aat ctg acc gcc ctg cgt gtg ctc 816
Lys Leu Ala Pro Glu Asp Leu Ala Asn Leu Thr Ala Leu Arg Val Leu
210 215 220
gat gtg ggc gga aat tgc cgc cgc tgc gac cac gct ccc aac ccc tgc 864
Asp Val Gly Gly Asn Cys Arg Arg Cys Asp His Ala Pro Asn Pro Cys
225 230 235 240
atg gag tgc cct cgt cac ttc ccc cag cta cat ccc gat acc ttc agc 912
Met Glu Cys Pro Arg His Phe Pro Gln Leu His Pro Asp Thr Phe Ser
245 250 255
cac ctg agc cgt ctt gaa ggc ctg gtg ttg aag gac agt tct ctc tcc 960
His Leu Ser Arg Leu Glu Gly Leu Vat Leu Lys Asp Ser Ser Leu Ser
260 265 270
tgg ctg aat gcc agt tgg ttc cgt ggg ctg gga aac ctc cga gtg ctg 1008
Trp Leu Asn Ala Ser Trp Phe Arg Gly Leu Gly Asn Leu Arg Val Leu
275 280 285
gac ctg agt gag aac ttc ctc tac aaa tgc atc act aaa acc aag gcc 1056
Asp Leu Ser Glu Asn Phe Leu Tyr Lys Cys Ile Thr Lys Thr Lys Ala
290 295 300
ttc cag ggc cta aca cag ctg cgc aag ctt aac ctg tcc ttc aat tac 1104
Phe Gln Gly Leu Thr Gln Leu Arg Lys Leu Asn Leu Ser Phe Asn Tyr
305 310 315 320
caa aag agg gtg tcc ttt gcc cac ctg tct ctg gcc oct tcc ttc ggg 1152
Gln Lys Arg Val Ser Phe Ala His Leu Ser Leu Ala Pro Ser Phe Gly
325 330 335
agc ctg gtc gcc ctg aag gag ctg gac atg cac ggc atc ttc ttc cgc 1200
Ser Leu Val Ala Leu Lys Glu Leu Asp Met His Gly Ile Phe Phe Arg
340 345 350
tca ctc gat gag acc acg ctc cgg cca ctg gcc cgc ctg ccc atg ctc 1248
Ser Leu Asp Glu Thr Thr Leu Arg Pro Leu Ala Arg Leu Pro Met Leu
355 360 365
cag act ctg cgt ctg cag atg aac ttc atc aac cag gcc cag ctc ggc 1296
Gln Thr Leu Arg Leu Gln Met Asn Phe Ile Asn Gln Ala Gln Leu Gly
370 375 380
atc ttc agg gcc ttc cct ggc ctg cgc tac gtg gac ctg tcg gac aac 1344
Ile Phe Arg Ala Phe Pro Gly Leu Arg Tyr Val Asp Leu Ser Asp Asn
385 390 395 400
cgc atc agc gga gct tcg gag ctg aca gcc acc atg ggg gag gca gat 1392
Arg Ile Ser Gly Ala Ser Glu Leu Thr Ala Thr Met Gly Glu Ala Asp
405 410 415
gga ggg gag aag gtc tgg ctg cag cct ggg gac ctt gct ccg gcc cca 1440
Gly Gly Glu Lys Val Trp Leu Gln Pro Gly Asp Leu Ala Pro Ala Pro
420 425 430
gtg gac act ccc agc tct gaa gac ttc agg ccc aac tgc agc acc ctc 1488
Val Asp Thr Pro Ser Ser Glu Asp Phe Arg Pro Asn Cys Ser Thr Leu
435 440 445
aac ttc acc ttg gat ctg tca cgg aac aac ctg gtg acc gtg cag ccg 1536
Asn Phe Thr Leu Asp Leu Ser Arg Asn Asn Leu Val Thr Val Gln Pro
450 455 460
gag atg ttt gcc cag ctc tcg cac ctg cag tgc ctg cgc ctg agc cac 1584
Glu Met Phe Ala Gln Leu Ser His Leu Gln Cys Leu Arg Leu Ser His
465 470 475 480
aac tgc atc tcg cag gca gtc aat ggc tcc cag ttc ctg ccg ctg acc 1632
Asn Cys Ile Ser Gln Ala Val Asn Gly Ser Gln Phe Leu Pro Leu Thr
485 490 495
ggt ctg cag gtg cta gac ctg tcc cac aat aag ctg gac ctc tac cac 1680
Gly Leu Gln Val Leu Asp Leu Ser His Asn Lys Leu Asp Leu Tyr His
500 505 510
gag cac tca ttc acg gag cta cca cga ctg gag gcc ctg gac ctc agc 1728
Glu His Ser Phe Thr Glu Leu Pro Arg Leu Glu Ala Leu Asp Leu Ser
515 520 525
tac aac agc cag ccc ttt ggc atg cag ggc gtg ggc cac aac ttc agc 1776
Tyr Asn Ser Gln Pro Phe Gly Met Gln Gly Val Gly His Asn Phe Ser
530 535 540
ttc gtg gct cac ctg cgc acc ctg cgc cac ctc agc ctg gcc cac aac 1824
Phe Val Ala His Leu Arg Thr Leu Arg His Leu Ser Leu Ala His Asn
545 550 555 560
aac atc cac agc caa gtg tcc cag cag ctc tgc agt acg tcg ctg cgg 1872
Asn Ile His Ser Gln Val Ser Gln Gln Leu Cys Ser Thr Ser Leu Arg
565 570 575
gcc ctg gac ttc agc ggc aat gca ctg ggc cat atg tgg gcc gag gga 1920
Ala Leu Asp Phe Ser Gly Asn Ala Leu Gly His Met Trp Ala Glu Gly
580 585 590
gac ctc tat ctg cac ttc ttc caa ggc ctg agc ggt ttg atc tgg ctg 1968
Asp Leu Tyr Leu His Phe Phe Gln Gly Leu Ser Gly Leu Ile Trp Leu
595 600 605
gac ttg tcc cag aac cgc ctg cac acc ctc ctg ccc caa acc ctg cgc 2016
Asp Leu Ser Gln Asn Arg Leu His Thr Leu Leu Pro Gln Thr Leu Arg
610 615 620
aac ctc ccc aag agc cta cag gtg ctg cgt ctc cgt gac aat tac ctg 2064
Asn Leu Pro Lys Ser Leu Gln Val Leu Arg Leu Arg Asp Asn Tyr Leu
625 630 635 640
gcc ttc ttt aag tgg tgg agc ctc cac ttc ctg ccc aaa ctg gaa gtc 2112
Ala Phe Phe Lys Trp Trp Ser Leu His Phe Leu Pro Lys Leu Glu Val
645 650 655
ctc gac ctg gca gga aac cag ctg aag gcc ctg acc aat ggc agc ctg 2160
Leu Asp Leu Ala Gly Asn Gln Leu Lys Ala Leu Thr Asn Gly Ser Leu
660 665 670
cct gct ggc acc cgg ctc cgg agg ctg gat gtc agc tgc aac agc atc 2208
Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser Ile
675 680 685
agc ttc gtg gcc ccc ggc ttc ttt tcc aag gcc aag gag ctg cga gag 2256
Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg Glu
690 695 700
ctc aac ctt agc gcc aac gcc ctc aag aca gtg gac cac tcc tgg ttt 2304
Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp Phe
705 710 715 720
ggg ccc ctg gcg agt gcc ctg caa ata cta gat gta agc gcc aac cct 2352
Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn Pro
725 730 735
ctg cac tgc gcc tgt ggg gcg gcc ttt atg gac ttc ctg ctg gag gtg 2400
Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu Val
740 745 750
cag gct gcc gtg ccc ggt ctg ccc agc cgg gtg aag tgt ggc agt ccg 2448
Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser Pro
755 760 765
ggc cag ctc cag ggc ctc agc atc ttt gca cag gac ctg cgc ctc tgc 2496
Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu Cys
770 775 780
ctg gat gag gcc ctc tcc tgg gac tgt ttc gcc ctc tcg ctg ctg gct 2544
Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu Ala
785 790 795 800
gtg gct ctg ggc ctg ggt gtg ccc atg ctg cat cac ctc tgt ggc tgg 2592
Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly Trp
805 810 815
gac ctc tgg tac tgc ttc cac ctg tgc ctg gcc tgg ctt ccc tgg cgg 2640
Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp Arg
820 825 830
ggg cgg caa agt ggg cga gat gag gat gcc ctg ccc tac gat gcc ttc 2688
Gly Arg Gln Ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala Phe
835 840 845
gtg gtc ttc gac aaa acg cag agc gca gtg gca gac tgg gtg tac aac 2736
Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr Asn
850 855 860
gag ctt cgg ggg cag ctg gag gag tgc cgt ggg cgc tgg gca ctc cgc 2784
Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu Arg
865 870 875 880
ctg tgc ctg gag gaa cgc gac tgg ctg cct ggc aaa acc ctc ttt gag 2832
Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe Glu
885 890 895
aac ctg tgg gcc tcg gtc tat ggc agc cgc aag acg ctg ttt gtg ctg 2880
Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val Leu
900 905 910
gcc cac acg gac cgg gtc agt ggt ctc ttg cgc gcc agc ttc ctg ctg 2928
Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu Leu
915 920 925
gcc cag cag cgc ctg ctg gag gac cgc aag gac gtc gtg gtg ctg gtg 2976
Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu Val
930 935 940
atc ctg agc cct gac ggc cgc cgc tcc cgc tat gtg cgg ctg cgc cag 3024
Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Val Arg Leu Arg Gln
945 950 955 960
cgc ctc tgc cgc cag agt gtc ctc ctc tgg ccc cac cag ccc agt ggt 3072
Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser Gly
965 970 975
cag cgc agc ttc tgg gcc cag ctg ggc atg gcc ctg acc agg gac aac 3120
Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp Asn
980 985 990
cac cac ttc tat aac cgg aac ttc tgc cag gga ccc acg gcc gaa tag 3168
His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
995 1000 1005
<210>43
<211>1055
<212>PRT
<213>未知
<400>43
Met Pro Met Lys Trp Ser Gly Trp Arg Trp Ser Trp Gly Pro Ala Thr
-45 -40 -35
His Thr Ala Leu Pro Pro Pro Gln Gly Phe Cys Arg Ser Ala Leu His
-30 -25 -20
Pro Leu Ser Leu Leu Val Gln Ala Ile Met Leu Ala Met Thr Leu Ala
-15 -10 -5 -1
Leu Gly Thr Leu Pro Ala Phe Leu Pro Cys Glu Leu Gln Pro His Gly
1 5 10 15
Leu Val Asn Cys Asn Trp Leu Phe Leu Lys Ser Val Pro His Phe Ser
20 25 30
Met Ala Ala Pro Arg Gly Asn Val Thr Ser Leu Ser Leu Ser Ser Asn
35 40 45
Arg Ile His His Leu His Asp Ser Asp Phe Ala His Leu Pro Ser Leu
50 55 60
Arg His Leu Asn Leu Lys Trp Asn Cys Pro Pro Val Gly Leu Ser Pro
65 70 75 80
Met His Phe Pro Cys His Met Thr Ile Glu Pro Ser Thr Phe Leu Ala
85 90 95
Val Pro Thr Leu Glu Glu Leu Asn Leu Ser Tyr Asn Asn Ile Met Thr
100 105 110
Val Pro Ala Leu Pro Lys Ser Leu Ile Ser Leu Ser Leu Ser His Thr
115 120 125
Asn Ile Leu Met Leu Asp Ser Ala Ser Leu Ala Gly Leu His Ala Leu
130 135 140
Arg Phe Leu Phe Met Asp Gly Asn Cys Tyr Tyr Lys Asn Pro Cys Arg
145 150 155 160
Gln Ala Leu Glu Val Ala Pro Gly Ala Leu Leu Gly Leu Gly Asn Leu
165 170 175
Thr His Leu Ser Leu Lys Tyr Asn Asn Leu Thr Val Val Pro Arg Asn
180 185 190
Leu Pro Ser Ser Leu Glu Tyr Leu Leu Leu Ser Tyr Asn Arg Ile Val
195 200 205
Lys Leu Ala Pro Glu Asp Leu Ala Asn Leu Thr Ala Leu Arg Val Leu
210 215 220
Asp Val Gly Gly Asn Cys Arg Arg Cys Asp His Ala Pro Asn Pro Cys
225 230 235 240
Met Glu Cys Pro Arg His Phe Pro Gln Leu His Pro Asp Thr Phe Ser
245 250 255
His Leu Ser Arg Leu Glu Gly Leu Val Leu Lys Asp Ser Ser Leu Ser
260 265 270
Trp Leu Asn Ala Ser Trp Phe Arg Gly Leu Gly Asn Leu Arg Val Leu
275 280 285
Asp Leu Ser Glu Asn Phe Leu Tyr Lys Cys Ila Thr Lys Thr Lys Ala
290 295 300
Phe Gln Gly Leu Thr Gln Leu Arg Lys Leu Asn Leu Ser Phe Asn Tyr
305 310 315 320
Gln Lys Arg Val Ser Phe Ala His Leu Ser Leu Ala Pro Ser Phe Gly
325 330 335
Ser Leu Val Ala Leu Lys Glu Leu Asp Met His Gly Ile Phe Phe Arg
340 345 350
Ser Leu Asp Glu Thr Thr Leu Arg Pro Leu Ala Arg Leu Pro Met Leu
355 360 365
Gln Thr Leu Arg Leu Gln Met Asn Phe Ile Asn Gln Ala Gln Leu Gly
370 375 380
Ile Phe Arg Ala Phe Pro Gly Leu Arg Tyr Val Asp Leu Ser Asp Asn
385 390 395 400
Arg Ile Ser Gly Ala Ser Glu Leu Thr Ala Thr Met Gly Glu Ala Asp
405 410 415
Gly Gly Glu Lys Val Trp Leu Gln Pro Gly Asp Leu Ala Pro Ala Pro
420 425 430
Val Asp Thr Pro Ser Ser Glu Asp Phe Arg Pro Asn Cys Ser Thr Leu
435 440 445
Asn Phe Thr Leu Asp Leu Ser Arg Asn Asn Leu Val Thr Val Gln Pro
450 455 460
Glu Met Phe Ala Gln Leu Ser His Leu Gln Cys Leu Arg Leu Ser His
465 470 475 480
Asn Cys Ile Ser Gln Ala Val Asn Gly Ser Gln Phe Leu Pro Leu Thr
485 490 495
Gly Leu Gln Val Leu Asp Leu Ser His Asn Lys Leu Asp Leu Tyr His
500 505 510
Glu His Ser Phe Thr Glu Leu Pro Arg Leu Glu Ala Leu Asp Leu Ser
515 520 525
Tyr Asn Ser Gln Pro Phe Gly Met Gln Gly Val Gly His Asn Phe Ser
530 535 540
Phe Val Ala His Leu Arg Thr Leu Arg His Leu Ser Leu Ala His Asn
545 550 555 560
Asn Ile His Ser Gln Val Ser Gln Gln Leu Cys Ser Thr Ser Leu Arg
565 570 575
Ala Leu Asp Phe Ser Gly Asn Ala Leu Gly His Met Trp Ala Glu Gly
580 585 590
Asp Leu Tyr Leu His Phe Phe Gln Gly Leu Ser Gly Leu Ile Trp Leu
595 600 605
Asp Leu Ser Gln Asn Arg Leu His Thr Leu Leu Pro Gln Thr Leu Arg
610 615 620
Asn Leu Pro Lys Ser Leu Gln Val Leu Arg Leu Arg Asp Asn Tyr Leu
625 630 635 640
Ala Phe Phe Lys Trp Trp Ser Leu His Phe Leu Pro Lys Leu Glu Val
645 650 655
Leu Asp Leu Ala Gly Asn Gln Leu Lys Ala Leu Thr Asn Gly Ser Leu
660 665 670
Pro Ala Gly Thr Arg Leu Arg Arg Leu Asp Val Ser Cys Asn Ser Ile
675 680 685
Ser Phe Val Ala Pro Gly Phe Phe Ser Lys Ala Lys Glu Leu Arg Glu
690 695 700
Leu Asn Leu Ser Ala Asn Ala Leu Lys Thr Val Asp His Ser Trp Phe
705 710 715 720
Gly Pro Leu Ala Ser Ala Leu Gln Ile Leu Asp Val Ser Ala Asn Pro
725 730 735
Leu His Cys Ala Cys Gly Ala Ala Phe Met Asp Phe Leu Leu Glu Val
740 745 750
Gln Ala Ala Val Pro Gly Leu Pro Ser Arg Val Lys Cys Gly Ser Pro
755 760 765
Gly Gln Leu Gln Gly Leu Ser Ile Phe Ala Gln Asp Leu Arg Leu Cys
770 775 780
Leu Asp Glu Ala Leu Ser Trp Asp Cys Phe Ala Leu Ser Leu Leu Ala
785 790 795 800
Val Ala Leu Gly Leu Gly Val Pro Met Leu His His Leu Cys Gly Trp
805 810 815
Asp Leu Trp Tyr Cys Phe His Leu Cys Leu Ala Trp Leu Pro Trp Arg
820 825 830
Gly Arg Gln ser Gly Arg Asp Glu Asp Ala Leu Pro Tyr Asp Ala Phe
835 840 845
Val Val Phe Asp Lys Thr Gln Ser Ala Val Ala Asp Trp Val Tyr Asn
850 855 860
Glu Leu Arg Gly Gln Leu Glu Glu Cys Arg Gly Arg Trp Ala Leu Arg
865 870 875 880
Leu Cys Leu Glu Glu Arg Asp Trp Leu Pro Gly Lys Thr Leu Phe Glu
885 890 895
Asn Leu Trp Ala Ser Val Tyr Gly Ser Arg Lys Thr Leu Phe Val Leu
900 905 910
Ala His Thr Asp Arg Val Ser Gly Leu Leu Arg Ala Ser Phe Leu Leu
915 920 925
Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys Asp Val Val Val Leu Val
930 935 940
Ile Leu Ser Pro Asp Gly Arg Arg Ser Arg Tyr Val Arg Leu Arg Gln
945 950 955 960
Arg Leu Cys Arg Gln Ser Val Leu Leu Trp Pro His Gln Pro Ser Gly
965 970 975
Gln Arg Ser Phe Trp Ala Gln Leu Gly Met Ala Leu Thr Arg Asp Asn
980 985 990
His His Phe Tyr Asn Arg Asn Phe Cys Gln Gly Pro Thr Ala Glu
995 1000 1005
<210>44
<211>2289
<212>DNA
<213>未知
<220>
<223>未知生物的说明:啮齿类动物;推测为
小鼠(Mus musculus)
<220>
<221>CDS
<222>(1)..(2079)
<400>44
aac ctg tcc ttc aat tac cgc aag aag gta tcc ttt gcc cgc ctc cac 48
Asn Leu Ser Phe Asn Tyr Arg Lys Lys Val Ser Phe Ala Arg Leu His
1 5 l0 15
ctg gca agt tcc ttt aag aac ctg gtg tca ctg cag gag ctg aac atg 96
Leu Ala Ser Ser Phe Lys Asn Leu Val Ser Leu Gln Glu Leu Asn Met
20 25 30
aac ggc atc ttc ttc cgc ttg ctc aac aag tac acg ctc aga tgg ctg 144
Asn Gly Ile Phe Phe Arg Leu Leu Asn Lys Tyr Thr Leu Arg Trp Leu
35 40 45
gcc gat ctg ccc aaa ctc cac act ctg cat ctt caa atg aac ttc atc 192
Ala Asp Leu Pro Lys Leu His Thr Leu His Leu Gln Met Asn Phe Ile
50 55 60
aac cag gca cag ctc agc atc ttt ggt acc ttc cga gcc ctt cgc ttt 240
Asn Gln Ala Gln Leu Ser Ile Phe Gly Thr Phe Arg Ala Leu Arg Phe
65 70 75 80
gtg gac ttg tca gac aat cgc atc agt ggg cct tca acg ctg tca gaa 288
Val Asp Leu Ser Asp Asn Arg Ile Ser Gly Pro Ser Thr Leu Ser Glu
85 90 95
gcc acc cct gaa gag gca gat gat gca gag cag gag gag ctg ttg tct 336
Ala Thr Pro Glu Glu Ala Asp Asp Ala Glu Gln Glu Glu Leu Leu Ser
100 105 110
gcg gat cct cac cca gct ccg ctg agc acc cct gct tct aag aac ttc 384
Ala Asp Pro His Pro Ala Pro Leu Ser Thr Pro Ala Ser Lys Asn Phe
115 120 125
atg gac agg tgt aag aac ttc aag ttc aac atg gac ctg tct cgg aac 432
Met Asp Arg Cys Lys Asn Phe Lys Phe Asn Met Asp Leu Ser Arg Asn
130 135 140
aac ctg gtg act atc aca gca gag atg ttt gta aat ctc tca cgc ctc 480
Asn Leu Val Thr Ile Thr Ala Glu Met Phe Val Asn Leu Ser Arg Leu
145 150 155 160
cag tgt ctt agc ctg agc cac aac tca att gca cag gct gtc aat ggc 528
Gln Cys Leu Ser Leu Ser His Asn Ser Ile Ala Gln Ala Val Asn Gly
165 170 175
tct cag ttc ctg ccg ctg acc ggt ctg cag gtg cta gac ctg tcc cac 576
Ser Gln Phe Leu Pro Leu Thr Gly Leu Gln Val Leu Asp Leu Ser His
180 185 190
aat aag ctg gac ctc tac cac gag cac tca ttc acg gag cta cca cga 624
Asn Lys Leu Asp Leu Tyr His Glu His Ser Phe Thr Glu Leu Pro Arg
195 200 205
ctg gag gcc ctg gac ctc agc tac aac agc cag ccc ttt agc atg aag 672
Leu Glu Ala Leu Asp Leu Ser Tyr Asn Ser Gln Pro Phe Ser Met Lys
210 215 220
ggt ata ggc cac aat ttc agt ttt gtg acc cat ctg tcc atg cta cag 720
Gly Ile Gly His Asn Phe Ser Phe Val Thr His Leu Ser Met Leu Gln
225 230 235 240
agc ctt agc ctg gca cac aat gac att cat acc cgt gtg tcc tca cat 768
Ser Leu Ser Leu Ala His Asn Asp Ile His Thr Arg Val Ser Ser His
245 250 255
ctc aac agc aac tca gtg agg ttt ctt gac ttc agc ggc aac ggt atg 816
Leu Asn Ser Asn Ser Val Arg Phe Leu Asp Phe Ser Gly Asn Gly Met
260 265 270
ggc cgc atg tgg gat gag ggg ggc ctt tat ctc cat ttc ttc caa ggc 864
Gly Arg Met Trp Asp Glu Gly Gly Leu Tyr Leu His Phe Phe Gln Gly
275 280 285
ctg agt ggc gtg ctg aag ctg gac ctg tct caa aat aac ctg cat atc 912
Leu Ser Gly Val Leu Lys Leu Asp Leu Ser Gln Asn Asn Leu His Ile
290 295 300
ctc cgg ccc cag aac ctt gac aac ctc ccc aag agc ctg aag ctg ctg 960
Leu Arg Pro Gln Asn Leu Asp Asn Leu Pro Lys Ser Leu Lys Leu Leu
305 310 315 320
agc ctc cga gac aac tac cta tct ttc ttt aac tgg acc agt ctg tcc 1008
Ser Leu Arg Asp Asn Tyr Leu Ser Phe Phe Asn Trp Thr Ser Leu Ser
325 330 335
ttc cta ccc aac ctg gaa gtc cta gac ctg gca ggc aac cag cta aag 1056
Phe Leu Pro Asn Leu Glu Val Leu Asp Leu Ala Gly Asn Gln Leu Lys
340 345 350
gcc ctg acc aat ggc acc ctg cct aat ggc acc ctc ctc cag aaa ctc 1104
Ala Leu Thr Asn Gly Thr Leu Pro Asn Gly Thr Leu Leu Gln Lys Leu
355 360 365
gat gtc agt agc aac agt atc gtc tct gtg gcc ccc ggc ttc ttt tcc 1152
Asp Val Ser Ser Asn Ser Ile Val Ser Val Ala Pro Gly Phe Phe Ser
370 375 380
aag gcc aag gag ctg cga gag ctc aac ctt agc gcc aac gcc ctc aag 1200
Lys Ala Lys Glu Leu Arg Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys
385 390 395 400
aca gtg gac cac tcc tgg ttt ggg ccc att gtg atg aac ctg aca gtt 1248
Thr Val Asp His Ser Trp Phe Gly Pro Ile Val Met Asn Leu Thr Val
405 410 415
cta gac gtg aga agc aac cct ctg cac tgt gcc tgt ggg gca gcc ttc 1296
Leu Asp Val Arg Ser Asn Pro Leu His Cys Ala Cys Gly Ala Ala Phe
420 425 430
gta gac tta ctg ttg gag gtg cag acc aag gtg cct ggc ctg gct aat 1344
Val Asp Leu Leu Leu Glu Val Gln Thr Lys Val Pro Gly Leu Ala Asn
435 440 445
ggt gtg aag tgt ggc agc ccc ggc cag ctg cag ggc cgt agc atc ttc 1392
Gly Val Lys Cys Gly Ser Pro Gly Gln Leu Gln Gly Arg Ser Ile Phe
450 455 460
gcg cag gac ctg cgg ctg tgc ctg gat gag gtc ctc tct tgg gac tgc 1440
Ala Gln Asp Leu Arg Leu Cys Leu Asp Glu Val Leu Ser Trp Asp Cys
465 470 475 480
ttt ggc ctt tca ctc ttg gct gtg gcc gtg ggc atg gtg gtg cct ata 1488
Phe Gly Leu Ser Leu Leu Ala Val Ala Val Gly Met Val Val Pro Ile
485 490 495
ctg cac cat ctc tgc ggc tgg gac gtc tgg tac tgt ttt cat ctg tgc 1536
Leu His His Leu Cys Gly Trp Asp Val Trp Tyr Cys Phe His Leu Cys
500 505 510
ctg gca tgg cta cct ttg cta gcc cgc agc cga cgc agc gcc caa act 1584
Leu Ala Trp Leu Pro Leu Leu Ala Arg Ser Arg Arg Ser Ala Gln Thr
515 520 525
ctc cct tat gat gcc ttc gtg gtg ttc gat aag gca cag agc gca gtt 1632
Leu Pro Tyr Asp Ala Phe Val Val Phe Asp Lys Ala Gln Ser Ala Val
530 535 540
gcc gac tgg gtg tat aac gag ctg cgg gtg cgg ctg gag gag cgg cgc 1680
Ala Asp Trp Val Tyr Asn Glu Leu Arg Val Arg Leu Glu Glu Arg Arg
545 550 555 560
ggc cgc tgg gca ctc cgc ctg tgc ctg gag gac cga gat tgg ctg cct 1728
Gly Arg Trp Ala Leu Arg Leu Cys Leu Glu Asp Arg Asp Trp Leu Pro
565 570 575
ggc cag acg ctc ttc gag aac ctc tgg gct tcc atc tat ggg agc cgc 1776
Gly Gln Thr Leu Phe Glu Asn Leu Trp Ala Ser Ile Tyr Gly Ser Arg
580 585 590
aag act cta ttt gtg ctg gcc cac acg gac cgc gtc agt ggc ctc ctg 1824
Lys Thr Leu Phe Val Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu
595 600 605
cgc acc agc ttc ctg ctg gct cag cag cgc ctg ttg gaa gac cgc aag 1872
Arg Thr Ser Phe Leu Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys
610 615 620
gac gtg gtg gtg ttg gtg atc ctg cgt ccg gat gcc cac cgc tcc cgc 1920
Asp Val Val Val Leu Val Ile Leu Arg Pro Asp Ala His Arg Ser Arg
625 630 635 640
tat gtg cga ctg cgc cag cgt ctc tgc cgc cag agt gtg ctc ttc tgg 1968
Tyr Val Arg Leu Arg Gln Arg Leu Cys Arg Gln Ser Val Leu Phe Trp
645 650 655
ccc cag cag ccc aac ggg cag ggg ggc ttc tgg gcc cag ctg agt aca 2016
Pro Gln Gln Pro Asn Gly Gln Gly Gly Phe Trp Ala Gln Leu Ser Thr
660 665 670
gcc ctg act agg gac aac cgc cac ttc tat aac cag aac ttc tgc cgg 2064
Ala Leu Thr Arg Asp Asn Arg His Phe Tyr Asn Gln Asn Phe Cys Arg
675 680 685
gga cct aca gca gaa tagctcagag caacagctgg aaacagctgc atcttcatgt 2119
Gly Pro Thr Ala Glu
690
ctggttcccg agttgctctg cctgccttgc tctgtcttac tacaccgcta tttggcaagt 2179
gcgcaatata tgctaccaag ccaccaggcc cacggagcaa aggttggctg taaagggtag 2239
ttttcttccc atgcatcttt caggagagtg aagatagaca ccaaacccac 2289
<210>45
<211>693
<212>PRT
<213>未知
<400>45
Asn Leu Ser Phe Asn Tyr Arg Lys Lys Val Ser Phe Ala Arg Leu His
1 5 10 15
Leu Ala Ser Ser Phe Lys Asn Leu Val Ser Leu Gln Glu Leu Asn Met
20 25 30
Asn Gly Ile Phe Phe Arg Leu Leu Asn Lys Tyr Thr Leu Arg Trp Leu
35 40 45
Ala Asp Leu Pro Lys Leu His Thr Leu His Leu Gln Met Asn Phe Ile
50 55 60
Asn Gln Ala Gln Leu Ser Ile Phe Gly Thr Phe Arg Ala Leu Arg Phe
65 70 75 80
Val Asp Leu Ser Asp Asn Arg Ile Ser Gly Pro Ser Thr Leu Ser Glu
85 90 95
Ala Thr Pro Glu Glu Ala Asp Asp Ala Glu Gln Glu Glu Leu Leu Ser
100 105 110
Ala Asp Pro His Pro Ala Pro Leu Ser Thr Pro Ala Ser Lys Asn Phe
115 120 125
Met Asp Arg Cys Lys Asn Phe Lys Phe Asn Mer Asp Leu Ser Arg Asn
130 135 140
Asn Leu Val Thr Ile Thr Ala Glu Met Phe Val Asn Leu Ser Arg Leu
145 150 155 160
Gln Cys Leu Ser Leu Ser His Asn Ser Ile Ala Gln Ala Val Asn Gly
165 170 175
Ser Gln Phe Leu Pro Leu Thr Gly Leu Gln Val Leu Asp Leu Ser His
180 185 190
Asn Lys Leu Asp Leu Tyr His Glu His Ser Phe Thr Glu Leu Pro Arg
195 200 205
Leu Glu Ala Leu Asp Leu Ser Tyr Asn Ser Gln Pro Phe Ser Met Lys
210 215 220
Gly Ile Gly His Asn Phe Ser Phe Val Thr His Leu Ser Met Leu Gln
225 230 235 240
Ser Leu Ser Leu Ala His Asn Asp Ile His Thr Arg Val Ser Ser His
245 250 255
Leu Asn Ser Asn Ser Val Arg Phe Leu Asp Phe Ser Gly Asn Gly Met
260 265 270
Gly Arg Met Trp Asp Glu Gly Gly Leu Tyr Leu His Phe Phe Gln Gly
275 280 285
Leu Ser Gly Val Leu Lys Leu Asp Leu Ser Gln Asn Asn Leu His Ile
290 295 300
Leu Arg Pro Gln Asn Leu Asp Asn Leu Pro Lys Ser Leu Lys Leu Leu
305 310 315 320
Ser Leu Arg Asp Asn Tyr Leu Ser Phe Phe Asn Trp Thr Ser Leu Ser
325 330 335
Phe Leu Pro Asn Leu Glu Val Leu Asp Leu Ala Gly Asn Gln Leu Lys
340 345 350
Ala Leu Thr Asn Gly Thr Leu Pro Asn Gly Thr Leu Leu Gln Lys Leu
355 360 365
Asp Val Ser Ser Asn Ser Ile Val Ser Val Ala Pro Gly Phe Phe Ser
370 375 380
Lys Ala Lys Glu Leu Arg Glu Leu Asn Leu Ser Ala Asn Ala Leu Lys
385 390 395 400
Thr Val Asp His Ser Trp Phe Gly Pro Ile Val Met Asn Leu Thr Val
405 410 415
Leu Asp Val Arg Ser Asn Pro Leu His Cys Ala Cys Gly Ala Ala Phe
420 425 430
Val Asp Leu Leu Leu Glu Val Gln Thr Lys Val Pro Gly Leu Ala Asn
435 440 445
Gly Val Lys Cys Gly Ser Pro Gly Gln Leu Gln Gly Arg Ser Ile Phe
450 455 460
Ala Gln Asp Leu Arg Leu Cys Leu Asp Glu Val Leu Ser Trp Asp Cys
465 470 475 480
Phe Gly Leu Ser Leu Leu Ala Val Ala Val Gly Met Val Val Pro Ile
485 490 495
Leu His His Leu Cys Gly Trp Asp Val Trp Tyr Cys Phe His Leu Cys
500 505 510
Leu Ala Trp Leu Pro Leu Leu Ala Arg Ser Arg Arg Ser Ala Gln Thr
515 520 525
Leu Pro Tyr Asp Ala Phe Val Val Phe Asp Lys Ala Gln Ser Ala Val
530 535 540
Ala Asp Trp Val Tyr Asn Glu Leu Arg Val Arg Leu Glu Glu Arg Arg
545 550 555 560
Gly Arg Trp Ala Leu Arg Leu Cys Leu Glu Asp Arg Asp Trp Leu Pro
565 570 575
Gly Gln Thr Leu Phe Glu Asn Leu Trp Ala Ser Ile Tyr Gly Ser Arg
580 585 590
Lys Thr Leu Phe Val Leu Ala His Thr Asp Arg Val Ser Gly Leu Leu
595 600 605
Arg Thr Ser Phe Leu Leu Ala Gln Gln Arg Leu Leu Glu Asp Arg Lys
610 615 620
Asp Val Val Val Leu Val Ile Leu Arg Pro Asp Ala His Arg Ser Arg
625 630 635 640
Tyr Val Arg Leu Arg Gln Arg Leu Cys Arg Gln Ser Val Leu Phe Trp
645 650 655
Pro Gln Gln Pro Asn Gly Gln Gly Gly Phe Trp Ala Gln Leu Ser Thr
660 665 670
Ala Leu Thr Arg Asp Asn Arg His Phe Tyr Asn Gln Asn Phe Cys Arg
675 680 685
Gly Pro Thr Ala Glu
690
Claims (10)
1.一种抗原性多肽,其包含SEQ ID NO:41的至少20个连续氨基酸残基。
2.一种抗原性多肽,其包含SEQ ID NO:41的氨基酸残基1-765。
3.一种抗原性多肽,其包含SEQ ID NO:41。
4.一种抗原性多肽,其为SEQ ID NO:41。
5.一种融合蛋白,其包含权利要求1-4中任一项的多肽。
6.一种抗体或抗体片段,其特异性地与权利要求1-4中任一项的多肽结合。
7.一种核酸,其编码权利要求1-4中任一项的多肽。
8.一种表达载体,其包含权利要求7的核酸。
9.一种宿主细胞,其包含权利要求8的载体。
10.一种重组生产权利要求1-4中任一项的多肽的方法,其包括在表达多肽的条件下培养权利要求9的宿主细胞。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US20755800P | 2000-05-25 | 2000-05-25 | |
US60/207558 | 2000-05-25 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB01813453XA Division CN1222539C (zh) | 2000-05-25 | 2001-05-23 | 人受体蛋白;相关的试剂和方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1721443A true CN1721443A (zh) | 2006-01-18 |
Family
ID=22771074
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2005100885947A Pending CN1721443A (zh) | 2000-05-25 | 2001-05-23 | 人受体蛋白;相关的试剂和方法 |
CNB01813453XA Expired - Fee Related CN1222539C (zh) | 2000-05-25 | 2001-05-23 | 人受体蛋白;相关的试剂和方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB01813453XA Expired - Fee Related CN1222539C (zh) | 2000-05-25 | 2001-05-23 | 人受体蛋白;相关的试剂和方法 |
Country Status (13)
Country | Link |
---|---|
EP (4) | EP1908837A3 (zh) |
JP (1) | JP2003533996A (zh) |
KR (1) | KR100907356B1 (zh) |
CN (2) | CN1721443A (zh) |
AU (3) | AU6488901A (zh) |
CA (1) | CA2410082A1 (zh) |
HK (1) | HK1049017A1 (zh) |
HU (1) | HUP0302240A2 (zh) |
IL (2) | IL152725A0 (zh) |
MX (1) | MXPA02011618A (zh) |
NZ (2) | NZ522327A (zh) |
WO (1) | WO2001090151A2 (zh) |
ZA (1) | ZA200208856B (zh) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6303321B1 (en) | 1999-02-11 | 2001-10-16 | North Shore-Long Island Jewish Research Institute | Methods for diagnosing sepsis |
JP2002034565A (ja) * | 2000-07-19 | 2002-02-05 | Japan Science & Technology Corp | 細菌dnaを特異的に認識する受容体タンパク質 |
US7220723B2 (en) | 2001-05-15 | 2007-05-22 | The Feinstein Institute For Medical Research | Inhibitors of the interaction between HMGB polypeptides and toll-like receptor 2 as anti-inflammatory agents |
US7304034B2 (en) | 2001-05-15 | 2007-12-04 | The Feinstein Institute For Medical Research | Use of HMGB fragments as anti-inflammatory agents |
WO2003031573A2 (en) * | 2001-10-05 | 2003-04-17 | Coley Pharmaceutical Gmbh | Toll-like receptor 3 signaling agonists and antagonists |
US7696169B2 (en) | 2003-06-06 | 2010-04-13 | The Feinstein Institute For Medical Research | Inhibitors of the interaction between HMGB polypeptides and toll-like receptor 2 as anti-inflammatory agents |
WO2005026209A2 (en) | 2003-09-11 | 2005-03-24 | Critical Therapeutics, Inc. | Monoclonal antibodies against hmgb1 |
US8039588B2 (en) * | 2004-05-21 | 2011-10-18 | The Uab Research Foundation | Variable lymphocyte receptors |
US8821884B2 (en) * | 2004-07-27 | 2014-09-02 | The Regents Of The University Of California | Compositions and methods using MD-2 mutants and chimeric proteins |
EP1657257A1 (en) * | 2004-11-11 | 2006-05-17 | Klinikum der Universität Regensburg | Recombinant TLR-MD-2 fusion proteins |
WO2006054129A1 (en) | 2004-11-19 | 2006-05-26 | Institut Gustave Roussy | Improved treatment of cancer by double-stranded rna |
CN101415438B (zh) * | 2004-11-30 | 2013-03-27 | 森托科尔公司 | Toll样受体3拮抗剂、方法和用途 |
US7700728B2 (en) | 2005-03-24 | 2010-04-20 | Schering Corporation | Use of chimeric receptors in a screening assay for identifying agonists and antagonists of cell receptors |
US7498409B2 (en) | 2005-03-24 | 2009-03-03 | Schering Corporation | Screening assay for TLR7, TLR8 and TLR9 agonists and antagonists |
US7982007B2 (en) * | 2007-12-26 | 2011-07-19 | Centocor, Inc. | Cynomolgus toll-like receptor 3 |
KR20120118044A (ko) | 2010-01-27 | 2012-10-25 | 다케다 야쿠힌 고교 가부시키가이샤 | 항암제로 유도되는 말초 신경 장애를 억제하기 위한 화합물 |
EP2713737B1 (en) | 2011-06-01 | 2016-04-20 | Janus Biotherapeutics, Inc. | Novel immune system modulators |
JP6093759B2 (ja) | 2011-06-01 | 2017-03-08 | ジャナス バイオセラピューティクス,インク. | 新規の免疫系調節剤 |
IN2014KN00948A (zh) | 2011-10-04 | 2015-08-21 | Janus Biotherapeutics Inc | |
SG11202010939WA (en) | 2018-05-31 | 2020-12-30 | Daiichi Sankyo Co Ltd | Anti-human tlr7 antibody |
KR102265432B1 (ko) * | 2019-08-20 | 2021-06-15 | 주식회사 케어젠 | 피부 미백 활성을 갖는 펩타이드 및 이의 용도 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB1319315A (en) | 1969-06-19 | 1973-06-06 | Citizen Watch Co Ltd | Calendar timepiece |
US3940475A (en) | 1970-06-11 | 1976-02-24 | Biological Developments, Inc. | Radioimmune method of assaying quantitatively for a hapten |
NL154598B (nl) | 1970-11-10 | 1977-09-15 | Organon Nv | Werkwijze voor het aantonen en bepalen van laagmoleculire verbindingen en van eiwitten die deze verbindingen specifiek kunnen binden, alsmede testverpakking. |
US3817837A (en) | 1971-05-14 | 1974-06-18 | Syva Corp | Enzyme amplification assay |
US3939350A (en) | 1974-04-29 | 1976-02-17 | Board Of Trustees Of The Leland Stanford Junior University | Fluorescent immunoassay employing total reflection for activation |
US3996345A (en) | 1974-08-12 | 1976-12-07 | Syva Company | Fluorescence quenching with immunological pairs in immunoassays |
US4277437A (en) | 1978-04-05 | 1981-07-07 | Syva Company | Kit for carrying out chemically induced fluorescence immunoassay |
US4215149A (en) | 1978-11-17 | 1980-07-29 | Standard Brands Incorporated | Process for improving the palatability of pet food |
US4366241A (en) | 1980-08-07 | 1982-12-28 | Syva Company | Concentrating zone method in heterogeneous immunoassays |
US4659678A (en) | 1982-09-29 | 1987-04-21 | Serono Diagnostics Limited | Immunoassay of antigens |
US4816567A (en) | 1983-04-08 | 1989-03-28 | Genentech, Inc. | Recombinant immunoglobin preparations |
US4859609A (en) | 1986-04-30 | 1989-08-22 | Genentech, Inc. | Novel receptors for efficient determination of ligands and their antagonists or agonists |
ES2279572T3 (es) * | 1997-05-07 | 2007-08-16 | Schering Corporation | Proteinas receptoras humanas de tipo toll, reactivos y metodos relacionados. |
DK1887014T3 (da) * | 1997-10-17 | 2010-08-02 | Genentech Inc | Humane Toll-homologer |
JP2000128900A (ja) * | 1998-10-26 | 2000-05-09 | Japan Science & Technology Corp | 新規トル様(Toll−like)レセプター及びその遺伝子 |
GB0001704D0 (en) * | 2000-01-25 | 2000-03-15 | Glaxo Group Ltd | Protein |
US20050095587A1 (en) * | 2000-02-24 | 2005-05-05 | Panzer Scott R. | Molecules for disease detection and treatment |
-
2001
- 2001-05-23 NZ NZ522327A patent/NZ522327A/en not_active IP Right Cessation
- 2001-05-23 NZ NZ534666A patent/NZ534666A/en not_active IP Right Cessation
- 2001-05-23 EP EP07123071A patent/EP1908837A3/en not_active Withdrawn
- 2001-05-23 IL IL15272501A patent/IL152725A0/xx unknown
- 2001-05-23 WO PCT/US2001/016766 patent/WO2001090151A2/en active IP Right Grant
- 2001-05-23 CA CA002410082A patent/CA2410082A1/en not_active Abandoned
- 2001-05-23 CN CNA2005100885947A patent/CN1721443A/zh active Pending
- 2001-05-23 CN CNB01813453XA patent/CN1222539C/zh not_active Expired - Fee Related
- 2001-05-23 EP EP01939361A patent/EP1283849A2/en not_active Withdrawn
- 2001-05-23 KR KR1020027015868A patent/KR100907356B1/ko not_active IP Right Cessation
- 2001-05-23 AU AU6488901A patent/AU6488901A/xx active Pending
- 2001-05-23 EP EP06014230A patent/EP1714980A1/en not_active Withdrawn
- 2001-05-23 HU HU0302240A patent/HUP0302240A2/hu unknown
- 2001-05-23 JP JP2001586962A patent/JP2003533996A/ja active Pending
- 2001-05-23 EP EP20040005422 patent/EP1433792A3/en not_active Withdrawn
- 2001-05-23 MX MXPA02011618A patent/MXPA02011618A/es active IP Right Grant
-
2002
- 2002-10-31 ZA ZA200208856A patent/ZA200208856B/en unknown
-
2003
- 2003-02-21 HK HK03101314.4A patent/HK1049017A1/zh unknown
-
2006
- 2006-09-26 AU AU2006222684A patent/AU2006222684B2/en not_active Ceased
-
2008
- 2008-06-12 IL IL192111A patent/IL192111A0/en unknown
-
2009
- 2009-02-12 AU AU2009200539A patent/AU2009200539A1/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
WO2001090151A3 (en) | 2002-10-17 |
AU2006222684A1 (en) | 2006-10-19 |
CN1222539C (zh) | 2005-10-12 |
AU2006222684B2 (en) | 2008-12-11 |
ZA200208856B (en) | 2004-03-24 |
EP1433792A8 (en) | 2004-10-06 |
EP1433792A3 (en) | 2004-11-17 |
EP1283849A2 (en) | 2003-02-19 |
IL192111A0 (en) | 2008-12-29 |
IL152725A0 (en) | 2003-06-24 |
EP1714980A1 (en) | 2006-10-25 |
KR100907356B1 (ko) | 2009-07-10 |
CN1444602A (zh) | 2003-09-24 |
HUP0302240A2 (hu) | 2003-10-28 |
AU2009200539A1 (en) | 2009-03-05 |
EP1433792A2 (en) | 2004-06-30 |
NZ534666A (en) | 2006-01-27 |
EP1908837A2 (en) | 2008-04-09 |
CA2410082A1 (en) | 2001-11-29 |
NZ522327A (en) | 2004-09-24 |
EP1908837A3 (en) | 2008-06-11 |
MXPA02011618A (es) | 2003-03-10 |
AU6488901A (en) | 2001-12-03 |
HK1049017A1 (zh) | 2003-04-25 |
JP2003533996A (ja) | 2003-11-18 |
WO2001090151A2 (en) | 2001-11-29 |
KR20030003761A (ko) | 2003-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1225555C (zh) | 人Toll样受体蛋白、相关试剂和方法 | |
CN1222539C (zh) | 人受体蛋白;相关的试剂和方法 | |
CN1575337A (zh) | 哺乳动物细胞因子受体亚单位蛋白、相关试剂及方法 | |
CN1221569C (zh) | 介导细胞吸附和信号传递的细胞表面分子 | |
CN1217956C (zh) | 可溶性单链t细胞受体蛋白 | |
CN1444652A (zh) | 哺乳动物受体蛋白;相关试剂和方法 | |
CN1348465A (zh) | 通过il-13和il-13受体链的拮抗作用治疗纤维化 | |
CN1827639A (zh) | 新的细胞因子zalpha11配体 | |
CN1271387A (zh) | 哺乳动物细胞因子∶白介素-b30和相关试剂 | |
CN1372594A (zh) | Ox2受体的同系物 | |
CN1245533A (zh) | 哺乳动物趋化因子 | |
CN1649897A (zh) | Falp蛋白 | |
CN1886155A (zh) | 分离的哺乳动物膜蛋白基因和相关试剂 | |
AU2001264889A1 (en) | Human receptor proteins; related reagents and methods | |
CN1717413A (zh) | 新的pth应答基因 | |
CN1377891A (zh) | 树突状细胞来源的锌指蛋白、其编码序列和用途 | |
CN1908012A (zh) | 介导细胞吸附和信号传递的细胞表面分子 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1085224 Country of ref document: HK |
|
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1085224 Country of ref document: HK |