KR20220069917A - 리소좀 축적 장애의 치료를 위한 벡터 조성물 및 이의 사용 방법 - Google Patents
리소좀 축적 장애의 치료를 위한 벡터 조성물 및 이의 사용 방법 Download PDFInfo
- Publication number
- KR20220069917A KR20220069917A KR1020227002362A KR20227002362A KR20220069917A KR 20220069917 A KR20220069917 A KR 20220069917A KR 1020227002362 A KR1020227002362 A KR 1020227002362A KR 20227002362 A KR20227002362 A KR 20227002362A KR 20220069917 A KR20220069917 A KR 20220069917A
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- vector
- composition
- gly
- ala
- Prior art date
Links
- 239000013598 vector Substances 0.000 title claims abstract description 251
- 239000000203 mixture Substances 0.000 title claims abstract description 184
- 238000000034 method Methods 0.000 title claims abstract description 149
- 208000015439 Lysosomal storage disease Diseases 0.000 title claims abstract description 144
- 238000011282 treatment Methods 0.000 title description 42
- 108090000790 Enzymes Proteins 0.000 claims abstract description 263
- 102000004190 Enzymes Human genes 0.000 claims abstract description 262
- 230000002132 lysosomal effect Effects 0.000 claims abstract description 198
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 91
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 91
- 239000002157 polynucleotide Substances 0.000 claims abstract description 91
- 108091000080 Phosphotransferase Proteins 0.000 claims abstract description 34
- 102000020233 phosphotransferase Human genes 0.000 claims abstract description 34
- 210000004027 cell Anatomy 0.000 claims description 206
- 230000014509 gene expression Effects 0.000 claims description 114
- 102100023231 Lysosomal alpha-mannosidase Human genes 0.000 claims description 110
- 101710135169 Lysosomal alpha-mannosidase Proteins 0.000 claims description 110
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 80
- 230000026731 phosphorylation Effects 0.000 claims description 55
- 238000006366 phosphorylation reaction Methods 0.000 claims description 55
- 241000282414 Homo sapiens Species 0.000 claims description 53
- 150000007523 nucleic acids Chemical group 0.000 claims description 51
- 239000013603 viral vector Substances 0.000 claims description 47
- 108010009380 alpha-N-acetyl-D-glucosaminidase Proteins 0.000 claims description 39
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 35
- 102100034561 Alpha-N-acetylglucosaminidase Human genes 0.000 claims description 33
- 239000013607 AAV vector Substances 0.000 claims description 31
- 239000013604 expression vector Substances 0.000 claims description 27
- 208000024891 symptom Diseases 0.000 claims description 27
- 241000702421 Dependoparvovirus Species 0.000 claims description 20
- 238000012546 transfer Methods 0.000 claims description 20
- 241000701022 Cytomegalovirus Species 0.000 claims description 18
- 108010030291 alpha-Galactosidase Proteins 0.000 claims description 14
- 102000005840 alpha-Galactosidase Human genes 0.000 claims description 14
- 210000004962 mammalian cell Anatomy 0.000 claims description 14
- 239000002253 acid Substances 0.000 claims description 13
- 238000000338 in vitro Methods 0.000 claims description 13
- 210000000234 capsid Anatomy 0.000 claims description 12
- 238000001727 in vivo Methods 0.000 claims description 12
- 238000001990 intravenous administration Methods 0.000 claims description 12
- 108010028144 alpha-Glucosidases Proteins 0.000 claims description 11
- 238000011161 development Methods 0.000 claims description 10
- 238000003776 cleavage reaction Methods 0.000 claims description 8
- 238000007913 intrathecal administration Methods 0.000 claims description 8
- 230000007017 scission Effects 0.000 claims description 8
- 238000007920 subcutaneous administration Methods 0.000 claims description 8
- 238000001361 intraarterial administration Methods 0.000 claims description 7
- 238000007918 intramuscular administration Methods 0.000 claims description 7
- 238000007914 intraventricular administration Methods 0.000 claims description 5
- 230000009885 systemic effect Effects 0.000 claims description 4
- 230000000699 topical effect Effects 0.000 claims description 4
- 210000003705 ribosome Anatomy 0.000 claims description 2
- 241001655883 Adeno-associated virus - 1 Species 0.000 claims 3
- 241000702423 Adeno-associated virus - 2 Species 0.000 claims 3
- 241000202702 Adeno-associated virus - 3 Species 0.000 claims 3
- 241000580270 Adeno-associated virus - 4 Species 0.000 claims 3
- 241001634120 Adeno-associated virus - 5 Species 0.000 claims 3
- 241000972680 Adeno-associated virus - 6 Species 0.000 claims 3
- 241001164823 Adeno-associated virus - 7 Species 0.000 claims 3
- 241001164825 Adeno-associated virus - 8 Species 0.000 claims 3
- 102100024295 Maltase-glucoamylase Human genes 0.000 claims 3
- 101000588395 Bacillus subtilis (strain 168) Beta-hexosaminidase Proteins 0.000 claims 1
- 239000008194 pharmaceutical composition Substances 0.000 abstract description 36
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 abstract description 9
- 229940088598 enzyme Drugs 0.000 description 231
- 230000000694 effects Effects 0.000 description 100
- 108090000623 proteins and genes Proteins 0.000 description 97
- 210000001519 tissue Anatomy 0.000 description 79
- NBSCHQHZLSJFNQ-QTVWNMPRSA-N D-Mannose-6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O NBSCHQHZLSJFNQ-QTVWNMPRSA-N 0.000 description 65
- 102000004196 processed proteins & peptides Human genes 0.000 description 57
- 102000004169 proteins and genes Human genes 0.000 description 57
- 235000018102 proteins Nutrition 0.000 description 50
- 229920001184 polypeptide Polymers 0.000 description 47
- 108020004414 DNA Proteins 0.000 description 44
- 102000053602 DNA Human genes 0.000 description 44
- 101710145225 Cation-independent mannose-6-phosphate receptor Proteins 0.000 description 42
- 102100037182 Cation-independent mannose-6-phosphate receptor Human genes 0.000 description 42
- 239000000758 substrate Substances 0.000 description 42
- 238000001415 gene therapy Methods 0.000 description 37
- 238000002641 enzyme replacement therapy Methods 0.000 description 34
- 150000001875 compounds Chemical class 0.000 description 33
- 241001465754 Metazoa Species 0.000 description 32
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 30
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 28
- 150000002632 lipids Chemical class 0.000 description 28
- 210000004072 lung Anatomy 0.000 description 28
- 241000699670 Mus sp. Species 0.000 description 27
- 238000010172 mouse model Methods 0.000 description 26
- 102000002464 Galactosidases Human genes 0.000 description 25
- 108010093031 Galactosidases Proteins 0.000 description 25
- 230000001225 therapeutic effect Effects 0.000 description 24
- 239000002502 liposome Substances 0.000 description 21
- 239000000523 sample Substances 0.000 description 20
- 210000002966 serum Anatomy 0.000 description 20
- 239000003814 drug Substances 0.000 description 19
- 230000006870 function Effects 0.000 description 19
- 201000010099 disease Diseases 0.000 description 18
- 239000012634 fragment Substances 0.000 description 18
- 108020001507 fusion proteins Proteins 0.000 description 18
- 102000037865 fusion proteins Human genes 0.000 description 18
- 230000001965 increasing effect Effects 0.000 description 18
- 108010039650 imiglucerase Proteins 0.000 description 17
- 210000004185 liver Anatomy 0.000 description 17
- 239000003636 conditioned culture medium Substances 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 16
- YUDPTGPSBJVHCN-VMMWWAARSA-N 4-methyl-7-[(2r,3s,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxychromen-2-one Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O YUDPTGPSBJVHCN-VMMWWAARSA-N 0.000 description 15
- 102100033342 Lysosomal acid glucosylceramidase Human genes 0.000 description 15
- 238000009472 formulation Methods 0.000 description 15
- 239000002105 nanoparticle Substances 0.000 description 15
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 229920001542 oligosaccharide Polymers 0.000 description 14
- 150000002482 oligosaccharides Chemical class 0.000 description 14
- 239000013612 plasmid Substances 0.000 description 14
- 230000008901 benefit Effects 0.000 description 13
- 230000004700 cellular uptake Effects 0.000 description 13
- 239000003937 drug carrier Substances 0.000 description 13
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 13
- 229960002127 imiglucerase Drugs 0.000 description 13
- 210000003712 lysosome Anatomy 0.000 description 13
- 230000001868 lysosomic effect Effects 0.000 description 13
- 208000015872 Gaucher disease Diseases 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- 125000003275 alpha amino acid group Chemical group 0.000 description 12
- 235000001014 amino acid Nutrition 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- 229940124597 therapeutic agent Drugs 0.000 description 12
- 102100033448 Lysosomal alpha-glucosidase Human genes 0.000 description 11
- 241000700605 Viruses Species 0.000 description 11
- 150000001413 amino acids Chemical class 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 11
- 230000001939 inductive effect Effects 0.000 description 11
- 238000000746 purification Methods 0.000 description 11
- 108010073969 valyllysine Proteins 0.000 description 11
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 239000004480 active ingredient Substances 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 210000003734 kidney Anatomy 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 230000009467 reduction Effects 0.000 description 10
- 241000713666 Lentivirus Species 0.000 description 9
- 208000008955 Mucolipidoses Diseases 0.000 description 9
- -1 chromosomes Proteins 0.000 description 9
- 239000003623 enhancer Substances 0.000 description 9
- 238000002347 injection Methods 0.000 description 9
- 239000007924 injection Substances 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 239000000693 micelle Substances 0.000 description 9
- 201000007769 mucolipidosis Diseases 0.000 description 9
- 230000002829 reductive effect Effects 0.000 description 9
- 210000000952 spleen Anatomy 0.000 description 9
- 230000008685 targeting Effects 0.000 description 9
- 241000701161 unidentified adenovirus Species 0.000 description 9
- 241001430294 unidentified retrovirus Species 0.000 description 9
- YUDPTGPSBJVHCN-YMILTQATSA-N 4-methylumbelliferyl beta-D-glucoside Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YUDPTGPSBJVHCN-YMILTQATSA-N 0.000 description 8
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 8
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 8
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 8
- 108700008625 Reporter Genes Proteins 0.000 description 8
- 108700019146 Transgenes Proteins 0.000 description 8
- 229920004890 Triton X-100 Polymers 0.000 description 8
- 238000009825 accumulation Methods 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- 201000008333 alpha-mannosidosis Diseases 0.000 description 8
- 108010068265 aspartyltyrosine Proteins 0.000 description 8
- 210000004556 brain Anatomy 0.000 description 8
- 208000035475 disorder Diseases 0.000 description 8
- 108010050848 glycylleucine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 8
- 102000005962 receptors Human genes 0.000 description 8
- 108020003175 receptors Proteins 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 230000002441 reversible effect Effects 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 108010038745 tryptophylglycine Proteins 0.000 description 8
- 208000002267 Anti-neutrophil cytoplasmic antibody-associated vasculitis Diseases 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 239000003795 chemical substances by application Substances 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 230000002255 enzymatic effect Effects 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 230000036541 health Effects 0.000 description 7
- 210000005260 human cell Anatomy 0.000 description 7
- 108010005942 methionylglycine Proteins 0.000 description 7
- 108010029020 prolylglycine Proteins 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 238000011084 recovery Methods 0.000 description 7
- 229920002477 rna polymer Polymers 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- 230000003442 weekly effect Effects 0.000 description 7
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 6
- 102000005720 Glutathione transferase Human genes 0.000 description 6
- 108010070675 Glutathione transferase Proteins 0.000 description 6
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 6
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 6
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 6
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 229940106189 ceramide Drugs 0.000 description 6
- 239000002738 chelating agent Substances 0.000 description 6
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 6
- 210000001808 exosome Anatomy 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 150000002305 glucosylceramides Chemical class 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 210000002216 heart Anatomy 0.000 description 6
- 238000007490 hematoxylin and eosin (H&E) staining Methods 0.000 description 6
- 239000004615 ingredient Substances 0.000 description 6
- 108010045758 lysosomal proteins Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 210000003205 muscle Anatomy 0.000 description 6
- 230000007935 neutral effect Effects 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 239000002245 particle Substances 0.000 description 6
- 230000000069 prophylactic effect Effects 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 210000003462 vein Anatomy 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 5
- 241000700584 Simplexvirus Species 0.000 description 5
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 5
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 239000003963 antioxidant agent Substances 0.000 description 5
- 235000006708 antioxidants Nutrition 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- 210000004748 cultured cell Anatomy 0.000 description 5
- 230000003247 decreasing effect Effects 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 239000008103 glucose Substances 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010000761 leucylarginine Proteins 0.000 description 5
- 210000002540 macrophage Anatomy 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 239000000546 pharmaceutical excipient Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 239000003755 preservative agent Substances 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 4
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 4
- 241000282472 Canis lupus familiaris Species 0.000 description 4
- 108090000565 Capsid Proteins Proteins 0.000 description 4
- YBSQGNFRWZKFMJ-UHFFFAOYSA-N Cerebroside B Natural products CCCCCCCCCCCCCCC(O)C(=O)NC(C(O)C=CCCC=C(C)CCCCCCCCC)COC1OC(CO)C(O)C(O)C1O YBSQGNFRWZKFMJ-UHFFFAOYSA-N 0.000 description 4
- 102100023321 Ceruloplasmin Human genes 0.000 description 4
- 241000699802 Cricetulus griseus Species 0.000 description 4
- 102000001189 Cyclic Peptides Human genes 0.000 description 4
- 108010069514 Cyclic Peptides Proteins 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical class OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 241000282326 Felis catus Species 0.000 description 4
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 4
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 4
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- 101001040800 Homo sapiens Integral membrane protein GPR180 Proteins 0.000 description 4
- 101000934372 Homo sapiens Macrosialin Proteins 0.000 description 4
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- 102100021244 Integral membrane protein GPR180 Human genes 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 4
- 102100025136 Macrosialin Human genes 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- 241000288906 Primates Species 0.000 description 4
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 4
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 4
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 4
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 4
- 102000019199 alpha-Mannosidase Human genes 0.000 description 4
- 108010012864 alpha-Mannosidase Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 229940049197 cerezyme Drugs 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 239000007979 citrate buffer Substances 0.000 description 4
- 238000012761 co-transfection Methods 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 230000007547 defect Effects 0.000 description 4
- 231100000673 dose–response relationship Toxicity 0.000 description 4
- 235000019441 ethanol Nutrition 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000001476 gene delivery Methods 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 230000000670 limiting effect Effects 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000006225 natural substrate Substances 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 210000001672 ovary Anatomy 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 210000000352 storage cell Anatomy 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000011277 treatment modality Methods 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 3
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 3
- 241000711404 Avian avulavirus 1 Species 0.000 description 3
- WVDDGKGOMKODPV-UHFFFAOYSA-N Benzyl alcohol Chemical compound OCC1=CC=CC=C1 WVDDGKGOMKODPV-UHFFFAOYSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 239000004322 Butylated hydroxytoluene Substances 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 3
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 3
- 208000032007 Glycogen storage disease due to acid maltase deficiency Diseases 0.000 description 3
- 206010053185 Glycogen storage disease type II Diseases 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101000997662 Homo sapiens Lysosomal acid glucosylceramidase Proteins 0.000 description 3
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 3
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 3
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 3
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 3
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 3
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 3
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 3
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 3
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 3
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 3
- 241000282887 Suidae Species 0.000 description 3
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 3
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 3
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 3
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 235000010354 butylated hydroxytoluene Nutrition 0.000 description 3
- 150000001783 ceramides Chemical class 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 238000000576 coating method Methods 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- 239000002552 dosage form Substances 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 238000001952 enzyme assay Methods 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 3
- 235000011187 glycerol Nutrition 0.000 description 3
- 201000004502 glycogen storage disease II Diseases 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 230000005847 immunogenicity Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 210000003292 kidney cell Anatomy 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 210000002569 neuron Anatomy 0.000 description 3
- 239000000816 peptidomimetic Substances 0.000 description 3
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 235000021317 phosphate Nutrition 0.000 description 3
- 239000008363 phosphate buffer Substances 0.000 description 3
- 150000003904 phospholipids Chemical class 0.000 description 3
- 229920000575 polymersome Polymers 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 230000002685 pulmonary effect Effects 0.000 description 3
- 238000001525 receptor binding assay Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000001177 retroviral effect Effects 0.000 description 3
- 238000007363 ring formation reaction Methods 0.000 description 3
- 230000003248 secreting effect Effects 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- HOMYIYLRRDTKAA-UHFFFAOYSA-N 2-hydroxy-N-[3-hydroxy-1-[3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoctadeca-4,8-dien-2-yl]hexadecanamide Chemical compound CCCCCCCCCCCCCCC(O)C(=O)NC(C(O)C=CCCC=CCCCCCCCCC)COC1OC(CO)C(O)C(O)C1O HOMYIYLRRDTKAA-UHFFFAOYSA-N 0.000 description 2
- GHCZTIFQWKKGSB-UHFFFAOYSA-N 2-hydroxypropane-1,2,3-tricarboxylic acid;phosphoric acid Chemical compound OP(O)(O)=O.OC(=O)CC(O)(C(O)=O)CC(O)=O GHCZTIFQWKKGSB-UHFFFAOYSA-N 0.000 description 2
- YUDPTGPSBJVHCN-NZBFACKJSA-N 4-methyl-7-[(2s,3s,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxychromen-2-one Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O YUDPTGPSBJVHCN-NZBFACKJSA-N 0.000 description 2
- HSHNITRMYYLLCV-UHFFFAOYSA-N 4-methylumbelliferone Chemical compound C1=C(O)C=CC2=C1OC(=O)C=C2C HSHNITRMYYLLCV-UHFFFAOYSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 241000710929 Alphavirus Species 0.000 description 2
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 2
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 2
- 102100031149 Deoxyribonuclease gamma Human genes 0.000 description 2
- 241000605786 Desulfovibrio sp. Species 0.000 description 2
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 2
- 241000283086 Equidae Species 0.000 description 2
- 208000024720 Fabry Disease Diseases 0.000 description 2
- 241000710831 Flavivirus Species 0.000 description 2
- 102100028496 Galactocerebrosidase Human genes 0.000 description 2
- 108010042681 Galactosylceramidase Proteins 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- 108010017544 Glucosylceramidase Proteins 0.000 description 2
- 102000004547 Glucosylceramidase Human genes 0.000 description 2
- 108010024636 Glutathione Proteins 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 2
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 2
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- 101000979046 Homo sapiens Lysosomal alpha-mannosidase Proteins 0.000 description 2
- 108090000144 Human Proteins Proteins 0.000 description 2
- 102000003839 Human Proteins Human genes 0.000 description 2
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 2
- 241000701074 Human alphaherpesvirus 2 Species 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 2
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 2
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 241000712079 Measles morbillivirus Species 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- 208000002678 Mucopolysaccharidoses Diseases 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000699660 Mus musculus Species 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 2
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 2
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- 241000607714 Serratia sp. Species 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- HYLNRGXEQACDKG-NYVOZVTQSA-N Trp-Asn-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HYLNRGXEQACDKG-NYVOZVTQSA-N 0.000 description 2
- DEZKIRSBKKXUEV-NYVOZVTQSA-N Trp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N DEZKIRSBKKXUEV-NYVOZVTQSA-N 0.000 description 2
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 2
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 2
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 2
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 2
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 2
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 2
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 2
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 2
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 239000000443 aerosol Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 229940121375 antifungal agent Drugs 0.000 description 2
- 239000003429 antifungal agent Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 239000012736 aqueous medium Substances 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 235000010323 ascorbic acid Nutrition 0.000 description 2
- 239000011668 ascorbic acid Substances 0.000 description 2
- 229960005070 ascorbic acid Drugs 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 210000000601 blood cell Anatomy 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 210000004413 cardiac myocyte Anatomy 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 235000015165 citric acid Nutrition 0.000 description 2
- 239000003086 colorant Substances 0.000 description 2
- 238000002648 combination therapy Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- YPHMISFOHDHNIV-FSZOTQKASA-N cycloheximide Chemical compound C1[C@@H](C)C[C@H](C)C(=O)[C@@H]1[C@H](O)CC1CC(=O)NC(=O)C1 YPHMISFOHDHNIV-FSZOTQKASA-N 0.000 description 2
- 150000001945 cysteines Chemical class 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000002716 delivery method Methods 0.000 description 2
- 108010031616 deoxyribonuclease gamma Proteins 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- RNPXCFINMKSQPQ-UHFFFAOYSA-N dicetyl hydrogen phosphate Chemical compound CCCCCCCCCCCCCCCCOP(O)(=O)OCCCCCCCCCCCCCCCC RNPXCFINMKSQPQ-UHFFFAOYSA-N 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000006185 dispersion Substances 0.000 description 2
- 239000003995 emulsifying agent Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 2
- MMXKVMNBHPAILY-UHFFFAOYSA-N ethyl laurate Chemical compound CCCCCCCCCCCC(=O)OCC MMXKVMNBHPAILY-UHFFFAOYSA-N 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 229960003180 glutathione Drugs 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 230000013632 homeostatic process Effects 0.000 description 2
- 238000003364 immunohistochemistry Methods 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 208000016245 inborn errors of metabolism Diseases 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000002601 intratumoral effect Effects 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 206010028093 mucopolysaccharidosis Diseases 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 230000000174 oncolytic effect Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 150000007524 organic acids Chemical class 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 230000007030 peptide scission Effects 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920005862 polyol Polymers 0.000 description 2
- 150000003077 polyols Chemical class 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- JAJWGJBVLPIOOH-IZYKLYLVSA-M sodium taurocholate Chemical compound [Na+].C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 JAJWGJBVLPIOOH-IZYKLYLVSA-M 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 235000010199 sorbic acid Nutrition 0.000 description 2
- 239000004334 sorbic acid Substances 0.000 description 2
- 229940075582 sorbic acid Drugs 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 235000010356 sorbitol Nutrition 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 239000000829 suppository Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 230000004797 therapeutic response Effects 0.000 description 2
- 239000002562 thickening agent Substances 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 238000011200 topical administration Methods 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 239000000080 wetting agent Substances 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N (R)-alpha-Tocopherol Natural products OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- LBLYYCQCTBFVLH-UHFFFAOYSA-N 2-Methylbenzenesulfonic acid Chemical compound CC1=CC=CC=C1S(O)(=O)=O LBLYYCQCTBFVLH-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- KISWVXRQTGLFGD-UHFFFAOYSA-N 2-[[2-[[6-amino-2-[[2-[[2-[[5-amino-2-[[2-[[1-[2-[[6-amino-2-[(2,5-diamino-5-oxopentanoyl)amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]-5-(diaminomethylideneamino)p Chemical compound C1CCN(C(=O)C(CCCN=C(N)N)NC(=O)C(CCCCN)NC(=O)C(N)CCC(N)=O)C1C(=O)NC(CO)C(=O)NC(CCC(N)=O)C(=O)NC(CCCN=C(N)N)C(=O)NC(CO)C(=O)NC(CCCCN)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 KISWVXRQTGLFGD-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- YUDPTGPSBJVHCN-JZYAIQKZSA-N 4-Methylumbelliferyl-alpha-D-glucopyranoside Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YUDPTGPSBJVHCN-JZYAIQKZSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 1
- PSGQCCSGKGJLRL-UHFFFAOYSA-N 4-methyl-2h-chromen-2-one Chemical group C1=CC=CC2=C1OC(=O)C=C2C PSGQCCSGKGJLRL-UHFFFAOYSA-N 0.000 description 1
- YUDPTGPSBJVHCN-CHUNWDLHSA-N 4-methylumbelliferyl alpha-D-galactoside Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O YUDPTGPSBJVHCN-CHUNWDLHSA-N 0.000 description 1
- 108020005029 5' Flanking Region Proteins 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 241000093709 Acetobacterium sp. Species 0.000 description 1
- 241000588625 Acinetobacter sp. Species 0.000 description 1
- 241000131104 Actinobacillus sp. Species 0.000 description 1
- 241001156739 Actinobacteria <phylum> Species 0.000 description 1
- 241000456624 Actinobacteria bacterium Species 0.000 description 1
- 241001147825 Actinomyces sp. Species 0.000 description 1
- 108010024878 Adenovirus E1A Proteins Proteins 0.000 description 1
- 241000256173 Aedes albopictus Species 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 241001135756 Alphaproteobacteria Species 0.000 description 1
- 241000565344 Anhinga anhinga Species 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- 241000186073 Arthrobacter sp. Species 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- DMRUJUFCPVHHKP-UHFFFAOYSA-N Asn-Met-Asn-Gln Chemical compound NC(=O)CC(N)C(=O)NC(CCSC)C(=O)NC(CC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O DMRUJUFCPVHHKP-UHFFFAOYSA-N 0.000 description 1
- UYRPHDGXHKBZHJ-CIUDSAMLSA-N Asn-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N UYRPHDGXHKBZHJ-CIUDSAMLSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 1
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- HTOZUYZQPICRAP-BPUTZDHNSA-N Asp-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N HTOZUYZQPICRAP-BPUTZDHNSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- CRNKLABLTICXDV-GUBZILKMSA-N Asp-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N CRNKLABLTICXDV-GUBZILKMSA-N 0.000 description 1
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- VXEORMGBKTUUCM-KWBADKCTSA-N Asp-Val-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O VXEORMGBKTUUCM-KWBADKCTSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241001518086 Bartonella henselae Species 0.000 description 1
- 241001135724 Bdellovibrio sp. Species 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 241001135529 Bordetella sp. Species 0.000 description 1
- 241000283730 Bos primigenius Species 0.000 description 1
- 101001028834 Bos taurus Cation-independent mannose-6-phosphate receptor Proteins 0.000 description 1
- 241000508772 Brucella sp. Species 0.000 description 1
- 241001508395 Burkholderia sp. Species 0.000 description 1
- 239000004255 Butylated hydroxyanisole Substances 0.000 description 1
- 102100021935 C-C motif chemokine 26 Human genes 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 108090000835 CX3C Chemokine Receptor 1 Proteins 0.000 description 1
- 102100039196 CX3C chemokine receptor 1 Human genes 0.000 description 1
- 241000589994 Campylobacter sp. Species 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000168484 Capnocytophaga sp. Species 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 241000207206 Cardiobacterium Species 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000010804 Caulobacter vibrioides Species 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 241000191368 Chlorobi Species 0.000 description 1
- 241001142109 Chloroflexi Species 0.000 description 1
- 102000010792 Chromogranin A Human genes 0.000 description 1
- 108010038447 Chromogranin A Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 241000873310 Citrobacter sp. Species 0.000 description 1
- 241000193464 Clostridium sp. Species 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 241000186249 Corynebacterium sp. Species 0.000 description 1
- 241000700626 Cowpox virus Species 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- 241000709675 Coxsackievirus B3 Species 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- BGIRVSMUAJMGOK-FXQIFTODSA-N Cys-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N BGIRVSMUAJMGOK-FXQIFTODSA-N 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- DZIGZIIJIGGANI-FXQIFTODSA-N Cys-Glu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DZIGZIIJIGGANI-FXQIFTODSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XXDATQFUGMAJRV-XIRDDKMYSA-N Cys-Leu-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XXDATQFUGMAJRV-XIRDDKMYSA-N 0.000 description 1
- SDDJEOCJUFKAPV-BPUTZDHNSA-N Cys-Met-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CCSC)C(O)=O)=CNC2=C1 SDDJEOCJUFKAPV-BPUTZDHNSA-N 0.000 description 1
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- UHDGCWIWMRVCDJ-CCXZUQQUSA-N Cytarabine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](CO)O1 UHDGCWIWMRVCDJ-CCXZUQQUSA-N 0.000 description 1
- 108010080611 Cytosine Deaminase Proteins 0.000 description 1
- 102000000311 Cytosine Deaminase Human genes 0.000 description 1
- 102100029588 Deoxycytidine kinase Human genes 0.000 description 1
- 108010033174 Deoxycytidine kinase Proteins 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- GZDFHIJNHHMENY-UHFFFAOYSA-N Dimethyl dicarbonate Chemical compound COC(=O)OC(=O)OC GZDFHIJNHHMENY-UHFFFAOYSA-N 0.000 description 1
- 241000255925 Diptera Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000588905 Eikenella sp. Species 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000588697 Enterobacter cloacae Species 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241001495410 Enterococcus sp. Species 0.000 description 1
- 241001518861 Erysipelothrix sp. Species 0.000 description 1
- 241000488157 Escherichia sp. Species 0.000 description 1
- 239000001856 Ethyl cellulose Substances 0.000 description 1
- ZZSNKZQZMQGXPY-UHFFFAOYSA-N Ethyl cellulose Chemical compound CCOCC1OC(OC)C(OCC)C(OCC)C1OC1C(O)C(O)C(OC)C(CO)O1 ZZSNKZQZMQGXPY-UHFFFAOYSA-N 0.000 description 1
- 241001267419 Eubacterium sp. Species 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 241000589564 Flavobacterium sp. Species 0.000 description 1
- 241000589601 Francisella Species 0.000 description 1
- 241000187808 Frankia sp. Species 0.000 description 1
- 241000223195 Fusarium graminearum Species 0.000 description 1
- 101150115151 GAA gene Proteins 0.000 description 1
- 101150028412 GBA gene Proteins 0.000 description 1
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102100039289 Glial fibrillary acidic protein Human genes 0.000 description 1
- 101710193519 Glial fibrillary acidic protein Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- QDXMSSWCEVYOLZ-SZMVWBNQSA-N Gln-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QDXMSSWCEVYOLZ-SZMVWBNQSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- PDXIOFXRBVDSHD-JBACZVJFSA-N Gln-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)N)N PDXIOFXRBVDSHD-JBACZVJFSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 1
- 208000010055 Globoid Cell Leukodystrophy Diseases 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 1
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 241000606841 Haemophilus sp. Species 0.000 description 1
- 241000590008 Helicobacter sp. Species 0.000 description 1
- 241001494519 Heliobacterium sp. Species 0.000 description 1
- 102000001554 Hemoglobins Human genes 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- 229920002971 Heparan sulfate Polymers 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- JHVCZQFWRLHUQR-DCAQKATOSA-N His-Arg-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JHVCZQFWRLHUQR-DCAQKATOSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 1
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- XSEAJSPAOTZXJE-IHPCNDPISA-N His-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N XSEAJSPAOTZXJE-IHPCNDPISA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 101000718525 Homo sapiens Alpha-galactosidase A Proteins 0.000 description 1
- 101000897493 Homo sapiens C-C motif chemokine 26 Proteins 0.000 description 1
- 101000877379 Homo sapiens ETS-related transcription factor Elf-3 Proteins 0.000 description 1
- 101100236307 Homo sapiens GAA gene Proteins 0.000 description 1
- 101001072477 Homo sapiens N-acetylglucosamine-1-phosphotransferase subunit gamma Proteins 0.000 description 1
- 101001072470 Homo sapiens N-acetylglucosamine-1-phosphotransferase subunits alpha/beta Proteins 0.000 description 1
- 101100434895 Homo sapiens NAGLU gene Proteins 0.000 description 1
- 101000760175 Homo sapiens Zinc finger protein 35 Proteins 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 1
- 241000216646 Hydrogenophaga pseudoflava Species 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- YBGTWSFIGHUWQE-MXAVVETBSA-N Ile-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CN=CN1 YBGTWSFIGHUWQE-MXAVVETBSA-N 0.000 description 1
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 241001454354 Kingella Species 0.000 description 1
- 241000588754 Klebsiella sp. Species 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 244000285963 Kluyveromyces fragilis Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 208000028226 Krabbe disease Diseases 0.000 description 1
- 241000710912 Kunjin virus Species 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000186610 Lactobacillus sp. Species 0.000 description 1
- 241000194034 Lactococcus lactis subsp. cremoris Species 0.000 description 1
- 241000178948 Lactococcus sp. Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000589268 Legionella sp. Species 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- 241001627205 Leuconostoc sp. Species 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241001084338 Listeria sp. Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 1
- VSJXPNCQYGOLFM-XIRDDKMYSA-N Lys-Cys-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VSJXPNCQYGOLFM-XIRDDKMYSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- 101150030800 MAN2B1 gene Proteins 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 208000027933 Mannosidase Deficiency disease Diseases 0.000 description 1
- 241001576959 Megasphaera sp. Species 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- HMEVNCOJHJTLNB-BVSLBCMMSA-N Met-Trp-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N HMEVNCOJHJTLNB-BVSLBCMMSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 102000006890 Methyl-CpG-Binding Protein 2 Human genes 0.000 description 1
- 108010072388 Methyl-CpG-Binding Protein 2 Proteins 0.000 description 1
- 241000191936 Micrococcus sp. Species 0.000 description 1
- 241000187723 Micromonospora sp. Species 0.000 description 1
- 239000004909 Moisturizer Substances 0.000 description 1
- 241000588628 Moraxella sp. Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 241000187488 Mycobacterium sp. Species 0.000 description 1
- 241000202944 Mycoplasma sp. Species 0.000 description 1
- 102000047918 Myelin Basic Human genes 0.000 description 1
- 101710107068 Myelin basic protein Proteins 0.000 description 1
- 241000863422 Myxococcus xanthus Species 0.000 description 1
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 1
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 1
- 102100036713 N-acetylglucosamine-1-phosphotransferase subunit gamma Human genes 0.000 description 1
- 102100036710 N-acetylglucosamine-1-phosphotransferase subunits alpha/beta Human genes 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 101150003688 NAGLU gene Proteins 0.000 description 1
- 241001440871 Neisseria sp. Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 241000187681 Nocardia sp. Species 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229940029536 PANVAC Drugs 0.000 description 1
- 241000606580 Pasteurella sp. Species 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- 241000606012 Pectinatus Species 0.000 description 1
- 241000604136 Pediococcus sp. Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 241000192033 Peptostreptococcus sp. Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- MJOJSHOTYWABPR-WIRXVTQYSA-N Phe-Trp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MJOJSHOTYWABPR-WIRXVTQYSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZTVSVSFBHUVYIN-UFYCRDLUSA-N Phe-Tyr-Met Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=C(O)C=C1 ZTVSVSFBHUVYIN-UFYCRDLUSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000709664 Picornaviridae Species 0.000 description 1
- 241000607000 Plesiomonas Species 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241001521757 Propionibacterium sp. Species 0.000 description 1
- 241001656788 Propionispira Species 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241000588770 Proteus mirabilis Species 0.000 description 1
- 241000334216 Proteus sp. Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 241000589615 Pseudomonas syringae Species 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 235000019485 Safflower oil Nutrition 0.000 description 1
- 241000607149 Salmonella sp. Species 0.000 description 1
- 208000025820 Sanfilippo syndrome type B Diseases 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 241001037420 Selenomonas sp. Species 0.000 description 1
- 238000010266 Sephadex chromatography Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000607758 Shigella sp. Species 0.000 description 1
- 241000589196 Sinorhizobium meliloti Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241001180364 Spirochaetes Species 0.000 description 1
- 241000202911 Spiroplasma sp. Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 241000139725 Sporomusa sp. Species 0.000 description 1
- 241001147693 Staphylococcus sp. Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000983364 Stenotrophomonas sp. Species 0.000 description 1
- 235000014962 Streptococcus cremoris Nutrition 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- 241000187180 Streptomyces sp. Species 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 102000001435 Synapsin Human genes 0.000 description 1
- 108050009621 Synapsin Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- VLIUBAATANYCOY-GBALPHGKSA-N Thr-Cys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VLIUBAATANYCOY-GBALPHGKSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- VGNKUXWYFFDWDH-BEMMVCDISA-N Thr-Trp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N)O VGNKUXWYFFDWDH-BEMMVCDISA-N 0.000 description 1
- BGHVVGPELPHRCI-HZTRNQAASA-N Thr-Trp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N)O BGHVVGPELPHRCI-HZTRNQAASA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 102100037357 Thymidylate kinase Human genes 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 241000499912 Trichoderma reesei Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 1
- PMIJXCLOQFMOKZ-BPUTZDHNSA-N Trp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PMIJXCLOQFMOKZ-BPUTZDHNSA-N 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- KOVOKXBHGVXQMG-BPUTZDHNSA-N Trp-Cys-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 KOVOKXBHGVXQMG-BPUTZDHNSA-N 0.000 description 1
- JZHJLBPBQKPTNX-UBHSHLNASA-N Trp-Cys-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 JZHJLBPBQKPTNX-UBHSHLNASA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- PVRRBEROBJQPJX-SZMVWBNQSA-N Trp-His-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PVRRBEROBJQPJX-SZMVWBNQSA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- VUVVMFSDLYKHPA-PMVMPFDFSA-N Tyr-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC3=CC=C(C=C3)O)N VUVVMFSDLYKHPA-PMVMPFDFSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 description 1
- 108010044965 UDP-N-acetylglucosamine-lysosomal-enzyme N-acetylglucosaminephosphotransferase Proteins 0.000 description 1
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108010005705 Ubiquitinated Proteins Proteins 0.000 description 1
- 241001125316 Ureaplasma sp. Species 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- 108010051583 Ventricular Myosins Proteins 0.000 description 1
- 241000607284 Vibrio sp. Species 0.000 description 1
- 241000604955 Wolbachia sp. Species 0.000 description 1
- 241000605941 Wolinella Species 0.000 description 1
- 241001148118 Xanthomonas sp. Species 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 241000131891 Yersinia sp. Species 0.000 description 1
- 102100024672 Zinc finger protein 35 Human genes 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- ATBOMIWRCZXYSZ-XZBBILGWSA-N [1-[2,3-dihydroxypropoxy(hydroxy)phosphoryl]oxy-3-hexadecanoyloxypropan-2-yl] (9e,12e)-octadeca-9,12-dienoate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C\C\C=C\CCCCC ATBOMIWRCZXYSZ-XZBBILGWSA-N 0.000 description 1
- 239000003070 absorption delaying agent Substances 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 229960004150 aciclovir Drugs 0.000 description 1
- MKUXAQIIEYXACX-UHFFFAOYSA-N aciclovir Chemical compound N1C(N)=NC(=O)C2=C1N(COCCO)C=N2 MKUXAQIIEYXACX-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000002386 air freshener Substances 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 229940053991 aldehydes and derivative Drugs 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 150000001338 aliphatic hydrocarbons Chemical class 0.000 description 1
- 229940087168 alpha tocopherol Drugs 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 150000001414 amino alcohols Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940035676 analgesics Drugs 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 239000000730 antalgic agent Substances 0.000 description 1
- 229940121363 anti-inflammatory agent Drugs 0.000 description 1
- 239000002260 anti-inflammatory agent Substances 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000002421 anti-septic effect Effects 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 239000000823 artificial membrane Substances 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- 239000012131 assay buffer Substances 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- DMLAVOWQYNRWNQ-UHFFFAOYSA-N azobenzene Chemical compound C1=CC=CC=C1N=NC1=CC=CC=C1 DMLAVOWQYNRWNQ-UHFFFAOYSA-N 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 229940092524 bartonella henselae Drugs 0.000 description 1
- SRSXLGNVWSONIS-UHFFFAOYSA-N benzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC=CC=C1 SRSXLGNVWSONIS-UHFFFAOYSA-N 0.000 description 1
- 229940092714 benzenesulfonic acid Drugs 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 235000019445 benzyl alcohol Nutrition 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- DLRVVLDZNNYCBX-ZZFZYMBESA-N beta-melibiose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@H](O)O1 DLRVVLDZNNYCBX-ZZFZYMBESA-N 0.000 description 1
- 239000003012 bilayer membrane Substances 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- 235000019282 butylated hydroxyanisole Nutrition 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 239000007894 caplet Substances 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 239000012876 carrier material Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 229940110456 cocoa butter Drugs 0.000 description 1
- 235000019868 cocoa butter Nutrition 0.000 description 1
- 238000001246 colloidal dispersion Methods 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000005687 corn oil Nutrition 0.000 description 1
- 239000002285 corn oil Substances 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 239000002385 cottonseed oil Substances 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 108010000742 dTMP kinase Proteins 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000002074 deregulated effect Effects 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 229940093541 dicetylphosphate Drugs 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- UGMCXQCYOVCMTB-UHFFFAOYSA-K dihydroxy(stearato)aluminium Chemical compound CCCCCCCCCCCCCCCCCC(=O)O[Al](O)O UGMCXQCYOVCMTB-UHFFFAOYSA-K 0.000 description 1
- BPHQZTVXXXJVHI-UHFFFAOYSA-N dimyristoyl phosphatidylglycerol Chemical compound CCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCCCCCCCC BPHQZTVXXXJVHI-UHFFFAOYSA-N 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000007884 disintegrant Substances 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000004064 dysfunction Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 235000019325 ethyl cellulose Nutrition 0.000 description 1
- 229920001249 ethyl cellulose Polymers 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- XRECTZIEBJDKEO-UHFFFAOYSA-N flucytosine Chemical compound NC1=NC(=O)NC=C1F XRECTZIEBJDKEO-UHFFFAOYSA-N 0.000 description 1
- 229960004413 flucytosine Drugs 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 150000004674 formic acids Chemical class 0.000 description 1
- 101150022753 galc gene Proteins 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- IRSCQMHQWWYFCW-UHFFFAOYSA-N ganciclovir Chemical compound O=C1NC(N)=NC2=C1N=CN2COC(CO)CO IRSCQMHQWWYFCW-UHFFFAOYSA-N 0.000 description 1
- 229960002963 ganciclovir Drugs 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000007897 gelcap Substances 0.000 description 1
- 239000003349 gelling agent Substances 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 210000005046 glial fibrillary acidic protein Anatomy 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 239000003979 granulating agent Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229940093915 gynecological organic acid Drugs 0.000 description 1
- 210000005003 heart tissue Anatomy 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 208000009429 hemophilia B Diseases 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 102000045630 human GBA Human genes 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- ZCTXEAQXZGPWFG-UHFFFAOYSA-N imidurea Chemical compound O=C1NC(=O)N(CO)C1NC(=O)NCNC(=O)NC1C(=O)NC(=O)N1CO ZCTXEAQXZGPWFG-UHFFFAOYSA-N 0.000 description 1
- 229940113174 imidurea Drugs 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940125721 immunosuppressive agent Drugs 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000007972 injectable composition Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 230000006122 isoprenylation Effects 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000011866 long-term treatment Methods 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 239000007937 lozenge Substances 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- VTHJTEIRLNZDEV-UHFFFAOYSA-L magnesium dihydroxide Chemical compound [OH-].[OH-].[Mg+2] VTHJTEIRLNZDEV-UHFFFAOYSA-L 0.000 description 1
- 239000000347 magnesium hydroxide Substances 0.000 description 1
- 229910001862 magnesium hydroxide Inorganic materials 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 125000000311 mannosyl group Chemical group C1([C@@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229920006343 melt-processible rubber Polymers 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003228 microsomal effect Effects 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 230000001333 moisturizer Effects 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 208000005340 mucopolysaccharidosis III Diseases 0.000 description 1
- 208000036709 mucopolysaccharidosis type 3B Diseases 0.000 description 1
- 208000012227 mucopolysaccharidosis type IIIB Diseases 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- DDOVBCWVTOHGCU-QMXMISKISA-N n-[(e,2s,3r)-3-hydroxy-1-[(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxynonadec-4-en-2-yl]octadecanamide Chemical compound CCCCCCCCCCCCCCCCCC(=O)N[C@H]([C@H](O)\C=C\CCCCCCCCCCCCCC)CO[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O DDOVBCWVTOHGCU-QMXMISKISA-N 0.000 description 1
- QCTHLCFVVACBSA-UHFFFAOYSA-N n-[4,5-dihydroxy-6-(hydroxymethyl)-2-(4-methyl-2-oxochromen-7-yl)oxyoxan-3-yl]acetamide Chemical compound CC(=O)NC1C(O)C(O)C(CO)OC1OC1=CC=C(C(C)=CC(=O)O2)C2=C1 QCTHLCFVVACBSA-UHFFFAOYSA-N 0.000 description 1
- 239000002088 nanocapsule Substances 0.000 description 1
- 238000013188 needle biopsy Methods 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 210000004412 neuroendocrine cell Anatomy 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 210000004248 oligodendroglia Anatomy 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- WVDDGKGOMKODPV-ZQBYOMGUSA-N phenyl(114C)methanol Chemical compound O[14CH2]C1=CC=CC=C1 WVDDGKGOMKODPV-ZQBYOMGUSA-N 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 150000004633 phorbol derivatives Chemical class 0.000 description 1
- 239000002644 phorbol ester Substances 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 239000004014 plasticizer Substances 0.000 description 1
- 210000004910 pleural fluid Anatomy 0.000 description 1
- 239000004848 polyfunctional curative Substances 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920001592 potato starch Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 229940002612 prodrug Drugs 0.000 description 1
- 239000000651 prodrug Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000005664 protein glycosylation in endoplasmic reticulum Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 101150066583 rep gene Proteins 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 210000000880 retinal rod photoreceptor cell Anatomy 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 235000005713 safflower oil Nutrition 0.000 description 1
- 239000003813 safflower oil Substances 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000003270 steroid hormone Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000011232 storage material Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 210000002948 striated muscle cell Anatomy 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 238000011477 surgical intervention Methods 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 229940040944 tetracyclines Drugs 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 238000000954 titration curve Methods 0.000 description 1
- AOBORMOPSGHCAX-DGHZZKTQSA-N tocofersolan Chemical compound OCCOC(=O)CCC(=O)OC1=C(C)C(C)=C2O[C@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C AOBORMOPSGHCAX-DGHZZKTQSA-N 0.000 description 1
- 229960000984 tocofersolan Drugs 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000000196 tragacanth Substances 0.000 description 1
- 235000010487 tragacanth Nutrition 0.000 description 1
- 229940116362 tragacanth Drugs 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 108010025625 vocimagene amiretrorepvec Proteins 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 239000002076 α-tocopherol Substances 0.000 description 1
- 235000004835 α-tocopherol Nutrition 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1288—Transferases for other substituted phosphate groups (2.7.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2465—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1) acting on alpha-galactose-glycoside bonds, e.g. alpha-galactosidase (3.2.1.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2477—Hemicellulases not provided in a preceding group
- C12N9/2488—Mannanases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/08—Transferases for other substituted phosphate groups (2.7.8)
- C12Y207/08017—UDP-N-acetylglucosamine--lysosomal-enzyme N-acetylglucosaminephosphotransferase (2.7.8.17)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/0102—Alpha-glucosidase (3.2.1.20)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01022—Alpha-galactosidase (3.2.1.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01024—Alpha-mannosidase (3.2.1.24)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01045—Glucosylceramidase (3.2.1.45), i.e. beta-glucocerebrosidase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01046—Galactosylceramidase (3.2.1.46)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/0105—Alpha-N-acetylglucosaminidase (3.2.1.50)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/035—Animal model for multifactorial diseases
- A01K2267/0362—Animal model for lipid/glucose metabolism, e.g. obesity, type-2 diabetes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/20—Vector systems having a special element relevant for transcription transcription of more than one cistron
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/20—Vectors comprising a special translation-regulating system translation of more than one cistron
- C12N2840/203—Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01021—Beta-glucosidase (3.2.1.21)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Public Health (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Virology (AREA)
- Epidemiology (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Diabetes (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Enzymes And Modification Thereof (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
본 명세서에 제공된 것은, 대상체에서 리소좀 축적 장애(LSD)를 치료 또는 예방하기 위한 바이시스트론 벡터를 사용하는 조성물 및 방법이다. 개시된 조성물은 프로모터, 내부 리보솜 진입 부위(IRES), 리소솜 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함한다. 본 방법은 본 명세서에 개시된 바이시스트론 벡터를 포함하는 약제학적 조성물을 대상체에게 투여하는 것을 포함한다.
Description
관련 출원
본 출원은, 2019년 7월 2일에 출원된 가출원 USSN 제62/869,781호 및 2019년 7월 2일에 출원된 USSN 제62/869,808호의 이익을 주장하고, 그 내용은 참조에 의해 그 전체가 본 명세서에 도입된다.
서열 목록의 도입
2020년 7월 1일에 작성되고 크기가 611KB인 "M6PT-002/01WO_SeqList.txt"라는 명칭의 텍스트 파일의 내용은 참조에 의해 그 전체가 도입된다.
기술 분야
개시된 내용은 리소좀 축적 장애(lysosomal storage disorder)를 치료하기 위한 조성물 및 방법에 관한 것이다. 보다 구체적으로, 개시된 내용은 개선된 유전자 치료 및 개선된 효소 대체 요법(ERT)을 사용하여 리소좀 장애를 치료하는 분야에 관한 것이다.
리소좀 축적 장애(LSD)는 리소좀 기능의 결함으로 인해 발생하는 선천성 대사 장애와 관련되어 있다. 현재, 약 50개의 상이한 LSD가 확인되었지만, 이들 중 소수(10개 미만)가 치료를 받는 것으로 보고되어 있다. 따라서, LSD에 대한 안전하고 효과적인 치료에 대한 당해 기술분야의 충족되지 않은 요구가 있다. 본 개시는 효소 대체 요법(ERT) 또는 유전자 치료 중 어느 하나를 통해 이러한 충족되지 않은 요구에 대한 2개의 해결책을 제공한다.
본 개시는, 프로모터(promoter)를 암호화(encoding)하는 서열, 리소좀 효소를 암호화하는 제1 폴리뉴클레오티드 서열 및 변형된 N-아세틸글루코사민-1-포스포트랜스퍼라제(GlcNAc-1 PTase, PTase)를 암호화하는 제2 폴리뉴클레오티드 서열을 포함하는 벡터(vector)를 포함하는 조성물을 제공하고, 여기서 상기 프로모터는 포유동물 세포에서 발현을 유도할 수 있고, 상기 프로모터는 제1 폴리뉴클레오티드 및 제2 폴리뉴클레오티드에 작동가능하게 연결되어 있다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 내부 리보솜 진입 부위(Internal Ribosomal Entry Site; IRES)를 암호화하는 서열을 추가로 포함한다. 일부 실시양태에서, IRES를 암호화하는 서열은 리소좀 효소를 암호화하는 서열과, 변형된 GlcNAc-1 PTase를 암호화하는 서열 사이에 위치한다. 일부 실시양태에서, 5'에서 3'으로, 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 서열, IRES를 암호화하는 서열 및 리소좀 효소를 암호화하는 서열을 포함한다. 일부 실시양태에서, 5'에서 3'으로, 벡터는 리소좀 효소를 암호화하는 서열, IRES를 암호화하는 서열 및 변형된 GlcNAc-1 PTase를 암호화하는 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 절단 부위를 암호화하는 서열을 추가로 포함한다. 일부 실시양태에서, 절단 부위는 2A 자가-절단 펩티드를 암호화하는 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 발현 벡터이다. 일부 실시양태에서, 발현 벡터는 플라스미드를 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 전달 벡터이다. 일부 실시양태에서, 전달 벡터는 바이러스 벡터를 포함한다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터 또는 렌티바이러스 벡터를 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 또는 AAV9의 AAV로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, 전달 벡터는 비-바이러스 벡터(non-viral vector)를 포함한다. 일부 실시양태에서, 비-바이러스 벡터는 리포솜, 지질 나노입자(LNP), 미셀, 폴리머좀, 나노입자, 폴리머 나노입자 또는 엑소좀을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터 또는 렌티바이러스 벡터를 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 또는 AAV9의 AAV로부터 단리되거나 유래된 서열을 포함한다. 본 개시의 조성물의 일부 실시양태에서, 벡터는 비-바이러스 벡터이다. 일부 실시양태에서, 비-바이러스 벡터는 리포솜, 지질 나노입자(LNP), 미셀, 폴리머좀, 나노입자, 폴리머 나노입자 또는 엑소좀을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 바이러스 벡터이다. 일부 실시양태에서, 벡터는 렌티바이러스 벡터이다. 일부 실시양태에서, 벡터는 아데노바이러스 벡터 또는 아데노-연관 바이러스(AAV) 벡터이다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 혈청형을 포함한다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 하나 이상의 혈청형으로부터 단리되거나 유래된 캡시드를 암호화하는 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 하나 이상의 혈청형으로부터 단리되거나 유래된 적어도 하나의 역방향 말단 반복체(inverted terminal repeat; ITR)를 암호화하는 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 바이시스트론 벡터(bicistronic vector)이다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 멀티시스트론 벡터이다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 유비쿼터스 프로모터를 포함한다. 일부 실시양태에서, 프로모터는 포유동물 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 인간 세포에서 발현을 유도할 수 있다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 세포형 특이적 프로모터를 포함한다. 일부 실시양태에서, 프로모터는 포유동물 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 인간 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는, 뉴런 또는 신경교 세포를 포함하지만 이들로 한정되지 않는 신경 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 평활근 세포, 횡문근 세포 또는 심근 세포를 포함하지만 이들로 한정되지 않는 근육 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 폐 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 골 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 적혈구, 백혈구, 이의 전구세포 또는 조혈 줄기 세포를 포함하지만 이들로 한정되지 않는 혈액 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 T-세포, B-세포 또는 마크로파지를 포함하지만 이들로 한정되지 않는 면역 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 비장 또는 췌장의 세포에서 발현을 유도할 수 있다. 일부 실시양태에서, 프로모터는 신장 세포에서 발현을 유도할 수 있다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 인간 T-림프구향성 바이러스 I형(HTLV-I) 프로모터이다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 CBh 프로모터이다. 일부 실시양태에서, CBh 프로모터는 변형된 닭 β-액틴 프로모터에 융합된 CMV 초기 인핸서를 포함한다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 CEF 또는 hCEFI 프로모터이다. 일부 실시양태에서, hCEFI 프로모터는 인간 EF1a 프로모터에 작동가능하게 연결된 인간 CMV 인핸서를 포함한다. 일부 실시양태에서, hCEFI 프로모터는 서열번호 161의 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 프로모터는 구성적 프로모터를 포함한다. 일부 실시양태에서, 구성적 프로모터는 사이토메갈로바이러스(CMV) 프로모터를 포함한다.
본 개시의 조성물의 일부 실시양태에서, 벡터는 서열번호 1의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드는 서열번호 4의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 효소를 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GCase/GBA, GBA 유전자에 의해 암호화됨), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA 유전자에 의해 암호화됨), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN)로 이루어진 그룹으로부터 선택된다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GCase/GBA)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 5의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 갈락토실세레미다제(GALC)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 6의 핵산 서열을 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 23의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 α-갈락토시다제(GLA)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 7의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 α-N-아세틸글루코사미니다제(NAGLU)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 8의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 산 α-글루코시다제(GAA)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 9의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 리소좀 산 α-만노시다제(LAMAN)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 10의 핵산 서열을 포함한다.
본 개시는 리소좀 축적 장애(LSD)를 치료하는 방법을 제공하고, 상기 방법은 유효량의 본 개시의 조성물을 대상체(subject)에게 투여하는 것을 포함하고, 상기 조성물은 LSD에 관여하는 리소좀 효소의 포스포릴화를 증가시켜 LSD를 치료한다. 본 개시는 리소좀 축적 장애(LSD)를 치료하는 방법을 제공하고, 상기 방법은 유효량의 본 개시의 조성물을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 LSD에 관여하는 리소좀 효소의 N-연결된 올리고당 포스포릴화를 증가시키고, 이에 의해 LSD를 치료한다. 일부 실시양태에서, 대상체는 LSD의 징후(sign) 또는 증상(symptom)을 나타낸다. 일부 실시양태에서, 대상체는 LSD로 진단되어 있다.
본 개시는 리소좀 축적 장애(LSD)의 발생 또는 발병(onset)을 예방하는 방법을 제공하고, 상기 방법은 유효량의 본 개시의 조성물을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 LSD에 관여하는 리소좀 효소의 포스포릴화를 증가시키고, 이에 의해 대상체에서 LSD의 발생을 예방한다. 일부 실시양태에서, 대상체는 LSD의 발생 또는 발병의 위험이 있다. 일부 실시양태에서, 대상체는 LSD의 징후 또는 증상을 나타낸다.
본 개시는 리소좀 축적 장애(LSD)에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법을 제공하고, 상기 방법은 대상체에게 유효량의 본 개시의 조성물을 투여하는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시킨다. 일부 실시양태에서, 대상체는 LSD의 징후 또는 증상을 나타낸다. 일부 실시양태에서, 대상체는 LSD의 발생 또는 발병의 위험이 있다. 일부 실시양태에서, 대상체는 LSD로 진단되어 있다.
본 개시는 리소좀 축적 장애(LSD)에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법을 제공하고, 상기 방법은 유효량의 본 개시의 조성물을 세포에 접촉시키는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시킨다. 일부 실시양태에서, 세포는 시험관내 또는 생체외에 존재한다. 일부 실시양태에서, 세포는 생체내에 존재한다. 일부 실시양태에서, 대상체는 세포를 포함한다. 일부 실시양태에서, 대상체는 LSD의 징후 또는 증상을 나타낸다. 일부 실시양태에서, 대상체는 LSD의 발생 또는 발병의 위험이 있다. 일부 실시양태에서, 대상체는 LSD로 진단되어 있다.
본 개시의 방법의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다.
본 개시의 방법의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 바와 같은 적어도 하나이다.
본 개시의 방법의 일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GCase/GBA), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN) 중 하나 이상을 포함한다.
본 개시의 방법의 일부 실시양태에서, 투여는 전신 투여 경로를 포함한다. 일부 실시양태에서, 전신 투여 경로는 장내(enteral), 비경구, 경구, 근육내(IM), 피하(SC), 정맥내(IV), 동맥내(IA), 척추강내, 심실내, 흉강내, 뇌실내이다.
본 개시의 방법의 일부 실시양태에서, 투여는 국소 투여 경로를 포함한다.
본 개시의 방법의 일부 실시양태에서, 대상체는 인간이다. 일부 실시양태에서, 대상체는 남성이다. 일부 실시양태에서, 대상체는 여성이다.
본 개시를 설명할 목적으로, 본 개시의 특정 실시양태가 도면에 도시되어 있다. 그러나, 본 개시는 도면에 도시된 실시양태의 정확한 배치 및 수단으로 한정되지 않는다.
특허 또는 출원 파일에는 컬러로 실행된 적어도 하나의 도면이 포함되어 있다. 컬러 도면과 함께 이 특허 또는 특허 출원 간행물의 사본은 요청 및 필요한 수수료 지불에 응하여 특허청으로부터 제공된다.
도 1A 내지 도 1C는 S1-S3 바이시스트론 벡터를 도시하는 일련의 다이아그램 및 그래프이다. 도 1A: CMV-S1S3 벡터. 도 1B: pLL01:pCMV-MCS-IRES-S1S3 벡터. 도 1C: CMV-S1S3 및 pLL01의 발현 수준을 나타내는 그래프(CPM: 분당 카운트).
도 2A 내지 2C는 S1-S3 바이시스트론 벡터에서 GBA 바이시스트론 발현 플라스미드의 생성을 도시하는 일련의 다이아그램 및 히스토그램이다. 도 2A: pLL11:pCMV-hGBA-IRES-S1S3 벡터. 도 2B: 조건부 배지에서의 GBA 활성. 도 1C: PTase 활성의 백분율을 나타내는 히스토그램.
도 3A 내지 3C는, 바이시스트론 발현이 GBA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 그래프 및 히스토그램이다.
도 4A 내지 도 4D는, 바이시스트론 발현이 GAA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 5A 내지 5D는, 바이시스트론 발현이 GALC 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 6A 내지 도 6D는, 바이시스트론 발현이 NAGLU 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 7A 내지 도 7D는, 바이시스트론 발현이 GLA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 8A 내지 8D는 바이시스트론 발현이 LAMAN 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 9A 내지 9E는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 고셔병(A-C)의 치료에서 GBA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다. 패널 D 및 E는 GBA 효소의 단일 점 돌연변이가 이의 안정성을 증가시키지만 CI-MPR에 대한 결합에는 영향을 미치지 않는 것을 입증한다.
도 10A 내지 10C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 폼페병(Pompe Disease) 치료에서 GAA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 11A 내지 11C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 크랩병(Krabbe Disease)의 치료에서 GALC 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 12A 내지 12C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 MPS IIIB 질환의 치료에서 NAGLU 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 13A 내지 13C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 파브리병(Fabry Disease)의 치료에서 GLA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 14A 내지 14C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 α-만노시드증(Mannosidosis)의 치료에서 LAMAN 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 15A 내지 15B는 AAV9 벡터에 의해 전달된 본 개시의 S1-S3 PTase 바이시스트론 벡터가 점액지질증 질환(Mucolipidosis Disease)의 치료에서 유전자 치료로서 사용될 수 있음을 입증하는 개략도 및 그래프이다.
도 16A 내지 16B는 20주령의 GaucherD409V /null 마우스의 간, 폐 및 비장에서 관찰된 글루코실세라미드 수준의 상승을 도시하는 한 쌍의 그래프이다. GBA의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 폐에서 GC의 축적은 통계적으로 및 치료적으로 가치 있는 결과이고, 이는 현재 표준 치료의 충족되지 않은 공지된 요구이다. 조직 균질물의 20μL 분취량과 적절한 대조군을 취하고, 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음, 3220g, 4℃; 3)에서 15분 동안 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조시키고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주사했다.
도 17A 내지 17C는, 이미글루세라제와 비교하여, GBAD409V /null 마우스 모델에서 GCaseM6P가 더 긴 반감기 및 더 큰 조직 흡수를 갖는다는 것을 입증하는 일련의 그래프이다. Gaucher D409V/Null 마우스 모델에서의 PK/PD 연구는 표준 치료, 이미글루세라미드, 및 Expi293 세포에서 S1-S3 PTase와 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 일시적으로 공-발현시킴으로써 생성된 정제된 GBA를 사용하여 수행했다. 이 변이체 GCase는 중성 및 약알칼리성 조건에서 더 큰 안정성을 갖는다. 간단히 말해서, 3마리의 동물에게 약 1.5mg/kg의 재조합 GCase의 꼬리 정맥 주사를 제공했다. 혈청 약물동태 데이터의 경우, 혈장 샘플을 2, 10, 20, 40 및 60분에 수집했다. 합성 기질인 4-메틸움벨리페릴-베타-D-글루코피라노시드(4MU-Glc)를 사용하여 활성을 측정했다. 2분 시점을 100% 활성으로 설정함으로써 개개 동물에서 활성을 정규화하고, 후속 시점은 t=2분 시점의 백분율이다. S1-S3 PTase의 존재하에 발현된 안정화된 GCase는 더 긴 반감기를 갖는 것으로 보인다. 이 더 긴 반감기는 더 큰 안정성을 갖는 효소와 상이한 클리어런스 경로의 조합이다. 효소 주사 2시간 후에 조직에 의해 흡수된 GCase의 양을 측정하기 위해, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은 단백질 측정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. 적절한 포스포릴화를 수반하는 안정한 GCase의 진정한 이점은 도시된 조직 흡수 데이터에서 관찰된다. 평가된 모든 조직에 대해, 바이시스트론 S1-S3 PTase 벡터 플랫폼 S1'S3 PTase를 사용하여 발현된 안정화된 GCase에 의해 더 많은 활성이 발견되었다. 이는, 이미글루세라제가 거의 활성을 갖지 않는 폐, 근육 및 뇌에서 가장 극적이다. 조직 및 혈청의 데이터를 함께 취하면, N-연결된 올리고당 포스포릴화가 더 큰, 보다 안정한 GCase의 이점은 영향을 받은 조직에 더 많은 효소를 전달한다는 점에서 명백하다. 상당한 양의 GCase가 이러한 용량으로 폐, 근육 및 심장에 전달된 것은 이번이 최초이다.
도 18A 내지 18E는 GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 조직 마크로파지(항-CD68 염색)를 이미글루세라제보다 더 양호하게 감소시킨 것을 입증하는 일련의 사진 및 막대 그래프이다. D409V 고셔(Gaucher) 마우스 모델에서의 효능 연구는, 표준 치료인 세레자임(Cerezyme)과, 중성 및 약알칼리성 조건에서 안정성이 더 큰 것으로 보고된 S1S3 PTase 및 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA(M0111)을 사용하여 수행했다. 약 20주령의 고셔(Gaucher) 마우스를 매주 약 1.5mg/kg의 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, CD68 항체를 사용한 면역조직화학을 위해 4% 파라포름알데히드-PBS, pH 7.4에 고정했다. M0111은, CD68 Ab에 의해 시각화된 바와 같이, 영향을 받은 조직에서 마크로파지의 감소에 의해 입증된 바와 같이 현재 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 19A 내지 19C는, GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 이미글루세라제보다 고셔 저장 세포(헤마톡실린 및 에오신(H&E) 염색)의 수와 크기를 더 양호하게 감소시킨 것을 입증하는 일련의 사진이다. D409A 고셔 마우스 모델의 효능 연구는, 표준 치료인 세레자임과, 중성 및 약알칼리성 조건에서 안정성이 더 큰 것으로 보고된 S1-S3 PTase 및 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA를 사용하여 수행했다. 약 20주령의 고셔 마우스를 매주 약 1.5mg/kg 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, 헤마톡실린 및 에오신(H&E) 염색을 위한 포르말린용 4% 파라포름알데히드-PBS, pH 7.4에 고정했다. GCaseM6P는 H&E 염색에 의해 시각화된 바와 같이, 영향을 받은 조직의 저장 세포 감소에 의해 입증된 바와 같이 현재 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 20A 내지 20B는, GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 이미글루세라제보다 축적된 기질을 더 양호하게 감소시킨 것을 입증하는 한 쌍의 그래프이다. 약 20주령 고셔 마우스를 매주 약 1.5mg/kg 효소로 4주 동안 치료했다. 조직 샘플을 수집하고, 글리코실세라미드 분석을 위해 균질화했다. GCase의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 중요한 수치는 폐의 GC 축적이고, 이는 현재 표준 치료에 대해 충족되지 않은 것으로 공지된 요구이다. 20μL의 조직 균질물 및 적절한 대조군은, 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음, 3220g, 4℃; 3)에서 15분 동안 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조시키고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주사했다. 측정된 2개 세라미드에 대해, GCaseM6P 치료된 동물은 이미글루세라제보다 ERT 요법 후에 더 낮은 수치를 가졌다.
도 21A 내지 21D는 고셔병의 치료를 위한 생체내 AAV 매개 유전자 치료 연구의 결과를 나타내는 일련의 그래프이다. 3개의 상이한 프로모터를 갖는 안정한 GBA + S1-S3 PTase의 바이시스트론 발현 도입유전자에 의한 AAV9 유전자 치료의 효과를 결정하기 위해, 15주령 GBAD409V /null 마우스에 중등도 용량의 AAV9-안정성 GBA+ S1-S3 PTase, 5E11 vg를 투여했다. 조직에 의해 생성된 GBA의 양을 결정하기 위해, AAV9 주사 2주 후, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은, 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다.
도 22A 내지 22C는 ERT로서 리소좀 알파-만노시다제(LAMAN)를 사용한 경우의 시험관내 연구의 결과를 도시하는 일련의 그래프이다.
도 23A 내지 23B는 LAMAN 효소 발현, 정제 및 특성화를 도시하는 사진 및 상응하는 데이터 표이다. LAMAN의 2개 조제물은 S1-S3 PTase를 암호화하는 바이시스트론 벡터의 존재(M0611) 또는 부재하에 Expi293 세포에서 일시적으로 공-발현되었다. 둘 다 HPC4 친화성 태그를 사용하여 정제되었다. 포스포릴화의 유의한 증가는, 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 종류의 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성을 기반으로 한다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 LAMANM6P(M0611)이 M6P의 존재하에서도 수용체에 결합하는 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 23C는 LAMANM6P(M0611) 효소 발현, 정제 및 특성화를 도시하는 그래프이다. LAMAN의 2개 조제물은, PTase의 S1-S3 변이체를 암호화하는 바이시스트론 벡터의 존재 또는 부재하에 Expi293 세포에서 일시적으로 공-발현되었다. 둘 다 HPC4 태그를 이용하여 정제되었다. 포스포릴화의 유의한 증가는 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 종류의 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성에 의해 결정되었다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 M6P의 존재하에서도 수용체에 결합하는 M0611의 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 24A 내지 24B는 효소 대체 요법을 위한 야생형 마우스에서 LAMAN 및 LAMANM6P 효소의 생체분포를 입증하는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 2mg/kg의 각 프렙을 꼬리 정맥으로부터 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에서 조직의 더 큰 활성이 있었다. 이 경향은 폐를 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다. 이 관찰에 대한 유일한 예외는 신장이었다. 내인성 LAMAN 활성은 모든 샘플로부터 차감된다. LAMANM6P 효소를 주사한 대부분의 마우스 조직에서 더 높은 LAMAN 효소 활성이 검출되었다.
도 25A 내지 25B는 효소 대체 요법을 위한 야생형 마우스에서 αLAMAN 및 LAMANM6P 효소의 생체분포를 입증하는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 10mg/kg의 각 프렙을 꼬리 정맥으로부터 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에서 조직의 더 큰 활성이 있었다. 이 경향은 신장을 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다.
도 26A 내지 26B는 점액지질증 유전자 치료(GTx)에 대한 AAV9 설계 및 시험관내 시험을 도시하는 개략도 및 그래프이다. 293T 세포에 다양한 M0021(AAV9-CAGp-S1-S3) 바이러스를 형질도입하고, PTase 활성 검정 전에 2일 동안 배양했다.
도 27A 내지 27B는 M0021 치료가 ML II 마우스에서 혈청 리소좀 효소 수준을 감소시킨다는 것을 입증하는 한 쌍의 그래프이다. S1-S3 PTase 유전자 치료의 효과를 결정하기 위해, 34주령 암컷 마우스에게 중등도 용량의 M0021(AAV9-CAGp-S1-S3), 4e12 vg(2e13 vg/kg)를 투여했다. ML II의 표현형 중 하나는 세포 내의 리소좀을 표적화할 수 없기 때문에, 리소좀 효소의 혈청 수준이 상승하는 것이다. 치료를 받은 지 1주일 후에 혈청 중의 LAMAN 및 ManB 활성이 감소된 경우, 유망한 결과가 관찰되었다. 이 결과는 MLII 마우스 모델의 기재된 표현형에 영향을 미치는 능력을 입증하기 때문에 중요하다.
도 28A 내지 28C는 M0021 치료가 ML II에서 리소좀 효소의 포스포릴화를 증가시킨다는 것을 입증하는 일련의 그래프이다. LAMAN 및 ManB의 혈청 활성 감소에서 S1-S3 PTase 유전자 치료에 대한 영향을 추가로 이해하기 위해, 혈청에서 발견된 효소의 CI-MPR 결합을, 전술한 고정화된 수용체 결합 검정을 사용하여 평가했다. 간단히 말해서, 고정된 CI-MPR에 증가하는 양으로 첨가되는 활성에 대해 공지되어 있다. 결합되지 않은 효소는 세척하고, 잔류하는 결합 효소는 적절한 합성 기질: Man-b-4MU(ManB, LAMAN 4MU-Man(LAMAN)을 사용하여 측정한다. ML II 마우스에서의 AAV9-S1S3 유전자 치료는 리소좀 효소의 글리칸 포스포릴화를 증가시킨다. 혈청 중의 총 포스포릴화된 리소좀 효소는 3주 후에 정상 수준으로 또는 약간 더 높은 수준으로 정상화된다.
도 29A 내지 29C는 고셔 마우스에서 AAV9-hTLV-GBAM6P 유전자 치료의 주사 2주 후의 폐 및 간에서 효소 활성 및 선택된 GCase 기질을 도시하는 일련의 그래프이다. AAV9-hTLV-GBA-S1S3은 달리는 AAV9-hTLV-GBAM6P로 공지되어 있고, M6P는 S1S3 작제물을 나타낸다. AAV9 hTLV-GBA 또는 AAV9 hTLV-GBAM6P(GBA 및 S1-S3 PTase를 갖는 바이시스트론 벡터를 포함하는 도입유전자)의 2주 후, 2개 작제물에 대해 간에서 발현이 상승했다(도 29A). 간 글루코실-β-세라미드 수준이 측정되는 경우(도 29B, C), AAV9 hTLV-GBA 치료된 동물과 비교하여 간에서 더 낮은 GCase 활성이 있었음에도 불구하고, AAV9 hTLV-GBAM6P 치료된 동물에서 축적된 기질의 최대의 감소가 관찰되었다. 더 적은 활성으로 더 큰 기질 감소는 세포 흡수 및 리소좀 표적화 관점에서 유전자 치료를 위한 N-연결된 올리고당 포스포릴화의 중요성을 나타낸다. 폐에서, AAV9 치료 동물에 대한 GCase 활성은 낮다. 그러나, AAV9-hTLV-GBAM6P 치료된 동물은 축적된 글루코실-β-세라미드 수준에 대해 폐에서 유의한 감소를 나타냈다(도 29B, C). AAV9-hTLV-GBA 치료된 동물의 경우에는 약간의 감소가 관찰되었다. 이것은, CI-MPR에 대한 높은 친화도를 갖는 포스포릴화된 도입유전자 산물을 사용하면, 효율적인 세포 흡수 및 리소좀 표적화에 의해, 낮은 활성 수준에서도 효과적인 치료법을 유도할 수 있음을 입증한다.
특허 또는 출원 파일에는 컬러로 실행된 적어도 하나의 도면이 포함되어 있다. 컬러 도면과 함께 이 특허 또는 특허 출원 간행물의 사본은 요청 및 필요한 수수료 지불에 응하여 특허청으로부터 제공된다.
도 1A 내지 도 1C는 S1-S3 바이시스트론 벡터를 도시하는 일련의 다이아그램 및 그래프이다. 도 1A: CMV-S1S3 벡터. 도 1B: pLL01:pCMV-MCS-IRES-S1S3 벡터. 도 1C: CMV-S1S3 및 pLL01의 발현 수준을 나타내는 그래프(CPM: 분당 카운트).
도 2A 내지 2C는 S1-S3 바이시스트론 벡터에서 GBA 바이시스트론 발현 플라스미드의 생성을 도시하는 일련의 다이아그램 및 히스토그램이다. 도 2A: pLL11:pCMV-hGBA-IRES-S1S3 벡터. 도 2B: 조건부 배지에서의 GBA 활성. 도 1C: PTase 활성의 백분율을 나타내는 히스토그램.
도 3A 내지 3C는, 바이시스트론 발현이 GBA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 그래프 및 히스토그램이다.
도 4A 내지 도 4D는, 바이시스트론 발현이 GAA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 5A 내지 5D는, 바이시스트론 발현이 GALC 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 6A 내지 도 6D는, 바이시스트론 발현이 NAGLU 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 7A 내지 도 7D는, 바이시스트론 발현이 GLA 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 8A 내지 8D는 바이시스트론 발현이 LAMAN 효소의 포스포릴화를 증가시키는 것을 나타내는 일련의 다이아그램, 그래프 및 히스토그램이다.
도 9A 내지 9E는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 고셔병(A-C)의 치료에서 GBA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다. 패널 D 및 E는 GBA 효소의 단일 점 돌연변이가 이의 안정성을 증가시키지만 CI-MPR에 대한 결합에는 영향을 미치지 않는 것을 입증한다.
도 10A 내지 10C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 폼페병(Pompe Disease) 치료에서 GAA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 11A 내지 11C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 크랩병(Krabbe Disease)의 치료에서 GALC 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 12A 내지 12C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 MPS IIIB 질환의 치료에서 NAGLU 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 13A 내지 13C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 파브리병(Fabry Disease)의 치료에서 GLA 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 14A 내지 14C는 본 개시의 S1-S3 PTase 바이시스트론 벡터가 α-만노시드증(Mannosidosis)의 치료에서 LAMAN 효소의 CI-MPR 결합 및 이의 세포 흡수를 유의하게 증가시킨다는 것을 입증하는 일련의 그래프이다.
도 15A 내지 15B는 AAV9 벡터에 의해 전달된 본 개시의 S1-S3 PTase 바이시스트론 벡터가 점액지질증 질환(Mucolipidosis Disease)의 치료에서 유전자 치료로서 사용될 수 있음을 입증하는 개략도 및 그래프이다.
도 16A 내지 16B는 20주령의 GaucherD409V /null 마우스의 간, 폐 및 비장에서 관찰된 글루코실세라미드 수준의 상승을 도시하는 한 쌍의 그래프이다. GBA의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 폐에서 GC의 축적은 통계적으로 및 치료적으로 가치 있는 결과이고, 이는 현재 표준 치료의 충족되지 않은 공지된 요구이다. 조직 균질물의 20μL 분취량과 적절한 대조군을 취하고, 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음, 3220g, 4℃; 3)에서 15분 동안 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조시키고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주사했다.
도 17A 내지 17C는, 이미글루세라제와 비교하여, GBAD409V /null 마우스 모델에서 GCaseM6P가 더 긴 반감기 및 더 큰 조직 흡수를 갖는다는 것을 입증하는 일련의 그래프이다. Gaucher D409V/Null 마우스 모델에서의 PK/PD 연구는 표준 치료, 이미글루세라미드, 및 Expi293 세포에서 S1-S3 PTase와 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 일시적으로 공-발현시킴으로써 생성된 정제된 GBA를 사용하여 수행했다. 이 변이체 GCase는 중성 및 약알칼리성 조건에서 더 큰 안정성을 갖는다. 간단히 말해서, 3마리의 동물에게 약 1.5mg/kg의 재조합 GCase의 꼬리 정맥 주사를 제공했다. 혈청 약물동태 데이터의 경우, 혈장 샘플을 2, 10, 20, 40 및 60분에 수집했다. 합성 기질인 4-메틸움벨리페릴-베타-D-글루코피라노시드(4MU-Glc)를 사용하여 활성을 측정했다. 2분 시점을 100% 활성으로 설정함으로써 개개 동물에서 활성을 정규화하고, 후속 시점은 t=2분 시점의 백분율이다. S1-S3 PTase의 존재하에 발현된 안정화된 GCase는 더 긴 반감기를 갖는 것으로 보인다. 이 더 긴 반감기는 더 큰 안정성을 갖는 효소와 상이한 클리어런스 경로의 조합이다. 효소 주사 2시간 후에 조직에 의해 흡수된 GCase의 양을 측정하기 위해, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은 단백질 측정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. 적절한 포스포릴화를 수반하는 안정한 GCase의 진정한 이점은 도시된 조직 흡수 데이터에서 관찰된다. 평가된 모든 조직에 대해, 바이시스트론 S1-S3 PTase 벡터 플랫폼 S1'S3 PTase를 사용하여 발현된 안정화된 GCase에 의해 더 많은 활성이 발견되었다. 이는, 이미글루세라제가 거의 활성을 갖지 않는 폐, 근육 및 뇌에서 가장 극적이다. 조직 및 혈청의 데이터를 함께 취하면, N-연결된 올리고당 포스포릴화가 더 큰, 보다 안정한 GCase의 이점은 영향을 받은 조직에 더 많은 효소를 전달한다는 점에서 명백하다. 상당한 양의 GCase가 이러한 용량으로 폐, 근육 및 심장에 전달된 것은 이번이 최초이다.
도 18A 내지 18E는 GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 조직 마크로파지(항-CD68 염색)를 이미글루세라제보다 더 양호하게 감소시킨 것을 입증하는 일련의 사진 및 막대 그래프이다. D409V 고셔(Gaucher) 마우스 모델에서의 효능 연구는, 표준 치료인 세레자임(Cerezyme)과, 중성 및 약알칼리성 조건에서 안정성이 더 큰 것으로 보고된 S1S3 PTase 및 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA(M0111)을 사용하여 수행했다. 약 20주령의 고셔(Gaucher) 마우스를 매주 약 1.5mg/kg의 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, CD68 항체를 사용한 면역조직화학을 위해 4% 파라포름알데히드-PBS, pH 7.4에 고정했다. M0111은, CD68 Ab에 의해 시각화된 바와 같이, 영향을 받은 조직에서 마크로파지의 감소에 의해 입증된 바와 같이 현재 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 19A 내지 19C는, GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 이미글루세라제보다 고셔 저장 세포(헤마톡실린 및 에오신(H&E) 염색)의 수와 크기를 더 양호하게 감소시킨 것을 입증하는 일련의 사진이다. D409A 고셔 마우스 모델의 효능 연구는, 표준 치료인 세레자임과, 중성 및 약알칼리성 조건에서 안정성이 더 큰 것으로 보고된 S1-S3 PTase 및 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA를 사용하여 수행했다. 약 20주령의 고셔 마우스를 매주 약 1.5mg/kg 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, 헤마톡실린 및 에오신(H&E) 염색을 위한 포르말린용 4% 파라포름알데히드-PBS, pH 7.4에 고정했다. GCaseM6P는 H&E 염색에 의해 시각화된 바와 같이, 영향을 받은 조직의 저장 세포 감소에 의해 입증된 바와 같이 현재 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 20A 내지 20B는, GBAD409V /null 마우스 모델에서 GCaseM6P ERT가 이미글루세라제보다 축적된 기질을 더 양호하게 감소시킨 것을 입증하는 한 쌍의 그래프이다. 약 20주령 고셔 마우스를 매주 약 1.5mg/kg 효소로 4주 동안 치료했다. 조직 샘플을 수집하고, 글리코실세라미드 분석을 위해 균질화했다. GCase의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 중요한 수치는 폐의 GC 축적이고, 이는 현재 표준 치료에 대해 충족되지 않은 것으로 공지된 요구이다. 20μL의 조직 균질물 및 적절한 대조군은, 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음, 3220g, 4℃; 3)에서 15분 동안 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조시키고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주사했다. 측정된 2개 세라미드에 대해, GCaseM6P 치료된 동물은 이미글루세라제보다 ERT 요법 후에 더 낮은 수치를 가졌다.
도 21A 내지 21D는 고셔병의 치료를 위한 생체내 AAV 매개 유전자 치료 연구의 결과를 나타내는 일련의 그래프이다. 3개의 상이한 프로모터를 갖는 안정한 GBA + S1-S3 PTase의 바이시스트론 발현 도입유전자에 의한 AAV9 유전자 치료의 효과를 결정하기 위해, 15주령 GBAD409V /null 마우스에 중등도 용량의 AAV9-안정성 GBA+ S1-S3 PTase, 5E11 vg를 투여했다. 조직에 의해 생성된 GBA의 양을 결정하기 위해, AAV9 주사 2주 후, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은, 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다.
도 22A 내지 22C는 ERT로서 리소좀 알파-만노시다제(LAMAN)를 사용한 경우의 시험관내 연구의 결과를 도시하는 일련의 그래프이다.
도 23A 내지 23B는 LAMAN 효소 발현, 정제 및 특성화를 도시하는 사진 및 상응하는 데이터 표이다. LAMAN의 2개 조제물은 S1-S3 PTase를 암호화하는 바이시스트론 벡터의 존재(M0611) 또는 부재하에 Expi293 세포에서 일시적으로 공-발현되었다. 둘 다 HPC4 친화성 태그를 사용하여 정제되었다. 포스포릴화의 유의한 증가는, 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 종류의 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성을 기반으로 한다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 LAMANM6P(M0611)이 M6P의 존재하에서도 수용체에 결합하는 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 23C는 LAMANM6P(M0611) 효소 발현, 정제 및 특성화를 도시하는 그래프이다. LAMAN의 2개 조제물은, PTase의 S1-S3 변이체를 암호화하는 바이시스트론 벡터의 존재 또는 부재하에 Expi293 세포에서 일시적으로 공-발현되었다. 둘 다 HPC4 태그를 이용하여 정제되었다. 포스포릴화의 유의한 증가는 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 종류의 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성에 의해 결정되었다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 M6P의 존재하에서도 수용체에 결합하는 M0611의 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 24A 내지 24B는 효소 대체 요법을 위한 야생형 마우스에서 LAMAN 및 LAMANM6P 효소의 생체분포를 입증하는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 2mg/kg의 각 프렙을 꼬리 정맥으로부터 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에서 조직의 더 큰 활성이 있었다. 이 경향은 폐를 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다. 이 관찰에 대한 유일한 예외는 신장이었다. 내인성 LAMAN 활성은 모든 샘플로부터 차감된다. LAMANM6P 효소를 주사한 대부분의 마우스 조직에서 더 높은 LAMAN 효소 활성이 검출되었다.
도 25A 내지 25B는 효소 대체 요법을 위한 야생형 마우스에서 αLAMAN 및 LAMANM6P 효소의 생체분포를 입증하는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 10mg/kg의 각 프렙을 꼬리 정맥으로부터 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에서 조직의 더 큰 활성이 있었다. 이 경향은 신장을 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다.
도 26A 내지 26B는 점액지질증 유전자 치료(GTx)에 대한 AAV9 설계 및 시험관내 시험을 도시하는 개략도 및 그래프이다. 293T 세포에 다양한 M0021(AAV9-CAGp-S1-S3) 바이러스를 형질도입하고, PTase 활성 검정 전에 2일 동안 배양했다.
도 27A 내지 27B는 M0021 치료가 ML II 마우스에서 혈청 리소좀 효소 수준을 감소시킨다는 것을 입증하는 한 쌍의 그래프이다. S1-S3 PTase 유전자 치료의 효과를 결정하기 위해, 34주령 암컷 마우스에게 중등도 용량의 M0021(AAV9-CAGp-S1-S3), 4e12 vg(2e13 vg/kg)를 투여했다. ML II의 표현형 중 하나는 세포 내의 리소좀을 표적화할 수 없기 때문에, 리소좀 효소의 혈청 수준이 상승하는 것이다. 치료를 받은 지 1주일 후에 혈청 중의 LAMAN 및 ManB 활성이 감소된 경우, 유망한 결과가 관찰되었다. 이 결과는 MLII 마우스 모델의 기재된 표현형에 영향을 미치는 능력을 입증하기 때문에 중요하다.
도 28A 내지 28C는 M0021 치료가 ML II에서 리소좀 효소의 포스포릴화를 증가시킨다는 것을 입증하는 일련의 그래프이다. LAMAN 및 ManB의 혈청 활성 감소에서 S1-S3 PTase 유전자 치료에 대한 영향을 추가로 이해하기 위해, 혈청에서 발견된 효소의 CI-MPR 결합을, 전술한 고정화된 수용체 결합 검정을 사용하여 평가했다. 간단히 말해서, 고정된 CI-MPR에 증가하는 양으로 첨가되는 활성에 대해 공지되어 있다. 결합되지 않은 효소는 세척하고, 잔류하는 결합 효소는 적절한 합성 기질: Man-b-4MU(ManB, LAMAN 4MU-Man(LAMAN)을 사용하여 측정한다. ML II 마우스에서의 AAV9-S1S3 유전자 치료는 리소좀 효소의 글리칸 포스포릴화를 증가시킨다. 혈청 중의 총 포스포릴화된 리소좀 효소는 3주 후에 정상 수준으로 또는 약간 더 높은 수준으로 정상화된다.
도 29A 내지 29C는 고셔 마우스에서 AAV9-hTLV-GBAM6P 유전자 치료의 주사 2주 후의 폐 및 간에서 효소 활성 및 선택된 GCase 기질을 도시하는 일련의 그래프이다. AAV9-hTLV-GBA-S1S3은 달리는 AAV9-hTLV-GBAM6P로 공지되어 있고, M6P는 S1S3 작제물을 나타낸다. AAV9 hTLV-GBA 또는 AAV9 hTLV-GBAM6P(GBA 및 S1-S3 PTase를 갖는 바이시스트론 벡터를 포함하는 도입유전자)의 2주 후, 2개 작제물에 대해 간에서 발현이 상승했다(도 29A). 간 글루코실-β-세라미드 수준이 측정되는 경우(도 29B, C), AAV9 hTLV-GBA 치료된 동물과 비교하여 간에서 더 낮은 GCase 활성이 있었음에도 불구하고, AAV9 hTLV-GBAM6P 치료된 동물에서 축적된 기질의 최대의 감소가 관찰되었다. 더 적은 활성으로 더 큰 기질 감소는 세포 흡수 및 리소좀 표적화 관점에서 유전자 치료를 위한 N-연결된 올리고당 포스포릴화의 중요성을 나타낸다. 폐에서, AAV9 치료 동물에 대한 GCase 활성은 낮다. 그러나, AAV9-hTLV-GBAM6P 치료된 동물은 축적된 글루코실-β-세라미드 수준에 대해 폐에서 유의한 감소를 나타냈다(도 29B, C). AAV9-hTLV-GBA 치료된 동물의 경우에는 약간의 감소가 관찰되었다. 이것은, CI-MPR에 대한 높은 친화도를 갖는 포스포릴화된 도입유전자 산물을 사용하면, 효율적인 세포 흡수 및 리소좀 표적화에 의해, 낮은 활성 수준에서도 효과적인 치료법을 유도할 수 있음을 입증한다.
리소좀 축적 장애(LSD)는 리소좀 기능의 결함으로 인해 발생하는 선천성 대사 장애와 관련되어 있다. 현재, 약 50개의 상이한 LSD가 확인되었지만, 이들 중 소수(10개 미만)가 치료를 받는 것으로 보고되어 있다. 환자는 현재 효소 대체 요법(ERT)의 정맥내 주입에 의해 치료되고 있고, 이는 환자에서 누락된 효소를 보충하여 질환의 증상에 대처한다. ERT의 목표는 결함 세포의 리소좀에 충분한 양의 정상 효소를 도입하여 저장 물질을 제거하고 리소좀 기능을 회복시키는 것이다. 영향을 받는 리소좀으로 ERT의 효율적 흡수를 보장하기 위해, ERT에는 높은 수준의 만노오스 6-포스페이트(M6P)가 포함되어 있어야 한다. 이상적으로, LSD 환자는, 리소좀으로의 효과적 전달을 가능하게 하기 위해, 고도로 포화된 수준의 M6P와 함께 누락된 효소를 투여함으로써 치료해야 한다. 그러나, 리소좀에 M6P를 추가할 수 있는 포스포릴화 프로세스는 본질적으로 비효율적이기 때문에, 이 프로세스는 매우 곤란하다. GlcNAc-1PTase의 S1-S3 변이체의 최근 발견은 리소좀 효소의 포스포릴화 프로세스를 현저히 개선시킨다. 추가로, 환자에게 LSD의 장기간 치료를 제공하는 유전자 치료 접근방식이 필요하다.
본 개시는 GlcNAc-1-포스포트랜스퍼라제의 S1-S3 변이체에 작동가능하게 연결된 리소좀 효소를 생성하기 위한 발현 벡터, 조성물 및 방법을 제공한다. GlcNAc-1-포스포트랜스퍼라제의 S1-S3 변이체는 세포 내로 및 혈청 또는 신장 외부로의 작동가능하게 연결된 리소좀 효소의 수송을 현저히 증가시켜, 갱신, 분포 및 리소좀 효소 활성을 증가시킨다.
본 개시는 GlcNAc-1-포스포트랜스퍼라제의 S1-S3 변이체에 작동가능하게 연결된 리소좀 효소를 생성하기 위한 유전자 치료 벡터, 조성물 및 방법을 제공한다. 본 개시는 S1-S3 변이체의 발현이 내인성 리소좀 효소의 흡수, 분포 및 활성을 증가시킨다는 것을 입증한다.
본 개시는 신규 바이시스트론 벡터를 통한 S1-S3 PTase와의 공동-발현에 의해 적절한 포스포릴화된 N-연결 올리고당을 갖는 리소좀 효소를 생성하기 위한 ERT, 벡터, 조성물 및 방법을 제공한다. S1-S3 PTase 및 리소좀 효소의 바이시스트론 발현은 발현되는 리소좀 효소의 M6P 함량을 현저히 증가시킨다. 충분히 포스포릴화된 효소를 갖는 것은 효소의 효율적인 흡수 및 리소좀 전달을 가능하게 한다. 이에 의해, 더 나은 조직 분포, 세포 흡수, 리소좀 표적화 및 기질 감소가 가능하다. 본 개시는 S1-S3 PTase의 공-발현에 의해 M6P 리소좀 효소의 높은 수준의 발현 또는 높은 수준의 활성을 생성하기 위한 유전자 치료 벡터, 조성물 및 방법을 제공한다. PTase의 S1-S3 변이체의 바이시스트론 발현은 리소좀 효소에서 M6P 함유량 수준을 현저히 증가시킨다. 리소좀 효소 표면 상에서 높은 M6P를 통해, 효소는 시험관내 및 생체내에서 흡수, 분포 및 효능을 증가시켜 조직 세포에 전달될 수 있다.
본 개시의 벡터, 조성물 및 방법은 효소 대체 요법(ERT)을 위해 사용될 수 있다.
대안적으로 또는 추가로, 본 개시의 벡터, 조성물 및 방법은 유전자 치료를 위해 사용될 수 있다.
다수의 리소좀 효소가 기재되어 있고, ERT와 유전자 치료 모두에서 이의 사용이 입증되어 있다. 중요하게는, 본 개시의 벡터, 조성물 및 방법은 리소좀 효소의 세포 흡수를 증가시키고, 결과적으로, 하나 이상의 신체 조직에서 리소좀 효소의 활성을 증가시키기 위해, 임의의 리소좀 효소와 함께 사용될 수 있다.
일부 실시양태에서, 리소좀 단백질에 작동가능하게 연결된 S1-S3 PTase를 포함하는 본 개시의 조성물 및 방법은 대상체의 하나 이상의 비장, 뇌, 하나 이상의 폐, 또는 하나 이상의 근육에서 리소좀 단백질의 흡수 및 활성을 증가시킨다.
일부 실시양태에서, 바이시스트론 벡터가 S1-S3GlcNAc-1-포스포트랜스퍼라제를 암호화하는 서열 및 리소좀 단백질을 암호화하는 서열을 포함하는 실시양태를 포함하여, S1-S3 GlcNAc-1-포스포트랜스퍼라제를 포함하는 본 개시의 벡터, 조성물 및 방법은 대상체의 하나 이상의 비장, 뇌, 하나 이상의 폐, 또는 하나 이상의 근육에서 암호화된 리소좀 단백질의 흡수 및 활성을 증가시킨다.
예시적 실시양태
본 개시는, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 벡터를 포함하는 조성물을 제공한다.
본 개시는, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함하는 조성물을 제공한다.
본 개시의 조성물의 일부 실시양태에서, 바이시스트론 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드의 전방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드의 후방에 위치하는 내부 리보솜 진입 부위(IRES)를 포함한다. 일부 실시양태에서, 바이시스트론 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 후방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 전방에 위치하는 IRES를 포함한다.
본 개시의 조성물의 일부 실시양태에서, 바이시스트론 벡터는 프로모터를 포함한다. 일부 실시양태에서, 바이시스트론 벡터는 구성적 프로모터를 포함한다. 일부 실시양태에서, 구성적 프로모터는 사이토메갈로바이러스(CMV) 프로모터를 포함한다. 일부 실시양태에서, 프로모터는 리소좀 효소를 암호화하는 폴리뉴클레오티드 또는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드에 작동가능하게 연결된다. 일부 실시양태에서, 프로모터는 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드에 작동가능하게 연결된다.
본 개시의 조성물의 일부 실시양태에서, 바이시스트론 벡터는 서열번호 1의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 변형된 GlcNAc-1 포스포트랜스퍼라제를 암호화하는 폴리뉴클레오티드는 서열번호 4의 핵산 서열을 포함한다.
본 개시의 조성물의 일부 실시양태에서, 암호화된 리소좀 효소는 표 1에 수록된 바와 같은 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다. 일부 실시양태에서, 암호화된 리소좀 효소 또는 이의 변이체는 표 1A, 표 1B 또는 표 1C에 수록된 바와 같이 적어도 하나의 리소좀 저장 장애(LSD)를 유발한다. 일부 실시양태에서, 암호화된 리소좀 효소 또는 이의 변이체의 활성 또는 기능은 표 1A, 표 1B 또는 표 1C에 수록된 바와 같은 적어도 하나의 리소좀 축적 장애(LSD)에서 감소, 억제 또는 탈조절된다.
본 개시의 조성물의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 리소좀 효소를 포함한다. 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 효소를 포함한다. 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 하나 이상의 리소좀 효소(들)를 포함한다. 일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GCase, GBA), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN)로 이루어진 그룹으로부터 선택된다. 일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GCase, GBA)를 포함한다. 일부 실시양태에서, 리소좀 효소는 갈락토실세레미다제(GALC)를 포함한다. 일부 실시양태에서, 리소좀 효소는 α-갈락토시다제(GLA)를 포함한다. 일부 실시양태에서, 리소좀 효소는 α-N-아세틸글루코사미니다제(NAGLU)를 포함한다. 일부 실시양태에서, 리소좀 효소는 산 α-글루코시다제(GAA)를 포함한다. 일부 실시양태에서, 리소좀 효소는 리소좀 산 α-만노시다제(LAMAN)를 포함한다. 일부 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 5 내지 10의 핵산 서열을 포함한다.
본 개시는, 구성적 프로모터, 내부 리보솜 진입 부위(IRES) 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함하는 조성물을 제공한다.
본 개시의 조성물의 일부 실시양태에서, 조성물은 약제학적으로 허용되는 담체를 추가로 포함한다.
본 개시의 벡터의 일부 실시양태에서, 벡터는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 아데노바이러스, 아데노-연관 바이러스(AAV), 레트로바이러스 또는 렌티바이러스이다. 일부 실시양태에서, 바이러스 벡터는 아데노바이러스를 포함한다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터를 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9의 하나 이상의 AAV로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 1의 AAV(AAV1)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 2의 AAV(AAV2)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 3의 AAV(AAV3)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 4의 AAV(AAV4)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 5의 AAV(AAV5)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 6의 AAV(AAV6)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 7의 AAV(AAV7)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 8의 AAV(AAV8)로부터 단리되거나 유래된 서열을 포함한다. 일부 실시양태에서, AAV 벡터는 혈청형 9의 AAV(AAV9)로부터 단리되거나 유래된 서열을 포함한다.
본 개시의 벡터의 일부 실시양태에서, 벡터는 발현 벡터이다. 일부 실시양태에서, 발현 벡터는 서열번호 1의 폴리뉴클레오티드 서열을 포함한다.
본 개시는 본 개시의 벡터를 포함하는 세포를 제공한다. 일부 실시양태에서, 세포는 포유동물 세포이다. 일부 실시양태에서, 세포는 영장류 세포이다. 일부 실시양태에서, 세포는 인간 세포이다. 일부 실시양태에서, 세포는 배양된 세포이다. 일부 실시양태에서, 세포는 불사멸화 또는 안정화된 세포주이다. 일부 실시양태에서, 세포는 차이니즈 햄스터 난소(CHO) 세포이다. 일부 실시양태에서, 세포는 인간 배아 신장 293(HEK293) 세포이다.
본 개시는 본 개시의 바이시스트론 벡터를 포함하는 세포를 제공한다. 일부 실시양태에서, 세포는 포유동물 세포이다. 일부 실시양태에서, 세포는 영장류 세포이다. 일부 실시양태에서, 세포는 인간 세포이다. 일부 실시양태에서, 세포는 배양된 세포이다. 일부 실시양태에서, 세포는 불사멸화 또는 안정화된 세포주이다. 일부 실시양태에서, 세포는 차이니즈 햄스터 난소(CHO) 세포이다. 일부 실시양태에서, 세포는 인간 배아 신장 293(HEK293) 세포이다.
본 개시는 본 개시의 조성물을 포함하는 세포를 제공한다. 일부 실시양태에서, 세포는 포유동물 세포이다. 일부 실시양태에서, 세포는 영장류 세포이다. 일부 실시양태에서, 세포는 인간 세포이다. 일부 실시양태에서, 세포는 배양된 세포이다. 일부 실시양태에서, 세포는 불사멸화 또는 안정화된 세포주이다. 일부 실시양태에서, 세포는 차이니즈 햄스터 난소(CHO) 세포이다. 일부 실시양태에서, 세포는 인간 배아 신장 293(HEK293) 세포이다.
본 개시는 본 개시의 벡터에 의해 발현되는 리소좀 효소 및 약제학적으로 허용되는 담체를 포함하는 약제학적 조성물을 제공한다.
본 개시는 리소좀 축적 장애(LSD)를 치료하는 방법을 제공하고, 상기 방법은 대상체에게 본 개시의 조성물을 투여하고, 이에 의해 LSD를 치료하는 것을 포함한다.
본 개시는 리소좀 축적 장애(LSD)를 치료하는 방법을 제공하고, 상기 방법은 치료학적 유효량의 본 개시의 조성물을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시키고, 이에 의해 LSD를 치료한다.
본 개시는 리소좀 축적 장애(LSD)를 앓고 있는 대상체를 치료하는 방법을 제공하고, 상기 방법은 본 개시의 약제학적 조성물을 대상체에게 투여하고, 이에 의해 리소좀 효소의 포스포릴화를 증가시키고 대상체를 치료하는 것을 포함한다.
본 개시는 이를 필요로 하는 대상체에서 리소좀 축적 장애(LSD)의 발생을 예방하는 방법을 제공하고, 이 방법은 대상체에게 본 개시의 약제학적 조성물을 투여하고, 이에 의해 리소좀 효소의 포스포릴화를 증가시키고 대상체에서 LSD의 발생을 예방하는 것을 포함한다.
본 개시는 이를 필요로 하는 대상체에서 리소좀 축적 장애(LSD)에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법을 제공하고, 상기 방법은 본 개시의 조성물을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시킨다.
본 개시의 방법의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 바와 같은 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다.
본 개시의 방법의 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 리소좀 축적 장애(LSD)를 포함한다. 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 축적 장애(LSD)를 포함한다. 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 하나 이상의 리소좀 축적 장애(들)(LSD(들))를 포함한다.
효소 대체 요법(
ERT
)
본 명세서에 제공된 것은, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 발현 벡터를 포함하는 조성물이다. 일부 실시양태에서, 개시된 바이시스트론 발현 벡터는, 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 전방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 후방에 위치하는 내부 리보솜 진입 부위(IRES)를 포함한다. 다른 실시양태에서, 개시된 바이시스트론 발현 벡터는, 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 후방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 전방에 위치하는 IRES를 포함한다.
본 명세서에 제공된 것은 개시된 바이시스트론 발현 벡터를 포함하는 포유동물 세포이다.
본 명세서에 제공된 것은, 본 명세서에 개시된 바와 같은 바이시스트론 벡터에 의해 발현되는 리소좀 효소 및 약제학적으로 허용되는 담체를 포함하는 약제학적 조성물을 제공한다.
본 명세서에 제공된 것은, 리소좀 축적 장애(LSD)를 앓고 있는 대상체를 치료하는 방법, 및 이를 필요로 하는 대상체에서 리소좀 축적 장애(LSD)의 발생을 예방하는 방법을 제공한다.
유전자 치료
본 명세서에서 제공된 것은, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 바이러스 벡터를 포함하는 조성물을 제공한다. 일부 실시양태에서, 개시된 바이시스트론 바이러스 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 전방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 후방에 위치하는 내부 리보솜 진입 부위(IRES)를 포함한다. 다른 실시양태에서, 개시된 바이시스트론 바이러스 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 후방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 전방에 위치하는 IRES를 포함한다. 일부 실시양태에서, 바이러스 벡터는 아데노바이러스, 아데노-연관 바이러스(AAV), 레트로바이러스 또는 렌티바이러스이다.
본 명세서에서 제공된 것은 리소좀 축적 장애(LSD)를 앓고 있는 대상체를 치료하는 방법 및 개시된 바이시스트론 바이러스 벡터를 대상체에게 투여함으로써 이를 필요로 하는 대상체에서 리소좀 축적 장애(LSD)의 발생을 예방하는 방법이다.
추가로 본 명세서에서 제공된 것은 이를 필요로 하는 대상체에서 LSD에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법이다.
본 명세서에서 제공된 것은 대상체에서 리소좀 축적 장애(LSD)를 치료 또는 예방하기 위한 바이시스트론 벡터를 사용하는 조성물 및 방법이다.
본 개시는 프로모터, 내부 리보솜 진입 부위(IRES), 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함하는 조성물을 제공한다. 본 개시의 방법은 본 명세서에 개시된 바이시스트론 벡터를 포함하는 약제학적 조성물을 대상체에게 투여하는 것을 포함한다.
정의
달리 정의되지 않는 한, 본 명세서에서 사용된 모든 기술 용어 및 과학 용어는 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 갖는다. 본 명세서에 기재된 것과 유사하거나 동등한 임의의 방법 및 재료를 본 발명의 시험의 실시에 사용할 수 있지만, 바람직한 재료 및 방법이 본 명세서에 기재되어 있다. 본 발명을 설명 및 청구할 때에 하기 용어가 사용된다.
또한, 본 명세서에서 사용된 용어는 일부 실시양태를 설명하는 것만을 목적으로 하고, 한정하는 것을 의도하는 것이 아님을 이해해야 한다.
본 명세서에 사용된 바와 같이, 관사 "a" 및 "an"은 관사의 문법적 목적어 중 하나 또는 하나 이상(즉, 적어도 하나)를 지칭하기 위해 사용된다. 예를 들면, "요소"는 하나의 요소 또는 하나 이상의 요소를 의미한다.
양, 시간적 지속시간 등의 측정가능한 값을 지칭할 때에 본 명세서에서 사용되는 바와 같이, 용어 "약"은 지정된 값으로부터 ±20% 또는 ±10%, 보다 바람직하게는 ±5%, 보다 바람직하게는 ±1%, 및 여전히 더 바람직하게는 ±0.1%의 변동을 포함하는 것을 의미하고, 이는 이러한 변동이 개시된 방법을 수행하는 데 적절하기 때문이다.
용어 "2A" 또는 "2A 펩티드" 또는 "2A-유사 펩티드"는 자가-프로세싱 바이러스 펩티드이다. 2A 펩티드는 단일 ORF 전사 단위에서 상이한 단백질 코딩 서열을 분리할 수 있다[참조: Ryan et al., 1991, J Gen Virol 72:2727-2732). "자가-절단" 펩티드 또는 프로테아제 부위로 지칭되지만, 2A 서열이 하나의 전사물로부터 2개의 단백질을 생성하는 메커니즘은 2A에서 정상 펩티드 결합이 손상되어, 하나의 번역 이벤트로부터 2개의 불연속 단백질 단편을 생성하는 리보솜 스키핑에 의해 발생한다. 2A 펩티드 서열과의 연결은 단일 ORF로부터 유래하는 복수의 개별 단백질(본질적으로 등몰량)의 세포 발현을 초래한다[참조: de Felipe et al., 2006, Trends Biotechnol 24:68-75].
용어 "생물학적" 또는 "생물학적 샘플"은 생물로부터 또는 생물의 구성요소(예를 들면, 세포)로부터 수득된 샘플을 지칭한다. 샘플은 임의의 생물학적 조직 또는 체액의 것일 수 있다. 종종, 샘플은 환자로부터 유래하는 샘플인 "임상 샘플"일 것이다. 이러한 샘플에는 골수, 심장 조직, 타액, 혈액, 림프액, 혈액 세포(예: 백혈구), 조직 또는 미세 바늘 생검 샘플, 소변, 복막액, 흉막액 또는 이들로부터의 세포가 포함되지만, 이들로 한정되지 않는다. 생물학적 샘플에는 조직학적 목적으로 채취한 냉동 절편 등의 조직 절편이 포함될 수도 있다.
본원에 사용된 바와 같이, 용어 "유도체"는 바이러스의 유도체가 주형 바이러스 핵산 또는 아미노산 서열과 관련하여 핵산 또는 아미노산 서열의 차이를 가질 수 있음을 명시한다.
"질환"은, 동물이 항상성을 유지할 수 없고, 질환이 개선되지 않는 경우, 동물의 건강이 계속 악화되는 동물의 건강 상태이다.
대조적으로, 동물의 "장애"는 동물이 항상성을 유지할 수 있는 건강 상태이지만, 동물의 건강 상태는 장애가 없는 경우보다 덜 양호한 상태이다. 치료하지 않고 방치하여도, 장애가 반드시 동물의 건강 상태를 추가로 저하시키는 것은 아니다.
"발현 벡터"는 발현되는 뉴클레오티드 서열에 작동가능하게 연결된 발현 조절 서열을 포함하는 재조합 폴리뉴클레오티드를 포함하는 벡터를 지칭한다. 발현 벡터는 발현에 충분한 시스-작용 요소를 포함하고; 발현을 위한 다른 요소는 숙주 세포에 의해 또는 시험관내 발현 시스템에서 공급될 수 있다. 발현 벡터는, 재조합 폴리뉴클레오티드를 포함하는, 코스미드, 플라스미드(예: 네이키드 또는 리포솜에 함유됨) 및 바이러스(예: 렌티바이러스, 레트로바이러스, 아데노바이러스 및 아데노-연관 바이러스) 등의 당해 기술분야에 공지된 모든 것들을 포함한다. 일부 실시양태에서, 개시된 벡터는 본 명세서에서 바이러스 벡터로 지칭된다. 일부 실시양태에서, 개시된 벡터는 본 명세서에서 발현 벡터로 지칭된다.
본 명세서에 사용된 바와 같이, "더 높은"은, 대조군 참조보다, 적어도 10% 이상, 예를 들면, 20%, 30%, 40%, 또는 50%, 60%, 70%, 80%, 90% 이상 및/또는 1.1배, 1.2배, 1.4배, 1.6배, 1.8배, 2.0배 이상, 및 이들 사이의 임의의 및 모든 또는 부분적 증분인 발현 수준을 지칭한다. 본 명세서에 개시된 바와 같이, 참조 값보다 높은 발현 수준은, 건강한 대상체에서 측정되거나 당해 기술분야에서 정의 또는 사용되는 발현(mRNA 또는 단백질)으로부터의 정상 또는 대조군 수준보다 높은 발현 수준(mRNA 또는 단백질)을 지칭한다.
본 명세서에 사용된 바와 같이, "더 낮은"은, 대조군 참조보다, 적어도 10% 이상 더 낮은, 예를 들면, 20%, 30%, 40%, 또는 50%, 60%, 70%, 80%, 90% 더 낮은, 및/또는 1.1배, 1.2배, 1.4배, 1.6배, 1.8배, 2.0배 이상 더 낮은, 및 그 사이의 임의의 및 모든 또는 부분적 증분인 발현 수준을 지칭한다. 본 명세서에 개시된 바와 같이, 참조 값보다 낮은 발현 수준은, 건강한 대상체에서 측정되거나 당해 기술분야에서 정의 또는 사용되는 발현(mRNA 또는 단백질)로부터의 정상 또는 대조군 수준보다 낮은 발현 수준(mRNA 또는 단백질)을 지칭한다.
본 명세서에 사용된 바와 같이, "대조군" 또는 "참조"라는 용어는 호환적으로 사용될 수 있고, 비교의 기준으로서 사용되는 값을 지칭한다.
본원에 사용된 바와 같이, "병용 요법"은 제1 약제가 다른 약제와 함께 투여되는 것을 의미한다. "병용하여" 또는 "조합하여"는 다른 치료 방식에 추가하여 하나의 치료 방식의 투여를 지칭한다. 이와 같이, "조합하여"는 다른 치료 방식을 개인에게 전달하기 전, 도중 또는 후에 하나의 치료 방식을 투여하는 것을 지칭한다. 이러한 조합은 단일 치료 섭생 또는 섭생의 일부로 간주된다. 예를 들면, 본 개시의 벡터 또는 벡터를 포함하는 조성물은 제2 치료제와 조합하여 대상체에게 제공 또는 투여될 수 있다. 일부 실시양태에서, 본 개시의 벡터 및 조성물은 제2 치료제와 동시에 또는 순차로 대상체에게 제공 또는 투여된다. 일부 실시양태에서, 본 개시의 벡터 및 조성물은 제2 치료제와 동시에 대상체에게 제공 또는 투여된다. 일부 실시양태에서, 본 개시의 벡터 및 조성물은 제2 치료제와 함께 대상체에게 순차적으로 제공 또는 투여된다. 일부 실시양태에서, 본 개시의 벡터 및 조성물은 제2 치료제의 투여 전에 대상체에게 제공 또는 투여된다. 일부 실시양태에서, 본 개시의 벡터 및 조성물은 제2 치료제의 투여 후에 대상체에게 제공 또는 투여된다. 일부 실시양태에서, 제2 치료제는 본 개시의 조성물의 제2 벡터를 포함한다. 일부 실시양태에서, 제2 치료제는, 이를 암호화하는 본 개시의 벡터 또는 조성물을 포함하여, 본 개시의 리소좀 효소의 변이체 형태를 포함한다. 일부 실시양태에서, 제2 치료제는 리소좀 축적 장애의 징후 또는 증상을 완화하기 위한 하나 이상의 약제를 포함한다. 일부 실시양태에서, 제2 치료제는 하나 이상의 항염증제 또는 면역억제제를 포함한다.
본 명세서에 사용된 용어 "작동가능하게 연결된"은 핵산 서열의 발현이, 그것이 공간적으로 연결되어 있는 프로모터의 조절하에 있는 것을 의미한다. 프로모터는 그 조절하에 핵산 서열의 5'(상류)에 위치할 수 있다.
본 명세서에 사용된 바와 같이, "일차 세포"는, 살아있는 조직(즉, 생검 물질)으로부터 직접 채취하고, 시험관내에서 성장을 위해 확립되고, 모집단 배가를 거의 받지 않기 때문에, 연속적 종양형성 또는 인공적 불사멸화 세포주와 비교하여, 이들이 유래하는 조직의 주요 기능적 구성요소 및 특성을 보다 잘 대표하는 세포를 지칭한다.
본 명세서에서 사용된 바와 같이, 용어 "펩티드", "폴리펩티드" 및 "단백질"은 호환적으로 사용되고, 펩티드 결합에 의해 공유 결합된 아미노산 잔기로 구성되는 화합물을 지칭한다. 단백질 또는 펩티드는 적어도 2개의 아미노산을 포함해야 하고, 단백질 또는 펩티드의 서열을 구성할 수 있는 최대 아미노산 수에는 제한이 없다. 폴리펩티드는 펩티드 결합에 의해 서로 결합된 2개 이상의 아미노산을 포함하는 임의의 펩티드 또는 단백질을 포함한다. 본 명세서에 사용된 바와 같이, 이 용어는, 예를 들면, 당해 기술분야에서 일반적으로 펩티드, 올리고펩티드 및 올리고머로 지칭되는 단쇄 및 당해 기술분야에서 일반적으로 단백질로서 지칭되는 보다 긴 사슬 둘 다를 지칭하고, 이들 중에는 다수 유형이 있다. "폴리펩티드"는, 예를 들면, 생물학적으로 활성인 단편, 실질적으로 상동성 폴리펩티드, 올리고펩티드, 동종이량체, 이종이량체, 폴리펩티드의 변이체, 변형된 폴리펩티드, 유도체, 유사체, 융합 단백질 등을 포함한다. 폴리펩티드는 천연 펩티드, 재조합 펩티드, 합성 펩티드 또는 이들의 조합을 포함한다.
본원에 사용된 용어 "프로모터"는 핵산의 발현을 부여, 활성화 또는 증강시킬 수 있는 합성 또는 천연-유래 분자를 의미할 수 있다. 본 명세서에 사용된 바와 같이, 프로모터는 폴리뉴클레오티드 서열의 특이적 전사를 개시하는데 필요한, 세포의 합성 기구 또는 도입된 합성 기구에 의해 인식되는 DNA 서열로서 정의된다.
본원에 사용된 바와 같이, 용어 "프로모터/조절 서열"은 프로모터/조절 서열에 작동가능하게 연결된 유전자 산물의 발현에 필요한 핵산 서열을 의미한다. 일부 경우에, 이 서열은 코어 프로모터 서열일 수 있고, 다른 경우에, 이 서열은 또한 유전자 산물의 발현에 필요한 인핸서 서열 및 기타 조절 요소를 포함할 수 있다. 프로모터/조절 서열은, 예를 들면, 조직 특이적 방식으로 유전자 산물을 발현하는 것일 수 있다.
"구성적" 프로모터는, 유전자 산물을 암호화하거나 특정하는 폴리뉴클레오티드와 작동가능하게 연결되면, 세포의 대부분 또는 모든 생리학적 조건하에서 유전자 산물을 세포에서 생성시키는 뉴클레오티드 서열이다.
"유도성" 프로모터는, 유전자 산물을 암호화하거나 특정하는 폴리뉴클레오티드와 작동가능하게 연결된 경우, 프로모터에 대응하는 유도인자가 세포에 존재하는 경우에만 실질적으로 세포에서 유전자 산물을 생성시키는 뉴클레오티드 서열이다.
본 명세서에 사용된 용어 "RNA"는 리보핵산으로 정의된다.
본 발명의 문맥에서 사용되는 용어 "치료"는, 질환 또는 장애에 대한 치료적 치료, 뿐만 아니라 예방적 또는 억제적 수단을 포함하는 것을 의미한다. 본 명세서에 사용된 바와 같이, 용어 "치료" 및 "치료하다" 및 "치료하는" 등의 관련 용어는 질환 상태 또는 이의 적어도 하나의 증상의 진행, 중증도 및/또는 지속기간의 감소를 의미한다. 따라서, 용어 '치료'는 대상체에게 이익을 줄 수 있는 임의의 섭생을 지칭한다. 치료는 기존 상태와 관련되거나 예방적(예방적 치료)일 수 있다. 치료에는 치유, 완화 또는 예방 효과가 포함될 수 있다. 본 명세서에서 "치료적" 및 "예방적" 치료에 대한 언급은 이들의 가장 넓은 문맥에서 고려되어야 한다. 용어 "치료적"은 반드시 대상체가 완전히 회복될 때까지 치료된다는 것을 의미하는 것은 아니다. 유사하게, "예방적"은 반드시 대상체가 최종적으로 질환 상태에 걸리지 않는다는 것을 의미하는 것은 아니다. 따라서, 예를 들면, 용어 치료는 질환 또는 장애의 발병 전 또는 후에 약제의 투여를 포함하고, 이에 의해 질환 또는 장애의 모든 징후를 예방하거나 제거한다. 또 다른 예로서, 질환의 증상을 퇴치하기 위한 질환의 임상 증상의 후에 약제의 투여는 질환의 "치료"를 포함한다.
본원에 사용된 바와 같이, 용어 "핵산"은 데옥시리보핵산(DNA), 및 적절한 경우 리보핵산(RNA) 등의 폴리뉴클레오티드를 지칭한다. 이 용어는 또한, 등가물로서, 뉴클레오티드 유사체로부터 제조된 RNA 또는 DNA 중 어느 하나의 유사체, 및 기재되는 실시양태에 적용가능한 경우, 단일-가닥(센스 또는 안티센스) 및 이중-가닥 폴리뉴클레오티드를 포함하는 것으로 이해되어야 한다. EST, 염색체, cDNA, mRNA 및 rRNA는 핵산으로 지칭될 수 있는 분자의 대표적 예이다.
본 명세서에 사용된 바와 같이, 용어 "약제학적 조성물"은 본 발명 내에서 유용한 적어도 하나의 화합물과 다른 화학 성분, 예컨대, 담체, 안정화제, 희석제, 보조제, 분산제, 현탁제, 증점제 및/또는 부형제의 혼합물을 지칭한다. 약제학적 조성물은 생물에 대한 화합물의 투여를 용이하게 한다. 화합물을 투여하는 복수의 기술이 당해 기술분야에 존대하고, 여기에는 종양내, 정맥내, 흉막내, 경구, 에어로졸, 비경구, 안과, 폐 및 국소 투여를 포함하지만 이들로 한정되지 않는다.
어구 "약제학적으로 허용되는 담체"는 약학적으로 허용되는 염, 약학적으로 허용되는 물질, 조성물 또는 담체, 예를 들면, 액체 또는 고체 충전제, 희석제, 부형제, 용매 또는 캡슐화 물질을 포함하고, 이는 이의 의도된 기능을 수행할 수 있도록 대상체 내로 또는 대상체에 본 발명의 화합물의 운반 또는 전달에 관여한다. 전형적으로, 이러한 화합물은 하나의 기관 또는 신체의 일부로부터 또 다른 기관 또는 신체의 일부로 운반되거나 수송된다. 각 염 또는 담체는, 제형의 다른 성분과 적합성이 있고 대상체에게 유해하지 않다는 의미에서 "허용"되어야 한다. 약제학적으로 허용되는 담체로서 작용할 수 있는 물질의 몇몇 예는 하기를 포함한다: 락토오스, 글루코오스 및 수크로오스 등의 당; 옥수수 전분 및 감자 전분 등의 전분; 나트륨 카복시메틸 셀룰로오스, 에틸 셀룰로오스 및 셀룰로오스 아세테이트 등의 셀룰로오스 및 이의 유도체; 분말 트라가칸트; 맥아; 젤라틴; 활석; 코코아 버터 및 좌약 왁스 등의 부형제; 땅콩유, 면실유, 홍화유, 참기름, 올리브유, 옥수수유 및 대두유 등의 오일; 프로필렌 글리콜 등의 글리콜; 글리세린, 소르비톨, 만니톨 및 폴리에틸렌 글리콜 등의 폴리올; 에틸 올레이트 및 에틸 라우레이트 등의 에스테르; 아가; 수산화마그네슘 및 수산화알루미늄 등의 완충제; 알긴산; 피로겐-비함유 물; 등장 식염수; 링거액; 에틸 알코올; 인산염 완충액; 희석제; 과립화제; 윤활제; 접합제; 붕해제; 습윤제; 유화제; 착색제; 이형제; 코팅제; 감미료; 향미제; 방향제; 방부제; 항산화제; 가소제; 겔화제; 증점제; 경화제; 유착제; 현탁제; 계면활성제; 보습제; 담체; 안정제; 및 약제학적 제형에 사용되는 기타 무독성 적합성 물질, 또는 이들의 임의의 조합. 본 명세서에 사용된 바와 같이, "약제학적으로 허용되는 담체"는 또한 화합물의 활성과 적합성이 있고 대상체에게 생리학적으로 허용되는 임의의 및 모든 코팅, 항균 및 항진균제, 및 흡수 지연제 등을 포함한다. 보충적 활성 화합물이 또한 조성물에 도입될 수 있다.
본 명세서에 사용된 바와 같이, 용어 "유효량" 또는 "치료학적 유효량"은, 특정 질환 상태를 예방하는 데 필요하거나, 질환 상태 또는 이의 적어도 하나의 증상 또는 이와 연관된 상태의 중증도를 감소시키고/시키거나 개선시키는, 본 발명의 벡터로부터 생성된 바이러스 입자 또는 감염 단위의 양을 의미한다.
본 명세서에 사용된 "대상체" 또는 "환자"는 인간 또는 비-인간 포유동물일 수 있다. 비-인간 포유동물은, 예를 들면, 가축 및 애완동물, 예를 들면 양, 소, 돼지, 개, 고양이 및 뮤린 포유동물을 포함한다. 바람직하게는, 대상체는 인간이다.
범위: 본 개시 전체에 걸쳐, 일부 실시양태는 범위 형식으로 제시될 수 있다. 범위 형식의 기재는 단순히 편의상 및 간결함을 위한 것이고, 개시 범위에 대한 융통성이 없는 제한으로서 해석되어서는 안 된다는 것을 이해해야 한다. 따라서, 범위에 대한 기재는, 이의 범위 내의 개개 수치 뿐만 아니라, 가능한 모든 하위 범위를 구체적으로 개시하는 것으로 간주되어야 한다. 예를 들면, 1 내지 6 등의 범위의 기재는 1 내지 3, 1 내지 4, 1 내지 5, 2 내지 4, 2 내지 6, 3 내지 6 등의 하위 범위, 게다가 해당 범위 내의 개별 수치, 예를 들면, 1, 2, 2.7, 3, 4, 5, 5.3 및 6을 구체적으로 개시하는 것으로 간주되어야 한다. 이는 범위의 폭에 관계없이 적용된다.
조성물
본 명세서에 제공된 것은, 바이시스트론 발현 벡터를 포함하는 약제를 대상체에게 투여함으로써 대상체에서 리소좀 축적 장애(LSD)를 치료 또는 예방하기 위한 조성물 및 방법이다.
일부 실시양태에서, 본 개시는 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함하는 조성물을 제공한다. 일 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드는 작동가능하게 연결된다.
일부 실시양태에서, 본 개시는 구성적 프로모터, 내부 리보솜 진입 부위(IRES) 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 폴리뉴클레오티드를 포함하는 바이시스트론 벡터를 포함하는 조성물을 제공한다.
일부 실시양태에서, 바이시스트론 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 전방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 후방에 위치하는 IRES를 포함한다. 다른 실시양태에서, 바이시스트론 벡터는 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드 후방 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 전방에 위치하는 IRES를 포함한다.
IRES의 서열은 당해 기술분야에 공지된 서열 또는 이의 변이체일 수 있다. IRES 변이체는 변형되거나 돌연변이될 수 있다. 일 실시양태에서, 서열 IRES는 서열번호 3을 포함한다. 다른 실시양태에서, IRES의 서열은 서열번호 3과 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95%, 적어도 99% 유사하다.
일 실시양태에서, 리소좀 효소의 폴리뉴클레오티드는 2A 펩티드를 암호화하는 2A DNA에 작동가능하게 연결되고, 이는 순차로 작동가능하게는 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)의 폴리뉴클레오티드이다. 당해 기술분야에 공지된 다양한 2A 펩티드는, T2A, P2A, E2A 및 F2A를 포함하지만 이들로 한정되지 않는 개시된 바이시스트론 벡터에서 사용될 수 있다. 일부 실시양태에서, GSG 잔기의 부가는 절단 효율을 개선하기 위해 펩티드의 5' 말단에 부가될 수 있다.
일부 실시양태에서, 바이시스트론 바이러스 벡터는 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드에 작동가능하게 연결된 프로모터를 포함한다.
일부 실시양태에서, 바이시스트론 발현 벡터는 프로모터를 포함한다.
프로모터는 구성적, 유도성/억제성 또는 세포형 특이적일 수 있다. 특정 실시양태에서, 프로모터는 구성적일 수 있다. 포유동물 세포에 대한 구성적 프로모터의 비제한적 예는 CMV, UBC, EF1a, SV40, PGK, CAG, CBA/CAGGS/ACTB, CBh, MeCP2, U6 및 H1을 포함한다. 일부 실시양태에서, 현재 개시된 바이시스트론 벡터는 구성적 프로모터를 포함한다. 일부 실시양태에서, 구성적 프로모터는 사이토메갈로바이러스(CMV) 프로모터이다. 일부 실시양태에서, CMV 프로모터의 폴리뉴클레오티드는 서열번호 2의 핵산 서열을 포함한다.
다른 실시양태에서, 프로모터는 유도성 프로모터일 수 있다. 유도성 프로모터는 테트라사이클린, 열 충격, 스테로이드 호르몬, 중금속, 포르볼 에스테르, 아데노바이러스 E1A 요소, 인터페론 및 혈청 유도성 프로모터로 이루어진 그룹으로부터 선택될 수 있다.
상이한 실시양태에서, 프로모터는 세포형 특이적일 수 있다. 예를 들면, 뉴런(예: 시냅신), 성상세포(예: GFAP), 희돌기교세포(예: 미엘린 염기성 단백질), 소교세포(예: CX3CR1), 신경내분비 세포(예: 크로모그라닌 A), 근육 세포(예: 데스민, Mb) 또는 심근세포(예: 알파 미오신 중쇄 프로모터)에 대한 세포형 특이적 프로모터를 사용할 수 있다. 예시적 실시양태에서, 프로모터는 Nrl(간상체 광수용체 특이적) 프로모터 또는 HBB(헤모글로빈 베타) 프로모터일 수 있다. 프로모터는, 발현을 추가로 증강시키고/시키거나 핵산의 공간적 발현 및/또는 시간적 발현을 변경하기 위해 하나 이상의 특이적 전사 조절 서열을 추가로 포함할 수 있다.
벡터에서 발견되는 인핸서 서열은 또한 그 안에 포함되는 유전자의 발현을 조절한다. 통상, 인핸서는 단백질 인자와 결합하여 유전자의 전사를 증강시킨다. 인핸서는, 그것이 조절하는 유전자의 상류 또는 하류에 위치할 수 있다. 인핸서는 또한, 특정 세포 또는 조직 유형에서 전사를 증강시키기 위해 조직 특이적일 수 있다. 일 실시양태에서, 본원의 바이시스트론 벡터는 벡터 내에 존재하는 유전자의 전사를 촉진하기 위한 하나 이상의 인핸서를 포함한다. 인핸서의 비제한적 예는 CMV 인핸서 및 SP1 인핸서를 포함한다.
일부 실시양태에서, 하나 초과의 프로모터가 폴리펩티드를 암호화하는 각각의 폴리뉴클레오티드에 작동가능하게 연결될 수 있고, 프로모터는 동일하거나 상이할 수 있다. 프로모터와 발현되는 핵산 서열 사이의 거리는 그 프로모터와 그것이 조절하는 천연 핵산 서열 사이의 거리와 대략 동일할 수 있다. 당해 기술분야에 공지된 바와 같이, 이 거리의 변동은 프로모터 기능의 손실 없이 적응될 수 있다.
바이시스트론 벡터 내의 폴리펩티드의 발현을 평가하기 위해, 벡터는 또한, 선별가능한 마커 유전자 또는 리포터 유전자 또는 이들 둘 모두를 포함하여, 바이러스 벡터를 통해 형질감염 또는 감염시키고자 하는 세포 모집단으로부터 발현 세포의 동정 및 선택을 용이하게 할 수 있다. 일부 실시양태에서, 선택가능 마커는 DNA의 개별 조각에 운반될 수 있고, 공-형질감염 절차에 사용될 수 있다. 선택가능한 마커 및 리포터 유전자의 둘 다에, 숙주 세포에서의 발현을 가능하게 하는 적절한 조절 서열을 인접시킬 수 있다. 유용한 선택가능한 마커는, 예를 들면, neo 등의 항생제 내성 유전자를 포함한다.
리포터 유전자는 잠재적으로 형질감염된 세포를 동정하고 조절 서열의 기능을 평가하기 위해 사용된다. 일반적으로, 리포터 유전자는 수용자 생물 또는 조직에 존재하지 않거나, 이에 의해 발현되지 않고 그 발현이 효소 활성 등의 일부 용이하게 검출가능한 특성에 의해 나타나는 폴리펩티드를 암호화하는 유전자이다. 리포터 유전자의 발현은 DNA가 수용자 세포에 도입된 후의 적절한 시간에 검정된다. 적합한 리포터 유전자는 루시페라제, 베타-갈락토시다제, 클로람페니콜 아세틸 트랜스퍼라제, 분비된 알칼리성 포스파타제, 또는 녹색 형광 단백질 유전자를 암호화하는 유전자를 포함할 수 있다[참조: 예를 들면, Ui-Tei et al., 2000 FEBS Letters 479: 79-82]. 적합한 발현 시스템은 공지되어 있고, 공지된 기술을 사용하여 제조하거나, 상업적으로 입수할 수 있다. 일반적으로, 리포터 유전자의 최고 수준의 발현을 나타내는 최소 5' 인접 영역을 갖는 작제물이 프로모터로서 동정된다. 이러한 프로모터 영역은 리포터 유전자에 연결될 수 있고, 프로모터-유도 전사를 조절하는 능력에 대해 약제를 평가하기 위해 사용될 수 있다.
유전자를 세포 내로 도입하고 발현시키는 방법은 당해 기술분야에 공지되어 있다. 발현 벡터와 관련하여, 벡터는 당해 기술분야의 임의의 방법에 의해 숙주 세포, 예를 들면, 포유동물, 박테리아, 효모 또는 곤충 세포에 용이하게 도입될 수 있다. 예를 들면, 발현 벡터는 물리적, 화학적 또는 생물학적 수단에 의해 숙주 세포로 전달될 수 있다.
폴리뉴클레오티드를 숙주 세포에 도입하기 위한 물리적 방법에는 인산칼슘 침전, 리포펙션, 입자 충격, 미세주입, 전기천공법 등이 포함된다. 벡터 및/또는 외인성 핵산을 포함하는 세포를 생산하는 방법은 당해 기술분야에 공지되어 있다[참조: 예를 들면, Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York]. 폴리뉴클레오티드를 숙주 세포에 도입하기 위한 바람직한 방법은 인산칼슘 형질감염이다.
목적의 폴리뉴클레오티드를 숙주 세포에 도입하기 위한 생물학적 방법에는 DNA 및 RNA 벡터의 사용이 포함된다. 바이러스 벡터, 특히 레트로바이러스 벡터는 유전자를 포유동물, 예를 들면, 인간 세포에 삽입하기 위해 가장 광범위하게 사용되는 방법이 되었다. 다른 바이러스 벡터는 렌티바이러스, 폭스바이러스, 단순 헤르페스 바이러스 I, 아데노바이러스 및 아데노-연관 바이러스 등으로부터 유래될 수 있다[참조: 예를 들면, 미국 특허 제5,350,674호 및 제5,585,362호].
폴리뉴클레오티드를 숙주 세포에 도입하기 위한 화학적 수단은 거대분자 복합체, 나노캡슐, 미소구체, 비드 등의 콜로이드 분산 시스템, 및 수중유 에멀젼, 미셀, 혼합 미셀 및 리포솜을 포함하는 지질-기반 시스템을 포함한다. 시험관내 및 생체내 전달 비히클로서 사용하기 위한 예시적 콜로이드 시스템은 리포솜(예를 들면, 인공 막 소포)이다.
비-바이러스 전달 시스템이 사용되는 일부 실시양태에서, 예시적 전달 비히클은 리포솜이다. 핵산을 숙주 세포로 도입하기 위해 지질 제형의 사용이 고려된다(시험관내, 생체외 또는 생체내). 일부 실시양태에서, 핵산은 지질과 회합될 수 있다. 지질과 회합된 핵산은 리포솜의 수성 내부에 캡슐화되거나, 리포솜의 지질 이중층 내에 산재되거나, 리포솜 및 올리고뉴클레오티드 둘 모두와 관련되는 연결 분자를 통해 리포솜에 부착되거나, 리포솜에 포획되거나, 리포솜과 복합체를 형성하거나, 지질을 포함하는 용액에 분산되거나, 지질과 혼합되거나, 지질과 조합되거나, 지질에 현탁액으로 함유되거나, 미셀과 함께 함유되거나 미셀과 복합체를 형성하거나, 또는 달리는 지질과 회합될 수 있다. 지질, 지질/DNA 또는 지질/발현 벡터 회합 조성물은 용액 중의 임의의 특정 구조로 한정되지 않는다. 예를 들면, 이들은 미셀로서 또는 "붕괴된" 구조로 이중층 구조로 존재할 수 있다. 이들은 또한, 단순히 용액에 산재되어, 크기 또는 형상이 균일하지 않은 응집체를 형성할 수도 있다. 지질은 천연에 존재하는 지질 또는 합성 지질일 수 있는 지방성 물질이다. 예를 들면, 지질에는, 세포질에서 자연적으로 존재하는 지방 액적과, 지방산, 알코올, 아민, 아미노 알코올 및 알데하이드 등의 장쇄 지방족 탄화수소 및 이들의 유도체를 포함하는 화합물 부류가 포함된다.
사용에 적합한 지질은 상업적 공급원으로부터 수득할 수 있다. 예를 들면, 디미리스틸 포스파티딜콜린("DMPC")은 미주리주 세인트루이스 소재의 시그마(Sigma)로부터 입수할 수 있고; 디세틸 포스페이트("DCP")는 케이 앤드 케이 라보라토리즈(K & K Laboratories)(Plainview, NY)로부터 입수할 수 있고; 콜레스테롤("Choi")은 칼바이오켐-베링(Calbiochem-Behring)으로부터 입수할 수 있고; 디미리스틸 포스파티딜글리세롤("DMPG") 및 기타 지질은 아방티 폴라 리피드, 인코포레이티드(Avanti Polar Lipids, Inc.)(Birmingham, AL)로부터 입수할 수 있다. 클로로포름 또는 클로로포름/메탄올 중의 지질 스톡 용액은 약 -20℃에서 보존할 수 있다. 클로로포름은 메탄올보다 쉽게 증발하기 때문에, 유일한 용매로서 사용된다. "리포솜"은 봉입된 지질 이중층 또는 응집체의 생성에 의해 형성되는 다양한 단일 및 다중층 지질 비히클을 포괄하는 일반적 용어이다. 리포솜은 인지질 이중층 막과 내부 수성 매질을 갖는 소포 구조를 갖는 것을 특징으로 할 수 있다. 다층 리포솜은 수성 매질에 의해 분리된 복수의 지질층을 갖고 있다. 이들은, 인지질이 과량의 수용액에 현탁되면, 자발적으로 형성된다. 지질 성분은 폐쇄된 구조를 형성하기 전에 자가 재배열하고, 지질 이중층 사이에 물과 용해된 용질을 포획한다[참조: Ghosh et al., 1991 Glycobiology 5: 505-10]. 그러나, 용액에서 정상 소포 구조와는 상이한 구조를 갖는 조성물도 포함된다. 예를 들면, 지질은 미셀 구조를 가정하거나 지질 분자의 불균일한 응집체로서 단순히 존재할 수 있다. 또한, 리포펙타민-핵산 복합체도 고려된다.
외인성 핵산을 숙주 세포에 도입하기 위해 사용되는 방법에 관계없이, 숙주 세포에서 재조합 DNA 서열의 존재를 확인하기 위해, 다양한 검정이 수행될 수 있다. 이러한 검정은, 예를 들면, 서던 및 노던 블롯팅, RT-PCR 및 PCR 등의 당업자에게 공지되어 있는 "분자 생물학적" 분석; 예를 들면, 면역학적 수단(ELISA 및 웨스턴 블롯)에 의해 또는 본 개시의 범위 내에 속하는 약제를 동정하기 위해 본 명세서에 기재된 검정에 의해 특정 펩티드의 존재 또는 부재를 검출하는 것 등의 "생화학적" 검정을 포함한다.
유전자 치료용 벡터
본 명세서에 개시된 바와 같이 대상체에서 LSD를 치료 또는 예방하기 위해 사용되는 벡터는 복제 및 임의로 진핵 세포에서의 통합에 적합하다. 전형적 벡터는 목적하는 핵산 서열의 발현 조절에 유용한 전사 및 번역 터미네이터, 개시 서열 및 프로모터를 함유한다.
본 개시의 벡터는 또한 표준 유전자 전달 프로토콜을 사용하여 핵산 면역화 및 유전자 치료에 사용될 수 있다. 유전자 전달 방법은 당해 기술분야에 공지되어 있다[참조: 예를 들면, 미국 특허 제5,399,346호, 제5,580,859호, 제5,589,466호, 그 전체가 참조에 의해 본 명세서에 도입된다]. 또 다른 실시양태에서, 본 개시는 유전자 치료 벡터를 제공한다.
본 개시의 단리된 핵산은 다수 유형의 벡터로 클로닝될 수 있다. 예를 들면, 핵산은 플라스미드, 파지미드, 파지 유도체, 동물 바이러스 및 코스미드를 포함하지만 이들로 한정되지 않는 벡터 내로 클로닝될 수 있다. 목적 벡터에는 발현 벡터, 복제 벡터, 프로브 생성 벡터 및 서열분석 벡터가 포함된다.
추가로, 벡터는 바이러스 벡터의 형태로 세포에 제공될 수 있다. 바이러스 벡터 기술은 당해 기술분야에 공지되어 있고, 예를 들면, 문헌[참조: Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York)] 및 기타 바이러스학 및 분자 생물학 매뉴얼에 기재되어 있다. 벡터로서 유용한 바이러스는 레트로바이러스, 아데노바이러스, 아데노-연관 바이러스, 헤르페스 바이러스 및 렌티바이러스를 포함하지만 이들로 한정되지 않는다. 일반적으로, 적합한 벡터는 적어도 하나의 생물에서 기능적 복제 기점, 프로모터 서열, 편리한 제한 엔도뉴클레아제 부위, 및 하나 이상의 선택가능한 마커를 함유한다(예를 들면, WO 01/96584; WO 01/29058; 및 미국 특허 제6,326,193호).
포유동물 세포로의 유전자 전달을 위해, 다수의 바이러스 기반 시스템이 개발되었다. 예를 들면, 레트로바이러스는 유전자 전달 시스템에 편리한 플랫폼을 제공한다. 선택된 유전자는 당해 기술분야에 공지된 기술을 사용하여 벡터에 삽입되고, 레트로바이러스 입자에 패키징될 수 있다. 이어서, 재조합 바이러스를 단리하여, 생체내 또는 생체외에서 대상체의 세포로 전달할 수 있다. 다수의 레트로바이러스 시스템이 당해 기술분야에 공지되어 있다. 일부 실시양태에서, 아데노바이러스 벡터가 사용된다. 다수의 아데노바이러스 벡터가 당해 기술분야에 공지되어 있다. 일 실시양태에서, 렌티바이러스 벡터가 사용된다.
예를 들면, 렌티바이러스 등의 레트로바이러스에서 유래하는 벡터는, 이들이 딸 세포에서 도입유전자의 장기적으로 안정한 통합 및 이의 전파를 가능하게 하기 때문에, 장기적 유전자 전달을 달성하기 위한 적합한 도구이다. 렌티바이러스 벡터는 간세포 등의 비증식성 세포를 형질도입할 수 있다는 점에서 마우스 백혈병 바이러스 등의 종양-레트로바이러스에서 유래하는 벡터에 비해 추가 이점이 있다. 이들은 또한 면역원성이 낮다는 추가 이점을 갖고 있다. 바람직한 실시양태에서, 조성물은 아데노-연관 바이러스(AAV)로부터 유래하는 벡터를 포함한다. 아데노-연관 바이러스(AAV) 벡터는 다양한 장애의 치료를 위한 강력한 유전자 전달 도구로 되었다. AAV 벡터는, 병원성의 결여, 최소 면역원성, 안정적이고 효율적인 방식으로 유사분열 후의 세포를 형질도입하는 능력을 포함하여, 유전자 치료에 이상적으로 적합하도록 하는 다수의 특징을 갖고 있다. AAV 벡터 내에 함유된 특정 유전자의 발현은 AAV 혈청형, 프로모터 및 전달 방법의 적절한 조합을 선택함으로써 하나 이상의 유형의 세포를 특이적으로 표적화할 수 있다.
일부 실시양태에서, 개시된 바이시스트론 바이러스 벡터는 아데노바이러스(예를 들면, Ad-SYE, AdSur-SYE, Ad5/3-MDA7/IL-24, Ad-SB, Ad-CRISPR, 종양용해성 Ad); 아데노-연관 바이러스, AAV(예: AAV-MeCP2, AAV1, AAV5, Dual AAV9 AAV8, AAV9, AAVrh10, AAVhu37); 단순 헤르페스 바이러스, HSV(예: HSV1, HSV2, HSV-1, HF10 종양용해성 HSV-2); 레트로바이러스(예: RRV/Toca 511, GRV); 렌티바이러스(예: HIV-1, HIV-2); 알파바이러스(SFV, M1); 플라비바이러스(쿤진(Kunjin) 바이러스); 랍도바이러스(rhabdovirus)(VSV); 홍역 바이러스(예: MV-Edm); 뉴캐슬병(Newcastle disease) 바이러스(예: NDV90); 아닌가 피코르나바이러스 콕스사키에바이러스(anhinga Picornaviruses) Coxsackievirus)(예: CVB3, CAV21, EV1); 또는 폭스바이러스(예: PANVAC, VV, VV-GLV-1h153, CPXV)를 포함한다.
일 실시양태에서, 개시된 바이시스트론 바이러스 벡터는 아데노바이러스, 아데노-연관 바이러스(AAV), 알파바이러스, 플라비바이러스, 단순 헤르페스 바이러스(HSV), 홍역 바이러스, 랍도바이러스, 레트로바이러스, 렌티바이러스, 뉴캐슬병 바이러스(NDV), 폭스바이러스 또는 피코르나바이러스이다. 일 실시양태에서, 개시된 바이시스트론 바이러스 벡터는 아데노바이러스, 아데노-연관 바이러스(AAV), 레트로바이러스 또는 렌티바이러스이다.
일 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드는 AAV 벡터 내에 함유된다. 30개 이상의 천연 존재 AAV 혈청형을 사용할 수 있다. AAV 캡시드에는 다수의 천연 변이체가 존재하여, 골격근에 특히 적합한 특성을 갖는 AAV의 동정 및 사용을 가능하게 한다. AAV 바이러스는 종래의 분자 생물학 기술을 사용하여 조작할 수 있고, 이는, 몇 가지 열거하면, 핵산 서열의 세포 특이적 전달, 면역원성의 최소화, 안정성 및 입자 수명의 조정, 효율적 분해, 핵으로의 정확한 전달을 위해 이들 입자를 최적화할 수 있게 한다.
AAV의 사용은, 비교적 독성이 없고, 효율적 유전자 전달을 제공하며, 특정 목적에 용이하게 최적화될 수 있기 때문에, DNA의 외인성 전달의 일반적 모드이다. 인간 또는 비인간 영장류(NHP)로부터 단리되고 충분히 특성화된 AAV의 혈청형 중에서, 인간 혈청형 2는 유전자 전달 벡터로서 개발된 최초의 AAV이고; 그것은 상이한 표적 조직 및 동물 모델에서 효율적 유전자 전달 실험에 광범위하게 사용되어 왔다. 일부 인간 질환 모델에 대한 AAV2 기반 벡터의 실험적 적용의 임상 시험이 진행 중이고, 예를 들면, 낭포성 섬유증 및 혈우병 B 등의 질환의 치료가 포함된다. 기타 유용한 AAV 혈청형에는 AAV1, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9이 포함된다.
벡터로 조립하기 위한 바람직한 AAV 단편은 vp1, vp2, vp3 및 초가변 영역을 포함하는 캡 단백질, rep 78, rep 68, rep 52 및 rep 40을 포함하는 rep 단백질, 및 이들 단백질을 암호화하는 서열을 포함한다. 이들 단편은 다양한 벡터 시스템 및 숙주 세포에서 용이하게 이용될 수 있다. 이러한 단편은 단독으로, 다른 AAV 혈청형 서열 또는 단편과 조합하여, 또는 다른 AAV 또는 비-AAV 바이러스 서열로부터의 요소와 조합하여 사용될 수 있다. 본 명세서에 사용된 바와 같이, 인공 AAV 혈청형은, 한정되지 않지만, 비-천연 캡시드 단백질을 갖는 AAV를 포함한다. 이러한 인공 캡시드는, 상이한 선택된 AAV 혈청형, 동일한 AAV 혈청형의 비-인접 부분, 비-AAA 바이러스 공급원 또는 비-바이러스 공급원으로부터 수득될 수 있는 이종성 서열과 조합하여, 선택된 AAV 서열(예를 들면, vp1 캡시드 단백질의 단편)을 사용하여 임의의 적합한 기술에 의해 생성할 수 있다. 인공 AAV 혈청형은, 한정되지 않지만, 키메라 AAV 캡시드, 재조합 AAV 캡시드, 또는 "인간화" AAV 캡시드일 수 있다. 따라서, 목적의 리소좀 효소 및 변형된 GlcNAc-1 PTase의 발현에 적합한 예시적 AAV 또는 인공 AAV는 AAV2/8(미국 특허 제7,282,199호 참조), AAV2/5(국립 위생 연구소(National Institutes of Health)에서 입수 가능), AAV2/9(국제 공개공보 제WO2005/033321호), AAV2/6(미국 특허 제6,156,303호) 및 AAVrh8(국제 공개공보 제WO2003/042397호) 등을 포함한다.
일 실시양태에서, 본 명세서에 기재된 조성물 및 방법에 유용한 벡터는, 적어도, 선택된 AAV 혈청형 캡시드, 예를 들면, AAV8 캡시드 또는 이의 단편을 암호화하는 서열을 함유한다. 또 다른 실시양태에서, 유용한 벡터는, 적어도, 선택된 AAV 혈청형 rep 단백질, 예를 들면, AAV8 rep 단백질 또는 이의 단편을 암호화하는 서열을 함유한다. 임의로, 이러한 벡터는 AAV 캡 및 rep 단백질을 모두 포함할 수 있다. AAV rep 및 cap 둘 모두가 제공되는 벡터에서, AAV rep 및 AAV cap 서열은 둘 다 하나의 혈청형 기원, 예를 들면, 모든 AAV8 기원일 수 있다. 또는, rep 서열이 캡 서열을 제공하는 것과는 상이한 AAV 혈청형으로부터 유래하는 벡터를 사용할 수 있다. 일 실시양태에서, rep 및 cap 서열은 별개의 공급원(예를 들면, 별개의 벡터, 또는 숙주 세포 및 벡터)으로부터 발현된다. 또 다른 실시양태에서, 이들 rep 서열은 상이한 AAV 혈청형의 캡 서열에 프레임내 융합되어, 미국 특허 제제7,282,199호에 기재된 AAV2/8 등의 키메라 AAV 벡터를 형성한다.
적합한 재조합 아데노-연관 바이러스(AAV)는, 본 명세서에 정의된 바와 같이, 아데노-연관 바이러스(AAV) 혈청형 캡시드 단백질 또는 이의 단편을 암호화하는 핵산 서열; 기능적 rep 유전자; 적어도, AAV 역방향 말단 반복체(ITR) 및 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 변형된 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드로 구성된 미니유전자; 및 AAV 캡시드 단백질 내로 미니유전자의 패키징을 가능하게 하기에 충분한 헬퍼 기능을 함유하는 숙주 세포를 배양함으로써 생성된다. AAV 캡시드에 AAV 미니유전자를 패키징하기 위해 숙주 세포에서 배양되는 데 필요한 성분은 트랜스로 숙주 세포에 제공될 수 있다. 또는, 필수 구성요소(예: 미니유전자, rep 서열, 캡 서열 및/또는 헬퍼 기능) 중 임의의 하나 이상은 당업자에게 공지된 방법을 사용하여 하나 이상의 필수 구성요소를 함유하도록 조작된 안정한 숙주 세포에 의해 제공될 수 있다.
가장 적합하게는, 이러한 안정한 숙주 세포는 구성적 프로모터의 조절하에 필요한 구성요소(들)를 함유할 것이다. 그러나, 필요한 구성요소는 유도성 프로모터의 조절하에 있을 수 있다. 적합한 유도성 및 구성적 프로모터의 예는 본 명세서의 다른 곳에서 제공되고, 당해 기술분야에 공지되어 있다. 또 다른 대안에서, 선택된 안정한 숙주 세포는 구성적 프로모터의 조절하에 선택된 구성요소(들) 및 하나 이상의 유도성 프로모터의 조절하에 있는 다른 선택된 구성요소(들)를 함유할 수 있다. 예를 들면, 293 세포(구성적 프로모터의 조절하에 E1 헬퍼 기능을 함유함)로부터 유래하지만 유도성 프로모터의 조절하에 rep 및/또는 cap 단백질을 함유하는 안정한 숙주 세포가 생성될 수 있다. 추가로 기타 안정한 숙주 세포는 당업자에 의해 생성될 수 있다.
본 개시의 rAAV를 생성하는데 필요한 미니유전자, rep 서열, 캡 서열, 및 헬퍼 기능은 그 위에 운반된 서열을 전달하는 임의의 유전적 요소의 형태로 패키징 숙주 세포에 전달될 수 있다. 선택된 유전적 요소는 본 명세서에 기재된 방법 및 당해 기술분야에서 이용가능한 임의의 기타 방법을 포함하는 임의의 적합한 방법을 사용하여 전달될 수 있다. 본 개시의 임의의 실시양태를 구성하기 위해 사용되는 방법은 핵산 조작의 숙련가에게 공지되어 있고, 유전 공학, 재조합 공학 및 합성 기술을 포함한다[참조: 예를 들면, Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY]. 유사하게는, rAAV 비리온을 생성하는 방법은 공지되어 있고, 적절한 방법의 선택은 본 개시에 대한 제한은 아니다[참조: 예를 들면, K. Fisher et al., 1993 J. Virol., 70:520-532 및 미국 특허 제5,478,745호 등].
달리 명시되지 않는 한, AAV ITR 및 본 명세서에 기재된 기타 선택된 AAV 성분은 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9 또는 기타 것을 포함하지만 이들로 한정되지 않는 임의의 AAV 혈청형 또는 기타 공지된 또는 미지의 AAV 혈청형 중에서 용이하게 선택될 수 있다. 이들 ITR 또는 기타 AAV 성분은 당업자에게 이용가능한 기술을 사용하여 AAV 혈청형으로부터 용이하게 단리될 수 있다. 이러한 AAV는 학술적, 상업적 또는 공적 공급원[참조: 예를 들면, American Type Culture Collection, Manassas, VA]로부터 단리되거나 수득될 수 있다. 또는, AAV 서열은, 문헌 또는 데이터베이스, 예를 들면, GenBank, PubMed 등에서 이용가능하도록 공개된 서열을 참조함으로써 합성 또는 기타 적절한 수단을 통해 수득할 수 있다.
일부 실시양태에서, 바이시스트론 벡터는 서열번호 1의 핵산 서열을 포함한다. 다른 실시양태에서, 바이시스트론 벡터는 서열번호 1과 적어도 20%, 적어도 30%, 적어도 40%, 적어도 50%, 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95%, 적어도 99% 유사성을 갖는 핵산 서열을 포함한다.
일부 실시양태에서, 암호화된 리소좀 효소는 하기 표 1A, 표 1B 또는 표 1C에 수록된 바와 같은 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다. 다른 실시양태에서, 리소좀 효소는 하기 표 1A, 표 1B 또는 표 1C에 수록된 바와 같은 적어도 하나이다.
[표 1A]
ERT 실시양태((Uniprot 수탁 번호)를 갖는 효소)
[표 1B]
유전자 치료 실시양태((Uniprot 수탁 번호)를 갖는 효소)
[표 1C]
리소좀 장애(단백질(Uniprot 수탁 번호))
일부 실시양태에서, 리소좀 효소는 β-글루코세브로시다제(GBA), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN)로 이루어진 그룹으로부터 선택된다. 또 다른 실시양태에서, 리소좀 효소를 암호화하는 폴리뉴클레오티드는 서열번호 5 내지 10의 핵산 서열을 포함한다. 다른 실시양태에서, 리소좀 효소는 서열번호 5 내지 10과 적어도 50%, 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95%, 적어도 99% 유사성을 갖는 폴리뉴클레오티드에 의해 암호화된다.
일부 실시양태에서, S1-S3 PTase는 서열번호 4의 핵산 서열을 포함하는 폴리뉴클레오티드에 의해 암호화된다. 다른 실시양태에서, GlcNAc-1 PTase는 서열번호 4와 적어도 50%, 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95%, 적어도 99% 유사성을 갖는 폴리뉴클레오티드에 의해 암호화된다.
본 개시는 또한, 본 명세서에 개시된 것들과 실질적인 상동성을 갖는 임의 형태의 폴리펩티드 또는 폴리뉴클레오티드를 포함하는 것으로 해석되어야 한다.
바람직하게는, "실질적으로 상동성"인 폴리펩티드는, 본 명세서에 개시된 펩티드의 아미노산 서열과 약 50% 상동성, 보다 바람직하게는 약 70% 상동성, 훨씬 더 바람직하게는 약 80% 상동성, 더욱 바람직하게는 약 90% 상동성, 훨씬 더 바람직하게는 약 95% 상동성, 및 훨씬 더 바람직하게는 약 99% 상동성이다.
또는, 폴리펩티드는 재조합 수단에 의해 또는 더 긴 폴리펩티드로부터의 절단에 의해 제조될 수 있다. 펩티드의 조성은 아미노산 분석 또는 서열분석에 의해 확인할 수 있다. 본 개시에 따른 폴리펩티드의 변이체는 (i) 하나 이상의 아미노산 잔기가 보존 또는 비보존 아미노산 잔기(바람직하게는 보존 아미노산 잔기)로 치환되고 이러한 치환된 아미노산 잔기가 유전 코드에 의해 암호화된 것일 수도 있고 아닐 수도 있는 것, (ii) 하나 이상의 변형된 아미노산 잔기, 예를 들면, 치환기의 부착에 의해 변형된 잔기가 존재하는 것, (iii) 폴리펩티드가 본 개시의 폴리펩티드의 선택적 스플라이스 변이체인 것, (iv) 폴리펩티드의 단편 및/또는 (v) 폴리펩티드가 리더 또는 분비 서열 또는 정제(예: His-태그) 또는 검출(예: Sv5 에피토프 태그)에 사용되는 서열 등의 또 다른 폴리펩티드와 융합되어 있는 것일 수 있다. 단편에는 원래 서열의 단백질 분해 절단(다중 부위 단백질 분해 포함)을 통해 생성된 폴리펩티드가 포함된다. 변이체는 번역 후 또는 화학적으로 변형될 수 있다. 이러한 변이체는 본 명세서의 교시로부터 당업자의 범위 내에 있는 것으로 간주된다.
당해 기술분야에 공지된 바와 같이, 2개의 폴리펩티드 사이의 "유사성"은 아미노산 서열 및 1개 폴리펩티드의 이의 보존된 아미노산 치환체를 제2 폴리펩티드의 서열과 비교함으로써 결정된다. 변이체는 원래 서열과 상이한, 바람직하게는 목적 세그먼트당 잔기의 40% 미만으로 원래 서열과 상이한, 더 바람직하게는 목적 세그먼트당 잔기의 25% 미만으로 원래 서열과 상이한, 보다 바람직하게는 목적 세그먼트당 잔기의 10% 미만으로 상이한, 가장 바람직하게는 목적 세그먼트당 단 수개의 잔기에서 원래 단백질 서열과 상이하고 동시에 원래 서열의 기능 및/또는 유비퀴틴 또는 유비퀴틴화된 단백질에 결합하는 능력을 보존하기 위해 원래 서열과 충분히 상동성인 폴리펩티드 서열을 포함하는 것으로 정의된다. 본 개시는 원래 아미노산 서열과 적어도 60%, 65%, 70%, 72%, 74%, 76%, 78%, 80%, 90%, 또는 95% 유사 또는 동일한 아미노산 서열을 포함한다. 2개의 폴리펩티드 사이의 동일성 정도는 당업자에게 널리 공지된 컴퓨터 알고리즘 및 방법을 사용하여 결정된다. 2개의 아미노산 서열 사이의 동일성은 바람직하게는 BLASTP 알고리즘[참조: BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, Md. 20894, Altschul, S., et al., J. Mol. Biol. 215: 403-410(1990)]을 사용함으로써 결정된다.
본 명세서에 개시된 폴리펩티드는 번역 후 변형될 수 있다. 예를 들면, 본 개시의 범위 내에 속하는 번역후 변형은 신호 펩티드 절단, 글리코실화, 아세틸화, 이소프레닐화, 단백질 분해, 미리스토일화, 단백질 폴딩 및 단백질 분해 프로세싱 등을 포함한다. 일부 변형 또는 프로세싱 이벤트는 추가의 생물학적 기구의 도입을 필요로 한다. 예를 들면, 신호 펩티드 절단 및 코어 글리코실화 등의 프로세싱 이벤트는 개과 마이크로솜 막 또는 Xenopus 계란 추출물을 표준 번역 반응에 부가하여 조사된다.
본 개시의 폴리펩티드는 번역후 변형에 의해 또는 번역 동안 비천연 아미노산을 도입함으로써 형성된 비천연 아미노산을 포함할 수 있다. 단백질 번역 동안 비천연 아미노산을 도입하기 위해 다양한 접근 방식이 이용가능하다.
본원에 사용된 용어 "기능적으로 동등"은, 본 개시의 리소좀 효소의 특정 아미노산 서열의 적어도 하나의 생물학적 기능 또는 활성을 바람직하게는 보유하는 폴리펩티드를 지칭한다.
폴리펩티드는, 융합 단백질을 제조하기 위해, 단백질 등의 다른 분자와 접합시킬 수 있다. 이것은, 예를 들면, 수득된 융합 단백질이 본 개시의 리소좀 효소의 기능성을 유지한다는 조건하에, N-말단 또는 C-말단 융합 단백질의 합성에 의해 달성될 수 있다.
폴리펩티드는 종래의 방법을 사용하여 포스포릴화될 수 있다. 일 실시양태에서, 본원에 개시된 리소좀 효소는 본원에 개시된 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase) 덕분에 포스포릴화될 수 있다.
펩티드 또는 키메라 단백질의 환상 유도체가 또한 본 명세서에서 고려된다. 환화는 펩티드 또는 키메라 단백질이 다른 분자와의 회합에 보다 바람직한 형태를 취하도록 할 수 있다. 환화는 당해 기술분야에 공지된 기술을 사용하여 달성될 수 있다. 예를 들면, 유리 설프하이드릴 그룹을 갖는 2개의 적절하게 이격된 성분 사이에 디설파이드 결합을 형성할 수 있거나, 한 성분의 아미노 그룹과 다른 성분의 카복실 그룹 사이에 아미드 결합을 형성할 수 있다.
환화는 또한 아조벤젠-함유 아미노산을 사용하여 달성될 수 있다. 결합을 형성하는 성분은 아미노산의 측쇄, 비아미노산 성분 또는 이들 2개의 조합일 수 있다. 일 실시양태에서, 환상 펩티드는 정확한 위치에 베타-턴을 포함할 수 있다. 아미노산 Pro-Gly를 정확한 위치에 첨가함으로써 본 개시의 펩티드에 베타-턴을 도입할 수 있다. 상기 기재된 바와 같이 펩티드 결합 연결을 함유하는 환상 펩티드보다 더 유연한 환상 펩티드를 생성하는 것이 바람직할 수 있다. 보다 유연한 펩티드는, 펩티드의 좌우 위치에 시스테인을 도입하고, 2개의 시스테인 사이에 디설파이드 가교를 형성함으로써 제조할 수 있다. 2개의 시스테인은 베타 시트를 변형시켜 회전하지 않도록 배열된다. 펩티드는, 디설파이드 결합의 길이와 베타 시트 부분의 수소 결합 수가 적기 때문에, 더 유연해진다. 환상 펩티드의 상대적 유연성은 분자 역학 시뮬레이션에 의해 결정할 수 있다.
태그
일 실시양태에서, 본 명세서에 개시된 폴리펩티드는 태그의 아미노산 서열을 추가로 포함한다. 태그에는 다음이 포함되지만 이들로 한정되지 않는다: 폴리히스티딘 태그(His-태그)(예: H6 및 H10 등) 또는 IMAC 시스템에서 사용하기 위한 기타 태그(예: Ni2+ 친화성 컬럼 등), GST 융합, MBP 융합, 스트렙트아비딘-태그, 박테리아 효소 BIRA의 BSP 비오티닐화 표적 서열 및 항체에 의해 지시되는 태그 에피토프(예: c-myc 태그, FLAG-태그, HPC4-태그 등). 당업자에 의해 관찰되는 바와 같이, 태그 펩티드는 본 개시의 융합 단백질의 정제, 검사, 선택 및/또는 시각화에 사용될 수 있다. 일 실시양태에서, 태그는 검출 태그 및/또는 정제 태그이다. 태그 서열은 본 개시의 단백질의 기능을 간섭하지 않는 것이 이해될 것이다.
리더 및 분비 서열
따라서, 본 개시의 폴리펩티드는 리더 또는 분비 서열, 또는 정제 또는 검출에 사용되는 서열 등의 또 다른 폴리펩티드 또는 태그에 융합될 수 있다. 일부 실시양태에서, 본 개시의 폴리펩티드는 본 개시의 폴리펩티드의 신속한 고친화성 정제의 기초를 제공하는 글루타티온-S-트랜스퍼라제 단백질 태그를 포함한다. 실제로, 이 GST 융합 단백질은 글루타티온에 대한 높은 친화성을 통해 세포로부터 정제될 수 있다. 아가로스 비드는 글루타티온에 결합될 수 있고, 이러한 글루타티온-아가로스 비드는 GST-단백질에 결합한다. 따라서, 특정 실시양태에서, 폴리펩티드는 고체 지지체에 결합될 수 있다. 일부 실시양태에서, 폴리펩티드가 GST 부분을 포함하는 경우, 폴리펩티드는 글루타티온-변형된 지지체에 커플링된다. 일부 실시양태에서, 글루타티온 변형된 지지체는 글루타티온-아가로스 비드이다. 추가로, 프로테아제 절단 부위를 암호화하는 서열을 친화성 태그와 폴리펩티드 서열 사이에 포함시킬 수 있고, 따라서 이 특정 효소와 함께 인큐베이션 후에 결합 태그의 제거를 가능하게 하고, 따라서 대응하는 목적 단백질의 정제를 용이하게 한다.
본 명세서에 개시된 폴리펩티드는 또한, 표적 단백질, 및/또는 키메라 단백질을 목적하는 세포 성분 또는 세포 유형 또는 조직으로 지시할 수 있는 표적화 도메인에 융합 또는 통합될 수 있다. 키메라 단백질은 또한, 추가 아미노산 서열 또는 도메인을 함유할 수 있다. 키메라 단백질은 다양한 구성요소가 다른 공급원에서 유래한다는 의미에서 재조합체이고, 따라서 자연에서는 함께 발견되지 않는다(즉, 이종성임).
본 개시의 조성물의 일부 실시양태에서, 폴리펩티드는 본 개시의 리소좀 단백질의 펩티드모방체를 포함하거나, 벡터는 본 개시의 리소좀 단백질의 펩티드모방체를 암호화한다. 펩티드모방체는 펩티드 및 단백질을 기반으로 하거나, 이들로부터 유래하는 화합물이다.
다른 분자와 접합된 본 개시의 펩티드 또는 키메라 단백질을 포함하는 N-말단 또는 C-말단 융합 단백질은, 재조합 기술을 통해, 펩티드 또는 키메라 단백질의 N-말단 또는 C-말단, 및 선택된 단백질의 서열 또는 목적하는 생물학적 기능을 갖는 선택가능한 마커를 융합함으로써 제조될 수 있다. 수득된 융합 단백질은, 본 명세서에 기재된 바와 같이, 선택된 단백질 또는 마커 단백질에 융합된 펩티드 또는 키메라 단백질을 포함하는 리소좀 효소를 함유한다. 융합 단백질을 제조하기 위해 사용될 수 있는 단백질의 예는 면역글로불린, 글루타티온-S-트랜스퍼라제(GST), 헤마글루티닌(HA) 및 절두형 myc를 포함한다.
본원에 개시된 폴리펩티드 및 키메라 단백질은 염산, 황산, 브롬화수소산, 인산 등의 무기산 또는 포름산, 아세트산, 프로피온산, 글리콜산, 락트산, 피루브산, 옥살산, 숙신산, 말산, 타르타르산, 시트르산, 벤조산, 살리실산, 벤젠술폰산 및 톨루엔술폰산 등의 유기산과 반응함으로써 약제학적 염으로 전환될 수 있다.
변형된 세포
일부 실시양태에서, 본 개시는 본 개시의 벡터를 포함하는 세포를 제공한다. 일부 실시양태에서, 벡터는 바이러스 벡터(예를 들면, AAV 또는 렌티바이러스 벡터)이다. 일부 실시양태에서, 벡터는 비-바이러스 벡터(예를 들면, 리포솜, 나노입자, 지질 나노입자, 미셀, 폴리머솜, 엑소좀)이다. 일부 실시양태에서, 벡터는 발현 벡터이다. 일부 실시양태에서, 벡터는 적어도 2개의 서열의 바이시스트론, 폴리시스트론 또는 멀티시스트론 발현을 가능하게 하는 적어도 하나의 요소를 함유한다. 일부 실시양태에서, 벡터는 본 개시의 리소좀 효소를 암호화하는 서열을 포함한다. 대안적으로 또는 추가로, 일부 실시양태에서, 벡터는 본 개시의 S1S3 작제물을 암호화하는 서열을 포함한다. 일부 실시양태에서, 리소좀 효소는 표 1A, 표 1B 또는 표 1C에 수록된 효소 중 하나 이상이다. 일부 실시양태에서, 벡터는 리소좀 효소를 암호화하는 핵산 또는 아미노산 서열을 포함하고, 표 1A, 표 1B 또는 표 1C에 수록된 효소 중 하나 이상이다.
일부 실시양태에서, 본 개시의 벡터를 포함하는 세포는 본 개시의 변형된 세포이다. 일부 실시양태에서, 본 개시의 벡터를 포함하는 세포는 비-천연 존재이다.
일부 실시양태에서, 세포는 인간 서열을 발현하고/하거나 인간 단백질을 생산할 수 있는 포유동물 세포이다. 일부 실시양태에서, 포유동물 세포는 마우스, 랫트, 기니 피그, 래빗, 고양이, 개 또는 비인간 영장류로부터 단리되거나 유래된다.
일부 실시양태에서, 세포는 인간 서열을 발현하고/하거나 인간 단백질을 생산할 수 있는 인간 세포이다.
일부 실시양태에서, 세포는, 본 개시의 벡터를 발현하도록 변형되고 생체외에서 배양된 1차 세포이다. 일부 실시양태에서, 배양된 세포는 불사멸화되거나, 기타 방법으로 변형되어 시험관내에서 세포의 무기한 증식을 촉진하고, 배양된 세포주를 생성한다.
숙주 세포
일부 실시양태에서, 본 개시는 본 개시의 바이시스트론 벡터를 포함하는 세포를 제공한다. 세포는 원핵 세포 또는 진핵 세포일 수 있다. 적절한 세포는 박테리아, 효모, 진균, 곤충 및 포유동물 세포를 포함하지만 이들로 한정되지 않는다.
일부 실시양태에서, 본 개시는 본 개시의 바이시스트론 벡터를 포함하는 포유동물 세포를 제공한다.
개시된 바이시스트론 벡터를 포함하는 숙주 세포는 단백질 발현 및 임의로 정제에 사용될 수 있다. 숙주로부터 발현된 단백질을 발현시키고 임의로 정제하는 방법은 당해 기술분야에서 표준적이다.
일부 실시양태에서, 본 개시의 벡터를 포함하는 숙주 세포는 본 개시의 효소 작제물에 의해 암호화되는 폴리펩티드를 생산하기 위해 사용될 수 있다. 일반적으로, 본 개시의 폴리펩티드의 생산은 효소 작제물을 포함하는 벡터로 숙주 세포를 형질감염시키고, 이어서 세포가 목적하는 폴리펩티드를 전사 및 번역하도록 세포를 배양하는 것을 포함한다. 이어서, 단리된 숙주 세포를 용해하여, 후속 정제를 위해 발현된 폴리펩티드를 추출할 수 있다.
일부 실시양태에서, 숙주 세포는 원핵 세포이다. 적합한 원핵 세포의 비제한적 예는 이. 콜라이(E. coli) 및 기타 엔테로박테리아세아에(Enterobacteriaceae), 에스케리키아 종(Escherichia sp.), 캄필로박터 종(Campylobacter sp.), 울리넬라 종(Wolinella sp.), 데설포비브리오 종(Desulfovibrio sp.), 비브리오 종(Vibrio sp.), 슈도모나스 종(Pseudomonas sp.), 바실루스 종(Bacillus sp.), 리스테리아 종(Listeria sp.), 스타필로콕쿠스 종(Staphylococcus sp.), 스트렙토콕쿠스 종(Streptococcus sp.), 펩토스트렙토콕쿠스 종(Peptostreptococcus sp.), 메가스파에라 종(Megasphaera sp.), 펙티나투스 종(Pectinatus sp.), 셀레노모나스 종(Selenomonas sp.), 지모필루스 종(Zymophilus sp.), 악티노마이세스 종(Actinomyces sp.), 아르트로박터 종(Arthrobacter sp.), 프란키아 종(Frankia sp.), 마이크로모노스포라 종(Micromonospora sp.), 노카르디아 종(Nocardia sp.), 프로피오니박테리움 종(Propionibacterium sp.), 스트렙토마이세스 종(Streptomyces sp.), 락토바실루스 종(Lactobacillus sp.), 락토콕쿠스 종(Lactococcus sp.), 류코노스톡 종(Leuconostoc sp.), 페디오콕쿠스 종(Pediococcus sp.), 아세톡박테리움 종(Acetobacterium sp.), 유박테리움 종(Eubacterium sp.), 헬리오박테리움 종(Heliobacterium sp.), 헬리오시페로트릭스 종(Heliospirillum sp.), 스포로무사 종(Sporomusa sp.), 스피로플라스마 종(Spiroplasma sp.), 우레아플라스마 종(Ureaplasma sp.), 에리시펠로트릭스 종(Erysipelothrix sp.), 코리네박테리움 종(Corynebacterium sp.), 엔테로콕쿠스 종(Enterococcus sp.), 클로스트리디움 종(Clostridium sp.), 마이코플라스마 종(Mycoplasma sp.), 마이코박테리움 종(Mycobacterium sp.), 악티노박테리아 종(Actinobacteria sp.), 살모넬라 종(Salmonella sp.), 시겔라 종(Shigella sp.), 모라셀라 종(Moraxella sp.), 헬리코박터 종(Helicobacter sp.), 스테노트리포모나스 종(Stenotrophomonas sp.), 마이크로콕쿠스 종(Micrococcus sp.), 네이세리아 종(Neisseria sp.), 브델로비브리오 종(Bdellovibrio sp.), 헤모필루스 종(Hemophilus sp.), 클레브시엘라 종(Klebsiella sp.), 프로테우스 미라빌리스(Proteus mirabilis), 엔테로박터 클로아카에(Enterobacter cloacae), 세라티아 종(Serratia sp.), 시트로박터 종(Citrobacter sp.), 프로테우스 종(Proteus sp.), 세라티아 종(Serratia sp.), 예르시니아 종(Yersinia sp.), 악시네토박터 종(Acinetobacter sp.), 악티노바실러스 종(Actinobacillus sp.), 보르데텔라 종(Bordetella sp.), 브루셀라 종(Brucella sp.), 카프노시토파가 종(Capnocytophaga sp.), 카르디오박테리움 종(Cardiobacterium sp.), 에이케넬라 종(Eikenella sp.), 프란시셀라 종(Francisella sp.), 헤모필루스 종(Haemophilus sp.), 킨겔라 종(Kingella sp.), 파스테렐라 종(Pasteurella sp.), 플라보박테리움 종(Flavobacterium sp.), 크산토모나스 종(Xanthomonas sp.), 부라크홀데리아 종(Burkholderia sp.), 에로모나스 종(Aeromonas sp.), 플레시오모나스 종(Plesiomonas sp.), 레기오넬라 종(Legionella sp.), 및 알파-프로테오박테리아, 예컨대, 울바키아 종(Wolbachia sp.), 시아노박테리아(cyanobacteria), 스피로카에테스(spirochaetes), 녹색 황 및 녹색 비황 박테리아, 불안정한 그람 음성 바실리(Gram negative bacilli), 엔테로박테리아세아에-글로코스-발효 그람-음성 사실리, 그람 음성 바실리-비-글루코스-발효, 그람 음성 바실리-글루코스 발효, 옥시다제 양성을 포함한다. 단백질 발현에 특히 유용한 박테리아 숙주 세포는 그람 음성 박테리아, 예컨대, 에스케리키아 콜라이(Escherichia coli), 슈도모나스 피우오레센스(Pseudomonas fiuorescens), 슈도모나스 할로플란크티스(Pseudomonas haloplanctis), 슈도모나스 푸티다(Pseudomonas putida) AC 10, 슈도모나스 슈도플라바(Pseudomonas pseudoflava), 바르토넬라 헨셀라에(Bartonella henselae), 슈도모나스 시린가에(Pseudomonas syringae), 카울로박터 크레센투스(Caulobacter crescentus), 지모모나스 모빌리스(Zymomonas mobilis), 리조비움 멜리로티(Rhizobium meliloti), 믹소콕쿠스 크산투스(Myxococcus xanthus), 및 그람 양성 박테리아, 예컨대, 바실러스 서브틸리스(Bacillus subtilis), 코리네박테리움(Corynebacterium), 스트렙토코커스 크레모리스(Streptococcus cremoris), 스트렙토코커스 리비단스(Streptococcus lividans) 및 스트렙토마이세스 리비단스(Streptomyces lividans)를 포함한다. 이. 콜라이는 가장 광범위하게 사용되는 발현 숙주 중 하나이다. 따라서, 이. 콜라이에서 과발현을 위한 기술은 충분히 개발되었고, 당업자가 용이하게 이용할 수 있다.
또한, 슈도모나스 푸오레센스(Pseudomonas fuorescens)는 재조합 단백질의 고수준 생산에 일반적으로 사용된다(즉, 생물 치료제 및 백신 개발).
일부 실시양태에서, 숙주 세포는 효모 또는 진균 세포이다. 단백질 발현에 특히 유용한 진균 숙주 세포는 아스퍼길리스 오리자에(Aspergillis oryzae), 아스퍼길리스 니거(Aspergillis niger), 트리코데르마 레에세이(Trichoderma reesei), 아스퍼길루스 니둘란스(Aspergillus nidulans), 푸사리움 그라미네아룸(Fusarium graminearum)을 포함한다. 단백질 발현에 특히 유용한 효모 숙주 세포는 칸디다 알비칸스(Candida albicans), 칸디다 말토스(Candida maltose), 한세눌라 폴리모르파(Hansenula polymorpha), 클루이베로마이세스 프라길리스(Kluyveromyces fragilis), 클루이베로마이세스 락티스(Kluyveromyces lactis), 피치아 길레리몬디(Pichia guillerimondii), 피치아 파스토리스(Pichia pastoris), 사카로마이세스 세레비지애(Saccharomyces cerevisiae), 쉬조사카로마이세스 폼베(Schizosaccharomyces pombe) 및 야로위아 리포리티카(Yarrowia lipolytica)를 포함한다.
일부 실시양태에서, 숙주 세포는 곤충 세포이다. 비제한적 예는 스포도프테라 프루기페르다(Spodoptera frugiperda) 세포주(예: Sf9 또는 Sf21), 초파리 세포주, 또는 모기 세포주(예: 아에데스 알보픽투스(Aedes albopictus) 유래 세포주)를 포함한다.
일부 실시양태에서, 숙주 세포는 포유동물 세포이다. 단백질 발현에 유용한 포유동물 숙주 세포는 챠이니즈 햄스터 난소(CHO) 세포, HeLa 세포, 인간 배아 신장 293(HEK293) 세포, 베이비 햄스터 신장(BHK) 세포, 원숭이 신장 세포(COS), 인간 간세포 암종 세포(예: Hep G2), 인간 배아 신장 세포, 보스 프리미게니우스(Bos primigenius) 및 무스 무스쿨루스(Mus musculus)를 포함한다. 특정 실시양태에서, 숙주 세포는 CHO 세포이다. 추가로, 포유동물 숙주 세포는 확립된 상업적으로 이용가능한 세포주(예를 들면, American Type Culture Collection(ATCC), Manassas, VA)일 수 있다. 숙주 세포는 불사멸화 세포일 수 있다. 또는, 숙주 세포는 1차 세포일 수 있다.
일부 실시양태에서, 숙주 세포는 높은 수준의 목적 단백질을 생산하도록 조작되었다.
개시의 방법
일부 실시양태에서, 본 개시는, 본원에 개시되어 있는 리소좀 축적 장애(LSD)를 앓고 있는 대상체를 치료하는 방법을 제공한다. 상기 방법은 본 명세서의 다른 곳에 개시된 바와 같이 바이시스트론 벡터에 의해 발현되는 리소좀 효소를 포함하는 약제학적 조성물을 대상체에게 투여하고, 이에 의해 리소좀 효소의 포스포릴화를 증가시키고 대상체를 치료하는 것을 포함한다.
일부 실시양태에서, 본 개시는, 이를 필요로 하는 대상체에서 리소좀 축적 장애(LSD)의 발생을 예방하는 방법을 제공한다. 상기 방법은 본 명세서의 다른 곳에 개시된 바와 같이 바이시스트론 벡터에 의해 발현되는 리소좀 효소를 포함하는 약제학적 조성물을 대상체에게 투여하고, 이에 의해 리소좀 효소의 포스포릴화를 증가시키고 대상체에서 LSD의 발생을 예방하는 것을 포함한다.
일부 실시양태에서, 리소좀 효소는 표 1에 수록된 바와 같은 적어도 하나의 리소좀 축적 장애(LSD)에 관여한다. 다른 실시양태에서, 리소좀 효소는 표 1에 수록된 바와 같은 적어도 하나이다.
추가 실시양태에서, 투여는 장내, 비경구, 경구, 근육내(IM), 피하(SC), 정맥내(IV) 및 동맥내(IA)로 이루어진 그룹으로부터 선택된 투여 경로를 포함한다. 개시된 방법에 사용될 수 있는 추가 투여 경로는 본 명세서의 다른 곳에서 상세히 설명된다.
병용 요법
본 명세서에 기재된 바와 같은 LSD를 치료 또는 예방하기 위한 조성물 및 방법은 LSD의 치료에 유용한 적어도 하나의 추가 화합물과 조합되는 경우에 유용할 수 있다. 추가 화합물은 LSD의 증상을 치료, 예방 또는 감소시키는 것으로 공지되어 있는 상업적으로 입수가능한 화합물을 포함할 수 있다. 화합물은 당해 기술분야에 공지된 ERT일 수 있지만, 이들로 한정되지 않는다.
약제학적 조성물 및 제형
또한, 본 명세서에는, 본 개시의 바이시스트론 벡터에 의해 발현되는 리소좀 효소를 포함하는 약제학적 조성물이 제공된다.
이러한 약제학적 조성물은 대상체에게 투여하기에 적합한 형태이거나, 약제학적 조성물은 하나 이상의 약제학적으로 허용되는 담체, 하나 이상의 추가 성분, 또는 이들의 일부 조합을 추가로 포함할 수 있다. 약제학적 조성물의 다양한 성분은 당해 기술분야에 공지되어 있는 바와 같이 생리학적으로 허용되는 양이온 또는 음이온과 함께 생리학적으로 허용되는 염의 형태로 존재할 수 있다.
본 개시의 일부 실시양태에서, 본 개시의 방법을 실시하는 데 유용한 약제학적 조성물은 1ng/kg/일 내지 100mg/kg/일의 용량을 전달하도록 투여될 수 있다. 본 개시의 일부 실시양태에서, 본 개시를 실시하는 데 유용한 약제학적 조성물은 1ng/kg/일 내지 500mg/kg/일의 용량을 전달하도록 투여될 수 있다. 본 개시의 약제학적 조성물 중의 활성 성분, 약제학적으로 허용되는 담체, 및 임의의 추가 성분의 상대적 양은 치료 대상체의 속성, 크기 및 상태에 따라, 및 추가로 조성물이 투여되는 경로에 따라 변화될 것이다. 예로서, 조성물은 0.1% 내지 100%(w/w)의 활성 성분을 포함할 수 있다.
본 개시의 일부 실시양태에서, 본 개시의 방법을 실시하는 데 유용한 약제학적 조성물은 1ng/kg 내지 100mg/kg의 용량을 전달하도록 투여될 수 있다. 본 개시의 일부 실시양태에서, 본 개시를 실시하는 데 유용한 약제학적 조성물은 1ng/kg 내지 500mg/kg의 용량을 전달하도록 투여될 수 있다. 본 개시의 일부 실시양태에서, 약제학적 조성물은 매일, 매주, 격주로, 매월 또는 매년 제공된다. 본 개시의 약제학적 조성물 중의 활성 성분, 약제학적으로 허용되는 담체, 및 임의의 추가 성분의 상대적 양은 치료 대상체의 속성, 크기 및 상태에 따라, 및 추가로 조성물이 투여되는 경로에 따라 변화될 것이다. 예로서, 조성물은 0.1% 내지 100%(w/w)의 활성 성분을 포함할 수 있다.
본 개시의 방법에 유용한 약제학적 조성물은 흡입, 경구, 직장, 질, 비경구, 국소, 경피, 폐, 비강내, 협측, 안과, 척추강내, 정맥내 또는 또 다른 투여 경로를 위해 적합하게 개발될 수 있다. 다른 고려되는 제형은 투영된 나노입자, 리포솜 조제물, 활성 성분을 함유하는 재밀봉된 적혈구, 및 면역학적 기반 제형을 포함한다. 투여 경로(들)는 당업자에게 용이하게 명백하고, 치료되는 질환의 유형 및 중증도, 치료되는 수의 또는 인간 환자의 유형 및 연령 등을 포함하는 임의 수의 요인에 의존한다.
본 명세서에 기재된 약제학적 조성물의 제형은 약리학 분야에서 공지되거나 금후 개발되는 임의의 방법에 의해 제조될 수 있다. 일반적으로, 이러한 조제 방법에는 활성 성분을 담체 또는 하나 이상의 기타 보조 성분과 결합시키고, 이어서, 필요하거나 원하는 경우, 제품을 목적하는 단일 또는 다중 용량 단위로 성형 또는 포장하는 단계가 포함된다. 일부 실시양태에서, 본원에 개시된 조성물은 천연 캡시드, 변형된 캡시드에 네이키드 RNA로서 제형화되거나, 보호 코트에 캡슐화될 수 있다.
활성 성분의 양은 일반적으로 대상체에게 투여되는 활성 성분의 투여량 또는 이러한 투여량의 편리한 분획, 예를 들면, 이러한 투여량의 1/2 또는 1/3과 동등하다. 단위 투여 형태는 1일 1회 용량 또는 복수회 1일 용량(예를 들면, 1일 약 1 내지 4회 또는 그 이상) 중 하나일 수 있다. 복수회의 1일 용량이 사용되는 경우, 단위 투여 형태는 각 투여량에 대해 동일하거나 상이할 수 있다.
본 명세서에 제공된 약제학적 조성물의 설명은 주로 인간에 대한 윤리적 투여에 적합한 약제학적 조성물에 관한 것이지만, 이러한 조성물은 일반적으로 모든 종류의 동물에게 투여하기에 적합하다는 것이 당업자에 의해 이해된다. 다양한 동물에 대한 투여에 적합한 조성물을 제공하기 위해 인간에 대한 투여에 적합한 약제학적 조성물의 변형은 잘 이해되고, 통상의 숙련된 수의 약리학자는, 경우에 따라, 이러한 변형을 설계하고 수행할 수 있다. 본 개시의 약제학적 조성물의 투여가 고려되는 대상체는 인간 및 기타 영장류, 상업적으로 관련된 포유동물, 예컨대, 소, 돼지, 말, 양, 고양이 및 개를 포함하는 포유동물을 포함하지만 이들로 한정되지 않는다. 일 실시양태에서, 대상체는 인간 또는 비-인간 포유동물, 예컨대, 이들로 한정되지 않지만, 말, 양, 소, 돼지, 개, 고양이 및 뮤린이다. 일 실시양태에서, 대상체는 인간이다.
일 실시양태에서, 조성물은 하나 이상의 약제학적으로 허용되는 부형제 또는 담체를 사용하여 제형화된다. 일부 실시양태에서, 본 개시는 LSD를 앓고 있는 대상체를 치료하기 위한 약제학적 조성물을 제공한다. 일부 실시양태에서, 본 개시는 본 개시의 바이시스트론 벡터에 의해 발현되는 리소좀 효소 및 약제학적으로 허용되는 담체를 포함하는 약제학적 조성물을 제공한다.
유용한 약제학적으로 허용되는 담체는 글리세롤, 물, 식염수, 에탄올 및 인산염 및 유기산의 염 등의 기타 약제학적으로 허용되는 염 용액을 포함하지만 이들로 한정되지 않는다. 담체는, 예를 들면, 물, 에탄올, 폴리올(예를 들면, 글리세롤, 프로필렌 글리콜 및 액체 폴리에틸렌 글리콜 등), 이들의 적합한 혼합물, 및 식물성 오일을 함유하는 용매 또는 분산 매질일 수 있다. 적절한 유동성은, 예를 들면, 레시틴 등의 코팅의 사용에 의해, 분산의 경우에 필요한 입자 크기의 유지에 의해 및 계면활성제의 사용에 의해 유지될 수 있다. 미생물 작용의 방지는 다양한 항균제 및 항진균제, 예를 들면, 파라벤, 클로로부탄올, 페놀, 아스코르브산, 티메로살 등에 의해 달성될 수 있다. 일부 실시양태에서, 등장화제, 예를 들면, 당, 염화나트륨, 또는 만니톨 및 소르비톨 등의 다가 알코올을 조성물에 포함하는 것이 바람직하다. 주사가능한 조성물의 연장된 흡수는 흡수를 지연시키는 제제, 예를 들면, 알루미늄 모노스테아레이트 또는 젤라틴을 조성물에 포함시킴으로써 야기될 수 있다.
제형은 종래의 부형제, 즉 경구, 비경구, 비강, 정맥내, 피하, 장내 또는 당해 기술분야에 공지된 임의의 기타 적합한 투여 방식에 적합한 약제학적으로 허용되는 유기 또는 무기 담체 물질과의 혼합물로 사용될 수 있다. 약제학적 제제는 멸균될 수 있고, 필요에 따라, 보조제, 예를 들면, 윤활제, 방부제, 안정화제, 습윤제, 유화제, 삼투압 완충제에 영향을 미치기 위한 염, 착색제, 향미제 및/또는 방향 물질과 혼합될 수 있다. 이들은 또한, 필요에 따라, 기타 활성제, 예를 들면, 기타 진통제와 조합될 수 있다.
개시된 조성물은, 조성물의 총 중량을 기준으로 약 0.005% 내지 2.0%의 방부제를 포함할 수 있다. 방부제는 환경 중의 오염 물질에 노출되는 경우의 부패를 방지하기 위해 사용된다. 본 개시에 따라 유용한 보존제의 예는 벤질 알코올, 소르브산, 파라벤, 이미드우레아 및 이들의 조합으로 이루어진 그룹으로부터 선택된 것을 포함하지만 이들로 한정되지 않는다. 일부 실시양태에서, 방부제는 약 0.5% 내지 2.0%의 벤질 알코올 및 0.05% 내지 0.5%의 소르브산의 조합이다.
상기 조성물은 화합물의 분해를 억제하는 항산화제 및 킬레이트제를 포함할 수 있다. 일부 화합물에 대한 바람직한 산화방지제는, 조성물의 총 중량을 기준으로 약 0.01% 내지 0.3%의 바람직한 범위의 BHT, BHA, 알파-토코페롤 및 아스코르브산이고, 보다 바람직하게는 0.03% 내지 0.1% 범위의 BHT이다. 바람직하게는, 킬레이트제는, 조성물의 총 중량을 기준으로 0.01% 내지 0.5%의 양으로 존재한다. 특히 바람직한 킬레이트제는, 조성물의 총 중량을 기준으로 약 0.01% 내지 0.20중량%, 더욱 바람직하게는 0.02% 내지 0.10중량% 범위의 에데테이트 염(예: 에데트산이나트륨) 및 시트르산을 포함한다. 킬레이트제는 제형의 저장 수명에 유해할 수 있는 조성물 중의 금속 이온을 킬레이트화하는 데 유용하다. 일부 실시양태에서, BHT 및 에데트산이나트륨은 각각 일부 화합물에 대한 항산화제 및 킬레이트제이지만, 다른 적합하고 등가인 항산화제 및 킬레이트제는 따라서 당업자에게 공지된 바와 같이 치환될 수 있다.
투여/투약
투여 섭생은 유효량을 구성하는 것에 영향을 미칠 수 있다. 예를 들면, 치료 제형은 리소좀 축적 장애(LSD)와 관련된 외과적 개입의 전 또는 후에, 또는 환자가 리소좀 축적 장애(LSD)로 진단된 직후에 환자 대상체에게 투여될 수 있다. 추가로, 몇몇 분할된 투여량 및 시차 투여량은 매일 또는 연속적으로 투여할 수 있거나, 투여량은 연속적으로 주입하거나, 볼루스 주사할 수 있다. 추가로, 치료 제형의 투여량은 치료 또는 예방 상황의 긴급성에 의해 나타난 바와 같이 비례적으로 증가 또는 감소될 수 있다.
환자 대상체, 바람직하게는 포유동물, 보다 바람직하게는 인간에 대한 본 개시의 조성물의 투여는 공지된 절차를 사용하여 대상체에서 리소좀 축적 장애(LSD)를 치료하기에 효과적인 투여량 및 기간 동안 수행될 수 있다. 치료 효과를 달성하기 위해 필요한 치료 화합물의 유효량은, 사용된 특정 화합물의 활성; 투여 시간; 화합물의 배설 속도; 치료 기간; 화합물과 함께 사용되는 기타 약물, 화합물 또는 물질; 질환 또는 장애의 상태, 연령, 성별, 체중, 상태, 일반적인 건강 및 치료받는 환자의 이전 병력, 및 의학 분야에서 공지되어 있는 유사한 요인 등의 요인에 따라 달라질 수 있다. 투여 섭생은 최적의 치료 반응을 제공하도록 조정될 수 있다. 예를 들면, 몇몇 분할된 용량이 매일 투여되거나, 용량은 치료 상황의 긴급성에 의해 나타난 바와 같이 비례적으로 감소시킬 수 있다. 본 개시의 치료 화합물에 대한 유효 용량 범위의 비제한적 예는 약 0.01 내지 50mg/kg 체중/일이다.
화합물은 1일에 수회의 빈도로 대상체에게 투여될 수 있거나, 1일 1회, 1주 1회, 2주 1회, 1개월 1회 등의 덜 빈번하게, 또는 수개월에 1회 또는 수년에 1회 등의 훨씬 덜 빈번하게 투여될 수 있다. 1일에 투여되는 화합물의 양은 비제한적 예에서 매일, 격일로, 2일마다, 3일마다, 4일마다, 또는 5일마다 투여될 수 있는 것으로 이해된다. 예를 들면, 격일로 투여하는 경우, 월요일에 1일당 5mg의 투여를 개시하고, 수요일에 1일당 5mg을 투여하고, 금요일에 1일당 5mg을 투여한다. 용량의 빈도는 당업자에게 용이하게 명백하고, 치료되는 질환의 유형 및 중증도, 동물의 유형 및 연령 등(이들로 한정되지 않음)의 임의 수의 요인에 의존한다. 본 개시의 약제학적 조성물 중의 활성 성분의 실제 투여량 수준은, 환자에게 유독하지 않으면서, 특정 환자, 조성물 및 투여 방식에 대해 목적하는 치료 반응을 달성하는데 효과적인 활성 성분의 양을 수득하도록 변화될 수 있다. 당해 기술분야의 통상의 지식을 갖는 의사, 예를 들면, 의사 또는 수의사는 필요한 약제학적 조성물의 유효량을 용이하게 결정하고 처방할 수 있다. 예를 들면, 의사 또는 수의사는 목적하는 치료 효과를 달성하기 위해 필요한 수준보다 낮은 수준에서 약제학적 조성물에 사용된 본 개시의 화합물의 용량을 개시하고, 목적하는 효과가 달성될 때까지 용량을 서서히 증가시킬 수 있다.
일부 실시양태에서, 투여의 용이함 및 투여량의 균일성을 위해, 투여 단위 형태로 화합물을 제형화하는 것이 특히 유리하다. 본 명세서에 사용된 투여량 단위 형태는 치료될 환자를 위한 단일 투여량으로 적합한 물리적으로 별개의 단위를 지칭하고; 각 단위는 필요한 약제학적 비히클과 관련하여 목적하는 치료 효과를 생성하도록 계산된 소정 양의 치료 화합물을 함유한다. 본 개시의 투여량 단위 형태는 (a) 치료 화합물의 고유한 특성 및 달성될 특정 치료 효과, 및 (b) LSD의 치료를 위한 이러한 치료 화합물을 배합/제형화하는 기술에 고유한 제한에 의해 결정되고, 이에 직접적으로 의존한다.
투여 경로
당업자는 하나 이상의 경로를 투여에 사용할 수 있지만, 특정 경로가 다른 경로보다 더 신속하고 더 효과적인 반응을 제공할 수 있음을 인식할 것이다.
개시된 조성물의 투여 경로는 흡입, 경구, 비강, 직장, 비경구, 설하, 경피, 경점막(예를 들면, 설하, 설, (경)협측, (경)요도, 질(예, 경질 및 질주위), (비내)비강내 및 (경)직장내), 방광내, 폐내, 십이지장내, 위내, 척추강내, 대수조내(ICM), 척수내, 심실내, 뇌실내, 피하, 근육내, 피내, 동맥내, 정맥내, 기관지내, 흡입 및 국소 투여를 포함한다. 적합한 조성물 및 투여 형태는, 예를 들면, 정제, 캡슐, 캐플릿, 환제, 겔 캡, 트로키, 분산액, 현탁액, 용액, 시럽, 과립, 비드, 경피 패치, 겔, 분말, 펠렛, 마그마, 로젠지, 크림, 페이스트, 플라스터, 로션, 디스크, 좌제, 비강 또는 경구 투여용의 액체 스프레이, 흡입용의 건조 분말 또는 에어로졸 제형, 방광내 투여용의 조성물 및 제형 등을 포함한다. 본 개시에서 유용할 제형 및 조성물은 본 명세서에 기재된 특정 제형 및 조성물로 한정되지 않음을 이해해야 한다. 일 실시양태에서, LSD의 치료는 흡입, 경구, 직장, 질, 비경구, 국소, 경피, 폐, 비강내, 협측, 안과, 간내 동맥, 흉막내, 척추강내, 종양내, 정맥내 및 이들의 조합으로 이루어진 그룹으로부터 선택된 투여 경로를 포함한다.
유전자 치료 투여
당업자는 벡터를 세포 내로 투여하기 위해 상이한 전달 방법을 이용할 수 있음을 인지한다. 예로서는 하기의 것을 포함한다: (1) 전기천공(전기), 유전자 총(물리적 힘) 또는 대량의 액체 적용(압력) 등의 물리적 수단을 사용하는 방법; 및 (2) 벡터가 리포솜, 응집된 단백질 또는 수송체 분자 등의 또 다른 실체와 복합체를 형성하는 방법.
추가로, 실제 투여량 및 스케쥴은 상기 조성물이 다른 약제학적 조성물과 병용 투여되는지의 여부 또는 약동학, 약물 동태 및 대사의 개체간 차이에 따라 변화할 수 있다. 유사하게는, 양은 이용되는 특정 세포주에 따라(예를 들면, 세포 표면에 존재하는 벡터 수용체의 수, 또는 당해 세포주에서 복제하는 유전자 전달에 사용되는 특정 벡터의 능력에 기초하여) 시험관내 적용에서 변화할 수 있다. 추가로, 세포당 추가되는 벡터의 양은 벡터에 삽입된 치료용 유전자의 길이 및 안정성, 및 서열의 특성에 따라 상이할 수 있고, 특히 경험적으로 결정되어야 하는 파라미터이며, 본 개시의 방법에 고유하지 않은 요인(예를 들면, 합성과 관련된 비용)으로 인해 변경될 수 있다. 당업자는 특정 상황의 긴급성에 따라 임의의 필요한 조정을 용이하게 수행할 수 있다.
치료제를 함유하는 세포는 또한 자살 유전자, 즉 세포를 파괴하기 위해 사용될 수 있는 생성물을 암호화하는 유전자를 함유할 수 있다. 다수의 유전자 치료의 상황에서, 치료 목적으로 숙주 세포 내에서 유전자를 발현할 수 있는 것이 바람직하지만, 숙주 세포를 자유롭게 파괴할 수 있는 능력을 갖는 것이 바람직하다. 치료제는 활성화제 화합물의 부재하에 발현이 활성화되지 않는 자살 유전자에 연결될 수 있다. 약제 및 자살 유전자가 모두 도입된 세포의 사멸이 바람직한 경우, 활성화제 화합물을 세포에 투여함으로써 자살 유전자의 발현을 활성화시키고 세포를 사멸시킨다. 사용될 수 있는 자살 유전자/프로드러그의 조합의 예는 단순 헤르페스 바이러스-티미딘 키나제(HSV-tk) 및 간시클로비르, 아시클로비르; 옥시도리덕타제 및 사이클로헥시미드; 시토신 데아미나제 및 5-플루오로시토신; 티미딘 키나제 티미딜레이트 키나제(Tdk::Tmk) 및 AZT; 및 데옥시시티딘 키나제 및 시토신 아라비노사이드이다.
치료
본 개시는 LSD로 진단된 대상체 또는 LDS를 발증할 위험이 있는 대상체에서 결핍된 리소좀 효소를 치료하는 방법을 포함한다. 이 방법은 리소좀 효소의 포스포릴화를 개선하여 대상체를 치료하거나 대상체에서 LSD의 발생을 예방한다. 추가로, 이 방법은 환자의 삶의 질을 개선시킨다. 일 실시양태에서, 본 개시의 방법은 리소좀 효소를 암호화하는 폴리뉴클레오티드 및 GlcNAc-1 PTase를 암호화하는 폴리뉴클레오티드를 포함하는 조성물을 대상체에게 투여하는 것을 포함한다.
핵산 서열:
pLL01 바이시스트론 벡터 서열(서열번호 1)(CMV 프로모터: 이탤릭체 및 밑줄 . IRES: 볼드체 및 이탤릭체 . S1-S3: 볼드체 및 밑줄 .)
CMV 서열 (서열번호 2)
IRES 서열 (서열번호 3)
변형된 GlcNAc-1 포스포트랜스퍼라제 (GlcNAc-1 PTase), S1-S3 서열 (서열번호 4)
hGBA 야생형 서열 (서열번호 5):
hGBA 천연 변이체 서열 (서열번호 162): hGBA (K360N) 서열. 돌연변이 부위에서 볼드체 및 밑줄되어 있는 뉴클레오티드.
hGBA 조작된 변이체 서열 (서열번호 163): hGBA (C165S) 서열. 돌연변이 부위에서 볼드체 및 밑줄되어 있는 뉴클레오티드.
mGALC 서열 (서열번호 6):
hGLA 서열 (서열번호 7):
hNAGLU 서열 (서열번호 8):
hGAA 서열 (서열번호 9):
hGAA (서열번호 164; UniProt 수탁 번호 P10253-1)
hLAMAN 서열 (서열번호 10):
hGALC 서열 (서열번호 23; GenBank 수탁 번호 BC036518.2):
CEF 프로모터 서열 (서열번호 161):
연구에 사용된 프라이머
실시예
본 개시는 이제 하기 실시예를 참조하여 설명된다. 이들 실시예는 예시만을 목적으로 제공되고, 본 개시는 이들 실시예로 한정되는 것으로 결코 해석되어서는 안되며, 본 명세서에 제공된 교시의 결과로서 명백해지는 임의의 및 모든 변형을 포괄하는 것으로 해석되어야 한다.
추가 설명 없이, 당업자는 전술한 기재 및 하기 예시적 실시예를 사용하여 본 개시의 화합물을 제조 및 활용하고 특허청구된 방법을 실시할 수 있다고 믿어진다. 따라서, 하기 작업 실시예는 본 개시의 바람직한 실시양태를 구체적으로 지적하고, 어떠한 방식으로든 본 개시의 나머지 부분을 제한하는 것으로 해석되어서는 안 된다.
이들 실험에 사용된 재료와 방법이 이제 설명된다.
세포주: HEK293T 세포는, 10%(vol/vol) FBS(Gibco), 100,000U/L 페니실린, 100mg/L 스트렙토마이신(Invitrogen) 및 2mM L-글루타민(Invitrogen)이 보충된 0.11g/L 나트륨 피루베이트 및 4.5g/L 글루코스를 함유하는 DMEM(Corning)에서 유지되었다. Expi293 세포(Invitrogen)를 Expi293 발현 배지(Invitrogen)에서 현탁액으로 성장시켰다.
DNA 작제물: CMV-S1S3 플라스미드는 세인트루이스 소재의 워싱턴 대학교 의과대학의 스튜어트 코른펠드(Stuart Kornfeld) 교수에 의해 제공되었다. 바이시스트론 벡터 pLL01은 다음과 같이 2개 단계로 생성되었다: 제1 단계에서, 486bp IRES 서열을 Ptase α/β 및 γ 바이시스트론 작제물(Prof. Stuart Kornfeld에 의해 제공)로부터 증폭시키고, S1-S3 유전자 단편을 PCR에 의해 플라스미드 CMV-S1S3로부터 수득했다. 이들 2개의 단편을 제2 단계에서 중첩 확장 PCR에 의해 후속적으로 함께 연결시켜, IRES-S1S3 단편을 형성했다. IRE-S1S3 단편을 HpaI 및 PmeI 제한 효소(NEB)로 소화하고, pcDNA3.1(+) 벡터에 결찰했다. pLL11, pLL21, pLL31, pLL41, pLL51 및 pLL61 바이시스트론 플라스미드를 생성하기 위해, hGBA, hGAA, mGALC, hNAGLU, hGLA 및 hLAMAN 유전자를 이들의 특정 프라이머(표 1)로 증폭하고, 바이시스트론 벡터(pLL01)에 삽입했다.
포스포트랜스퍼라제 검정: HEK293T 또는 Expi293 세포를 회수하고, 용해 완충액(25mM Tris-Cl, pH 7.2, 150mM NaCl, 1% 트리톤 X-100 및 프로테아제 억제제 칵테일)에서 용해했다. 5㎕의 세포 추출물을 포스포트랜스퍼라제 검정 완충액(50mM 트리스-Cl, pH 7.4, 10mM MgCl2, 10mM MnCl2, 2mg/mL BSA, 2mM ATP)에서 75mM UDP-GlcNAc, 1mCi UDP-[3H]GlcNAc, 및 100mM aMM의 존재하에 최종 용적 50μL으로 37℃에서 0.5시간 동안 인큐베이팅했다. 1mL의 2mM EDTA, pH 8.0을 첨가하여 반응을 중단시키고, 샘플을 QAE-Sephadex 크로마토그래피에 적용했다.
효소 생산: Expi293 세포는, 공(empty) 벡터, 바이시스트론 플라스미드 또는 이의 단일 발현 플라스미드로 형질감염시켰다. 배지는 2 내지 3일 후에 수확되었다. GBA의 생산을 위해, 분비된 효소를 안정화하기 위해 세포 배양 동안 30μM의 이소파고민을 함유하는 조건 배지를 PBS 완충액에서 4℃에서 밤새 투석하여, 효소 활성 검정을 위해 이소파고민을 제거했다.
효소 활성 검정: 다음 기질이 효소 활성 검정에 사용된다: 4-메틸움벨리페릴[3-D-글루코피라노시드(GCase/GBA 효소 기질, M3633, Sigma), 4-메틸움벨리페릴 α-D-글루코피라노시드(GAA 효소 기질, M9766, Sigma), 6-헥사데카노일아미노-4-메틸움벨리페릴[3-D-갈락토피라노시드(GALC 효소 기질, EH05989, Carbosynth), 4-메틸움벨리페릴-N-아세틸-α-D-글루코사미니드(NAGLU 효소 기질, 474500, Millipore), 4-메틸움벨리페릴 α-D-갈락토피라노시드(GLA 효소 기질, M7633, Sigma) 및 4-메틸움벨리페릴 α-D-만노피라노시드(LAMAN 효소 기질, M3657, Sigma). GBA 효소 활성은 1mM GBA 기질을 포함하는 시트레이트-포스페이트 완충액, pH 5.0, 0.25% TX-100, 0.25% Na 타우로콜레이트에서 검정되었다. GAA 효소 활성은 1mM GAA 기질을 포함하는 시트레이트 완충액, pH 4.0, 0.25% TX-100에서 수행되었다. GALC 효소 활성은 시트레이트-포스페이트 완충액, pH 4.0, 0.25% TX-100, 0.6% Na 타우로콜레이트, 0.1mM GALC 기질을 갖는 0.2% 올레산에서 수행되었다. NAGLU 효소 활성은 1mM NAGLU 기질을 포함하는 시트레이트 완충액, pH 4.0, 0.25% TX-100에서 검정되었다. GLA 효소 활성은 1mM GLA 기질을 포함하는 시트레이트 완충액, pH 4.5, 0.25% TX-100에서 검정되었다. LAMAN 효소 활성은 1mM LAMAN 기질을 포함하는 시트레이트 완충액, pH 4.0, 0.25% TX-100에서 검정되었다.
CI- MPR 결합 검정: CI-MPR 결합은 고결합 96웰 플레이트(Costar 3601)에서 수행되었다. 플레이트를 50μL의 정제된 소 CI-MPR로 10㎍/ml, 실온(RT)에서 1시간 동안 고정화하고, 2% BSA로 실온에서 추가로 1시간 동안 차단했다. 형질감염된 Expi293 세포로부터의 조건부 배지의 분취량을 Hepes 완충액(40mM Hepes, pH 6.8, 150mM NaCl, 0.05% Tween-20)으로 희석하고, 고정화된 CI-MPR과 함께 실온에서 1시간 동안 인큐베이팅하여 포스포릴화된 리소좀 효소에 결합시켰다. 3회 세척 후, 리소좀 효소 활성을 4-메틸움벨리페론 방법에 의해 검정했다.
실시예
1: 리소좀 효소 발현을 위한
포스포트랜스퍼라제(S1-S3)를
포함하는 공 바이시스트론 벡터의 생성
2개의 유전자(GNPTAB 및 GNPTG)에 의해 암호화된 α2β2γ2 6량체인 GlcNAc-1-포스포트랜스퍼라제(GlcNAc-1-PTase, Ptase라고도 지칭됨)는, 양이온 비의존성 만노스 6-포스페이트 수용체(CI-MPR)를 통한 리소좀 표적화에 필요한 포스포릴화된 올리고당의 생성에 관여한다. 발현된 리소좀 효소의 포스포릴화는 조작된 절두된 Ptase(S1-S3)와의 동시 형질감염에 의해 현저히 증가한다. 이 연구는 리소좀 축적 질환(LSD, 예를 들면, 고셔병, 폼페병 및 α-만노시드증을 포함하지만 이들로 한정되지 않음)의 치료를 위한 포스포릴화된 리소좀 효소의 생산에 S1-S3 작제물을 이용한다.
효소 대체 요법(ERT)을 위한 고도로 포스포릴화된 치료용 리소좀 효소를 생산하기 위해, 치료용 리소좀 효소와 S1-S3를 동일한 세포에서 동시에 공-발현시킨다. S1-S3와 리소좀 효소는 서로 상이한 벡터에서 발현되기 때문에, 고도로 포스포릴화된 치료용 리소좀 효소를 생산하기 위해, 리소좀 효소와 S1-S3를 발현하는 안정한 세포주를 2개 단계로 생성한다: (a) Ptase S1-S3을 발현하는 안정한 세포주를 생성하는 단계; (b) S1-S3 안정한 세포주를 기반으로, 치료용 리소좀 효소의 발현을 추가하는 제2 세포주를 생성하는 단계. 이 2단계의 시간 소모적 절차를 회피하기 위해, 본 명세서에 개시된 것은 단일 프로모터하에 2개의 별개 유전자를 발현할 수 있는 내부 리보솜 진입 부위(IRES)를 도입함으로써 바이시스트론 벡터이다
바이시스트론 발현은 또한 리소좀 축적 질환(LSD)의 유전자 치료에도 적용될 수 있다. 공 바이시스트론 벡터 - pcDNA3.1(+) 플라스미드 벡터의 사이토메갈로바이러스(CMV) 프로모터하에 486bp IRES 서열 및 S1-S3 유전자를 포함하는 pLL01(도 1B). 바이시스트론 벡터 pLL01은 다중-클로닝 부위에 3개의 고유한 제한 효소 절단 부위를 갖고 있고, 이는 IRES 서열 전방에 위치하고 치료용 리소좀 효소 유전자를 삽입하는 것을 가능하게 한다. 바이시스트론 벡터 pLL01을 사용하여 S1-S3의 발현을 조사하기 위해, HEK293 세포에 pcDNA3.1(+), CMV-S1S3(도 1A) 또는 pLL01의 등량 플라스미드를 형질감염시켰다. 48시간 후, 세포를 회수하고, 용해 완충액(25mM 트리스 완충액, pH 7.4, 150mM NaCl, 프로테아제 억제제 칵테일을 포함하는 1% TX-100)에서 용해시켰다. pcDNA3.1(+), CMV-S1S3 또는 pLL01을 발현하는 전체 세포 추출물의 포스포트랜스퍼라제 활성 분석을 수행하여, S1-S3의 발현을 결정했다. 도 1C에 도시된 바와 같이, 샘플 CMV-S1S3과 비교하면, pcDNA3.1(+) 샘플의 포스포트랜스퍼라제 활성은 무시할 수 있지만, 바이시스트론 벡터 pLL01은 9.3%의 활성을 유지한다.
실시예
2:
바이시스트론
발현은 치료용 리소좀 효소의
포스포릴화를
증강시킨다
.
바이시스트론 벡터에서 S1-S3의 발현은 낮기 때문에(9.3%)(실시예 1 참조), 이 연구는 낮은 S1-S3 활성이 리소좀 효소를 포스포릴화하기에 충분한지의 여부를 결정하기 위해 설계되었다. 6개의 상이한 리소좀 효소가 본원의 바이시스트론 벡터에서 시험되었다. 효소는 다음과 같았다: 산성 β-글루코시다제(GBA), 산성 α-글루코시다제(GAA), 갈락토실세라미다제(GALC), α-N-아세틸글루코사미니다제(NAGLU), α-갈락토시다제(GLA) 및 산 α-만노시다제(LAMAN).
산 β- 글루코시다제 ( GBA ): GBA는 리소좀에서 기질인 글리코세레브로사이드를 분해하는 리소좀 효소이다. 리소좀에서 GBA의 결핍은 가장 일반적인 리소좀 축적 질환(LSD)인 고셔병을 유발한다. 본원에 개시된 바이시스트론 벡터에서 GBA의 포스포릴화를 시험하기 위해, 정지 코돈을 갖는 1611bp의 인간 GBA cDNA 서열을 바이시스트로론 공 벡터 - pLL01에 NheI 및 NotI 제한 부위를 통해 삽입함으로써 GBA 바이시스트론 플라스미드 - pLL11을 생성했다(도 2A). CMV-S1S3 플라스미드를 갖거나 갖지 않는 동일한 양의 pLL11 및 GBA 플라스미드를 Expi293 세포에 형질감염시켰다. 48시간 후, 세포와 조건 배지를 별도로 회수했다. 놀랍게도, pLL11 조건부 배지에서 GBA 활성은 240nmol/시간/ml이고, GBA 단독(96nmol/시간/ml) 또는 GBA 및 S1-S3 동시 형질감염(90nmol/시간/ml, 도 2B))에 의해 제조된 배지보다 2배 이상 높다. GBA 발현에 추가하여, S1-S3의 발현은 세포 추출물을 사용하는 포스포트랜스퍼라제 검정에 의해 정량화되었다. GBA를 결여하는 바이시스트론 벡터 pLL01과 유사하게, pLL11 샘플은 GBA&S1-S3의 공동 형질감염 샘플과 비교하여 7.5% 포스포트랜스퍼라제 발현을 나타낸다(도 2C).
S1-S3 발현은 바이시스트론 벡터에서 감소했기 때문에, GBA의 포스포릴화에 대한 낮은 포스포트랜스퍼라제 발현의 결과가 결정되었다. 이러한 목적을 위해, pLL11, GBA 단독 및 S1-S3로 공-형질감염된 GBA의 조건 배지를 회수하고, 양이온-독립적 만노즈 6-포스페이트 수용체(CI-MPR) 결합 실험을 수행함으로써 포스포릴화 정도를 정량했다. 본원에 개시된 바이시스트론 벡터에서 생성된 GBA는 플래토 상에서 CI-MPR에 대한 결합이 훨씬 더 높아진다(도 3A). 그럼에도 불구하고, 선형 범위 점을 이용하여 수용체 결합율을 계산하면, 개시된 바이시스트론 벡터에서 생성된 GBA의 44%가, S1-S3와의 공-형질감염에 의해 생성된 GBA와 동일한 CI-MPR에 결합했고(43%), 내인성 포스포트랜스퍼라제에 의해 생성된 GBA보다 10배 더 높다(4.5%, 도 3B).
적정은 동정된 분석물의 농도를 결정하기 위해 당해 기술분야에서 광범위하게 사용되었다. 결합 실험에서 CI-MPR의 농도를 적정했다. 연속 희석된 CI-MPR을 96웰 플레이트에 고정화하고, 본원에 개시된 바이시스트론 벡터 또는 내인성 포스파타제(Ptase)에 의해 생성된 동일한 양의 GBA 효소를 수용체 결합 분석을 위해 플레이트에 첨가했다. 도 3C에서 도시된 바와 같이, pLL11 샘플로부터의 GBA 결합은 CI-MPR 농도에 의존적이었고, 수용체 농도가 15μg/ml에 도달하면 포화된 반면, 내인성 Ptase에 의해 생성된 GBA 결합은 낮은 수준으로 유지되었다. 본원의 데이터는 개시된 바이시스트론 벡터가 GBA 효소의 포스포릴화 수준을 대폭 상승시킨다는 것을 나타냈다.
산 α- 글루코시다제 ( GAA ): 리소좀 효소 GAA는 리소좀에서 글리코겐을 글루코스로 분해하는 데 필수적이다. GAA 유전자의 돌연변이는 리소좀 축적 장애 - 폼페병과 관련되어 있다. GAA 바이시스트론 플라스미드 - pLL21을 생성하기 위해, 정지 코돈을 포함하는 2859 염기쌍(bp)의 인간 GAA 유전자 단편을 증폭하고, 제한 효소 NheI 및 NotI에 의한 소화 후에 바이시스트론 벡터 pLL01에 삽입했다(도 4A). 서열 확인된 pLL21 및 GAA 플라스미드를 Expi293 세포에 형질감염시켰다. 48시간 후, GAA 활성 및 CI-MPR 결합 실험을 위해 조건부 배지를 수집했다. GBA와 유사하게, pLL21 조건부 배지에서의 GAA 활성은 GAA 단일 발현보다 높았다(도 4B). pLL21 조건부 배지의 결합은 GAA 단일 조건부 배지보다 신속하고 높았다(도 4C). 1시간 인큐베이팅 시간 동안, pLL21 조건부 배지로부터 GAA의 72.5%가 CI-MPR에 결합하지만, GAA 단일 발현으로부터 GAA의 CI-MPR 결합은 단지 21.5%이다(도 4D). 이들 데이터는 본원에 개시된 바이시스트론 발현 플랫폼이 GAA 효소의 포스포릴화를 대폭 증가시킬 수 있음을 시사했다.
갈락토실세라미다제 ( GALC ): 리소좀에서, GALC 효소는, 세라미드 유도체로부터 갈락토스를 제거함으로써, 갈락토실세라미드의 이화작용을 담당한다. GALC 효소의 유전적 결핍은 크랩병의 원인이다. 본원에 개시된 바이시스트론 발현에서 GALC 효소를 시험하기 위해, 마우스 GALC 유전자를 벡터 pLL01에 삽입함으로써 바이시스트론 플라스미드 pLL31을 생성했다(도 5A). pLL31 형질감염된 Expi293 세포에서 회수된 pLL31 조건부 배지에서의 GALC 효소 활성은 GALC 단독 배지와 유사하다(0.86 nmol/μl/h 대 0.62nmol/μl/h, 도 5B). CI-MPR 수용체 결합 결과는, S1-S3에 의한 GALC의 바이시스트론 발현에 의해, CI-MPR 결합이 28.4%로부터 56.8%로 증가하는 것을 나타냈다(도 5C&D).
α-N- 아세틸글루코사미니다제 ( NAGLU ): NAGLU 유전자는 리소좀 내의 헤파린 설페이트를 분해하는 효소를 암호화한다. NAGLU 효소의 결함은 점액다당류증(MPS) IIIB라고도 공지된 산필리포(Sanfilippo) 증후군 B형을 초래한다. NAGLU 효소가 ERT를 위해 세포주에서 생성된 경우에는 만노스 잔기에 포스페이트를 갖지 않는다. 그리고, 이의 ERT에 대한 임상 시험은 금년 초에 실패했다. 본원에 개시된 바이시스트론 벡터에서 NAGLU를 발현시키기 위해, 상기와 동일한 절차를 사용했다. 2229bp의 인간 NAGLU 유전자를 pLL01 바이시스트론 벡터에 삽입하고(도 6A), NAGLU 바이시스트론 플라스미드 -pLL41 및 NAGLU 단일 발현 플라스미드를 Expi293 세포에 형질감염시켰다. 조건부 배지를 사용함으로써, 샘플 pLL41의 NAGLU 활성은 NAGLU 단일 발현 샘플보다 높은 것으로 나타났다(도 6B). CI-MPR 결합의 측면에서, 대량의 효소(최대 9nmol/시간, 도 6C-6D)를 넣었음에도 불구하고, NAGLU 단일 발현 샘플로부터 NAGLU 결합은 거의 검출되지 않았다. 그러나, 바이시스트론 벡터에 의해 생성된 NAGLU는 최대 25%까지 CI-MPR에 결합한다(도 6C-6D).
α- 갈락토시다제 ( GLA ): 리소좀 효소 GLA는 멜리비오스를 갈락토오스와 글루코스로 가수분해하고, 글로보트리아오실세라미드(GL-3)를 대사할 수 있다. GLA 효소 활성의 결핍은 X-링커 장애(패브리병)를 유발한다. GLA 바이시스트론 플라스미드 - pLL51을 생성하기 위해, 인간 GLA 유전자 단편과 바이시스트론 벡터 pLL01을 BamHI 및 NotI로 소화하고, T4 리가제에 의해 결찰했다(도 7A). 정확한 pLL51 클론 및 GLA 단일 플라스미드가 형질감염되고, Expi293 세포에서 발현된다. GLA 활성 검정 및 CI-MPR 결합 실험은 조건부 배지를 사용하여 수행된다. 도 7B에 도시된 바와 같이, GLA 단독 또는 pLL51 조건부 배지 중 어느 하나에서의 GLA 활성은 유사하다. 이 2개 배지를 사용한 적정 곡선은 pLL51 샘플이 GLA 샘플보다 CI-MPR에 결합하는 속도가 신속하다는 것을 시사한다(도 7C). pLL51 샘플에 대한 전체 결합 백분율은 62.1%이고, 이는 GLA 샘플의 거의 2배이다(33.1%, 도 7D).
산 α- 만노시다제 ( LAMAN ): 유전 질환 α-만노시드증은 MAN2B1 유전자에 의해 암호화되는 리소좀 효소 LAMAN의 결함에 의해 유발된다. 인간 LAMAN 효소는 거의 포스포릴화되지 않기 때문에, hLAMAN은 개시된 바이시스트론 발현에 대한 우수한 후보이다. 3033bp의 인간 LAMAN 유전자를 pLL01 바이시스트론 벡터에 삽입하고(도 8A), 이후 연구를 위해 Expi293 세포에서 발현시켰다. LAMAN 바이시스트론 플라스미드 pLL61 조건부 배지에서 LAMAN 활성은 LAMAN 단일 발현보다 약간 낮아진다(도 8B). CI-MPR에 대한 결합을 적정한 경우, LAMAN 단일 발현 샘플을 사용하여도 CI-MPR에 대한 LAMAN 효소 결합은 거의 검출되지 않았지만, pLL61 샘플로부터 대량의 LAMAN 효소가 CI-MPR과 상호작용하는 것으로 밝혀졌다(도 8C). CI-MPR에 대한 LAMAN의 결합은 S1-S3 바이시스트론 발현에 의해 1.6%로부터 75.2%로 증가한다(도 8D).
상기 6개 효소는 기본적 포스포릴화 수준에 기초하여 2개 그룹으로 분류할 수 있다. 그룹 1은 낮은 포스포릴화 리소좀 효소(GBA, NAGLU 및 LAMAN)이고, 이는 효소 생산 동안 야생형 Ptase에 대한 기질로서는 불충분하다. 제2 그룹은 높은 포스포릴화 효소(GAA, GALC 및 GLA)이다. 효소는 야생형 Ptase에 대한 우수한 기질로서 간주되고, 상당한 양의 포스페이트를 수취했다. 본원에 개시된 S1-S3의 바이시스트론 발현은 기본적 포스포릴화 수준과 무관하게 6개의 리소좀 효소의 포스포릴화를 유의하게 증가시키는 것으로 나타났다. 이러한 발견의 관점에서, 본 명세서에 개시된 바이시스트론 벡터 pLL01은 모든 리소좀 축적 질환을 치료하기 위한 고도로 포스포릴화된 리소좀 효소를 생산하는 데 사용될 수 있다. 명백하게는, 본원에 개시된 바이시스트론 벡터는 리소좀 축적 장애의 치료를 위한 ERT 및 유전자 치료에 큰 이점이 있다.
실시예
3: 고셔병의 치료
효소 대체 요법(
ERT
)
GBA를 암호화하는 서열 및 S1-S3 Ptase를 암호화하는 서열을 포함하는 발현 벡터를 사용하여, 고셔병의 징후 또는 증상을 치료 또는 예방할 수 있다. 하기 연구는 고셔병의 당해 기술분야에서 인정되어 있는 표준 마우스 모델에서 (GCase/GBA)-S1-S3의 발현이 (GCase/GBA)-S1-S3의 발현, 순환 혈류로부터 세포로 v-S1-S3의 수송 및 v-S1-S3 복합체를 흡수하는 세포에서 v의 활성 증가를 유도하는 것을 입증한다. GBA-S1-S3 복합체의 발현 및 흡수에 기인하는 (GCase/GBA) 활성의 약간의 증가는 마우스 모델에서 기능의 유의한 기능 회복을 유도한다.
S1-S3 PTase를 갖는 바이시스트론 발현 벡터를 사용하는 GBA의 발현은 고셔병의 징후 또는 증상을 치료하거나 예방하기 위해 사용할 수 있고 포스포릴화된 올리고당의 수준이 더 높은 재조합 단백질을 생성한다. 하기 연구는 고셔병의 당해 기술분야에서 인정하는 표준 마우스 모델에서 S1-S3 PTase를 갖는 바이시스트론 벡터를 사용하여 발현된 재조합 단백질을 사용하는 ERT가 반감기의 연장, 조직에 의한 흡수 증가, 기질 감소의 증가 및 현재의 표준 치료와 비교한 조직 병리학의 모다 양호한 교정을 유도한다는 것을 입증한다.
도 16A-16B는 20주령 GaucherD409V /null 마우스의 간, 폐 및 비장에서 관찰된 글루코실세라미드 수준의 상승을 도시하는 한 쌍의 그래프이다. GBA의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 폐에서 GC의 축적은 통계적으로 및 치료적으로 가치 있는 결과이고, 이는 현재의 표준 치료의 충족되지 않은 요구로 공지되어 있다. 조직 균질물의 20μL 분취량과 적절한 대조군은 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음 15분 동안 3220g 4℃; 3)에서 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조하고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주입했다.
도 17A-17C는, 이미글루세라제와 비교하여, GCaseM6P의 반감기가 길고, GBAD409V/null 마우스 모델에서 조직으로의 흡수가 더 많다는 것을 입증하는 일련의 그래프이다. 고셔 D409V/Null 마우스 모델에서의 PK/PD 연구는 표준 치료, 이미글루세라제, 및 S1-S3 PTase 및 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현함으로써 생성된 정제된 GBA를 사용하여 수행했다. 이 변이체 GCase는 중성 및 약알칼리성 조건에서 더 큰 안정성을 갖는다. 간단히 말해서, 3마리의 동물에게 약 1.5mg/kg의 재조합 GCase를 꼬리 정맥 주사했다. 혈청 약동학 데이터의 경우, 혈장 샘플을 2, 10, 20, 40 및 60분에서 수집했다. 합성 기질인 4-메틸움벨리페릴-베타-D-글루코피라노시드(4MU-Glc)를 사용하여 활성을 측정했다. 2분 시점을 100% 활성으로 설정함으로써 개별 동물에서 활성을 정규화하고, 후속 시점은 t=2분 시점의 백분율이다. S1-S3 PTase의 존재하에 발현된 안정화된 GCase는 더 긴 반감기를 갖는 것으로 나타난다. 이러한 더 긴 반감기는 더 큰 안정성을 갖는 효소와 다양한 클리어런스 경로의 조합이다. 효소 주입의 2시간 후에 조직에 의해 흡수된 GCase의 양을 측정하기 위해, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은 단백질 측정을 위한 BCA 방법에 의해 결정된 바와 같이 균질물 중의 총 단백질에 대해 정규화되었다. 적절한 포스포릴화를 수반하는 안정한 GCase의 진정한 이점은 제시된 조직 흡수 데이터에서 관찰된다. 평가된 모든 조직에 대해, 바이시스트론 S1-S3 PTase 벡터 플랫폼 S1'S3 PTase를 사용하여 발현된 안정화 GCase에서 더 많은 활성이 발견되었다. 이것은 이미글루세라제가 거의 활성을 갖지 않는 폐, 근육 및 뇌에서 가장 극적이다. 조직 및 혈청 데이터를 함께 취하면, N-연결된 올리고당의 포스포릴화가 더 큰, 보다 안정한 GCase의 이점은 영향을 받은 조직에 더 많은 효소를 전달하기 위해 명백하다. 상당한 양의 GCase가 이러한 용량으로 폐, 근육 및 심장에 전달된 것은 이것이 최초이다.
도 18A-18E는, GBAD409V /null 마우스 모델에서, GCaseM6P ERT가 조직 마크로파지(항-CD68 염색)를 이미글루세라제보다 더 양호하게 감소시켰음을 입증하는 일련의 사진 및 막대 그래프이다. D409V 고셔 마우스 모델에서의 효능 연구는 표준 치료인 세레자임, 및 S1S3 PTase 및 중성 및 약알칼리성 조건에서 더 큰 안정성을 갖는 것으로 보고된 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA(M0111)를 사용하여 수행되었다. 약 20주령의 고셔 마우스를 매주 약 1.5mg/kg의 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, CD68 항체를 사용한 면역조직화학을 위해 4% 파라포름알데히드-PBS, pH 7.4에서 고정했다. CD68 Ab에 의해 시각화된 바와 같이, 영향을 받은 조직에서 마크로파지의 감소에 의해 입증된 바와 같이, M0111은 현재의 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 19A-19C는 GCaseM6P ERT가 GBAD409V /null 마우스 모델에서 이미글루세라제보다 고셔 저장 세포(헤마톡실린 및 에오신(H&E) 염색)의 수와 크기를 더 잘 감소시켰음을 입증하는 일련의 사진이다. D409A 고셔 마우스 모델의 효능 연구는, 표준 치료인 세레자임과, S1-S3 PTase 및 중성 및 약알칼리성 조건에서 더 큰 안정성을 갖는 것으로 보고된 GBA의 천연 변이체를 암호화하는 바이시스트론 벡터를 이용하여 Expi293 세포에서 일시적으로 공-발현된 정제된 GBA를 사용하여 수행되었다. 약 20주령의 고셔 마우스를 매주 약 1.5mg/kg 효소로 4주 동안 치료했다. 4주 후, 간 및 폐의 조직을 채취하고, 헤마톡실린 및 에오신(H&E) 염색용의 포르말린용으로 4% 파라포름알데히드-PBS, pH 7.4에서 고정했다. GCaseM6P는 H&E 염색에 의해 가시화된, 영향을 받은 조직의 저장 세포의 감소에 의해 입증된 바와 같이 현재의 표준 치료와 비교하여 더 큰 효능을 갖고 있다.
도 20A-20B는 GCaseM6P ERT가 GBAD409V /null 마우스 모델에서 이미글루세라제보다 축적된 기질을 더 양호하게 감소시켰음을 입증하는 한 쌍의 그래프이다. 약 20주령 고셔 마우스를 매주 약 1.5mg/kg의 효소로 4주 동안 치료했다. 조직 샘플을 수집하고, 글리코실세라미드 분석을 위해 균질화했다. GCase의 천연 기질인 글루코세레브로사이드의 축적은 조직 균질물에서 측정되었다. 중요한 가치는 폐에서 GC의 축적이고, 이는 현재의 표준 치료에 대해 충족되지 않은 것으로 공지된 요구이다. 조직 균질물의 20μL 분취량 및 적절한 대조군은, 200μL의 메탄올/ACN/H2O(v:v:v=85:10:5)를 첨가하고, 800rpm에서 5분 동안 혼합한 다음, 15분 동안 3220g 4℃; 3)에서 원심분리함으로써 글루코실세라미드를 추출했다. 50μL의 상청액을 회수하고, 질소로 건조하고, 메탄올/ACN/H2O(v:v:v=85:10:5)로 재현탁하고, LC-MS/MS 분석을 위해 직접 주입했다. 측정된 2개의 세라미드에 대해, GCaseM6P 치료된 동물은 이미글루세라제보다 ERT 요법 후의 수준이 낮았다.
유전자 치료
GBA를 암호화하는 서열 및 S1-S3 PTase를 암호화하는 서열을 포함하는 바이시스트론 벡터를 갖는 전달 벡터를 사용하여, 고셔병의 징후 또는 증상을 치료 또는 예방할 수 있다. 일부 실시양태에서, 전달 벡터는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터이다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 또는 AAV9 벡터이다. 일부 실시양태에서, 바이러스 벡터는 렌티바이러스 벡터이다. 일부 실시양태에서, 벡터는 비-바이러스 벡터이다. 일부 실시양태에서, 비-바이러스 벡터는 리포솜, LNP, 폴리머 나노입자, 나노입자, 미셀, 폴리머 또는 엑소좀이다. 하기 연구는 고셔병의 당해 기술분야에서 인정되어 있는 표준 마우스 모델에서 바이시스트론 벡터를 사용하는 GBA 및 S1-S3 PTase의 발현이 GBAM6P의 발현, 조직 및 혈청에서의 활성의 증가 및 기질의 감소를 유도한다는 것을 입증한다. 이것은 CI-MPR에 대한 높은 친화성을 갖는 포스포릴화된 도입유전자 산물을 갖는 것이, 효율적인 세포 흡수 및 리소좀 표적화에 의해, 낮은 활성 수준에서도 효과적 치료를 유도할 수 있음을 입증한다.
도 21A-21D는 고셔병의 치료를 위한 생체내 AAV 매개 유전자 치료 연구의 결과를 나타내는 일련의 그래프이다. 3개의 상이한 프로모터를 갖는 안정한 GBA + S1-S3 PTase의 바이시스트론 발현 도입유전자에 의한 AAV9 유전자 치료의 효과를 결정하기 위해, 15주령 GBAD409V /null 마우스에 중등도 용량의 AAV9-안정성 GBA+ S1-S3 PTase, 5E11 vg를 투여했다. 조직에 의해 생성된 GBA의 양을 결정하기 위해, AAV9 주사 2주 후, 조직을 회수하고, 균질화하고, 4MU-Glc 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 균질물 중의 총 단백질에 대해 정규화되었다.
도 29A-29C는 고셔 마우스에서 AAV9-hTLV-GBAM6P 유전자 치료의 주사 2주 후의 폐 및 간에서 효소 활성 및 선택된 GCase 기질을 나타내는 일련의 그래프이다. AAV9-hTLV-GBA-S1S3은 달리는 AAV9-hTLV-GBAM6P로 공지되어 있고, 여기서 M6P는 S1S3 작제물을 나타낸다. AAV9 hTLV-GBA 또는 AAV9 hTLV-GBAM6P(GBA 및 S1-S3 PTase를 갖는 바이시스트론 벡터를 갖는 도입유전자)의 2주 후, 양 작제물에 대해 간에서 발현이 상승했다(도 29A). 간 글루코실-β-세라미드 수준을 측정한 경우(도 29B 및 C), AAV9 hTLV-GBA 치료된 동물과 비교하여 간에서 GCase 활성이 낮아졌음에도 불구하고, AAV9 hTLV-GBAM6P 치료된 동물에서 축적된 기질의 최대의 감소가 관찰되었다. 더 적은 활성으로 이러한 더 큰 기질 감소는 세포 흡수 및 리소좀 표적화의 관점에서 유전자 치료를 위한 N-연결된 올리고당 포스포릴화의 중요성을 나타낸다. 폐에서, AAV9 처리 동물에 대한 GCase 활성은 낮다. 그러나, AAV9-hTLV-GBAM6P 치료된 동물은 축적된 글루코실-β-세라미드 수준에 대해 폐에서 유의한 감소를 나타냈다(도 29B, C). AAV9-hTLV-GBA 치료된 동물에서는 약간의 감소가 관찰되었다. 이것은 CI-MPR에 대한 높은 친화성을 갖는 포스포릴화된 도입유전자 산물을 갖는 것이 효율적 세포 흡수 및 리소좀 표적화에 기인하여 낮은 활성 수준에서도 효과적 치료를 유도할 수 있음을 입증한다.
실시예
4: α-
만노시드증의
치료
효소 대체 요법(
ERT
)
LAMAN을 암호화하는 서열 및 S1-S3 Ptase를 암호화하는 서열을 포함하는 발현 벡터를 사용하여, α-만노시드증의 징후 또는 증상을 치료 또는 예방할 수 있다. 하기 연구는 마우스 모델에서 LAMAN-S1-S3의 발현이 LAMAN-S1-S3의 발현, 순환 혈류로부터 세포로의 LAMAN-S1-S3의 수송, 및 LAMAN-S1-S3 복합체를 흡수하는 세포에서 LAMAN의 활성의 증가를 유도한다는 것을 입증한다. LAMAN-S1-S3 복합체의 발현 및 흡수로부터 발생하는 LAMAN의 약간의 증가는 마우스 모델에서 기능의 유의한 기능 회복을 유도한다.
S1-S3 PTase를 사용한 바이시스트론 발현 벡터를 사용하는 LAMAN의 발현은 α-만노시드증의 징후 또는 증상을 치료 또는 예방하기 위해 사용할 수 있는, 더 높은 수준의 포스포릴화된 올리고당을 포함하는 재조합 단백질을 생성한다. 하기 연구는 야생형 마우스에서 S1-S3 PTase를 갖는 바이시스트론 벡터를 사용하여 발현된 재조합 LAMAN 단백질을 사용하는 ERT가 조직에서 더 큰 흡수 및 광범위한 분포를 유도한다는 것을 입증한다.
도 22A-22C는 ERT로서 리소좀 알파-만노시다제(LAMAN)를 사용한 경우의 시험관내 연구의 결과를 나타내는 일련의 그래프이다.
도 23A-23B는 LAMAN 효소의 발현, 정제 및 특성화를 도시하는 사진 및 대응하는 데이터 표이다. LAMAN의 2개 조제물은 S1-S3 PTase를 암호화하는 바이시스트론 벡터를 포함하거나(M0611) 포함하지 않는 Expi293 세포에서 일시적으로 공-발현되었다. 둘 다는 HPC4 친화성 태그를 사용하여 정제되었다. 포스포릴화의 유의한 증가는 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성을 기반으로 한다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 M6P의 존재하에서도 수용체에 결합하는 LAMANM6P(M0611)의 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 23C는 LAMANM6P(M0611) 효소의 발현, 정제 및 특성화를 도시하는 그래프이다. LAMAN의 2개 조제물은 PTase의 S1-S3 변이체를 암호화하는 바이시스트론 벡터의 존재 또는 부재하에 Expi293 세포에서 일시적으로 공-발현되었다. 이들 둘 다는 HPC4 태그를 사용하여 정제되었다. 포스포릴화의 유의한 증가는 고정화된 양이온-비의존적 만노스 6-포스페이트 수용체에 용량 의존적 방식으로 결합하는 LAMAN의 양을 측정함으로써 입증되었다. 결합된 LAMAN의 양은 합성 기질 4-메틸움벨리페릴-α-D-만노피라노시드(4MU-Man)를 사용한 활성에 의해 결정되었다. 포스포릴화된 올리고당을 통한 결합의 특이성은 결합을 차단하는 첨가된 만노스 6-포스페이트의 능력에 의해 확인되었다. 주목할만한 것은 M6P의 존재하에서도 수용체에 결합하는 M0611의 능력이다. LAMANM6P(M0611, P-0030) 및 LAMAN(P-0031)은 생체내 동물 연구용으로 선택되었다.
도 24A-24B는 효소 대체 요법을 위한 야생형 마우스에서 LAMAN 및 LAMANM6P 효소의 생체분포를 나타내는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 2mg/kg의 각 프렙을 꼬리 정맥을 통해 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에서 조직의 활성이 더 커졌다. 이 경향은 폐를 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다. 이 관찰에 대한 유일한 예외는 신장이었다. 내인성 LAMAN 활성은 모든 샘플로부터 차감한다. LAMANM6P 효소를 주사한 대부분의 마우스 조직에서 더 높은 LAMAN 효소 활성이 검출되었다.
도 25A-25B는 효소 대체 요법을 위한 야생형 마우스에서 αLAMAN 및 LAMANM6P 효소의 생체분포를 입증하는 한 쌍의 그래프이다. LAMAN과 LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN) 사이의 조직 흡수의 차이를 평가하기 위해, 10mg/kg의 각 프렙을 꼬리 정맥을 통해 야생형 마우스(n=4)에 주사했다. 투여 2시간 및 8시간 후, 조직을 회수하고, 균질화하고, 4MU-Man 기질을 사용하여 활성을 측정했다. 활성은 단백질 결정을 위한 BCA 방법에 의해 결정된 균질물 중의 총 단백질에 대해 정규화되었다. LAMANM6P(S1-S3 PTase와 공-발현된 LAMAN)의 이점은 조직 흡수 데이터에서 관찰된다. 간, 비장, 심장, 폐 및 뇌의 경우, 2시간에 조직의 활동이 더 커졌다. 이 경향은 신장을 제외하고 8시간에서도 마찬가지였다. 이것은 이 조직의 분석에서 관찰된 높은 변동의 결과일 수 있다.
유전자 치료
LAMAN을 암호화하는 서열 및 S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, α-만노시드증의 징후 또는 증상을 치료 또는 예방할 수 있다. 일부 실시양태에서, 전달 벡터는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터이다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 또는 AAV9 벡터이다. 일부 실시양태에서, 바이러스 벡터는 렌티바이러스 벡터이다. 일부 실시양태에서, 벡터는 비-바이러스 벡터이다. 일부 실시양태에서, 비-바이러스 벡터는 리포솜, LNP, 폴리머 나노입자, 나노입자, 미셀, 폴리머 또는 엑소좀이다. 하기 연구는 α-만노시드증의 마우스 모델에서 LAMAN-S1-S3의 발현이 LAMAN-S1-S3의 발현, 순환 혈류로부터 세포로의 LAMAN-S1-S3의 수송, 및 LAMAN-S1-S3 복합체를 흡수하는 세포에서 LAMAN의 활성의 증가를 유도한다는 것을 입증한다. LAMAN-S1-S3 복합체의 발현 및 흡수에 기인하는 v의 약간의 증가는 마우스 모델에서 기능의 유의한 기능 회복을 유도한다.
대안적으로 또는 추가로, S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, α-만노시드증의 징후 또는 증상을 치료 또는 예방할 수 있다. S1-S3의 발현은 신체 조직에 의한 내인성 LAMAN의 흡수를 증가시키고, 이에 의해 마우스 모델에서 기능의 유의한 기능 회복을 유도할 수 있다.
실시예
5:
점액지질증의
치료
효소 대체 요법(ERT)
S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 발현 벡터를 사용하여, 점액지질증의 징후 또는 증상을 치료 또는 예방할 수 있다. 하기 연구에서는 S1-S3의 발현이 S1-S3의 발현, S1-S3 및 하나 이상의 리소좀 효소를 순환 혈류로부터 세포로의 수송, 및 S1-S3 복합체를 흡수하는 세포에서 하나 이상의 리소좀 효소의 활성의 증가를 유도한다는 것을 입증한다. S1-S3 복합체와 하나 이상의 리소좀 효소의 발현 및 흡수에 기인하는 S1-S3 복합체의 약간의 증가는 기능의 유의한 기능 회복을 유도한다.
유전자 치료
S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, 점액지질증의 징후 또는 증상을 치료 또는 예방할 수 있다. 일부 실시양태에서, 전달 벡터는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터이다. 일부 실시양태에서, AAV 벡터는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 또는 AAV9 벡터이다. 일부 실시양태에서, 바이러스 벡터는 렌티바이러스 벡터이다. 일부 실시양태에서, 벡터는 비-바이러스 벡터이다. 일부 실시양태에서, 비-바이러스 벡터는 리포솜, LNP, 폴리머 나노입자, 나노입자, 미셀, 폴리머 또는 엑소좀이다. 가용성 S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, 점액지질증의 징후 또는 증상을 치료하거나 예방할 수 있다. S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, 점액지질증의 징후 또는 증상을 치료 또는 예방할 수 있다. 하기 연구는, S1-S3 PTase의 발현이 S1-S3 PTase의 발현을 유도하고, S1-S3 세포 활성이 N-연결된 올리고당 포스포릴화를 증가시킴으로써 잘못 수송된 리소좀 효소의 혈청 수준을 교정하여, 효율적으로 리소좀을 표적화하는 것을 입증한다.
대안적으로 또는 추가로, S1-S3 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 서열을 포함하는 전달 벡터를 사용하여, 점액지질증의 징후 또는 증상을 치료하거나 예방할 수 있다. S1-S3 PTase의 발현은 신체 조직에 의한 하나 이상의 내인성 리소좀 효소의 흡수를 증가시키고, 이에 의해 마우스 모델에서 기능의 유의한 기능 회복을 유도할 수 있다.
도 26A-26B는 점액지질증 유전자 치료(GTx)에 대한 AAV9 설계 및 시험관내 시험을 도시하는 개략도 및 그래프이다. 293T 세포에 다양한 M0021(AAV9-CAGp-S1-S3) 바이러스를 형질도입하고, PTase 활성 검정 전에 2일 동안 배양했다.
도 27A-27B는 M0021 치료가 ML II 마우스에서 혈청 리소좀 효소 수준을 감소시킨다는 것을 입증하는 한 쌍의 그래프이다. S1-S3 PTase 유전자 치료의 효과를 결정하기 위해, 34주령의 암컷 마우스에게 중등도 용량의 M0021(AAV9-CAGp-S1-S3), 4e12 vg(2e13 vg/kg)를 투여했다. ML II의 표현형 중 하나는, 세포 내의 리소좀을 표적화할 수 없기 때문에, 리소좀 효소의 혈청 수준이 상승하는 것이다. 치료를 받은 지 단지 1주일 후에 혈청 중의 LAMAN 및 ManB 활성이 감소한 경우, 유망한 결과가 관찰되었다. 이 결과는 MLII 마우스 모델의 기재된 표현형에 영향을 미치는 능력을 입증하기 때문에 중요하다.
도 28A-28C는 M0021 치료가 ML II에서 리소좀 효소의 포스포릴화를 증가시킨다는 것을 입증하는 일련의 그래프이다. LAMAN 및 ManB의 혈청 활성 감소에 있어서 S1-S3 PTase 유전자 치료에 대한 영향을 추가로 이해하기 위해, 혈청에서 발견되는 효소의 CI-MPR 결합을, 전술한 고정화된 수용체 결합 검정을 사용하여 평가했다. 간단히 말해서, 고정화된 CI-MPR에 증가하는 양으로 첨가되는 활성에 대해 공지되어 있다. 결합되지 않은 효소는 세정하고, 잔류하는 결합된 효소는 적절한 합성 기질: Man-b-4MU(ManB, LAMAN 4MU-Man(LAMAN)을 사용하여 측정한다. ML II 마우스에서의 AAV9-S1S3 유전자 치료는 리소좀 효소의 글리칸 포스포릴화를 증가시킨다. 혈청 중의 총 포스포릴화된 리소좀 효소는 정상 수준으로 정상화되거나 3주 후 약간 더 높아진다.
본 명세서에 인용된 각각의 및 모든 특허, 특허 출원 및 간행물의 개시는 참조에 의해 그 전체가 본 명세서에 도입된다. 본 개시는 특정 실시양태를 참조하여 개시되었지만, 본 개시의 다른 실시양태 및 변형은, 본 개시의 진정한 사상 및 범위를 벗어나지 않으면서, 당업자에 의해 고안될 수 있음이 명백하다. 첨부된 특허청구범위는 이러한 모든 실시양태 및 등가의 변형을 포함하는 것으로 해석되어야 한다.
SEQUENCE LISTING
<110> M6P Therapeutics
Do, Cuong
Liu, Lin
<120> VECTOR COMPOSITIONS AND METHODS OF USING SAME FOR TREATMENT OF
LYSOSOMAL STORAGE DISORDERS
<130> M6PT-002/01WO
<150> 62/869,781
<151> 2019-07-02
<150> 62/869,808
<151> 2019-07-02
<160> 164
<170> PatentIn version 3.5
<210> 1
<211> 7709
<212> DNA
<213> artificial sequence
<220>
<223> expression vector
<400> 1
gacggatcgg gagatctccc gatcccctat ggtgcactct cagtacaatc tgctctgatg 60
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 480
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900
gtttaaactt aagcttggta ccgagctcgg atccactagt ccagtgtggt ggaattctgc 960
agatatccag cacagtggcg gccgctgatt aacctcagga ctagtggtta ttttccacca 1020
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1080
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1140
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1200
agcggaaccc cccacctggc gacaggtgcc tctgcggcca aaagccacgt gtataagata 1260
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1320
tcaaatggct cacctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 1380
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 1440
taaaaaacgt ctaggccccc cgaaccacgg ggacgtggtt ttcctttgaa agtttgttaa 1500
catgctgttc aagctcctgc agagacagac ctatacctgc ctgtcccaca ggtatgggct 1560
ctacgtgtgc ttcttgggcg tcgttgtcac catcgtctcc gccttccagt tcggagaggt 1620
ggttctggaa tggagccgag atcaatacca tgttttgttt gattcctata gagacaatat 1680
tgctggaaag tcctttcaga atcggctttg tctgcccatg ccgattgacg ttgtttacac 1740
ctgggtgaat ggcacagatc ttgaactact gaaggaacta acagaattaa aaagatcaaa 1800
acgtgatcca ttaataccag aatgtcaagg taaacaaaca ccagaaaaag ataaatgtta 1860
tagagatgac atctctgcca gtcgttttga agataacgaa gaactgaggt actcattgcg 1920
atctatcgag aggcatgcac catgggttcg gaatattttc attgtcacca acgggcagat 1980
tccatcctgg ctgaaccttg acaatcctcg agtgacaata gtaacacacc aggatgtttt 2040
tcgaaatttg agccacttgc ctacctttag ttcacctgct attgaaagtc acattcatcg 2100
catcgaaggg ctgtcccaga agtttattta cctaaatgat gatgtcatgt ttgggaagga 2160
tgtctggcca gatgattttt acagtcactc caaaggccag aaggtttatt tgacatggcc 2220
tgtgccaaac ggaggtagcg gaggtgatac atttgcagat tccctcagat atgtaaataa 2280
aattctaaat agcaagtttg gattcacatc gcggaaagtc cctgctcaca tgcctcacat 2340
gattgaccgg attgttatgc aagaactgca agatatgttc cctgaagaat ttgacaagac 2400
gtcatttcac aaagtgcgcc attctgagga tatgcagttt gccttctctt atttttatta 2460
tctcatgagt gcagtgcagc cactgaatat atctcaagtc tttgatgaag ttgatacaga 2520
tcaatctggt gtcttgtctg acagagaaat ccgaacactg gctaccagaa ttcacgaact 2580
gccgttaagt ttgcaggatt tgacaggtct ggaacacatg ctaataaatt gctcaaaaat 2640
gcttcctgct gatatcacgc agctaaataa tattccacca actcaggaat cctactatga 2700
tcccaacctg ccaccggtca ctaaaagtct agtaacaaac tgtaaaccag taactgacaa 2760
aatccacaaa gcatataagg acaaaaacaa atataggttt gaaatcatgg gagaagaaga 2820
aatcgctttt aaaatgattc gtaccaacgt ttctcatgtg gttggccagt tggatgacat 2880
aagaaaaaac cctaggaagt ttgtttgcct gaatgacaac attgaccaca atcataaaga 2940
tgctcagaca gtgaaggctg ttctcaggga cttctatgaa tccatgttcc ccataccttc 3000
ccaatttgaa ctgccaagag agtatcgaaa ccgtttcctt catatgcatg agctgcagga 3060
atggagggct tatcgagaca aattgaagtt ttggacccat tgtgtactag caacattgat 3120
tatgtttact atattctcat tttttgctga gcagttaatt gcacttaagc ggaagatatt 3180
tcccagaagg aggatacaca aagaagctag tcccaatcga atcagagtat ctagaggagg 3240
taagcctatc cctaaccctc tcctcggtct cgattctacg tgagtttaaa cccgctgatc 3300
agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 3360
cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 3420
gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg 3480
ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta tggcttctga 3540
ggcggaaaga accagctggg gctctagggg gtatccccac gcgccctgta gcggcgcatt 3600
aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 3660
gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 3720
agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 3780
caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 3840
tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 3900
aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc 3960
ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaattaat tctgtggaat 4020
gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 4080
atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 4140
agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 4200
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 4260
tttatttatg cagaggccga ggccgcctct gcctctgagc tattccagaa gtagtgagga 4320
ggcttttttg gaggcctagg cttttgcaaa aagctcccgg gagcttgtat atccattttc 4380
ggatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga tggattgcac 4440
gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca 4500
atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt 4560
gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg 4620
tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga 4680
agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct 4740
cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg 4800
gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg 4860
gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc 4920
gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat 4980
ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac 5040
tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt 5100
gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct 5160
cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg agcgggactc 5220
tggggttcga aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca 5280
ccgccgcctt ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga 5340
tcctccagcg cggggatctc atgctggagt tcttcgccca ccccaacttg tttattgcag 5400
cttataatgg ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt 5460
cactgcattc tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctgtatac 5520
cgtcgacctc tagctagagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 5580
gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg 5640
gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 5700
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 5760
tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 5820
tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 5880
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 5940
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 6000
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 6060
gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 6120
ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 6180
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 6240
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 6300
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 6360
tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc 6420
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 6480
ccgctggtag cggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 6540
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 6600
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 6660
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 6720
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 6780
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 6840
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 6900
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 6960
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 7020
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 7080
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 7140
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 7200
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 7260
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 7320
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 7380
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 7440
tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7500
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7560
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 7620
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7680
catttccccg aaaagtgcca cctgacgtc 7709
<210> 2
<211> 508
<212> DNA
<213> Cytomegalovirus
<400> 2
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 60
gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 120
atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 180
aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 240
catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 300
catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 360
atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 420
ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 480
acggtgggag gtctatataa gcagagct 508
<210> 3
<211> 486
<212> DNA
<213> EMC virus
<400> 3
ggttattttc caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg 60
tcttcttgac gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt 120
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag 180
cgaccctttg caggcagcgg aaccccccac ctggcgacag gtgcctctgc ggccaaaagc 240
cacgtgtata agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga 300
tagttgtgga aagagtcaaa tggctcacct caagcgtatt caacaagggg ctgaaggatg 360
cccagaaggt accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat 420
gtgtttagtc gaggttaaaa aacgtctagg ccccccgaac cacggggacg tggttttcct 480
ttgaaa 486
<210> 4
<211> 1728
<212> DNA
<213> artificial sequence
<220>
<223> modified GlcNAc-1 phosphotransferase
<400> 4
atgctgttca agctcctgca gagacagacc tatacctgcc tgtcccacag gtatgggctc 60
tacgtgtgct tcttgggcgt cgttgtcacc atcgtctccg ccttccagtt cggagaggtg 120
gttctggaat ggagccgaga tcaataccat gttttgtttg attcctatag agacaatatt 180
gctggaaagt cctttcagaa tcggctttgt ctgcccatgc cgattgacgt tgtttacacc 240
tgggtgaatg gcacagatct tgaactactg aaggaactaa cagaattaaa aagatcaaaa 300
cgtgatccat taataccaga atgtcaaggt aaacaaacac cagaaaaaga taaatgttat 360
agagatgaca tctctgccag tcgttttgaa gataacgaag aactgaggta ctcattgcga 420
tctatcgaga ggcatgcacc atgggttcgg aatattttca ttgtcaccaa cgggcagatt 480
ccatcctggc tgaaccttga caatcctcga gtgacaatag taacacacca ggatgttttt 540
cgaaatttga gccacttgcc tacctttagt tcacctgcta ttgaaagtca cattcatcgc 600
atcgaagggc tgtcccagaa gtttatttac ctaaatgatg atgtcatgtt tgggaaggat 660
gtctggccag atgattttta cagtcactcc aaaggccaga aggtttattt gacatggcct 720
gtgccaaacg gaggtagcgg aggtgataca tttgcagatt ccctcagata tgtaaataaa 780
attctaaata gcaagtttgg attcacatcg cggaaagtcc ctgctcacat gcctcacatg 840
attgaccgga ttgttatgca agaactgcaa gatatgttcc ctgaagaatt tgacaagacg 900
tcatttcaca aagtgcgcca ttctgaggat atgcagtttg ccttctctta tttttattat 960
ctcatgagtg cagtgcagcc actgaatata tctcaagtct ttgatgaagt tgatacagat 1020
caatctggtg tcttgtctga cagagaaatc cgaacactgg ctaccagaat tcacgaactg 1080
ccgttaagtt tgcaggattt gacaggtctg gaacacatgc taataaattg ctcaaaaatg 1140
cttcctgctg atatcacgca gctaaataat attccaccaa ctcaggaatc ctactatgat 1200
cccaacctgc caccggtcac taaaagtcta gtaacaaact gtaaaccagt aactgacaaa 1260
atccacaaag catataagga caaaaacaaa tataggtttg aaatcatggg agaagaagaa 1320
atcgctttta aaatgattcg taccaacgtt tctcatgtgg ttggccagtt ggatgacata 1380
agaaaaaacc ctaggaagtt tgtttgcctg aatgacaaca ttgaccacaa tcataaagat 1440
gctcagacag tgaaggctgt tctcagggac ttctatgaat ccatgttccc cataccttcc 1500
caatttgaac tgccaagaga gtatcgaaac cgtttccttc atatgcatga gctgcaggaa 1560
tggagggctt atcgagacaa attgaagttt tggacccatt gtgtactagc aacattgatt 1620
atgtttacta tattctcatt ttttgctgag cagttaattg cacttaagcg gaagatattt 1680
cccagaagga ggatacacaa agaagctagt cccaatcgaa tcagagta 1728
<210> 5
<211> 1611
<212> DNA
<213> Homo sapiens
<400> 5
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc 60
atggctggca gcctcacagg attgcttcta cttcaggcag tgtcgtgggc atcaggtgcc 120
cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca 180
tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag 240
agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg 300
ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt 360
ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa 420
aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta 480
cccatggcca gctgtgactt ctccatccgc acctacacct atgcagacac ccctgatgat 540
ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt 600
caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca 660
cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc 720
ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct 780
gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg 840
agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc 900
cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg 960
gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca 1020
gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaaa 1080
gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc 1140
tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg 1200
cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg 1260
aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc 1320
atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc 1380
cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag 1440
aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta 1500
aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag 1560
acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a 1611
<210> 6
<211> 2052
<212> DNA
<213> Mus musculus
<220>
<221> misc_feature
<222> (922)..(922)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (930)..(930)
<223> n is a, c, g, or t
<400> 6
atggctaaca gccaacctaa ggcttcccag caacgccaag caaaagtcat gaccgccgcc 60
gcgggctcgg cgagccgtgt tgcggtgccc ttattgttgt gtgcgctgct agtgcccggt 120
ggcgcctacg tgctggacga ctctgacggc ctgggcagag agttcgacgg catcggcgct 180
gtgtctggcg gcggagccac aagcagactg ctggtcaact accccgagcc ctacagaagc 240
gagatcctgg actacctgtt caagcccaac ttcggcgcca gcctgcacat cctgaaggtg 300
gaaatcggcg gcgacggcca gaccaccgac ggcacagagc ccagccacat gcactacgag 360
ctggatgaga actacttcag aggctacgag tggtggctga tgaaggaagc caagaagaga 420
aaccccgaca tcatcctgat gggcctgcct tggagcttcc ccggctggct gggcaagggc 480
ttcagctggc cctacgtgaa cctgcagctg accgcctact acgtcgtgcg gtggattctg 540
ggcgccaagc actaccacga cctggacatc gactacatcg gcatctggaa cgagaggccc 600
ttcgacgcca actacatcaa agaactgagg aagatgctgg attaccaggg cctgcagaga 660
gtgcggatca ttgccagcga caacctgtgg gagcccatca gcagctccct gctgctggac 720
caggacctgt ggaaggtcgt cgacgtgatc ggcgcccact accctggcac ctacaccgtg 780
tggaacgcca agatgagcgg caagaagctg tggtccagcg aggacttcag caccatcaac 840
agcaacgtgg gagccggctg ctggtccaga atcctgaacc agaattacat caacggcaac 900
atgaccagca caatcgcctg gnaacctggn ggccagctac tacgaggact gccctacggc 960
agatccggcc tgatgaccgc ccaggaacct tggagcggcc actacgtggt ggcttcccca 1020
atctgggtgt ccgcccacac cacccagttc acccagcctg gctggtacta cctgaaaacc 1080
gtgggccacc tggaaaaggg cggcagctac gtggccctga ccgatggcct gggcaacctg 1140
accatcatca tcgagacaat gagccaccag cacagcatgt gcatcagacc ctacctgccc 1200
tactacaacg tgtcccacca gctggccaca ttcaccctga agggcagcct gagagagatc 1260
caggaactgc aggtctggta caccaagctg ggcacccccc agcagagact gcacttcaag 1320
cagctggaca ccctgtggct gctggacggc agcggcagct tcaccctgga actggaagag 1380
gacgaaatct tcaccctgac cacactgacc accggcagaa agggcagcta ccccccacct 1440
cctagcagca agccattccc caccaactac aaggacgact tcaacgtgga ataccccctg 1500
ttcagcgagg cccccaactt cgccgaccag accggcgtgt tcgagtacta catgaacaac 1560
gaggacagag agcacaggtt caccctgaga caggtgctga accagaggcc catcacctgg 1620
gctgccgacg ccagcagcac catctccgtg atcggggacc accactggac caacatgacc 1680
gtgcagtgcg acgtgtacat cgagacacct agaagcggcg gagtgtttat cgccggcaga 1740
gtgaacaagg gcggcatcct gatcagatcc gctacaggcg tgttcttctg gatcttcgcc 1800
aacggcagct acagagtgac cgccgacctg ggcggctgga tcacatacgc ctctggccac 1860
gccgacgtga ccgccaagag atggtacacc ctgaccctgg gcatcaaggg ctacttcgcc 1920
ttcggcatgc tgaacggcac catcctgtgg aagaacgtgc gcgtgaagta ccccggccac 1980
ggctgggctg ccatcggcac ccacacattc gagttcgccc agttcgacaa ctttcgcgtg 2040
gaagctgctc gc 2052
<210> 7
<211> 1290
<212> DNA
<213> Homo sapien
<400> 7
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc 60
ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct 120
accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca 180
gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc 240
tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga 300
gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta 360
gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa 420
acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct 480
gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg 540
gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac 600
tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga 660
cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag 720
agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg 780
ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa 840
gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc 900
cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat 960
caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg 1020
gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt 1080
ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct 1140
gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact 1200
tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca 1260
atgcagatgt cattaaaaga cttactttaa 1290
<210> 8
<211> 2232
<212> DNA
<213> Homo sapiens
<400> 8
atggaggcgg tggcggtggc cgcggcggtg ggggtccttc tcctggccgg ggccgggggc 60
gcggcaggcg acgaggcccg ggaggcggcg gccgtgcggg cgctcgtggc ccggctgctg 120
gggccaggcc ccgcggccga cttctccgtg tcggtggagc gcgctctggc tgccaagccg 180
ggcttggaca cctacagcct gggcggcggc ggcgcggcgc gcgtgcgggt gcgcggctcc 240
acgggcgtgg cggccgccgc ggggctgcac cgctacctgc gcgacttctg tggctgccac 300
gtggcctggt ccggctctca gctgcgcctg ccgcggccac tgccagccgt gccgggggag 360
ctgaccgagg ccacgcccaa caggtaccgc tattaccaga atgtgtgcac gcaaagctac 420
tctttcgtgt ggtgggactg ggcccgctgg gagcgagaga tagactggat ggcgctgaat 480
ggcatcaacc tggcactggc ctggagcggc caggaggcca tctggcagcg ggtgtacctg 540
gccttgggcc tgacccaggc agagatcaat gagttcttta ctggtcctgc cttcctggcc 600
tgggggcgaa tgggcaacct gcacacctgg gatggccccc tgcccccctc ctggcacatc 660
aagcagcttt acctgcagca ccgggtcctg gaccagatgc gctccttcgg catgacccca 720
gtgctgcctg cattcgcggg gcatgttccc gaggctgtca ccagggtgtt ccctcaggtc 780
aatgtcacga agatgggcag ttggggccac tttaactgtt cctactcctg ctccttcctt 840
ctggctccgg aagaccccat attccccatc atcgggagcc tcttcctgcg agagctgatc 900
aaagagtttg gcacagacca catctatggg gccgacactt tcaatgagat gcagccacct 960
tcctcagagc cctcctacct tgccgcagcc accactgccg tctatgaggc catgactgca 1020
gtggatactg aggctgtgtg gctgctccaa ggctggctct tccagcacca gccgcagttc 1080
tgggggcccg cccagatcag ggctgtgctg ggagctgtgc cccgtggccg cctcctggtt 1140
ctggacctgt ttgctgagag ccagcctgtg tatacccgca ctgcctcctt ccagggccag 1200
cccttcatct ggtgcatgct gcacaacttt gggggaaacc atggtctttt tggagcccta 1260
gaggctgtga acggaggccc agaagctgcc cgcctcttcc ccaactccac catggtaggc 1320
acgggcatgg cccccgaggg catcagccag aacgaagtgg tctattccct catggctgag 1380
ctgggctggc gaaaggaccc agtgccagat ttggcagcct gggtgaccag ctttgccgcc 1440
cggcggtatg gggtctccca cccggacgca ggggcagcgt ggaggctact gctccggagt 1500
gtgtacaact gctccgggga ggcctgcagg ggccacaatc gtagcccgct ggtcaggcgg 1560
ccgtccctac agatgaatac cagcatctgg tacaaccgat ctgatgtgtt tgaggcctgg 1620
cggctgctgc tcacatctgc tccctccctg gccaccagcc ccgccttccg ctacgacctg 1680
ctggacctca ctcggcaggc agtgcaggag ctggtcagct tgtactatga ggaggcaaga 1740
agcgcctacc tgagcaagga gctggcctcc ctgttgaggg ctggaggcgt cctggcctat 1800
gagctgctgc cggcactgga cgaggtgctg gctagtgaca gccgcttctt gctgggcagc 1860
tggctagagc aggcccgagc agcggcagtc agtgaggccg aggccgattt ctacgagcag 1920
aacagccgct accagctgac cttgtggggg ccagaaggca acatcctgga ctatgccaac 1980
aagcagctgg cggggttggt ggccaactac tacacccctc gctggcggct tttcctggag 2040
gcgctggttg acagtgtggc ccagggcatc cctttccaac agcaccagtt tgacaaaaat 2100
gtcttccaac tggagcaggc cttcgttctc agcaagcaga ggtaccccag ccagccgcga 2160
ggagacactg tggacctggc caagaagatc ttcctcaaat attacccccg ctgggtggcc 2220
ggctcttggt ga 2232
<210> 9
<211> 2860
<212> DNA
<213> Homo sapiens
<400> 9
atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc 60
ttggcaaccg ctgcactcct ggggcacatc ctactccatg atttcctgct ggttccccga 120
gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc 180
agcagaccag ggccccggga tgcccaggca caccccggcc gtcccagagc agtgcccaca 240
cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag 300
gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc 360
cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac 420
ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc 480
cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac 540
ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc 600
cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg 660
atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc 720
tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc 780
gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac 840
cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg 900
ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg 960
gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac 1020
atcttcctgg gcccagagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac 1080
ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc 1140
accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc 1200
cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc 1260
ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg 1320
atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag 1380
ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta 1440
tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtgggag 1500
gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac 1560
gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac 1620
ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc 1680
agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc 1740
atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc 1800
tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc 1860
tgggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct 1920
ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc 1980
tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg 2040
ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc 2100
ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg 2160
gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg 2220
gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag 2280
gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta 2340
gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc 2400
gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct 2460
gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc 2520
atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac 2580
gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc 2640
aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag 2700
ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt 2760
gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg 2820
ctgttgatgg gagagcagtt tctcgtcagc tggtgttags 2860
<210> 10
<211> 3033
<212> DNA
<213> Homo sapiens
<400> 10
atgggcgcct acgcgcgggc ttcgggggtc tgcgctcgcg gctgcctgga ctcagcaggc 60
ccctggacca tgtcccgcgc cctgcggcca ccgctcccgc ctctctgctt tttccttttg 120
ttgctggcgg ctgccggtgc tcgggccggg ggatacgaga catgccccac agtgcagccg 180
aacatgctga acgtgcacct gctgcctcac acacatgatg acgtgggctg gctcaaaacc 240
gtggaccagt acttttatgg aatcaagaat gacatccagc acgccggtgt gcagtacatc 300
ctggactcgg tcatctctgc cttgctggca gatcccaccc gtcgcttcat ttacgtggag 360
attgccttct tctcccgttg gtggcaccag cagacaaatg ccacacagga agtcgtgcga 420
gaccttgtgc gccaggggcg cctggagttc gccaatggtg gctgggtgat gaacgatgag 480
gcagccaccc actacggtgc catcgtggac cagatgacac ttgggctgcg ctttctggag 540
gacacatttg gcaatgatgg gcgaccccgt gtggcctggc acattgaccc cttcggccac 600
tctcgggagc aggcctcgct gtttgcgcag atgggcttcg acggcttctt ctttgggcgc 660
cttgattatc aagataagtg ggtacggatg cagaagctgg agatggagca ggtgtggcgg 720
gccagcacca gcctgaagcc cccgaccgcg gacctcttca ctggtgtgct tcccaatggt 780
tacaacccgc caaggaatct gtgctgggat gtgctgtgtg tcgatcagcc gctggtggag 840
gaccctcgca gccccgagta caacgccaag gagctggtcg attacttcct aaatgtggcc 900
actgcccagg gccggtatta ccgcaccaac cacactgtga tgaccatggg ctcggacttc 960
caatatgaga atgccaacat gtggttcaag aaccttgaca agctcatccg gctggtaaat 1020
gcgcagcagg caaaaggaag cagtgtccat gttctctact ccacccccgc ttgttacctc 1080
tgggagctga acaaggccaa cctcacctgg tcagtgaaac atgacgactt cttcccttac 1140
gcggatggcc cccaccagtt ctggaccggt tacttttcca gtcggccggc cctcaaacgc 1200
tacgagcgcc tcagctacaa cttcctgcag gtgtgcaacc agctggaggc gctggtgggc 1260
ctggcggcca acgtgggacc ctatggctcc ggagacagtg cacccctcaa tgaggcgatg 1320
gctgtgctcc agcatcacga cgccgtcagc ggcacctccc gccagcacgt ggccaacgac 1380
tacgcgcgcc agcttgcggc aggctggggg ccttgcgagg ttcttctgag caacgcgctg 1440
gcgcggctca gaggcttcaa agatcacttc accttttgcc aacagctaaa catcagcatc 1500
tgcccgctca gccagacggc ggcgcgcttc caggtcatcg tttataatcc cctggggcgg 1560
aaggtgaatt ggatggtacg gctgccggtc agcgaaggcg ttttcgttgt gaaggacccc 1620
aatggcagga cagtgcccag cgatgtggta atatttccca gctcagacag ccaggcgcac 1680
cctccggagc tgctgttctc agcctcactg cccgccctgg gcttcagcac ctattcagta 1740
gcccaggtgc ctcgctggaa gccccaggcc cgcgcaccac agcccatccc cagaagatcc 1800
tggtcccctg ctttaaccat cgaaaatgag cacatccggg caacgtttga tcctgacaca 1860
gggctgttga tggagattat gaacatgaat cagcaactcc tgctgcctgt tcgccagacc 1920
ttcttctggt acaacgccag tataggtgac aacgaaagtg accaggcctc aggtgcctac 1980
atcttcagac ccaaccaaca gaaaccgctg cctgtgagcc gctgggctca gatccacctg 2040
gtgaagacac ccttggtgca ggaggtgcac cagaacttct cagcttggtg ttcccaggtg 2100
gttcgcctgt acccaggaca gcggcacctg gagctagagt ggtcggtggg gccgatacct 2160
gtgggcgaca cctgggggaa ggaggtcatc agccgttttg acacaccgct ggagacaaag 2220
ggacgcttct acacagacag caatggccgg gagatcctgg agaggaggcg ggattatcga 2280
cccacctgga aactgaacca gacggagccc gtggcaggaa actactatcc agtcaacacc 2340
cggatttaca tcacggatgg aaacatgcag ctgactgtgc tgactgaccg ctcccagggg 2400
ggcagcagcc tgagagatgg ctcgctggag ctcatggtgc accgaaggct gctgaaggac 2460
gatggacgcg gagtatcgga gccactaatg gagaacgggt cgggggcgtg ggtgcgaggg 2520
cgccacctgg tgctgctgga cacagcccag gctgcagccg ccggacaccg gctcctggcg 2580
gagcaggagg tcctggcccc tcaggtggtg ctggccccgg gtggcggcgc cgcctacaat 2640
ctcggggctc ctccgcgcac gcagttctca gggctgcgca gggacctgcc gccctcggtg 2700
cacctgctca cgctggccag ctggggcccc gaaatggtgc tgctgcgctt ggagcaccag 2760
tttgccgtag gagaggattc cggacgtaac ctgagcgccc ccgttacctt gaacttgagg 2820
gacctgttct ccaccttcac catcacccgc ctgcaggaga ccacgctggt ggccaaccag 2880
ctccgcgagg cagcctccag gctcaagtgg acaacaaaca caggccccac accccaccaa 2940
actccgtacc agctggaccc ggccaacatc acgctggaac ccatggaaat ccgcactttc 3000
ctggcctcag ttcaatggaa ggaggtggat ggt 3033
<210> 11
<211> 32
<212> DNA
<213> artificial sequence
<220>
<223> GBA forward primer
<400> 11
ctgctagcca ccatggagtt ttcaagtcct tc 32
<210> 12
<211> 31
<212> DNA
<213> artificial sequence
<220>
<223> GBA reverse primer
<400> 12
atagcggccg ctcactggcg acgccacagg t 31
<210> 13
<211> 35
<212> DNA
<213> artificial sequence
<220>
<223> GAA forward primer
<400> 13
ctgctagcca ccatgggagt gaggcacccg ccctg 35
<210> 14
<211> 38
<212> DNA
<213> artificial sequence
<220>
<223> GAA reverse primer
<400> 14
atagcggccg ctcaacacca gctgacgaga aactgctc 38
<210> 15
<211> 35
<212> DNA
<213> artificial sequence
<220>
<223> GALC forward primer
<400> 15
ctgctagcca ccatggctaa cagccaacct aaggc 35
<210> 16
<211> 39
<212> DNA
<213> artificial sequence
<220>
<223> GALC reverse primer
<400> 16
atagcggccg ctcagcgagc agcttccacg cgaaagttg 39
<210> 17
<211> 34
<212> DNA
<213> artificial sequence
<220>
<223> NAGLU forward primer
<400> 17
ctgctagcca ccatggaagc cgtggctgtc gcag 34
<210> 18
<211> 37
<212> DNA
<213> artificial sequence
<220>
<223> NAGLU reverse primer
<400> 18
atagcggccg ctcaccaact accagccacc catctag 37
<210> 19
<211> 33
<212> DNA
<213> artificial sequence
<220>
<223> GLA forward primer
<400> 19
ctggatccac catgcagctg aggaacccag aac 33
<210> 20
<211> 38
<212> DNA
<213> artificial sequence
<220>
<223> GLA reverse primer
<400> 20
atagcggccg ctcaaagtaa gtcttttaat gacatctg 38
<210> 21
<211> 35
<212> DNA
<213> artificial sequence
<220>
<223> LAMAN reverse primer
<400> 21
ctgctagcca ccatgggcgc ctacgcgcgg gcttc 35
<210> 22
<211> 38
<212> DNA
<213> artificial sequence
<220>
<223> LAMAN reverse primer
<400> 22
atagcggccg ctcaaccatc cacctccttc cattgaac 38
<210> 23
<211> 3749
<212> DNA
<213> Homo sapiens
<400> 23
aaaagctatg actgcggccg cgggttcggc gggccgcgcc gcggtgccct tgctgctgtg 60
tgcgctgctg gcgcccggcg gcgcgtacgt gctcgacgac tccgacgggc tgggccggga 120
gttcgacggc atcggcgcgg tcagcggcgg cggggcaacc tcccgacttc tagtaaatta 180
cccagagccc tatcgttctc agatattgga ttatctcttt aagccgaatt ttggtgcctc 240
tttgcatatt ttaaaagtgg aaataggtgg tgatgggcag acaacagatg gcactgagcc 300
ctcccacatg cattatgcac tagatgagaa ttatttccga ggatacgagt ggtggttgat 360
gaaagaagct aagaagagga atcccaatat tacactcatt gggttgccat ggtcattccc 420
tggatggctg ggaaaaggtt tcgactggcc ttatgtcaat cttcagctga ctgcctatta 480
tgtcgtgacc tggattgtgg gcgccaagcg ttaccatgat ttggacattg attatattgg 540
aatttggaat gagaggtcat ataatgccaa ttatattaag atattaagaa aaatgctgaa 600
ttatcaaggt ctccagcgag tgaaaatcat agcaagtgat aatctctggg agtccatctc 660
tgcatccatg ctccttgatg ccgaactctt caaggtggtt gatgttatag gggctcatta 720
tcctggaacc cattcagcaa aagatgcaaa gttgactggg aagaagcttt ggtcttctga 780
agactttagc actttaaata gtgacatggg tgcaggctgc tggggtcgca ttttaaatca 840
gaattatatc aatggctata tgacttccac aatcgcatgg aatttagtgg ctagttacta 900
tgaacagttg ccttatggga gatgcgggtt gatgacggcc caggagccat ggagtgggca 960
ctacgtggta gaatctcctg tctgggtatc agctcatacc actcagttta ctcaacctgg 1020
ctggtattac ctgaagacag ttggccattt agagaaagga ggaagctacg tagctctgac 1080
tgatggctta gggaacctca ccatcatcat tgaaaccatg agtcataaac attctaagtg 1140
catacggcca tttcttcctt atttcaatgt gtcacaacaa tttgccacct ttgttcttaa 1200
gggatctttt agtgaaatac cagagctaca ggtatggtat accaaacttg gaaaaacatc 1260
cgaaagattt ctttttaagc agctggattc tctatggctc cttgacagcg atggcagttt 1320
cacactgagc ctgcatgaag atgagctgtt cacactcacc actctcacca ctggtcgcaa 1380
aggcagctac ccgcttcctc caaaatccca gcccttccca agtacctata aggatgattt 1440
caatgttgat tacccatttt ttagtgaagc tccaaacttt gctgatcaaa ctggtgtatt 1500
tgaatatttt acaaatattg aagaccctgg cgagcatcac ttcacgctac gccaagttct 1560
caaccagaga cccattacgt gggctgccga tgcatccaac acaatcagta ttataggaga 1620
ctacaactgg accaatctga ctacaaagtg tgatgtttac atagagaccc ctgacacagg 1680
aggtgtgttc attgcaggaa gagtaaataa aggtggtatt ttgattagaa gtgccagagg 1740
aattttcttc tggatttttg caaatggatc ttacagggtt acaggtgatt tagctggatg 1800
gattatatat gctttaggac gtgttgaagt tacagcaaaa aaatggtata cactcacgtt 1860
aactattaag ggtcatttcg cctctggcat gctgaatgac aagtctctgt ggacagacat 1920
ccctgtgaat tttccaaaga atggctgggc tgcaattgga actcactcct ttgaatttgc 1980
acagtttgac aactttcttg tggaagccac acgctaatac ttaacagggc atcatagaat 2040
actctggatt ttcttccctt ctttttggtt ttggttcaga gccaattctt gtttcattgg 2100
aacagtatat gaggcttttg agactaaaaa taatgaagag taaaagggga gagaaattta 2160
tttttaattt accctgtgga agattttatt agaattaatt ccaaggggaa aactggtgaa 2220
tctttaacat tacctggtgt gttccctaac attcaaactg tgcattggcc atacccttag 2280
gagtggtttg agtagtacag acctcgaagc cttgctgcta acactgaggt agctctcttc 2340
atcttatttg caagcggtcc tgtagatggc agtaacttga tcatcactga gatgtattta 2400
tgcatgctga ccgtgtgtcc aagtgagcca gtgtcttcat cacaagatga tgctgccata 2460
atagaaagct gaagaacact agaagtagct ttttgaaaac cacttcaacc tgttatgctt 2520
tatgctctaa aaagtatttt ttttattttc ctttttaaga tgatactttt gaaatgcagg 2580
atatgatgag tgggatgatt ttaaaaatgc ctctttaata aactacctct aacactattt 2640
ctgtggtaat agatattagc agattaattg ggttatttgc attatttaat ttttttgatt 2700
ccaagttttg gtcttgtaac cactataact ctctgtgaac atttttccag gtggctggaa 2760
gaaggaagaa aacctgatat agccaatgct gttgtagtcg tttcctcagc ctcatctcac 2820
tgtgctgtgg tctgtcctca catgtgcact ggtaacagac tcacacagct gatgaatgct 2880
tttctctcct tatgtgtgga aggaggggag cacttagaca tttgctaact cccagaattg 2940
gatcatctcc taagatgtac ttacttttta aagtccaaat atgtttatat ttaaatatac 3000
gtgagcatgt tcatcatgtt gtatgattta tactaagcat taatgtggct ctatgtagca 3060
aatcagttat tcatgtaggt aaagtaaatc tagaattatt tataagaatt actcattgaa 3120
ctaattctac tatttaggaa tttgtaagag tctaacatag gcttagctac agtgaagttt 3180
tgcattgctt ttgaagacaa gaagataagt gctagaataa ataagattac agagaaaatt 3240
ttttgttaaa accaagtgat ttccagctga tgtatctaat attttttaaa acgaacatta 3300
tagaggtgta atttatttac aataaaatgt tcctacttta aatatacaat tcagtgagtt 3360
ttgataaatt gatataccca tgtaaccaac actccagtca agcttcagaa tatttccatc 3420
accccagaag gttctcttgt atacctgctc agtcagttcc tttcactccc gattgttggc 3480
agccattgat aggaattcta tcactatagg ttagttttct ttgttccaga acatcatgaa 3540
agcggcgtca tgtactgtgt attcttatga atggtttctt tccatcagca taatgatttg 3600
agatttgtcc atgttgtgtg attcagtggt ttgttccttc ttatttctga agagttttcc 3660
attgtatgaa tataccacaa tttgtttcct ccccaccagt ttctgatact acaattaaaa 3720
ctgtctacat ttacaaaaaa aaaaaaaaa 3749
<210> 24
<211> 1248
<212> DNA
<213> Homo sapiens
<400> 24
atgactgggg agcgacccag cacggcgctc ccggacagac gctgggggcc gcggattctg 60
ggcttctggg gaggctgtag ggtttgggtg tttgccgcga tcttcctgct gctgtctctg 120
gcagcctcct ggtccaaggc tgagaacgac ttcggtctgg tgcagccgct ggtgaccatg 180
gagcaactgc tgtgggtgag cgggagacag atcggctcag tggacacctt ccgcatcccg 240
ctcatcacag ccactccgcg gggcactctt ctcgcctttg ctgaggcgag gaaaatgtcc 300
tcatccgatg agggggccaa gttcatcgcc ctgcggaggt ccatggacca gggcagcaca 360
tggtctccta cagcgttcat tgtcaatgat ggggatgtcc ccgatgggct gaaccttggg 420
gcagtagtga gcgatgttga gacaggagta gtatttcttt tctactccct ttgtgctcac 480
aaggccggct gccaggtggc ctctaccatg ttggtatgga gcaaggatga tggtgtttcc 540
tggagcacac cccggaatct ctccctggat attggcactg aagtgtttgc ccctggaccg 600
ggctctggta ttcagaaaca gcgggagcca cggaagggcc gcctcatcgt gtgtggccat 660
gggacgctgg agcgggacgg agtcttctgt ctcctcagcg atgatcatgg tgcctcctgg 720
cgctacggaa gtggggtcag cggcatcccc tacggtcagc ccaagcagga aaatgatttc 780
aatcctgatg aatgccagcc ctatgagctc ccagatggct cagtcgtcat caatgcccga 840
aaccagaaca actaccactg ccactgccga attgtcctcc gcagctatga tgcctgtgat 900
acactaaggc cccgtgatgt gaccttcgac cctgagctcg tggaccctgt ggtagctgca 960
ggagctgtag tcaccagctc cggcattgtc ttcttctcca acccagcaca tccagagttc 1020
cgagtgaacc tgaccctgcg atggagcttc agcaatggta cctcatggcg gaaagagaca 1080
gtccagctat ggccaggccc cagtggctat tcatccctgg caaccctgga gggcagcatg 1140
gatggagagg agcaggcccc ccagctctac gtcctgtatg agaaaggccg gaaccactac 1200
acagagagca tctccgtggc caaaatcagt gtctatggga cactctga 1248
<210> 25
<211> 415
<212> PRT
<213> Homo sapiens
<400> 25
Met Thr Gly Glu Arg Pro Ser Thr Ala Leu Pro Asp Arg Arg Trp Gly
1 5 10 15
Pro Arg Ile Leu Gly Phe Trp Gly Gly Cys Arg Val Trp Val Phe Ala
20 25 30
Ala Ile Phe Leu Leu Leu Ser Leu Ala Ala Ser Trp Ser Lys Ala Glu
35 40 45
Asn Asp Phe Gly Leu Val Gln Pro Leu Val Thr Met Glu Gln Leu Leu
50 55 60
Trp Val Ser Gly Arg Gln Ile Gly Ser Val Asp Thr Phe Arg Ile Pro
65 70 75 80
Leu Ile Thr Ala Thr Pro Arg Gly Thr Leu Leu Ala Phe Ala Glu Ala
85 90 95
Arg Lys Met Ser Ser Ser Asp Glu Gly Ala Lys Phe Ile Ala Leu Arg
100 105 110
Arg Ser Met Asp Gln Gly Ser Thr Trp Ser Pro Thr Ala Phe Ile Val
115 120 125
Asn Asp Gly Asp Val Pro Asp Gly Leu Asn Leu Gly Ala Val Val Ser
130 135 140
Asp Val Glu Thr Gly Val Val Phe Leu Phe Tyr Ser Leu Cys Ala His
145 150 155 160
Lys Ala Gly Cys Gln Val Ala Ser Thr Met Leu Val Trp Ser Lys Asp
165 170 175
Asp Gly Val Ser Trp Ser Thr Pro Arg Asn Leu Ser Leu Asp Ile Gly
180 185 190
Thr Glu Val Phe Ala Pro Gly Pro Gly Ser Gly Ile Gln Lys Gln Arg
195 200 205
Glu Pro Arg Lys Gly Arg Leu Ile Val Cys Gly His Gly Thr Leu Glu
210 215 220
Arg Asp Gly Val Phe Cys Leu Leu Ser Asp Asp His Gly Ala Ser Trp
225 230 235 240
Arg Tyr Gly Ser Gly Val Ser Gly Ile Pro Tyr Gly Gln Pro Lys Gln
245 250 255
Glu Asn Asp Phe Asn Pro Asp Glu Cys Gln Pro Tyr Glu Leu Pro Asp
260 265 270
Gly Ser Val Val Ile Asn Ala Arg Asn Gln Asn Asn Tyr His Cys His
275 280 285
Cys Arg Ile Val Leu Arg Ser Tyr Asp Ala Cys Asp Thr Leu Arg Pro
290 295 300
Arg Asp Val Thr Phe Asp Pro Glu Leu Val Asp Pro Val Val Ala Ala
305 310 315 320
Gly Ala Val Val Thr Ser Ser Gly Ile Val Phe Phe Ser Asn Pro Ala
325 330 335
His Pro Glu Phe Arg Val Asn Leu Thr Leu Arg Trp Ser Phe Ser Asn
340 345 350
Gly Thr Ser Trp Arg Lys Glu Thr Val Gln Leu Trp Pro Gly Pro Ser
355 360 365
Gly Tyr Ser Ser Leu Ala Thr Leu Glu Gly Ser Met Asp Gly Glu Glu
370 375 380
Gln Ala Pro Gln Leu Tyr Val Leu Tyr Glu Lys Gly Arg Asn His Tyr
385 390 395 400
Thr Glu Ser Ile Ser Val Ala Lys Ile Ser Val Tyr Gly Thr Leu
405 410 415
<210> 26
<211> 1497
<212> DNA
<213> Homo sapiens
<400> 26
atgacttcca gtccccgggc gcctcctgga gagcaaggac gcgggggagc agagatgatc 60
cgagccgcgc cgccgccgct gttcctgctg ctgctgctgc tgctgctgct agtgtcctgg 120
gcgtcccgag gcgaggcagc ccccgaccag gacgagatcc agcgcctccc cgggctggcc 180
aagcagccgt ctttccgcca gtactccggc tacctcaaag gctccggctc caagcacctc 240
cactactggt ttgtggagtc ccagaaggat cccgagaaca gccctgtggt gctttggctc 300
aatgggggtc ccggctgcag ctcactagat gggctcctca cagagcatgg ccccttcctg 360
gtccagccag atggtgtcac cctggagtac aacccctatt cttggaatct gattgccaat 420
gtgttatacc tggagtcccc agctggggtg ggcttctcct actccgatga caagttttat 480
gcaactaatg acactgaggt cgcccagagc aattttgagg cccttcaaga tttcttccgc 540
ctctttccgg agtacaagaa caacaaactt ttcctgaccg gggagagcta tgctggcatc 600
tacatcccca ccctggccgt gctggtcatg caggatccca gcatgaacct tcaggggctg 660
gctgtgggca atggactctc ctcctatgag cagaatgaca actccctggt ctactttgcc 720
tactaccatg gccttctggg gaacaggctt tggtcttctc tccagaccca ctgctgctct 780
caaaacaagt gtaacttcta tgacaacaaa gacctggaat gcgtgaccaa tcttcaggaa 840
gtggcccgca tcgtgggcaa ctctggcctc aacatctaca atctctatgc cccgtgtgct 900
ggaggggtgc ccagccattt taggtatgag aaggacactg ttgtggtcca ggatttgggc 960
aacatcttca ctcgcctgcc actcaagcgg atgtggcatc aggcactgct gcgctcaggg 1020
gataaagtgc gcatggaccc cccctgcacc aacacaacag ctgcttccac ctacctcaac 1080
aacccgtacg tgcggaaggc cctcaacatc ccggagcagc tgccacaatg ggacatgtgc 1140
aactttctgg taaacttaca gtaccgccgt ctctaccgaa gcatgaactc ccagtatctg 1200
aagctgctta gctcacagaa ataccagatc ctattatata atggagatgt agacatggcc 1260
tgcaatttca tgggggatga gtggtttgtg gattccctca accagaagat ggaggtgcag 1320
cgccggccct ggttagtgaa gtacggggac agcggggagc agattgccgg cttcgtgaag 1380
gagttctccc acatcgcctt tctcacgatc aagggcgccg gccacatggt tcccaccgac 1440
aagcccctcg ctgccttcac catgttctcc cgcttcctga acaagcagcc atactga 1497
<210> 27
<211> 480
<212> PRT
<213> Homo sapiens
<400> 27
Met Ile Arg Ala Ala Pro Pro Pro Leu Phe Leu Leu Leu Leu Leu Leu
1 5 10 15
Leu Leu Leu Val Ser Trp Ala Ser Arg Gly Glu Ala Ala Pro Asp Gln
20 25 30
Asp Glu Ile Gln Arg Leu Pro Gly Leu Ala Lys Gln Pro Ser Phe Arg
35 40 45
Gln Tyr Ser Gly Tyr Leu Lys Gly Ser Gly Ser Lys His Leu His Tyr
50 55 60
Trp Phe Val Glu Ser Gln Lys Asp Pro Glu Asn Ser Pro Val Val Leu
65 70 75 80
Trp Leu Asn Gly Gly Pro Gly Cys Ser Ser Leu Asp Gly Leu Leu Thr
85 90 95
Glu His Gly Pro Phe Leu Val Gln Pro Asp Gly Val Thr Leu Glu Tyr
100 105 110
Asn Pro Tyr Ser Trp Asn Leu Ile Ala Asn Val Leu Tyr Leu Glu Ser
115 120 125
Pro Ala Gly Val Gly Phe Ser Tyr Ser Asp Asp Lys Phe Tyr Ala Thr
130 135 140
Asn Asp Thr Glu Val Ala Gln Ser Asn Phe Glu Ala Leu Gln Asp Phe
145 150 155 160
Phe Arg Leu Phe Pro Glu Tyr Lys Asn Asn Lys Leu Phe Leu Thr Gly
165 170 175
Glu Ser Tyr Ala Gly Ile Tyr Ile Pro Thr Leu Ala Val Leu Val Met
180 185 190
Gln Asp Pro Ser Met Asn Leu Gln Gly Leu Ala Val Gly Asn Gly Leu
195 200 205
Ser Ser Tyr Glu Gln Asn Asp Asn Ser Leu Val Tyr Phe Ala Tyr Tyr
210 215 220
His Gly Leu Leu Gly Asn Arg Leu Trp Ser Ser Leu Gln Thr His Cys
225 230 235 240
Cys Ser Gln Asn Lys Cys Asn Phe Tyr Asp Asn Lys Asp Leu Glu Cys
245 250 255
Val Thr Asn Leu Gln Glu Val Ala Arg Ile Val Gly Asn Ser Gly Leu
260 265 270
Asn Ile Tyr Asn Leu Tyr Ala Pro Cys Ala Gly Gly Val Pro Ser His
275 280 285
Phe Arg Tyr Glu Lys Asp Thr Val Val Val Gln Asp Leu Gly Asn Ile
290 295 300
Phe Thr Arg Leu Pro Leu Lys Arg Met Trp His Gln Ala Leu Leu Arg
305 310 315 320
Ser Gly Asp Lys Val Arg Met Asp Pro Pro Cys Thr Asn Thr Thr Ala
325 330 335
Ala Ser Thr Tyr Leu Asn Asn Pro Tyr Val Arg Lys Ala Leu Asn Ile
340 345 350
Pro Glu Gln Leu Pro Gln Trp Asp Met Cys Asn Phe Leu Val Asn Leu
355 360 365
Gln Tyr Arg Arg Leu Tyr Arg Ser Met Asn Ser Gln Tyr Leu Lys Leu
370 375 380
Leu Ser Ser Gln Lys Tyr Gln Ile Leu Leu Tyr Asn Gly Asp Val Asp
385 390 395 400
Met Ala Cys Asn Phe Met Gly Asp Glu Trp Phe Val Asp Ser Leu Asn
405 410 415
Gln Lys Met Glu Val Gln Arg Arg Pro Trp Leu Val Lys Tyr Gly Asp
420 425 430
Ser Gly Glu Gln Ile Ala Gly Phe Val Lys Glu Phe Ser His Ile Ala
435 440 445
Phe Leu Thr Ile Lys Gly Ala Gly His Met Val Pro Thr Asp Lys Pro
450 455 460
Leu Ala Ala Phe Thr Met Phe Ser Arg Phe Leu Asn Lys Gln Pro Tyr
465 470 475 480
<210> 28
<211> 3036
<212> DNA
<213> Homo sapiens
<400> 28
atgggcgcct acgcgcgggc ttcgggggtc tgcgctcgcg gctgcctgga ctcagcaggc 60
ccctggacca tgtcccgcgc cctgcggcca ccgctcccgc ctctctgctt tttccttttg 120
ttgctggcgg ctgccggtgc tcgggccggg ggatacgaga catgccccac agtgcagccg 180
aacatgctga acgtgcacct gctgcctcac acacatgatg acgtgggctg gctcaaaacc 240
gtggaccagt acttttatgg aatcaagaat gacatccagc acgccggtgt gcagtacatc 300
ctggactcgg tcatctctgc cttgctggca gatcccaccc gtcgcttcat ttacgtggag 360
attgccttct tctcccgttg gtggcaccag cagacaaatg ccacacagga agtcgtgcga 420
gaccttgtgc gccaggggcg cctggagttc gccaatggtg gctgggtgat gaacgatgag 480
gcagccaccc actacggtgc catcgtggac cagatgacac ttgggctgcg ctttctggag 540
gacacatttg gcaatgatgg gcgaccccgt gtggcctggc acattgaccc cttcggccac 600
tctcgggagc aggcctcgct gtttgcgcag atgggcttcg acggcttctt ctttgggcgc 660
cttgattatc aagataagtg ggtacggatg cagaagctgg agatggagca ggtgtggcgg 720
gccagcacca gcctgaagcc cccgaccgcg gacctcttca ctggtgtgct tcccaatggt 780
tacaacccgc caaggaatct gtgctgggat gtgctgtgtg tcgatcagcc gctggtggag 840
gaccctcgca gccccgagta caacgccaag gagctggtcg attacttcct aaatgtggcc 900
actgcccagg gccggtatta ccgcaccaac cacactgtga tgaccatggg ctcggacttc 960
caatatgaga atgccaacat gtggttcaag aaccttgaca agctcatccg gctggtaaat 1020
gcgcagcagg caaaaggaag cagtgtccat gttctctact ccacccccgc ttgttacctc 1080
tgggagctga acaaggccaa cctcacctgg tcagtgaaac atgacgactt cttcccttac 1140
gcggatggcc cccaccagtt ctggaccggt tacttttcca gtcggccggc cctcaaacgc 1200
tacgagcgcc tcagctacaa cttcctgcag gtgtgcaacc agctggaggc gctggtgggc 1260
ctggcggcca acgtgggacc ctatggctcc ggagacagtg cacccctcaa tgaggcgatg 1320
gctgtgctcc agcatcacga cgccgtcagc ggcacctccc gccagcacgt ggccaacgac 1380
tacgcgcgcc agcttgcggc aggctggggg ccttgcgagg ttcttctgag caacgcgctg 1440
gcgcggctca gaggcttcaa agatcacttc accttttgcc aacagctaaa catcagcatc 1500
tgcccgctca gccagacggc ggcgcgcttc caggtcatcg tttataatcc cctggggcgg 1560
aaggtgaatt ggatggtacg gctgccggtc agcgaaggcg ttttcgttgt gaaggacccc 1620
aatggcagga cagtgcccag cgatgtggta atatttccca gctcagacag ccaggcgcac 1680
cctccggagc tgctgttctc agcctcactg cccgccctgg gcttcagcac ctattcagta 1740
gcccaggtgc ctcgctggaa gccccaggcc cgcgcaccac agcccatccc cagaagatcc 1800
tggtcccctg ctttaaccat cgaaaatgag cacatccggg caacgtttga tcctgacaca 1860
gggctgttga tggagattat gaacatgaat cagcaactcc tgctgcctgt tcgccagacc 1920
ttcttctggt acaacgccag tataggtgac aacgaaagtg accaggcctc aggtgcctac 1980
atcttcagac ccaaccaaca gaaaccgctg cctgtgagcc gctgggctca gatccacctg 2040
gtgaagacac ccttggtgca ggaggtgcac cagaacttct cagcttggtg ttcccaggtg 2100
gttcgcctgt acccaggaca gcggcacctg gagctagagt ggtcggtggg gccgatacct 2160
gtgggcgaca cctgggggaa ggaggtcatc agccgttttg acacaccgct ggagacaaag 2220
ggacgcttct acacagacag caatggccgg gagatcctgg agaggaggcg ggattatcga 2280
cccacctgga aactgaacca gacggagccc gtggcaggaa actactatcc agtcaacacc 2340
cggatttaca tcacggatgg aaacatgcag ctgactgtgc tgactgaccg ctcccagggg 2400
ggcagcagcc tgagagatgg ctcgctggag ctcatggtgc accgaaggct gctgaaggac 2460
gatggacgcg gagtatcgga gccactaatg gagaacgggt cgggggcgtg ggtgcgaggg 2520
cgccacctgg tgctgctgga cacagcccag gctgcagccg ccggacaccg gctcctggcg 2580
gagcaggagg tcctggcccc tcaggtggtg ctggccccgg gtggcggcgc cgcctacaat 2640
ctcggggctc ctccgcgcac gcagttctca gggctgcgca gggacctgcc gccctcggtg 2700
cacctgctca cgctggccag ctggggcccc gaaatggtgc tgctgcgctt ggagcaccag 2760
tttgccgtag gagaggattc cggacgtaac ctgagcgccc ccgttacctt gaacttgagg 2820
gacctgttct ccaccttcac catcacccgc ctgcaggaga ccacgctggt ggccaaccag 2880
ctccgcgagg cagcctccag gctcaagtgg acaacaaaca caggccccac accccaccaa 2940
actccgtacc agctggaccc ggccaacatc acgctggaac ccatggaaat ccgcactttc 3000
ctggcctcag ttcaatggaa ggaggtggat ggttag 3036
<210> 29
<211> 1011
<212> PRT
<213> Homo sapiens
<400> 29
Met Gly Ala Tyr Ala Arg Ala Ser Gly Val Cys Ala Arg Gly Cys Leu
1 5 10 15
Asp Ser Ala Gly Pro Trp Thr Met Ser Arg Ala Leu Arg Pro Pro Leu
20 25 30
Pro Pro Leu Cys Phe Phe Leu Leu Leu Leu Ala Ala Ala Gly Ala Arg
35 40 45
Ala Gly Gly Tyr Glu Thr Cys Pro Thr Val Gln Pro Asn Met Leu Asn
50 55 60
Val His Leu Leu Pro His Thr His Asp Asp Val Gly Trp Leu Lys Thr
65 70 75 80
Val Asp Gln Tyr Phe Tyr Gly Ile Lys Asn Asp Ile Gln His Ala Gly
85 90 95
Val Gln Tyr Ile Leu Asp Ser Val Ile Ser Ala Leu Leu Ala Asp Pro
100 105 110
Thr Arg Arg Phe Ile Tyr Val Glu Ile Ala Phe Phe Ser Arg Trp Trp
115 120 125
His Gln Gln Thr Asn Ala Thr Gln Glu Val Val Arg Asp Leu Val Arg
130 135 140
Gln Gly Arg Leu Glu Phe Ala Asn Gly Gly Trp Val Met Asn Asp Glu
145 150 155 160
Ala Ala Thr His Tyr Gly Ala Ile Val Asp Gln Met Thr Leu Gly Leu
165 170 175
Arg Phe Leu Glu Asp Thr Phe Gly Asn Asp Gly Arg Pro Arg Val Ala
180 185 190
Trp His Ile Asp Pro Phe Gly His Ser Arg Glu Gln Ala Ser Leu Phe
195 200 205
Ala Gln Met Gly Phe Asp Gly Phe Phe Phe Gly Arg Leu Asp Tyr Gln
210 215 220
Asp Lys Trp Val Arg Met Gln Lys Leu Glu Met Glu Gln Val Trp Arg
225 230 235 240
Ala Ser Thr Ser Leu Lys Pro Pro Thr Ala Asp Leu Phe Thr Gly Val
245 250 255
Leu Pro Asn Gly Tyr Asn Pro Pro Arg Asn Leu Cys Trp Asp Val Leu
260 265 270
Cys Val Asp Gln Pro Leu Val Glu Asp Pro Arg Ser Pro Glu Tyr Asn
275 280 285
Ala Lys Glu Leu Val Asp Tyr Phe Leu Asn Val Ala Thr Ala Gln Gly
290 295 300
Arg Tyr Tyr Arg Thr Asn His Thr Val Met Thr Met Gly Ser Asp Phe
305 310 315 320
Gln Tyr Glu Asn Ala Asn Met Trp Phe Lys Asn Leu Asp Lys Leu Ile
325 330 335
Arg Leu Val Asn Ala Gln Gln Ala Lys Gly Ser Ser Val His Val Leu
340 345 350
Tyr Ser Thr Pro Ala Cys Tyr Leu Trp Glu Leu Asn Lys Ala Asn Leu
355 360 365
Thr Trp Ser Val Lys His Asp Asp Phe Phe Pro Tyr Ala Asp Gly Pro
370 375 380
His Gln Phe Trp Thr Gly Tyr Phe Ser Ser Arg Pro Ala Leu Lys Arg
385 390 395 400
Tyr Glu Arg Leu Ser Tyr Asn Phe Leu Gln Val Cys Asn Gln Leu Glu
405 410 415
Ala Leu Val Gly Leu Ala Ala Asn Val Gly Pro Tyr Gly Ser Gly Asp
420 425 430
Ser Ala Pro Leu Asn Glu Ala Met Ala Val Leu Gln His His Asp Ala
435 440 445
Val Ser Gly Thr Ser Arg Gln His Val Ala Asn Asp Tyr Ala Arg Gln
450 455 460
Leu Ala Ala Gly Trp Gly Pro Cys Glu Val Leu Leu Ser Asn Ala Leu
465 470 475 480
Ala Arg Leu Arg Gly Phe Lys Asp His Phe Thr Phe Cys Gln Gln Leu
485 490 495
Asn Ile Ser Ile Cys Pro Leu Ser Gln Thr Ala Ala Arg Phe Gln Val
500 505 510
Ile Val Tyr Asn Pro Leu Gly Arg Lys Val Asn Trp Met Val Arg Leu
515 520 525
Pro Val Ser Glu Gly Val Phe Val Val Lys Asp Pro Asn Gly Arg Thr
530 535 540
Val Pro Ser Asp Val Val Ile Phe Pro Ser Ser Asp Ser Gln Ala His
545 550 555 560
Pro Pro Glu Leu Leu Phe Ser Ala Ser Leu Pro Ala Leu Gly Phe Ser
565 570 575
Thr Tyr Ser Val Ala Gln Val Pro Arg Trp Lys Pro Gln Ala Arg Ala
580 585 590
Pro Gln Pro Ile Pro Arg Arg Ser Trp Ser Pro Ala Leu Thr Ile Glu
595 600 605
Asn Glu His Ile Arg Ala Thr Phe Asp Pro Asp Thr Gly Leu Leu Met
610 615 620
Glu Ile Met Asn Met Asn Gln Gln Leu Leu Leu Pro Val Arg Gln Thr
625 630 635 640
Phe Phe Trp Tyr Asn Ala Ser Ile Gly Asp Asn Glu Ser Asp Gln Ala
645 650 655
Ser Gly Ala Tyr Ile Phe Arg Pro Asn Gln Gln Lys Pro Leu Pro Val
660 665 670
Ser Arg Trp Ala Gln Ile His Leu Val Lys Thr Pro Leu Val Gln Glu
675 680 685
Val His Gln Asn Phe Ser Ala Trp Cys Ser Gln Val Val Arg Leu Tyr
690 695 700
Pro Gly Gln Arg His Leu Glu Leu Glu Trp Ser Val Gly Pro Ile Pro
705 710 715 720
Val Gly Asp Thr Trp Gly Lys Glu Val Ile Ser Arg Phe Asp Thr Pro
725 730 735
Leu Glu Thr Lys Gly Arg Phe Tyr Thr Asp Ser Asn Gly Arg Glu Ile
740 745 750
Leu Glu Arg Arg Arg Asp Tyr Arg Pro Thr Trp Lys Leu Asn Gln Thr
755 760 765
Glu Pro Val Ala Gly Asn Tyr Tyr Pro Val Asn Thr Arg Ile Tyr Ile
770 775 780
Thr Asp Gly Asn Met Gln Leu Thr Val Leu Thr Asp Arg Ser Gln Gly
785 790 795 800
Gly Ser Ser Leu Arg Asp Gly Ser Leu Glu Leu Met Val His Arg Arg
805 810 815
Leu Leu Lys Asp Asp Gly Arg Gly Val Ser Glu Pro Leu Met Glu Asn
820 825 830
Gly Ser Gly Ala Trp Val Arg Gly Arg His Leu Val Leu Leu Asp Thr
835 840 845
Ala Gln Ala Ala Ala Ala Gly His Arg Leu Leu Ala Glu Gln Glu Val
850 855 860
Leu Ala Pro Gln Val Val Leu Ala Pro Gly Gly Gly Ala Ala Tyr Asn
865 870 875 880
Leu Gly Ala Pro Pro Arg Thr Gln Phe Ser Gly Leu Arg Arg Asp Leu
885 890 895
Pro Pro Ser Val His Leu Leu Thr Leu Ala Ser Trp Gly Pro Glu Met
900 905 910
Val Leu Leu Arg Leu Glu His Gln Phe Ala Val Gly Glu Asp Ser Gly
915 920 925
Arg Asn Leu Ser Ala Pro Val Thr Leu Asn Leu Arg Asp Leu Phe Ser
930 935 940
Thr Phe Thr Ile Thr Arg Leu Gln Glu Thr Thr Leu Val Ala Asn Gln
945 950 955 960
Leu Arg Glu Ala Ala Ser Arg Leu Lys Trp Thr Thr Asn Thr Gly Pro
965 970 975
Thr Pro His Gln Thr Pro Tyr Gln Leu Asp Pro Ala Asn Ile Thr Leu
980 985 990
Glu Pro Met Glu Ile Arg Thr Phe Leu Ala Ser Val Gln Trp Lys Glu
995 1000 1005
Val Asp Gly
1010
<210> 30
<211> 2640
<212> DNA
<213> Homo sapiens
<400> 30
atgcgcctcc acctgctcct gctgctcgcg ctgtgcggtg caggcaccac cgccgcggag 60
ctcagttaca gcttgcgtgg caactggagc atctgcaatg ggaacggctc gctggagctg 120
cccggggcgg tccctggctg cgtgcacagc gccttgttcc agcagggcct gatccaggat 180
tcttactaca gatttaatga ccttaactac agatgggtct ctttggataa ctggacctat 240
agcaaagaat ttaaaatccc ctttgaaatt agcaaatggc aaaaagtaaa tttgattctt 300
gagggagtgg atacggtttc aaaaatcctg ttcaatgaag tcactattgg ggaaacagac 360
aatatgttca atagatatag ctttgatatt accaacgtgg tcagggacgt gaactccatt 420
gagctgcgtt tccagtcagc ggtgttgtat gcagcacagc agagcaaagc tcacactcgc 480
taccaggttc ccccagactg ccctccactt gtgcagaagg gtgaatgcca tgtcaacttt 540
gttcggaagg agcaatgttc ctttagttgg gactgggggc cttcctttcc tacccaggga 600
atctggaaag atgttagaat tgaagcctat aatatttgtc acctgaacta cttcacattt 660
tccccaatat atgataagag tgcccaggag tggaatctgg aaatagagtc tacatttgat 720
gttgtcagct caaagccagt tggtggtcaa gtgatcgtag ccatccctaa gttgcaaaca 780
caacagacat acagcattga acttcaacct gggaaaagga ttgttgagct atttgtgaac 840
attagcaaga atattactgt agaaacttgg tggcctcatg gacatggaaa ccagactggg 900
tacaacatga ctgttctttt tgaactggat ggaggcttaa atattgaaaa atcagctaag 960
gtttatttta ggacagtgga acttatagaa gagcctataa aagggtctcc tggtttgagt 1020
ttctatttca aaattaatgg atttcccata tttctaaaag gctcaaactg gatcccagca 1080
gattcattcc aggaccgagt aacctctgag ttgttacggc tccttttaca gtctgttgtg 1140
gatgctaata tgaatactct tcgggtttgg ggaggaggaa tttatgagca ggatgaattc 1200
tatgaactct gtgatgaact aggaataatg gtatggcagg attttatgtt tgcctgtgcc 1260
ctttatccaa ctgatcaggg cttcctggat tcagtgacag cagaagttgc ctaccagatc 1320
aagagactga aatctcatcc ttctatcatc atatggagtg gcaataatga aaatgaggag 1380
gcgctgatga tgaattggta tcatatcagt ttcactgacc ggccaatcta catcaaggac 1440
tatgtgacac tctatgtgaa aaacatcaga gagctcgtac tggcaggaga caagagtcgt 1500
ccttttatta cgtccagtcc tacaaatggg gctgaaactg ttgcagaagc ctgggtctct 1560
caaaacccta atagcaatta ttttggtgat gtacattttt atgactatat cagtgattgc 1620
tggaactgga aagttttccc aaaagctcga tttgcatctg aatatggata tcagtcctgg 1680
ccgtccttca gtacattaga aaaggtctcg tctacagagg actggtcttt caatagcaag 1740
ttttcacttc atcgacaaca tcacgaaggt ggtaacaaac aaatgcttta tcaggctgga 1800
cttcatttca aactccccca aagcacagat ccattacgca catttaaaga taccatctac 1860
cttactcagg tgatgcaggc ccagtgtgtc aaaacagaaa ctgaattcta ccgccgtagt 1920
cgcagcgaga tagtggatca gcaagggcac acgatggggg cactttattg gcagttgaat 1980
gacatctggc aagctccttc ctgggcttct cttgagtacg gaggaaagtg gaaaatgctt 2040
cattactttg ctcagaattt ctttgctcca ctgttgccag taggctttga gaatgaaaac 2100
acgttctata tctatggtgt gtcagatctt cactcggatt attcgatgac actcagtgtg 2160
agagtccata catggagctc cctggagccc gtgtgctctc gtgtgactga acgttttgtg 2220
atgaaaggag gagaggctgt ctgcctttat gaggagccag tgtctgaatt gctgaggaga 2280
tgtgggaatt gcacacggga aagctgtgtg gtttcctttt acctttcagc tgaccatgaa 2340
ctcctgagcc cgaccaacta ccacttcttg tcctcaccga aggaggccgt ggggctctgc 2400
aaggcgcaga tcactgccat catctctcag caaggtgaca tatttgtttt tgacctggag 2460
acctcagctg tcgctccctt tgtttggttg gatgtaggaa gcatcccagg gagatttagt 2520
gacaatggtt tcctcatgac tgagaagaca cgaactatat tattttaccc ttgggagccc 2580
accagcaaga atgagttgga gcaatctttt catgtgacct ccttaacaga tatttactga 2640
<210> 31
<211> 879
<212> PRT
<213> Homo sapiens
<400> 31
Met Arg Leu His Leu Leu Leu Leu Leu Ala Leu Cys Gly Ala Gly Thr
1 5 10 15
Thr Ala Ala Glu Leu Ser Tyr Ser Leu Arg Gly Asn Trp Ser Ile Cys
20 25 30
Asn Gly Asn Gly Ser Leu Glu Leu Pro Gly Ala Val Pro Gly Cys Val
35 40 45
His Ser Ala Leu Phe Gln Gln Gly Leu Ile Gln Asp Ser Tyr Tyr Arg
50 55 60
Phe Asn Asp Leu Asn Tyr Arg Trp Val Ser Leu Asp Asn Trp Thr Tyr
65 70 75 80
Ser Lys Glu Phe Lys Ile Pro Phe Glu Ile Ser Lys Trp Gln Lys Val
85 90 95
Asn Leu Ile Leu Glu Gly Val Asp Thr Val Ser Lys Ile Leu Phe Asn
100 105 110
Glu Val Thr Ile Gly Glu Thr Asp Asn Met Phe Asn Arg Tyr Ser Phe
115 120 125
Asp Ile Thr Asn Val Val Arg Asp Val Asn Ser Ile Glu Leu Arg Phe
130 135 140
Gln Ser Ala Val Leu Tyr Ala Ala Gln Gln Ser Lys Ala His Thr Arg
145 150 155 160
Tyr Gln Val Pro Pro Asp Cys Pro Pro Leu Val Gln Lys Gly Glu Cys
165 170 175
His Val Asn Phe Val Arg Lys Glu Gln Cys Ser Phe Ser Trp Asp Trp
180 185 190
Gly Pro Ser Phe Pro Thr Gln Gly Ile Trp Lys Asp Val Arg Ile Glu
195 200 205
Ala Tyr Asn Ile Cys His Leu Asn Tyr Phe Thr Phe Ser Pro Ile Tyr
210 215 220
Asp Lys Ser Ala Gln Glu Trp Asn Leu Glu Ile Glu Ser Thr Phe Asp
225 230 235 240
Val Val Ser Ser Lys Pro Val Gly Gly Gln Val Ile Val Ala Ile Pro
245 250 255
Lys Leu Gln Thr Gln Gln Thr Tyr Ser Ile Glu Leu Gln Pro Gly Lys
260 265 270
Arg Ile Val Glu Leu Phe Val Asn Ile Ser Lys Asn Ile Thr Val Glu
275 280 285
Thr Trp Trp Pro His Gly His Gly Asn Gln Thr Gly Tyr Asn Met Thr
290 295 300
Val Leu Phe Glu Leu Asp Gly Gly Leu Asn Ile Glu Lys Ser Ala Lys
305 310 315 320
Val Tyr Phe Arg Thr Val Glu Leu Ile Glu Glu Pro Ile Lys Gly Ser
325 330 335
Pro Gly Leu Ser Phe Tyr Phe Lys Ile Asn Gly Phe Pro Ile Phe Leu
340 345 350
Lys Gly Ser Asn Trp Ile Pro Ala Asp Ser Phe Gln Asp Arg Val Thr
355 360 365
Ser Glu Leu Leu Arg Leu Leu Leu Gln Ser Val Val Asp Ala Asn Met
370 375 380
Asn Thr Leu Arg Val Trp Gly Gly Gly Ile Tyr Glu Gln Asp Glu Phe
385 390 395 400
Tyr Glu Leu Cys Asp Glu Leu Gly Ile Met Val Trp Gln Asp Phe Met
405 410 415
Phe Ala Cys Ala Leu Tyr Pro Thr Asp Gln Gly Phe Leu Asp Ser Val
420 425 430
Thr Ala Glu Val Ala Tyr Gln Ile Lys Arg Leu Lys Ser His Pro Ser
435 440 445
Ile Ile Ile Trp Ser Gly Asn Asn Glu Asn Glu Glu Ala Leu Met Met
450 455 460
Asn Trp Tyr His Ile Ser Phe Thr Asp Arg Pro Ile Tyr Ile Lys Asp
465 470 475 480
Tyr Val Thr Leu Tyr Val Lys Asn Ile Arg Glu Leu Val Leu Ala Gly
485 490 495
Asp Lys Ser Arg Pro Phe Ile Thr Ser Ser Pro Thr Asn Gly Ala Glu
500 505 510
Thr Val Ala Glu Ala Trp Val Ser Gln Asn Pro Asn Ser Asn Tyr Phe
515 520 525
Gly Asp Val His Phe Tyr Asp Tyr Ile Ser Asp Cys Trp Asn Trp Lys
530 535 540
Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly Tyr Gln Ser Trp
545 550 555 560
Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr Glu Asp Trp Ser
565 570 575
Phe Asn Ser Lys Phe Ser Leu His Arg Gln His His Glu Gly Gly Asn
580 585 590
Lys Gln Met Leu Tyr Gln Ala Gly Leu His Phe Lys Leu Pro Gln Ser
595 600 605
Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr Ile Tyr Leu Thr Gln Val
610 615 620
Met Gln Ala Gln Cys Val Lys Thr Glu Thr Glu Phe Tyr Arg Arg Ser
625 630 635 640
Arg Ser Glu Ile Val Asp Gln Gln Gly His Thr Met Gly Ala Leu Tyr
645 650 655
Trp Gln Leu Asn Asp Ile Trp Gln Ala Pro Ser Trp Ala Ser Leu Glu
660 665 670
Tyr Gly Gly Lys Trp Lys Met Leu His Tyr Phe Ala Gln Asn Phe Phe
675 680 685
Ala Pro Leu Leu Pro Val Gly Phe Glu Asn Glu Asn Thr Phe Tyr Ile
690 695 700
Tyr Gly Val Ser Asp Leu His Ser Asp Tyr Ser Met Thr Leu Ser Val
705 710 715 720
Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys Ser Arg Val Thr
725 730 735
Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys Leu Tyr Glu Glu
740 745 750
Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys Thr Arg Glu Ser
755 760 765
Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu Leu Leu Ser Pro
770 775 780
Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala Val Gly Leu Cys
785 790 795 800
Lys Ala Gln Ile Thr Ala Ile Ile Ser Gln Gln Gly Asp Ile Phe Val
805 810 815
Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val Trp Leu Asp Val
820 825 830
Gly Ser Ile Pro Gly Arg Phe Ser Asp Asn Gly Phe Leu Met Thr Glu
835 840 845
Lys Thr Arg Thr Ile Leu Phe Tyr Pro Trp Glu Pro Thr Ser Lys Asn
850 855 860
Glu Leu Glu Gln Ser Phe His Val Thr Ser Leu Thr Asp Ile Tyr
865 870 875
<210> 32
<211> 1041
<212> DNA
<213> Homo sapiens
<400> 32
atggcgcgga agtcgaactt gcctgtgctt ctcgtgccgt ttctgctctg ccaggcccta 60
gtgcgctgct ccagccctct gcccctggtc gtcaacactt ggccctttaa gaatgcaacc 120
gaagcagcgt ggagggcatt agcatctgga ggctctgccc tggatgcagt ggagagcggc 180
tgtgccatgt gtgagagaga gcagtgtgac ggctctgtag gctttggagg aagtcctgat 240
gaacttggag aaaccacact agatgccatg atcatggatg gcactactat ggatgtagga 300
gcagtaggag atctcagacg aattaaaaat gctattggtg tggcacggaa agtactggaa 360
catacaacac acacactttt agtaggagag tcagccacca catttgctca aagtatgggg 420
tttatcaatg aagacttatc taccactgct tctcaagctc ttcattcaga ttggcttgct 480
cggaattgcc agccaaatta ttggaggaat gttataccag atccctcaaa atactgcgga 540
ccctacaaac cacctggtat cttaaagcag gatattccta tccataaaga aacagaagat 600
gatcgtggtc atgacactat tggcatggtt gtaatccata agacaggaca tattgctgct 660
ggtacatcta caaatggtat aaaattcaaa atacatggcc gtgtaggaga ctcaccaata 720
cctggagctg gagcctatgc tgacgatact gcaggggcag ccgcagccac tgggaatggt 780
gatatattga tgcgcttcct gccaagctac caagctgtag aatacatgag aagaggagaa 840
gatccaacca tagcttgcca aaaagtgatt tcaagaatcc agaagcattt tccagaattc 900
tttggggctg ttatatgtgc caatgtgact ggaagttacg gtgctgcttg caataaactt 960
tcaacattta ctcagtttag tttcatggtt tataattccg aaaaaaatca gccaactgag 1020
gaaaaagtgg actgcatcta a 1041
<210> 33
<211> 346
<212> PRT
<213> Homo sapiens
<400> 33
Met Ala Arg Lys Ser Asn Leu Pro Val Leu Leu Val Pro Phe Leu Leu
1 5 10 15
Cys Gln Ala Leu Val Arg Cys Ser Ser Pro Leu Pro Leu Val Val Asn
20 25 30
Thr Trp Pro Phe Lys Asn Ala Thr Glu Ala Ala Trp Arg Ala Leu Ala
35 40 45
Ser Gly Gly Ser Ala Leu Asp Ala Val Glu Ser Gly Cys Ala Met Cys
50 55 60
Glu Arg Glu Gln Cys Asp Gly Ser Val Gly Phe Gly Gly Ser Pro Asp
65 70 75 80
Glu Leu Gly Glu Thr Thr Leu Asp Ala Met Ile Met Asp Gly Thr Thr
85 90 95
Met Asp Val Gly Ala Val Gly Asp Leu Arg Arg Ile Lys Asn Ala Ile
100 105 110
Gly Val Ala Arg Lys Val Leu Glu His Thr Thr His Thr Leu Leu Val
115 120 125
Gly Glu Ser Ala Thr Thr Phe Ala Gln Ser Met Gly Phe Ile Asn Glu
130 135 140
Asp Leu Ser Thr Thr Ala Ser Gln Ala Leu His Ser Asp Trp Leu Ala
145 150 155 160
Arg Asn Cys Gln Pro Asn Tyr Trp Arg Asn Val Ile Pro Asp Pro Ser
165 170 175
Lys Tyr Cys Gly Pro Tyr Lys Pro Pro Gly Ile Leu Lys Gln Asp Ile
180 185 190
Pro Ile His Lys Glu Thr Glu Asp Asp Arg Gly His Asp Thr Ile Gly
195 200 205
Met Val Val Ile His Lys Thr Gly His Ile Ala Ala Gly Thr Ser Thr
210 215 220
Asn Gly Ile Lys Phe Lys Ile His Gly Arg Val Gly Asp Ser Pro Ile
225 230 235 240
Pro Gly Ala Gly Ala Tyr Ala Asp Asp Thr Ala Gly Ala Ala Ala Ala
245 250 255
Thr Gly Asn Gly Asp Ile Leu Met Arg Phe Leu Pro Ser Tyr Gln Ala
260 265 270
Val Glu Tyr Met Arg Arg Gly Glu Asp Pro Thr Ile Ala Cys Gln Lys
275 280 285
Val Ile Ser Arg Ile Gln Lys His Phe Pro Glu Phe Phe Gly Ala Val
290 295 300
Ile Cys Ala Asn Val Thr Gly Ser Tyr Gly Ala Ala Cys Asn Lys Leu
305 310 315 320
Ser Thr Phe Thr Gln Phe Ser Phe Met Val Tyr Asn Ser Glu Lys Asn
325 330 335
Gln Pro Thr Glu Glu Lys Val Asp Cys Ile
340 345
<210> 34
<211> 1401
<212> DNA
<213> Homo sapiens
<400> 34
atgcgggctc cggggatgag gtcgcggccg gcgggtcccg cgctgttgct gctgctgctc 60
ttcctcggag cggccgagtc ggtgcgtcgg gcccagcctc cgcgccgcta caccccagac 120
tggccgagcc tggattctcg gccgctgccg gcctggttcg acgaagccaa gttcggggtg 180
ttcatccact ggggcgtgtt ctcggtgccc gcctggggca gcgagtggtt ctggtggcac 240
tggcagggcg aggggcggcc gcagtaccag cgcttcatgc gcgacaacta cccgcccggc 300
ttcagctacg ccgacttcgg accgcagttc actgcgcgct tcttccaccc ggaggagtgg 360
gccgacctct tccaggccgc gggcgccaag tatgtagttt tgacgacaaa gcatcacgaa 420
ggcttcacaa actggccgag tcctgtgtct tggaactgga actccaaaga cgtggggcct 480
catcgggatt tggttggtga attgggaaca gctctccgga agaggaacat ccgctatgga 540
ctataccact cactcttaga gtggttccat ccactctatc tacttgataa gaaaaatggc 600
ttcaaaacac agcattttgt cagtgcaaaa acaatgccag agctgtacga ccttgttaac 660
agctataaac ctgatctgat ctggtctgat ggggagtggg aatgtcctga tacttactgg 720
aactccacaa attttctttc atggctctac aatgacagcc ctgtcaagga tgaggtggta 780
gtaaatgacc gatggggtca gaactgttcc tgtcaccatg gaggatacta taactgtgaa 840
gataaattca agccacagag cttgccagat cacaagtggg agatgtgcac cagcattgac 900
aagttttcct ggggctatcg tcgtgacatg gcattgtctg atgttacaga agaatctgaa 960
atcatttcgg aactggttca gacagtaagt ttgggaggca actatcttct gaacattgga 1020
ccaactaaag atggactgat tgttcccatc ttccaagaaa ggcttcttgc tgttgggaaa 1080
tggctgagca tcaatgggga ggctatctat gcctccaaac catggcgggt gcaatgggaa 1140
aagaacacaa catctgtatg gtatacctca aagggatcgg ctgtttatgc catttttctg 1200
cactggccag aaaatggagt cttaaacctt gaatccccca taactacctc aactacaaag 1260
ataacaatgc tgggaattca aggagatctg aagtggtcca cagatccaga taaaggtctc 1320
ttcatctctc taccccagtt gccaccctct gctgtccccg cagagtttgc ttggactata 1380
aagctgacag gagtgaagta a 1401
<210> 35
<211> 466
<212> PRT
<213> Homo sapiens
<400> 35
Met Arg Ala Pro Gly Met Arg Ser Arg Pro Ala Gly Pro Ala Leu Leu
1 5 10 15
Leu Leu Leu Leu Phe Leu Gly Ala Ala Glu Ser Val Arg Arg Ala Gln
20 25 30
Pro Pro Arg Arg Tyr Thr Pro Asp Trp Pro Ser Leu Asp Ser Arg Pro
35 40 45
Leu Pro Ala Trp Phe Asp Glu Ala Lys Phe Gly Val Phe Ile His Trp
50 55 60
Gly Val Phe Ser Val Pro Ala Trp Gly Ser Glu Trp Phe Trp Trp His
65 70 75 80
Trp Gln Gly Glu Gly Arg Pro Gln Tyr Gln Arg Phe Met Arg Asp Asn
85 90 95
Tyr Pro Pro Gly Phe Ser Tyr Ala Asp Phe Gly Pro Gln Phe Thr Ala
100 105 110
Arg Phe Phe His Pro Glu Glu Trp Ala Asp Leu Phe Gln Ala Ala Gly
115 120 125
Ala Lys Tyr Val Val Leu Thr Thr Lys His His Glu Gly Phe Thr Asn
130 135 140
Trp Pro Ser Pro Val Ser Trp Asn Trp Asn Ser Lys Asp Val Gly Pro
145 150 155 160
His Arg Asp Leu Val Gly Glu Leu Gly Thr Ala Leu Arg Lys Arg Asn
165 170 175
Ile Arg Tyr Gly Leu Tyr His Ser Leu Leu Glu Trp Phe His Pro Leu
180 185 190
Tyr Leu Leu Asp Lys Lys Asn Gly Phe Lys Thr Gln His Phe Val Ser
195 200 205
Ala Lys Thr Met Pro Glu Leu Tyr Asp Leu Val Asn Ser Tyr Lys Pro
210 215 220
Asp Leu Ile Trp Ser Asp Gly Glu Trp Glu Cys Pro Asp Thr Tyr Trp
225 230 235 240
Asn Ser Thr Asn Phe Leu Ser Trp Leu Tyr Asn Asp Ser Pro Val Lys
245 250 255
Asp Glu Val Val Val Asn Asp Arg Trp Gly Gln Asn Cys Ser Cys His
260 265 270
His Gly Gly Tyr Tyr Asn Cys Glu Asp Lys Phe Lys Pro Gln Ser Leu
275 280 285
Pro Asp His Lys Trp Glu Met Cys Thr Ser Ile Asp Lys Phe Ser Trp
290 295 300
Gly Tyr Arg Arg Asp Met Ala Leu Ser Asp Val Thr Glu Glu Ser Glu
305 310 315 320
Ile Ile Ser Glu Leu Val Gln Thr Val Ser Leu Gly Gly Asn Tyr Leu
325 330 335
Leu Asn Ile Gly Pro Thr Lys Asp Gly Leu Ile Val Pro Ile Phe Gln
340 345 350
Glu Arg Leu Leu Ala Val Gly Lys Trp Leu Ser Ile Asn Gly Glu Ala
355 360 365
Ile Tyr Ala Ser Lys Pro Trp Arg Val Gln Trp Glu Lys Asn Thr Thr
370 375 380
Ser Val Trp Tyr Thr Ser Lys Gly Ser Ala Val Tyr Ala Ile Phe Leu
385 390 395 400
His Trp Pro Glu Asn Gly Val Leu Asn Leu Glu Ser Pro Ile Thr Thr
405 410 415
Ser Thr Thr Lys Ile Thr Met Leu Gly Ile Gln Gly Asp Leu Lys Trp
420 425 430
Ser Thr Asp Pro Asp Lys Gly Leu Phe Ile Ser Leu Pro Gln Leu Pro
435 440 445
Pro Ser Ala Val Pro Ala Glu Phe Ala Trp Thr Ile Lys Leu Thr Gly
450 455 460
Val Lys
465
<210> 36
<211> 2232
<212> DNA
<213> Homo sapiens
<400> 36
atggaggcgg tggcggtggc cgcggcggtg ggggtccttc tcctggccgg ggccgggggc 60
gcggcaggcg acgaggcccg ggaggcggcg gccgtgcggg cgctcgtggc ccggctgctg 120
gggccaggcc ccgcggccga cttctccgtg tcggtggagc gcgctctggc tgccaagccg 180
ggcttggaca cctacagcct gggcggcggc ggcgcggcgc gcgtgcgggt gcgcggctcc 240
acgggcgtgg cggccgccgc ggggctgcac cgctacctgc gcgacttctg tggctgccac 300
gtggcctggt ccggctctca gctgcgcctg ccgcggccac tgccagccgt gccgggggag 360
ctgaccgagg ccacgcccaa caggtaccgc tattaccaga atgtgtgcac gcaaagctac 420
tctttcgtgt ggtgggactg ggcccgctgg gagcgagaga tagactggat ggcgctgaat 480
ggcatcaacc tggcactggc ctggagcggc caggaggcca tctggcagcg ggtgtacctg 540
gccttgggcc tgacccaggc agagatcaat gagttcttta ctggtcctgc cttcctggcc 600
tgggggcgaa tgggcaacct gcacacctgg gatggccccc tgcccccctc ctggcacatc 660
aagcagcttt acctgcagca ccgggtcctg gaccagatgc gctccttcgg catgacccca 720
gtgctgcctg cattcgcggg gcatgttccc gaggctgtca ccagggtgtt ccctcaggtc 780
aatgtcacga agatgggcag ttggggccac tttaactgtt cctactcctg ctccttcctt 840
ctggctccgg aagaccccat attccccatc atcgggagcc tcttcctgcg agagctgatc 900
aaagagtttg gcacagacca catctatggg gccgacactt tcaatgagat gcagccacct 960
tcctcagagc cctcctacct tgccgcagcc accactgccg tctatgaggc catgactgca 1020
gtggatactg aggctgtgtg gctgctccaa ggctggctct tccagcacca gccgcagttc 1080
tgggggcccg cccagatcag ggctgtgctg ggagctgtgc cccgtggccg cctcctggtt 1140
ctggacctgt ttgctgagag ccagcctgtg tatacccgca ctgcctcctt ccagggccag 1200
cccttcatct ggtgcatgct gcacaacttt gggggaaacc atggtctttt tggagcccta 1260
gaggctgtga acggaggccc agaagctgcc cgcctcttcc ccaactccac catggtaggc 1320
acgggcatgg cccccgaggg catcagccag aacgaagtgg tctattccct catggctgag 1380
ctgggctggc gaaaggaccc agtgccagat ttggcagcct gggtgaccag ctttgccgcc 1440
cggcggtatg gggtctccca cccggacgca ggggcagcgt ggaggctact gctccggagt 1500
gtgtacaact gctccgggga ggcctgcagg ggccacaatc gtagcccgct ggtcaggcgg 1560
ccgtccctac agatgaatac cagcatctgg tacaaccgat ctgatgtgtt tgaggcctgg 1620
cggctgctgc tcacatctgc tccctccctg gccaccagcc ccgccttccg ctacgacctg 1680
ctggacctca ctcggcaggc agtgcaggag ctggtcagct tgtactatga ggaggcaaga 1740
agcgcctacc tgagcaagga gctggcctcc ctgttgaggg ctggaggcgt cctggcctat 1800
gagctgctgc cggcactgga cgaggtgctg gctagtgaca gccgcttctt gctgggcagc 1860
tggctagagc aggcccgagc agcggcagtc agtgaggccg aggccgattt ctacgagcag 1920
aacagccgct accagctgac cttgtggggg ccagaaggca acatcctgga ctatgccaac 1980
aagcagctgg cggggttggt ggccaactac tacacccctc gctggcggct tttcctggag 2040
gcgctggttg acagtgtggc ccagggcatc cctttccaac agcaccagtt tgacaaaaat 2100
gtcttccaac tggagcaggc cttcgttctc agcaagcaga ggtaccccag ccagccgcga 2160
ggagacactg tggacctggc caagaagatc ttcctcaaat attacccccg ctgggtggcc 2220
ggctcttggt ga 2232
<210> 37
<211> 743
<212> PRT
<213> Homo sapiens
<400> 37
Met Glu Ala Val Ala Val Ala Ala Ala Val Gly Val Leu Leu Leu Ala
1 5 10 15
Gly Ala Gly Gly Ala Ala Gly Asp Glu Ala Arg Glu Ala Ala Ala Val
20 25 30
Arg Ala Leu Val Ala Arg Leu Leu Gly Pro Gly Pro Ala Ala Asp Phe
35 40 45
Ser Val Ser Val Glu Arg Ala Leu Ala Ala Lys Pro Gly Leu Asp Thr
50 55 60
Tyr Ser Leu Gly Gly Gly Gly Ala Ala Arg Val Arg Val Arg Gly Ser
65 70 75 80
Thr Gly Val Ala Ala Ala Ala Gly Leu His Arg Tyr Leu Arg Asp Phe
85 90 95
Cys Gly Cys His Val Ala Trp Ser Gly Ser Gln Leu Arg Leu Pro Arg
100 105 110
Pro Leu Pro Ala Val Pro Gly Glu Leu Thr Glu Ala Thr Pro Asn Arg
115 120 125
Tyr Arg Tyr Tyr Gln Asn Val Cys Thr Gln Ser Tyr Ser Phe Val Trp
130 135 140
Trp Asp Trp Ala Arg Trp Glu Arg Glu Ile Asp Trp Met Ala Leu Asn
145 150 155 160
Gly Ile Asn Leu Ala Leu Ala Trp Ser Gly Gln Glu Ala Ile Trp Gln
165 170 175
Arg Val Tyr Leu Ala Leu Gly Leu Thr Gln Ala Glu Ile Asn Glu Phe
180 185 190
Phe Thr Gly Pro Ala Phe Leu Ala Trp Gly Arg Met Gly Asn Leu His
195 200 205
Thr Trp Asp Gly Pro Leu Pro Pro Ser Trp His Ile Lys Gln Leu Tyr
210 215 220
Leu Gln His Arg Val Leu Asp Gln Met Arg Ser Phe Gly Met Thr Pro
225 230 235 240
Val Leu Pro Ala Phe Ala Gly His Val Pro Glu Ala Val Thr Arg Val
245 250 255
Phe Pro Gln Val Asn Val Thr Lys Met Gly Ser Trp Gly His Phe Asn
260 265 270
Cys Ser Tyr Ser Cys Ser Phe Leu Leu Ala Pro Glu Asp Pro Ile Phe
275 280 285
Pro Ile Ile Gly Ser Leu Phe Leu Arg Glu Leu Ile Lys Glu Phe Gly
290 295 300
Thr Asp His Ile Tyr Gly Ala Asp Thr Phe Asn Glu Met Gln Pro Pro
305 310 315 320
Ser Ser Glu Pro Ser Tyr Leu Ala Ala Ala Thr Thr Ala Val Tyr Glu
325 330 335
Ala Met Thr Ala Val Asp Thr Glu Ala Val Trp Leu Leu Gln Gly Trp
340 345 350
Leu Phe Gln His Gln Pro Gln Phe Trp Gly Pro Ala Gln Ile Arg Ala
355 360 365
Val Leu Gly Ala Val Pro Arg Gly Arg Leu Leu Val Leu Asp Leu Phe
370 375 380
Ala Glu Ser Gln Pro Val Tyr Thr Arg Thr Ala Ser Phe Gln Gly Gln
385 390 395 400
Pro Phe Ile Trp Cys Met Leu His Asn Phe Gly Gly Asn His Gly Leu
405 410 415
Phe Gly Ala Leu Glu Ala Val Asn Gly Gly Pro Glu Ala Ala Arg Leu
420 425 430
Phe Pro Asn Ser Thr Met Val Gly Thr Gly Met Ala Pro Glu Gly Ile
435 440 445
Ser Gln Asn Glu Val Val Tyr Ser Leu Met Ala Glu Leu Gly Trp Arg
450 455 460
Lys Asp Pro Val Pro Asp Leu Ala Ala Trp Val Thr Ser Phe Ala Ala
465 470 475 480
Arg Arg Tyr Gly Val Ser His Pro Asp Ala Gly Ala Ala Trp Arg Leu
485 490 495
Leu Leu Arg Ser Val Tyr Asn Cys Ser Gly Glu Ala Cys Arg Gly His
500 505 510
Asn Arg Ser Pro Leu Val Arg Arg Pro Ser Leu Gln Met Asn Thr Ser
515 520 525
Ile Trp Tyr Asn Arg Ser Asp Val Phe Glu Ala Trp Arg Leu Leu Leu
530 535 540
Thr Ser Ala Pro Ser Leu Ala Thr Ser Pro Ala Phe Arg Tyr Asp Leu
545 550 555 560
Leu Asp Leu Thr Arg Gln Ala Val Gln Glu Leu Val Ser Leu Tyr Tyr
565 570 575
Glu Glu Ala Arg Ser Ala Tyr Leu Ser Lys Glu Leu Ala Ser Leu Leu
580 585 590
Arg Ala Gly Gly Val Leu Ala Tyr Glu Leu Leu Pro Ala Leu Asp Glu
595 600 605
Val Leu Ala Ser Asp Ser Arg Phe Leu Leu Gly Ser Trp Leu Glu Gln
610 615 620
Ala Arg Ala Ala Ala Val Ser Glu Ala Glu Ala Asp Phe Tyr Glu Gln
625 630 635 640
Asn Ser Arg Tyr Gln Leu Thr Leu Trp Gly Pro Glu Gly Asn Ile Leu
645 650 655
Asp Tyr Ala Asn Lys Gln Leu Ala Gly Leu Val Ala Asn Tyr Tyr Thr
660 665 670
Pro Arg Trp Arg Leu Phe Leu Glu Ala Leu Val Asp Ser Val Ala Gln
675 680 685
Gly Ile Pro Phe Gln Gln His Gln Phe Asp Lys Asn Val Phe Gln Leu
690 695 700
Glu Gln Ala Phe Val Leu Ser Lys Gln Arg Tyr Pro Ser Gln Pro Arg
705 710 715 720
Gly Asp Thr Val Asp Leu Ala Lys Lys Ile Phe Leu Lys Tyr Tyr Pro
725 730 735
Arg Trp Val Ala Gly Ser Trp
740
<210> 38
<211> 2034
<212> DNA
<213> Homo sapiens
<400> 38
atgccggggt tcctggttcg catcctccct ctgttgctgg ttctgctgct tctgggccct 60
acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc 120
ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg 180
ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc 240
cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag 300
gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg 360
aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta 420
gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag 480
tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt 540
ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg 600
cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact 660
gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg 720
gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc 780
aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct 840
cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg 900
gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc 960
aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct 1020
ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta 1080
ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa 1140
aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc 1200
ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca 1260
acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac 1320
gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg 1380
atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg 1440
ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact 1500
ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg 1560
tgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc 1620
cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt 1680
gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag 1740
gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc 1800
ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa 1860
ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac 1920
aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga 1980
ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atga 2034
<210> 39
<211> 677
<212> PRT
<213> Homo sapiens
<400> 39
Met Pro Gly Phe Leu Val Arg Ile Leu Pro Leu Leu Leu Val Leu Leu
1 5 10 15
Leu Leu Gly Pro Thr Arg Gly Leu Arg Asn Ala Thr Gln Arg Met Phe
20 25 30
Glu Ile Asp Tyr Ser Arg Asp Ser Phe Leu Lys Asp Gly Gln Pro Phe
35 40 45
Arg Tyr Ile Ser Gly Ser Ile His Tyr Ser Arg Val Pro Arg Phe Tyr
50 55 60
Trp Lys Asp Arg Leu Leu Lys Met Lys Met Ala Gly Leu Asn Ala Ile
65 70 75 80
Gln Thr Tyr Val Pro Trp Asn Phe His Glu Pro Trp Pro Gly Gln Tyr
85 90 95
Gln Phe Ser Glu Asp His Asp Val Glu Tyr Phe Leu Arg Leu Ala His
100 105 110
Glu Leu Gly Leu Leu Val Ile Leu Arg Pro Gly Pro Tyr Ile Cys Ala
115 120 125
Glu Trp Glu Met Gly Gly Leu Pro Ala Trp Leu Leu Glu Lys Glu Ser
130 135 140
Ile Leu Leu Arg Ser Ser Asp Pro Asp Tyr Leu Ala Ala Val Asp Lys
145 150 155 160
Trp Leu Gly Val Leu Leu Pro Lys Met Lys Pro Leu Leu Tyr Gln Asn
165 170 175
Gly Gly Pro Val Ile Thr Val Gln Val Glu Asn Glu Tyr Gly Ser Tyr
180 185 190
Phe Ala Cys Asp Phe Asp Tyr Leu Arg Phe Leu Gln Lys Arg Phe Arg
195 200 205
His His Leu Gly Asp Asp Val Val Leu Phe Thr Thr Asp Gly Ala His
210 215 220
Lys Thr Phe Leu Lys Cys Gly Ala Leu Gln Gly Leu Tyr Thr Thr Val
225 230 235 240
Asp Phe Gly Thr Gly Ser Asn Ile Thr Asp Ala Phe Leu Ser Gln Arg
245 250 255
Lys Cys Glu Pro Lys Gly Pro Leu Ile Asn Ser Glu Phe Tyr Thr Gly
260 265 270
Trp Leu Asp His Trp Gly Gln Pro His Ser Thr Ile Lys Thr Glu Ala
275 280 285
Val Ala Ser Ser Leu Tyr Asp Ile Leu Ala Arg Gly Ala Ser Val Asn
290 295 300
Leu Tyr Met Phe Ile Gly Gly Thr Asn Phe Ala Tyr Trp Asn Gly Ala
305 310 315 320
Asn Ser Pro Tyr Ala Ala Gln Pro Thr Ser Tyr Asp Tyr Asp Ala Pro
325 330 335
Leu Ser Glu Ala Gly Asp Leu Thr Glu Lys Tyr Phe Ala Leu Arg Asn
340 345 350
Ile Ile Gln Lys Phe Glu Lys Val Pro Glu Gly Pro Ile Pro Pro Ser
355 360 365
Thr Pro Lys Phe Ala Tyr Gly Lys Val Thr Leu Glu Lys Leu Lys Thr
370 375 380
Val Gly Ala Ala Leu Asp Ile Leu Cys Pro Ser Gly Pro Ile Lys Ser
385 390 395 400
Leu Tyr Pro Leu Thr Phe Ile Gln Val Lys Gln His Tyr Gly Phe Val
405 410 415
Leu Tyr Arg Thr Thr Leu Pro Gln Asp Cys Ser Asn Pro Ala Pro Leu
420 425 430
Ser Ser Pro Leu Asn Gly Val His Asp Arg Ala Tyr Val Ala Val Asp
435 440 445
Gly Ile Pro Gln Gly Val Leu Glu Arg Asn Asn Val Ile Thr Leu Asn
450 455 460
Ile Thr Gly Lys Ala Gly Ala Thr Leu Asp Leu Leu Val Glu Asn Met
465 470 475 480
Gly Arg Val Asn Tyr Gly Ala Tyr Ile Asn Asp Phe Lys Gly Leu Val
485 490 495
Ser Asn Leu Thr Leu Ser Ser Asn Ile Leu Thr Asp Trp Thr Ile Phe
500 505 510
Pro Leu Asp Thr Glu Asp Ala Val Arg Ser His Leu Gly Gly Trp Gly
515 520 525
His Arg Asp Ser Gly His His Asp Glu Ala Trp Ala His Asn Ser Ser
530 535 540
Asn Tyr Thr Leu Pro Ala Phe Tyr Met Gly Asn Phe Ser Ile Pro Ser
545 550 555 560
Gly Ile Pro Asp Leu Pro Gln Asp Thr Phe Ile Gln Phe Pro Gly Trp
565 570 575
Thr Lys Gly Gln Val Trp Ile Asn Gly Phe Asn Leu Gly Arg Tyr Trp
580 585 590
Pro Ala Arg Gly Pro Gln Leu Thr Leu Phe Val Pro Gln His Ile Leu
595 600 605
Met Thr Ser Ala Pro Asn Thr Ile Thr Val Leu Glu Leu Glu Trp Ala
610 615 620
Pro Cys Ser Ser Asp Asp Pro Glu Leu Cys Ala Val Thr Phe Val Asp
625 630 635 640
Arg Pro Val Ile Gly Ser Ser Val Thr Tyr Asp His Pro Ser Lys Pro
645 650 655
Val Glu Lys Arg Leu Met Pro Pro Pro Pro Gln Lys Asn Lys Asp Ser
660 665 670
Trp Leu Asp His Val
675
<210> 40
<211> 1590
<212> DNA
<213> Homo sapiens
<400> 40
atgacaagct ccaggctttg gttttcgctg ctgctggcgg cagcgttcgc aggacgggcg 60
acggccctct ggccctggcc tcagaacttc caaacctccg accagcgcta cgtcctttac 120
ccgaacaact ttcaattcca gtacgatgtc agctcggccg cgcagcccgg ctgctcagtc 180
ctcgacgagg ccttccagcg ctatcgtgac ctgcttttcg gttccgggtc ttggccccgt 240
ccttacctca cagggaaacg gcatacactg gagaagaatg tgttggttgt ctctgtagtc 300
acacctggat gtaaccagct tcctactttg gagtcagtgg agaattatac cctgaccata 360
aatgatgacc agtgtttact cctctctgag actgtctggg gagctctccg aggtctggag 420
acttttagcc agcttgtttg gaaatctgct gagggcacat tctttatcaa caagactgag 480
attgaggact ttccccgctt tcctcaccgg ggcttgctgt tggatacatc tcgccattac 540
ctgccactct ctagcatcct ggacactctg gatgtcatgg cgtacaataa attgaacgtg 600
ttccactggc atctggtaga tgatccttcc ttcccatatg agagcttcac ttttccagag 660
ctcatgagaa aggggtccta caaccctgtc acccacatct acacagcaca ggatgtgaag 720
gaggtcattg aatacgcacg gctccggggt atccgtgtgc ttgcagagtt tgacactcct 780
ggccacactt tgtcctgggg accaggtatc cctggattac tgactccttg ctactctggg 840
tctgagccct ctggcacctt tggaccagtg aatcccagtc tcaataatac ctatgagttc 900
atgagcacat tcttcttaga agtcagctct gtcttcccag atttttatct tcatcttgga 960
ggagatgagg ttgatttcac ctgctggaag tccaacccag agatccagga ctttatgagg 1020
aagaaaggct tcggtgagga cttcaagcag ctggagtcct tctacatcca gacgctgctg 1080
gacatcgtct cttcttatgg caagggctat gtggtgtggc aggaggtgtt tgataataaa 1140
gtaaagattc agccagacac aatcatacag gtgtggcgag aggatattcc agtgaactat 1200
atgaaggagc tggaactggt caccaaggcc ggcttccggg cccttctctc tgccccctgg 1260
tacctgaacc gtatatccta tggccctgac tggaaggatt tctacatagt ggaacccctg 1320
gcatttgaag gtacccctga gcagaaggct ctggtgattg gtggagaggc ttgtatgtgg 1380
ggagaatatg tggacaacac aaacctggtc cccaggctct ggcccagagc aggggctgtt 1440
gccgaaaggc tgtggagcaa caagttgaca tctgacctga catttgccta tgaacgtttg 1500
tcacacttcc gctgtgaatt gctgaggcga ggtgtccagg cccaacccct caatgtaggc 1560
ttctgtgagc aggagtttga acagacctga 1590
<210> 41
<211> 529
<212> PRT
<213> Homo sapiens
<400> 41
Met Thr Ser Ser Arg Leu Trp Phe Ser Leu Leu Leu Ala Ala Ala Phe
1 5 10 15
Ala Gly Arg Ala Thr Ala Leu Trp Pro Trp Pro Gln Asn Phe Gln Thr
20 25 30
Ser Asp Gln Arg Tyr Val Leu Tyr Pro Asn Asn Phe Gln Phe Gln Tyr
35 40 45
Asp Val Ser Ser Ala Ala Gln Pro Gly Cys Ser Val Leu Asp Glu Ala
50 55 60
Phe Gln Arg Tyr Arg Asp Leu Leu Phe Gly Ser Gly Ser Trp Pro Arg
65 70 75 80
Pro Tyr Leu Thr Gly Lys Arg His Thr Leu Glu Lys Asn Val Leu Val
85 90 95
Val Ser Val Val Thr Pro Gly Cys Asn Gln Leu Pro Thr Leu Glu Ser
100 105 110
Val Glu Asn Tyr Thr Leu Thr Ile Asn Asp Asp Gln Cys Leu Leu Leu
115 120 125
Ser Glu Thr Val Trp Gly Ala Leu Arg Gly Leu Glu Thr Phe Ser Gln
130 135 140
Leu Val Trp Lys Ser Ala Glu Gly Thr Phe Phe Ile Asn Lys Thr Glu
145 150 155 160
Ile Glu Asp Phe Pro Arg Phe Pro His Arg Gly Leu Leu Leu Asp Thr
165 170 175
Ser Arg His Tyr Leu Pro Leu Ser Ser Ile Leu Asp Thr Leu Asp Val
180 185 190
Met Ala Tyr Asn Lys Leu Asn Val Phe His Trp His Leu Val Asp Asp
195 200 205
Pro Ser Phe Pro Tyr Glu Ser Phe Thr Phe Pro Glu Leu Met Arg Lys
210 215 220
Gly Ser Tyr Asn Pro Val Thr His Ile Tyr Thr Ala Gln Asp Val Lys
225 230 235 240
Glu Val Ile Glu Tyr Ala Arg Leu Arg Gly Ile Arg Val Leu Ala Glu
245 250 255
Phe Asp Thr Pro Gly His Thr Leu Ser Trp Gly Pro Gly Ile Pro Gly
260 265 270
Leu Leu Thr Pro Cys Tyr Ser Gly Ser Glu Pro Ser Gly Thr Phe Gly
275 280 285
Pro Val Asn Pro Ser Leu Asn Asn Thr Tyr Glu Phe Met Ser Thr Phe
290 295 300
Phe Leu Glu Val Ser Ser Val Phe Pro Asp Phe Tyr Leu His Leu Gly
305 310 315 320
Gly Asp Glu Val Asp Phe Thr Cys Trp Lys Ser Asn Pro Glu Ile Gln
325 330 335
Asp Phe Met Arg Lys Lys Gly Phe Gly Glu Asp Phe Lys Gln Leu Glu
340 345 350
Ser Phe Tyr Ile Gln Thr Leu Leu Asp Ile Val Ser Ser Tyr Gly Lys
355 360 365
Gly Tyr Val Val Trp Gln Glu Val Phe Asp Asn Lys Val Lys Ile Gln
370 375 380
Pro Asp Thr Ile Ile Gln Val Trp Arg Glu Asp Ile Pro Val Asn Tyr
385 390 395 400
Met Lys Glu Leu Glu Leu Val Thr Lys Ala Gly Phe Arg Ala Leu Leu
405 410 415
Ser Ala Pro Trp Tyr Leu Asn Arg Ile Ser Tyr Gly Pro Asp Trp Lys
420 425 430
Asp Phe Tyr Ile Val Glu Pro Leu Ala Phe Glu Gly Thr Pro Glu Gln
435 440 445
Lys Ala Leu Val Ile Gly Gly Glu Ala Cys Met Trp Gly Glu Tyr Val
450 455 460
Asp Asn Thr Asn Leu Val Pro Arg Leu Trp Pro Arg Ala Gly Ala Val
465 470 475 480
Ala Glu Arg Leu Trp Ser Asn Lys Leu Thr Ser Asp Leu Thr Phe Ala
485 490 495
Tyr Glu Arg Leu Ser His Phe Arg Cys Glu Leu Leu Arg Arg Gly Val
500 505 510
Gln Ala Gln Pro Leu Asn Val Gly Phe Cys Glu Gln Glu Phe Glu Gln
515 520 525
Thr
<210> 42
<211> 1671
<212> DNA
<213> Homo sapiens
<400> 42
atggagctgt gcgggctggg gctgccccgg ccgcccatgc tgctggcgct gctgttggcg 60
acactgctgg cggcgatgtt ggcgctgctg actcaggtgg cgctggtggt gcaggtggcg 120
gaggcggctc gggccccgag cgtctcggcc aagccggggc cggcgctgtg gcccctgccg 180
ctcttggtga agatgacccc gaacctgctg catctcgccc cggagaactt ctacatcagc 240
cacagcccca attccacggc gggcccctcc tgcaccctgc tggaggaagc gtttcgacga 300
tatcatggct atatttttgg tttctacaag tggcatcatg aacctgctga attccaggct 360
aaaacccagg ttcagcaact tcttgtctca atcacccttc agtcagagtg tgatgctttc 420
cccaacatat cttcagatga gtcttatact ttacttgtga aagaaccagt ggctgtcctt 480
aaggccaaca gagtttgggg agcattacga ggtttagaga cctttagcca gttagtttat 540
caagattctt atggaacttt caccatcaat gaatccacca ttattgattc tccaaggttt 600
tctcacagag gaattttgat tgatacatcc agacattatc tgccagttaa gattattctt 660
aaaactctgg atgccatggc ttttaataag tttaatgttc ttcactggca catagttgat 720
gaccagtctt tcccatatca gagcatcact tttcctgagt taagcaataa aggaagctat 780
tctttgtctc atgtttatac accaaatgat gtccgtatgg tgattgaata tgccagatta 840
cgaggaattc gagtcctgcc agaatttgat acccctgggc atacactatc ttggggaaaa 900
ggtcagaaag acctcctgac tccatgttac agtagacaaa acaagttgga ctcttttgga 960
cctataaacc ctactctgaa tacaacatac agcttcctta ctacattttt caaagaaatt 1020
agtgaggtgt ttccagatca attcattcat ttgggaggag atgaagtgga atttaaatgt 1080
tgggaatcaa atccaaaaat tcaagatttc atgaggcaaa aaggctttgg cacagatttt 1140
aagaaactag aatctttcta cattcaaaag gttttggata ttattgcaac cataaacaag 1200
ggatccattg tctggcagga ggtttttgat gataaagcaa agcttgcgcc gggcacaata 1260
gttgaagtat ggaaagacag cgcatatcct gaggaactca gtagagtcac agcatctggc 1320
ttccctgtaa tcctttctgc tccttggtac ttagatttga ttagctatgg acaagattgg 1380
aggaaatact ataaagtgga acctcttgat tttggcggta ctcagaaaca gaaacaactt 1440
ttcattggtg gagaagcttg tctatgggga gaatatgtgg atgcaactaa cctcactcca 1500
agattatggc ctcgggcaag tgctgttggt gagagactct ggagttccaa agatgtcaga 1560
gatatggatg acgcctatga cagactgaca aggcaccgct gcaggatggt cgaacgtgga 1620
atagctgcac aacctcttta tgctggatat tgtaaccatg agaacatgta a 1671
<210> 43
<211> 556
<212> PRT
<213> Homo sapiens
<400> 43
Met Glu Leu Cys Gly Leu Gly Leu Pro Arg Pro Pro Met Leu Leu Ala
1 5 10 15
Leu Leu Leu Ala Thr Leu Leu Ala Ala Met Leu Ala Leu Leu Thr Gln
20 25 30
Val Ala Leu Val Val Gln Val Ala Glu Ala Ala Arg Ala Pro Ser Val
35 40 45
Ser Ala Lys Pro Gly Pro Ala Leu Trp Pro Leu Pro Leu Ser Val Lys
50 55 60
Met Thr Pro Asn Leu Leu His Leu Ala Pro Glu Asn Phe Tyr Ile Ser
65 70 75 80
His Ser Pro Asn Ser Thr Ala Gly Pro Ser Cys Thr Leu Leu Glu Glu
85 90 95
Ala Phe Arg Arg Tyr His Gly Tyr Ile Phe Gly Phe Tyr Lys Trp His
100 105 110
His Glu Pro Ala Glu Phe Gln Ala Lys Thr Gln Val Gln Gln Leu Leu
115 120 125
Val Ser Ile Thr Leu Gln Ser Glu Cys Asp Ala Phe Pro Asn Ile Ser
130 135 140
Ser Asp Glu Ser Tyr Thr Leu Leu Val Lys Glu Pro Val Ala Val Leu
145 150 155 160
Lys Ala Asn Arg Val Trp Gly Ala Leu Arg Gly Leu Glu Thr Phe Ser
165 170 175
Gln Leu Val Tyr Gln Asp Ser Tyr Gly Thr Phe Thr Ile Asn Glu Ser
180 185 190
Thr Ile Ile Asp Ser Pro Arg Phe Ser His Arg Gly Ile Leu Ile Asp
195 200 205
Thr Ser Arg His Tyr Leu Pro Val Lys Ile Ile Leu Lys Thr Leu Asp
210 215 220
Ala Met Ala Phe Asn Lys Phe Asn Val Leu His Trp His Ile Val Asp
225 230 235 240
Asp Gln Ser Phe Pro Tyr Gln Ser Ile Thr Phe Pro Glu Leu Ser Asn
245 250 255
Lys Gly Ser Tyr Ser Leu Ser His Val Tyr Thr Pro Asn Asp Val Arg
260 265 270
Met Val Ile Glu Tyr Ala Arg Leu Arg Gly Ile Arg Val Leu Pro Glu
275 280 285
Phe Asp Thr Pro Gly His Thr Leu Ser Trp Gly Lys Gly Gln Lys Asp
290 295 300
Leu Leu Thr Pro Cys Tyr Ser Arg Gln Asn Lys Leu Asp Ser Phe Gly
305 310 315 320
Pro Ile Asn Pro Thr Leu Asn Thr Thr Tyr Ser Phe Leu Thr Thr Phe
325 330 335
Phe Lys Glu Ile Ser Glu Val Phe Pro Asp Gln Phe Ile His Leu Gly
340 345 350
Gly Asp Glu Val Glu Phe Lys Cys Trp Glu Ser Asn Pro Lys Ile Gln
355 360 365
Asp Phe Met Arg Gln Lys Gly Phe Gly Thr Asp Phe Lys Lys Leu Glu
370 375 380
Ser Phe Tyr Ile Gln Lys Val Leu Asp Ile Ile Ala Thr Ile Asn Lys
385 390 395 400
Gly Ser Ile Val Trp Gln Glu Val Phe Asp Asp Lys Ala Lys Leu Ala
405 410 415
Pro Gly Thr Ile Val Glu Val Trp Lys Asp Ser Ala Tyr Pro Glu Glu
420 425 430
Leu Ser Arg Val Thr Ala Ser Gly Phe Pro Val Ile Leu Ser Ala Pro
435 440 445
Trp Tyr Leu Asp Leu Ile Ser Tyr Gly Gln Asp Trp Arg Lys Tyr Tyr
450 455 460
Lys Val Glu Pro Leu Asp Phe Gly Gly Thr Gln Lys Gln Lys Gln Leu
465 470 475 480
Phe Ile Gly Gly Glu Ala Cys Leu Trp Gly Glu Tyr Val Asp Ala Thr
485 490 495
Asn Leu Thr Pro Arg Leu Trp Pro Arg Ala Ser Ala Val Gly Glu Arg
500 505 510
Leu Trp Ser Ser Lys Asp Val Arg Asp Met Asp Asp Ala Tyr Asp Arg
515 520 525
Leu Thr Arg His Arg Cys Arg Met Val Glu Arg Gly Ile Ala Ala Gln
530 535 540
Pro Leu Tyr Ala Gly Tyr Cys Asn His Glu Asn Met
545 550 555
<210> 44
<211> 582
<212> DNA
<213> Homo sapiens
<400> 44
atgcagtccc tgatgcaggc tcccctcctg atcgccctgg gcttgcttct cgcggcccct 60
gcgcaagccc acctgaaaaa gccatcccag ctcagtagct tttcctggga taactgtgat 120
gaagggaagg accctgcggt gatcagaagc ctgactctgg agcctgaccc catcatcgtt 180
cctggaaatg tgaccctcag tgtcatgggc agcaccagtg tccccctgag ttctcctctg 240
aaggtggatt tagttttgga gaaggaggtg gctggcctct ggatcaagat cccatgcaca 300
gactacattg gcagctgtac ctttgaacac ttctgtgatg tgcttgacat gttaattcct 360
actggggagc cctgcccaga gcccctgcgt acctatgggc ttccttgcca ctgtcccttc 420
aaagaaggaa cctactcact gcccaagagc gaattcgttg tgcctgacct ggagctgccc 480
agttggctca ccaccgggaa ctaccgcata gagagcgtcc tgagcagcag tgggaagcgt 540
ctgggctgca tcaagatcgc tgcctctcta aagggcatat aa 582
<210> 45
<211> 193
<212> PRT
<213> Homo sapiens
<400> 45
Met Gln Ser Leu Met Gln Ala Pro Leu Leu Ile Ala Leu Gly Leu Leu
1 5 10 15
Leu Ala Ala Pro Ala Gln Ala His Leu Lys Lys Pro Ser Gln Leu Ser
20 25 30
Ser Phe Ser Trp Asp Asn Cys Asp Glu Gly Lys Asp Pro Ala Val Ile
35 40 45
Arg Ser Leu Thr Leu Glu Pro Asp Pro Ile Ile Val Pro Gly Asn Val
50 55 60
Thr Leu Ser Val Met Gly Ser Thr Ser Val Pro Leu Ser Ser Pro Leu
65 70 75 80
Lys Val Asp Leu Val Leu Glu Lys Glu Val Ala Gly Leu Trp Ile Lys
85 90 95
Ile Pro Cys Thr Asp Tyr Ile Gly Ser Cys Thr Phe Glu His Phe Cys
100 105 110
Asp Val Leu Asp Met Leu Ile Pro Thr Gly Glu Pro Cys Pro Glu Pro
115 120 125
Leu Arg Thr Tyr Gly Leu Pro Cys His Cys Pro Phe Lys Glu Gly Thr
130 135 140
Tyr Ser Leu Pro Lys Ser Glu Phe Val Val Pro Asp Leu Glu Leu Pro
145 150 155 160
Ser Trp Leu Thr Thr Gly Asn Tyr Arg Ile Glu Ser Val Leu Ser Ser
165 170 175
Ser Gly Lys Arg Leu Gly Cys Ile Lys Ile Ala Ala Ser Leu Lys Gly
180 185 190
Ile
<210> 46
<211> 1611
<212> DNA
<213> Homo sapiens
<400> 46
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc 60
atggctggca gcctcacagg attgcttcta cttcaggcag tgtcgtgggc atcaggtgcc 120
cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca 180
tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag 240
agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg 300
ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt 360
ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa 420
aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta 480
cccatggcca gctgtgactt ctccatccgc acctacacct atgcagacac ccctgatgat 540
ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt 600
caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca 660
cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc 720
ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct 780
gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg 840
agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc 900
cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg 960
gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca 1020
gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaaa 1080
gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc 1140
tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg 1200
cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg 1260
aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc 1320
atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc 1380
cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag 1440
aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta 1500
aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag 1560
acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a 1611
<210> 47
<211> 536
<212> PRT
<213> Homo sapiens
<400> 47
Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser
1 5 10 15
Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln
20 25 30
Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe
35 40 45
Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser
50 55 60
Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu
65 70 75 80
Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln
85 90 95
Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln
100 105 110
Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala
115 120 125
Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu
130 135 140
Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val
145 150 155 160
Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp
165 170 175
Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp
180 185 190
Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln
195 200 205
Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu
210 215 220
Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro
225 230 235 240
Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu
245 250 255
Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu
260 265 270
Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu
275 280 285
Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly
290 295 300
Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu
305 310 315 320
Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr
325 330 335
Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr
340 345 350
Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg
355 360 365
Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser
370 375 380
Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met
385 390 395 400
Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly
405 410 415
Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp
420 425 430
Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp
435 440 445
Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys
450 455 460
Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys
465 470 475 480
Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val
485 490 495
Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys
500 505 510
Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile
515 520 525
His Thr Tyr Leu Trp Arg Arg Gln
530 535
<210> 48
<211> 1575
<212> DNA
<213> Homo sapiens
<400> 48
atgtacgccc tcttcctcct ggccagcctc ctgggcgcgg ctctagccgg cccggtcctt 60
ggactgaaag aatgcaccag gggctcggca gtgtggtgcc agaatgtgaa gacggcgtcc 120
gactgcgggg cagtgaagca ctgcctgcag accgtttgga acaagccaac agtgaaatcc 180
cttccctgcg acatatgcaa agacgttgtc accgcagctg gtgatatgct gaaggacaat 240
gccactgagg aggagatcct tgtttacttg gagaagacct gtgactggct tccgaaaccg 300
aacatgtctg cttcatgcaa ggagatagtg gactcctacc tccctgtcat cctggacatc 360
attaaaggag aaatgagccg tcctggggag gtgtgctctg ctctcaacct ctgcgagtct 420
ctccagaagc acctagcaga gctgaatcac cagaagcagc tggagtccaa taagatccca 480
gagctggaca tgactgaggt ggtggccccc ttcatggcca acatccctct cctcctctac 540
cctcaggacg gcccccgcag caagccccag ccaaaggata atggggacgt ttgccaggac 600
tgcattcaga tggtgactga catccagact gctgtacgga ccaactccac ctttgtccag 660
gccttggtgg aacatgtcaa ggaggagtgt gaccgcctgg gccctggcat ggccgacata 720
tgcaagaact atatcagcca gtattctgaa attgctatcc agatgatgat gcacatgcaa 780
cccaaggaga tctgtgcgct ggttgggttc tgtgatgagg tgaaagagat gcccatgcag 840
actctggtcc ccgccaaagt ggcctccaag aatgtcatcc ctgccctgga actggtggag 900
cccattaaga agcacgaggt cccagcaaag tctgatgttt actgtgaggt gtgtgaattc 960
ctggtgaagg aggtgaccaa gctgattgac aacaacaaga ctgagaaaga aatactcgac 1020
gcttttgaca aaatgtgctc gaagctgccg aagtccctgt cggaagagtg ccaggaggtg 1080
gtggacacgt acggcagctc catcctgtcc atcctgctgg aggaggtcag ccctgagctg 1140
gtgtgcagca tgctgcacct ctgctctggc acgcggctgc ctgcactgac cgttcacgtg 1200
actcagccaa aggacggtgg cttctgcgaa gtgtgcaaga agctggtggg ttatttggat 1260
cgcaacctgg agaaaaacag caccaagcag gagatcctgg ctgctcttga gaaaggctgc 1320
agcttcctgc cagaccctta ccagaagcag tgtgatcagt ttgtggcaga gtacgagccc 1380
gtgctgatcg agatcctggt ggaggtgatg gatccttcct tcgtgtgctt gaaaattgga 1440
gcctgcccct cggcccataa gcccttgttg ggaactgaga agtgtatatg gggcccaagc 1500
tactggtgcc agaacacaga gacagcagcc cagtgcaatg ctgtcgagca ttgcaaacgc 1560
catgtgtgga actag 1575
<210> 49
<211> 524
<212> PRT
<213> Homo sapiens
<400> 49
Met Tyr Ala Leu Phe Leu Leu Ala Ser Leu Leu Gly Ala Ala Leu Ala
1 5 10 15
Gly Pro Val Leu Gly Leu Lys Glu Cys Thr Arg Gly Ser Ala Val Trp
20 25 30
Cys Gln Asn Val Lys Thr Ala Ser Asp Cys Gly Ala Val Lys His Cys
35 40 45
Leu Gln Thr Val Trp Asn Lys Pro Thr Val Lys Ser Leu Pro Cys Asp
50 55 60
Ile Cys Lys Asp Val Val Thr Ala Ala Gly Asp Met Leu Lys Asp Asn
65 70 75 80
Ala Thr Glu Glu Glu Ile Leu Val Tyr Leu Glu Lys Thr Cys Asp Trp
85 90 95
Leu Pro Lys Pro Asn Met Ser Ala Ser Cys Lys Glu Ile Val Asp Ser
100 105 110
Tyr Leu Pro Val Ile Leu Asp Ile Ile Lys Gly Glu Met Ser Arg Pro
115 120 125
Gly Glu Val Cys Ser Ala Leu Asn Leu Cys Glu Ser Leu Gln Lys His
130 135 140
Leu Ala Glu Leu Asn His Gln Lys Gln Leu Glu Ser Asn Lys Ile Pro
145 150 155 160
Glu Leu Asp Met Thr Glu Val Val Ala Pro Phe Met Ala Asn Ile Pro
165 170 175
Leu Leu Leu Tyr Pro Gln Asp Gly Pro Arg Ser Lys Pro Gln Pro Lys
180 185 190
Asp Asn Gly Asp Val Cys Gln Asp Cys Ile Gln Met Val Thr Asp Ile
195 200 205
Gln Thr Ala Val Arg Thr Asn Ser Thr Phe Val Gln Ala Leu Val Glu
210 215 220
His Val Lys Glu Glu Cys Asp Arg Leu Gly Pro Gly Met Ala Asp Ile
225 230 235 240
Cys Lys Asn Tyr Ile Ser Gln Tyr Ser Glu Ile Ala Ile Gln Met Met
245 250 255
Met His Met Gln Pro Lys Glu Ile Cys Ala Leu Val Gly Phe Cys Asp
260 265 270
Glu Val Lys Glu Met Pro Met Gln Thr Leu Val Pro Ala Lys Val Ala
275 280 285
Ser Lys Asn Val Ile Pro Ala Leu Glu Leu Val Glu Pro Ile Lys Lys
290 295 300
His Glu Val Pro Ala Lys Ser Asp Val Tyr Cys Glu Val Cys Glu Phe
305 310 315 320
Leu Val Lys Glu Val Thr Lys Leu Ile Asp Asn Asn Lys Thr Glu Lys
325 330 335
Glu Ile Leu Asp Ala Phe Asp Lys Met Cys Ser Lys Leu Pro Lys Ser
340 345 350
Leu Ser Glu Glu Cys Gln Glu Val Val Asp Thr Tyr Gly Ser Ser Ile
355 360 365
Leu Ser Ile Leu Leu Glu Glu Val Ser Pro Glu Leu Val Cys Ser Met
370 375 380
Leu His Leu Cys Ser Gly Thr Arg Leu Pro Ala Leu Thr Val His Val
385 390 395 400
Thr Gln Pro Lys Asp Gly Gly Phe Cys Glu Val Cys Lys Lys Leu Val
405 410 415
Gly Tyr Leu Asp Arg Asn Leu Glu Lys Asn Ser Thr Lys Gln Glu Ile
420 425 430
Leu Ala Ala Leu Glu Lys Gly Cys Ser Phe Leu Pro Asp Pro Tyr Gln
435 440 445
Lys Gln Cys Asp Gln Phe Val Ala Glu Tyr Glu Pro Val Leu Ile Glu
450 455 460
Ile Leu Val Glu Val Met Asp Pro Ser Phe Val Cys Leu Lys Ile Gly
465 470 475 480
Ala Cys Pro Ser Ala His Lys Pro Leu Leu Gly Thr Glu Lys Cys Ile
485 490 495
Trp Gly Pro Ser Tyr Trp Cys Gln Asn Thr Glu Thr Ala Ala Gln Cys
500 505 510
Asn Ala Val Glu His Cys Lys Arg His Val Trp Asn
515 520
<210> 50
<211> 80
<212> PRT
<213> Homo sapiens
<400> 50
Ser Asp Val Tyr Cys Glu Val Cys Glu Phe Leu Val Lys Glu Val Thr
1 5 10 15
Lys Leu Ile Asp Asn Asn Lys Thr Glu Lys Glu Ile Leu Asp Ala Phe
20 25 30
Asp Lys Met Cys Ser Lys Leu Pro Lys Ser Leu Ser Glu Glu Cys Gln
35 40 45
Glu Val Val Asp Thr Tyr Gly Ser Ser Ile Leu Ser Ile Leu Leu Glu
50 55 60
Glu Val Ser Pro Glu Leu Val Cys Ser Met Leu His Leu Cys Ser Gly
65 70 75 80
<210> 51
<211> 79
<212> PRT
<213> Homo sapiens
<400> 51
Gly Asp Val Cys Gln Asp Cys Ile Gln Met Val Thr Asp Ile Gln Thr
1 5 10 15
Ala Val Arg Thr Asn Ser Thr Phe Val Gln Ala Leu Val Glu His Val
20 25 30
Lys Glu Glu Cys Asp Arg Leu Gly Pro Gly Met Ala Asp Ile Cys Lys
35 40 45
Asn Tyr Ile Ser Gln Tyr Ser Glu Ile Ala Ile Gln Met Met Met His
50 55 60
Met Gln Pro Lys Glu Ile Cys Ala Leu Val Gly Phe Cys Asp Glu
65 70 75
<210> 52
<211> 1272
<212> DNA
<213> Homo sapiens
<400> 52
atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc cctggaggag 60
gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc cggcaagtgg 120
caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt ccatcgattt 180
ctaggcatcc cgtactccca cgaccagggc ccctgccaga acctgacctg cttcccgccg 240
gccactcctt gcgacggtgg ctgtgaccag ggcctggtcc ccatcccact gttggccaac 300
ctgtccgtgg aggcgcagcc cccctggctg cccggactag aggcccgcta catggctttc 360
gcccatgacc tcatggccga cgcccagcgc caggatcgcc ccttcttcct gtactatgcc 420
tctcaccaca cccactaccc tcagttcagt gggcagagct ttgcagagcg ttcaggccgc 480
gggccatttg gggactccct gatggagctg gatgcagctg tggggaccct gatgacagcc 540
ataggggacc tggggctgct tgaagagacg ctggtcatct tcactgcaga caatggacct 600
gagaccatgc gtatgtcccg aggcggctgc tccggtctct tgcggtgtgg aaagggaacg 660
acctacgagg gcggtgtccg agagcctgcc ttggccttct ggccaggtca tatcgctccc 720
ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct 780
ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc 840
acaggcaaga gccctcggca gtctctcttc ttctacccgt cctacccaga cgaggtccgt 900
ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca gggctctgcc 960
cacagtgata ccactgcaga ccctgcctgc cacgcctcca gctctctgac tgctcatgag 1020
cccccgctgc tctatgacct gtccaaggac cctggtgaga actacaacct gctggggggt 1080
gtggccgggg ccaccccaga ggtgctgcaa gccctgaaac agcttcagct gctcaaggcc 1140
cagttagacg cagctgtgac cttcggcccc agccaggtgg cccggggcga ggaccccgcc 1200
ctgcagatct gctgtcatcc tggctgcacc ccccgcccag cttgctgcca ttgcccagat 1260
ccccatgcct ga 1272
<210> 53
<211> 507
<212> PRT
<213> Homo sapiens
<400> 53
Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala
1 5 10 15
Val Ala Arg Pro Pro Asn Ile Val Leu Ile Phe Ala Asp Asp Leu Gly
20 25 30
Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn
35 40 45
Leu Asp Gln Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val
50 55 60
Pro Val Ser Leu Cys Thr Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg
65 70 75 80
Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser
85 90 95
Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala
100 105 110
Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val
115 120 125
Gly Pro Glu Gly Ala Phe Leu Pro Pro His Gln Gly Phe His Arg Phe
130 135 140
Leu Gly Ile Pro Tyr Ser His Asp Gln Gly Pro Cys Gln Asn Leu Thr
145 150 155 160
Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gln Gly Leu
165 170 175
Val Pro Ile Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gln Pro Pro
180 185 190
Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu
195 200 205
Met Ala Asp Ala Gln Arg Gln Asp Arg Pro Phe Phe Leu Tyr Tyr Ala
210 215 220
Ser His His Thr His Tyr Pro Gln Phe Ser Gly Gln Ser Phe Ala Glu
225 230 235 240
Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala
245 250 255
Ala Val Gly Thr Leu Met Thr Ala Ile Gly Asp Leu Gly Leu Leu Glu
260 265 270
Glu Thr Leu Val Ile Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg
275 280 285
Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr
290 295 300
Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly
305 310 315 320
His Ile Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu
325 330 335
Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr
340 345 350
Leu Asp Gly Phe Asp Leu Ser Pro Leu Leu Leu Gly Thr Gly Lys Ser
355 360 365
Pro Arg Gln Ser Leu Phe Phe Tyr Pro Ser Tyr Pro Asp Glu Val Arg
370 375 380
Gly Val Phe Ala Val Arg Thr Gly Lys Tyr Lys Ala His Phe Phe Thr
385 390 395 400
Gln Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala
405 410 415
Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser
420 425 430
Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala
435 440 445
Thr Pro Glu Val Leu Gln Ala Leu Lys Gln Leu Gln Leu Leu Lys Ala
450 455 460
Gln Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gln Val Ala Arg Gly
465 470 475 480
Glu Asp Pro Ala Leu Gln Ile Cys Cys His Pro Gly Cys Thr Pro Arg
485 490 495
Pro Ala Cys Cys His Cys Pro Asp Pro His Ala
500 505
<210> 54
<211> 1125
<212> DNA
<213> Homo sapiens
<400> 54
atggctgcgc ccgcactagg gctggtgtgt ggacgttgcc ctgagctggg tctcgtcctc 60
ttgctgctgc tgctctcgct gctgtgtgga gcggcaggga gccaggaggc cgggaccggt 120
gcgggcgcgg ggtcccttgc gggttcttgc ggctgcggca cgccccagcg gcctggcgcc 180
catggcagtt cggcagccgc tcaccgatac tcgcgggagg ctaacgctcc gggccccgta 240
cccggagagc ggcaactcgc gcactcaaag atggtcccca tccctgctgg agtatttaca 300
atgggcacag atgatcctca gataaagcag gatggggaag cacctgcgag gagagttact 360
attgatgcct tttacatgga tgcctatgaa gtcagtaata ctgaatttga gaagtttgtg 420
aactcaactg gctatttgac agaggctgag aagtttggcg actcctttgt ctttgaaggc 480
atgttgagtg agcaagtgaa gaccaatatt caacaggcag ttgcagctgc tccctggtgg 540
ttacctgtga aaggcgctaa ctggagacac ccagaagggc ctgactctac tattctgcac 600
aggccggatc atccagttct ccatgtgtcc tggaatgatg cggttgccta ctgcacttgg 660
gcagggaagc ggctgcccac ggaagctgag tgggaataca gctgtcgagg aggcctgcat 720
aatagacttt tcccctgggg caacaaactg cagcccaaag gccagcatta tgccaacatt 780
tggcagggcg agtttccggt gaccaacact ggtgaggatg gcttccaagg aactgcgcct 840
gttgatgcct tccctcccaa tggttatggc ttatacaaca tagtggggaa cgcatgggaa 900
tggacttcag actggtggac tgttcatcat tctgttgaag aaacgcttaa cccaaaaggt 960
cccccttctg ggaaagaccg agtgaagaaa ggtggatcct acatgtgcca taggtcttat 1020
tgttacaggt atcgctgtgc tgctcggagc cagaacacac ctgatagctc tgcttcgaat 1080
ctgggattcc gctgtgcagc cgaccgcctg cccactatgg actga 1125
<210> 55
<211> 374
<212> PRT
<213> Homo sapiens
<400> 55
Met Ala Ala Pro Ala Leu Gly Leu Val Cys Gly Arg Cys Pro Glu Leu
1 5 10 15
Gly Leu Val Leu Leu Leu Leu Leu Leu Ser Leu Leu Cys Gly Ala Ala
20 25 30
Gly Ser Gln Glu Ala Gly Thr Gly Ala Gly Ala Gly Ser Leu Ala Gly
35 40 45
Ser Cys Gly Cys Gly Thr Pro Gln Arg Pro Gly Ala His Gly Ser Ser
50 55 60
Ala Ala Ala His Arg Tyr Ser Arg Glu Ala Asn Ala Pro Gly Pro Val
65 70 75 80
Pro Gly Glu Arg Gln Leu Ala His Ser Lys Met Val Pro Ile Pro Ala
85 90 95
Gly Val Phe Thr Met Gly Thr Asp Asp Pro Gln Ile Lys Gln Asp Gly
100 105 110
Glu Ala Pro Ala Arg Arg Val Thr Ile Asp Ala Phe Tyr Met Asp Ala
115 120 125
Tyr Glu Val Ser Asn Thr Glu Phe Glu Lys Phe Val Asn Ser Thr Gly
130 135 140
Tyr Leu Thr Glu Ala Glu Lys Phe Gly Asp Ser Phe Val Phe Glu Gly
145 150 155 160
Met Leu Ser Glu Gln Val Lys Thr Asn Ile Gln Gln Ala Val Ala Ala
165 170 175
Ala Pro Trp Trp Leu Pro Val Lys Gly Ala Asn Trp Arg His Pro Glu
180 185 190
Gly Pro Asp Ser Thr Ile Leu His Arg Pro Asp His Pro Val Leu His
195 200 205
Val Ser Trp Asn Asp Ala Val Ala Tyr Cys Thr Trp Ala Gly Lys Arg
210 215 220
Leu Pro Thr Glu Ala Glu Trp Glu Tyr Ser Cys Arg Gly Gly Leu His
225 230 235 240
Asn Arg Leu Phe Pro Trp Gly Asn Lys Leu Gln Pro Lys Gly Gln His
245 250 255
Tyr Ala Asn Ile Trp Gln Gly Glu Phe Pro Val Thr Asn Thr Gly Glu
260 265 270
Asp Gly Phe Gln Gly Thr Ala Pro Val Asp Ala Phe Pro Pro Asn Gly
275 280 285
Tyr Gly Leu Tyr Asn Ile Val Gly Asn Ala Trp Glu Trp Thr Ser Asp
290 295 300
Trp Trp Thr Val His His Ser Val Glu Glu Thr Leu Asn Pro Lys Gly
305 310 315 320
Pro Pro Ser Gly Lys Asp Arg Val Lys Lys Gly Gly Ser Tyr Met Cys
325 330 335
His Arg Ser Tyr Cys Tyr Arg Tyr Arg Cys Ala Ala Arg Ser Gln Asn
340 345 350
Thr Pro Asp Ser Ser Ala Ser Asn Leu Gly Phe Arg Cys Ala Ala Asp
355 360 365
Arg Leu Pro Thr Met Asp
370
<210> 56
<211> 1989
<212> DNA
<213> Homo sapiens
<400> 56
atggctgagt ggctactctc ggcttcctgg caacgccgag cgaaagctat gactgcggcc 60
gcgggttcgg cgggccgcgc cgcggtgccc ttgctgctgt gtgcgctgct ggcgcccggc 120
ggcgcgtacg tgctcgacga ctccgacggg ctgggccggg agttcgacgg catcggcgcg 180
gtcagcggcg gcgggccgaa ttttggtgcc tctttgcata ttttaaaagt ggaaataggt 240
ggtgatgggc agacaacaga cggcactgag ccctcccaca tgcattatgc actagatgag 300
aattatttcc gaggatacga gtggtggttg atgaaagaag ctaagaagag gaatcccaat 360
attacactca ttgggttgcc atggtcattc cctggatggc tgggaaaagg tttcgactgg 420
ccttatgtca atcttcagct gactgcctat tatgtcgtga cctggattgt gggcgccaag 480
cgttaccatg atttggacat tgattatatt ggaatttgga atgagaggtc atataatgcc 540
aattatatta agatattaag aaaaatgctg aattatcaag gtctccagcg agtgaaaatc 600
atagcaagtg ataatctctg ggagtccatc tctgcatcca tgctccttga tgccgaactc 660
ttcaaggtgg ttgatgttat aggggctcat tatcctggaa cccattcagc aaaagatgca 720
aagttgactg ggaagaagct ttggtcttct gaagacttta gcactttaaa tagtgacatg 780
ggtgcaggct gctggggtcg cattttaaat cagaattata tcaatggcta tatgacttcc 840
acaatcgcat ggaatttagt ggctagttac tatgaacagt tgccttatgg gagatgcggg 900
ttgatgacgg cccaggagcc atggagtggg cactacgtgg tagaatctcc tgtctgggta 960
tcagctcata ccactcagtt tactcaacct ggctggtatt acctgaagac agttggccat 1020
ttagagaaag gaggaagcta cgtagctctg actgatggct tagggaacct caccatcatc 1080
attgaaacca tgagtcataa acattctaag tgcatacggc catttcttcc ttatttcaat 1140
gtgtcacaac aatttgccac ctttgttctt aagggatctt ttagtgaaat accagagcta 1200
caggtatggt ataccaaact tggaaaaaca tccgaaagat ttctttttaa gcagctggat 1260
tctctatggc tccttgacag cgatggcagt ttcacactga gcctgcatga agatgagctg 1320
ttcacactca ccactctcac cactggtcgc aaaggcagct acccgcttcc tccaaaatcc 1380
cagcccttcc caagtaccta taaggatgat ttcaatgttg attacccatt ttttagtgaa 1440
gctccaaact ttgctgatca aactggtgta tttgaatatt ttacaaatat tgaagaccct 1500
ggcgagcatc acttcacgct acgccaagtt ctcaaccaga gacccattac atgggctgcc 1560
gatgcatcca acacaatcag tattatagga gactacaact ggaccaatct gactataaag 1620
tgtgatgtat acatagagac ccctgacaca ggaggtgtgt tcattgcagg aagagtaaat 1680
aaaggtggta ttttgattag aagtgccaga ggaattttct tctggatttt tgcaaatgga 1740
tcttacaggg ttacaggtga tttagctgga tggattatat atgctttagg acgtgttgaa 1800
gttacagcaa aaaaatggta tacactcacg ttaactatta agggtcattt cacctctggc 1860
atgctgaatg acaagtctct gtggacagac atccctgtga attttccaaa gaatggctgg 1920
gctgcaattg gaactcactc ctttgaattt gcacagtttg acaactttct tgtggaagcc 1980
acacgctaa 1989
<210> 57
<211> 685
<212> PRT
<213> Homo sapiens
<400> 57
Met Ala Glu Trp Leu Leu Ser Ala Ser Trp Gln Arg Arg Ala Lys Ala
1 5 10 15
Met Thr Ala Ala Ala Gly Ser Ala Gly Arg Ala Ala Val Pro Leu Leu
20 25 30
Leu Cys Ala Leu Leu Ala Pro Gly Gly Ala Tyr Val Leu Asp Asp Ser
35 40 45
Asp Gly Leu Gly Arg Glu Phe Asp Gly Ile Gly Ala Val Ser Gly Gly
50 55 60
Gly Ala Thr Ser Arg Leu Leu Val Asn Tyr Pro Glu Pro Tyr Arg Ser
65 70 75 80
Gln Ile Leu Asp Tyr Leu Phe Lys Pro Asn Phe Gly Ala Ser Leu His
85 90 95
Ile Leu Lys Val Glu Ile Gly Gly Asp Gly Gln Thr Thr Asp Gly Thr
100 105 110
Glu Pro Ser His Met His Tyr Ala Leu Asp Glu Asn Tyr Phe Arg Gly
115 120 125
Tyr Glu Trp Trp Leu Met Lys Glu Ala Lys Lys Arg Asn Pro Asn Ile
130 135 140
Thr Leu Ile Gly Leu Pro Trp Ser Phe Pro Gly Trp Leu Gly Lys Gly
145 150 155 160
Phe Asp Trp Pro Tyr Val Asn Leu Gln Leu Thr Ala Tyr Tyr Val Val
165 170 175
Thr Trp Ile Val Gly Ala Lys Arg Tyr His Asp Leu Asp Ile Asp Tyr
180 185 190
Ile Gly Ile Trp Asn Glu Arg Ser Tyr Asn Ala Asn Tyr Ile Lys Ile
195 200 205
Leu Arg Lys Met Leu Asn Tyr Gln Gly Leu Gln Arg Val Lys Ile Ile
210 215 220
Ala Ser Asp Asn Leu Trp Glu Ser Ile Ser Ala Ser Met Leu Leu Asp
225 230 235 240
Ala Glu Leu Phe Lys Val Val Asp Val Ile Gly Ala His Tyr Pro Gly
245 250 255
Thr His Ser Ala Lys Asp Ala Lys Leu Thr Gly Lys Lys Leu Trp Ser
260 265 270
Ser Glu Asp Phe Ser Thr Leu Asn Ser Asp Met Gly Ala Gly Cys Trp
275 280 285
Gly Arg Ile Leu Asn Gln Asn Tyr Ile Asn Gly Tyr Met Thr Ser Thr
290 295 300
Ile Ala Trp Asn Leu Val Ala Ser Tyr Tyr Glu Gln Leu Pro Tyr Gly
305 310 315 320
Arg Cys Gly Leu Met Thr Ala Gln Glu Pro Trp Ser Gly His Tyr Val
325 330 335
Val Glu Ser Pro Val Trp Val Ser Ala His Thr Thr Gln Phe Thr Gln
340 345 350
Pro Gly Trp Tyr Tyr Leu Lys Thr Val Gly His Leu Glu Lys Gly Gly
355 360 365
Ser Tyr Val Ala Leu Thr Asp Gly Leu Gly Asn Leu Thr Ile Ile Ile
370 375 380
Glu Thr Met Ser His Lys His Ser Lys Cys Ile Arg Pro Phe Leu Pro
385 390 395 400
Tyr Phe Asn Val Ser Gln Gln Phe Ala Thr Phe Val Leu Lys Gly Ser
405 410 415
Phe Ser Glu Ile Pro Glu Leu Gln Val Trp Tyr Thr Lys Leu Gly Lys
420 425 430
Thr Ser Glu Arg Phe Leu Phe Lys Gln Leu Asp Ser Leu Trp Leu Leu
435 440 445
Asp Ser Asp Gly Ser Phe Thr Leu Ser Leu His Glu Asp Glu Leu Phe
450 455 460
Thr Leu Thr Thr Leu Thr Thr Gly Arg Lys Gly Ser Tyr Pro Leu Pro
465 470 475 480
Pro Lys Ser Gln Pro Phe Pro Ser Thr Tyr Lys Asp Asp Phe Asn Val
485 490 495
Asp Tyr Pro Phe Phe Ser Glu Ala Pro Asn Phe Ala Asp Gln Thr Gly
500 505 510
Val Phe Glu Tyr Phe Thr Asn Ile Glu Asp Pro Gly Glu His His Phe
515 520 525
Thr Leu Arg Gln Val Leu Asn Gln Arg Pro Ile Thr Trp Ala Ala Asp
530 535 540
Ala Ser Asn Thr Ile Ser Ile Ile Gly Asp Tyr Asn Trp Thr Asn Leu
545 550 555 560
Thr Ile Lys Cys Asp Val Tyr Ile Glu Thr Pro Asp Thr Gly Gly Val
565 570 575
Phe Ile Ala Gly Arg Val Asn Lys Gly Gly Ile Leu Ile Arg Ser Ala
580 585 590
Arg Gly Ile Phe Phe Trp Ile Phe Ala Asn Gly Ser Tyr Arg Val Thr
595 600 605
Gly Asp Leu Ala Gly Trp Ile Ile Tyr Ala Leu Gly Arg Val Glu Val
610 615 620
Thr Ala Lys Lys Trp Tyr Thr Leu Thr Leu Thr Ile Lys Gly His Phe
625 630 635 640
Thr Ser Gly Met Leu Asn Asp Lys Ser Leu Trp Thr Asp Ile Pro Val
645 650 655
Asn Phe Pro Lys Asn Gly Trp Ala Ala Ile Gly Thr His Ser Phe Glu
660 665 670
Phe Ala Gln Phe Asp Asn Phe Leu Val Glu Ala Thr Arg
675 680 685
<210> 58
<211> 1290
<212> DNA
<213> Homo sapiens
<400> 58
atgcagctga ggaacccaga actacatctg ggctgcgcgc ttgcgcttcg cttcctggcc 60
ctcgtttcct gggacatccc tggggctaga gcactggaca atggattggc aaggacgcct 120
accatgggct ggctgcactg ggagcgcttc atgtgcaacc ttgactgcca ggaagagcca 180
gattcctgca tcagtgagaa gctcttcatg gagatggcag agctcatggt ctcagaaggc 240
tggaaggatg caggttatga gtacctctgc attgatgact gttggatggc tccccaaaga 300
gattcagaag gcagacttca ggcagaccct cagcgctttc ctcatgggat tcgccagcta 360
gctaattatg ttcacagcaa aggactgaag ctagggattt atgcagatgt tggaaataaa 420
acctgcgcag gcttccctgg gagttttgga tactacgaca ttgatgccca gacctttgct 480
gactggggag tagatctgct aaaatttgat ggttgttact gtgacagttt ggaaaatttg 540
gcagatggtt ataagcacat gtccttggcc ctgaatagga ctggcagaag cattgtgtac 600
tcctgtgagt ggcctcttta tatgtggccc tttcaaaagc ccaattatac agaaatccga 660
cagtactgca atcactggcg aaattttgct gacattgatg attcctggaa aagtataaag 720
agtatcttgg actggacatc ttttaaccag gagagaattg ttgatgttgc tggaccaggg 780
ggttggaatg acccagatat gttagtgatt ggcaactttg gcctcagctg gaatcagcaa 840
gtaactcaga tggccctctg ggctatcatg gctgctcctt tattcatgtc taatgacctc 900
cgacacatca gccctcaagc caaagctctc cttcaggata aggacgtaat tgccatcaat 960
caggacccct tgggcaagca agggtaccag cttagacagg gagacaactt tgaagtgtgg 1020
gaacgacctc tctcaggctt agcctgggct gtagctatga taaaccggca ggagattggt 1080
ggacctcgct cttataccat cgcagttgct tccctgggta aaggagtggc ctgtaatcct 1140
gcctgcttca tcacacagct cctccctgtg aaaaggaagc tagggttcta tgaatggact 1200
tcaaggttaa gaagtcacat aaatcccaca ggcactgttt tgcttcagct agaaaataca 1260
atgcagatgt cattaaaaga cttactttaa 1290
<210> 59
<211> 429
<212> PRT
<213> Homo sapiens
<400> 59
Met Gln Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala Leu
1 5 10 15
Arg Phe Leu Ala Leu Val Ser Trp Asp Ile Pro Gly Ala Arg Ala Leu
20 25 30
Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp Glu
35 40 45
Arg Phe Met Cys Asn Leu Asp Cys Gln Glu Glu Pro Asp Ser Cys Ile
50 55 60
Ser Glu Lys Leu Phe Met Glu Met Ala Glu Leu Met Val Ser Glu Gly
65 70 75 80
Trp Lys Asp Ala Gly Tyr Glu Tyr Leu Cys Ile Asp Asp Cys Trp Met
85 90 95
Ala Pro Gln Arg Asp Ser Glu Gly Arg Leu Gln Ala Asp Pro Gln Arg
100 105 110
Phe Pro His Gly Ile Arg Gln Leu Ala Asn Tyr Val His Ser Lys Gly
115 120 125
Leu Lys Leu Gly Ile Tyr Ala Asp Val Gly Asn Lys Thr Cys Ala Gly
130 135 140
Phe Pro Gly Ser Phe Gly Tyr Tyr Asp Ile Asp Ala Gln Thr Phe Ala
145 150 155 160
Asp Trp Gly Val Asp Leu Leu Lys Phe Asp Gly Cys Tyr Cys Asp Ser
165 170 175
Leu Glu Asn Leu Ala Asp Gly Tyr Lys His Met Ser Leu Ala Leu Asn
180 185 190
Arg Thr Gly Arg Ser Ile Val Tyr Ser Cys Glu Trp Pro Leu Tyr Met
195 200 205
Trp Pro Phe Gln Lys Pro Asn Tyr Thr Glu Ile Arg Gln Tyr Cys Asn
210 215 220
His Trp Arg Asn Phe Ala Asp Ile Asp Asp Ser Trp Lys Ser Ile Lys
225 230 235 240
Ser Ile Leu Asp Trp Thr Ser Phe Asn Gln Glu Arg Ile Val Asp Val
245 250 255
Ala Gly Pro Gly Gly Trp Asn Asp Pro Asp Met Leu Val Ile Gly Asn
260 265 270
Phe Gly Leu Ser Trp Asn Gln Gln Val Thr Gln Met Ala Leu Trp Ala
275 280 285
Ile Met Ala Ala Pro Leu Phe Met Ser Asn Asp Leu Arg His Ile Ser
290 295 300
Pro Gln Ala Lys Ala Leu Leu Gln Asp Lys Asp Val Ile Ala Ile Asn
305 310 315 320
Gln Asp Pro Leu Gly Lys Gln Gly Tyr Gln Leu Arg Gln Gly Asp Asn
325 330 335
Phe Glu Val Trp Glu Arg Pro Leu Ser Gly Leu Ala Trp Ala Val Ala
340 345 350
Met Ile Asn Arg Gln Glu Ile Gly Gly Pro Arg Ser Tyr Thr Ile Ala
355 360 365
Val Ala Ser Leu Gly Lys Gly Val Ala Cys Asn Pro Ala Cys Phe Ile
370 375 380
Thr Gln Leu Leu Pro Val Lys Arg Lys Leu Gly Phe Tyr Glu Trp Thr
385 390 395 400
Ser Arg Leu Arg Ser His Ile Asn Pro Thr Gly Thr Val Leu Leu Gln
405 410 415
Leu Glu Asn Thr Met Gln Met Ser Leu Lys Asp Leu Leu
420 425
<210> 60
<211> 1653
<212> DNA
<213> Homo sapiens
<400> 60
atgccgccac cccggaccgg ccgaggcctt ctctggctgg gtctggttct gagctccgtc 60
tgcgtcgccc tcggatccga aacgcaggcc aactcgacca cagatgctct gaacgttctt 120
ctcatcatcg tggatgacct gcgcccctcc ctgggctgtt atggggataa gctggtgagg 180
tccccaaata ttgaccaact ggcatcccac agcctcctct tccagaatgc ctttgcgcag 240
caagcagtgt gcgccccgag ccgcgtttct ttcctcactg gcaggagacc tgacaccacc 300
cgcctgtacg acttcaactc ctactggagg gtgcacgctg gaaacttctc caccatcccc 360
cagtacttca aggagaatgg ctatgtgacc atgtcggtgg gaaaagtctt tcaccctggg 420
atatcttcta accataccga tgattctccg tatagctggt cttttccacc ttatcatcct 480
tcctctgaga agtatgaaaa cactaagaca tgtcgagggc cagatggaga actccatgcc 540
aacctgcttt gccctgtgga tgtgctggat gttcccgagg gcaccttgcc tgacaaacag 600
agcactgagc aagccataca gttgttggaa aagatgaaaa cgtcagccag tcctttcttc 660
ctggccgttg ggtatcataa gccacacatc cccttcagat accccaagga atttcagaag 720
ttgtatccct tggagaacat caccctggcc cccgatcccg aggtccctga tggcctaccc 780
cctgtggcct acaacccctg gatggacatc aggcaacggg aagacgtcca agccttaaac 840
atcagtgtgc cgtatggtcc aattcctgtg gactttcagc ggaaaatccg ccagagctac 900
tttgcctctg tgtcatattt ggatacacag gtcggccgcc tcttgagtgc tttggacgat 960
cttcagctgg ccaacagcac catcattgca tttacctcgg atcatgggtg ggctctaggt 1020
gaacatggag aatgggccaa atacagcaat tttgatgttg ctacccatgt tcccctgata 1080
ttctatgttc ctggaaggac ggcttcactt ccggaggcag gcgagaagct tttcccttac 1140
ctcgaccctt ttgattccgc ctcacagttg atggagccag gcaggcaatc catggacctt 1200
gtggaacttg tgtctctttt tcccacgctg gctggacttg caggactgca ggttccacct 1260
cgctgccccg ttccttcatt tcacgttgag ctgtgcagag aaggcaagaa ccttctgaag 1320
cattttcgat tccgtgactt ggaagaggat ccgtacctcc ctggtaatcc ccgtgaactg 1380
attgcctata gccagtatcc ccggccttca gacatccctc agtggaattc tgacaagccg 1440
agtttaaaag atataaagat catgggctat tccatacgca ccatagacta taggtatact 1500
gtgtgggttg gcttcaatcc tgatgaattt ctagctaact tttctgacat ccatgcaggg 1560
gaactgtatt ttgtggattc tgacccattg caggatcaca atatgtataa tgattcccaa 1620
ggtggagatc ttttccagtt gttgatgcct tga 1653
<210> 61
<211> 550
<212> PRT
<213> Homo sapiens
<400> 61
Met Pro Pro Pro Arg Thr Gly Arg Gly Leu Leu Trp Leu Gly Leu Val
1 5 10 15
Leu Ser Ser Val Cys Val Ala Leu Gly Ser Glu Thr Gln Ala Asn Ser
20 25 30
Thr Thr Asp Ala Leu Asn Val Leu Leu Ile Ile Val Asp Asp Leu Arg
35 40 45
Pro Ser Leu Gly Cys Tyr Gly Asp Lys Leu Val Arg Ser Pro Asn Ile
50 55 60
Asp Gln Leu Ala Ser His Ser Leu Leu Phe Gln Asn Ala Phe Ala Gln
65 70 75 80
Gln Ala Val Cys Ala Pro Ser Arg Val Ser Phe Leu Thr Gly Arg Arg
85 90 95
Pro Asp Thr Thr Arg Leu Tyr Asp Phe Asn Ser Tyr Trp Arg Val His
100 105 110
Ala Gly Asn Phe Ser Thr Ile Pro Gln Tyr Phe Lys Glu Asn Gly Tyr
115 120 125
Val Thr Met Ser Val Gly Lys Val Phe His Pro Gly Ile Ser Ser Asn
130 135 140
His Thr Asp Asp Ser Pro Tyr Ser Trp Ser Phe Pro Pro Tyr His Pro
145 150 155 160
Ser Ser Glu Lys Tyr Glu Asn Thr Lys Thr Cys Arg Gly Pro Asp Gly
165 170 175
Glu Leu His Ala Asn Leu Leu Cys Pro Val Asp Val Leu Asp Val Pro
180 185 190
Glu Gly Thr Leu Pro Asp Lys Gln Ser Thr Glu Gln Ala Ile Gln Leu
195 200 205
Leu Glu Lys Met Lys Thr Ser Ala Ser Pro Phe Phe Leu Ala Val Gly
210 215 220
Tyr His Lys Pro His Ile Pro Phe Arg Tyr Pro Lys Glu Phe Gln Lys
225 230 235 240
Leu Tyr Pro Leu Glu Asn Ile Thr Leu Ala Pro Asp Pro Glu Val Pro
245 250 255
Asp Gly Leu Pro Pro Val Ala Tyr Asn Pro Trp Met Asp Ile Arg Gln
260 265 270
Arg Glu Asp Val Gln Ala Leu Asn Ile Ser Val Pro Tyr Gly Pro Ile
275 280 285
Pro Val Asp Phe Gln Arg Lys Ile Arg Gln Ser Tyr Phe Ala Ser Val
290 295 300
Ser Tyr Leu Asp Thr Gln Val Gly Arg Leu Leu Ser Ala Leu Asp Asp
305 310 315 320
Leu Gln Leu Ala Asn Ser Thr Ile Ile Ala Phe Thr Ser Asp His Gly
325 330 335
Trp Ala Leu Gly Glu His Gly Glu Trp Ala Lys Tyr Ser Asn Phe Asp
340 345 350
Val Ala Thr His Val Pro Leu Ile Phe Tyr Val Pro Gly Arg Thr Ala
355 360 365
Ser Leu Pro Glu Ala Gly Glu Lys Leu Phe Pro Tyr Leu Asp Pro Phe
370 375 380
Asp Ser Ala Ser Gln Leu Met Glu Pro Gly Arg Gln Ser Met Asp Leu
385 390 395 400
Val Glu Leu Val Ser Leu Phe Pro Thr Leu Ala Gly Leu Ala Gly Leu
405 410 415
Gln Val Pro Pro Arg Cys Pro Val Pro Ser Phe His Val Glu Leu Cys
420 425 430
Arg Glu Gly Lys Asn Leu Leu Lys His Phe Arg Phe Arg Asp Leu Glu
435 440 445
Glu Asp Pro Tyr Leu Pro Gly Asn Pro Arg Glu Leu Ile Ala Tyr Ser
450 455 460
Gln Tyr Pro Arg Pro Ser Asp Ile Pro Gln Trp Asn Ser Asp Lys Pro
465 470 475 480
Ser Leu Lys Asp Ile Lys Ile Met Gly Tyr Ser Ile Arg Thr Ile Asp
485 490 495
Tyr Arg Tyr Thr Val Trp Val Gly Phe Asn Pro Asp Glu Phe Leu Ala
500 505 510
Asn Phe Ser Asp Ile His Ala Gly Glu Leu Tyr Phe Val Asp Ser Asp
515 520 525
Pro Leu Gln Asp His Asn Met Tyr Asn Asp Ser Gln Gly Gly Asp Leu
530 535 540
Phe Gln Leu Leu Met Pro
545 550
<210> 62
<211> 1962
<212> DNA
<213> Homo sapiens
<400> 62
atgcgtcccc tgcgcccccg cgccgcgctg ctggcgctcc tggcctcgct cctggccgcg 60
cccccggtgg ccccggccga ggccccgcac ctggtgcatg tggacgcggc ccgcgcgctg 120
tggcccctgc ggcgcttctg gaggagcaca ggcttctgcc ccccgctgcc acacagccag 180
gctgaccagt acgtcctcag ctgggaccag cagctcaacc tcgcctatgt gggcgccgtc 240
cctcaccgcg gcatcaagca ggtccggacc cactggctgc tggagcttgt caccaccagg 300
gggtccactg gacggggcct gagctacaac ttcacccacc tggacgggta cctggacctt 360
ctcagggaga accagctcct cccagggttt gagctgatgg gcagcgcctc gggccacttc 420
actgactttg aggacaagca gcaggtgttt gagtggaagg acttggtctc cagcctggcc 480
aggagataca tcggtaggta cggactggcg catgtttcca agtggaactt cgagacgtgg 540
aatgagccag accaccacga ctttgacaac gtctccatga ccatgcaagg cttcctgaac 600
tactacgatg cctgctcgga gggtctgcgc gccgccagcc ccgccctgcg gctgggaggc 660
cccggcgact ccttccacac cccaccgcga tccccgctga gctggggcct cctgcgccac 720
tgccacgacg gtaccaactt cttcactggg gaggcgggcg tgcggctgga ctacatctcc 780
ctccacagga agggtgcgcg cagctccatc tccatcctgg agcaggagaa ggtcgtcgcg 840
cagcagatcc ggcagctctt ccccaagttc gcggacaccc ccatttacaa cgacgaggcg 900
gacccgctgg tgggctggtc cctgccacag ccgtggaggg cggacgtgac ctacgcggcc 960
atggtggtga aggtcatcgc gcagcatcag aacctgctac tggccaacac cacctccgcc 1020
ttcccctacg cgctcctgag caacgacaat gccttcctga gctaccaccc gcaccccttc 1080
gcgcagcgca cgctcaccgc gcgcttccag gtcaacaaca cccgcccgcc gcacgtgcag 1140
ctgttgcgca agccggtgct cacggccatg gggctgctgg cgctgctgga tgaggagcag 1200
ctctgggccg aagtgtcgca ggccgggacc gtcctggaca gcaaccacac ggtgggcgtc 1260
ctggccagcg cccaccgccc ccagggcccg gccgacgcct ggcgcgccgc ggtgctgatc 1320
tacgcgagcg acgacacccg cgcccacccc aaccgcagcg tcgcggtgac cctgcggctg 1380
cgcggggtgc cccccggccc gggcctggtc tacgtcacgc gctacctgga caacgggctc 1440
tgcagccccg acggcgagtg gcggcgcctg ggccggcccg tcttccccac ggcagagcag 1500
ttccggcgca tgcgcgcggc tgaggacccg gtggccgcgg cgccccgccc cttacccgcc 1560
ggcggccgcc tgaccctgcg ccccgcgctg cggctgccgt cgcttttgct ggtgcacgtg 1620
tgtgcgcgcc ccgagaagcc gcccgggcag gtcacgcggc tccgcgccct gcccctgacc 1680
caagggcagc tggttctggt ctggtcggat gaacacgtgg gctccaagtg cctgtggaca 1740
tacgagatcc agttctctca ggacggtaag gcgtacaccc cggtcagcag gaagccatcg 1800
accttcaacc tctttgtgtt cagcccagac acaggtgctg tctctggctc ctaccgagtt 1860
cgagccctgg actactgggc ccgaccaggc cccttctcgg accctgtgcc gtacctggag 1920
gtccctgtgc caagagggcc cccatccccg ggcaatccat ga 1962
<210> 63
<211> 653
<212> PRT
<213> Homo sapiens
<400> 63
Met Arg Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser
1 5 10 15
Leu Leu Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val
20 25 30
His Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg
35 40 45
Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr
50 55 60
Val Leu Ser Trp Asp Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val
65 70 75 80
Pro His Arg Gly Ile Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu
85 90 95
Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr
100 105 110
His Leu Asp Gly Tyr Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro
115 120 125
Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu
130 135 140
Asp Lys Gln Gln Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala
145 150 155 160
Arg Arg Tyr Ile Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn
165 170 175
Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser
180 185 190
Met Thr Met Gln Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly
195 200 205
Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser
210 215 220
Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His
225 230 235 240
Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu
245 250 255
Asp Tyr Ile Ser Leu His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile
260 265 270
Leu Glu Gln Glu Lys Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro
275 280 285
Lys Phe Ala Asp Thr Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val
290 295 300
Gly Trp Ser Leu Pro Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala
305 310 315 320
Met Val Val Lys Val Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn
325 330 335
Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe
340 345 350
Leu Ser Tyr His Pro His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg
355 360 365
Phe Gln Val Asn Asn Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys
370 375 380
Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln
385 390 395 400
Leu Trp Ala Glu Val Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His
405 410 415
Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp
420 425 430
Ala Trp Arg Ala Ala Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala
435 440 445
His Pro Asn Arg Ser Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro
450 455 460
Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu
465 470 475 480
Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro
485 490 495
Thr Ala Glu Gln Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala
500 505 510
Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro
515 520 525
Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro
530 535 540
Glu Lys Pro Pro Gly Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr
545 550 555 560
Gln Gly Gln Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys
565 570 575
Cys Leu Trp Thr Tyr Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr
580 585 590
Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser
595 600 605
Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp
610 615 620
Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu
625 630 635 640
Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro
645 650
<210> 64
<211> 1509
<212> DNA
<213> Homo sapiens
<400> 64
atgagctgcc ccgtgcccgc ctgctgcgcg ctgctgctag tcctggggct ctgccgggcg 60
cgtccccgga acgcactgct gctcctcgcg gatgacggag gctttgagag tggcgcgtac 120
aacaacagcg ccatcgccac cccgcacctg gacgccttgg cccgccgcag cctcctcttt 180
cgcaatgcct tcacctcggt cagcagctgc tctcccagcc gcgccagcct cctcactggc 240
ctgccccagc atcagaatgg gatgtacggg ctgcaccagg acgtgcacca cttcaactcc 300
ttcgacaagg tgcggagcct gccgctgctg ctcagccaag ctggtgtgcg cacaggcatc 360
atcgggaaga agcacgtggg gccggagacc gtgtacccgt ttgactttgc gtacacggag 420
gagaatggct ccgtcctcca ggtggggcgg aacatcacta gaattaagct gctcgtccgg 480
aaattcctgc agactcagga tgaccggcct ttcttcctct acgtcgcctt ccacgacccc 540
caccgctgtg ggcactccca gccccagtac ggaaccttct gtgagaagtt tggcaacgga 600
gagagcggca tgggtcgtat cccagactgg accccccagg cctacgaccc actggacgtg 660
ctggtgcctt acttcgtccc caacaccccg gcagcccgag ccgacctggc cgctcagtac 720
accaccgtcg gccgcatgga ccaaggagtt ggactggtgc tccaggagct gcgtgacgcc 780
ggtgtcctga acgacacact ggtgatcttc acgtccgaca acgggatccc cttccccagc 840
ggcaggacca acctgtactg gccgggcact gctgaaccct tactggtgtc atccccggag 900
cacccaaaac gctggggcca agtcagcgag gcctacgtga gcctcctaga cctcacgccc 960
accatcttgg attggttctc gatcccgtac cccagctacg ccatctttgg ctcgaagacc 1020
atccacctca ctggccggtc cctcctgccg gcgctggagg ccgagcccct ctgggccacc 1080
gtctttggca gccagagcca ccacgaggtc accatgtcct accccatgcg ctccgtgcag 1140
caccggcact tccgcctcgt gcacaacctc aacttcaaga tgccctttcc catcgaccag 1200
gacttctacg tctcacccac cttccaggac ctcctgaacc gcaccacagc tggtcagccc 1260
acgggctggt acaaggacct ccgtcattac tactaccggg cgcgctggga gctctacgac 1320
cggagccggg acccccacga gacccagaac ctggccaccg acccgcgctt tgctcagctt 1380
ctggagatgc ttcgggacca gctggccaag tggcagtggg agacccacga cccctgggtg 1440
tgcgcccccg acggcgtcct ggaggagaag ctctctcccc agtgccagcc cctccacaat 1500
gagctgtga 1509
<210> 65
<211> 502
<212> PRT
<213> Homo sapiens
<400> 65
Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly
1 5 10 15
Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp
20 25 30
Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro
35 40 45
His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe
50 55 60
Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly
65 70 75 80
Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His
85 90 95
His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser
100 105 110
Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro
115 120 125
Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser
130 135 140
Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg
145 150 155 160
Lys Phe Leu Gln Thr Gln Asp Asp Arg Pro Phe Phe Leu Tyr Val Ala
165 170 175
Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr
180 185 190
Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro
195 200 205
Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr
210 215 220
Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr
225 230 235 240
Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu
245 250 255
Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser
260 265 270
Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro
275 280 285
Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg
290 295 300
Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro
305 310 315 320
Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe
325 330 335
Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu
340 345 350
Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His
355 360 365
Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe
370 375 380
Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln
385 390 395 400
Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr
405 410 415
Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr
420 425 430
Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr
435 440 445
Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu
450 455 460
Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val
465 470 475 480
Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln
485 490 495
Pro Leu His Asn Glu Leu
500
<210> 66
<211> 1908
<212> DNA
<213> Homo sapiens
<400> 66
atgagcgggg cgggcagggc gctggccgcg ctgctgctgg ccgcgtccgt gctgagcgcc 60
gcgctgctgg cccccggcgg ctcttcgggg cgcgatgccc aggccgcgcc gccacgagac 120
ttagacaaaa aaagacatgc agagctgaag atggatcagg ctttgctact catccataat 180
gaacttctct ggaccaactt gaccgtctac tggaaatctg aatgctgtta tcactgcttg 240
tttcaggttc tggtaaacgt tcctcagagt ccaaaagcag ggaagcctag tgctgcagct 300
gcctctgtca gcacccagca cggatctatc ctgcagctga acgacacctt ggaagagaaa 360
gaagtttgta ggttggaata cagatttgga gaatttggaa actattctct cttggtaaag 420
aacatccata atggagttag tgaaattgcc tgtgacctgg ctgtgaacga ggatccagtt 480
gatagtaacc ttcctgtgag cattgcattc cttattggtc ttgctgtcat cattgtgata 540
tcctttctga ggctcttgtt gagtttggat gactttaaca attggatttc taaagccata 600
agttctcgag aaactgatcg cctcatcaat tctgagctgg gatctcccag caggacagac 660
cctctcgatg gtgatgttca gccagcaacg tggcgtctat ctgccctgcc gccccgcctc 720
cgcagcgtgg acaccttcag ggggattgct cttatactca tggtctttgt caattatgga 780
ggaggaaaat attggtactt caaacatgca agttggaatg ggctgacagt ggctgacctc 840
gtgttcccgt ggtttgtatt tattatggga tcttccattt ttctatcgat gacttctata 900
ctgcaacggg ggtgttcaaa attcagattg ctggggaaga ttgcatggag gagtttcctg 960
ttaatctgca taggaattat cattgtgaat cccaattatt gccttggtcc attgtcttgg 1020
gacaaggtgc gcattcctgg tgtgctgcag cgattgggag tgacatactt tgtggttgct 1080
gtgttggagc tcctctttgc taaacctgtg cctgaacatt gtgcctcgga gaggagctgc 1140
ctttctcttc gagacatcac gtccagctgg ccccagtggc tgctcatcct ggtgctggaa 1200
ggcctgtggc tgggcttgac attcctcctg ccagtccctg ggtgccctac tggttatctt 1260
ggtcctgggg gcattggaga ttttggcaag tatccaaatt gcactggagg agctgcaggc 1320
tacatcgacc gcctgctgct gggagacgat cacctttacc agcacccatc ttctgctgta 1380
ctttaccaca ccgaggtggc ctatgacccc gagggcatcc tgggcaccat caactccatc 1440
gtgatggcct ttttaggagt tcaggcagga aaaatactat tgtattacaa ggctcggacc 1500
aaagacatcc tgattcgatt cactgcttgg tgttgtattc ttgggctcat ttctgttgct 1560
ctgacgaagg tttctgaaaa tgaaggcttt attccagtaa acaaaaatct ctggtccctt 1620
tcgtatgtca ctacgctcag ttcttttgcc ttcttcatcc tgctggtcct gtacccagtt 1680
gtggatgtga aggggctgtg gacaggaacc ccattctttt atccaggaat gaattccatt 1740
ctggtatatg tcggccacga ggtgtttgag aactacttcc cctttcagtg gaagctgaag 1800
gacaaccagt cccacaagga gcacctgact cagaacatcg tcgccactgc cctctgggtg 1860
ctcattgcct acatcctcta tagaaagaag attttttgga aaatctga 1908
<210> 67
<211> 663
<212> PRT
<213> Homo sapiens
<400> 67
Met Thr Gly Ala Arg Ala Ser Ala Ala Glu Gln Arg Arg Ala Gly Arg
1 5 10 15
Ser Gly Gln Ala Arg Ala Ala Glu Arg Ala Ala Gly Met Ser Gly Ala
20 25 30
Gly Arg Ala Leu Ala Ala Leu Leu Leu Ala Ala Ser Val Leu Ser Ala
35 40 45
Ala Leu Leu Ala Pro Gly Gly Ser Ser Gly Arg Asp Ala Gln Ala Ala
50 55 60
Pro Pro Arg Asp Leu Asp Lys Lys Arg His Ala Glu Leu Lys Met Asp
65 70 75 80
Gln Ala Leu Leu Leu Ile His Asn Glu Leu Leu Trp Thr Asn Leu Thr
85 90 95
Val Tyr Trp Lys Ser Glu Cys Cys Tyr His Cys Leu Phe Gln Val Leu
100 105 110
Val Asn Val Pro Gln Ser Pro Lys Ala Gly Lys Pro Ser Ala Ala Ala
115 120 125
Ala Ser Val Ser Thr Gln His Gly Ser Ile Leu Gln Leu Asn Asp Thr
130 135 140
Leu Glu Glu Lys Glu Val Cys Arg Leu Glu Tyr Arg Phe Gly Glu Phe
145 150 155 160
Gly Asn Tyr Ser Leu Leu Val Lys Asn Ile His Asn Gly Val Ser Glu
165 170 175
Ile Ala Cys Asp Leu Ala Val Asn Glu Asp Pro Val Asp Ser Asn Leu
180 185 190
Pro Val Ser Ile Ala Phe Leu Ile Gly Leu Ala Val Ile Ile Val Ile
195 200 205
Ser Phe Leu Arg Leu Leu Leu Ser Leu Asp Asp Phe Asn Asn Trp Ile
210 215 220
Ser Lys Ala Ile Ser Ser Arg Glu Thr Asp Arg Leu Ile Asn Ser Glu
225 230 235 240
Leu Gly Ser Pro Ser Arg Thr Asp Pro Leu Asp Gly Asp Val Gln Pro
245 250 255
Ala Thr Trp Arg Leu Ser Ala Leu Pro Pro Arg Leu Arg Ser Val Asp
260 265 270
Thr Phe Arg Gly Ile Ala Leu Ile Leu Met Val Phe Val Asn Tyr Gly
275 280 285
Gly Gly Lys Tyr Trp Tyr Phe Lys His Ala Ser Trp Asn Gly Leu Thr
290 295 300
Val Ala Asp Leu Val Phe Pro Trp Phe Val Phe Ile Met Gly Ser Ser
305 310 315 320
Ile Phe Leu Ser Met Thr Ser Ile Leu Gln Arg Gly Cys Ser Lys Phe
325 330 335
Arg Leu Leu Gly Lys Ile Ala Trp Arg Ser Phe Leu Leu Ile Cys Ile
340 345 350
Gly Ile Ile Ile Val Asn Pro Asn Tyr Cys Leu Gly Pro Leu Ser Trp
355 360 365
Asp Lys Val Arg Ile Pro Gly Val Leu Gln Arg Leu Gly Val Thr Tyr
370 375 380
Phe Val Val Ala Val Leu Glu Leu Leu Phe Ala Lys Pro Val Pro Glu
385 390 395 400
His Cys Ala Ser Glu Arg Ser Cys Leu Ser Leu Arg Asp Ile Thr Ser
405 410 415
Ser Trp Pro Gln Trp Leu Leu Ile Leu Val Leu Glu Gly Leu Trp Leu
420 425 430
Gly Leu Thr Phe Leu Leu Pro Val Pro Gly Cys Pro Thr Gly Tyr Leu
435 440 445
Gly Pro Gly Gly Ile Gly Asp Phe Gly Lys Tyr Pro Asn Cys Thr Gly
450 455 460
Gly Ala Ala Gly Tyr Ile Asp Arg Leu Leu Leu Gly Asp Asp His Leu
465 470 475 480
Tyr Gln His Pro Ser Ser Ala Val Leu Tyr His Thr Glu Val Ala Tyr
485 490 495
Asp Pro Glu Gly Ile Leu Gly Thr Ile Asn Ser Ile Val Met Ala Phe
500 505 510
Leu Gly Val Gln Ala Gly Lys Ile Leu Leu Tyr Tyr Lys Ala Arg Thr
515 520 525
Lys Asp Ile Leu Ile Arg Phe Thr Ala Trp Cys Cys Ile Leu Gly Leu
530 535 540
Ile Ser Val Ala Leu Thr Lys Val Ser Glu Asn Glu Gly Phe Ile Pro
545 550 555 560
Val Asn Lys Asn Leu Trp Ser Leu Ser Tyr Val Thr Thr Leu Ser Ser
565 570 575
Phe Ala Phe Phe Ile Leu Leu Val Leu Tyr Pro Val Val Asp Val Lys
580 585 590
Gly Leu Trp Thr Gly Thr Pro Phe Phe Tyr Pro Gly Met Asn Ser Ile
595 600 605
Leu Val Tyr Val Gly His Glu Val Phe Glu Asn Tyr Phe Pro Phe Gln
610 615 620
Trp Lys Leu Lys Asp Asn Gln Ser His Lys Glu His Leu Thr Gln Asn
625 630 635 640
Ile Val Ala Thr Ala Leu Trp Val Leu Ile Ala Tyr Ile Leu Tyr Arg
645 650 655
Lys Lys Ile Phe Trp Lys Ile
660
<210> 68
<211> 1956
<212> DNA
<213> Homo sapiens
<400> 68
atggcccggg ggtcggcggt tgcctgggcg gcgctcgggc cgttgttgtg gggctgcgcg 60
ctggggctgc agggcgggat gctgtacccc caggagagcc cgtcgcggga gtgcaaggag 120
ctggacggcc tctggagctt ccgcgccgac ttctctgaca accgacgccg gggcttcgag 180
gagcagtggt accggcggcc gctgtgggag tcaggcccca ccgtggacat gccagttccc 240
tccagcttca atgacatcag ccaggactgg cgtctgcggc attttgtcgg ctgggtgtgg 300
tacgaacggg aggtgatcct gccggagcga tggacccagg acctgcgcac aagagtggtg 360
ctgaggattg gcagtgccca ttcctatgcc atcgtgtggg tgaatggggt cgacacgcta 420
gagcatgagg ggggctacct ccccttcgag gccgacatca gcaacctggt ccaggtgggg 480
cccctgccct cccggctccg aatcactatc gccatcaaca acacactcac ccccaccacc 540
ctgccaccag ggaccatcca atacctgact gacacctcca agtatcccaa gggttacttt 600
gtccagaaca catattttga ctttttcaac tacgctggac tgcagcggtc tgtacttctg 660
tacacgacac ccaccaccta catcgatgac atcaccgtca ccaccagcgt ggagcaagac 720
agtgggctgg tgaattacca gatctctgtc aagggcagta acctgttcaa gttggaagtg 780
cgtcttttgg atgcagaaaa caaagtcgtg gcgaatggga ctgggaccca gggccaactt 840
aaggtgccag gtgtcagcct ctggtggccg tacctgatgc acgaacgccc tgcctatctg 900
tattcattgg aggtgcagct gactgcacag acgtcactgg ggcctgtgtc tgacttctac 960
acactccctg tggggatccg cactgtggct gtcaccaaga gccagttcct catcaatggg 1020
aaacctttct atttccacgg tgtcaacaag catgaggatg cggacatccg agggaagggc 1080
ttcgactggc cgctgctggt gaaggacttc aacctgcttc gctggcttgg tgccaacgct 1140
ttccgtacca gccactaccc ctatgcagag gaagtgatgc agatgtgtga ccgctatggg 1200
attgtggtca tcgatgagtg tcccggcgtg ggcctggcgc tgccgcagtt cttcaacaac 1260
gtttctctgc atcaccacat gcaggtgatg gaagaagtgg tgcgtaggga caagaaccac 1320
cccgcggtcg tgatgtggtc tgtggccaac gagcctgcgt cccacctaga atctgctggc 1380
tactacttga agatggtgat cgctcacacc aaatccttgg acccctcccg gcctgtgacc 1440
tttgtgagca actctaacta tgcagcagac aagggggctc cgtatgtgga tgtgatctgt 1500
ttgaacagct actactcttg gtatcacgac tacgggcacc tggagttgat tcagctgcag 1560
ctggccaccc agtttgagaa ctggtataag aagtatcaga agcccattat tcagagcgag 1620
tatggagcag aaacgattgc agggtttcac caggatccac ctctgatgtt cactgaagag 1680
taccagaaaa gtctgctaga gcagtaccat ctgggtctgg atcaaaaacg cagaaaatac 1740
gtggttggag agctcatttg gaattttgcc gatttcatga ctgaacagtc accgacgaga 1800
gtgctgggga ataaaaaggg gatcttcact cggcagagac aaccaaaaag tgcagcgttc 1860
cttttgcgag agagatactg gaagattgcc aatgaaacca ggtatcccca ctcagtagcc 1920
aagtcacaat gtttggaaaa cagcctgttt acttga 1956
<210> 69
<211> 651
<212> PRT
<213> Homo sapiens
<400> 69
Met Ala Arg Gly Ser Ala Val Ala Trp Ala Ala Leu Gly Pro Leu Leu
1 5 10 15
Trp Gly Cys Ala Leu Gly Leu Gln Gly Gly Met Leu Tyr Pro Gln Glu
20 25 30
Ser Pro Ser Arg Glu Cys Lys Glu Leu Asp Gly Leu Trp Ser Phe Arg
35 40 45
Ala Asp Phe Ser Asp Asn Arg Arg Arg Gly Phe Glu Glu Gln Trp Tyr
50 55 60
Arg Arg Pro Leu Trp Glu Ser Gly Pro Thr Val Asp Met Pro Val Pro
65 70 75 80
Ser Ser Phe Asn Asp Ile Ser Gln Asp Trp Arg Leu Arg His Phe Val
85 90 95
Gly Trp Val Trp Tyr Glu Arg Glu Val Ile Leu Pro Glu Arg Trp Thr
100 105 110
Gln Asp Leu Arg Thr Arg Val Val Leu Arg Ile Gly Ser Ala His Ser
115 120 125
Tyr Ala Ile Val Trp Val Asn Gly Val Asp Thr Leu Glu His Glu Gly
130 135 140
Gly Tyr Leu Pro Phe Glu Ala Asp Ile Ser Asn Leu Val Gln Val Gly
145 150 155 160
Pro Leu Pro Ser Arg Leu Arg Ile Thr Ile Ala Ile Asn Asn Thr Leu
165 170 175
Thr Pro Thr Thr Leu Pro Pro Gly Thr Ile Gln Tyr Leu Thr Asp Thr
180 185 190
Ser Lys Tyr Pro Lys Gly Tyr Phe Val Gln Asn Thr Tyr Phe Asp Phe
195 200 205
Phe Asn Tyr Ala Gly Leu Gln Arg Ser Val Leu Leu Tyr Thr Thr Pro
210 215 220
Thr Thr Tyr Ile Asp Asp Ile Thr Val Thr Thr Ser Val Glu Gln Asp
225 230 235 240
Ser Gly Leu Val Asn Tyr Gln Ile Ser Val Lys Gly Ser Asn Leu Phe
245 250 255
Lys Leu Glu Val Arg Leu Leu Asp Ala Glu Asn Lys Val Val Ala Asn
260 265 270
Gly Thr Gly Thr Gln Gly Gln Leu Lys Val Pro Gly Val Ser Leu Trp
275 280 285
Trp Pro Tyr Leu Met His Glu Arg Pro Ala Tyr Leu Tyr Ser Leu Glu
290 295 300
Val Gln Leu Thr Ala Gln Thr Ser Leu Gly Pro Val Ser Asp Phe Tyr
305 310 315 320
Thr Leu Pro Val Gly Ile Arg Thr Val Ala Val Thr Lys Ser Gln Phe
325 330 335
Leu Ile Asn Gly Lys Pro Phe Tyr Phe His Gly Val Asn Lys His Glu
340 345 350
Asp Ala Asp Ile Arg Gly Lys Gly Phe Asp Trp Pro Leu Leu Val Lys
355 360 365
Asp Phe Asn Leu Leu Arg Trp Leu Gly Ala Asn Ala Phe Arg Thr Ser
370 375 380
His Tyr Pro Tyr Ala Glu Glu Val Met Gln Met Cys Asp Arg Tyr Gly
385 390 395 400
Ile Val Val Ile Asp Glu Cys Pro Gly Val Gly Leu Ala Leu Pro Gln
405 410 415
Phe Phe Asn Asn Val Ser Leu His His His Met Gln Val Met Glu Glu
420 425 430
Val Val Arg Arg Asp Lys Asn His Pro Ala Val Val Met Trp Ser Val
435 440 445
Ala Asn Glu Pro Ala Ser His Leu Glu Ser Ala Gly Tyr Tyr Leu Lys
450 455 460
Met Val Ile Ala His Thr Lys Ser Leu Asp Pro Ser Arg Pro Val Thr
465 470 475 480
Phe Val Ser Asn Ser Asn Tyr Ala Ala Asp Lys Gly Ala Pro Tyr Val
485 490 495
Asp Val Ile Cys Leu Asn Ser Tyr Tyr Ser Trp Tyr His Asp Tyr Gly
500 505 510
His Leu Glu Leu Ile Gln Leu Gln Leu Ala Thr Gln Phe Glu Asn Trp
515 520 525
Tyr Lys Lys Tyr Gln Lys Pro Ile Ile Gln Ser Glu Tyr Gly Ala Glu
530 535 540
Thr Ile Ala Gly Phe His Gln Asp Pro Pro Leu Met Phe Thr Glu Glu
545 550 555 560
Tyr Gln Lys Ser Leu Leu Glu Gln Tyr His Leu Gly Leu Asp Gln Lys
565 570 575
Arg Arg Lys Tyr Val Val Gly Glu Leu Ile Trp Asn Phe Ala Asp Phe
580 585 590
Met Thr Glu Gln Ser Pro Thr Arg Val Leu Gly Asn Lys Lys Gly Ile
595 600 605
Phe Thr Arg Gln Arg Gln Pro Lys Ser Ala Ala Phe Leu Leu Arg Glu
610 615 620
Arg Tyr Trp Lys Ile Ala Asn Glu Thr Arg Tyr Pro His Ser Val Ala
625 630 635 640
Lys Ser Gln Cys Leu Glu Asn Ser Leu Phe Thr
645 650
<210> 70
<211> 1659
<212> DNA
<213> Homo sapiens
<400> 70
atgcggctcc tgcctctagc cccaggtcgg ctccggcggg gcagcccccg ccacctgccc 60
tcctgcagcc cagcgctgct actgctggtg ctgggcggct gcctgggggt cttcggggtg 120
gctgcgggaa cccggaggcc caacgtggtg ctgctcctca cggacgacca ggacgaagtg 180
ctcggcggca tgacaccgct aaagaaaacc aaagctctca tcggagagat ggggatgact 240
ttttccagtg cttatgtgcc aagtgctctc tgctgcccca gcagagccag tatcctgaca 300
ggaaagtacc cacataatca tcacgttgtg aacaacactc tggaggggaa ctgcagtagt 360
aagtcctggc agaagatcca agaaccaaat actttcccag caattctcag atcaatgtgt 420
ggttatcaga ccttttttgc agggaaatat ttaaatgagt acggagcccc agatgcaggt 480
ggactagaac acgttcctct gggttggagt tactggtatg ccttggaaaa gaattctaag 540
tattataatt acaccctgtc tatcaatggg aaggcacgga agcatggtga aaactatagt 600
gtggactacc tgacagatgt tttggctaat gtctccttgg actttctgga ctacaagtcc 660
aactttgagc ccttcttcat gatgatcgcc actccagcgc ctcattcgcc ttggacagct 720
gcacctcagt accagaaggc tttccagaat gtctttgcac caagaaacaa gaacttcaac 780
atccatggaa cgaacaagca ctggttaatt aggcaagcca agactccaat gactaattct 840
tcaatacagt ttttagataa tgcatttagg aaaaggtggc aaactctcct ctcagttgat 900
gaccttgtgg agaaactggt caagaggctg gagttcactg gggagctcaa caacacttac 960
atcttctata cctcagacaa tggctatcac acaggacagt tttccttgcc aatagacaag 1020
agacagctgt atgagtttga tatcaaagtt ccactgttgg ttcgaggacc tgggatcaaa 1080
ccaaatcaga caagcaagat gctggttgcc aacattgact tgggtcctac tattttggac 1140
attgctggct acgacctaaa taagacacag atggatggga tgtccttatt gcccattttg 1200
agaggtgcca gtaacttgac ctggcgatca gatgtcctgg tggaatacca aggagaaggc 1260
cgtaacgtca ctgacccaac atgcccttcc ctgagtcctg gcgtatctca atgcttccca 1320
gactgtgtat gtgaagatgc ttataacaat acctatgcct gtgtgaggac aatgtcagca 1380
ttgtggaatt tgcagtattg cgagtttgat gaccaggagg tgtttgtaga agtctataat 1440
ctgactgcag acccagacca gatcactaac attgctaaaa ccatagaccc agagctttta 1500
ggaaagatga actatcggtt aatgatgtta cagtcctgtt ctgggccaac ctgtcgcact 1560
ccaggggttt ttgaccccgg atacaggttt gacccccgtc tcatgttcag caatcgcggc 1620
agtgtcagga ctcgaagatt ttccaaacat cttctgtag 1659
<210> 71
<211> 552
<212> PRT
<213> Homo sapiens
<400> 71
Met Arg Leu Leu Pro Leu Ala Pro Gly Arg Leu Arg Arg Gly Ser Pro
1 5 10 15
Arg His Leu Pro Ser Cys Ser Pro Ala Leu Leu Leu Leu Val Leu Gly
20 25 30
Gly Cys Leu Gly Val Phe Gly Val Ala Ala Gly Thr Arg Arg Pro Asn
35 40 45
Val Val Leu Leu Leu Thr Asp Asp Gln Asp Glu Val Leu Gly Gly Met
50 55 60
Thr Pro Leu Lys Lys Thr Lys Ala Leu Ile Gly Glu Met Gly Met Thr
65 70 75 80
Phe Ser Ser Ala Tyr Val Pro Ser Ala Leu Cys Cys Pro Ser Arg Ala
85 90 95
Ser Ile Leu Thr Gly Lys Tyr Pro His Asn His His Val Val Asn Asn
100 105 110
Thr Leu Glu Gly Asn Cys Ser Ser Lys Ser Trp Gln Lys Ile Gln Glu
115 120 125
Pro Asn Thr Phe Pro Ala Ile Leu Arg Ser Met Cys Gly Tyr Gln Thr
130 135 140
Phe Phe Ala Gly Lys Tyr Leu Asn Glu Tyr Gly Ala Pro Asp Ala Gly
145 150 155 160
Gly Leu Glu His Val Pro Leu Gly Trp Ser Tyr Trp Tyr Ala Leu Glu
165 170 175
Lys Asn Ser Lys Tyr Tyr Asn Tyr Thr Leu Ser Ile Asn Gly Lys Ala
180 185 190
Arg Lys His Gly Glu Asn Tyr Ser Val Asp Tyr Leu Thr Asp Val Leu
195 200 205
Ala Asn Val Ser Leu Asp Phe Leu Asp Tyr Lys Ser Asn Phe Glu Pro
210 215 220
Phe Phe Met Met Ile Ala Thr Pro Ala Pro His Ser Pro Trp Thr Ala
225 230 235 240
Ala Pro Gln Tyr Gln Lys Ala Phe Gln Asn Val Phe Ala Pro Arg Asn
245 250 255
Lys Asn Phe Asn Ile His Gly Thr Asn Lys His Trp Leu Ile Arg Gln
260 265 270
Ala Lys Thr Pro Met Thr Asn Ser Ser Ile Gln Phe Leu Asp Asn Ala
275 280 285
Phe Arg Lys Arg Trp Gln Thr Leu Leu Ser Val Asp Asp Leu Val Glu
290 295 300
Lys Leu Val Lys Arg Leu Glu Phe Thr Gly Glu Leu Asn Asn Thr Tyr
305 310 315 320
Ile Phe Tyr Thr Ser Asp Asn Gly Tyr His Thr Gly Gln Phe Ser Leu
325 330 335
Pro Ile Asp Lys Arg Gln Leu Tyr Glu Phe Asp Ile Lys Val Pro Leu
340 345 350
Leu Val Arg Gly Pro Gly Ile Lys Pro Asn Gln Thr Ser Lys Met Leu
355 360 365
Val Ala Asn Ile Asp Leu Gly Pro Thr Ile Leu Asp Ile Ala Gly Tyr
370 375 380
Asp Leu Asn Lys Thr Gln Met Asp Gly Met Ser Leu Leu Pro Ile Leu
385 390 395 400
Arg Gly Ala Ser Asn Leu Thr Trp Arg Ser Asp Val Leu Val Glu Tyr
405 410 415
Gln Gly Glu Gly Arg Asn Val Thr Asp Pro Thr Cys Pro Ser Leu Ser
420 425 430
Pro Gly Val Ser Gln Cys Phe Pro Asp Cys Val Cys Glu Asp Ala Tyr
435 440 445
Asn Asn Thr Tyr Ala Cys Val Arg Thr Met Ser Ala Leu Trp Asn Leu
450 455 460
Gln Tyr Cys Glu Phe Asp Asp Gln Glu Val Phe Val Glu Val Tyr Asn
465 470 475 480
Leu Thr Ala Asp Pro Asp Gln Ile Thr Asn Ile Ala Lys Thr Ile Asp
485 490 495
Pro Glu Leu Leu Gly Lys Met Asn Tyr Arg Leu Met Met Leu Gln Ser
500 505 510
Cys Ser Gly Pro Thr Cys Arg Thr Pro Gly Val Phe Asp Pro Gly Tyr
515 520 525
Arg Phe Asp Pro Arg Leu Met Phe Ser Asn Arg Gly Ser Val Arg Thr
530 535 540
Arg Arg Phe Ser Lys His Leu Leu
545 550
<210> 72
<211> 1602
<212> DNA
<213> Homo sapiens
<400> 72
atgggtccgc gcggcgcggc gagcttgccc cgaggccccg gacctcggcg gctgctcctc 60
cccgtcgtcc tcccgctgct gctgctgctg ttgttggcgc cgccgggctc gggcgccggg 120
gccagccggc cgccccacct ggtcttcttg ctggcagacg acctaggctg gaacgacgtc 180
ggcttccacg gctcccgcat ccgcacgccg cacctggacg cgctggcggc cggcggggtg 240
ctcctggaca actactacac gcagccgctg tgcacgccgt cgcggagcca gctgctcact 300
ggccgctacc agatccgtac aggtttacag caccaaataa tctggccctg tcagcccagc 360
tgtgttcctc tggatgaaaa actcctgccc cagctcctaa aagaagcagg ttatactacc 420
catatggtcg gaaaatggca cctgggaatg taccggaaag aatgccttcc aacccgccga 480
ggatttgata cctactttgg atatctcctg ggtagtgaag attattattc ccatgaacgc 540
tgtacattaa ttgacgctct gaatgtcaca cgatgtgctc ttgattttcg agatggcgaa 600
gaagttgcaa caggatataa aaatatgtat tcaacaaaca tattcaccaa aagggctata 660
gccctcataa ctaaccatcc accagagaag cctctgtttc tctaccttgc tctccagtct 720
gtgcatgagc cccttcaggt ccctgaggaa tacttgaagc catatgactt tatccaagac 780
aagaacaggc atcactatgc aggaatggtg tcccttatgg atgaagcagt aggaaatgtc 840
actgcagctt taaaaagcag tgggctctgg aacaacacgg tgttcatctt ttctacagat 900
aacggagggc agactttggc agggggtaat aactggcccc ttcgaggaag aaaatggagc 960
ctgtgggaag gaggcgtccg aggggtgggc tttgtggcaa gccccttgct gaagcagaag 1020
ggcgtgaaga accgggagct catccacatc tctgactggc tgccaacact cgtgaagctg 1080
gccaggggac acaccaatgg cacaaagcct ctggatggct tcgacgtgtg gaaaaccatc 1140
agtgaaggaa gcccatcccc cagaattgag ctgctgcata atattgaccc gaacttcgtg 1200
gactcttcac cgtgtcccag gaacagcatg gctccagcaa aggatgactc ttctcttcca 1260
gaatattcag cctttaacac atctgtccat gctgcaatta gacatggaaa ttggaaactc 1320
ctcacgggct acccaggctg tggttactgg ttccctccac cgtctcaata caatgtttct 1380
gagataccct catcagaccc accaaccaag accctctggc tctttgatat tgatcgggac 1440
cctgaagaaa gacatgacct gtccagagaa tatcctcaca tcgtcacaaa gctcctgtcc 1500
cgcctacagt tctaccataa acactcagtc cccgtgtact tccctgcaca ggacccccgc 1560
tgtgatccca aggccactgg ggtgtggggc ccttggatgt ag 1602
<210> 73
<211> 533
<212> PRT
<213> Homo sapiens
<400> 73
Met Gly Pro Arg Gly Ala Ala Ser Leu Pro Arg Gly Pro Gly Pro Arg
1 5 10 15
Arg Leu Leu Leu Pro Val Val Leu Pro Leu Leu Leu Leu Leu Leu Leu
20 25 30
Ala Pro Pro Gly Ser Gly Ala Gly Ala Ser Arg Pro Pro His Leu Val
35 40 45
Phe Leu Leu Ala Asp Asp Leu Gly Trp Asn Asp Val Gly Phe His Gly
50 55 60
Ser Arg Ile Arg Thr Pro His Leu Asp Ala Leu Ala Ala Gly Gly Val
65 70 75 80
Leu Leu Asp Asn Tyr Tyr Thr Gln Pro Leu Cys Thr Pro Ser Arg Ser
85 90 95
Gln Leu Leu Thr Gly Arg Tyr Gln Ile Arg Thr Gly Leu Gln His Gln
100 105 110
Ile Ile Trp Pro Cys Gln Pro Ser Cys Val Pro Leu Asp Glu Lys Leu
115 120 125
Leu Pro Gln Leu Leu Lys Glu Ala Gly Tyr Thr Thr His Met Val Gly
130 135 140
Lys Trp His Leu Gly Met Tyr Arg Lys Glu Cys Leu Pro Thr Arg Arg
145 150 155 160
Gly Phe Asp Thr Tyr Phe Gly Tyr Leu Leu Gly Ser Glu Asp Tyr Tyr
165 170 175
Ser His Glu Arg Cys Thr Leu Ile Asp Ala Leu Asn Val Thr Arg Cys
180 185 190
Ala Leu Asp Phe Arg Asp Gly Glu Glu Val Ala Thr Gly Tyr Lys Asn
195 200 205
Met Tyr Ser Thr Asn Ile Phe Thr Lys Arg Ala Ile Ala Leu Ile Thr
210 215 220
Asn His Pro Pro Glu Lys Pro Leu Phe Leu Tyr Leu Ala Leu Gln Ser
225 230 235 240
Val His Glu Pro Leu Gln Val Pro Glu Glu Tyr Leu Lys Pro Tyr Asp
245 250 255
Phe Ile Gln Asp Lys Asn Arg His His Tyr Ala Gly Met Val Ser Leu
260 265 270
Met Asp Glu Ala Val Gly Asn Val Thr Ala Ala Leu Lys Ser Ser Gly
275 280 285
Leu Trp Asn Asn Thr Val Phe Ile Phe Ser Thr Asp Asn Gly Gly Gln
290 295 300
Thr Leu Ala Gly Gly Asn Asn Trp Pro Leu Arg Gly Arg Lys Trp Ser
305 310 315 320
Leu Trp Glu Gly Gly Val Arg Gly Val Gly Phe Val Ala Ser Pro Leu
325 330 335
Leu Lys Gln Lys Gly Val Lys Asn Arg Glu Leu Ile His Ile Ser Asp
340 345 350
Trp Leu Pro Thr Leu Val Lys Leu Ala Arg Gly His Thr Asn Gly Thr
355 360 365
Lys Pro Leu Asp Gly Phe Asp Val Trp Lys Thr Ile Ser Glu Gly Ser
370 375 380
Pro Ser Pro Arg Ile Glu Leu Leu His Asn Ile Asp Pro Asn Phe Val
385 390 395 400
Asp Ser Ser Pro Cys Pro Arg Asn Ser Met Ala Pro Ala Lys Asp Asp
405 410 415
Ser Ser Leu Pro Glu Tyr Ser Ala Phe Asn Thr Ser Val His Ala Ala
420 425 430
Ile Arg His Gly Asn Trp Lys Leu Leu Thr Gly Tyr Pro Gly Cys Gly
435 440 445
Tyr Trp Phe Pro Pro Pro Ser Gln Tyr Asn Val Ser Glu Ile Pro Ser
450 455 460
Ser Asp Pro Pro Thr Lys Thr Leu Trp Leu Phe Asp Ile Asp Arg Asp
465 470 475 480
Pro Glu Glu Arg His Asp Leu Ser Arg Glu Tyr Pro His Ile Val Thr
485 490 495
Lys Leu Leu Ser Arg Leu Gln Phe Tyr His Lys His Ser Val Pro Val
500 505 510
Tyr Phe Pro Ala Gln Asp Pro Arg Cys Asp Pro Lys Ala Thr Gly Val
515 520 525
Trp Gly Pro Trp Met
530
<210> 74
<211> 1308
<212> DNA
<213> Homo sapiens
<400> 74
atggcagccc acctgcttcc catctgcgcc ctcttcctga ccttactcga tatggcccaa 60
ggctttaggg gccccttgct acccaaccgg cccttcacca ccgtctggaa tgcaaacacc 120
cagtggtgcc tggagaggca cggtgtggac gtggatgtca gtgtcttcga tgtggtagcc 180
aacccagggc agaccttccg cggccctgac atgacaattt tctatagctc ccagctgggc 240
acctacccct actacacgcc cactggggag cctgtgtttg gtggtctgcc ccagaatgcc 300
agcctgattg cccacctggc ccgcacattc caggacatcc tggctgccat acctgctcct 360
gacttctcag ggctggcagt catcgactgg gaggcatggc gcccacgctg ggccttcaac 420
tgggacacca aggacattta ccggcagcgc tcacgggcac tggtacaggc acagcaccct 480
gattggccag ctcctcaggt ggaggcagta gcccaggacc agttccaggg agctgcacgg 540
gcctggatgg caggcaccct ccagctgggg cgggcactgc gtcctcgcgg cctctggggc 600
ttctatggct tccctgactg ctacaactat gactttctaa gccccaacta caccggccag 660
tgcccatcag gcatccgtgc ccaaaatgac cagctagggt ggctgtgggg ccagagccgt 720
gccctctatc ccagcatcta catgcccgca gtgctggagg gcacagggaa gtcacagatg 780
tatgtgcaac accgtgtggc cgaggcattc cgtgtggctg tggctgctgg tgaccccaat 840
ctgccggtgc tgccctatgt ccagatcttc tatgacacga caaaccactt tctgcccctg 900
gatgagctgg agcacagcct gggggagagt gcggcccagg gggcagctgg agtggtgctc 960
tgggtgagct gggaaaatac aagaaccaag gaatcatgtc aggccatcaa ggagtatatg 1020
gacactacac tggggccctt catcctgaac gtgaccagtg gggcccttct ctgcagtcaa 1080
gccctgtgct ccggccatgg ccgctgtgtc cgccgcacca gccaccccaa agccctcctc 1140
ctccttaacc ctgccagttt ctccatccag ctcacgcctg gtggtgggcc cctgagcctg 1200
cggggtgccc tctcacttga agatcaggca cagatggctg tggagttcaa atgtcgatgc 1260
taccctggct ggcaggcacc gtggtgtgag cggaagagca tgtggtga 1308
<210> 75
<211> 435
<212> PRT
<213> Homo sapiens
<400> 75
Met Ala Ala His Leu Leu Pro Ile Cys Ala Leu Phe Leu Thr Leu Leu
1 5 10 15
Asp Met Ala Gln Gly Phe Arg Gly Pro Leu Leu Pro Asn Arg Pro Phe
20 25 30
Thr Thr Val Trp Asn Ala Asn Thr Gln Trp Cys Leu Glu Arg His Gly
35 40 45
Val Asp Val Asp Val Ser Val Phe Asp Val Val Ala Asn Pro Gly Gln
50 55 60
Thr Phe Arg Gly Pro Asp Met Thr Ile Phe Tyr Ser Ser Gln Leu Gly
65 70 75 80
Thr Tyr Pro Tyr Tyr Thr Pro Thr Gly Glu Pro Val Phe Gly Gly Leu
85 90 95
Pro Gln Asn Ala Ser Leu Ile Ala His Leu Ala Arg Thr Phe Gln Asp
100 105 110
Ile Leu Ala Ala Ile Pro Ala Pro Asp Phe Ser Gly Leu Ala Val Ile
115 120 125
Asp Trp Glu Ala Trp Arg Pro Arg Trp Ala Phe Asn Trp Asp Thr Lys
130 135 140
Asp Ile Tyr Arg Gln Arg Ser Arg Ala Leu Val Gln Ala Gln His Pro
145 150 155 160
Asp Trp Pro Ala Pro Gln Val Glu Ala Val Ala Gln Asp Gln Phe Gln
165 170 175
Gly Ala Ala Arg Ala Trp Met Ala Gly Thr Leu Gln Leu Gly Arg Ala
180 185 190
Leu Arg Pro Arg Gly Leu Trp Gly Phe Tyr Gly Phe Pro Asp Cys Tyr
195 200 205
Asn Tyr Asp Phe Leu Ser Pro Asn Tyr Thr Gly Gln Cys Pro Ser Gly
210 215 220
Ile Arg Ala Gln Asn Asp Gln Leu Gly Trp Leu Trp Gly Gln Ser Arg
225 230 235 240
Ala Leu Tyr Pro Ser Ile Tyr Met Pro Ala Val Leu Glu Gly Thr Gly
245 250 255
Lys Ser Gln Met Tyr Val Gln His Arg Val Ala Glu Ala Phe Arg Val
260 265 270
Ala Val Ala Ala Gly Asp Pro Asn Leu Pro Val Leu Pro Tyr Val Gln
275 280 285
Ile Phe Tyr Asp Thr Thr Asn His Phe Leu Pro Leu Asp Glu Leu Glu
290 295 300
His Ser Leu Gly Glu Ser Ala Ala Gln Gly Ala Ala Gly Val Val Leu
305 310 315 320
Trp Val Ser Trp Glu Asn Thr Arg Thr Lys Glu Ser Cys Gln Ala Ile
325 330 335
Lys Glu Tyr Met Asp Thr Thr Leu Gly Pro Phe Ile Leu Asn Val Thr
340 345 350
Ser Gly Ala Leu Leu Cys Ser Gln Ala Leu Cys Ser Gly His Gly Arg
355 360 365
Cys Val Arg Arg Thr Ser His Pro Lys Ala Leu Leu Leu Leu Asn Pro
370 375 380
Ala Ser Phe Ser Ile Gln Leu Thr Pro Gly Gly Gly Pro Leu Ser Leu
385 390 395 400
Arg Gly Ala Leu Ser Leu Glu Asp Gln Ala Gln Met Ala Val Glu Phe
405 410 415
Lys Cys Arg Cys Tyr Pro Gly Trp Gln Ala Pro Trp Cys Glu Arg Lys
420 425 430
Ser Met Trp
435
<210> 76
<211> 2859
<212> DNA
<213> Homo sapiens
<400> 76
atgggagtga ggcacccgcc ctgctcccac cggctcctgg ccgtctgcgc cctcgtgtcc 60
ttggcaaccg ctgcactcct ggggcacatc ctactccatg atttcctgct ggttccccga 120
gagctgagtg gctcctcccc agtcctggag gagactcacc cagctcacca gcagggagcc 180
agcagaccag ggccccggga tgcccaggca caccccggcc gtcccagagc agtgcccaca 240
cagtgcgacg tcccccccaa cagccgcttc gattgcgccc ctgacaaggc catcacccag 300
gaacagtgcg aggcccgcgg ctgttgctac atccctgcaa agcaggggct gcagggagcc 360
cagatggggc agccctggtg cttcttccca cccagctacc ccagctacaa gctggagaac 420
ctgagctcct ctgaaatggg ctacacggcc accctgaccc gtaccacccc caccttcttc 480
cccaaggaca tcctgaccct gcggctggac gtgatgatgg agactgagaa ccgcctccac 540
ttcacgatca aagatccagc taacaggcgc tacgaggtgc ccttggagac cccgcatgtc 600
cacagccggg caccgtcccc actctacagc gtggagttct ccgaggagcc cttcggggtg 660
atcgtgcgcc ggcagctgga cggccgcgtg ctgctgaaca cgacggtggc gcccctgttc 720
tttgcggacc agttccttca gctgtccacc tcgctgccct cgcagtatat cacaggcctc 780
gccgagcacc tcagtcccct gatgctcagc accagctgga ccaggatcac cctgtggaac 840
cgggaccttg cgcccacgcc cggtgcgaac ctctacgggt ctcacccttt ctacctggcg 900
ctggaggacg gcgggtcggc acacggggtg ttcctgctaa acagcaatgc catggatgtg 960
gtcctgcagc cgagccctgc ccttagctgg aggtcgacag gtgggatcct ggatgtctac 1020
atcttcctgg gcccagagcc caagagcgtg gtgcagcagt acctggacgt tgtgggatac 1080
ccgttcatgc cgccatactg gggcctgggc ttccacctgt gccgctgggg ctactcctcc 1140
accgctatca cccgccaggt ggtggagaac atgaccaggg cccacttccc cctggacgtc 1200
cagtggaacg acctggacta catggactcc cggagggact tcacgttcaa caaggatggc 1260
ttccgggact tcccggccat ggtgcaggag ctgcaccagg gcggccggcg ctacatgatg 1320
atcgtggatc ctgccatcag cagctcgggc cctgccggga gctacaggcc ctacgacgag 1380
ggtctgcgga ggggggtttt catcaccaac gagaccggcc agccgctgat tgggaaggta 1440
tggcccgggt ccactgcctt ccccgacttc accaacccca cagccctggc ctggtgggag 1500
gacatggtgg ctgagttcca tgaccaggtg cccttcgacg gcatgtggat tgacatgaac 1560
gagccttcca acttcatcag gggctctgag gacggctgcc ccaacaatga gctggagaac 1620
ccaccctacg tgcctggggt ggttgggggg accctccagg cggccaccat ctgtgcctcc 1680
agccaccagt ttctctccac acactacaac ctgcacaacc tctacggcct gaccgaagcc 1740
atcgcctccc acagggcgct ggtgaaggct cgggggacac gcccatttgt gatctcccgc 1800
tcgacctttg ctggccacgg ccgatacgcc ggccactgga cgggggacgt gtggagctcc 1860
tgggagcagc tcgcctcctc cgtgccagaa atcctgcagt ttaacctgct gggggtgcct 1920
ctggtcgggg ccgacgtctg cggcttcctg ggcaacacct cagaggagct gtgtgtgcgc 1980
tggacccagc tgggggcctt ctaccccttc atgcggaacc acaacagcct gctcagtctg 2040
ccccaggagc cgtacagctt cagcgagccg gcccagcagg ccatgaggaa ggccctcacc 2100
ctgcgctacg cactcctccc ccacctctac acactgttcc accaggccca cgtcgcgggg 2160
gagaccgtgg cccggcccct cttcctggag ttccccaagg actctagcac ctggactgtg 2220
gaccaccagc tcctgtgggg ggaggccctg ctcatcaccc cagtgctcca ggccgggaag 2280
gccgaagtga ctggctactt ccccttgggc acatggtacg acctgcagac ggtgccagta 2340
gaggcccttg gcagcctccc acccccacct gcagctcccc gtgagccagc catccacagc 2400
gaggggcagt gggtgacgct gccggccccc ctggacacca tcaacgtcca cctccgggct 2460
gggtacatca tccccctgca gggccctggc ctcacaacca cagagtcccg ccagcagccc 2520
atggccctgg ctgtggccct gaccaagggt ggggaggccc gaggggagct gttctgggac 2580
gatggagaga gcctggaagt gctggagcga ggggcctaca cacaggtcat cttcctggcc 2640
aggaataaca cgatcgtgaa tgagctggta cgtgtgacca gtgagggagc tggcctgcag 2700
ctgcagaagg tgactgtcct gggcgtggcc acggcgcccc agcaggtcct ctccaacggt 2760
gtccctgtct ccaacttcac ctacagcccc gacaccaagg tcctggacat ctgtgtctcg 2820
ctgttgatgg gagagcagtt tctcgtcagc tggtgttag 2859
<210> 77
<211> 952
<212> PRT
<213> Homo sapiens
<400> 77
Met Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val Cys
1 5 10 15
Ala Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu Leu
20 25 30
His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val
35 40 45
Leu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly
50 55 60
Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr
65 70 75 80
Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys
85 90 95
Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro
100 105 110
Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe
115 120 125
Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser
130 135 140
Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe
145 150 155 160
Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu
165 170 175
Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu
180 185 190
Val Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu
195 200 205
Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg
210 215 220
Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe
225 230 235 240
Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr
245 250 255
Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser
260 265 270
Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly
275 280 285
Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly
290 295 300
Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val
305 310 315 320
Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile
325 330 335
Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln
340 345 350
Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly
355 360 365
Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr
370 375 380
Arg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val
385 390 395 400
Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe
405 410 415
Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His
420 425 430
Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser
435 440 445
Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg
450 455 460
Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val
465 470 475 480
Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu
485 490 495
Ala Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe
500 505 510
Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly
515 520 525
Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val
530 535 540
Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser
545 550 555 560
Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly
565 570 575
Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly
580 585 590
Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg
595 600 605
Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu
610 615 620
Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro
625 630 635 640
Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu
645 650 655
Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg
660 665 670
Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser
675 680 685
Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala
690 695 700
Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly
705 710 715 720
Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser
725 730 735
Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile
740 745 750
Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro
755 760 765
Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly
770 775 780
Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser
785 790 795 800
Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val
805 810 815
His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr
820 825 830
Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr
835 840 845
Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser
850 855 860
Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala
865 870 875 880
Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly
885 890 895
Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala
900 905 910
Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr
915 920 925
Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly
930 935 940
Glu Gln Phe Leu Val Ser Trp Cys
945 950
<210> 78
<211> 1893
<212> DNA
<213> Homo sapiens
<400> 78
atgccccgct acggagcgtc actccgccag agctgcccca ggtccggccg ggagcaggga 60
caagacggga ccgccggagc ccccggactc ctttggatgg gcctggtgct ggcgctggcg 120
ctggcgctgg cgctggcgct ggctctgtct gactctcggg ttctctgggc tccggcagag 180
gctcaccctc tttctcccca aggccatcct gccaggttac atcgcatagt gccccggctc 240
cgagatgtct ttgggtgggg gaacctcacc tgcccaatct gcaaaggtct attcaccgcc 300
atcaacctcg ggctgaagga acccaatgtg gctcgcgtgg gctccgtggc catcaagctg 360
tgcaatctgc tgaagatagc accacctgcc gtgtgccaat ccattgtcca cctctttgag 420
gatgacatgg tggaggtgtg gagacgctca gtgctgagcc catctgaggc ctgtggcctg 480
ctcctgggct ccacctgtgg gcactgggac attttctcat cttggaacat ctctttgcct 540
actgtgccga agccgccccc caaaccccct agccccccag ccccaggtgc ccctgtcagc 600
cgcatcctct tcctcactga cctgcactgg gatcatgact acctggaggg cacggaccct 660
gactgtgcag acccactgtg ctgccgccgg ggttctggcc tgccgcccgc atcccggcca 720
ggtgccggat actggggcga atacagcaag tgtgacctgc ccctgaggac cctggagagc 780
ctgttgagtg ggctgggccc agccggccct tttgatatgg tgtactggac aggagacatc 840
cccgcacatg atgtctggca ccagactcgt caggaccaac tgcgggccct gaccaccgtc 900
acagcacttg tgaggaagtt cctggggcca gtgccagtgt accctgctgt gggtaaccat 960
gaaagcacac ctgtcaatag cttccctccc cccttcattg agggcaacca ctcctcccgc 1020
tggctctatg aagcgatggc caaggcttgg gagccctggc tgcctgccga agccctgcgc 1080
accctcagaa ttggggggtt ctatgctctt tccccatacc ccggtctccg cctcatctct 1140
ctcaatatga atttttgttc ccgtgagaac ttctggctct tgatcaactc cacggatccc 1200
gcaggacagc tccagtggct ggtgggggag cttcaggctg ctgaggatcg aggagacaaa 1260
gtgcatataa ttggccacat tcccccaggg cactgtctga agagctggag ctggaattat 1320
taccgaattg tagccaggta tgagaacacc ctggctgctc agttctttgg ccacactcat 1380
gtggatgaat ttgaggtctt ctatgatgaa gagactctga gccggccgct ggctgtagcc 1440
ttcctggcac ccagtgcaac tacctacatc ggccttaatc ctggttaccg tgtgtaccaa 1500
atagatggaa actactccgg gagctctcac gtggtcctgg accatgagac ctacatcctg 1560
aatctgaccc aggcaaacat accgggagcc ataccgcact ggcagcttct ctacagggct 1620
cgagaaacct atgggctgcc caacacactg cctaccgcct ggcacaacct ggtatatcgc 1680
atgcggggcg acatgcaact tttccagacc ttctggtttc tctaccataa gggccaccca 1740
ccctcggagc cctgtggcac gccctgccgt ctggctactc tttgtgccca gctctctgcc 1800
cgtgctgaca gccctgctct gtgccgccac ctgatgccag atgggagcct cccagaggcc 1860
cagagcctgt ggccaaggcc actgttttgc tag 1893
<210> 79
<211> 631
<212> PRT
<213> Homo sapiens
<400> 79
Met Pro Arg Tyr Gly Ala Ser Leu Arg Gln Ser Cys Pro Arg Ser Gly
1 5 10 15
Arg Glu Gln Gly Gln Asp Gly Thr Ala Gly Ala Pro Gly Leu Leu Trp
20 25 30
Met Gly Leu Val Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala
35 40 45
Leu Ser Asp Ser Arg Val Leu Trp Ala Pro Ala Glu Ala His Pro Leu
50 55 60
Ser Pro Gln Gly His Pro Ala Arg Leu His Arg Ile Val Pro Arg Leu
65 70 75 80
Arg Asp Val Phe Gly Trp Gly Asn Leu Thr Cys Pro Ile Cys Lys Gly
85 90 95
Leu Phe Thr Ala Ile Asn Leu Gly Leu Lys Lys Glu Pro Asn Val Ala
100 105 110
Arg Val Gly Ser Val Ala Ile Lys Leu Cys Asn Leu Leu Lys Ile Ala
115 120 125
Pro Pro Ala Val Cys Gln Ser Ile Val His Leu Phe Glu Asp Asp Met
130 135 140
Val Glu Val Trp Arg Arg Ser Val Leu Ser Pro Ser Glu Ala Cys Gly
145 150 155 160
Leu Leu Leu Gly Ser Thr Cys Gly His Trp Asp Ile Phe Ser Ser Trp
165 170 175
Asn Ile Ser Leu Pro Thr Val Pro Lys Pro Pro Pro Lys Pro Pro Ser
180 185 190
Pro Pro Ala Pro Gly Ala Pro Val Ser Arg Ile Leu Phe Leu Thr Asp
195 200 205
Leu His Trp Asp His Asp Tyr Leu Glu Gly Thr Asp Pro Asp Cys Ala
210 215 220
Asp Pro Leu Cys Cys Arg Arg Gly Ser Gly Leu Pro Pro Ala Ser Arg
225 230 235 240
Pro Gly Ala Gly Tyr Trp Gly Glu Tyr Ser Lys Cys Asp Leu Pro Leu
245 250 255
Arg Thr Leu Glu Ser Leu Leu Ser Gly Leu Gly Pro Ala Gly Pro Phe
260 265 270
Asp Met Val Tyr Trp Thr Gly Asp Ile Pro Ala His Asp Val Trp His
275 280 285
Gln Thr Arg Gln Asp Gln Leu Arg Ala Leu Thr Thr Val Thr Ala Leu
290 295 300
Val Arg Lys Phe Leu Gly Pro Val Pro Val Tyr Pro Ala Val Gly Asn
305 310 315 320
His Glu Ser Thr Pro Val Asn Ser Phe Pro Pro Pro Phe Ile Glu Gly
325 330 335
Asn His Ser Ser Arg Trp Leu Tyr Glu Ala Met Ala Lys Ala Trp Glu
340 345 350
Pro Trp Leu Pro Ala Glu Ala Leu Arg Thr Leu Arg Ile Gly Gly Phe
355 360 365
Tyr Ala Leu Ser Pro Tyr Pro Gly Leu Arg Leu Ile Ser Leu Asn Met
370 375 380
Asn Phe Cys Ser Arg Glu Asn Phe Trp Leu Leu Ile Asn Ser Thr Asp
385 390 395 400
Pro Ala Gly Gln Leu Gln Trp Leu Val Gly Glu Leu Gln Ala Ala Glu
405 410 415
Asp Arg Gly Asp Lys Val His Ile Ile Gly His Ile Pro Pro Gly His
420 425 430
Cys Leu Lys Ser Trp Ser Trp Asn Tyr Tyr Arg Ile Val Ala Arg Tyr
435 440 445
Glu Asn Thr Leu Ala Ala Gln Phe Phe Gly His Thr His Val Asp Glu
450 455 460
Phe Glu Val Phe Tyr Asp Glu Glu Thr Leu Ser Arg Pro Leu Ala Val
465 470 475 480
Ala Phe Leu Ala Pro Ser Ala Thr Thr Tyr Ile Gly Leu Asn Pro Gly
485 490 495
Tyr Arg Val Tyr Gln Ile Asp Gly Asn Tyr Ser Gly Ser Ser His Val
500 505 510
Val Leu Asp His Glu Thr Tyr Ile Leu Asn Leu Thr Gln Ala Asn Ile
515 520 525
Pro Gly Ala Ile Pro His Trp Gln Leu Leu Tyr Arg Ala Arg Glu Thr
530 535 540
Tyr Gly Leu Pro Asn Thr Leu Pro Thr Ala Trp His Asn Leu Val Tyr
545 550 555 560
Arg Met Arg Gly Asp Met Gln Leu Phe Gln Thr Phe Trp Phe Leu Tyr
565 570 575
His Lys Gly His Pro Pro Ser Glu Pro Cys Gly Thr Pro Cys Arg Leu
580 585 590
Ala Thr Leu Cys Ala Gln Leu Ser Ala Arg Ala Asp Ser Pro Ala Leu
595 600 605
Cys Arg His Leu Met Pro Asp Gly Ser Leu Pro Glu Ala Gln Ser Leu
610 615 620
Trp Pro Arg Pro Leu Phe Cys
625 630
<210> 80
<211> 1170
<212> DNA
<213> Homo sapiens
<400> 80
atgaactgct gcatcgggct gggagagaaa gctcgcgggt cccaccgggc ctcctaccca 60
agtctcagcg cgcttttcac cgaggcctca attctgggat ttggcagctt tgctgtgaaa 120
gcccaatgga cagaggactg cagaaaatca acctatcctc cttcaggacc aactgtcttc 180
cctgctgtta taaggtacag aggtgcagtt ccatggtaca ccataaatct tgacttacca 240
ccctacaaaa gatggcatga attgatgctt gacaaggcac cagtgcctgg cctacttggc 300
aactttcctg gcccttttga agaggaaatg aagggtattg ccgctgttac tgatatacct 360
ttaggagaga ttatttcatt caatattttt tatgaattat ttaccatttg tacttcaata 420
gtagcagaag acaaaaaagg tcatctaata catgggagaa acatggattt tggagtattt 480
cttgggtgga acataaataa tgatacctgg gtcataactg agcaactaaa acctttaaca 540
gtgaatttgg atttccaaag aaacaacaaa actgtcttca aggcttcaag ctttgctggc 600
tatgtgggca tgttaacagg attcaaacca ggactgttca gtcttacact gaatgaacgt 660
ttcagtataa atggtggtta tctgggtatt ctagaatgga ttctgggaaa gaaagatgtc 720
atgtggatag ggttcctcac tagaacagtt ctggaaaata gcacaagtta tgaagaagcc 780
aagaatttat tgaccaagac caagatattg gccccagcct actttatcct gggaggcaac 840
cagtctgggg aaggttgtgt gattacacga gacagaaagg aatcattgga tgtatatgaa 900
ctcgatgcta agcagggtag atggtatgtg gtacaaacaa attatgaccg ttggaaacat 960
cccttcttcc ttgatgatcg cagaacgcct gcaaagatgt gtctgaaccg caccagccaa 1020
gagaatatct catttgaaac catgtatgat gtcctgtcaa caaaacctgt cctcaacaag 1080
ctgaccgtat acacaacctt gatagatgtt accaaaggtc aattcgaaac ttacctgcgg 1140
gactgccctg acccttgtat aggttggtga 1170
<210> 81
<211> 395
<212> PRT
<213> Homo sapiens
<400> 81
Met Pro Gly Arg Ser Cys Val Ala Leu Val Leu Leu Ala Ala Ala Val
1 5 10 15
Ser Cys Ala Val Ala Gln His Ala Pro Pro Trp Thr Glu Asp Cys Arg
20 25 30
Lys Ser Thr Tyr Pro Pro Ser Gly Pro Thr Tyr Arg Gly Ala Val Pro
35 40 45
Trp Tyr Thr Ile Asn Leu Asp Leu Pro Pro Tyr Lys Arg Trp His Glu
50 55 60
Leu Met Leu Asp Lys Ala Pro Val Leu Lys Val Ile Val Asn Ser Leu
65 70 75 80
Lys Asn Met Ile Asn Thr Phe Val Pro Ser Gly Lys Ile Met Gln Val
85 90 95
Val Asp Glu Lys Leu Pro Gly Leu Leu Gly Asn Phe Pro Gly Pro Phe
100 105 110
Glu Glu Glu Met Lys Gly Ile Ala Ala Val Thr Asp Ile Pro Leu Gly
115 120 125
Glu Ile Ile Ser Phe Asn Ile Phe Tyr Glu Leu Phe Thr Ile Cys Thr
130 135 140
Ser Ile Val Ala Glu Asp Lys Lys Gly His Leu Ile His Gly Arg Asn
145 150 155 160
Met Asp Phe Gly Val Phe Leu Gly Trp Asn Ile Asn Asn Asp Thr Trp
165 170 175
Val Ile Thr Glu Gln Leu Lys Pro Leu Thr Val Asn Leu Asp Phe Gln
180 185 190
Arg Asn Asn Lys Thr Val Phe Lys Ala Ser Ser Phe Ala Gly Tyr Val
195 200 205
Gly Met Leu Thr Gly Phe Lys Pro Gly Leu Phe Ser Leu Thr Leu Asn
210 215 220
Glu Arg Phe Ser Ile Asn Gly Gly Tyr Leu Gly Ile Leu Glu Trp Ile
225 230 235 240
Leu Gly Lys Lys Asp Val Met Trp Ile Gly Phe Leu Thr Arg Thr Val
245 250 255
Leu Glu Asn Ser Thr Ser Tyr Glu Glu Ala Lys Asn Leu Leu Thr Lys
260 265 270
Thr Lys Ile Leu Ala Pro Ala Tyr Phe Ile Leu Gly Gly Asn Gln Ser
275 280 285
Gly Glu Gly Cys Val Ile Thr Arg Asp Arg Lys Glu Ser Leu Asp Val
290 295 300
Tyr Glu Leu Asp Ala Lys Gln Gly Arg Trp Tyr Val Val Gln Thr Asn
305 310 315 320
Tyr Asp Arg Trp Lys His Pro Phe Phe Leu Asp Asp Arg Arg Thr Pro
325 330 335
Ala Lys Met Cys Leu Asn Arg Thr Ser Gln Glu Asn Ile Ser Phe Glu
340 345 350
Thr Met Tyr Asp Val Leu Ser Thr Lys Pro Val Leu Asn Lys Leu Thr
355 360 365
Val Tyr Thr Thr Leu Ile Asp Val Thr Lys Gly Gln Phe Glu Thr Tyr
370 375 380
Leu Arg Asp Cys Pro Asp Pro Cys Ile Gly Trp
385 390 395
<210> 82
<211> 1200
<212> DNA
<213> Homo sapiens
<400> 82
atgaaaatgc ggttcttggg gttggtggtc tgtttggttc tctggaccct gcattctgag 60
gggtctggag ggaaactgac agctgtggat cctgaaacaa acatgaatgt gagtgaaatt 120
atctcttact ggggattccc tagtgaggaa tacctagttg agacagaaga tggatatatt 180
ctgtgcctta accgaattcc tcatgggagg aagaaccatt ctgacaaagg tcccaaacca 240
gttgtcttcc tgcaacatgg cttgctggca gattctagta actgggtcac aaaccttgcc 300
aacagcagcc tgggcttcat tcttgctgat gctggttttg acgtgtggat gggcaacagc 360
agaggaaata cctggtctcg gaaacataag acactctcag tttctcagga tgaattctgg 420
gctttcagtt atgatgagat ggcaaaatat gacctaccag cttccattaa cttcattctg 480
aataaaactg gccaagaaca agtgtattat gtgggtcatt ctcaaggcac cactataggt 540
tttatagcat tttcacagat ccctgagctg gctaaaagga ttaaaatgtt ttttgccctg 600
ggtcctgtgg cttccgtcgc cttctgtact agccctatgg ccaaattagg acgattacca 660
gatcatctca ttaaggactt atttggagac aaagaatttc ttccccagag tgcgtttttg 720
aagtggctgg gtacccacgt ttgcactcat gtcatactga aggagctctg tggaaatctc 780
tgttttcttc tgtgtggatt taatgagaga aatttaaata tgtctagagt ggatgtatat 840
acaacacatt ctcctgctgg aacttctgtg caaaacatgt tacactggag ccaggctgtt 900
aaattccaaa agtttcaagc ctttgactgg ggaagcagtg ccaagaatta ttttcattac 960
aaccagagtt atcctcccac atacaatgtg aaggacatgc ttgtgccgac tgcagtctgg 1020
agcgggggtc acgactggct tgcagatgtc tacgacgtca atatcttact gactcagatc 1080
accaacttgg tgttccatga gagcattccg gaatgggagc atcttgactt catttggggc 1140
ctggatgccc cttggaggct ttataataaa attattaatc taatgaggaa atatcagtga 1200
<210> 83
<211> 399
<212> PRT
<213> Homo sapiens
<400> 83
Met Lys Met Arg Phe Leu Gly Leu Val Val Cys Leu Val Leu Trp Thr
1 5 10 15
Leu His Ser Glu Gly Ser Gly Gly Lys Leu Thr Ala Val Asp Pro Glu
20 25 30
Thr Asn Met Asn Val Ser Glu Ile Ile Ser Tyr Trp Gly Phe Pro Ser
35 40 45
Glu Glu Tyr Leu Val Glu Thr Glu Asp Gly Tyr Ile Leu Cys Leu Asn
50 55 60
Arg Ile Pro His Gly Arg Lys Asn His Ser Asp Lys Gly Pro Lys Pro
65 70 75 80
Val Val Phe Leu Gln His Gly Leu Leu Ala Asp Ser Ser Asn Trp Val
85 90 95
Thr Asn Leu Ala Asn Ser Ser Leu Gly Phe Ile Leu Ala Asp Ala Gly
100 105 110
Phe Asp Val Trp Met Gly Asn Ser Arg Gly Asn Thr Trp Ser Arg Lys
115 120 125
His Lys Thr Leu Ser Val Ser Gln Asp Glu Phe Trp Ala Phe Ser Tyr
130 135 140
Asp Glu Met Ala Lys Tyr Asp Leu Pro Ala Ser Ile Asn Phe Ile Leu
145 150 155 160
Asn Lys Thr Gly Gln Glu Gln Val Tyr Tyr Val Gly His Ser Gln Gly
165 170 175
Thr Thr Ile Gly Phe Ile Ala Phe Ser Gln Ile Pro Glu Leu Ala Lys
180 185 190
Arg Ile Lys Met Phe Phe Ala Leu Gly Pro Val Ala Ser Val Ala Phe
195 200 205
Cys Thr Ser Pro Met Ala Lys Leu Gly Arg Leu Pro Asp His Leu Ile
210 215 220
Lys Asp Leu Phe Gly Asp Lys Glu Phe Leu Pro Gln Ser Ala Phe Leu
225 230 235 240
Lys Trp Leu Gly Thr His Val Cys Thr His Val Ile Leu Lys Glu Leu
245 250 255
Cys Gly Asn Leu Cys Phe Leu Leu Cys Gly Phe Asn Glu Arg Asn Leu
260 265 270
Asn Met Ser Arg Val Asp Val Tyr Thr Thr His Ser Pro Ala Gly Thr
275 280 285
Ser Val Gln Asn Met Leu His Trp Ser Gln Ala Val Lys Phe Gln Lys
290 295 300
Phe Gln Ala Phe Asp Trp Gly Ser Ser Ala Lys Asn Tyr Phe His Tyr
305 310 315 320
Asn Gln Ser Tyr Pro Pro Thr Tyr Asn Val Lys Asp Met Leu Val Pro
325 330 335
Thr Ala Val Trp Ser Gly Gly His Asp Trp Leu Ala Asp Val Tyr Asp
340 345 350
Val Asn Ile Leu Leu Thr Gln Ile Thr Asn Leu Val Phe His Glu Ser
355 360 365
Ile Pro Glu Trp Glu His Leu Asp Phe Ile Trp Gly Leu Asp Ala Pro
370 375 380
Trp Arg Leu Tyr Asn Lys Ile Ile Asn Leu Met Arg Lys Tyr Gln
385 390 395
<210> 84
<211> 990
<212> DNA
<213> Homo sapiens
<400> 84
atgtgggggc tcaaggttct gctgctacct gtggtgagct ttgctctgta ccctgaggag 60
atactggaca cccactggga gctatggaag aagacccaca ggaagcaata taacaacaag 120
gtggatgaaa tctctcggcg tttaatttgg gaaaaaaacc tgaagtatat ttccatccat 180
aaccttgagg cttctcttgg tgtccataca tatgaactgg ctatgaacca cctgggggac 240
atgaccagtg aagaggtggt tcagaagatg actggactca aagtacccct gtctcattcc 300
cgcagtaatg acacccttta tatcccagaa tgggaaggta gagccccaga ctctgtcgac 360
tatcgaaaga aaggatatgt tactcctgtc aaaaatcagg gtcagtgtgg ttcctgttgg 420
gcttttagct ctgtgggtgc cctggagggc caactcaaga agaaaactgg caaactctta 480
aatctgagtc cccagaacct agtggattgt gtgtctgaga atgatggctg tggagggggc 540
tacatgacca atgccttcca atatgtgcag aagaaccggg gtattgactc tgaagatgcc 600
tacccatatg tgggacagga agagagttgt atgtacaacc caacaggcaa ggcagctaaa 660
tgcagagggt acagagagat ccccgagggg aatgagaaag ccctgaagag ggcagtggcc 720
cgagtgggac ctgtctctgt ggccattgat gcaagcctga cctccttcca gttttacagc 780
aaaggtgtgt attatgatga aagctgcaat agcgataatc tgaaccatgc agttttggca 840
gtgggatatg gaatccagaa gggaaacaag cactggataa ttaaaaacag ctggggagaa 900
aactggggaa acaaaggata tatcctcatg gctcgaaata agaacaacgc ctgtggcatt 960
gccaacctgg ccagcttccc caagatgtga 990
<210> 85
<211> 329
<212> PRT
<213> Homo sapiens
<400> 85
Met Trp Gly Leu Lys Val Leu Leu Leu Pro Val Val Ser Phe Ala Leu
1 5 10 15
Tyr Pro Glu Glu Ile Leu Asp Thr His Trp Glu Leu Trp Lys Lys Thr
20 25 30
His Arg Lys Gln Tyr Asn Asn Lys Val Asp Glu Ile Ser Arg Arg Leu
35 40 45
Ile Trp Glu Lys Asn Leu Lys Tyr Ile Ser Ile His Asn Leu Glu Ala
50 55 60
Ser Leu Gly Val His Thr Tyr Glu Leu Ala Met Asn His Leu Gly Asp
65 70 75 80
Met Thr Ser Glu Glu Val Val Gln Lys Met Thr Gly Leu Lys Val Pro
85 90 95
Leu Ser His Ser Arg Ser Asn Asp Thr Leu Tyr Ile Pro Glu Trp Glu
100 105 110
Gly Arg Ala Pro Asp Ser Val Asp Tyr Arg Lys Lys Gly Tyr Val Thr
115 120 125
Pro Val Lys Asn Gln Gly Gln Cys Gly Ser Cys Trp Ala Phe Ser Ser
130 135 140
Val Gly Ala Leu Glu Gly Gln Leu Lys Lys Lys Thr Gly Lys Leu Leu
145 150 155 160
Asn Leu Ser Pro Gln Asn Leu Val Asp Cys Val Ser Glu Asn Asp Gly
165 170 175
Cys Gly Gly Gly Tyr Met Thr Asn Ala Phe Gln Tyr Val Gln Lys Asn
180 185 190
Arg Gly Ile Asp Ser Glu Asp Ala Tyr Pro Tyr Val Gly Gln Glu Glu
195 200 205
Ser Cys Met Tyr Asn Pro Thr Gly Lys Ala Ala Lys Cys Arg Gly Tyr
210 215 220
Arg Glu Ile Pro Glu Gly Asn Glu Lys Ala Leu Lys Arg Ala Val Ala
225 230 235 240
Arg Val Gly Pro Val Ser Val Ala Ile Asp Ala Ser Leu Thr Ser Phe
245 250 255
Gln Phe Tyr Ser Lys Gly Val Tyr Tyr Asp Glu Ser Cys Asn Ser Asp
260 265 270
Asn Leu Asn His Ala Val Leu Ala Val Gly Tyr Gly Ile Gln Lys Gly
275 280 285
Asn Lys His Trp Ile Ile Lys Asn Ser Trp Gly Glu Asn Trp Gly Asn
290 295 300
Lys Gly Tyr Ile Leu Met Ala Arg Asn Lys Asn Asn Ala Cys Gly Ile
305 310 315 320
Ala Asn Leu Ala Ser Phe Pro Lys Met
325
<210> 86
<211> 1692
<212> DNA
<213> Homo sapiens
<400> 86
atgggactcc aagcctgcct cctagggctc tttgccctca tcctctctgg caaatgcagt 60
tacagcccgg agcccgacca gcggaggacg ctgcccccag gctgggtgtc cctgggccgt 120
gcggaccctg aggaagagct gagtctcacc tttgccctga gacagcagaa tgtggaaaga 180
ctctcggagc tggtgcaggc tgtgtcggat cccagctctc ctcaatacgg aaaatacctg 240
accctagaga atgtggctga tctggtgagg ccatccccac tgaccctcca cacggtgcaa 300
aaatggctct tggcagccgg agcccagaag tgccattctg tgatcacaca ggactttctg 360
acttgctggc tgagcatccg acaagcagag ctgctgctcc ctggggctga gtttcatcac 420
tatgtgggag gacctacgga aacccatgtt gtaaggtccc cacatcccta ccagcttcca 480
caggccttgg ccccccatgt ggactttgtg gggggactgc accgttttcc cccaacatca 540
tccctgaggc aacgtcctga gccgcaggtg acagggactg taggcctgca tctgggggta 600
accccctctg tgatccgtaa gcgatacaac ttgacctcac aagacgtggg ctctggcacc 660
agcaataaca gccaagcctg tgcccagttc ctggagcagt atttccatga ctcagacctg 720
gctcagttca tgcgcctctt cggtggcaac tttgcacatc aggcatcagt agcccgtgtg 780
gttggacaac agggccgggg ccgggccggg attgaggcca gtctagatgt gcagtacctg 840
atgagtgctg gtgccaacat ctccacctgg gtctacagta gccctggccg gcatgaggga 900
caggagccct tcctgcagtg gctcatgctg ctcagtaatg agtcagccct gccacatgtg 960
catactgtga gctatggaga tgatgaggac tccctcagca gcgcctacat ccagcgggtc 1020
aacactgagc tcatgaaggc tgccgctcgg ggtctcaccc tgctcttcgc ctcaggtgac 1080
agtggggccg ggtgttggtc tgtctctgga agacaccagt tccgccctac cttccctgcc 1140
tccagcccct atgtcaccac agtgggaggc acatccttcc aggaaccttt cctcatcaca 1200
aatgaaattg ttgactatat cagtggtggt ggcttcagca atgtgttccc acggccttca 1260
taccaggagg aagctgtaac gaagttcctg agctctagcc cccacctgcc accatccagt 1320
tacttcaatg ccagtggccg tgcctaccca gatgtggctg cactttctga tggctactgg 1380
gtggtcagca acagagtgcc cattccatgg gtgtccggaa cctcggcctc tactccagtg 1440
tttgggggga tcctatcctt gatcaatgag cacaggatcc ttagtggccg cccccctctt 1500
ggctttctca acccaaggct ctaccagcag catggggcag gactctttga tgtaacccgt 1560
ggctgccatg agtcctgtct ggatgaagag gtagagggcc agggtttctg ctctggtcct 1620
ggctgggatc ctgtaacagg ctggggaaca cccaacttcc cagctttgct gaagactcta 1680
ctcaacccct ga 1692
<210> 87
<211> 563
<212> PRT
<213> Homo sapiens
<400> 87
Met Gly Leu Gln Ala Cys Leu Leu Gly Leu Phe Ala Leu Ile Leu Ser
1 5 10 15
Gly Lys Cys Ser Tyr Ser Pro Glu Pro Asp Gln Arg Arg Thr Leu Pro
20 25 30
Pro Gly Trp Val Ser Leu Gly Arg Ala Asp Pro Glu Glu Glu Leu Ser
35 40 45
Leu Thr Phe Ala Leu Arg Gln Gln Asn Val Glu Arg Leu Ser Glu Leu
50 55 60
Val Gln Ala Val Ser Asp Pro Ser Ser Pro Gln Tyr Gly Lys Tyr Leu
65 70 75 80
Thr Leu Glu Asn Val Ala Asp Leu Val Arg Pro Ser Pro Leu Thr Leu
85 90 95
His Thr Val Gln Lys Trp Leu Leu Ala Ala Gly Ala Gln Lys Cys His
100 105 110
Ser Val Ile Thr Gln Asp Phe Leu Thr Cys Trp Leu Ser Ile Arg Gln
115 120 125
Ala Glu Leu Leu Leu Pro Gly Ala Glu Phe His His Tyr Val Gly Gly
130 135 140
Pro Thr Glu Thr His Val Val Arg Ser Pro His Pro Tyr Gln Leu Pro
145 150 155 160
Gln Ala Leu Ala Pro His Val Asp Phe Val Gly Gly Leu His Arg Phe
165 170 175
Pro Pro Thr Ser Ser Leu Arg Gln Arg Pro Glu Pro Gln Val Thr Gly
180 185 190
Thr Val Gly Leu His Leu Gly Val Thr Pro Ser Val Ile Arg Lys Arg
195 200 205
Tyr Asn Leu Thr Ser Gln Asp Val Gly Ser Gly Thr Ser Asn Asn Ser
210 215 220
Gln Ala Cys Ala Gln Phe Leu Glu Gln Tyr Phe His Asp Ser Asp Leu
225 230 235 240
Ala Gln Phe Met Arg Leu Phe Gly Gly Asn Phe Ala His Gln Ala Ser
245 250 255
Val Ala Arg Val Val Gly Gln Gln Gly Arg Gly Arg Ala Gly Ile Glu
260 265 270
Ala Ser Leu Asp Val Gln Tyr Leu Met Ser Ala Gly Ala Asn Ile Ser
275 280 285
Thr Trp Val Tyr Ser Ser Pro Gly Arg His Glu Gly Gln Glu Pro Phe
290 295 300
Leu Gln Trp Leu Met Leu Leu Ser Asn Glu Ser Ala Leu Pro His Val
305 310 315 320
His Thr Val Ser Tyr Gly Asp Asp Glu Asp Ser Leu Ser Ser Ala Tyr
325 330 335
Ile Gln Arg Val Asn Thr Glu Leu Met Lys Ala Ala Ala Arg Gly Leu
340 345 350
Thr Leu Leu Phe Ala Ser Gly Asp Ser Gly Ala Gly Cys Trp Ser Val
355 360 365
Ser Gly Arg His Gln Phe Arg Pro Thr Phe Pro Ala Ser Ser Pro Tyr
370 375 380
Val Thr Thr Val Gly Gly Thr Ser Phe Gln Glu Pro Phe Leu Ile Thr
385 390 395 400
Asn Glu Ile Val Asp Tyr Ile Ser Gly Gly Gly Phe Ser Asn Val Phe
405 410 415
Pro Arg Pro Ser Tyr Gln Glu Glu Ala Val Thr Lys Phe Leu Ser Ser
420 425 430
Ser Pro His Leu Pro Pro Ser Ser Tyr Phe Asn Ala Ser Gly Arg Ala
435 440 445
Tyr Pro Asp Val Ala Ala Leu Ser Asp Gly Tyr Trp Val Val Ser Asn
450 455 460
Arg Val Pro Ile Pro Trp Val Ser Gly Thr Ser Ala Ser Thr Pro Val
465 470 475 480
Phe Gly Gly Ile Leu Ser Leu Ile Asn Glu His Arg Ile Leu Ser Gly
485 490 495
Arg Pro Pro Leu Gly Phe Leu Asn Pro Arg Leu Tyr Gln Gln His Gly
500 505 510
Ala Gly Leu Phe Asp Val Thr Arg Gly Cys His Glu Ser Cys Leu Asp
515 520 525
Glu Glu Val Glu Gly Gln Gly Phe Cys Ser Gly Pro Gly Trp Asp Pro
530 535 540
Val Thr Gly Trp Gly Thr Pro Asn Phe Pro Ala Leu Leu Lys Thr Leu
545 550 555 560
Leu Asn Pro
<210> 88
<211> 612
<212> DNA
<213> Homo sapiens
<400> 88
atggcgtcgc ccggctgcct gtggctcttg gctgtggctc tcctgccatg gacctgcgct 60
tctcgggcgc tgcagcatct ggacccgccg gcgccgctgc cgttggtgat ctggcatggg 120
atgggtgttt ttggactccc tcgatgccca ggagagagct ctcacatctg tgacttcatc 180
cgaaaaacac tgaatgctgg ggcgtactcc aaagttgttc aggaacgcct cgtgcaagcc 240
gaatactggc atgaccccat aaaggaggat gtgtatcgca accacagcat cttcttggca 300
gatataaatc aggagcgggg tatcaatgag tcctacaaga aaaacctgat ggccctgaag 360
aagtttgtga tggtgaaatt cctcaatgat tccattgtgg accctgtaga ttcggagtgg 420
tttggatttt acagaagtgg ccaagccaag gaaaccattc ccttacagga gacctccctg 480
tacacacagg accgcctggg gctaaaggaa atggacaatg caggacagct agtgtttctg 540
gctacagaag gggaccatct tcagttgtct gaagaatggt tttatgccca catcatacca 600
ttccttggat ga 612
<210> 89
<211> 306
<212> PRT
<213> Homo sapiens
<400> 89
Met Ala Ser Pro Gly Cys Leu Trp Leu Leu Ala Val Ala Leu Leu Pro
1 5 10 15
Trp Thr Cys Ala Ser Arg Ala Leu Gln His Leu Asp Pro Pro Ala Pro
20 25 30
Leu Pro Leu Val Ile Trp His Gly Met Gly Asp Ser Cys Cys Asn Pro
35 40 45
Leu Ser Met Gly Ala Ile Lys Lys Met Val Glu Lys Lys Ile Pro Gly
50 55 60
Ile Tyr Val Leu Ser Leu Glu Ile Gly Lys Thr Leu Met Glu Asp Val
65 70 75 80
Glu Asn Ser Phe Phe Leu Asn Val Asn Ser Gln Val Thr Thr Val Cys
85 90 95
Gln Ala Leu Ala Lys Asp Pro Lys Leu Gln Gln Gly Tyr Asn Ala Met
100 105 110
Gly Phe Ser Gln Gly Gly Gln Phe Leu Arg Ala Val Ala Gln Arg Cys
115 120 125
Pro Ser Pro Pro Met Ile Asn Leu Ile Ser Val Gly Gly Gln His Gln
130 135 140
Gly Val Phe Gly Leu Pro Arg Cys Pro Gly Glu Ser Ser His Ile Cys
145 150 155 160
Asp Phe Ile Arg Lys Thr Leu Asn Ala Gly Ala Tyr Ser Lys Val Val
165 170 175
Gln Glu Arg Leu Val Gln Ala Glu Tyr Trp His Asp Pro Ile Lys Glu
180 185 190
Asp Val Tyr Arg Asn His Ser Ile Phe Leu Ala Asp Ile Asn Gln Glu
195 200 205
Arg Gly Ile Asn Glu Ser Tyr Lys Lys Asn Leu Met Ala Leu Lys Lys
210 215 220
Phe Val Met Val Lys Phe Leu Asn Asp Ser Ile Val Asp Pro Val Asp
225 230 235 240
Ser Glu Trp Phe Gly Phe Tyr Arg Ser Gly Gln Ala Lys Glu Thr Ile
245 250 255
Pro Leu Gln Glu Thr Ser Leu Tyr Thr Gln Asp Arg Leu Gly Leu Lys
260 265 270
Glu Met Asp Asn Ala Gly Gln Leu Val Phe Leu Ala Thr Glu Gly Asp
275 280 285
His Leu Gln Leu Ser Glu Glu Trp Phe Tyr Ala His Ile Ile Pro Phe
290 295 300
Leu Gly
305
<210> 90
<211> 1104
<212> DNA
<213> Homo sapiens
<400> 90
atgataagga attggctgac tatttttatc ctttttcccc tgaagctcgt agagaaatgt 60
gagtcaagcg tcagcctcac tgttcctcct gtcgtaaagc tggagaacgg cagctcgacc 120
aacgtcagcc tcaccctgcg gccaccatta aatgcaaccc tggtgatcac ttttgaaatc 180
acatttcgtt ccaaaaatat tactatcctt gagctccccg atgaagttgt ggtgcctcct 240
ggagtgacaa actcctcttt tcaagtgaca tctcaaaatg ttggacaact tactgtttat 300
ctacatggaa atcactccaa tcagaccggc ccgaggatac gctttcttgt gatccgcagc 360
agcgccatta gcatcataaa ccaggtgatt ggctggatct actttgtggc ctggtccatc 420
tccttctacc ctcaggtgat catgaattgg aggcggaaaa gtgtcattgg tctgagcttc 480
gacttcgtgg ctctgaacct gacgggcttc gtggcctaca gtgtattcaa catcggcctc 540
ctctgggtgc cctacatcaa ggagcagttt ctcctcaaat accccaacgg agtgaacccc 600
gtgaacagca acgacgtctt cttcagcctg cacgcggttg tcctcacgct gatcatcatc 660
gtgcagtgct gcctgtatga gcgcggtggc cagcgcgtgt cctggcctgc catcggcttc 720
ctggtgctcg cgtggctctt cgcatttgtc accatgatcg tggctgcagt gggagtgacc 780
acgtggctgc agtttctctt ctgcttctcc tacatcaagc tcgcagtcac gctggtcaag 840
tattttccac aggcctacat gaacttttac tacaaaagca ctgagggctg gagcattggc 900
aacgtgctcc tggacttcac cgggggcagc ttcagcctcc tgcagatgtt cctccagtcc 960
tacaacaacg accagtggac gctgatcttc ggagacccaa ccaagtttgg actcggggtc 1020
ttctccatcg tcttcgacgt cgtcttcttc atccagcact tctgtttgta cagaaagaga 1080
ccggggtatg accagctgaa ctag 1104
<210> 91
<211> 367
<212> PRT
<213> Homo sapiens
<400> 91
Met Ile Arg Asn Trp Leu Thr Ile Phe Ile Leu Phe Pro Leu Lys Leu
1 5 10 15
Val Glu Lys Cys Glu Ser Ser Val Ser Leu Thr Val Pro Pro Val Val
20 25 30
Lys Leu Glu Asn Gly Ser Ser Thr Asn Val Ser Leu Thr Leu Arg Pro
35 40 45
Pro Leu Asn Ala Thr Leu Val Ile Thr Phe Glu Ile Thr Phe Arg Ser
50 55 60
Lys Asn Ile Thr Ile Leu Glu Leu Pro Asp Glu Val Val Val Pro Pro
65 70 75 80
Gly Val Thr Asn Ser Ser Phe Gln Val Thr Ser Gln Asn Val Gly Gln
85 90 95
Leu Thr Val Tyr Leu His Gly Asn His Ser Asn Gln Thr Gly Pro Arg
100 105 110
Ile Arg Phe Leu Val Ile Arg Ser Ser Ala Ile Ser Ile Ile Asn Gln
115 120 125
Val Ile Gly Trp Ile Tyr Phe Val Ala Trp Ser Ile Ser Phe Tyr Pro
130 135 140
Gln Val Ile Met Asn Trp Arg Arg Lys Ser Val Ile Gly Leu Ser Phe
145 150 155 160
Asp Phe Val Ala Leu Asn Leu Thr Gly Phe Val Ala Tyr Ser Val Phe
165 170 175
Asn Ile Gly Leu Leu Trp Val Pro Tyr Ile Lys Glu Gln Phe Leu Leu
180 185 190
Lys Tyr Pro Asn Gly Val Asn Pro Val Asn Ser Asn Asp Val Phe Phe
195 200 205
Ser Leu His Ala Val Val Leu Thr Leu Ile Ile Ile Val Gln Cys Cys
210 215 220
Leu Tyr Glu Arg Gly Gly Gln Arg Val Ser Trp Pro Ala Ile Gly Phe
225 230 235 240
Leu Val Leu Ala Trp Leu Phe Ala Phe Val Thr Met Ile Val Ala Ala
245 250 255
Val Gly Val Thr Thr Trp Leu Gln Phe Leu Phe Cys Phe Ser Tyr Ile
260 265 270
Lys Leu Ala Val Thr Leu Val Lys Tyr Phe Pro Gln Ala Tyr Met Asn
275 280 285
Phe Tyr Tyr Lys Ser Thr Glu Gly Trp Ser Ile Gly Asn Val Leu Leu
290 295 300
Asp Phe Thr Gly Gly Ser Phe Ser Leu Leu Gln Met Phe Leu Gln Ser
305 310 315 320
Tyr Asn Asn Asp Gln Trp Thr Leu Ile Phe Gly Asp Pro Thr Lys Phe
325 330 335
Gly Leu Gly Val Phe Ser Ile Val Phe Asp Val Val Phe Phe Ile Gln
340 345 350
His Phe Cys Leu Tyr Arg Lys Arg Pro Gly Tyr Asp Gln Leu Asn
355 360 365
<210> 92
<211> 1674
<212> DNA
<213> Homo sapiens
<400> 92
atggcggcgg gggcggagtc ggcgcggccg cctctgggcg ggaccgcggg gactagacgt 60
ggccgcgggg cggtgtcatc gcccccgccc cgcccggtcc agccagctcg gcccgggggc 120
ttcgggctgt cgggccggcg ctcccttctc tgccaggtgg cgagtacacc tgctcacgta 180
ggcgtcatga ggtctccggt tcgagacctg gcccggaacg atggcgagga gagcacggac 240
cgcacgcctc ttctaccggg cgccccacgg gccgaagccg ctccagtgtg ctgctctgct 300
cgttacaact tagcaatttt ggcctttttt ggtttcttca ttgtgtatgc attacgtgtg 360
aatctgagtg ttgcgttagt ggatatggta gattcaaata caactttaga agataataga 420
acttccaagg cgtgtccaga gcattctgct cccataaaag ttcatcataa tcaaacgggt 480
aagaagtacc aatgggatgc agaaactcaa ggatggattc tcggttcctt tttttatggc 540
tacatcatca cacagattcc tggaggatat gttgccagca aaataggggg gaaaatgctg 600
ctaggatttg ggatccttgg cactgctgtc ctcaccctgt tcactcccat tgctgcagat 660
ttaggagttg gaccactcat tgtactcaga gcactagaag gactaggaga gggtgttaca 720
tttccagcca tgcatgccat gtggtcttct tgggctcccc ctcttgaaag aagcaaactt 780
cttagcattt catatgcagg agcacagctt gggacagtaa tttctcttcc tctttctgga 840
ataatttgct actatatgaa ttggacttat gtcttctact tttttggtac tattggaata 900
ttttggtttc ttttgtggat ctggttagtt agtgacacac cacaaaaaca caagagaatt 960
tcccattatg aaaaggaata cattctttca tcattaagaa atcagctttc ttcacagaag 1020
tcagtgccgt gggtacccat tttaaaatcc ctgccacttt gggctatcgt agttgcacac 1080
ttttcttaca actggacttt ttatacttta ttgacattat tgcctactta tatgaaggag 1140
atcctaaggt tcaatgttca agagaatggg tttttatctt cattgcctta tttaggctct 1200
tggttatgta tgatcctgtc tggtcaagct gctgacaatt taagggcaaa atggaatttt 1260
tcaactttat gtgttcgcag aatttttagc cttataggaa tgattggacc tgcagtattc 1320
ctggtagctg ctggcttcat tggctgtgat tattctttgg ccgttgcttt cctaactata 1380
tcaacaacac tgggaggctt ttgctcttct ggatttagca tcaaccatct ggatattgct 1440
ccttcgtatg ctggtatcct cctgggcatc acaaatacat ttgccactat tccaggaatg 1500
gttgggcccg tcattgctaa aagtctgacc cctgataaca ctgttggaga atggcaaacc 1560
gtgttctata ttgctgctgc tattaatgtt tttggtgcca ttttctttac actattcgcc 1620
aaaggtgaag tacaaaactg ggctctcaat gatcaccatg gacacagaca ctga 1674
<210> 93
<211> 557
<212> PRT
<213> Homo sapiens
<400> 93
Met Ala Ala Gly Ala Glu Ser Ala Arg Pro Pro Leu Gly Gly Thr Ala
1 5 10 15
Gly Thr Arg Arg Gly Arg Gly Ala Val Ser Ser Pro Pro Pro Arg Pro
20 25 30
Val Gln Pro Ala Arg Pro Gly Gly Phe Gly Leu Ser Gly Arg Arg Ser
35 40 45
Leu Leu Cys Gln Val Ala Ser Thr Pro Ala His Val Gly Val Met Arg
50 55 60
Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser Thr Asp
65 70 75 80
Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala Pro Val
85 90 95
Cys Cys Ser Ala Arg Tyr Asn Leu Ala Ile Leu Ala Phe Phe Gly Phe
100 105 110
Phe Ile Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu Val Asp
115 120 125
Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser Lys Ala
130 135 140
Cys Pro Glu His Ser Ala Pro Ile Lys Val His His Asn Gln Thr Gly
145 150 155 160
Lys Lys Tyr Gln Trp Asp Ala Glu Thr Gln Gly Trp Ile Leu Gly Ser
165 170 175
Phe Phe Tyr Gly Tyr Ile Ile Thr Gln Ile Pro Gly Gly Tyr Val Ala
180 185 190
Ser Lys Ile Gly Gly Lys Met Leu Leu Gly Phe Gly Ile Leu Gly Thr
195 200 205
Ala Val Leu Thr Leu Phe Thr Pro Ile Ala Ala Asp Leu Gly Val Gly
210 215 220
Pro Leu Ile Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly Val Thr
225 230 235 240
Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro Leu Glu
245 250 255
Arg Ser Lys Leu Leu Ser Ile Ser Tyr Ala Gly Ala Gln Leu Gly Thr
260 265 270
Val Ile Ser Leu Pro Leu Ser Gly Ile Ile Cys Tyr Tyr Met Asn Trp
275 280 285
Thr Tyr Val Phe Tyr Phe Phe Gly Thr Ile Gly Ile Phe Trp Phe Leu
290 295 300
Leu Trp Ile Trp Leu Val Ser Asp Thr Pro Gln Lys His Lys Arg Ile
305 310 315 320
Ser His Tyr Glu Lys Glu Tyr Ile Leu Ser Ser Leu Arg Asn Gln Leu
325 330 335
Ser Ser Gln Lys Ser Val Pro Trp Val Pro Ile Leu Lys Ser Leu Pro
340 345 350
Leu Trp Ala Ile Val Val Ala His Phe Ser Tyr Asn Trp Thr Phe Tyr
355 360 365
Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu Ile Leu Arg Phe
370 375 380
Asn Val Gln Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu Gly Ser
385 390 395 400
Trp Leu Cys Met Ile Leu Ser Gly Gln Ala Ala Asp Asn Leu Arg Ala
405 410 415
Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg Ile Phe Ser Leu Ile
420 425 430
Gly Met Ile Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe Ile Gly
435 440 445
Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr Ile Ser Thr Thr Leu
450 455 460
Gly Gly Phe Cys Ser Ser Gly Phe Ser Ile Asn His Leu Asp Ile Ala
465 470 475 480
Pro Ser Tyr Ala Gly Ile Leu Leu Gly Ile Thr Asn Thr Phe Ala Thr
485 490 495
Ile Pro Gly Met Val Gly Pro Val Ile Ala Lys Ser Leu Thr Pro Asp
500 505 510
Asn Thr Val Gly Glu Trp Gln Thr Val Phe Tyr Ile Ala Ala Ala Ile
515 520 525
Asn Val Phe Gly Ala Ile Phe Phe Thr Leu Phe Ala Lys Gly Glu Val
530 535 540
Gln Asn Trp Ala Leu Asn Asp His His Gly His Arg His
545 550 555
<210> 94
<211> 996
<212> DNA
<213> Homo sapiens
<400> 94
atgcgcccgg ccttggcggt gggcctggtg ttcgcaggct gctgcagtaa cgtgatcttc 60
ctagagctcc tggcccggaa gcatccagga tgtgggaaca ttgtgacatt tgcacaattt 120
ttatttattg ctgtggaagg cttcctcttt gaagctgatt tgggaaggaa gccaccagct 180
atcccaataa ggtactatgc cataatggtg accatgttct tcaccgtgag cgtggtgaac 240
aactatgccc tgaatctcaa cattgccatg cccctgcata tgatatttag atccggttct 300
ctaattgcca acatgattct aggaattatc attttgaaga aaagatacag tatattcaaa 360
tatacctcca ttgccctggt gtctgtgggg atatttattt gcacttttat gtcagcaaag 420
caggtgactt cccagtccag cttgagtgag aatgatggat tccaggcatt tgtgtggtgg 480
ttactaggta ttggggcatt gacttttgct cttctgatgt cagcaaggat ggggatattc 540
caagagactc tctacaaacg atttgggaaa cactccaagg aggctttgtt ttataatcac 600
gcccttccac ttccgggttt cgtcttcttg gcttctgata tttatgacca tgcagttcta 660
ttcaataagt ctgagttata tgaaattccc gtcatcggag tgaccctgcc catcatgtgg 720
ttctacctcc tcatgaacat catcactcag tacgtgtgca tccggggtgt gtttatcctc 780
accacagaat gcgcctccct caccgtcacg ctcgtcgtga ccctacgcaa atttgtgagc 840
ctcatctttt ccatcttgta cttccagaac cccttcaccc tgtggcactg gctgggcacc 900
ttgtttgtct tcattgggac cttaatgtac acagaggtgt ggaacaacct agggaccaca 960
aaaagtgagc ctcagaagga cagcaagaag aactga 996
<210> 95
<211> 331
<212> PRT
<213> Homo sapiens
<400> 95
Met Arg Pro Ala Leu Ala Val Gly Leu Val Phe Ala Gly Cys Cys Ser
1 5 10 15
Asn Val Ile Phe Leu Glu Leu Leu Ala Arg Lys His Pro Gly Cys Gly
20 25 30
Asn Ile Val Thr Phe Ala Gln Phe Leu Phe Ile Ala Val Glu Gly Phe
35 40 45
Leu Phe Glu Ala Asp Leu Gly Arg Lys Pro Pro Ala Ile Pro Ile Arg
50 55 60
Tyr Tyr Ala Ile Met Val Thr Met Phe Phe Thr Val Ser Val Val Asn
65 70 75 80
Asn Tyr Ala Leu Asn Leu Asn Ile Ala Met Pro Leu His Met Ile Phe
85 90 95
Arg Ser Gly Ser Leu Ile Ala Asn Met Ile Leu Gly Ile Ile Ile Leu
100 105 110
Lys Lys Arg Tyr Ser Ile Phe Lys Tyr Thr Ser Ile Ala Leu Val Ser
115 120 125
Val Gly Ile Phe Ile Cys Thr Phe Met Ser Ala Lys Gln Val Thr Ser
130 135 140
Gln Ser Ser Leu Ser Glu Asn Asp Gly Phe Gln Ala Phe Val Trp Trp
145 150 155 160
Leu Leu Gly Ile Gly Ala Leu Thr Phe Ala Leu Leu Met Ser Ala Arg
165 170 175
Met Gly Ile Phe Gln Glu Thr Leu Tyr Lys Arg Phe Gly Lys His Ser
180 185 190
Lys Glu Ala Leu Phe Tyr Asn His Ala Leu Pro Leu Pro Gly Phe Val
195 200 205
Phe Leu Ala Ser Asp Ile Tyr Asp His Ala Val Leu Phe Asn Lys Ser
210 215 220
Glu Leu Tyr Glu Ile Pro Val Ile Gly Val Thr Leu Pro Ile Met Trp
225 230 235 240
Phe Tyr Leu Leu Met Asn Ile Ile Thr Gln Tyr Val Cys Ile Arg Gly
245 250 255
Val Phe Ile Leu Thr Thr Glu Cys Ala Ser Leu Thr Val Thr Leu Val
260 265 270
Val Thr Leu Arg Lys Phe Val Ser Leu Ile Phe Ser Ile Leu Tyr Phe
275 280 285
Gln Asn Pro Phe Thr Leu Trp His Trp Leu Gly Thr Leu Phe Val Phe
290 295 300
Ile Gly Thr Leu Met Tyr Thr Glu Val Trp Asn Asn Leu Gly Thr Thr
305 310 315 320
Lys Ser Glu Pro Gln Lys Asp Ser Lys Lys Asn
325 330
<210> 96
<211> 918
<212> DNA
<213> Homo sapiens
<400> 96
atggcggcgg ggctggcgcg gctcctgttg ctcctcgggc tctcggccgg cgggcccgcg 60
ccggcaggtg cagcgaagat gaaggtggtg gaggagccca acgcgtttgg ggtgaacaac 120
ccgttcttgc ctcaggccag tcgcctccag gccaagaggg atccttcacc cgtgtctgga 180
cccgtgcatc tcttccgact ctcgggcaag tgcttcagcc tggtggagtc cacgtacaag 240
tatgagttct gcccgttcca caacgtgacc cagcacgagc agaccttccg ctggaacgcc 300
tacagtggga tcctcggcat ctggcacgag tgggagatcg ccaacaacac cttcacgggc 360
atgtggatga gggacggtga cgcctgccgt tcccggagcc ggcagagcaa ggtggagctg 420
gcgtgtggaa aaagcaaccg gctggcccat gtgtccgagc cgagcacctg cgtctacgcg 480
ctgacgttcg agacccccct cgtctgccac ccccacgcct tgctagtgta cccaaccctg 540
ccagaggccc tgcagcggca gtgggaccag gtagagcagg acctggccga tgagctgatc 600
accccccagg gccatgagaa gttgctgagg acactttttg aggatgctgg ctacttaaag 660
accccagaag aaaatgaacc cacccagctg gagggaggtc ctgacagctt ggggtttgag 720
accctggaaa actgcaggaa ggctcataaa gaactctcaa aggagatcaa aaggctgaaa 780
ggtttgctca cccagcacgg catcccctac acgaggccca cagaaacttc caacttggag 840
cacttgggcc acgagacgcc cagagccaag tctccagagc agctgcgggg tgacccagga 900
ctgcgtggga gtttgtga 918
<210> 97
<211> 305
<212> PRT
<213> Homo sapiens
<400> 97
Met Ala Ala Gly Leu Ala Arg Leu Leu Leu Leu Leu Gly Leu Ser Ala
1 5 10 15
Gly Gly Pro Ala Pro Ala Gly Ala Ala Lys Met Lys Val Val Glu Glu
20 25 30
Pro Asn Ala Phe Gly Val Asn Asn Pro Phe Leu Pro Gln Ala Ser Arg
35 40 45
Leu Gln Ala Lys Arg Asp Pro Ser Pro Val Ser Gly Pro Val His Leu
50 55 60
Phe Arg Leu Ser Gly Lys Cys Phe Ser Leu Val Glu Ser Thr Tyr Lys
65 70 75 80
Tyr Glu Phe Cys Pro Phe His Asn Val Thr Gln His Glu Gln Thr Phe
85 90 95
Arg Trp Asn Ala Tyr Ser Gly Ile Leu Gly Ile Trp His Glu Trp Glu
100 105 110
Ile Ala Asn Asn Thr Phe Thr Gly Met Trp Met Arg Asp Gly Asp Ala
115 120 125
Cys Arg Ser Arg Ser Arg Gln Ser Lys Val Glu Leu Ala Cys Gly Lys
130 135 140
Ser Asn Arg Leu Ala His Val Ser Glu Pro Ser Thr Cys Val Tyr Ala
145 150 155 160
Leu Thr Phe Glu Thr Pro Leu Val Cys His Pro His Ala Leu Leu Val
165 170 175
Tyr Pro Thr Leu Pro Glu Ala Leu Gln Arg Gln Trp Asp Gln Val Glu
180 185 190
Gln Asp Leu Ala Asp Glu Leu Ile Thr Pro Gln Gly His Glu Lys Leu
195 200 205
Leu Arg Thr Leu Phe Glu Asp Ala Gly Tyr Leu Lys Thr Pro Glu Glu
210 215 220
Asn Glu Pro Thr Gln Leu Glu Gly Gly Pro Asp Ser Leu Gly Phe Glu
225 230 235 240
Thr Leu Glu Asn Cys Arg Lys Ala His Lys Glu Leu Ser Lys Glu Ile
245 250 255
Lys Arg Leu Lys Gly Leu Leu Thr Gln His Gly Ile Pro Tyr Thr Arg
260 265 270
Pro Thr Glu Thr Ser Asn Leu Glu His Leu Gly His Glu Thr Pro Arg
275 280 285
Ala Lys Ser Pro Glu Gln Leu Arg Gly Asp Pro Gly Leu Arg Gly Ser
290 295 300
Leu
305
<210> 98
<211> 3771
<212> DNA
<213> Homo sapiens
<400> 98
atgctgttca agctcctgca gagacagacc tatacctgcc tgtcccacag gtatgggctc 60
tacgtgtgct tcttgggcgt cgttgtcacc atcgtctccg ccttccagtt cggagaggtg 120
gttctggaat ggagccgaga tcaataccat gttttgtttg attcctatag agacaatatt 180
gctggaaagt cctttcagaa tcggctttgt ctgcccatgc cgattgacgt tgtttacacc 240
tgggtgaatg gcacagatct tgaactactg aaggaactac agcaggtcag agaacagatg 300
gaggaggagc agaaagcaat gagagaaatc cttgggaaaa acacaacgga acctactaag 360
aagagtgaga agcagttaga gtgtttgcta acacactgca ttaaggtgcc aatgcttgtc 420
ctggacccag ccctgccagc caacatcacc ctgaaggacc tgccatctct ttatccttct 480
tttcattctg ccagtgacat tttcaatgtt gcaaaaccaa aaaacccttc taccaatgtc 540
tcagttgttg tttttgacag tactaaggat gttgaagatg cccactctgg actgcttaaa 600
ggaaatagca gacagacagt atggaggggc tacttgacaa cagataaaga agtccctgga 660
ttagtgctaa tgcaagattt ggctttcctg agtggatttc caccaacatt caaggaaaca 720
aatcaactaa aaacaaaatt gccagaaaat ctttcctcta aagtcaaact gttgcagttg 780
tattcagagg ccagtgtagc gcttctaaaa ctgaataacc ccaaggattt tcaagaattg 840
aataagcaaa ctaagaagaa catgaccatt gatggaaaag aactgaccat aagtcctgca 900
tatttattat gggatctgag cgccatcagc cagtctaagc aggatgaaga catctctgcc 960
agtcgttttg aagataacga agaactgagg tactcattgc gatctatcga gaggcatgca 1020
ccatgggttc ggaatatttt cattgtcacc aacgggcaga ttccatcctg gctgaacctt 1080
gacaatcctc gagtgacaat agtaacacac caggatgttt ttcgaaattt gagccacttg 1140
cctaccttta gttcacctgc tattgaaagt cacattcatc gcatcgaagg gctgtcccag 1200
aagtttattt acctaaatga tgatgtcatg tttgggaagg atgtctggcc agatgatttt 1260
tacagtcact ccaaaggcca gaaggtttat ttgacatggc ctgtgccaaa ctgtgccgag 1320
ggctgcccag gttcctggat taaggatggc tattgtgaca aggcttgtaa taattcagcc 1380
tgcgattggg atggtgggga ttgctctgga aacagtggag ggagtcgcta tattgcagga 1440
ggtggaggta ctgggagtat tggagttgga cagccctggc agtttggtgg aggaataaac 1500
agtgtctctt actgtaatca gggatgtgcg aattcctggc tcgctgataa gttctgtgac 1560
caagcatgca atgtcttgtc ctgtgggttt gatgctggcg actgtgggca agatcatttt 1620
catgaattgt ataaagtgat ccttctccca aaccagactc actatattat tccaaaaggt 1680
gaatgcctgc cttatttcag ctttgcagaa gtagccaaaa gaggagttga aggtgcctat 1740
agtgacaatc caataattcg acatgcttct attgccaaca agtggaaaac catccacctc 1800
ataatgcaca gtggaatgaa tgccaccaca atacatttta atctcacgtt tcaaaataca 1860
aacgatgaag agttcaaaat gcagataaca gtggaggtgg acacaaggga gggaccaaaa 1920
ctgaattcta cagcccagaa gggttacgaa aatttagtta gtcccataac acttcttcca 1980
gaggcggaaa tcctttttga ggatattccc aaagaaaaac gcttcccgaa gtttaagaga 2040
catgatgtta actcaacaag gagagcccag gaagaggtga aaattcccct ggtaaatatt 2100
tcactccttc caaaagacgc ccagttgagt ctcaatacct tggatttgca actggaacat 2160
ggagacatca ctttgaaagg atacaatttg tccaagtcag ccttgctgag atcatttctg 2220
atgaactcac agcatgctaa aataaaaaat caagctataa taacagatga aacaaatgac 2280
agtttggtgg ctccacagga aaaacaggtt cataaaagca tcttgccaaa cagcttagga 2340
gtgtctgaaa gattgcagag gttgactttt cctgcagtga gtgtaaaagt gaatggtcat 2400
gaccagggtc agaatccacc cctggacttg gagaccacag caagatttag agtggaaact 2460
cacacccaaa aaaccatagg cggaaatgtg acaaaagaaa agcccccatc tctgattgtt 2520
ccactggaaa gccagatgac aaaagaaaag aaaatcacag ggaaagaaaa agagaacagt 2580
agaatggagg aaaatgctga aaatcacata ggcgttactg aagtgttact tggaagaaag 2640
ctgcagcatt acacagatag ttacttgggc tttttgccat gggagaaaaa aaagtatttc 2700
caagatcttc tcgacgaaga agagtcattg aagacacaat tggcatactt cactgatagc 2760
aaaaatactg ggaggcaact aaaagataca tttgcagatt ccctcagata tgtaaataaa 2820
attctaaata gcaagtttgg attcacatcg cggaaagtcc ctgctcacat gcctcacatg 2880
attgaccgga ttgttatgca agaactgcaa gatatgttcc ctgaagaatt tgacaagacg 2940
tcatttcaca aagtgcgcca ttctgaggat atgcagtttg ccttctctta tttttattat 3000
ctcatgagtg cagtgcagcc actgaatata tctcaagtct ttgatgaagt tgatacagat 3060
caatctggtg tcttgtctga cagagaaatc cgaacactgg ctaccagaat tcacgaactg 3120
ccgttaagtt tgcaggattt gacaggtctg gaacacatgc taataaattg ctcaaaaatg 3180
cttcctgctg atatcacgca gctaaataat attccaccaa ctcaggaatc ctactatgat 3240
cccaacctgc caccggtcac taaaagtcta gtaacaaact gtaaaccagt aactgacaaa 3300
atccacaaag catataagga caaaaacaaa tataggtttg aaatcatggg agaagaagaa 3360
atcgctttta aaatgattcg taccaacgtt tctcatgtgg ttggccagtt ggatgacata 3420
agaaaaaacc ctaggaagtt tgtttgcctg aatgacaaca ttgaccacaa tcataaagat 3480
gctcagacag tgaaggctgt tctcagggac ttctatgaat ccatgttccc cataccttcc 3540
caatttgaac tgccaagaga gtatcgaaac cgtttccttc atatgcatga gctgcaggaa 3600
tggagggctt atcgagacaa attgaagttt tggacccatt gtgtactagc aacattgatt 3660
atgtttacta tattctcatt ttttgctgag cagttaattg cacttaagcg gaagatattt 3720
cccagaagga ggatacacaa agaagctagt cccaatcgaa tcagagtata g 3771
<210> 99
<211> 1256
<212> PRT
<213> Homo sapiens
<400> 99
Met Leu Phe Lys Leu Leu Gln Arg Gln Thr Tyr Thr Cys Leu Ser His
1 5 10 15
Arg Tyr Gly Leu Tyr Val Cys Phe Leu Gly Val Val Val Thr Ile Val
20 25 30
Ser Ala Phe Gln Phe Gly Glu Val Val Leu Glu Trp Ser Arg Asp Gln
35 40 45
Tyr His Val Leu Phe Asp Ser Tyr Arg Asp Asn Ile Ala Gly Lys Ser
50 55 60
Phe Gln Asn Arg Leu Cys Leu Pro Met Pro Ile Asp Val Val Tyr Thr
65 70 75 80
Trp Val Asn Gly Thr Asp Leu Glu Leu Leu Lys Glu Leu Gln Gln Val
85 90 95
Arg Glu Gln Met Glu Glu Glu Gln Lys Ala Met Arg Glu Ile Leu Gly
100 105 110
Lys Asn Thr Thr Glu Pro Thr Lys Lys Ser Glu Lys Gln Leu Glu Cys
115 120 125
Leu Leu Thr His Cys Ile Lys Val Pro Met Leu Val Leu Asp Pro Ala
130 135 140
Leu Pro Ala Asn Ile Thr Leu Lys Asp Leu Pro Ser Leu Tyr Pro Ser
145 150 155 160
Phe His Ser Ala Ser Asp Ile Phe Asn Val Ala Lys Pro Lys Asn Pro
165 170 175
Ser Thr Asn Val Ser Val Val Val Phe Asp Ser Thr Lys Asp Val Glu
180 185 190
Asp Ala His Ser Gly Leu Leu Lys Gly Asn Ser Arg Gln Thr Val Trp
195 200 205
Arg Gly Tyr Leu Thr Thr Asp Lys Glu Val Pro Gly Leu Val Leu Met
210 215 220
Gln Asp Leu Ala Phe Leu Ser Gly Phe Pro Pro Thr Phe Lys Glu Thr
225 230 235 240
Asn Gln Leu Lys Thr Lys Leu Pro Glu Asn Leu Ser Ser Lys Val Lys
245 250 255
Leu Leu Gln Leu Tyr Ser Glu Ala Ser Val Ala Leu Leu Lys Leu Asn
260 265 270
Asn Pro Lys Asp Phe Gln Glu Leu Asn Lys Gln Thr Lys Lys Asn Met
275 280 285
Thr Ile Asp Gly Lys Glu Leu Thr Ile Ser Pro Ala Tyr Leu Leu Trp
290 295 300
Asp Leu Ser Ala Ile Ser Gln Ser Lys Gln Asp Glu Asp Ile Ser Ala
305 310 315 320
Ser Arg Phe Glu Asp Asn Glu Glu Leu Arg Tyr Ser Leu Arg Ser Ile
325 330 335
Glu Arg His Ala Pro Trp Val Arg Asn Ile Phe Ile Val Thr Asn Gly
340 345 350
Gln Ile Pro Ser Trp Leu Asn Leu Asp Asn Pro Arg Val Thr Ile Val
355 360 365
Thr His Gln Asp Val Phe Arg Asn Leu Ser His Leu Pro Thr Phe Ser
370 375 380
Ser Pro Ala Ile Glu Ser His Ile His Arg Ile Glu Gly Leu Ser Gln
385 390 395 400
Lys Phe Ile Tyr Leu Asn Asp Asp Val Met Phe Gly Lys Asp Val Trp
405 410 415
Pro Asp Asp Phe Tyr Ser His Ser Lys Gly Gln Lys Val Tyr Leu Thr
420 425 430
Trp Pro Val Pro Asn Cys Ala Glu Gly Cys Pro Gly Ser Trp Ile Lys
435 440 445
Asp Gly Tyr Cys Asp Lys Ala Cys Asn Asn Ser Ala Cys Asp Trp Asp
450 455 460
Gly Gly Asp Cys Ser Gly Asn Ser Gly Gly Ser Arg Tyr Ile Ala Gly
465 470 475 480
Gly Gly Gly Thr Gly Ser Ile Gly Val Gly Gln Pro Trp Gln Phe Gly
485 490 495
Gly Gly Ile Asn Ser Val Ser Tyr Cys Asn Gln Gly Cys Ala Asn Ser
500 505 510
Trp Leu Ala Asp Lys Phe Cys Asp Gln Ala Cys Asn Val Leu Ser Cys
515 520 525
Gly Phe Asp Ala Gly Asp Cys Gly Gln Asp His Phe His Glu Leu Tyr
530 535 540
Lys Val Ile Leu Leu Pro Asn Gln Thr His Tyr Ile Ile Pro Lys Gly
545 550 555 560
Glu Cys Leu Pro Tyr Phe Ser Phe Ala Glu Val Ala Lys Arg Gly Val
565 570 575
Glu Gly Ala Tyr Ser Asp Asn Pro Ile Ile Arg His Ala Ser Ile Ala
580 585 590
Asn Lys Trp Lys Thr Ile His Leu Ile Met His Ser Gly Met Asn Ala
595 600 605
Thr Thr Ile His Phe Asn Leu Thr Phe Gln Asn Thr Asn Asp Glu Glu
610 615 620
Phe Lys Met Gln Ile Thr Val Glu Val Asp Thr Arg Glu Gly Pro Lys
625 630 635 640
Leu Asn Ser Thr Ala Gln Lys Gly Tyr Glu Asn Leu Val Ser Pro Ile
645 650 655
Thr Leu Leu Pro Glu Ala Glu Ile Leu Phe Glu Asp Ile Pro Lys Glu
660 665 670
Lys Arg Phe Pro Lys Phe Lys Arg His Asp Val Asn Ser Thr Arg Arg
675 680 685
Ala Gln Glu Glu Val Lys Ile Pro Leu Val Asn Ile Ser Leu Leu Pro
690 695 700
Lys Asp Ala Gln Leu Ser Leu Asn Thr Leu Asp Leu Gln Leu Glu His
705 710 715 720
Gly Asp Ile Thr Leu Lys Gly Tyr Asn Leu Ser Lys Ser Ala Leu Leu
725 730 735
Arg Ser Phe Leu Met Asn Ser Gln His Ala Lys Ile Lys Asn Gln Ala
740 745 750
Ile Ile Thr Asp Glu Thr Asn Asp Ser Leu Val Ala Pro Gln Glu Lys
755 760 765
Gln Val His Lys Ser Ile Leu Pro Asn Ser Leu Gly Val Ser Glu Arg
770 775 780
Leu Gln Arg Leu Thr Phe Pro Ala Val Ser Val Lys Val Asn Gly His
785 790 795 800
Asp Gln Gly Gln Asn Pro Pro Leu Asp Leu Glu Thr Thr Ala Arg Phe
805 810 815
Arg Val Glu Thr His Thr Gln Lys Thr Ile Gly Gly Asn Val Thr Lys
820 825 830
Glu Lys Pro Pro Ser Leu Ile Val Pro Leu Glu Ser Gln Met Thr Lys
835 840 845
Glu Lys Lys Ile Thr Gly Lys Glu Lys Glu Asn Ser Arg Met Glu Glu
850 855 860
Asn Ala Glu Asn His Ile Gly Val Thr Glu Val Leu Leu Gly Arg Lys
865 870 875 880
Leu Gln His Tyr Thr Asp Ser Tyr Leu Gly Phe Leu Pro Trp Glu Lys
885 890 895
Lys Lys Tyr Phe Gln Asp Leu Leu Asp Glu Glu Glu Ser Leu Lys Thr
900 905 910
Gln Leu Ala Tyr Phe Thr Asp Ser Lys Asn Thr Gly Arg Gln Leu Lys
915 920 925
Asp Thr Phe Ala Asp Ser Leu Arg Tyr Val Asn Lys Ile Leu Asn Ser
930 935 940
Lys Phe Gly Phe Thr Ser Arg Lys Val Pro Ala His Met Pro His Met
945 950 955 960
Ile Asp Arg Ile Val Met Gln Glu Leu Gln Asp Met Phe Pro Glu Glu
965 970 975
Phe Asp Lys Thr Ser Phe His Lys Val Arg His Ser Glu Asp Met Gln
980 985 990
Phe Ala Phe Ser Tyr Phe Tyr Tyr Leu Met Ser Ala Val Gln Pro Leu
995 1000 1005
Asn Ile Ser Gln Val Phe Asp Glu Val Asp Thr Asp Gln Ser Gly
1010 1015 1020
Val Leu Ser Asp Arg Glu Ile Arg Thr Leu Ala Thr Arg Ile His
1025 1030 1035
Glu Leu Pro Leu Ser Leu Gln Asp Leu Thr Gly Leu Glu His Met
1040 1045 1050
Leu Ile Asn Cys Ser Lys Met Leu Pro Ala Asp Ile Thr Gln Leu
1055 1060 1065
Asn Asn Ile Pro Pro Thr Gln Glu Ser Tyr Tyr Asp Pro Asn Leu
1070 1075 1080
Pro Pro Val Thr Lys Ser Leu Val Thr Asn Cys Lys Pro Val Thr
1085 1090 1095
Asp Lys Ile His Lys Ala Tyr Lys Asp Lys Asn Lys Tyr Arg Phe
1100 1105 1110
Glu Ile Met Gly Glu Glu Glu Ile Ala Phe Lys Met Ile Arg Thr
1115 1120 1125
Asn Val Ser His Val Val Gly Gln Leu Asp Asp Ile Arg Lys Asn
1130 1135 1140
Pro Arg Lys Phe Val Cys Leu Asn Asp Asn Ile Asp His Asn His
1145 1150 1155
Lys Asp Ala Gln Thr Val Lys Ala Val Leu Arg Asp Phe Tyr Glu
1160 1165 1170
Ser Met Phe Pro Ile Pro Ser Gln Phe Glu Leu Pro Arg Glu Tyr
1175 1180 1185
Arg Asn Arg Phe Leu His Met His Glu Leu Gln Glu Trp Arg Ala
1190 1195 1200
Tyr Arg Asp Lys Leu Lys Phe Trp Thr His Cys Val Leu Ala Thr
1205 1210 1215
Leu Ile Met Phe Thr Ile Phe Ser Phe Phe Ala Glu Gln Leu Ile
1220 1225 1230
Ala Leu Lys Arg Lys Ile Phe Pro Arg Arg Arg Ile His Lys Glu
1235 1240 1245
Ala Ser Pro Asn Arg Ile Arg Val
1250 1255
<210> 100
<211> 1743
<212> DNA
<213> Homo sapiens
<400> 100
atgacagccc cggcgggtcc gcgcggctca gagaccgagc ggcttctgac ccccaacccc 60
gggtatggga cccaggcggg gccttcaccg gcccctccga cacccccaga agaggaagac 120
cttcgccgtc gtctcaaata ctttttcatg agtccctgcg acaagtttcg agccaagggc 180
cgcaagccct gcaagctgat gctgcaagtg gtcaagatcc tggtggtcac ggtgcagctc 240
atcctgtttg ggctcagtaa tcagctggct gtgacattcc gggaagagaa caccatcgcc 300
ttccgacacc tcttcctgct gggctactcg gacggagcgg atgacacctt cgcagcctac 360
acgcgggagc agctgtacca ggccatcttc catgctgtgg accagtacct ggcgttgcct 420
gacgtgtcac tgggccggta tgcgtatgtc cgtggtgggg gtgacccttg gaccaatggc 480
tcagggcttg ctctctgcca gcggtactac caccgaggcc acgtggaccc ggccaacgac 540
acatttgaca ttgatccgat ggtggttact gactgcatcc aggtggatcc ccccgagcgg 600
ccccctccgc cccccagcga cgatctcacc ctcttggaaa gcagctccag ttacaagaac 660
ctcacgctca aattccacaa gctggtcaat gtcaccatcc acttccggct gaagaccatt 720
aacctccaga gcctcatcaa taatgagatc ccggactgct ataccttcag cgtcctgatc 780
acgtttgaca acaaagcaca cagtgggcgg atccccatca gcctggagac ccaggcccac 840
atccaggagt gtaagcaccc cagtgtcttc cagcacggag acaacagctt ccggctcctg 900
tttgacgtgg tggtcatcct cacctgctcc ctgtccttcc tcctctgcgc ccgctcactc 960
cttcgaggct tcctgctgca gaacgagttt gtggggttca tgtggcggca gcggggacgg 1020
gtcatcagcc tgtgggagcg gctggaattt gtcaatggct ggtacatcct gctcgtcacc 1080
agcgatgtgc tcaccatctc gggcaccatc atgaagatcg gcatcgaggc caagaacttg 1140
gcgagctacg acgtctgcag catcctcctg ggcacctcga cgctgctggt gtgggtgggc 1200
gtgatccgct acctgacctt cttccacaac tacaatatcc tcatcgccac actgcgggtg 1260
gccctgccca gcgtcatgcg cttctgctgc tgcgtggctg tcatctacct gggctactgc 1320
ttctgtggct ggatcgtgct ggggccctat catgtgaagt tccgctcact ctccatggtg 1380
tctgagtgcc tgttctcgct catcaatggg gacgacatgt ttgtgacgtt cgccgccatg 1440
caggcgcagc agggccgcag cagcctggtg tggctcttct cccagctcta cctttactcc 1500
ttcatcagcc tcttcatcta catggtgctc agcctcttca tcgcgctcat caccggcgcc 1560
tacgacacca tcaagcatcc cggcggcgca ggcgcagagg agagcgagct gcaggcctac 1620
atcgcacagt gccaggacag ccccacctcc ggcaagttcc gccgcgggag cggctcggcc 1680
tgcagccttc tctgctgctg cggaagggac ccctcggagg agcattcgct gctggtgaat 1740
tga 1743
<210> 101
<211> 580
<212> PRT
<213> Homo sapiens
<400> 101
Met Thr Ala Pro Ala Gly Pro Arg Gly Ser Glu Thr Glu Arg Leu Leu
1 5 10 15
Thr Pro Asn Pro Gly Tyr Gly Thr Gln Ala Gly Pro Ser Pro Ala Pro
20 25 30
Pro Thr Pro Pro Glu Glu Glu Asp Leu Arg Arg Arg Leu Lys Tyr Phe
35 40 45
Phe Met Ser Pro Cys Asp Lys Phe Arg Ala Lys Gly Arg Lys Pro Cys
50 55 60
Lys Leu Met Leu Gln Val Val Lys Ile Leu Val Val Thr Val Gln Leu
65 70 75 80
Ile Leu Phe Gly Leu Ser Asn Gln Leu Ala Val Thr Phe Arg Glu Glu
85 90 95
Asn Thr Ile Ala Phe Arg His Leu Phe Leu Leu Gly Tyr Ser Asp Gly
100 105 110
Ala Asp Asp Thr Phe Ala Ala Tyr Thr Arg Glu Gln Leu Tyr Gln Ala
115 120 125
Ile Phe His Ala Val Asp Gln Tyr Leu Ala Leu Pro Asp Val Ser Leu
130 135 140
Gly Arg Tyr Ala Tyr Val Arg Gly Gly Gly Asp Pro Trp Thr Asn Gly
145 150 155 160
Ser Gly Leu Ala Leu Cys Gln Arg Tyr Tyr His Arg Gly His Val Asp
165 170 175
Pro Ala Asn Asp Thr Phe Asp Ile Asp Pro Met Val Val Thr Asp Cys
180 185 190
Ile Gln Val Asp Pro Pro Glu Arg Pro Pro Pro Pro Pro Ser Asp Asp
195 200 205
Leu Thr Leu Leu Glu Ser Ser Ser Ser Tyr Lys Asn Leu Thr Leu Lys
210 215 220
Phe His Lys Leu Val Asn Val Thr Ile His Phe Arg Leu Lys Thr Ile
225 230 235 240
Asn Leu Gln Ser Leu Ile Asn Asn Glu Ile Pro Asp Cys Tyr Thr Phe
245 250 255
Ser Val Leu Ile Thr Phe Asp Asn Lys Ala His Ser Gly Arg Ile Pro
260 265 270
Ile Ser Leu Glu Thr Gln Ala His Ile Gln Glu Cys Lys His Pro Ser
275 280 285
Val Phe Gln His Gly Asp Asn Ser Phe Arg Leu Leu Phe Asp Val Val
290 295 300
Val Ile Leu Thr Cys Ser Leu Ser Phe Leu Leu Cys Ala Arg Ser Leu
305 310 315 320
Leu Arg Gly Phe Leu Leu Gln Asn Glu Phe Val Gly Phe Met Trp Arg
325 330 335
Gln Arg Gly Arg Val Ile Ser Leu Trp Glu Arg Leu Glu Phe Val Asn
340 345 350
Gly Trp Tyr Ile Leu Leu Val Thr Ser Asp Val Leu Thr Ile Ser Gly
355 360 365
Thr Ile Met Lys Ile Gly Ile Glu Ala Lys Asn Leu Ala Ser Tyr Asp
370 375 380
Val Cys Ser Ile Leu Leu Gly Thr Ser Thr Leu Leu Val Trp Val Gly
385 390 395 400
Val Ile Arg Tyr Leu Thr Phe Phe His Asn Tyr Asn Ile Leu Ile Ala
405 410 415
Thr Leu Arg Val Ala Leu Pro Ser Val Met Arg Phe Cys Cys Cys Val
420 425 430
Ala Val Ile Tyr Leu Gly Tyr Cys Phe Cys Gly Trp Ile Val Leu Gly
435 440 445
Pro Tyr His Val Lys Phe Arg Ser Leu Ser Met Val Ser Glu Cys Leu
450 455 460
Phe Ser Leu Ile Asn Gly Asp Asp Met Phe Val Thr Phe Ala Ala Met
465 470 475 480
Gln Ala Gln Gln Gly Arg Ser Ser Leu Val Trp Leu Phe Ser Gln Leu
485 490 495
Tyr Leu Tyr Ser Phe Ile Ser Leu Phe Ile Tyr Met Val Leu Ser Leu
500 505 510
Phe Ile Ala Leu Ile Thr Gly Ala Tyr Asp Thr Ile Lys His Pro Gly
515 520 525
Gly Ala Gly Ala Glu Glu Ser Glu Leu Gln Ala Tyr Ile Ala Gln Cys
530 535 540
Gln Asp Ser Pro Thr Ser Gly Lys Phe Arg Arg Gly Ser Gly Ser Ala
545 550 555 560
Cys Ser Leu Leu Cys Cys Cys Gly Arg Asp Pro Ser Glu Glu His Ser
565 570 575
Leu Leu Val Asn
580
<210> 102
<211> 1233
<212> DNA
<213> Homo sapiens
<400> 102
atggtgtgct tccgcctctt cccggttccg ggctcagggc tcgttctggt ctgcctagtc 60
ctgggagctg tgcggtctta tgcattggaa cttaatttga cagattcaga aaatgccact 120
tgcctttatg caaaatggca gatgaatttc acagtacgct atgaaactac aaataaaact 180
tataaaactg taaccatttc agaccatggc actgtgacat ataatggaag catttgtggg 240
gatgatcaga atggtcccaa aatagcagtg cagttcggac ctggcttttc ctggattgcg 300
aattttacca aggcagcatc tacttattca attgacagcg tctcattttc ctacaacact 360
ggtgataaca caacatttcc tgatgctgaa gataaaggaa ttcttactgt tgatgaactt 420
ttggccatca gaattccatt gaatgacctt tttagatgca atagtttatc aactttggaa 480
aagaatgatg ttgtccaaca ctactgggat gttcttgtac aagcttttgt ccaaaatggc 540
acagtgagca caaatgagtt cctgtgtgat aaagacaaaa cttcaacagt ggcacccacc 600
atacacacca ctgtgccatc tcctactaca acacctactc caaaggaaaa accagaagct 660
ggaacctatt cagttaataa tggcaatgat acttgtctgc tggctaccat ggggctgcag 720
ctgaacatca ctcaggataa ggttgcttca gttattaaca tcaaccccaa tacaactcac 780
tccacaggca gctgccgttc tcacactgct ctacttagac tcaatagcag caccattaag 840
tatctagact ttgtctttgc tgtgaaaaat gaaaaccgat tttatctgaa ggaagtgaac 900
atcagcatgt atttggttaa tggctccgtt ttcagcattg caaataacaa tctcagctac 960
tgggatgccc ccctgggaag ttcttatatg tgcaacaaag agcagactgt ttcagtgtct 1020
ggagcatttc agataaatac ctttgatcta agggttcagc ctttcaatgt gacacaagga 1080
aagtattcta cagctcaaga ctgcagtgca gatgacgaca acttccttgt gcccatagcg 1140
gtgggagctg ccttggcagg agtacttatt ctagtgttgc tggcttattt tattggtctc 1200
aagcaccatc atgctggata tgagcaattt tag 1233
<210> 103
<211> 410
<212> PRT
<213> Homo sapiens
<400> 103
Met Val Cys Phe Arg Leu Phe Pro Val Pro Gly Ser Gly Leu Val Leu
1 5 10 15
Val Cys Leu Val Leu Gly Ala Val Arg Ser Tyr Ala Leu Glu Leu Asn
20 25 30
Leu Thr Asp Ser Glu Asn Ala Thr Cys Leu Tyr Ala Lys Trp Gln Met
35 40 45
Asn Phe Thr Val Arg Tyr Glu Thr Thr Asn Lys Thr Tyr Lys Thr Val
50 55 60
Thr Ile Ser Asp His Gly Thr Val Thr Tyr Asn Gly Ser Ile Cys Gly
65 70 75 80
Asp Asp Gln Asn Gly Pro Lys Ile Ala Val Gln Phe Gly Pro Gly Phe
85 90 95
Ser Trp Ile Ala Asn Phe Thr Lys Ala Ala Ser Thr Tyr Ser Ile Asp
100 105 110
Ser Val Ser Phe Ser Tyr Asn Thr Gly Asp Asn Thr Thr Phe Pro Asp
115 120 125
Ala Glu Asp Lys Gly Ile Leu Thr Val Asp Glu Leu Leu Ala Ile Arg
130 135 140
Ile Pro Leu Asn Asp Leu Phe Arg Cys Asn Ser Leu Ser Thr Leu Glu
145 150 155 160
Lys Asn Asp Val Val Gln His Tyr Trp Asp Val Leu Val Gln Ala Phe
165 170 175
Val Gln Asn Gly Thr Val Ser Thr Asn Glu Phe Leu Cys Asp Lys Asp
180 185 190
Lys Thr Ser Thr Val Ala Pro Thr Ile His Thr Thr Val Pro Ser Pro
195 200 205
Thr Thr Thr Pro Thr Pro Lys Glu Lys Pro Glu Ala Gly Thr Tyr Ser
210 215 220
Val Asn Asn Gly Asn Asp Thr Cys Leu Leu Ala Thr Met Gly Leu Gln
225 230 235 240
Leu Asn Ile Thr Gln Asp Lys Val Ala Ser Val Ile Asn Ile Asn Pro
245 250 255
Asn Thr Thr His Ser Thr Gly Ser Cys Arg Ser His Thr Ala Leu Leu
260 265 270
Arg Leu Asn Ser Ser Thr Ile Lys Tyr Leu Asp Phe Val Phe Ala Val
275 280 285
Lys Asn Glu Asn Arg Phe Tyr Leu Lys Glu Val Asn Ile Ser Met Tyr
290 295 300
Leu Val Asn Gly Ser Val Phe Ser Ile Ala Asn Asn Asn Leu Ser Tyr
305 310 315 320
Trp Asp Ala Pro Leu Gly Ser Ser Tyr Met Cys Asn Lys Glu Gln Thr
325 330 335
Val Ser Val Ser Gly Ala Phe Gln Ile Asn Thr Phe Asp Leu Arg Val
340 345 350
Gln Pro Phe Asn Val Thr Gln Gly Lys Tyr Ser Thr Ala Gln Asp Cys
355 360 365
Ser Ala Asp Asp Asp Asn Phe Leu Val Pro Ile Ala Val Gly Ala Ala
370 375 380
Leu Ala Gly Val Leu Ile Leu Val Leu Leu Ala Tyr Phe Ile Gly Leu
385 390 395 400
Lys His His His Ala Gly Tyr Glu Gln Phe
405 410
<210> 104
<211> 3837
<212> DNA
<213> Homo sapiens
<400> 104
atgaccgctc gcggcctggc ccttggcctc ctcctgctgc tactgtgtcc agcgcaggtg 60
ttttcacagt cctgtgtttg gtatggagag tgtggaattg catatgggga caagaggtac 120
aattgcgaat attctggccc accaaaacca ttgccaaagg atggatatga cttagtgcag 180
gaactctgtc caggattctt ctttggcaat gtcagtctct gttgtgatgt tcggcagctt 240
cagacactaa aagacaacct gcagctgcct ctacagtttc tgtccagatg tccatcctgt 300
ttttataacc tactgaacct gttttgtgag ctgacatgta gccctcgaca gagtcagttt 360
ttgaatgtta cagctactga agattatgtt gatcctgtta caaaccagac gaaaacaaat 420
gtgaaagagt tacaatacta cgtcggacag agttttgcca atgcaatgta caatgcctgc 480
cgggatgtgg aggccccctc aagtaatgac aaggccctgg gactcctgtg tgggaaggac 540
gctgacgcct gtaatgccac caactggatt gaatacatgt tcaataagga caatggacag 600
gcacctttta ccatcactcc tgtgttttca gattttccag tccatgggat ggagcccatg 660
aacaatgcca ccaaaggctg tgacgagtct gtggatgagg tcacagcacc atgtagctgc 720
caagactgct ctattgtctg tggccccaag ccccagcccc cacctcctcc tgctccctgg 780
acgatccttg gcttggacgc catgtatgtc atcatgtgga tcacctacat ggcgtttttg 840
cttgtgtttt ttggagcatt ttttgcagtg tggtgctaca gaaaacggta ttttgtctcc 900
gagtacactc ccatcgatag caatatagct ttttctgtta atgcaagtga caaaggagag 960
gcgtcctgct gtgaccctgt cagcgcagca tttgagggct gcttgaggcg gctgttcaca 1020
cgctgggggt ctttctgcgt ccgaaaccct ggctgtgtca ttttcttctc gctggtcttc 1080
attactgcgt gttcgtcagg cctggtgttt gtccgggtca caaccaatcc agttgacctc 1140
tggtcagccc ccagcagcca ggctcgcctg gaaaaagagt actttgacca gcactttggg 1200
cctttcttcc ggacggagca gctcatcatc cgggcccctc tcactgacaa acacatttac 1260
cagccatacc cttcgggagc tgatgtaccc tttggacctc cgcttgacat acagatactg 1320
caccaggttc ttgacttaca aatagccatc gaaaacatta ctgcctctta tgacaatgag 1380
actgtgacac ttcaagacat ctgcttggcc cctctttcac cgtataacac gaactgcacc 1440
attttgagtg tgttaaatta cttccagaac agccattccg tgctggacca caagaaaggg 1500
gacgacttct ttgtgtatgc cgattaccac acgcactttc tgtactgcgt acgggctcct 1560
gcctctctga atgatacaag tttgctccat gacccttgtc tgggtacgtt tggtggacca 1620
gtgttcccgt ggcttgtgtt gggaggctat gatgatcaaa actacaataa cgccactgcc 1680
cttgtgatta ccttccctgt caataattac tataatgata cagagaagct ccagagggcc 1740
caggcctggg aaaaagagtt tattaatttt gtgaaaaact acaagaatcc caatctgacc 1800
atttccttca ctgctgaacg aagtattgaa gatgaactaa atcgtgaaag tgacagtgat 1860
gtcttcaccg ttgtaattag ctatgccatc atgtttctat atatttccct agccttgggg 1920
cacatgaaaa gctgtcgcag gcttctggtg gattcgaagg tctcactagg catcgcgggc 1980
atcttgatcg tgctgagctc ggtggcttgc tccttgggtg tcttcagcta cattgggttg 2040
cccttgaccc tcattgtgat tgaagtcatc ccgttcctgg tgctggctgt tggagtggac 2100
aacatcttca ttctggtgca ggcctaccag agagatgaac gtcttcaagg ggaaaccctg 2160
gatcagcagc tgggcagggt cctaggagaa gtggctccca gtatgttcct gtcatccttt 2220
tctgagactg tagcattttt cttaggagca ttgtccgtga tgccagccgt gcacaccttc 2280
tctctctttg cgggattggc agtcttcatt gactttcttc tgcagattac ctgtttcgtg 2340
agtctcttgg ggttagacat taaacgtcaa gagaaaaatc ggctagacat cttttgctgt 2400
gtcagaggtg ctgaagatgg aacaagcgtc caggcctcag agagctgttt gtttcgcttc 2460
ttcaaaaact cctattctcc acttctgcta aaggactgga tgagaccaat tgtgatagca 2520
atatttgtgg gtgttctgtc attcagcatc gcagtcctga acaaagtaga tattggattg 2580
gatcagtctc tttcgatgcc agatgactcc tacatggtgg attatttcaa atccatcagt 2640
cagtacctgc atgcgggtcc gcctgtgtac tttgtcctgg aggaagggca cgactacact 2700
tcttccaagg ggcagaacat ggtgtgcggc ggcatgggct gcaacaatga ttccctggtg 2760
cagcagatat ttaacgcggc gcagctggac aactataccc gaataggctt cgccccctcg 2820
tcctggatcg acgattattt cgactgggtg aagccacagt cgtcttgctg tcgagtggac 2880
aatatcactg accagttctg caatgcttca gtggttgacc ctgcctgcgt tcgctgcagg 2940
cctctgactc cggaaggcaa acagaggcct caggggggag acttcatgag attcctgccc 3000
atgttccttt cggataaccc taaccccaag tgtggcaaag ggggacatgc tgcctatagt 3060
tctgcagtta acatcctcct tggccatggc accagggtcg gagccacgta cttcatgacc 3120
taccacaccg tgctgcagac ctctgctgac tttattgacg ctctgaagaa agcccgactt 3180
atagccagta atgtcaccga aaccatgggc attaacggca gtgcctaccg agtatttcct 3240
tacagtgtgt tttatgtctt ctacgaacag tacctgacca tcattgacga cactatcttc 3300
aacctcggtg tgtccctggg cgcgatattt ctggtgacca tggtcctcct gggctgtgag 3360
ctctggtctg cagtcatcat gtgtgccacc atcgccatgg tcttggtcaa catgtttgga 3420
gttatgtggc tctggggcat cagtctgaac gctgtatcct tggtcaacct ggtgatgagc 3480
tgtggcatct ccgtggagtt ctgcagccac ataaccagag cgttcacggt gagcatgaaa 3540
ggcagccgcg tggagcgcgc ggaagaggca cttgcccaca tgggcagctc cgtgttcagt 3600
ggaatcacac ttacaaaatt tggagggatt gtggtgttgg cttttgccaa atctcaaatt 3660
ttccagatat tctacttcag gatgtatttg gccatggtct tactgggagc cactcacgga 3720
ttaatatttc tccctgtctt actcagttac atagggccat cagtaaataa agccaaaagt 3780
tgtgccactg aagagcgata caaaggaaca gagcgcgaac ggcttctaaa tttctag 3837
<210> 105
<211> 1278
<212> PRT
<213> Homo sapiens
<400> 105
Met Thr Ala Arg Gly Leu Ala Leu Gly Leu Leu Leu Leu Leu Leu Cys
1 5 10 15
Pro Ala Gln Val Phe Ser Gln Ser Cys Val Trp Tyr Gly Glu Cys Gly
20 25 30
Ile Ala Tyr Gly Asp Lys Arg Tyr Asn Cys Glu Tyr Ser Gly Pro Pro
35 40 45
Lys Pro Leu Pro Lys Asp Gly Tyr Asp Leu Val Gln Glu Leu Cys Pro
50 55 60
Gly Phe Phe Phe Gly Asn Val Ser Leu Cys Cys Asp Val Arg Gln Leu
65 70 75 80
Gln Thr Leu Lys Asp Asn Leu Gln Leu Pro Leu Gln Phe Leu Ser Arg
85 90 95
Cys Pro Ser Cys Phe Tyr Asn Leu Leu Asn Leu Phe Cys Glu Leu Thr
100 105 110
Cys Ser Pro Arg Gln Ser Gln Phe Leu Asn Val Thr Ala Thr Glu Asp
115 120 125
Tyr Val Asp Pro Val Thr Asn Gln Thr Lys Thr Asn Val Lys Glu Leu
130 135 140
Gln Tyr Tyr Val Gly Gln Ser Phe Ala Asn Ala Met Tyr Asn Ala Cys
145 150 155 160
Arg Asp Val Glu Ala Pro Ser Ser Asn Asp Lys Ala Leu Gly Leu Leu
165 170 175
Cys Gly Lys Asp Ala Asp Ala Cys Asn Ala Thr Asn Trp Ile Glu Tyr
180 185 190
Met Phe Asn Lys Asp Asn Gly Gln Ala Pro Phe Thr Ile Thr Pro Val
195 200 205
Phe Ser Asp Phe Pro Val His Gly Met Glu Pro Met Asn Asn Ala Thr
210 215 220
Lys Gly Cys Asp Glu Ser Val Asp Glu Val Thr Ala Pro Cys Ser Cys
225 230 235 240
Gln Asp Cys Ser Ile Val Cys Gly Pro Lys Pro Gln Pro Pro Pro Pro
245 250 255
Pro Ala Pro Trp Thr Ile Leu Gly Leu Asp Ala Met Tyr Val Ile Met
260 265 270
Trp Ile Thr Tyr Met Ala Phe Leu Leu Val Phe Phe Gly Ala Phe Phe
275 280 285
Ala Val Trp Cys Tyr Arg Lys Arg Tyr Phe Val Ser Glu Tyr Thr Pro
290 295 300
Ile Asp Ser Asn Ile Ala Phe Ser Val Asn Ala Ser Asp Lys Gly Glu
305 310 315 320
Ala Ser Cys Cys Asp Pro Val Ser Ala Ala Phe Glu Gly Cys Leu Arg
325 330 335
Arg Leu Phe Thr Arg Trp Gly Ser Phe Cys Val Arg Asn Pro Gly Cys
340 345 350
Val Ile Phe Phe Ser Leu Val Phe Ile Thr Ala Cys Ser Ser Gly Leu
355 360 365
Val Phe Val Arg Val Thr Thr Asn Pro Val Asp Leu Trp Ser Ala Pro
370 375 380
Ser Ser Gln Ala Arg Leu Glu Lys Glu Tyr Phe Asp Gln His Phe Gly
385 390 395 400
Pro Phe Phe Arg Thr Glu Gln Leu Ile Ile Arg Ala Pro Leu Thr Asp
405 410 415
Lys His Ile Tyr Gln Pro Tyr Pro Ser Gly Ala Asp Val Pro Phe Gly
420 425 430
Pro Pro Leu Asp Ile Gln Ile Leu His Gln Val Leu Asp Leu Gln Ile
435 440 445
Ala Ile Glu Asn Ile Thr Ala Ser Tyr Asp Asn Glu Thr Val Thr Leu
450 455 460
Gln Asp Ile Cys Leu Ala Pro Leu Ser Pro Tyr Asn Thr Asn Cys Thr
465 470 475 480
Ile Leu Ser Val Leu Asn Tyr Phe Gln Asn Ser His Ser Val Leu Asp
485 490 495
His Lys Lys Gly Asp Asp Phe Phe Val Tyr Ala Asp Tyr His Thr His
500 505 510
Phe Leu Tyr Cys Val Arg Ala Pro Ala Ser Leu Asn Asp Thr Ser Leu
515 520 525
Leu His Asp Pro Cys Leu Gly Thr Phe Gly Gly Pro Val Phe Pro Trp
530 535 540
Leu Val Leu Gly Gly Tyr Asp Asp Gln Asn Tyr Asn Asn Ala Thr Ala
545 550 555 560
Leu Val Ile Thr Phe Pro Val Asn Asn Tyr Tyr Asn Asp Thr Glu Lys
565 570 575
Leu Gln Arg Ala Gln Ala Trp Glu Lys Glu Phe Ile Asn Phe Val Lys
580 585 590
Asn Tyr Lys Asn Pro Asn Leu Thr Ile Ser Phe Thr Ala Glu Arg Ser
595 600 605
Ile Glu Asp Glu Leu Asn Arg Glu Ser Asp Ser Asp Val Phe Thr Val
610 615 620
Val Ile Ser Tyr Ala Ile Met Phe Leu Tyr Ile Ser Leu Ala Leu Gly
625 630 635 640
His Met Lys Ser Cys Arg Arg Leu Leu Val Asp Ser Lys Val Ser Leu
645 650 655
Gly Ile Ala Gly Ile Leu Ile Val Leu Ser Ser Val Ala Cys Ser Leu
660 665 670
Gly Val Phe Ser Tyr Ile Gly Leu Pro Leu Thr Leu Ile Val Ile Glu
675 680 685
Val Ile Pro Phe Leu Val Leu Ala Val Gly Val Asp Asn Ile Phe Ile
690 695 700
Leu Val Gln Ala Tyr Gln Arg Asp Glu Arg Leu Gln Gly Glu Thr Leu
705 710 715 720
Asp Gln Gln Leu Gly Arg Val Leu Gly Glu Val Ala Pro Ser Met Phe
725 730 735
Leu Ser Ser Phe Ser Glu Thr Val Ala Phe Phe Leu Gly Ala Leu Ser
740 745 750
Val Met Pro Ala Val His Thr Phe Ser Leu Phe Ala Gly Leu Ala Val
755 760 765
Phe Ile Asp Phe Leu Leu Gln Ile Thr Cys Phe Val Ser Leu Leu Gly
770 775 780
Leu Asp Ile Lys Arg Gln Glu Lys Asn Arg Leu Asp Ile Phe Cys Cys
785 790 795 800
Val Arg Gly Ala Glu Asp Gly Thr Ser Val Gln Ala Ser Glu Ser Cys
805 810 815
Leu Phe Arg Phe Phe Lys Asn Ser Tyr Ser Pro Leu Leu Leu Lys Asp
820 825 830
Trp Met Arg Pro Ile Val Ile Ala Ile Phe Val Gly Val Leu Ser Phe
835 840 845
Ser Ile Ala Val Leu Asn Lys Val Asp Ile Gly Leu Asp Gln Ser Leu
850 855 860
Ser Met Pro Asp Asp Ser Tyr Met Val Asp Tyr Phe Lys Ser Ile Ser
865 870 875 880
Gln Tyr Leu His Ala Gly Pro Pro Val Tyr Phe Val Leu Glu Glu Gly
885 890 895
His Asp Tyr Thr Ser Ser Lys Gly Gln Asn Met Val Cys Gly Gly Met
900 905 910
Gly Cys Asn Asn Asp Ser Leu Val Gln Gln Ile Phe Asn Ala Ala Gln
915 920 925
Leu Asp Asn Tyr Thr Arg Ile Gly Phe Ala Pro Ser Ser Trp Ile Asp
930 935 940
Asp Tyr Phe Asp Trp Val Lys Pro Gln Ser Ser Cys Cys Arg Val Asp
945 950 955 960
Asn Ile Thr Asp Gln Phe Cys Asn Ala Ser Val Val Asp Pro Ala Cys
965 970 975
Val Arg Cys Arg Pro Leu Thr Pro Glu Gly Lys Gln Arg Pro Gln Gly
980 985 990
Gly Asp Phe Met Arg Phe Leu Pro Met Phe Leu Ser Asp Asn Pro Asn
995 1000 1005
Pro Lys Cys Gly Lys Gly Gly His Ala Ala Tyr Ser Ser Ala Val
1010 1015 1020
Asn Ile Leu Leu Gly His Gly Thr Arg Val Gly Ala Thr Tyr Phe
1025 1030 1035
Met Thr Tyr His Thr Val Leu Gln Thr Ser Ala Asp Phe Ile Asp
1040 1045 1050
Ala Leu Lys Lys Ala Arg Leu Ile Ala Ser Asn Val Thr Glu Thr
1055 1060 1065
Met Gly Ile Asn Gly Ser Ala Tyr Arg Val Phe Pro Tyr Ser Val
1070 1075 1080
Phe Tyr Val Phe Tyr Glu Gln Tyr Leu Thr Ile Ile Asp Asp Thr
1085 1090 1095
Ile Phe Asn Leu Gly Val Ser Leu Gly Ala Ile Phe Leu Val Thr
1100 1105 1110
Met Val Leu Leu Gly Cys Glu Leu Trp Ser Ala Val Ile Met Cys
1115 1120 1125
Ala Thr Ile Ala Met Val Leu Val Asn Met Phe Gly Val Met Trp
1130 1135 1140
Leu Trp Gly Ile Ser Leu Asn Ala Val Ser Leu Val Asn Leu Val
1145 1150 1155
Met Ser Cys Gly Ile Ser Val Glu Phe Cys Ser His Ile Thr Arg
1160 1165 1170
Ala Phe Thr Val Ser Met Lys Gly Ser Arg Val Glu Arg Ala Glu
1175 1180 1185
Glu Ala Leu Ala His Met Gly Ser Ser Val Phe Ser Gly Ile Thr
1190 1195 1200
Leu Thr Lys Phe Gly Gly Ile Val Val Leu Ala Phe Ala Lys Ser
1205 1210 1215
Gln Ile Phe Gln Ile Phe Tyr Phe Arg Met Tyr Leu Ala Met Val
1220 1225 1230
Leu Leu Gly Ala Thr His Gly Leu Ile Phe Leu Pro Val Leu Leu
1235 1240 1245
Ser Tyr Ile Gly Pro Ser Val Asn Lys Ala Lys Ser Cys Ala Thr
1250 1255 1260
Glu Glu Arg Tyr Lys Gly Thr Glu Arg Glu Arg Leu Leu Asn Phe
1265 1270 1275
<210> 106
<211> 456
<212> DNA
<213> Homo sapiens
<400> 106
atgcgtttcc tggcagctac attcctgctc ctggcgctca gcaccgctgc ccaggccgaa 60
ccggtgcagt tcaaggactg cggttctgtg gatggagtta taaaggaagt gaatgtgagc 120
ccatgcccca cccaaccctg ccagctgagc aaaggacagt cttacagcgt caatgtcacc 180
ttcaccagca atattcagtc taaaagcagc aaggccgtgg tgcatggcat cctgatgggc 240
gtcccagttc cctttcccat tcctgagcct gatggttgta agagtggaat taactgccct 300
atccaaaaag acaagaccta tagctacctg aataaactac cagtgaaaag cgaatatccc 360
tctataaaac tggtggtgga gtggcaactt caggatgaca aaaaccaaag tctcttctgc 420
tgggaaatcc cagtacagat cgtttctcat ctctaa 456
<210> 107
<211> 151
<212> PRT
<213> Homo sapiens
<400> 107
Met Arg Phe Leu Ala Ala Thr Phe Leu Leu Leu Ala Leu Ser Thr Ala
1 5 10 15
Ala Gln Ala Glu Pro Val Gln Phe Lys Asp Cys Gly Ser Val Asp Gly
20 25 30
Val Ile Lys Glu Val Asn Val Ser Pro Cys Pro Thr Gln Pro Cys Gln
35 40 45
Leu Ser Lys Gly Gln Ser Tyr Ser Val Asn Val Thr Phe Thr Ser Asn
50 55 60
Ile Gln Ser Lys Ser Ser Lys Ala Val Val His Gly Ile Leu Met Gly
65 70 75 80
Val Pro Val Pro Phe Pro Ile Pro Glu Pro Asp Gly Cys Lys Ser Gly
85 90 95
Ile Asn Cys Pro Ile Gln Lys Asp Lys Thr Tyr Ser Tyr Leu Asn Lys
100 105 110
Leu Pro Val Lys Ser Glu Tyr Pro Ser Ile Lys Leu Val Val Glu Trp
115 120 125
Gln Leu Gln Asp Asp Lys Asn Gln Ser Leu Phe Cys Trp Glu Ile Pro
130 135 140
Val Gln Ile Val Ser His Leu
145 150
<210> 108
<211> 1317
<212> DNA
<213> Homo sapiens
<400> 108
atgggaggct gtgcaggctc gcggcggcgc ttttcggatt ccgaggggga ggagaccgtc 60
ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa cgcggtgggc 120
ttctggctgc tgggcctttg caacaacttc tcttatgtgg tgatgctgag tgccgcccac 180
gacatcctta gccacaagag gacatcggga aaccagagcc atgtggaccc aggcccaacg 240
ccgatccccc acaacagctc atcacgattt gactgcaact ctgtctctac ggctgctgtg 300
ctcctggcgg acatcctccc cacactcgtc atcaaattgt tggctcctct tggccttcac 360
ctgctgccct acagcccccg ggttctcgtc agtgggattt gtgctgctgg aagcttcgtc 420
ctggttgcct tttctcattc tgtggggacc agcctgtgtg gtgtggtctt cgctagcatc 480
tcatcaggcc ttggggaggt caccttcctc tccctcactg ccttctaccc cagggccgtg 540
atctcctggt ggtcctcagg gactggggga gctgggctgc tgggggccct gtcctacctg 600
ggcctcaccc aggccggcct ctcccctcag cagaccctgc tgtccatgct gggtatccct 660
gccctgctgc tggccagcta tttcttgttg ctcacatctc ctgaggccca ggaccctgga 720
ggggaagaag aagcagagag cgcagcccgg cagcccctca taagaaccga ggccccggag 780
tcgaagccag gctccagctc cagcctctcc cttcgggaaa ggtggacagt gttcaagggt 840
ctgctgtggt acattgttcc cttggtcgta gtttactttg ccgagtattt cattaaccag 900
ggactttttg aactcctctt tttctggaac acttccctga gtcacgctca gcaataccgc 960
tggtaccaga tgctgtacca ggctggcgtc tttgcctccc gctcttctct ccgctgctgt 1020
cgcatccgtt tcacctgggc cctggccctg ctgcagtgcc tcaacctggt gttcctgctg 1080
gcagacgtgt ggttcggctt tctgccaagc atctacctcg tcttcctgat cattctgtat 1140
gaggggctcc tgggaggcgc agcctacgtg aacaccttcc acaacatcgc cctggagacc 1200
agtgatgagc accgggagtt tgcaatggcg gccacctgca tctctgacac actggggatc 1260
tccctgtcgg ggctcctggc tttgcctctg catgacttcc tctgccagct ctcctga 1317
<210> 109
<211> 438
<212> PRT
<213> Homo sapiens
<400> 109
Met Gly Gly Cys Ala Gly Ser Arg Arg Arg Phe Ser Asp Ser Glu Gly
1 5 10 15
Glu Glu Thr Val Pro Glu Pro Arg Leu Pro Leu Leu Asp His Gln Gly
20 25 30
Ala His Trp Lys Asn Ala Val Gly Phe Trp Leu Leu Gly Leu Cys Asn
35 40 45
Asn Phe Ser Tyr Val Val Met Leu Ser Ala Ala His Asp Ile Leu Ser
50 55 60
His Lys Arg Thr Ser Gly Asn Gln Ser His Val Asp Pro Gly Pro Thr
65 70 75 80
Pro Ile Pro His Asn Ser Ser Ser Arg Phe Asp Cys Asn Ser Val Ser
85 90 95
Thr Ala Ala Val Leu Leu Ala Asp Ile Leu Pro Thr Leu Val Ile Lys
100 105 110
Leu Leu Ala Pro Leu Gly Leu His Leu Leu Pro Tyr Ser Pro Arg Val
115 120 125
Leu Val Ser Gly Ile Cys Ala Ala Gly Ser Phe Val Leu Val Ala Phe
130 135 140
Ser His Ser Val Gly Thr Ser Leu Cys Gly Val Val Phe Ala Ser Ile
145 150 155 160
Ser Ser Gly Leu Gly Glu Val Thr Phe Leu Ser Leu Thr Ala Phe Tyr
165 170 175
Pro Arg Ala Val Ile Ser Trp Trp Ser Ser Gly Thr Gly Gly Ala Gly
180 185 190
Leu Leu Gly Ala Leu Ser Tyr Leu Gly Leu Thr Gln Ala Gly Leu Ser
195 200 205
Pro Gln Gln Thr Leu Leu Ser Met Leu Gly Ile Pro Ala Leu Leu Leu
210 215 220
Ala Ser Tyr Phe Leu Leu Leu Thr Ser Pro Glu Ala Gln Asp Pro Gly
225 230 235 240
Gly Glu Glu Glu Ala Glu Ser Ala Ala Arg Gln Pro Leu Ile Arg Thr
245 250 255
Glu Ala Pro Glu Ser Lys Pro Gly Ser Ser Ser Ser Leu Ser Leu Arg
260 265 270
Glu Arg Trp Thr Val Phe Lys Gly Leu Leu Trp Tyr Ile Val Pro Leu
275 280 285
Val Val Val Tyr Phe Ala Glu Tyr Phe Ile Asn Gln Gly Leu Phe Glu
290 295 300
Leu Leu Phe Phe Trp Asn Thr Ser Leu Ser His Ala Gln Gln Tyr Arg
305 310 315 320
Trp Tyr Gln Met Leu Tyr Gln Ala Gly Val Phe Ala Ser Arg Ser Ser
325 330 335
Leu Arg Cys Cys Arg Ile Arg Phe Thr Trp Ala Leu Ala Leu Leu Gln
340 345 350
Cys Leu Asn Leu Val Phe Leu Leu Ala Asp Val Trp Phe Gly Phe Leu
355 360 365
Pro Ser Ile Tyr Leu Val Phe Leu Ile Ile Leu Tyr Glu Gly Leu Leu
370 375 380
Gly Gly Ala Ala Tyr Val Asn Thr Phe His Asn Ile Ala Leu Glu Thr
385 390 395 400
Ser Asp Glu His Arg Glu Phe Ala Met Ala Ala Thr Cys Ile Ser Asp
405 410 415
Thr Leu Gly Ile Ser Leu Ser Gly Leu Leu Ala Leu Pro Leu His Asp
420 425 430
Phe Leu Cys Gln Leu Ser
435
<210> 110
<211> 866
<212> DNA
<213> Homo sapiens
<400> 110
tcctgcaggc caggcatggc tctgtgagcg ctgatgaggc tgcccgcacg gctcccttcc 60
acctcgacct ctggttctac ttcacactgc agaactgggt tctggacttt gggcgtccca 120
ttgccatgct ggtattccct ctcgagtggt ttccactcaa caagcccagt gttggggact 180
acttccacat ggcctacaac gtcatcacgc cctttctctt gctcaagctc atcgagcggt 240
ccccccgcac cctgccacgc tccatcacgt acgtgagcat catcatcttc atcatgggtg 300
ccagcatcca cctggtgggt gactctgtca accaccgcct gctcttcagt ggctaccagc 360
accacctgtc tgtccgtgag aaccccatca tcaagaatct caagccggag acgctgatcg 420
actcctttga gctgctctac tattatgatg agtacctggg tcactgcatg tggtacatcc 480
ccttcttcct catcctcttc atgtacttca gcggctgctt tactgcctct aaagctgaga 540
gcttgattcc agggcctgcc ctgctcctgg tggcacccag tggcctgtac tactggtacc 600
tggtcaccga gggccagatc ttcatcctct tcatcttcac cttcttcgcc atgctggccc 660
tcgtcctgca ccagaagcgc aagcgcctct tcctggacag caacggcctc ttcctcttct 720
cctccttcgc actgaccctc ttgcttgtgg cgctctgggt cgcctggctg tggaatgacc 780
ctgttctcag gaagaagtac ccgggtgtca tctacgtccc tgagccctgg gctttctaca 840
cccttcacgt cagcagtcgg cactga 866
<210> 111
<211> 311
<212> PRT
<213> Homo sapiens
<400> 111
Met Glu Ala Thr Arg Arg Arg Gln His Leu Gly Ala Thr Gly Gly Pro
1 5 10 15
Gly Ala Gln Leu Gly Ala Ser Phe Leu Gln Ala Arg His Gly Ser Val
20 25 30
Ser Ala Asp Glu Ala Ala Arg Thr Ala Pro Phe His Leu Asp Leu Trp
35 40 45
Phe Tyr Phe Thr Leu Gln Asn Trp Val Leu Asp Phe Gly Arg Pro Ile
50 55 60
Ala Met Leu Val Phe Pro Leu Glu Trp Phe Pro Leu Asn Lys Pro Ser
65 70 75 80
Val Gly Asp Tyr Phe His Met Ala Tyr Asn Val Ile Thr Pro Phe Leu
85 90 95
Leu Leu Lys Leu Ile Glu Arg Ser Pro Arg Thr Leu Pro Arg Ser Ile
100 105 110
Thr Tyr Val Ser Ile Ile Ile Phe Ile Met Gly Ala Ser Ile His Leu
115 120 125
Val Gly Asp Ser Val Asn His Arg Leu Leu Phe Ser Gly Tyr Gln His
130 135 140
His Leu Ser Val Arg Glu Asn Pro Ile Ile Lys Asn Leu Lys Pro Glu
145 150 155 160
Thr Leu Ile Asp Ser Phe Glu Leu Leu Tyr Tyr Tyr Asp Glu Tyr Leu
165 170 175
Gly His Cys Met Trp Tyr Ile Pro Phe Phe Leu Ile Leu Phe Met Tyr
180 185 190
Phe Ser Gly Cys Phe Thr Ala Ser Lys Ala Glu Ser Leu Ile Pro Gly
195 200 205
Pro Ala Leu Leu Leu Val Ala Pro Ser Gly Leu Tyr Tyr Trp Tyr Leu
210 215 220
Val Thr Glu Gly Gln Ile Phe Ile Leu Phe Ile Phe Thr Phe Phe Ala
225 230 235 240
Met Leu Ala Leu Val Leu His Gln Lys Arg Lys Arg Leu Phe Leu Asp
245 250 255
Ser Asn Gly Leu Phe Leu Phe Ser Ser Phe Ala Leu Thr Leu Leu Leu
260 265 270
Val Ala Leu Trp Val Ala Trp Leu Trp Asn Asp Pro Val Leu Arg Lys
275 280 285
Lys Tyr Pro Gly Val Ile Tyr Val Pro Glu Pro Trp Ala Phe Tyr Thr
290 295 300
Leu His Val Ser Ser Arg His
305 310
<210> 112
<211> 861
<212> DNA
<213> Homo sapiens
<400> 112
atgaatcctg cgagcgatgg gggcacatca gagagcattt ttgacctgga ctatgcatcc 60
tgggggatcc gctccacgct gatggtcgct ggctttgtct tctacttggg cgtctttgtg 120
gtctgccacc agctgtcctc ttccctgaat gccacttacc gttctttggt ggccagagag 180
aaggtcttct gggacctggc ggccacgcgt gcagtctttg gtgttcagag cacagccgca 240
ggcctgtggg ctctgctggg ggaccctgtg ctgcatgccg acaaggcgcg tggccagcag 300
aactggtgct ggtttcacat cacgacagca acgggattct tttgctttga aaatgttgca 360
gtccacctgt ccaacttgat cttccggaca tttgacttgt ttctggttat ccaccatctc 420
tttgcctttc ttgggtttct tggctgcttg gtcaatctcc aagctggcca ctatctagct 480
atgaccacgt tgctcctgga gatgagcacg ccctttacct gcgtttcctg gatgctctta 540
aaggcgggct ggtccgagtc tctgttttgg aagctcaacc agtggctgat gattcacatg 600
tttcactgcc gcatggttct aacctaccac atgtggtggg tgtgtttctg gcactgggac 660
ggcctggtca gcagcctgta tctgcctcat ttgacactgt tccttgtcgg actggctctg 720
cttacgctaa tcattaatcc atattggacc cataagaaga ctcagcagct tctcaatccg 780
gtggactgga acttcgcaca gccagaagcc aagagcaggc cagaaggcaa cgggcagctg 840
ctgcggaaga agaggccata g 861
<210> 113
<211> 286
<212> PRT
<213> Homo sapiens
<400> 113
Met Asn Pro Ala Ser Asp Gly Gly Thr Ser Glu Ser Ile Phe Asp Leu
1 5 10 15
Asp Tyr Ala Ser Trp Gly Ile Arg Ser Thr Leu Met Val Ala Gly Phe
20 25 30
Val Phe Tyr Leu Gly Val Phe Val Val Cys His Gln Leu Ser Ser Ser
35 40 45
Leu Asn Ala Thr Tyr Arg Ser Leu Val Ala Arg Glu Lys Val Phe Trp
50 55 60
Asp Leu Ala Ala Thr Arg Ala Val Phe Gly Val Gln Ser Thr Ala Ala
65 70 75 80
Gly Leu Trp Ala Leu Leu Gly Asp Pro Val Leu His Ala Asp Lys Ala
85 90 95
Arg Gly Gln Gln Asn Trp Cys Trp Phe His Ile Thr Thr Ala Thr Gly
100 105 110
Phe Phe Cys Phe Glu Asn Val Ala Val His Leu Ser Asn Leu Ile Phe
115 120 125
Arg Thr Phe Asp Leu Phe Leu Val Ile His His Leu Phe Ala Phe Leu
130 135 140
Gly Phe Leu Gly Cys Leu Val Asn Leu Gln Ala Gly His Tyr Leu Ala
145 150 155 160
Met Thr Thr Leu Leu Leu Glu Met Ser Thr Pro Phe Thr Cys Val Ser
165 170 175
Trp Met Leu Leu Lys Ala Gly Trp Ser Glu Ser Leu Phe Trp Lys Leu
180 185 190
Asn Gln Trp Leu Met Ile His Met Phe His Cys Arg Met Val Leu Thr
195 200 205
Tyr His Met Trp Trp Val Cys Phe Trp His Trp Asp Gly Leu Val Ser
210 215 220
Ser Leu Tyr Leu Pro His Leu Thr Leu Phe Leu Val Gly Leu Ala Leu
225 230 235 240
Leu Thr Leu Ile Ile Asn Pro Tyr Trp Thr His Lys Lys Thr Gln Gln
245 250 255
Leu Leu Asn Pro Val Asp Trp Asn Phe Ala Gln Pro Glu Ala Lys Ser
260 265 270
Arg Pro Glu Gly Asn Gly Gln Leu Leu Arg Lys Lys Arg Pro
275 280 285
<210> 114
<211> 11406
<212> DNA
<213> Homo sapiens
<400> 114
atgagcaccg acagtaactc actggcacgt gaatttctga ccgatgtcaa ccggctttgc 60
aatgcagtgg tccagagggt ggaggccagg gaggaagaag aggaggagac gcacatggca 120
acccttggac agtaccttgt ccatggtcga ggatttctat tacttaccaa gctaaattct 180
ataattgatc aggcattgac atgtagagaa gaactcctga ctcttcttct gtctctcctt 240
ccactggtat ggaagatacc tgtccaagaa gaaaaggcaa cagattttaa cctaccgctc 300
tcagcagata taatcctgac caaagaaaag aactcaagtt cacaaagatc cactcaggaa 360
aaattacatt tagaaggaag tgccctgtct agtcaggttt ctgcaaaagt aaatgttttt 420
cgaaaaagca gacgacagcg taaaattacc catcgctatt ctgtaagaga tgcaagaaag 480
acacagctct ccacctcaga ttcagaagcc aattcagatg aaaaaggcat agcaatgaat 540
aagcatagaa ggccccatct gctgcatcat tttttaacat cgtttcctaa acaagaccac 600
cccaaagcta aacttgaccg cttagcaacc aaagaacaga ctcctccaga tgctatggct 660
ttggaaaatt ccagagagat tattccaaga caggggtcaa acactgacat tttaagtgag 720
ccagctgcct tgtctgttat cagtaacatg aacaattctc catttgactt atgtcatgtt 780
ttgttatctt tattagaaaa agtttgtaag tttgacgtta ccttgaatca taattctcct 840
ttagcagcca gtgtagtgcc cacactaact gaattcctag caggctttgg ggactgctgc 900
agtctgagcg acaacttgga gagtcgagta gtttctgcag gttggaccga agaaccggtg 960
gctttgattc aaaggatgct ctttcgaaca gtgttgcatc ttctgtcagt agatgttagt 1020
actgcagaga tgatgccaga aaatcttagg aaaaatttaa ctgaattgct tagagcagct 1080
ttaaaaatta gaatatgcct agaaaagcag cctgaccctt ttgcaccaag acaaaagaaa 1140
acactgcagg aggttcagga agattttgtg ttttcaaagt atcgtcatag agcccttctt 1200
ttacctgagc ttttggaagg agttcttcag attctgatct gttgtcttca aagtgcagct 1260
tcaaatccct tctacttcag tcaagccatg gatttggttc aagaattcat tcagcatcat 1320
ggatttaatt tatttgaaac agcagttctt caaatggaat ggctggtttt aagagatgga 1380
gttcctcccg aggcctcaga gcatttgaaa gccctaataa atagtgtgat gaaaataatg 1440
agcactgtca aaaaagtgaa atcagagcaa cttcatcatt cgatgtgtac aagaaaaagg 1500
cacagacgat gtgaatattc tcattttatg catcatcacc gagatctctc aggtcttctg 1560
gtttcggctt ttaaaaacca ggtttccaaa aacccatttg aagagactgc agatggagat 1620
gtttattatc ctgagcggtg ctgttgcatt gcagtgtgtg cccatcagtg cttgcgctta 1680
ctacagcagg cttccttgag cagcacttgt gtccagatcc tatcgggtgt tcataacatt 1740
ggaatatgct gttgtatgga tcccaaatct gtaatcattc ctttgctcca tgcttttaaa 1800
ttgccagcac tgaaaaattt tcagcagcat atattgaata tccttaacaa acttattttg 1860
gatcagttag gaggagcaga gatatcacca aaaattaaaa aagcagcttg taatatttgt 1920
actgttgact ctgaccaact agcccaatta gaagagacac tgcagggaaa cttatgtgat 1980
gctgaactct cctcaagttt atccagtcct tcttacagat ttcaagggat cctgcccagc 2040
agtggatctg aagatttgtt gtggaaatgg gatgctttaa aggcttatca gaactttgtt 2100
tttgaagaag acagattaca tagtatacag attgcaaatc acatttgcaa tttaatccag 2160
aaaggcaata tagttgttca gtggaaatta tataattaca tatttaatcc tgtgctccaa 2220
agaggagttg aattagcaca tcattgtcaa cacctaagcg ttacttcagc tcaaagtcat 2280
gtatgtagcc atcataacca gtgcttgcct caggacgtgc ttcagattta tgtaaaaact 2340
ctgcctatcc tgcttaaatc cagggtaata agagatttgt ttttgagttg taatggagta 2400
agtcaaataa tcgaattaaa ttgcttaaat ggtattcgaa gtcattctct aaaagcattt 2460
gaaactctga taatcagcct aggggagcaa cagaaagatg cctcagttcc agatattgat 2520
gggatagaca ttgaacagaa ggagttgtcc tctgtacatg tgggtacttc ttttcatcat 2580
cagcaagctt attcagattc tcctcagagt ctcagcaaat tttatgctgg cctcaaagaa 2640
gcttatccaa agagacggaa gactgttaac caagatgttc atatcaacac aataaaccta 2700
ttcctctgtg tggctttttt atgcgtaagt aaagaagcag agtctgacag ggagtcggcc 2760
aatgactcag aagatacttc tggctatgac agcacagcca gcgagccttt aagtcatatg 2820
ctgccatgta tatctctcga gagccttgtc ttgccttctc ctgaacatat gcaccaagca 2880
gcagacattt ggtctatgtg tcgttggatc tacatgttga gttcagtgtt ccagaaacag 2940
ttttataggc ttggtggttt ccgagtatgc cataagttaa tatttatgat aatacagaaa 3000
ctgttcagaa gtcacaaaga ggagcaagga aaaaaggagg gagatacaag tgtaaatgaa 3060
aaccaggatt taaacagaat ttctcaacct aagagaacta tgaaggaaga tttattatct 3120
ttggctataa aaagtgaccc cataccatca gaactaggta gtctaaaaaa gagtgctgac 3180
agtttaggta aattagagtt acagcatatt tcttccataa atgtggaaga agtttcagct 3240
actgaagccg ctcccgagga agcaaagcta tttacaagtc aagaaagtga gacctcactt 3300
caaagtatac gacttttgga agcccttctg gccatttgtc ttcatggtgc cagaactagt 3360
caacagaaga tggaattgga gttacctaat cagaacttgt ctgtggaaag tatattattt 3420
gaaatgaggg accatctttc ccagtcaaag gtgattgaaa cacaactagc aaagccttta 3480
tttgatgccc tgcttcgagt tgccctcggg aattattcag cagattttga acataatgat 3540
gctatgactg agaagagtca tcaatctgca gaagaattgt catcccagcc tggtgatttt 3600
tcagaagaag ctgaggattc tcagtgttgt agttttaaac ttttagttga agaagaaggt 3660
tacgaagcag atagtgaaag caatcctgaa gatggcgaaa cccaggatga tggggtagac 3720
ttaaagtctg aaacagaagg tttcagtgca tcaagcagtc caaatgactt actcgaaaac 3780
ctcactcaag gggaaataat ttatcctgag atttgtatgc tggaattaaa tttgctttct 3840
gctagtaaag ccaaacttga tgtgcttgcc catgtatttg agagtttttt gaaaattatt 3900
aggcagaaag aaaagaatgt ttttctgctc atgcaacagg gaactgtgaa aaatctttta 3960
ggagggttct tgagtatttt aacacaggat gattctgatt ttcaagcatg ccagagagta 4020
ttggtggatc ttttggtatc tttgatgagt tcaagaacat gttcagaaga gctaaccctt 4080
cttttgagaa tatttctgga gaaatctcct tgtacaaaaa ttcttcttct gggtattctg 4140
aaaattattg aaagtgatac tactatgagc ccttcacagt atctaacctt ccctttactg 4200
cacgctccaa atttaagcaa cggtgtttca tcacaaaagt atcctgggat tttaaacagt 4260
aaggccatgg gtttattgag aagagcacga gtttcacgga gcaagaaaga ggctgataga 4320
gagagttttc cccatcggct gctttcatct tggcacatag ccccagtcca cctgccgttg 4380
ctggggcaaa actgctggcc acacctatca gaaggtttca gtgtttccct gtggtttaat 4440
gtggagtgta tccatgaagc tgagagtact acagaaaaag gaaagaagat aaagaaaaga 4500
aacaaatcat taattttacc agatagcagt tttgatggta cagagagcga cagaccagaa 4560
ggtgcagagt acataaatcc tggtgaaaga ctcatagaag aaggatgtat tcatataatt 4620
tcactgggat ccaaagcgtt gatgatccaa gtgtgggctg atccccacaa tgccactctt 4680
atctttcgtg tgtgcatgga ttcaaatgat gacatgaaag ctgttttact agcacaggtt 4740
gaatcacagg agaatatttt cctcccaagc aaatggcaac atttagtact cacctactta 4800
cagcagcccc aagggaaaag gaggattcat gggaaaatct ccatatgggt ctctggacag 4860
aggaagcctg atgttacttt ggattttatg cttccaagaa aaacaagttt gtcatctgat 4920
agcaataaaa cattttgcat gattggccat tgtttatcat cccaagaaga gtttttgcag 4980
ttggctggaa aatgggacct gggaaatttg cttctcttca acggagctaa ggttggttca 5040
caagaggcct tttatctgta tgcttgtgga cccaaccata catctgtaat gccatgtaag 5100
tatggcaagc cagtcaatga ctactccaaa tatattaata aagaaatttt gcgatgtgaa 5160
caaatcagag aactttttat gaccaagaaa gatgtggata ttggtctctt aattgaaagt 5220
ctttcagttg tttatacaac ttactgtcct gctcagtata ccatctatga accagtgatt 5280
agacttaaag gtcaaatgaa aacccaactc tctcaaagac ccttcagctc aaaagaagtt 5340
cagagcatct tattagaacc tcatcatcta aagaatctcc aacctactga atataaaact 5400
attcaaggca ttctgcacga aattggtgga actggcatat ttgtttttct ctttgccagg 5460
gttgttgaac tcagtagctg tgaagaaact caagcattag cactgcgagt tatactctca 5520
ttaattaaat acaaccaaca aagagtacat gaattagaaa attgtaatgg actttctatg 5580
attcatcagg tgttgatcaa acaaaaatgc attgttgggt tttacatttt gaagaccctt 5640
cttgaaggat gctgtggtga agatattatt tatatgaatg agaatggaga gtttaagttg 5700
gatgtagact ctaatgctat aatccaagat gttaagctgt tagaggaact attgcttgac 5760
tggaagatat ggagtaaagc agagcaaggt gtttgggaaa ctttgctagc agctctagaa 5820
gtcctcatca gagcagatca ccaccagcag atgtttaata ttaagcagtt attgaaagct 5880
caagtggttc atcactttct actgacttgt caggttttgc aggaatacaa agaggggcaa 5940
ctcacaccca tgccccgaga ggtttgtaga tcatttgtga aaattatagc agaagtcctt 6000
ggatctcctc cagatttgga attattgaca attatcttca atttcctttt agcagttcac 6060
cctcctacta atacttacgt ttgtcacaat cccacgaact tctacttttc tttgcacata 6120
gatggcaaga tctttcagga gaaagtgcgg tcaatcatgt acctgaggca ttccagcagt 6180
ggaggaaggt cccttatgag ccctggattt atggtaataa gcccatctgg ttttactgct 6240
tcaccatatg aaggagagaa ttcctctaat attattccac aacagatggc cgcccatatg 6300
ctgcgttcta gaagcctacc agcattccct acttcttcac tactaacgca atcacaaaaa 6360
ctgactggaa gtttgggttg tagtatcgac aggttacaaa atattgcaga tacttatgtt 6420
gccacccaat caaagaaaca aaattctttg gggagttccg acacactgaa aaaaggcaaa 6480
gaggacgcat tcatcagtag ctgtgagtct gcaaaaactg tttgtgaaat ggaagctgtc 6540
ctctcagccc aggtctctgt cagtgatgtc ccaaagggag tgctgggatt tccagtggtc 6600
aaagcagatc ataaacagtt gggagcagaa cccaggtcag aagatgacag tcctggggat 6660
gagtcctgcc cacgccgacc tgattaccta aagggattgg cctccttcca gcgaagccac 6720
agcactattg caagccttgg gctagctttt ccttcacaga acggatctgc agctgttggc 6780
cgttggccaa gtcttgttga tagaaacact gatgattggg aaaactttgc ctattctctt 6840
ggttatgagc caaattacaa ccgaactgca agtgctcaca gtgtaactga agactgtttg 6900
gtacctatat gctgtggatt atatgaactc ctaagtgggg ttcttcttat cctgcctgat 6960
gttttgcttg aagatgtgat ggacaagctt attcaagcag atacactttt ggtcctcgtt 7020
aaccacccat caccagctat acaacaaggt gttattaaac tattagatgc atattttgct 7080
agagcatcta aggaacaaaa agataaattt ctgaagaatc gtggattttc cttgctagcc 7140
aaccagttgt atcttcatcg aggaactcaa gaattgttag aatgcttcat cgaaatgttc 7200
tttggtcgac atattggcct tgatgaagaa tttgatctgg aagatgtgag aaacatggga 7260
ttgtttcaga agtggtctgt cattcctatt ctgggactaa tagagacctc tctatatgac 7320
aacatactct tgcataatgc tcttttactt cttctccaaa ttttaaattc ttgttctaag 7380
gtagcagata tgttgctgga taatggtcta ctctatgtgt tatgtaatac agtagcagcc 7440
ctgaatggat tagaaaagaa cattcccatg agtgaatata aattgcttgc ttgtgatata 7500
cagcaacttt tcatagcagt tacaattcat gcttgcagtt cctcaggctc acaatatttt 7560
agggttattg aagaccttat tgtaatgctt ggatatcttc aaaatagcaa aaacaagagg 7620
acacaaaata tggctgttgc actacagctt agagttctcc aggctgctat ggaatttata 7680
aggaccaccg caaatcatga ctctgaaaac ctcacagatt cactccagtc accttctgct 7740
ccccatcatg cagtagttca aaagcggaaa agcattgctg gtcctcgaaa atttcccctt 7800
gctcaaactg aatcgcttct gatgaaaatg cgttcagtgg caaatgatga gcttcatgtg 7860
atgatgcaac ggagaatgag ccaagagaac cctagccaag caactgaaac ggaacttgcg 7920
cagagactac agaggctcac tgttttagca gtcaacagga ttatttatca agaatttaat 7980
tcagacatta ttgacatttt gagaactcca gaaaatgtaa ctcaaagcaa gacctcagtt 8040
ttccagaccg aaatttctga ggaaaatatt catcatgaac agtcttctgt tttcaatcca 8100
tttcagaaag aaatttttac atatctggta gaaggattca aagtatctat tggttcaagt 8160
aaagccagtg gttccaagca gcaatggact aaaattctgt ggtcttgtaa ggagaccttc 8220
cgaatgcagc ttgggagact actagtgcat attttgtcgc cagcccacgc tgcacaagag 8280
agaaagcaaa tttttgaaat agttcatgaa ccaaatcatc aggaaatact acgagactgt 8340
ctcagcccat ccctacaaca tggagccaag ttagttttgt atttgtcaga gttgatacat 8400
aatcaccaag gtgaattgac tgaagaagag ctaggcacag cagaactgct tatgaatgct 8460
ttgaagttat gtggtcacaa gtgcatccct cccagtgcat caacaaaagc agaccttatt 8520
aaaatgatca aagaggaaca aaagaaatat gaaactgaag aaggagtgaa taaagctgct 8580
tggcagaaaa cagttaacaa taatcaacaa agtctctttc agcgtctgga ttcaaaatca 8640
aaggatatat ctaaaatagc tgcagatatc acccaggcag tgtctctctc ccaaggaaat 8700
gagagaaaaa aggtgatcca gcatattaga ggaatgtata aagtagattt gagtgccagc 8760
agacattggc aggaacttat tcagcagctg acacatgata gagcagtatg gtatgacccc 8820
atctactatc caacctcatg gcagttggat ccaacagaag ggccaaatcg agagaggaga 8880
cgtttacaga gatgttattt aactattcca aataagtatc tccttaggga tagacagaaa 8940
tcagaagatg ttgtcaaacc accactctct tacctgtttg aagacaaaac tcattcttct 9000
ttctcttcta ctgtcaaaga caaagctgca agtgaatcta taagagtgaa tcgaagatgc 9060
atcagtgttg caccatctag agagacagct ggtgaattgt tactaggtaa atgtggaatg 9120
tattttgtgg aagataatgc ttctgataca gttgaaagtt cgagccttca gggagagttg 9180
gaaccagcat cattttcctg gacatatgaa gaaattaaag aagttcacaa gcgttggtgg 9240
caattgagag ataatgctgt agaaatcttt ctaacaaatg gcagaacact cctgttggca 9300
tttgataaca ccaaggttcg tgatgatgta taccacaata tactcacaaa taacctccct 9360
aatcttctgg aatatggtaa catcaccgct ctgacaaatt tatggtatac tgggcaaatt 9420
actaattttg aatatttgac tcacttaaac aaacatgctg gccgatcctt caatgatctc 9480
atgcagtatc ctgtgttccc atttatactt gctgactacg ttagtgagac acttgacctc 9540
aatgatctgt tgatatacag aaatctctct aaacctatag ctgttcagta taaagaaaaa 9600
gaagatcgtt atgtggacac atacaagtac ttggaggaag agtaccgcaa aggagccaga 9660
gaagatgacc ccatgcctcc cgtgcagccc tatcactatg gctcccacta ttccaatagc 9720
ggcactgtgc ttcacttcct ggtcaggatg cctcctttca ctaaaatgtt tttagcctat 9780
caagatcaaa gttttgacat tccagacaga acttttcatt ctacaaatac aacttggcga 9840
ctctcatctt ttgaatctat gactgatgtg aaagaactta tcccagagtt tttctatctt 9900
ccagagttcc tagttaaccg tgaaggtttt gattttggtg tgcgtcagaa tggtgaacgg 9960
gttaatcacg tcaaccttcc cccttgggcg cgtaatgatc ctcgtctttt tatcctcatc 10020
catcggcagg ctctagagtc tgactacgtg tcgcagaaca tctgtcagtg gattgacttg 10080
gtgtttgggt ataagcaaaa ggggaaggct tctgttcaag cgatcaatgt ttttcatcct 10140
gctacatatt ttggaatgga tgtctctgca gttgaagatc cagttcagag acgagcgcta 10200
gaaaccatga taaaaaccta cgggcagact ccccgtcagc tgttccacat ggcccatgtg 10260
agcagacctg gagccaagct caatattgaa ggagagcttc cagctgctgt ggggttgcta 10320
gtgcagtttg ctttcaggga gacccgagaa caggtcaaag aaatcaccta tccgagtcct 10380
ttgtcatgga taaaaggctt gaaatggggg gaatacgtgg gttcccccag tgctccagta 10440
cctgtggtct gcttcagcca gccccacgga gaaagatttg gctctctcca ggctctgccc 10500
accagagcaa tctgtggttt gtcacggaat ttctgtcttc tgatgacata tagcaaggaa 10560
caaggtgtga gaagcatgaa cagtacggac attcagtggt cagccatcct gagctgggga 10620
tatgctgata atattttaag gttgaagagt aaacaaagtg agcctccagt aaactttatt 10680
caaagttcac aacagtacca ggtgactagt tgtgcttggg tgcctgacag ttgccagctg 10740
tttactggaa gcaaatgcgg tgtcatcaca gcctacacaa acagatttac aagcagcacg 10800
ccatcagaaa tagaaatgga gactcaaata catctctatg gtcacacaga agagataacc 10860
agcttatttg tttgcaaacc atacagtata ctgataagtg tgagcagaga cggaacctgc 10920
atcatatggg atttaaacag gttatgctat gtacaaagtc tggcgggaca caaaagccct 10980
gtcacagctg tctctgccag tgaaacctca ggtgatattg ctactgtgtg tgattcagct 11040
ggcggaggca gtgacctcag actctggacg gtgaacgggg atctcgttgg acatgtccac 11100
tgcagggaga tcatctgttc cgtggctttc tccaaccagc ctgagggagt atctatcaat 11160
gtaatcgctg ggggattaga aaatggaatt gtaaggttat ggagcacatg ggacttaaag 11220
cctgtgagag aaattacatt tcccaaatca aataagccca tcatcagcct tacattttct 11280
tgtgatggcc accatttgta cacagcaaac agtgatggga ccgtgattgc ctggtgtcgg 11340
aaggaccagc agcgcttgaa acagccaatg ttctattcct tccttagcag ctatgcagcc 11400
gggtga 11406
<210> 115
<211> 3801
<212> PRT
<213> Homo sapiens
<400> 115
Met Ser Thr Asp Ser Asn Ser Leu Ala Arg Glu Phe Leu Thr Asp Val
1 5 10 15
Asn Arg Leu Cys Asn Ala Val Val Gln Arg Val Glu Ala Arg Glu Glu
20 25 30
Glu Glu Glu Glu Thr His Met Ala Thr Leu Gly Gln Tyr Leu Val His
35 40 45
Gly Arg Gly Phe Leu Leu Leu Thr Lys Leu Asn Ser Ile Ile Asp Gln
50 55 60
Ala Leu Thr Cys Arg Glu Glu Leu Leu Thr Leu Leu Leu Ser Leu Leu
65 70 75 80
Pro Leu Val Trp Lys Ile Pro Val Gln Glu Glu Lys Ala Thr Asp Phe
85 90 95
Asn Leu Pro Leu Ser Ala Asp Ile Ile Leu Thr Lys Glu Lys Asn Ser
100 105 110
Ser Ser Gln Arg Ser Thr Gln Glu Lys Leu His Leu Glu Gly Ser Ala
115 120 125
Leu Ser Ser Gln Val Ser Ala Lys Val Asn Val Phe Arg Lys Ser Arg
130 135 140
Arg Gln Arg Lys Ile Thr His Arg Tyr Ser Val Arg Asp Ala Arg Lys
145 150 155 160
Thr Gln Leu Ser Thr Ser Asp Ser Glu Ala Asn Ser Asp Glu Lys Gly
165 170 175
Ile Ala Met Asn Lys His Arg Arg Pro His Leu Leu His His Phe Leu
180 185 190
Thr Ser Phe Pro Lys Gln Asp His Pro Lys Ala Lys Leu Asp Arg Leu
195 200 205
Ala Thr Lys Glu Gln Thr Pro Pro Asp Ala Met Ala Leu Glu Asn Ser
210 215 220
Arg Glu Ile Ile Pro Arg Gln Gly Ser Asn Thr Asp Ile Leu Ser Glu
225 230 235 240
Pro Ala Ala Leu Ser Val Ile Ser Asn Met Asn Asn Ser Pro Phe Asp
245 250 255
Leu Cys His Val Leu Leu Ser Leu Leu Glu Lys Val Cys Lys Phe Asp
260 265 270
Val Thr Leu Asn His Asn Ser Pro Leu Ala Ala Ser Val Val Pro Thr
275 280 285
Leu Thr Glu Phe Leu Ala Gly Phe Gly Asp Cys Cys Ser Leu Ser Asp
290 295 300
Asn Leu Glu Ser Arg Val Val Ser Ala Gly Trp Thr Glu Glu Pro Val
305 310 315 320
Ala Leu Ile Gln Arg Met Leu Phe Arg Thr Val Leu His Leu Leu Ser
325 330 335
Val Asp Val Ser Thr Ala Glu Met Met Pro Glu Asn Leu Arg Lys Asn
340 345 350
Leu Thr Glu Leu Leu Arg Ala Ala Leu Lys Ile Arg Ile Cys Leu Glu
355 360 365
Lys Gln Pro Asp Pro Phe Ala Pro Arg Gln Lys Lys Thr Leu Gln Glu
370 375 380
Val Gln Glu Asp Phe Val Phe Ser Lys Tyr Arg His Arg Ala Leu Leu
385 390 395 400
Leu Pro Glu Leu Leu Glu Gly Val Leu Gln Ile Leu Ile Cys Cys Leu
405 410 415
Gln Ser Ala Ala Ser Asn Pro Phe Tyr Phe Ser Gln Ala Met Asp Leu
420 425 430
Val Gln Glu Phe Ile Gln His His Gly Phe Asn Leu Phe Glu Thr Ala
435 440 445
Val Leu Gln Met Glu Trp Leu Val Leu Arg Asp Gly Val Pro Pro Glu
450 455 460
Ala Ser Glu His Leu Lys Ala Leu Ile Asn Ser Val Met Lys Ile Met
465 470 475 480
Ser Thr Val Lys Lys Val Lys Ser Glu Gln Leu His His Ser Met Cys
485 490 495
Thr Arg Lys Arg His Arg Arg Cys Glu Tyr Ser His Phe Met His His
500 505 510
His Arg Asp Leu Ser Gly Leu Leu Val Ser Ala Phe Lys Asn Gln Val
515 520 525
Ser Lys Asn Pro Phe Glu Glu Thr Ala Asp Gly Asp Val Tyr Tyr Pro
530 535 540
Glu Arg Cys Cys Cys Ile Ala Val Cys Ala His Gln Cys Leu Arg Leu
545 550 555 560
Leu Gln Gln Ala Ser Leu Ser Ser Thr Cys Val Gln Ile Leu Ser Gly
565 570 575
Val His Asn Ile Gly Ile Cys Cys Cys Met Asp Pro Lys Ser Val Ile
580 585 590
Ile Pro Leu Leu His Ala Phe Lys Leu Pro Ala Leu Lys Asn Phe Gln
595 600 605
Gln His Ile Leu Asn Ile Leu Asn Lys Leu Ile Leu Asp Gln Leu Gly
610 615 620
Gly Ala Glu Ile Ser Pro Lys Ile Lys Lys Ala Ala Cys Asn Ile Cys
625 630 635 640
Thr Val Asp Ser Asp Gln Leu Ala Gln Leu Glu Glu Thr Leu Gln Gly
645 650 655
Asn Leu Cys Asp Ala Glu Leu Ser Ser Ser Leu Ser Ser Pro Ser Tyr
660 665 670
Arg Phe Gln Gly Ile Leu Pro Ser Ser Gly Ser Glu Asp Leu Leu Trp
675 680 685
Lys Trp Asp Ala Leu Lys Ala Tyr Gln Asn Phe Val Phe Glu Glu Asp
690 695 700
Arg Leu His Ser Ile Gln Ile Ala Asn His Ile Cys Asn Leu Ile Gln
705 710 715 720
Lys Gly Asn Ile Val Val Gln Trp Lys Leu Tyr Asn Tyr Ile Phe Asn
725 730 735
Pro Val Leu Gln Arg Gly Val Glu Leu Ala His His Cys Gln His Leu
740 745 750
Ser Val Thr Ser Ala Gln Ser His Val Cys Ser His His Asn Gln Cys
755 760 765
Leu Pro Gln Asp Val Leu Gln Ile Tyr Val Lys Thr Leu Pro Ile Leu
770 775 780
Leu Lys Ser Arg Val Ile Arg Asp Leu Phe Leu Ser Cys Asn Gly Val
785 790 795 800
Ser Gln Ile Ile Glu Leu Asn Cys Leu Asn Gly Ile Arg Ser His Ser
805 810 815
Leu Lys Ala Phe Glu Thr Leu Ile Ile Ser Leu Gly Glu Gln Gln Lys
820 825 830
Asp Ala Ser Val Pro Asp Ile Asp Gly Ile Asp Ile Glu Gln Lys Glu
835 840 845
Leu Ser Ser Val His Val Gly Thr Ser Phe His His Gln Gln Ala Tyr
850 855 860
Ser Asp Ser Pro Gln Ser Leu Ser Lys Phe Tyr Ala Gly Leu Lys Glu
865 870 875 880
Ala Tyr Pro Lys Arg Arg Lys Thr Val Asn Gln Asp Val His Ile Asn
885 890 895
Thr Ile Asn Leu Phe Leu Cys Val Ala Phe Leu Cys Val Ser Lys Glu
900 905 910
Ala Glu Ser Asp Arg Glu Ser Ala Asn Asp Ser Glu Asp Thr Ser Gly
915 920 925
Tyr Asp Ser Thr Ala Ser Glu Pro Leu Ser His Met Leu Pro Cys Ile
930 935 940
Ser Leu Glu Ser Leu Val Leu Pro Ser Pro Glu His Met His Gln Ala
945 950 955 960
Ala Asp Ile Trp Ser Met Cys Arg Trp Ile Tyr Met Leu Ser Ser Val
965 970 975
Phe Gln Lys Gln Phe Tyr Arg Leu Gly Gly Phe Arg Val Cys His Lys
980 985 990
Leu Ile Phe Met Ile Ile Gln Lys Leu Phe Arg Ser His Lys Glu Glu
995 1000 1005
Gln Gly Lys Lys Glu Gly Asp Thr Ser Val Asn Glu Asn Gln Asp
1010 1015 1020
Leu Asn Arg Ile Ser Gln Pro Lys Arg Thr Met Lys Glu Asp Leu
1025 1030 1035
Leu Ser Leu Ala Ile Lys Ser Asp Pro Ile Pro Ser Glu Leu Gly
1040 1045 1050
Ser Leu Lys Lys Ser Ala Asp Ser Leu Gly Lys Leu Glu Leu Gln
1055 1060 1065
His Ile Ser Ser Ile Asn Val Glu Glu Val Ser Ala Thr Glu Ala
1070 1075 1080
Ala Pro Glu Glu Ala Lys Leu Phe Thr Ser Gln Glu Ser Glu Thr
1085 1090 1095
Ser Leu Gln Ser Ile Arg Leu Leu Glu Ala Leu Leu Ala Ile Cys
1100 1105 1110
Leu His Gly Ala Arg Thr Ser Gln Gln Lys Met Glu Leu Glu Leu
1115 1120 1125
Pro Asn Gln Asn Leu Ser Val Glu Ser Ile Leu Phe Glu Met Arg
1130 1135 1140
Asp His Leu Ser Gln Ser Lys Val Ile Glu Thr Gln Leu Ala Lys
1145 1150 1155
Pro Leu Phe Asp Ala Leu Leu Arg Val Ala Leu Gly Asn Tyr Ser
1160 1165 1170
Ala Asp Phe Glu His Asn Asp Ala Met Thr Glu Lys Ser His Gln
1175 1180 1185
Ser Ala Glu Glu Leu Ser Ser Gln Pro Gly Asp Phe Ser Glu Glu
1190 1195 1200
Ala Glu Asp Ser Gln Cys Cys Ser Phe Lys Leu Leu Val Glu Glu
1205 1210 1215
Glu Gly Tyr Glu Ala Asp Ser Glu Ser Asn Pro Glu Asp Gly Glu
1220 1225 1230
Thr Gln Asp Asp Gly Val Asp Leu Lys Ser Glu Thr Glu Gly Phe
1235 1240 1245
Ser Ala Ser Ser Ser Pro Asn Asp Leu Leu Glu Asn Leu Thr Gln
1250 1255 1260
Gly Glu Ile Ile Tyr Pro Glu Ile Cys Met Leu Glu Leu Asn Leu
1265 1270 1275
Leu Ser Ala Ser Lys Ala Lys Leu Asp Val Leu Ala His Val Phe
1280 1285 1290
Glu Ser Phe Leu Lys Ile Ile Arg Gln Lys Glu Lys Asn Val Phe
1295 1300 1305
Leu Leu Met Gln Gln Gly Thr Val Lys Asn Leu Leu Gly Gly Phe
1310 1315 1320
Leu Ser Ile Leu Thr Gln Asp Asp Ser Asp Phe Gln Ala Cys Gln
1325 1330 1335
Arg Val Leu Val Asp Leu Leu Val Ser Leu Met Ser Ser Arg Thr
1340 1345 1350
Cys Ser Glu Glu Leu Thr Leu Leu Leu Arg Ile Phe Leu Glu Lys
1355 1360 1365
Ser Pro Cys Thr Lys Ile Leu Leu Leu Gly Ile Leu Lys Ile Ile
1370 1375 1380
Glu Ser Asp Thr Thr Met Ser Pro Ser Gln Tyr Leu Thr Phe Pro
1385 1390 1395
Leu Leu His Ala Pro Asn Leu Ser Asn Gly Val Ser Ser Gln Lys
1400 1405 1410
Tyr Pro Gly Ile Leu Asn Ser Lys Ala Met Gly Leu Leu Arg Arg
1415 1420 1425
Ala Arg Val Ser Arg Ser Lys Lys Glu Ala Asp Arg Glu Ser Phe
1430 1435 1440
Pro His Arg Leu Leu Ser Ser Trp His Ile Ala Pro Val His Leu
1445 1450 1455
Pro Leu Leu Gly Gln Asn Cys Trp Pro His Leu Ser Glu Gly Phe
1460 1465 1470
Ser Val Ser Leu Trp Phe Asn Val Glu Cys Ile His Glu Ala Glu
1475 1480 1485
Ser Thr Thr Glu Lys Gly Lys Lys Ile Lys Lys Arg Asn Lys Ser
1490 1495 1500
Leu Ile Leu Pro Asp Ser Ser Phe Asp Gly Thr Glu Ser Asp Arg
1505 1510 1515
Pro Glu Gly Ala Glu Tyr Ile Asn Pro Gly Glu Arg Leu Ile Glu
1520 1525 1530
Glu Gly Cys Ile His Ile Ile Ser Leu Gly Ser Lys Ala Leu Met
1535 1540 1545
Ile Gln Val Trp Ala Asp Pro His Asn Ala Thr Leu Ile Phe Arg
1550 1555 1560
Val Cys Met Asp Ser Asn Asp Asp Met Lys Ala Val Leu Leu Ala
1565 1570 1575
Gln Val Glu Ser Gln Glu Asn Ile Phe Leu Pro Ser Lys Trp Gln
1580 1585 1590
His Leu Val Leu Thr Tyr Leu Gln Gln Pro Gln Gly Lys Arg Arg
1595 1600 1605
Ile His Gly Lys Ile Ser Ile Trp Val Ser Gly Gln Arg Lys Pro
1610 1615 1620
Asp Val Thr Leu Asp Phe Met Leu Pro Arg Lys Thr Ser Leu Ser
1625 1630 1635
Ser Asp Ser Asn Lys Thr Phe Cys Met Ile Gly His Cys Leu Ser
1640 1645 1650
Ser Gln Glu Glu Phe Leu Gln Leu Ala Gly Lys Trp Asp Leu Gly
1655 1660 1665
Asn Leu Leu Leu Phe Asn Gly Ala Lys Val Gly Ser Gln Glu Ala
1670 1675 1680
Phe Tyr Leu Tyr Ala Cys Gly Pro Asn His Thr Ser Val Met Pro
1685 1690 1695
Cys Lys Tyr Gly Lys Pro Val Asn Asp Tyr Ser Lys Tyr Ile Asn
1700 1705 1710
Lys Glu Ile Leu Arg Cys Glu Gln Ile Arg Glu Leu Phe Met Thr
1715 1720 1725
Lys Lys Asp Val Asp Ile Gly Leu Leu Ile Glu Ser Leu Ser Val
1730 1735 1740
Val Tyr Thr Thr Tyr Cys Pro Ala Gln Tyr Thr Ile Tyr Glu Pro
1745 1750 1755
Val Ile Arg Leu Lys Gly Gln Met Lys Thr Gln Leu Ser Gln Arg
1760 1765 1770
Pro Phe Ser Ser Lys Glu Val Gln Ser Ile Leu Leu Glu Pro His
1775 1780 1785
His Leu Lys Asn Leu Gln Pro Thr Glu Tyr Lys Thr Ile Gln Gly
1790 1795 1800
Ile Leu His Glu Ile Gly Gly Thr Gly Ile Phe Val Phe Leu Phe
1805 1810 1815
Ala Arg Val Val Glu Leu Ser Ser Cys Glu Glu Thr Gln Ala Leu
1820 1825 1830
Ala Leu Arg Val Ile Leu Ser Leu Ile Lys Tyr Asn Gln Gln Arg
1835 1840 1845
Val His Glu Leu Glu Asn Cys Asn Gly Leu Ser Met Ile His Gln
1850 1855 1860
Val Leu Ile Lys Gln Lys Cys Ile Val Gly Phe Tyr Ile Leu Lys
1865 1870 1875
Thr Leu Leu Glu Gly Cys Cys Gly Glu Asp Ile Ile Tyr Met Asn
1880 1885 1890
Glu Asn Gly Glu Phe Lys Leu Asp Val Asp Ser Asn Ala Ile Ile
1895 1900 1905
Gln Asp Val Lys Leu Leu Glu Glu Leu Leu Leu Asp Trp Lys Ile
1910 1915 1920
Trp Ser Lys Ala Glu Gln Gly Val Trp Glu Thr Leu Leu Ala Ala
1925 1930 1935
Leu Glu Val Leu Ile Arg Ala Asp His His Gln Gln Met Phe Asn
1940 1945 1950
Ile Lys Gln Leu Leu Lys Ala Gln Val Val His His Phe Leu Leu
1955 1960 1965
Thr Cys Gln Val Leu Gln Glu Tyr Lys Glu Gly Gln Leu Thr Pro
1970 1975 1980
Met Pro Arg Glu Val Cys Arg Ser Phe Val Lys Ile Ile Ala Glu
1985 1990 1995
Val Leu Gly Ser Pro Pro Asp Leu Glu Leu Leu Thr Ile Ile Phe
2000 2005 2010
Asn Phe Leu Leu Ala Val His Pro Pro Thr Asn Thr Tyr Val Cys
2015 2020 2025
His Asn Pro Thr Asn Phe Tyr Phe Ser Leu His Ile Asp Gly Lys
2030 2035 2040
Ile Phe Gln Glu Lys Val Arg Ser Ile Met Tyr Leu Arg His Ser
2045 2050 2055
Ser Ser Gly Gly Arg Ser Leu Met Ser Pro Gly Phe Met Val Ile
2060 2065 2070
Ser Pro Ser Gly Phe Thr Ala Ser Pro Tyr Glu Gly Glu Asn Ser
2075 2080 2085
Ser Asn Ile Ile Pro Gln Gln Met Ala Ala His Met Leu Arg Ser
2090 2095 2100
Arg Ser Leu Pro Ala Phe Pro Thr Ser Ser Leu Leu Thr Gln Ser
2105 2110 2115
Gln Lys Leu Thr Gly Ser Leu Gly Cys Ser Ile Asp Arg Leu Gln
2120 2125 2130
Asn Ile Ala Asp Thr Tyr Val Ala Thr Gln Ser Lys Lys Gln Asn
2135 2140 2145
Ser Leu Gly Ser Ser Asp Thr Leu Lys Lys Gly Lys Glu Asp Ala
2150 2155 2160
Phe Ile Ser Ser Cys Glu Ser Ala Lys Thr Val Cys Glu Met Glu
2165 2170 2175
Ala Val Leu Ser Ala Gln Val Ser Val Ser Asp Val Pro Lys Gly
2180 2185 2190
Val Leu Gly Phe Pro Val Val Lys Ala Asp His Lys Gln Leu Gly
2195 2200 2205
Ala Glu Pro Arg Ser Glu Asp Asp Ser Pro Gly Asp Glu Ser Cys
2210 2215 2220
Pro Arg Arg Pro Asp Tyr Leu Lys Gly Leu Ala Ser Phe Gln Arg
2225 2230 2235
Ser His Ser Thr Ile Ala Ser Leu Gly Leu Ala Phe Pro Ser Gln
2240 2245 2250
Asn Gly Ser Ala Ala Val Gly Arg Trp Pro Ser Leu Val Asp Arg
2255 2260 2265
Asn Thr Asp Asp Trp Glu Asn Phe Ala Tyr Ser Leu Gly Tyr Glu
2270 2275 2280
Pro Asn Tyr Asn Arg Thr Ala Ser Ala His Ser Val Thr Glu Asp
2285 2290 2295
Cys Leu Val Pro Ile Cys Cys Gly Leu Tyr Glu Leu Leu Ser Gly
2300 2305 2310
Val Leu Leu Ile Leu Pro Asp Val Leu Leu Glu Asp Val Met Asp
2315 2320 2325
Lys Leu Ile Gln Ala Asp Thr Leu Leu Val Leu Val Asn His Pro
2330 2335 2340
Ser Pro Ala Ile Gln Gln Gly Val Ile Lys Leu Leu Asp Ala Tyr
2345 2350 2355
Phe Ala Arg Ala Ser Lys Glu Gln Lys Asp Lys Phe Leu Lys Asn
2360 2365 2370
Arg Gly Phe Ser Leu Leu Ala Asn Gln Leu Tyr Leu His Arg Gly
2375 2380 2385
Thr Gln Glu Leu Leu Glu Cys Phe Ile Glu Met Phe Phe Gly Arg
2390 2395 2400
His Ile Gly Leu Asp Glu Glu Phe Asp Leu Glu Asp Val Arg Asn
2405 2410 2415
Met Gly Leu Phe Gln Lys Trp Ser Val Ile Pro Ile Leu Gly Leu
2420 2425 2430
Ile Glu Thr Ser Leu Tyr Asp Asn Ile Leu Leu His Asn Ala Leu
2435 2440 2445
Leu Leu Leu Leu Gln Ile Leu Asn Ser Cys Ser Lys Val Ala Asp
2450 2455 2460
Met Leu Leu Asp Asn Gly Leu Leu Tyr Val Leu Cys Asn Thr Val
2465 2470 2475
Ala Ala Leu Asn Gly Leu Glu Lys Asn Ile Pro Met Ser Glu Tyr
2480 2485 2490
Lys Leu Leu Ala Cys Asp Ile Gln Gln Leu Phe Ile Ala Val Thr
2495 2500 2505
Ile His Ala Cys Ser Ser Ser Gly Ser Gln Tyr Phe Arg Val Ile
2510 2515 2520
Glu Asp Leu Ile Val Met Leu Gly Tyr Leu Gln Asn Ser Lys Asn
2525 2530 2535
Lys Arg Thr Gln Asn Met Ala Val Ala Leu Gln Leu Arg Val Leu
2540 2545 2550
Gln Ala Ala Met Glu Phe Ile Arg Thr Thr Ala Asn His Asp Ser
2555 2560 2565
Glu Asn Leu Thr Asp Ser Leu Gln Ser Pro Ser Ala Pro His His
2570 2575 2580
Ala Val Val Gln Lys Arg Lys Ser Ile Ala Gly Pro Arg Lys Phe
2585 2590 2595
Pro Leu Ala Gln Thr Glu Ser Leu Leu Met Lys Met Arg Ser Val
2600 2605 2610
Ala Asn Asp Glu Leu His Val Met Met Gln Arg Arg Met Ser Gln
2615 2620 2625
Glu Asn Pro Ser Gln Ala Thr Glu Thr Glu Leu Ala Gln Arg Leu
2630 2635 2640
Gln Arg Leu Thr Val Leu Ala Val Asn Arg Ile Ile Tyr Gln Glu
2645 2650 2655
Phe Asn Ser Asp Ile Ile Asp Ile Leu Arg Thr Pro Glu Asn Val
2660 2665 2670
Thr Gln Ser Lys Thr Ser Val Phe Gln Thr Glu Ile Ser Glu Glu
2675 2680 2685
Asn Ile His His Glu Gln Ser Ser Val Phe Asn Pro Phe Gln Lys
2690 2695 2700
Glu Ile Phe Thr Tyr Leu Val Glu Gly Phe Lys Val Ser Ile Gly
2705 2710 2715
Ser Ser Lys Ala Ser Gly Ser Lys Gln Gln Trp Thr Lys Ile Leu
2720 2725 2730
Trp Ser Cys Lys Glu Thr Phe Arg Met Gln Leu Gly Arg Leu Leu
2735 2740 2745
Val His Ile Leu Ser Pro Ala His Ala Ala Gln Glu Arg Lys Gln
2750 2755 2760
Ile Phe Glu Ile Val His Glu Pro Asn His Gln Glu Ile Leu Arg
2765 2770 2775
Asp Cys Leu Ser Pro Ser Leu Gln His Gly Ala Lys Leu Val Leu
2780 2785 2790
Tyr Leu Ser Glu Leu Ile His Asn His Gln Gly Glu Leu Thr Glu
2795 2800 2805
Glu Glu Leu Gly Thr Ala Glu Leu Leu Met Asn Ala Leu Lys Leu
2810 2815 2820
Cys Gly His Lys Cys Ile Pro Pro Ser Ala Ser Thr Lys Ala Asp
2825 2830 2835
Leu Ile Lys Met Ile Lys Glu Glu Gln Lys Lys Tyr Glu Thr Glu
2840 2845 2850
Glu Gly Val Asn Lys Ala Ala Trp Gln Lys Thr Val Asn Asn Asn
2855 2860 2865
Gln Gln Ser Leu Phe Gln Arg Leu Asp Ser Lys Ser Lys Asp Ile
2870 2875 2880
Ser Lys Ile Ala Ala Asp Ile Thr Gln Ala Val Ser Leu Ser Gln
2885 2890 2895
Gly Asn Glu Arg Lys Lys Val Ile Gln His Ile Arg Gly Met Tyr
2900 2905 2910
Lys Val Asp Leu Ser Ala Ser Arg His Trp Gln Glu Leu Ile Gln
2915 2920 2925
Gln Leu Thr His Asp Arg Ala Val Trp Tyr Asp Pro Ile Tyr Tyr
2930 2935 2940
Pro Thr Ser Trp Gln Leu Asp Pro Thr Glu Gly Pro Asn Arg Glu
2945 2950 2955
Arg Arg Arg Leu Gln Arg Cys Tyr Leu Thr Ile Pro Asn Lys Tyr
2960 2965 2970
Leu Leu Arg Asp Arg Gln Lys Ser Glu Asp Val Val Lys Pro Pro
2975 2980 2985
Leu Ser Tyr Leu Phe Glu Asp Lys Thr His Ser Ser Phe Ser Ser
2990 2995 3000
Thr Val Lys Asp Lys Ala Ala Ser Glu Ser Ile Arg Val Asn Arg
3005 3010 3015
Arg Cys Ile Ser Val Ala Pro Ser Arg Glu Thr Ala Gly Glu Leu
3020 3025 3030
Leu Leu Gly Lys Cys Gly Met Tyr Phe Val Glu Asp Asn Ala Ser
3035 3040 3045
Asp Thr Val Glu Ser Ser Ser Leu Gln Gly Glu Leu Glu Pro Ala
3050 3055 3060
Ser Phe Ser Trp Thr Tyr Glu Glu Ile Lys Glu Val His Lys Arg
3065 3070 3075
Trp Trp Gln Leu Arg Asp Asn Ala Val Glu Ile Phe Leu Thr Asn
3080 3085 3090
Gly Arg Thr Leu Leu Leu Ala Phe Asp Asn Thr Lys Val Arg Asp
3095 3100 3105
Asp Val Tyr His Asn Ile Leu Thr Asn Asn Leu Pro Asn Leu Leu
3110 3115 3120
Glu Tyr Gly Asn Ile Thr Ala Leu Thr Asn Leu Trp Tyr Thr Gly
3125 3130 3135
Gln Ile Thr Asn Phe Glu Tyr Leu Thr His Leu Asn Lys His Ala
3140 3145 3150
Gly Arg Ser Phe Asn Asp Leu Met Gln Tyr Pro Val Phe Pro Phe
3155 3160 3165
Ile Leu Ala Asp Tyr Val Ser Glu Thr Leu Asp Leu Asn Asp Leu
3170 3175 3180
Leu Ile Tyr Arg Asn Leu Ser Lys Pro Ile Ala Val Gln Tyr Lys
3185 3190 3195
Glu Lys Glu Asp Arg Tyr Val Asp Thr Tyr Lys Tyr Leu Glu Glu
3200 3205 3210
Glu Tyr Arg Lys Gly Ala Arg Glu Asp Asp Pro Met Pro Pro Val
3215 3220 3225
Gln Pro Tyr His Tyr Gly Ser His Tyr Ser Asn Ser Gly Thr Val
3230 3235 3240
Leu His Phe Leu Val Arg Met Pro Pro Phe Thr Lys Met Phe Leu
3245 3250 3255
Ala Tyr Gln Asp Gln Ser Phe Asp Ile Pro Asp Arg Thr Phe His
3260 3265 3270
Ser Thr Asn Thr Thr Trp Arg Leu Ser Ser Phe Glu Ser Met Thr
3275 3280 3285
Asp Val Lys Glu Leu Ile Pro Glu Phe Phe Tyr Leu Pro Glu Phe
3290 3295 3300
Leu Val Asn Arg Glu Gly Phe Asp Phe Gly Val Arg Gln Asn Gly
3305 3310 3315
Glu Arg Val Asn His Val Asn Leu Pro Pro Trp Ala Arg Asn Asp
3320 3325 3330
Pro Arg Leu Phe Ile Leu Ile His Arg Gln Ala Leu Glu Ser Asp
3335 3340 3345
Tyr Val Ser Gln Asn Ile Cys Gln Trp Ile Asp Leu Val Phe Gly
3350 3355 3360
Tyr Lys Gln Lys Gly Lys Ala Ser Val Gln Ala Ile Asn Val Phe
3365 3370 3375
His Pro Ala Thr Tyr Phe Gly Met Asp Val Ser Ala Val Glu Asp
3380 3385 3390
Pro Val Gln Arg Arg Ala Leu Glu Thr Met Ile Lys Thr Tyr Gly
3395 3400 3405
Gln Thr Pro Arg Gln Leu Phe His Met Ala His Val Ser Arg Pro
3410 3415 3420
Gly Ala Lys Leu Asn Ile Glu Gly Glu Leu Pro Ala Ala Val Gly
3425 3430 3435
Leu Leu Val Gln Phe Ala Phe Arg Glu Thr Arg Glu Gln Val Lys
3440 3445 3450
Glu Ile Thr Tyr Pro Ser Pro Leu Ser Trp Ile Lys Gly Leu Lys
3455 3460 3465
Trp Gly Glu Tyr Val Gly Ser Pro Ser Ala Pro Val Pro Val Val
3470 3475 3480
Cys Phe Ser Gln Pro His Gly Glu Arg Phe Gly Ser Leu Gln Ala
3485 3490 3495
Leu Pro Thr Arg Ala Ile Cys Gly Leu Ser Arg Asn Phe Cys Leu
3500 3505 3510
Leu Met Thr Tyr Ser Lys Glu Gln Gly Val Arg Ser Met Asn Ser
3515 3520 3525
Thr Asp Ile Gln Trp Ser Ala Ile Leu Ser Trp Gly Tyr Ala Asp
3530 3535 3540
Asn Ile Leu Arg Leu Lys Ser Lys Gln Ser Glu Pro Pro Val Asn
3545 3550 3555
Phe Ile Gln Ser Ser Gln Gln Tyr Gln Val Thr Ser Cys Ala Trp
3560 3565 3570
Val Pro Asp Ser Cys Gln Leu Phe Thr Gly Ser Lys Cys Gly Val
3575 3580 3585
Ile Thr Ala Tyr Thr Asn Arg Phe Thr Ser Ser Thr Pro Ser Glu
3590 3595 3600
Ile Glu Met Glu Thr Gln Ile His Leu Tyr Gly His Thr Glu Glu
3605 3610 3615
Ile Thr Ser Leu Phe Val Cys Lys Pro Tyr Ser Ile Leu Ile Ser
3620 3625 3630
Val Ser Arg Asp Gly Thr Cys Ile Ile Trp Asp Leu Asn Arg Leu
3635 3640 3645
Cys Tyr Val Gln Ser Leu Ala Gly His Lys Ser Pro Val Thr Ala
3650 3655 3660
Val Ser Ala Ser Glu Thr Ser Gly Asp Ile Ala Thr Val Cys Asp
3665 3670 3675
Ser Ala Gly Gly Gly Ser Asp Leu Arg Leu Trp Thr Val Asn Gly
3680 3685 3690
Asp Leu Val Gly His Val His Cys Arg Glu Ile Ile Cys Ser Val
3695 3700 3705
Ala Phe Ser Asn Gln Pro Glu Gly Val Ser Ile Asn Val Ile Ala
3710 3715 3720
Gly Gly Leu Glu Asn Gly Ile Val Arg Leu Trp Ser Thr Trp Asp
3725 3730 3735
Leu Lys Pro Val Arg Glu Ile Thr Phe Pro Lys Ser Asn Lys Pro
3740 3745 3750
Ile Ile Ser Leu Thr Phe Ser Cys Asp Gly His His Leu Tyr Thr
3755 3760 3765
Ala Asn Ser Asp Gly Thr Val Ile Ala Trp Cys Arg Lys Asp Gln
3770 3775 3780
Gln Arg Leu Lys Gln Pro Met Phe Tyr Ser Phe Leu Ser Ser Tyr
3785 3790 3795
Ala Ala Gly
3800
<210> 116
<211> 5568
<212> DNA
<213> Homo sapiens
<400> 116
atggctgcgt cggagctcta cacaaagttt gccagggttt ggatacctga tccagaggaa 60
gtctggaagt cagcagagct gctcaaagat tataagccag gagataaagt cctcctgctt 120
cacctcgagg aaggaaagga tttggaatac catctagatc caaagaccaa ggagctgcct 180
cacttacgaa atcctgacat acttgttggt gaaaatgacc tcacagccct cagctatctt 240
catgagcctg ctgtgctcca taatctcaga gtccgcttta ttgattccaa acttatttat 300
acgtattgtg gtatagtcct agtagctata aatccctatg aacagctgcc tatttatgga 360
gaagatatta ttaatgcata cagtggtcag aacatgggtg atatggatcc acatatcttt 420
gcagtagctg aagaagctta caagcaaatg gccagagatg aacgaaatca gtccatcatc 480
gtaagtggag agtctggggc aggaaaaaca gtctcagcta agtatgccat gcgatacttt 540
gcaactgtga gtggttctgc cagtgaggcc aatgtggagg aaaaggtctt ggcctccaac 600
cccatcatgg agtccattgg aaatgctaaa acaaccagga atgataatag cagccgtttt 660
gggaagtata ttgagattgg ttttgataag agatatcgaa tcattggtgc caatatgaga 720
acttatcttt tagagaaatc cagagtggta ttccaggcag aagaggagag aaactatcat 780
atcttctatc agctttgtgc ctcagcaaag ttacctgaat ttaaaatgct acgattagga 840
aatgcagata actttaatta cacaaaacaa ggaggcagtc ctgtgattga aggagtggat 900
gatgcaaagg agatggcaca tactaggcag gcctgcactt tgctaggaat tagtgaatct 960
catcaaatgg gaattttccg aatacttgct ggcatccttc acttaggcaa tgttggattt 1020
acatcccgag atgcagacag ctgcacaata cctcccaagc atgaacctct ctgcatcttc 1080
tgtgaactca tgggtgtgga ctatgaggag atgtgtcact ggctctgcca tcggaaactg 1140
gctactgcca cagagacata catcaagccc atctccaagc tgcaggccac gaatgcccgc 1200
gatgctttgg ccaagcacat ctatgccaag ctctttaact ggattgtaga taatgtcaat 1260
caggctctcc attctgctgt caaacagcac tcttttattg gtgtgctaga catttacgga 1320
tttgaaacat ttgagataaa tagttttgaa cagttttgca taaattatgc aaatgaaaaa 1380
ctacagcaac aattcaatat gcatgtcttc aaattggagc aagaagaata tatgaaggaa 1440
caaattccat ggacactcat agatttttat gataatcagc cttgtattaa tcttatagaa 1500
tcaaaactag gcattctaga tttactggat gaggaatgca agatgcctaa aggcacagat 1560
gacacctggg cccaaaaatt gtacaacaca catttgaaca aatgtgcact ctttgaaaag 1620
cctcgtctat caaacaaagc tttcatcatc caacattttg ctgacaaagt ggaataccag 1680
tgtgaaggat ttctcgaaaa gaataaagac accgtttttg aagaacaaat taaagttctt 1740
aaatcaagca agtttaagat gctaccagaa ctatttcaag atgatgagaa ggccatcagt 1800
ccaacttcag ccacctcctc agggcgcaca cccctcacac gaactcctgc aaagcccacc 1860
aaaggcagac caggccaaat ggccaaagag cacaagaaaa cagtggggca tcagttcaga 1920
aactccctgc acctgcttat ggagacactc aatgccacta cccctcacta tgtgcgctgt 1980
atcaagccta atgacttcaa gttcccattc acgtttgatg agaagagggc agtgcagcag 2040
ctgagagcat gtggtgtcct ggaaaccatc cgaatcagtg cggccggttt cccctcacgg 2100
tggacttacc aagaattttt cagccgctac cgtgtcctaa tgaagcagaa agatgtgctg 2160
agtgacagaa agcaaacatg caagaatgtg ttagagaaac tgatactgga caaggacaaa 2220
taccagtttg gtaagacaaa gatctttttc cgtgccggtc aagtggccta tctagaaaaa 2280
ttgagagctg acaaactgag agctgcctgc atccggatcc agaagaccat ccgagggtgg 2340
ctgctgagaa agaagtacct acgcatgcgg aaggcagcca tcaccatgca gagatacgtg 2400
cggggctacc aggcccgatg ctatgctaag tttctgcgca gaaccaaggc agcaaccatc 2460
attcaaaagt actggcgcat gtatgtggtc cgcaggaggt acaagattag acgagctgcc 2520
actatcgttc ttcagtctta cttgcgaggc ttcttggcca gaaataggta tcgcaagata 2580
ctccgtgagc acaaagcagt catcattcag aagcgagtcc ggggctggct ggcccgcaca 2640
cactacaaga ggagcatgca tgccatcatc taccttcagt gctgcttcag gcggatgatg 2700
gccaagcgtg agctaaagaa gctcaaaatc gaggctcgct cagtggagcg ctataagaag 2760
ctgcacatcg gcatggagaa caagatcatg cagctgcagc gcaaagttga tgagcagaac 2820
aaagactaca aatgccttgt ggagaaacta accaatctgg aaggaatata caactctgag 2880
actgagaaac tacgaagtga cttagaacgt cttcaactaa gtgaagagga agcgaaagtt 2940
gccactgggc gggtccttag tctgcaggaa gaaattgcca agctccggaa agacctggag 3000
caaactcgtt cagagaaaaa atgcattgag gaacatgcag atcgatacaa acaagaaaca 3060
gagcagctgg tatcaaatct gaaggaagaa aatactttgc tgaagcaaga aaaagaagcc 3120
ctcaatcacc gcatcgtgca gcaggctaag gagatgacag aaactatgga gaagaagtta 3180
gtagaagaaa cgaaacaact ggaactcgac cttaatgatg aaaggctgag atatcagaac 3240
cttctgaatg agttcagtcg cctggaagaa agatatgatg acctcaagga agagatgacc 3300
cttatggtgc atgtgcctaa gcctggacac aagagaacag actccaccca cagcagcaac 3360
gagtctgaat atatctttag ctctgaaatt gcagaaatgg aagacattcc atcaaggaca 3420
gaggaaccaa gtgagaagaa ggtacctctg gacatgtcat tgttccttaa gctccagaag 3480
cgggtcacag agctggagca ggagaagcag gtgatgcagg atgagctgga ccgcaaggag 3540
gagcaggtgc tccgcagcaa ggccaaggaa gaagaaagac cacaaattag aggtgcagaa 3600
ctggaatatg agtcactcaa gcgtcaagaa ctagaatcag aaaacaaaaa actgaagaat 3660
gagctaaatg agttgcgcaa ggccctcagt gagaaaagtg ccccagaggt gaccgcccca 3720
ggtgcacctg cctaccgtgt cctcatggag cagctgacct ctgtgagcga ggagcttgat 3780
gtccgcaagg aggaagtcct catcttaagg tctcaactgg tgagccagaa agaggccatc 3840
caacccaagg atgacaagaa tacaatgaca gattccacaa tacttttgga agatgtacaa 3900
aaaatgaaag ataaaggtga aatagcacaa gcatacattg gtttgaaaga aacaaataga 3960
tcatctgctc tggattacca tgagttgaat gaggatggag agctgtggct ggtttatgaa 4020
gggttaaaac aagccaacag gctcctggaa tcccagctgc agtcacagaa gaggagccat 4080
gagaatgagg ccgaggccct ccgtggggag atccagagcc tgaaggagga gaacaaccga 4140
cagcagcagc tgctggccca gaacctgcag ctgcccccag aggcccgcat tgaggccagc 4200
ctgcagcacg agatcacccg gctgaccaac gaaaacttgg atttgatgga acaacttgaa 4260
aaacaggata agacggtccg taaactgaaa aaacaactga aagtatttgc caaaaaaatt 4320
ggcgaactag aagtgggcca gatggagaac atatccccag gacagatcat tgatgaaccc 4380
atccgaccag tcaacattcc caggaaagaa aaggatttcc aagggatgct ggaatacaag 4440
aaggaggatg agcaaaaact tgttaagaac ctgattctgg aactgaagcc acgtggtgta 4500
gcagtcaatt tgattccagg attaccggca tatatcctgt tcatgtgtgt tcgacatgct 4560
gactacctga atgatgatca gaaagtaagg tcgttgctaa catcaacaat taacagcatc 4620
aaaaaagtat tgaagaaaag aggtgatgat tttgaaaccg tctccttctg gctctctaac 4680
acatgccgat ttttgcactg cttgaaacag tacagtggag aagagggctt tatgaagcac 4740
aacacatctc gccagaatga acactgcctc accaattttg acctggctga gtatcggcag 4800
gtgctgagtg acttggccat tcagatctac cagcagctcg tgcgggtgtt agagaacatc 4860
cttcagccaa tgattgtctc aggcatgctg gaacatgaaa cgattcaggg cgtgtctggg 4920
gtgaagccca cagggttgag aaagcgaacc tccagtatcg ccgatgaggg cacctacaca 4980
ctggactcca tcctccggca gctcaactcc ttccactcgg tcatgtgtca gcatggcatg 5040
gaccctgaac tgatcaagca ggtggtcaag cagatgttct acatcatagg ggccatcacc 5100
ctgaacaacc ttctcctgcg gaaggacatg tgctcctgga gtaaaggcat gcagatcagg 5160
tacaatgtca gtcaactgga agaatggctg cgtgacaaga atctgatgaa tagtggggct 5220
aaagaaaccc tggaacctct cattcaggct gctcaacttt tgcaagtgaa aaagaaaaca 5280
gatgatgatg cagaagccat ttgttctatg tgcaatgctt taactactgc ccagattgtg 5340
aaagtgttga atttgtatac tccagttaat gagtttgaag aaagagtctc tgtgtcgttc 5400
attcgtacta tacagatgcg tttacgagac aggaaagact ctccccagct gctcatggat 5460
gctaaacaca tctttcctgt cacctttcct ttcaacccat cttccctcgc actagaaacc 5520
atccagattc cagccagcct cggcctgggc ttcatttcac gggtctga 5568
<210> 117
<211> 1855
<212> PRT
<213> Homo sapiens
<400> 117
Met Ala Ala Ser Glu Leu Tyr Thr Lys Phe Ala Arg Val Trp Ile Pro
1 5 10 15
Asp Pro Glu Glu Val Trp Lys Ser Ala Glu Leu Leu Lys Asp Tyr Lys
20 25 30
Pro Gly Asp Lys Val Leu Leu Leu His Leu Glu Glu Gly Lys Asp Leu
35 40 45
Glu Tyr His Leu Asp Pro Lys Thr Lys Glu Leu Pro His Leu Arg Asn
50 55 60
Pro Asp Ile Leu Val Gly Glu Asn Asp Leu Thr Ala Leu Ser Tyr Leu
65 70 75 80
His Glu Pro Ala Val Leu His Asn Leu Arg Val Arg Phe Ile Asp Ser
85 90 95
Lys Leu Ile Tyr Thr Tyr Cys Gly Ile Val Leu Val Ala Ile Asn Pro
100 105 110
Tyr Glu Gln Leu Pro Ile Tyr Gly Glu Asp Ile Ile Asn Ala Tyr Ser
115 120 125
Gly Gln Asn Met Gly Asp Met Asp Pro His Ile Phe Ala Val Ala Glu
130 135 140
Glu Ala Tyr Lys Gln Met Ala Arg Asp Glu Arg Asn Gln Ser Ile Ile
145 150 155 160
Val Ser Gly Glu Ser Gly Ala Gly Lys Thr Val Ser Ala Lys Tyr Ala
165 170 175
Met Arg Tyr Phe Ala Thr Val Ser Gly Ser Ala Ser Glu Ala Asn Val
180 185 190
Glu Glu Lys Val Leu Ala Ser Asn Pro Ile Met Glu Ser Ile Gly Asn
195 200 205
Ala Lys Thr Thr Arg Asn Asp Asn Ser Ser Arg Phe Gly Lys Tyr Ile
210 215 220
Glu Ile Gly Phe Asp Lys Arg Tyr Arg Ile Ile Gly Ala Asn Met Arg
225 230 235 240
Thr Tyr Leu Leu Glu Lys Ser Arg Val Val Phe Gln Ala Glu Glu Glu
245 250 255
Arg Asn Tyr His Ile Phe Tyr Gln Leu Cys Ala Ser Ala Lys Leu Pro
260 265 270
Glu Phe Lys Met Leu Arg Leu Gly Asn Ala Asp Asn Phe Asn Tyr Thr
275 280 285
Lys Gln Gly Gly Ser Pro Val Ile Glu Gly Val Asp Asp Ala Lys Glu
290 295 300
Met Ala His Thr Arg Gln Ala Cys Thr Leu Leu Gly Ile Ser Glu Ser
305 310 315 320
His Gln Met Gly Ile Phe Arg Ile Leu Ala Gly Ile Leu His Leu Gly
325 330 335
Asn Val Gly Phe Thr Ser Arg Asp Ala Asp Ser Cys Thr Ile Pro Pro
340 345 350
Lys His Glu Pro Leu Cys Ile Phe Cys Glu Leu Met Gly Val Asp Tyr
355 360 365
Glu Glu Met Cys His Trp Leu Cys His Arg Lys Leu Ala Thr Ala Thr
370 375 380
Glu Thr Tyr Ile Lys Pro Ile Ser Lys Leu Gln Ala Thr Asn Ala Arg
385 390 395 400
Asp Ala Leu Ala Lys His Ile Tyr Ala Lys Leu Phe Asn Trp Ile Val
405 410 415
Asp Asn Val Asn Gln Ala Leu His Ser Ala Val Lys Gln His Ser Phe
420 425 430
Ile Gly Val Leu Asp Ile Tyr Gly Phe Glu Thr Phe Glu Ile Asn Ser
435 440 445
Phe Glu Gln Phe Cys Ile Asn Tyr Ala Asn Glu Lys Leu Gln Gln Gln
450 455 460
Phe Asn Met His Val Phe Lys Leu Glu Gln Glu Glu Tyr Met Lys Glu
465 470 475 480
Gln Ile Pro Trp Thr Leu Ile Asp Phe Tyr Asp Asn Gln Pro Cys Ile
485 490 495
Asn Leu Ile Glu Ser Lys Leu Gly Ile Leu Asp Leu Leu Asp Glu Glu
500 505 510
Cys Lys Met Pro Lys Gly Thr Asp Asp Thr Trp Ala Gln Lys Leu Tyr
515 520 525
Asn Thr His Leu Asn Lys Cys Ala Leu Phe Glu Lys Pro Arg Leu Ser
530 535 540
Asn Lys Ala Phe Ile Ile Gln His Phe Ala Asp Lys Val Glu Tyr Gln
545 550 555 560
Cys Glu Gly Phe Leu Glu Lys Asn Lys Asp Thr Val Phe Glu Glu Gln
565 570 575
Ile Lys Val Leu Lys Ser Ser Lys Phe Lys Met Leu Pro Glu Leu Phe
580 585 590
Gln Asp Asp Glu Lys Ala Ile Ser Pro Thr Ser Ala Thr Ser Ser Gly
595 600 605
Arg Thr Pro Leu Thr Arg Thr Pro Ala Lys Pro Thr Lys Gly Arg Pro
610 615 620
Gly Gln Met Ala Lys Glu His Lys Lys Thr Val Gly His Gln Phe Arg
625 630 635 640
Asn Ser Leu His Leu Leu Met Glu Thr Leu Asn Ala Thr Thr Pro His
645 650 655
Tyr Val Arg Cys Ile Lys Pro Asn Asp Phe Lys Phe Pro Phe Thr Phe
660 665 670
Asp Glu Lys Arg Ala Val Gln Gln Leu Arg Ala Cys Gly Val Leu Glu
675 680 685
Thr Ile Arg Ile Ser Ala Ala Gly Phe Pro Ser Arg Trp Thr Tyr Gln
690 695 700
Glu Phe Phe Ser Arg Tyr Arg Val Leu Met Lys Gln Lys Asp Val Leu
705 710 715 720
Ser Asp Arg Lys Gln Thr Cys Lys Asn Val Leu Glu Lys Leu Ile Leu
725 730 735
Asp Lys Asp Lys Tyr Gln Phe Gly Lys Thr Lys Ile Phe Phe Arg Ala
740 745 750
Gly Gln Val Ala Tyr Leu Glu Lys Leu Arg Ala Asp Lys Leu Arg Ala
755 760 765
Ala Cys Ile Arg Ile Gln Lys Thr Ile Arg Gly Trp Leu Leu Arg Lys
770 775 780
Lys Tyr Leu Arg Met Arg Lys Ala Ala Ile Thr Met Gln Arg Tyr Val
785 790 795 800
Arg Gly Tyr Gln Ala Arg Cys Tyr Ala Lys Phe Leu Arg Arg Thr Lys
805 810 815
Ala Ala Thr Ile Ile Gln Lys Tyr Trp Arg Met Tyr Val Val Arg Arg
820 825 830
Arg Tyr Lys Ile Arg Arg Ala Ala Thr Ile Val Leu Gln Ser Tyr Leu
835 840 845
Arg Gly Phe Leu Ala Arg Asn Arg Tyr Arg Lys Ile Leu Arg Glu His
850 855 860
Lys Ala Val Ile Ile Gln Lys Arg Val Arg Gly Trp Leu Ala Arg Thr
865 870 875 880
His Tyr Lys Arg Ser Met His Ala Ile Ile Tyr Leu Gln Cys Cys Phe
885 890 895
Arg Arg Met Met Ala Lys Arg Glu Leu Lys Lys Leu Lys Ile Glu Ala
900 905 910
Arg Ser Val Glu Arg Tyr Lys Lys Leu His Ile Gly Met Glu Asn Lys
915 920 925
Ile Met Gln Leu Gln Arg Lys Val Asp Glu Gln Asn Lys Asp Tyr Lys
930 935 940
Cys Leu Val Glu Lys Leu Thr Asn Leu Glu Gly Ile Tyr Asn Ser Glu
945 950 955 960
Thr Glu Lys Leu Arg Ser Asp Leu Glu Arg Leu Gln Leu Ser Glu Glu
965 970 975
Glu Ala Lys Val Ala Thr Gly Arg Val Leu Ser Leu Gln Glu Glu Ile
980 985 990
Ala Lys Leu Arg Lys Asp Leu Glu Gln Thr Arg Ser Glu Lys Lys Cys
995 1000 1005
Ile Glu Glu His Ala Asp Arg Tyr Lys Gln Glu Thr Glu Gln Leu
1010 1015 1020
Val Ser Asn Leu Lys Glu Glu Asn Thr Leu Leu Lys Gln Glu Lys
1025 1030 1035
Glu Ala Leu Asn His Arg Ile Val Gln Gln Ala Lys Glu Met Thr
1040 1045 1050
Glu Thr Met Glu Lys Lys Leu Val Glu Glu Thr Lys Gln Leu Glu
1055 1060 1065
Leu Asp Leu Asn Asp Glu Arg Leu Arg Tyr Gln Asn Leu Leu Asn
1070 1075 1080
Glu Phe Ser Arg Leu Glu Glu Arg Tyr Asp Asp Leu Lys Glu Glu
1085 1090 1095
Met Thr Leu Met Val His Val Pro Lys Pro Gly His Lys Arg Thr
1100 1105 1110
Asp Ser Thr His Ser Ser Asn Glu Ser Glu Tyr Ile Phe Ser Ser
1115 1120 1125
Glu Ile Ala Glu Met Glu Asp Ile Pro Ser Arg Thr Glu Glu Pro
1130 1135 1140
Ser Glu Lys Lys Val Pro Leu Asp Met Ser Leu Phe Leu Lys Leu
1145 1150 1155
Gln Lys Arg Val Thr Glu Leu Glu Gln Glu Lys Gln Val Met Gln
1160 1165 1170
Asp Glu Leu Asp Arg Lys Glu Glu Gln Val Leu Arg Ser Lys Ala
1175 1180 1185
Lys Glu Glu Glu Arg Pro Gln Ile Arg Gly Ala Glu Leu Glu Tyr
1190 1195 1200
Glu Ser Leu Lys Arg Gln Glu Leu Glu Ser Glu Asn Lys Lys Leu
1205 1210 1215
Lys Asn Glu Leu Asn Glu Leu Arg Lys Ala Leu Ser Glu Lys Ser
1220 1225 1230
Ala Pro Glu Val Thr Ala Pro Gly Ala Pro Ala Tyr Arg Val Leu
1235 1240 1245
Met Glu Gln Leu Thr Ser Val Ser Glu Glu Leu Asp Val Arg Lys
1250 1255 1260
Glu Glu Val Leu Ile Leu Arg Ser Gln Leu Val Ser Gln Lys Glu
1265 1270 1275
Ala Ile Gln Pro Lys Asp Asp Lys Asn Thr Met Thr Asp Ser Thr
1280 1285 1290
Ile Leu Leu Glu Asp Val Gln Lys Met Lys Asp Lys Gly Glu Ile
1295 1300 1305
Ala Gln Ala Tyr Ile Gly Leu Lys Glu Thr Asn Arg Ser Ser Ala
1310 1315 1320
Leu Asp Tyr His Glu Leu Asn Glu Asp Gly Glu Leu Trp Leu Val
1325 1330 1335
Tyr Glu Gly Leu Lys Gln Ala Asn Arg Leu Leu Glu Ser Gln Leu
1340 1345 1350
Gln Ser Gln Lys Arg Ser His Glu Asn Glu Ala Glu Ala Leu Arg
1355 1360 1365
Gly Glu Ile Gln Ser Leu Lys Glu Glu Asn Asn Arg Gln Gln Gln
1370 1375 1380
Leu Leu Ala Gln Asn Leu Gln Leu Pro Pro Glu Ala Arg Ile Glu
1385 1390 1395
Ala Ser Leu Gln His Glu Ile Thr Arg Leu Thr Asn Glu Asn Leu
1400 1405 1410
Asp Leu Met Glu Gln Leu Glu Lys Gln Asp Lys Thr Val Arg Lys
1415 1420 1425
Leu Lys Lys Gln Leu Lys Val Phe Ala Lys Lys Ile Gly Glu Leu
1430 1435 1440
Glu Val Gly Gln Met Glu Asn Ile Ser Pro Gly Gln Ile Ile Asp
1445 1450 1455
Glu Pro Ile Arg Pro Val Asn Ile Pro Arg Lys Glu Lys Asp Phe
1460 1465 1470
Gln Gly Met Leu Glu Tyr Lys Lys Glu Asp Glu Gln Lys Leu Val
1475 1480 1485
Lys Asn Leu Ile Leu Glu Leu Lys Pro Arg Gly Val Ala Val Asn
1490 1495 1500
Leu Ile Pro Gly Leu Pro Ala Tyr Ile Leu Phe Met Cys Val Arg
1505 1510 1515
His Ala Asp Tyr Leu Asn Asp Asp Gln Lys Val Arg Ser Leu Leu
1520 1525 1530
Thr Ser Thr Ile Asn Ser Ile Lys Lys Val Leu Lys Lys Arg Gly
1535 1540 1545
Asp Asp Phe Glu Thr Val Ser Phe Trp Leu Ser Asn Thr Cys Arg
1550 1555 1560
Phe Leu His Cys Leu Lys Gln Tyr Ser Gly Glu Glu Gly Phe Met
1565 1570 1575
Lys His Asn Thr Ser Arg Gln Asn Glu His Cys Leu Thr Asn Phe
1580 1585 1590
Asp Leu Ala Glu Tyr Arg Gln Val Leu Ser Asp Leu Ala Ile Gln
1595 1600 1605
Ile Tyr Gln Gln Leu Val Arg Val Leu Glu Asn Ile Leu Gln Pro
1610 1615 1620
Met Ile Val Ser Gly Met Leu Glu His Glu Thr Ile Gln Gly Val
1625 1630 1635
Ser Gly Val Lys Pro Thr Gly Leu Arg Lys Arg Thr Ser Ser Ile
1640 1645 1650
Ala Asp Glu Gly Thr Tyr Thr Leu Asp Ser Ile Leu Arg Gln Leu
1655 1660 1665
Asn Ser Phe His Ser Val Met Cys Gln His Gly Met Asp Pro Glu
1670 1675 1680
Leu Ile Lys Gln Val Val Lys Gln Met Phe Tyr Ile Ile Gly Ala
1685 1690 1695
Ile Thr Leu Asn Asn Leu Leu Leu Arg Lys Asp Met Cys Ser Trp
1700 1705 1710
Ser Lys Gly Met Gln Ile Arg Tyr Asn Val Ser Gln Leu Glu Glu
1715 1720 1725
Trp Leu Arg Asp Lys Asn Leu Met Asn Ser Gly Ala Lys Glu Thr
1730 1735 1740
Leu Glu Pro Leu Ile Gln Ala Ala Gln Leu Leu Gln Val Lys Lys
1745 1750 1755
Lys Thr Asp Asp Asp Ala Glu Ala Ile Cys Ser Met Cys Asn Ala
1760 1765 1770
Leu Thr Thr Ala Gln Ile Val Lys Val Leu Asn Leu Tyr Thr Pro
1775 1780 1785
Val Asn Glu Phe Glu Glu Arg Val Ser Val Ser Phe Ile Arg Thr
1790 1795 1800
Ile Gln Met Arg Leu Arg Asp Arg Lys Asp Ser Pro Gln Leu Leu
1805 1810 1815
Met Asp Ala Lys His Ile Phe Pro Val Thr Phe Pro Phe Asn Pro
1820 1825 1830
Ser Ser Leu Ala Leu Glu Thr Ile Gln Ile Pro Ala Ser Leu Gly
1835 1840 1845
Leu Gly Phe Ile Ser Arg Val
1850 1855
<210> 118
<211> 666
<212> DNA
<213> Homo sapiens
<400> 118
atgtctgatg gagattatga ttacctcatc aagtttttag ctttgggaga ctctggtgta 60
gggaagacca gtgtacttta ccaatataca gatggtaaat ttaactccaa atttatcaca 120
acagtgggca ttgatttcag ggaaaaaaga gtggtgtaca gagccagtgg gccggatgga 180
gccactggca gaggccagag aatccacctg cagttatggg acacagcagg gcaggagagg 240
tttcgtagct taacgacagc gttcttcaga gatgctatgg gttttcttct actttttgat 300
ctgacaaatg agcaaagttt cctcaatgtc agaaactgga taagccagct acagatgcat 360
gcatattgtg aaaacccaga tatagtgctg tgtggaaaca agagtgatct ggaggaccag 420
agagtagtga aagaggagga agccatagca ctcgcagaga aatatggaat cccctacttt 480
gaaactagtg ctgccaatgg gacaaacata agccaagcaa ttgagatgct tctggacctg 540
ataatgaagc gaatggaacg gtgtgtggac aagtcctgga ttcctgaagg agtggtgcga 600
tcaaatggtc atgcctctac ggatcagtta agtgaagaaa aggagaaagg ggcatgtggc 660
tgttga 666
<210> 119
<211> 221
<212> PRT
<213> Homo sapiens
<400> 119
Met Ser Asp Gly Asp Tyr Asp Tyr Leu Ile Lys Phe Leu Ala Leu Gly
1 5 10 15
Asp Ser Gly Val Gly Lys Thr Ser Val Leu Tyr Gln Tyr Thr Asp Gly
20 25 30
Lys Phe Asn Ser Lys Phe Ile Thr Thr Val Gly Ile Asp Phe Arg Glu
35 40 45
Lys Arg Val Val Tyr Arg Ala Ser Gly Pro Asp Gly Ala Thr Gly Arg
50 55 60
Gly Gln Arg Ile His Leu Gln Leu Trp Asp Thr Ala Gly Gln Glu Arg
65 70 75 80
Phe Arg Ser Leu Thr Thr Ala Phe Phe Arg Asp Ala Met Gly Phe Leu
85 90 95
Leu Leu Phe Asp Leu Thr Asn Glu Gln Ser Phe Leu Asn Val Arg Asn
100 105 110
Trp Ile Ser Gln Leu Gln Met His Ala Tyr Cys Glu Asn Pro Asp Ile
115 120 125
Val Leu Cys Gly Asn Lys Ser Asp Leu Glu Asp Gln Arg Val Val Lys
130 135 140
Glu Glu Glu Ala Ile Ala Leu Ala Glu Lys Tyr Gly Ile Pro Tyr Phe
145 150 155 160
Glu Thr Ser Ala Ala Asn Gly Thr Asn Ile Ser Gln Ala Ile Glu Met
165 170 175
Leu Leu Asp Leu Ile Met Lys Arg Met Glu Arg Cys Val Asp Lys Ser
180 185 190
Trp Ile Pro Glu Gly Val Val Arg Ser Asn Gly His Ala Ser Thr Asp
195 200 205
Gln Leu Ser Glu Glu Lys Glu Lys Gly Ala Cys Gly Cys
210 215 220
<210> 120
<211> 1803
<212> DNA
<213> Homo sapiens
<400> 120
atggggaaga aactggatct ttccaagctc actgatgaag aggcccagca tgtcttggaa 60
gttgttcaac gagattttga cctccgaagg aaagaagagg aacggctaga ggcgttgaag 120
ggcaagatta agaaggaaag ctccaagagg gagctgcttt ccgacactgc ccatctgaac 180
gagacccact gcgcccgctg cctgcagccc taccagctgc ttgtgaatag caaaaggcag 240
tgcctggaat gtggcctctt cacctgcaaa agctgtggcc gcgtccaccc ggaggagcag 300
ggctggatct gtgacccctg ccatctggcc agagtcgtga agatcggctc actggagtgg 360
tactatgagc atgtgaaagc ccgcttcaag aggttcggaa gtgccaaggt catccggtcc 420
ctccacgggc ggctgcaggg tggagctggg cctgaactga tatctgaaga gagaagtgga 480
gacagcgacc agacagatga ggatggagaa cctggctcag aggcccaggc ccaggcccag 540
ccctttggca gcaaaaaaaa gcgcctcctc tccgtccacg acttcgactt cgagggagac 600
tcagatgact ccactcagcc tcaaggtcac tccctgcacc tgtcctcagt ccctgaggcc 660
agggacagcc cacagtccct cacagatgag tcctgctcag agaaggcagc ccctcacaag 720
gctgagggcc tggaggaggc tgatactggg gcctctgggt gccactccca tccggaagag 780
cagccgacca gcatctcacc ttccagacac ggcgccctgg ctgagctctg cccgcctgga 840
ggctcccaca ggatggccct ggggactgct gctgcactcg ggtcgaatgt catcaggaat 900
gagcagctgc ccctgcagta cttggccgat gtggacacct ctgatgagga aagcatccgg 960
gctcacgtga tggcctccca ccattccaag cggagaggcc gggcgtcttc tgagagtcag 1020
atctttgagc tgaataagca tatttcagct gtggaatgcc tgctgaccta cctggagaac 1080
acagttgtgc ctcccttggc caagggtcta ggtgctggag tgcgcacgga ggccgatgta 1140
gaggaggagg ccctgaggag gaagctggag gagctgacca gcaacgtcag tgaccaggag 1200
acctcgtccg aggaggagga agccaaggac gaaaaggcag agcccaacag ggacaaatca 1260
gttgggcctc tcccccaggc ggacccggag gtgggcacgg ctgcccatca aaccaacaga 1320
caggaaaaaa gcccccagga ccctggggac cccgtccagt acaacaggac cacagatgag 1380
gagctgtcag agctggagga cagagtggca gtgacggcct cagaagtcca gcaggcagag 1440
agcgaggttt cagacattga atccaggatt gcagccctga gggccgcagg gctcacggtg 1500
aagccctcgg gaaagccccg gaggaagtca aacctcccga tatttctccc tcgagtggct 1560
gggaaacttg gcaagagacc agaggaccca aatgcagacc cttcaagtga ggccaaggca 1620
atggctgtgc cctatcttct gagaagaaag ttcagtaatt ccctgaaaag tcaaggtaaa 1680
gatgatgatt cttttgatcg gaaatcagtg taccgaggct cgctgacaca gagaaacccc 1740
aacgcgagga aaggaatggc cagccacacc ttcgcgaaac ctgtggtggc ccaccagtcc 1800
taa 1803
<210> 121
<211> 600
<212> PRT
<213> Homo sapiens
<400> 121
Met Gly Lys Lys Leu Asp Leu Ser Lys Leu Thr Asp Glu Glu Ala Gln
1 5 10 15
His Val Leu Glu Val Val Gln Arg Asp Phe Asp Leu Arg Arg Lys Glu
20 25 30
Glu Glu Arg Leu Glu Ala Leu Lys Gly Lys Ile Lys Lys Glu Ser Ser
35 40 45
Lys Arg Glu Leu Leu Ser Asp Thr Ala His Leu Asn Glu Thr His Cys
50 55 60
Ala Arg Cys Leu Gln Pro Tyr Gln Leu Leu Val Asn Ser Lys Arg Gln
65 70 75 80
Cys Leu Glu Cys Gly Leu Phe Thr Cys Lys Ser Cys Gly Arg Val His
85 90 95
Pro Glu Glu Gln Gly Trp Ile Cys Asp Pro Cys His Leu Ala Arg Val
100 105 110
Val Lys Ile Gly Ser Leu Glu Trp Tyr Tyr Glu His Val Lys Ala Arg
115 120 125
Phe Lys Arg Phe Gly Ser Ala Lys Val Ile Arg Ser Leu His Gly Arg
130 135 140
Leu Gln Gly Gly Ala Gly Pro Glu Leu Ile Ser Glu Glu Arg Ser Gly
145 150 155 160
Asp Ser Asp Gln Thr Asp Glu Asp Gly Glu Pro Gly Ser Glu Ala Gln
165 170 175
Ala Gln Ala Gln Pro Phe Gly Ser Lys Lys Lys Arg Leu Leu Ser Val
180 185 190
His Asp Phe Asp Phe Glu Gly Asp Ser Asp Asp Ser Thr Gln Pro Gln
195 200 205
Gly His Ser Leu His Leu Ser Ser Val Pro Glu Ala Arg Asp Ser Pro
210 215 220
Gln Ser Leu Thr Asp Glu Ser Cys Ser Glu Lys Ala Ala Pro His Lys
225 230 235 240
Ala Glu Gly Leu Glu Glu Ala Asp Thr Gly Ala Ser Gly Cys His Ser
245 250 255
His Pro Glu Glu Gln Pro Thr Ser Ile Ser Pro Ser Arg His Gly Ala
260 265 270
Leu Ala Glu Leu Cys Pro Pro Gly Gly Ser His Arg Met Ala Leu Gly
275 280 285
Thr Ala Ala Ala Leu Gly Ser Asn Val Ile Arg Asn Glu Gln Leu Pro
290 295 300
Leu Gln Tyr Leu Ala Asp Val Asp Thr Ser Asp Glu Glu Ser Ile Arg
305 310 315 320
Ala His Val Met Ala Ser His His Ser Lys Arg Arg Gly Arg Ala Ser
325 330 335
Ser Glu Ser Gln Ile Phe Glu Leu Asn Lys His Ile Ser Ala Val Glu
340 345 350
Cys Leu Leu Thr Tyr Leu Glu Asn Thr Val Val Pro Pro Leu Ala Lys
355 360 365
Gly Leu Gly Ala Gly Val Arg Thr Glu Ala Asp Val Glu Glu Glu Ala
370 375 380
Leu Arg Arg Lys Leu Glu Glu Leu Thr Ser Asn Val Ser Asp Gln Glu
385 390 395 400
Thr Ser Ser Glu Glu Glu Glu Ala Lys Asp Glu Lys Ala Glu Pro Asn
405 410 415
Arg Asp Lys Ser Val Gly Pro Leu Pro Gln Ala Asp Pro Glu Val Gly
420 425 430
Thr Ala Ala His Gln Thr Asn Arg Gln Glu Lys Ser Pro Gln Asp Pro
435 440 445
Gly Asp Pro Val Gln Tyr Asn Arg Thr Thr Asp Glu Glu Leu Ser Glu
450 455 460
Leu Glu Asp Arg Val Ala Val Thr Ala Ser Glu Val Gln Gln Ala Glu
465 470 475 480
Ser Glu Val Ser Asp Ile Glu Ser Arg Ile Ala Ala Leu Arg Ala Ala
485 490 495
Gly Leu Thr Val Lys Pro Ser Gly Lys Pro Arg Arg Lys Ser Asn Leu
500 505 510
Pro Ile Phe Leu Pro Arg Val Ala Gly Lys Leu Gly Lys Arg Pro Glu
515 520 525
Asp Pro Asn Ala Asp Pro Ser Ser Glu Ala Lys Ala Met Ala Val Pro
530 535 540
Tyr Leu Leu Arg Arg Lys Phe Ser Asn Ser Leu Lys Ser Gln Gly Lys
545 550 555 560
Asp Asp Asp Ser Phe Asp Arg Lys Ser Val Tyr Arg Gly Ser Leu Thr
565 570 575
Gln Arg Asn Pro Asn Ala Arg Lys Gly Met Ala Ser His Thr Phe Ala
580 585 590
Lys Pro Val Val Ala His Gln Ser
595 600
<210> 122
<211> 3285
<212> DNA
<213> Homo sapiens
<400> 122
atgtccagca atagttttcc ttacaatgag cagtccggag gaggggaggc gacggagctg 60
ggtcaggagg cgacctcaac catttccccc tcgggggcct tcggcctctt tagcagcgat 120
ttgaagaaga atgaagatct aaagcaaatg ttagagagca acaaagattc tgctaaactg 180
gatgctatga agcggattgt tgggatgatt gcaaaaggga aaaatgcatc tgaactgttt 240
cctgctgttg tgaagaatgt ggccagtaaa aatattgaga tcaagaagtt ggtatatgtt 300
tacctggttc gatatgctga agaacagcag gatcttgcac tcctgtccat aagcactttt 360
cagcgagctc tgaaggaccc aaaccaacta attcgtgcaa gcgctttgag agttctgtca 420
agtattagag tgccaattat tgtacctatc atgatgcttg ctattaagga agcttctgct 480
gacttatcac catatgttag gaagaatgca gcccatgcaa tacaaaaatt atacagcctt 540
gatccagagc agaaggaaat gttaattgaa gtaattgaaa aacttctgaa agataaaagc 600
acattggtag ctggcagtgt tgtgatggct tttgaagaag tatgcccgga cagaatagat 660
ctgattcata aaaattaccg caagctatgt aacttactag tggatgttga agagtggggg 720
caggttgtca taatccacat gctaactcga tatgctcgga cacagtttgt cagcccttgg 780
aaagagggtg atgaattaga agacaatgga aagaatttct acgaatctga tgatgatcag 840
aaggaaaaga ctgacaaaaa gaagaagccg tatactatgg atccagatca tagactctta 900
attagaaata caaagccttt gcttcagagc aggaatgctg cggtggttat ggcagttgct 960
cagctgtatt ggcacatatc accaaaatct gaagctggca taatttctaa atcactagtg 1020
cgtttacttc gtagcaatag ggaggtgcag tatattgtcc tacaaaatat agcaactatg 1080
tcaattcaaa gaaaggggat gtttgaacct tatctgaaga gtttctatgt taggtcaact 1140
gatccaacta tgatcaagac actgaagctt gaaattttga caaacttggc aaatgaagcc 1200
aacatatcaa ctcttcttcg agaatttcag acctatgtga aaagccagga taaacaattt 1260
gcagcagcca ctattcagac tataggcaga tgtgcaacca acatcttgga agtcactgac 1320
acgtgcctca atggcttggt ctgtctgctg tccaacaggg atgaaatagt tgttgctgaa 1380
agtgtggttg ttataaagaa attactgcaa atgcaacctg cacaacatgg tgaaattatt 1440
aaacatatgg ccaaactcct ggacagtatc actgttcctg ttgctagagc aagtattctt 1500
tggctaattg gagaaaactg tgaacgagtt cctaaaattg cccctgatgt tttgaggaag 1560
atggctaaaa gcttcactag tgaagatgat ctggtaaaac tgcagatatt aaatctggga 1620
gcaaaattgt atttaaccaa ctccaaacag acaaaattgc ttacccagta catattaaat 1680
ctcggcaagt atgatcaaaa ctacgacatc agagaccgta caagatttat taggcagctt 1740
attgttccga atgtaaagag tggagcttta agtaaatatg ccaaaaaaat attcctagca 1800
caaaagcctg caccactgct tgagtctcct tttaaagata gagatcattt ccagcttggc 1860
accttatctc atactctcaa cattaaagct actgggtacc tggaattatc taattggcca 1920
gaggtggcgc ccgacccatc agttcgaaat gtagaagtaa tagagttggc aaaagaatgg 1980
accccagcag gaaaagcaaa gcaagagaat tctgctaaga agttttattc tgaatctgag 2040
gaagaggagg actcttctga tagtagcagt gacagtgaga gtgaatctgg aagtgaaagt 2100
ggagaacaag gcgaaagtgg ggaggaagga gacagcaatg aggacagcag tgaggactcc 2160
tccagtgagc aggacagtga gagtggacgg gagtcaggcc tagaaaacaa aagaacagcc 2220
aagaggaact caaaagccaa aggaaaaagt gattctgaag atggggagaa ggaaaatgaa 2280
aaatctaaaa cttcagattc ttcaaatgac gaatctagtt caatagaaga cagttcttcc 2340
gattctgaat cagagtcaga acctgaaagt gaatctgaat ccagaagagt cactaaggag 2400
aaagaaaaga aaacaaagca agatagaact cctcttacca aagatgtttc acttctagat 2460
ctggatgatt ttaacccagt atccactcca gttgcacttc ccacaccagc tctttctcca 2520
agtttgatgg ctgatcttga aggtttacac ttgtcaactt cctcttcagt catcagtgtc 2580
agtactcctg catttgtacc aacgaaaact cacgtgctgc ttcatcgaat gagtggaaaa 2640
ggactagctg cccattattt ctttccaaga cagccttgca tttttggtga taagatggtc 2700
tctatacaaa taacactgaa taacactact gatcgaaaga tagaaaatat ccacataggg 2760
gaaaaaaaac ttcctatagg catgaaaatg catgttttta atccaataga ctctcttgag 2820
cctgagggat ccattacagt ttcaatgggt attgactttt gtgattctac tcagactgcc 2880
agtttccagt tgtgtaccaa ggatgattgc ttcaatgtta atattcagcc acctgttgga 2940
gaactgcttt tacctgtggc catgtcagag aaagatttta agaaagagca aggagtgcta 3000
acaggaatga atgaaacttc tgctgtaatc attgctgcac cacagaattt cactccctct 3060
gtgatctttc agaaggttgt aaatgtagcc aatgtaggtg cagtcccttc tggccaggat 3120
aatatacaca ggtttgcagc taaaactgtg cacagtgggt cattgatgct agtcacagtg 3180
gaactgaagg aaggctctac agcccagctt atcataaaca ctgagaaaac tgtgattggc 3240
tctgttctgc tgcgggaact gaagcctgtc ctgtctcagg ggtaa 3285
<210> 123
<211> 1094
<212> PRT
<213> Homo sapiens
<400> 123
Met Ser Ser Asn Ser Phe Pro Tyr Asn Glu Gln Ser Gly Gly Gly Glu
1 5 10 15
Ala Thr Glu Leu Gly Gln Glu Ala Thr Ser Thr Ile Ser Pro Ser Gly
20 25 30
Ala Phe Gly Leu Phe Ser Ser Asp Leu Lys Lys Asn Glu Asp Leu Lys
35 40 45
Gln Met Leu Glu Ser Asn Lys Asp Ser Ala Lys Leu Asp Ala Met Lys
50 55 60
Arg Ile Val Gly Met Ile Ala Lys Gly Lys Asn Ala Ser Glu Leu Phe
65 70 75 80
Pro Ala Val Val Lys Asn Val Ala Ser Lys Asn Ile Glu Ile Lys Lys
85 90 95
Leu Val Tyr Val Tyr Leu Val Arg Tyr Ala Glu Glu Gln Gln Asp Leu
100 105 110
Ala Leu Leu Ser Ile Ser Thr Phe Gln Arg Ala Leu Lys Asp Pro Asn
115 120 125
Gln Leu Ile Arg Ala Ser Ala Leu Arg Val Leu Ser Ser Ile Arg Val
130 135 140
Pro Ile Ile Val Pro Ile Met Met Leu Ala Ile Lys Glu Ala Ser Ala
145 150 155 160
Asp Leu Ser Pro Tyr Val Arg Lys Asn Ala Ala His Ala Ile Gln Lys
165 170 175
Leu Tyr Ser Leu Asp Pro Glu Gln Lys Glu Met Leu Ile Glu Val Ile
180 185 190
Glu Lys Leu Leu Lys Asp Lys Ser Thr Leu Val Ala Gly Ser Val Val
195 200 205
Met Ala Phe Glu Glu Val Cys Pro Asp Arg Ile Asp Leu Ile His Lys
210 215 220
Asn Tyr Arg Lys Leu Cys Asn Leu Leu Val Asp Val Glu Glu Trp Gly
225 230 235 240
Gln Val Val Ile Ile His Met Leu Thr Arg Tyr Ala Arg Thr Gln Phe
245 250 255
Val Ser Pro Trp Lys Glu Gly Asp Glu Leu Glu Asp Asn Gly Lys Asn
260 265 270
Phe Tyr Glu Ser Asp Asp Asp Gln Lys Glu Lys Thr Asp Lys Lys Lys
275 280 285
Lys Pro Tyr Thr Met Asp Pro Asp His Arg Leu Leu Ile Arg Asn Thr
290 295 300
Lys Pro Leu Leu Gln Ser Arg Asn Ala Ala Val Val Met Ala Val Ala
305 310 315 320
Gln Leu Tyr Trp His Ile Ser Pro Lys Ser Glu Ala Gly Ile Ile Ser
325 330 335
Lys Ser Leu Val Arg Leu Leu Arg Ser Asn Arg Glu Val Gln Tyr Ile
340 345 350
Val Leu Gln Asn Ile Ala Thr Met Ser Ile Gln Arg Lys Gly Met Phe
355 360 365
Glu Pro Tyr Leu Lys Ser Phe Tyr Val Arg Ser Thr Asp Pro Thr Met
370 375 380
Ile Lys Thr Leu Lys Leu Glu Ile Leu Thr Asn Leu Ala Asn Glu Ala
385 390 395 400
Asn Ile Ser Thr Leu Leu Arg Glu Phe Gln Thr Tyr Val Lys Ser Gln
405 410 415
Asp Lys Gln Phe Ala Ala Ala Thr Ile Gln Thr Ile Gly Arg Cys Ala
420 425 430
Thr Asn Ile Leu Glu Val Thr Asp Thr Cys Leu Asn Gly Leu Val Cys
435 440 445
Leu Leu Ser Asn Arg Asp Glu Ile Val Val Ala Glu Ser Val Val Val
450 455 460
Ile Lys Lys Leu Leu Gln Met Gln Pro Ala Gln His Gly Glu Ile Ile
465 470 475 480
Lys His Met Ala Lys Leu Leu Asp Ser Ile Thr Val Pro Val Ala Arg
485 490 495
Ala Ser Ile Leu Trp Leu Ile Gly Glu Asn Cys Glu Arg Val Pro Lys
500 505 510
Ile Ala Pro Asp Val Leu Arg Lys Met Ala Lys Ser Phe Thr Ser Glu
515 520 525
Asp Asp Leu Val Lys Leu Gln Ile Leu Asn Leu Gly Ala Lys Leu Tyr
530 535 540
Leu Thr Asn Ser Lys Gln Thr Lys Leu Leu Thr Gln Tyr Ile Leu Asn
545 550 555 560
Leu Gly Lys Tyr Asp Gln Asn Tyr Asp Ile Arg Asp Arg Thr Arg Phe
565 570 575
Ile Arg Gln Leu Ile Val Pro Asn Val Lys Ser Gly Ala Leu Ser Lys
580 585 590
Tyr Ala Lys Lys Ile Phe Leu Ala Gln Lys Pro Ala Pro Leu Leu Glu
595 600 605
Ser Pro Phe Lys Asp Arg Asp His Phe Gln Leu Gly Thr Leu Ser His
610 615 620
Thr Leu Asn Ile Lys Ala Thr Gly Tyr Leu Glu Leu Ser Asn Trp Pro
625 630 635 640
Glu Val Ala Pro Asp Pro Ser Val Arg Asn Val Glu Val Ile Glu Leu
645 650 655
Ala Lys Glu Trp Thr Pro Ala Gly Lys Ala Lys Gln Glu Asn Ser Ala
660 665 670
Lys Lys Phe Tyr Ser Glu Ser Glu Glu Glu Glu Asp Ser Ser Asp Ser
675 680 685
Ser Ser Asp Ser Glu Ser Glu Ser Gly Ser Glu Ser Gly Glu Gln Gly
690 695 700
Glu Ser Gly Glu Glu Gly Asp Ser Asn Glu Asp Ser Ser Glu Asp Ser
705 710 715 720
Ser Ser Glu Gln Asp Ser Glu Ser Gly Arg Glu Ser Gly Leu Glu Asn
725 730 735
Lys Arg Thr Ala Lys Arg Asn Ser Lys Ala Lys Gly Lys Ser Asp Ser
740 745 750
Glu Asp Gly Glu Lys Glu Asn Glu Lys Ser Lys Thr Ser Asp Ser Ser
755 760 765
Asn Asp Glu Ser Ser Ser Ile Glu Asp Ser Ser Ser Asp Ser Glu Ser
770 775 780
Glu Ser Glu Pro Glu Ser Glu Ser Glu Ser Arg Arg Val Thr Lys Glu
785 790 795 800
Lys Glu Lys Lys Thr Lys Gln Asp Arg Thr Pro Leu Thr Lys Asp Val
805 810 815
Ser Leu Leu Asp Leu Asp Asp Phe Asn Pro Val Ser Thr Pro Val Ala
820 825 830
Leu Pro Thr Pro Ala Leu Ser Pro Ser Leu Met Ala Asp Leu Glu Gly
835 840 845
Leu His Leu Ser Thr Ser Ser Ser Val Ile Ser Val Ser Thr Pro Ala
850 855 860
Phe Val Pro Thr Lys Thr His Val Leu Leu His Arg Met Ser Gly Lys
865 870 875 880
Gly Leu Ala Ala His Tyr Phe Phe Pro Arg Gln Pro Cys Ile Phe Gly
885 890 895
Asp Lys Met Val Ser Ile Gln Ile Thr Leu Asn Asn Thr Thr Asp Arg
900 905 910
Lys Ile Glu Asn Ile His Ile Gly Glu Lys Lys Leu Pro Ile Gly Met
915 920 925
Lys Met His Val Phe Asn Pro Ile Asp Ser Leu Glu Pro Glu Gly Ser
930 935 940
Ile Thr Val Ser Met Gly Ile Asp Phe Cys Asp Ser Thr Gln Thr Ala
945 950 955 960
Ser Phe Gln Leu Cys Thr Lys Asp Asp Cys Phe Asn Val Asn Ile Gln
965 970 975
Pro Pro Val Gly Glu Leu Leu Leu Pro Val Ala Met Ser Glu Lys Asp
980 985 990
Phe Lys Lys Glu Gln Gly Val Leu Thr Gly Met Asn Glu Thr Ser Ala
995 1000 1005
Val Ile Ile Ala Ala Pro Gln Asn Phe Thr Pro Ser Val Ile Phe
1010 1015 1020
Gln Lys Val Val Asn Val Ala Asn Val Gly Ala Val Pro Ser Gly
1025 1030 1035
Gln Asp Asn Ile His Arg Phe Ala Ala Lys Thr Val His Ser Gly
1040 1045 1050
Ser Leu Met Leu Val Thr Val Glu Leu Lys Glu Gly Ser Thr Ala
1055 1060 1065
Gln Leu Ile Ile Asn Thr Glu Lys Thr Val Ile Gly Ser Val Leu
1070 1075 1080
Leu Arg Glu Leu Lys Pro Val Leu Ser Gln Gly
1085 1090
<210> 124
<211> 1515
<212> DNA
<213> Homo sapiens
<400> 124
atgtttcccc gcgagaagac gtggaacatc tcgttcgcgg gctgcggctt cctcggcgtc 60
tactacgtcg gcgtggcctc ctgcctccgc gagcacgcgc ccttcctggt ggccaacgcc 120
acgcacatct acggcgcctc ggccggggcg ctcacggcca cggcgctggt caccggggtc 180
tgcctgggtg aggctggtgc caagttcatt gaggtatcta aagaggcccg gaagcggttc 240
ctgggccccc tgcacccctc cttcaacctg gtaaagatca tccgcagttt cctgctgaag 300
gtcctgcctg ctgatagcca tgagcatgcc agtgggcgcc tgggcatctc cctgacccgc 360
gtgtcagacg gcgagaatgt cattatatcc cacttcaact ccaaggacga gctcatccag 420
gccaatgtct gcagcggttt catccccgtg tactgtgggc tcatccctcc ctccctccag 480
ggggtgcgct acgtggatgg tggcatttca gacaacctgc cactctatga gcttaagaac 540
accatcacag tgtccccctt ctcgggcgag agtgacatct gtccgcagga cagctccacc 600
aacatccacg agctgcgggt caccaacacc agcatccagt tcaacctgcg caacctctac 660
cgcctctcca aggccctctt cccgccggag cccctggtgc tgcgagagat gtgcaagcag 720
ggataccggg atggcctgcg ctttctgcag cggaacggcc tcctgaaccg gcccaacccc 780
ttgctggcgt tgccccccgc ccgcccccac ggcccagagg acaaggacca ggcagtggag 840
agcgcccaag cggaggatta ctcgcagctg cccggagaag atcacatcct ggagcacctg 900
cccgcccggc tcaatgaggc cctgctggag gcctgcgtgg agcccacgga cctgctgacc 960
accctctcca acatgctgcc tgtgcgtctg gccacggcca tgatggtgcc ctacacgctg 1020
ccgctggaga gcgctctgtc cttcaccatc cgcttgctgg agtggctgcc cgacgttccc 1080
gaggacatcc ggtggatgaa ggagcagacg ggcagcatct gccagtacct ggtgatgcgc 1140
gccaagagga agctgggcag gcacctgccc tccaggctgc cggagcaggt ggagctgcgc 1200
cgcgtccagt cgctgccgtc cgtgccgctg tcctgcgccg cctacagaga ggcactgccc 1260
ggctggatgc gcaacaacct ctcgctgggg gacgcgctgg ccaagtggga ggagtgccag 1320
cgccagctgc tgctcggcct cttctgcacc aacgtggcct tcccgcccga agctctgcgc 1380
atgcgcgcac ccgccgaccc ggctcccgcc cccgcggacc cagcatcccc gcagcaccag 1440
ctggccgggc ctgccccctt gctgagcacc cctgctcccg aggcccggcc cgtgatcggg 1500
gccctggggc tgtga 1515
<210> 125
<211> 504
<212> PRT
<213> Homo sapiens
<400> 125
Met Phe Pro Arg Glu Lys Thr Trp Asn Ile Ser Phe Ala Gly Cys Gly
1 5 10 15
Phe Leu Gly Val Tyr Tyr Val Gly Val Ala Ser Cys Leu Arg Glu His
20 25 30
Ala Pro Phe Leu Val Ala Asn Ala Thr His Ile Tyr Gly Ala Ser Ala
35 40 45
Gly Ala Leu Thr Ala Thr Ala Leu Val Thr Gly Val Cys Leu Gly Glu
50 55 60
Ala Gly Ala Lys Phe Ile Glu Val Ser Lys Glu Ala Arg Lys Arg Phe
65 70 75 80
Leu Gly Pro Leu His Pro Ser Phe Asn Leu Val Lys Ile Ile Arg Ser
85 90 95
Phe Leu Leu Lys Val Leu Pro Ala Asp Ser His Glu His Ala Ser Gly
100 105 110
Arg Leu Gly Ile Ser Leu Thr Arg Val Ser Asp Gly Glu Asn Val Ile
115 120 125
Ile Ser His Phe Asn Ser Lys Asp Glu Leu Ile Gln Ala Asn Val Cys
130 135 140
Ser Gly Phe Ile Pro Val Tyr Cys Gly Leu Ile Pro Pro Ser Leu Gln
145 150 155 160
Gly Val Arg Tyr Val Asp Gly Gly Ile Ser Asp Asn Leu Pro Leu Tyr
165 170 175
Glu Leu Lys Asn Thr Ile Thr Val Ser Pro Phe Ser Gly Glu Ser Asp
180 185 190
Ile Cys Pro Gln Asp Ser Ser Thr Asn Ile His Glu Leu Arg Val Thr
195 200 205
Asn Thr Ser Ile Gln Phe Asn Leu Arg Asn Leu Tyr Arg Leu Ser Lys
210 215 220
Ala Leu Phe Pro Pro Glu Pro Leu Val Leu Arg Glu Met Cys Lys Gln
225 230 235 240
Gly Tyr Arg Asp Gly Leu Arg Phe Leu Gln Arg Asn Gly Leu Leu Asn
245 250 255
Arg Pro Asn Pro Leu Leu Ala Leu Pro Pro Ala Arg Pro His Gly Pro
260 265 270
Glu Asp Lys Asp Gln Ala Val Glu Ser Ala Gln Ala Glu Asp Tyr Ser
275 280 285
Gln Leu Pro Gly Glu Asp His Ile Leu Glu His Leu Pro Ala Arg Leu
290 295 300
Asn Glu Ala Leu Leu Glu Ala Cys Val Glu Pro Thr Asp Leu Leu Thr
305 310 315 320
Thr Leu Ser Asn Met Leu Pro Val Arg Leu Ala Thr Ala Met Met Val
325 330 335
Pro Tyr Thr Leu Pro Leu Glu Ser Ala Leu Ser Phe Thr Ile Arg Leu
340 345 350
Leu Glu Trp Leu Pro Asp Val Pro Glu Asp Ile Arg Trp Met Lys Glu
355 360 365
Gln Thr Gly Ser Ile Cys Gln Tyr Leu Val Met Arg Ala Lys Arg Lys
370 375 380
Leu Gly Arg His Leu Pro Ser Arg Leu Pro Glu Gln Val Glu Leu Arg
385 390 395 400
Arg Val Gln Ser Leu Pro Ser Val Pro Leu Ser Cys Ala Ala Tyr Arg
405 410 415
Glu Ala Leu Pro Gly Trp Met Arg Asn Asn Leu Ser Leu Gly Asp Ala
420 425 430
Leu Ala Lys Trp Glu Glu Cys Gln Arg Gln Leu Leu Leu Gly Leu Phe
435 440 445
Cys Thr Asn Val Ala Phe Pro Pro Glu Ala Leu Arg Met Arg Ala Pro
450 455 460
Ala Asp Pro Ala Pro Ala Pro Ala Asp Pro Ala Ser Pro Gln His Gln
465 470 475 480
Leu Ala Gly Pro Ala Pro Leu Leu Ser Thr Pro Ala Pro Glu Ala Arg
485 490 495
Pro Val Ile Gly Ala Leu Gly Leu
500
<210> 126
<211> 83
<212> PRT
<213> Homo sapiens
<400> 126
Ser Leu Pro Cys Asp Ile Cys Lys Asp Val Val Thr Ala Ala Gly Asp
1 5 10 15
Met Leu Lys Asp Asn Ala Thr Glu Glu Glu Ile Leu Val Tyr Leu Glu
20 25 30
Lys Thr Cys Asp Trp Leu Pro Lys Pro Asn Met Ser Ala Ser Cys Lys
35 40 45
Glu Ile Val Asp Ser Tyr Leu Pro Val Ile Leu Asp Ile Ile Lys Gly
50 55 60
Glu Met Ser Arg Pro Gly Glu Val Cys Ser Ala Leu Asn Leu Cys Glu
65 70 75 80
Ser Leu Gln
<210> 127
<211> 1509
<212> DNA
<213> Homo sapiens
<400> 127
atgagctgcc ccgtgcccgc ctgctgcgcg ctgctgctag tcctggggct ctgccgggcg 60
cgtccccgga acgcactgct gctcctcgcg gatgacggag gctttgagag tggcgcgtac 120
aacaacagcg ccatcgccac cccgcacctg gacgccttgg cccgccgcag cctcctcttt 180
cgcaatgcct tcacctcggt cagcagctgc tctcccagcc gcgccagcct cctcactggc 240
ctgccccagc atcagaatgg gatgtacggg ctgcaccagg acgtgcacca cttcaactcc 300
ttcgacaagg tgcggagcct gccgctgctg ctcagccaag ctggtgtgcg cacaggcatc 360
atcgggaaga agcacgtggg gccggagacc gtgtacccgt ttgactttgc gtacacggag 420
gagaatggct ccgtcctcca ggtggggcgg aacatcacta gaattaagct gctcgtccgg 480
aaattcctgc agactcagga tgaccggcct ttcttcctct acgtcgcctt ccacgacccc 540
caccgctgtg ggcactccca gccccagtac ggaaccttct gtgagaagtt tggcaacgga 600
gagagcggca tgggtcgtat cccagactgg accccccagg cctacgaccc actggacgtg 660
ctggtgcctt acttcgtccc caacaccccg gcagcccgag ccgacctggc cgctcagtac 720
accaccgtcg gccgcatgga ccaaggagtt ggactggtgc tccaggagct gcgtgacgcc 780
ggtgtcctga acgacacact ggtgatcttc acgtccgaca acgggatccc cttccccagc 840
ggcaggacca acctgtactg gccgggcact gctgaaccct tactggtgtc atccccggag 900
cacccaaaac gctggggcca agtcagcgag gcctacgtga gcctcctaga cctcacgccc 960
accatcttgg attggttctc gatcccgtac cccagctacg ccatctttgg ctcgaagacc 1020
atccacctca ctggccggtc cctcctgccg gcgctggagg ccgagcccct ctgggccacc 1080
gtctttggca gccagagcca ccacgaggtc accatgtcct accccatgcg ctccgtgcag 1140
caccggcact tccgcctcgt gcacaacctc aacttcaaga tgccctttcc catcgaccag 1200
gacttctacg tctcacccac cttccaggac ctcctgaacc gcaccacagc tggtcagccc 1260
acgggctggt acaaggacct ccgtcattac tactaccggg cgcgctggga gctctacgac 1320
cggagccggg acccccacga gacccagaac ctggccaccg acccgcgctt tgctcagctt 1380
ctggagatgc ttcgggacca gctggccaag tggcagtggg agacccacga cccctgggtg 1440
tgcgcccccg acggcgtcct ggaggagaag ctctctcccc agtgccagcc cctccacaat 1500
gagctgtga 1509
<210> 128
<211> 502
<212> PRT
<213> Homo sapiens
<400> 128
Met Ser Cys Pro Val Pro Ala Cys Cys Ala Leu Leu Leu Val Leu Gly
1 5 10 15
Leu Cys Arg Ala Arg Pro Arg Asn Ala Leu Leu Leu Leu Ala Asp Asp
20 25 30
Gly Gly Phe Glu Ser Gly Ala Tyr Asn Asn Ser Ala Ile Ala Thr Pro
35 40 45
His Leu Asp Ala Leu Ala Arg Arg Ser Leu Leu Phe Arg Asn Ala Phe
50 55 60
Thr Ser Val Ser Ser Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly
65 70 75 80
Leu Pro Gln His Gln Asn Gly Met Tyr Gly Leu His Gln Asp Val His
85 90 95
His Phe Asn Ser Phe Asp Lys Val Arg Ser Leu Pro Leu Leu Leu Ser
100 105 110
Gln Ala Gly Val Arg Thr Gly Ile Ile Gly Lys Lys His Val Gly Pro
115 120 125
Glu Thr Val Tyr Pro Phe Asp Phe Ala Tyr Thr Glu Glu Asn Gly Ser
130 135 140
Val Leu Gln Val Gly Arg Asn Ile Thr Arg Ile Lys Leu Leu Val Arg
145 150 155 160
Lys Phe Leu Gln Thr Gln Asp Asp Arg Pro Phe Phe Leu Tyr Val Ala
165 170 175
Phe His Asp Pro His Arg Cys Gly His Ser Gln Pro Gln Tyr Gly Thr
180 185 190
Phe Cys Glu Lys Phe Gly Asn Gly Glu Ser Gly Met Gly Arg Ile Pro
195 200 205
Asp Trp Thr Pro Gln Ala Tyr Asp Pro Leu Asp Val Leu Val Pro Tyr
210 215 220
Phe Val Pro Asn Thr Pro Ala Ala Arg Ala Asp Leu Ala Ala Gln Tyr
225 230 235 240
Thr Thr Val Gly Arg Met Asp Gln Gly Val Gly Leu Val Leu Gln Glu
245 250 255
Leu Arg Asp Ala Gly Val Leu Asn Asp Thr Leu Val Ile Phe Thr Ser
260 265 270
Asp Asn Gly Ile Pro Phe Pro Ser Gly Arg Thr Asn Leu Tyr Trp Pro
275 280 285
Gly Thr Ala Glu Pro Leu Leu Val Ser Ser Pro Glu His Pro Lys Arg
290 295 300
Trp Gly Gln Val Ser Glu Ala Tyr Val Ser Leu Leu Asp Leu Thr Pro
305 310 315 320
Thr Ile Leu Asp Trp Phe Ser Ile Pro Tyr Pro Ser Tyr Ala Ile Phe
325 330 335
Gly Ser Lys Thr Ile His Leu Thr Gly Arg Ser Leu Leu Pro Ala Leu
340 345 350
Glu Ala Glu Pro Leu Trp Ala Thr Val Phe Gly Ser Gln Ser His His
355 360 365
Glu Val Thr Met Ser Tyr Pro Met Arg Ser Val Gln His Arg His Phe
370 375 380
Arg Leu Val His Asn Leu Asn Phe Lys Met Pro Phe Pro Ile Asp Gln
385 390 395 400
Asp Phe Tyr Val Ser Pro Thr Phe Gln Asp Leu Leu Asn Arg Thr Thr
405 410 415
Ala Gly Gln Pro Thr Gly Trp Tyr Lys Asp Leu Arg His Tyr Tyr Tyr
420 425 430
Arg Ala Arg Trp Glu Leu Tyr Asp Arg Ser Arg Asp Pro His Glu Thr
435 440 445
Gln Asn Leu Ala Thr Asp Pro Arg Phe Ala Gln Leu Leu Glu Met Leu
450 455 460
Arg Asp Gln Leu Ala Lys Trp Gln Trp Glu Thr His Asp Pro Trp Val
465 470 475 480
Cys Ala Pro Asp Gly Val Leu Glu Glu Lys Leu Ser Pro Gln Cys Gln
485 490 495
Pro Leu His Asn Glu Leu
500
<210> 129
<211> 1569
<212> DNA
<213> Homo sapiens
<400> 129
atggcggcgg ttgtcgcggc gacgaggtgg tggcagctgt tgctggtgct cagcgccgcg 60
gggatggggg cctcgggcgc cccgcagccc cccaacatcc tgctcctgct catggacgac 120
atgggatggg gtgacctcgg ggtgtatgga gagccctcca gagagacccc gaatttggac 180
cggatggctg cagaagggct gcttttccca aacttctatt ctgccaaccc tctgtgctcg 240
ccatcgaggg cggcactgct cacaggacgg ctacccatcc gcaatggctt ctacaccacc 300
aacgcccatg ccagaaacgc ctacacaccg caggagattg tgggcggcat cccagactcg 360
gagcagctcc tgccggagct tctgaagaag gccggctacg tcagcaagat tgtcggcaag 420
tggcatctgg gtcacaggcc ccagttccac cccctgaagc acggatttga tgagtggttt 480
ggatccccca actgccactt tggaccttat gacaacaagg ccaggcccaa catccctgtg 540
tacagggact gggagatggt tggcagatat tatgaagaat ttcctattaa tctgaagacg 600
ggggaagcca acctcaccca gatctacctg caggaagccc tggacttcat taagagacag 660
gcacggcacc accccttttt cctctactgg gctgtcgacg ccacgcacgc acccgtctat 720
gcctccaaac ccttcttggg caccagtcag cgagggcggt atggagacgc cgtccgggag 780
attgatgaca gcattgggaa gatactggag ctcctccaag acctgcacgt cgcggacaac 840
accttcgtct tcttcacgtc ggacaacggc gctgccctca tttccgcccc cgaacaaggt 900
ggcagcaacg gcccctttct gtgtgggaag cagaccacgt ttgaaggagg gatgagggag 960
cctgccctcg catggtggcc agggcacgtc actgcaggcc aggtgagcca ccagctgggc 1020
agcatcatgg acctcttcac caccagcctg gcccttgcgg gcctgacgcc gcccagcgac 1080
agggccattg atggcctcaa cctcctcccc accctcctgc agggccggct gatggacagg 1140
cctatcttct attaccgtgg cgacacgctg atggcggcca ccctcgggca gcacaaggct 1200
cacttctgga cctggaccaa ctcctgggag aacttcagac agggcattga tttctgccct 1260
gggcagaacg tttcaggggt cacaactcac aatctggaag accacacgaa gctgcccctg 1320
atcttccacc tgggacggga cccaggggag aggttccccc tcagctttgc cagcgccgag 1380
taccaggagg ccctcagcag gatcacctcg gtcgtccagc agcaccagga ggccttggtc 1440
cccgcgcagc cccagctcaa cgtgtgcaac tgggcggtca tgaactgggc acctccgggc 1500
tgtgaaaagt tagggaagtg tctgacacct ccagaatcca ttcccaagaa gtgcctctgg 1560
tcccactag 1569
<210> 130
<211> 522
<212> PRT
<213> Homo sapiens
<400> 130
Met Ala Ala Val Val Ala Ala Thr Arg Trp Trp Gln Leu Leu Leu Val
1 5 10 15
Leu Ser Ala Ala Gly Met Gly Ala Ser Gly Ala Pro Gln Pro Pro Asn
20 25 30
Ile Leu Leu Leu Leu Met Asp Asp Met Gly Trp Gly Asp Leu Gly Val
35 40 45
Tyr Gly Glu Pro Ser Arg Glu Thr Pro Asn Leu Asp Arg Met Ala Ala
50 55 60
Glu Gly Leu Leu Phe Pro Asn Phe Tyr Ser Ala Asn Pro Leu Cys Ser
65 70 75 80
Pro Ser Arg Ala Ala Leu Leu Thr Gly Arg Leu Pro Ile Arg Asn Gly
85 90 95
Phe Tyr Thr Thr Asn Ala His Ala Arg Asn Ala Tyr Thr Pro Gln Glu
100 105 110
Ile Val Gly Gly Ile Pro Asp Ser Glu Gln Leu Leu Pro Glu Leu Leu
115 120 125
Lys Lys Ala Gly Tyr Val Ser Lys Ile Val Gly Lys Trp His Leu Gly
130 135 140
His Arg Pro Gln Phe His Pro Leu Lys His Gly Phe Asp Glu Trp Phe
145 150 155 160
Gly Ser Pro Asn Cys His Phe Gly Pro Tyr Asp Asn Lys Ala Arg Pro
165 170 175
Asn Ile Pro Val Tyr Arg Asp Trp Glu Met Val Gly Arg Tyr Tyr Glu
180 185 190
Glu Phe Pro Ile Asn Leu Lys Thr Gly Glu Ala Asn Leu Thr Gln Ile
195 200 205
Tyr Leu Gln Glu Ala Leu Asp Phe Ile Lys Arg Gln Ala Arg His His
210 215 220
Pro Phe Phe Leu Tyr Trp Ala Val Asp Ala Thr His Ala Pro Val Tyr
225 230 235 240
Ala Ser Lys Pro Phe Leu Gly Thr Ser Gln Arg Gly Arg Tyr Gly Asp
245 250 255
Ala Val Arg Glu Ile Asp Asp Ser Ile Gly Lys Ile Leu Glu Leu Leu
260 265 270
Gln Asp Leu His Val Ala Asp Asn Thr Phe Val Phe Phe Thr Ser Asp
275 280 285
Asn Gly Ala Ala Leu Ile Ser Ala Pro Glu Gln Gly Gly Ser Asn Gly
290 295 300
Pro Phe Leu Cys Gly Lys Gln Thr Thr Phe Glu Gly Gly Met Arg Glu
305 310 315 320
Pro Ala Leu Ala Trp Trp Pro Gly His Val Thr Ala Gly Gln Val Ser
325 330 335
His Gln Leu Gly Ser Ile Met Asp Leu Phe Thr Thr Ser Leu Ala Leu
340 345 350
Ala Gly Leu Thr Pro Pro Ser Asp Arg Ala Ile Asp Gly Leu Asn Leu
355 360 365
Leu Pro Thr Leu Leu Gln Gly Arg Leu Met Asp Arg Pro Ile Phe Tyr
370 375 380
Tyr Arg Gly Asp Thr Leu Met Ala Ala Thr Leu Gly Gln His Lys Ala
385 390 395 400
His Phe Trp Thr Trp Thr Asn Ser Trp Glu Asn Phe Arg Gln Gly Ile
405 410 415
Asp Phe Cys Pro Gly Gln Asn Val Ser Gly Val Thr Thr His Asn Leu
420 425 430
Glu Asp His Thr Lys Leu Pro Leu Ile Phe His Leu Gly Arg Asp Pro
435 440 445
Gly Glu Arg Phe Pro Leu Ser Phe Ala Ser Ala Glu Tyr Gln Glu Ala
450 455 460
Leu Ser Arg Ile Thr Ser Val Val Gln Gln His Gln Glu Ala Leu Val
465 470 475 480
Pro Ala Gln Pro Gln Leu Asn Val Cys Asn Trp Ala Val Met Asn Trp
485 490 495
Ala Pro Pro Gly Cys Glu Lys Leu Gly Lys Cys Leu Thr Pro Pro Glu
500 505 510
Ser Ile Pro Lys Lys Cys Leu Trp Ser His
515 520
<210> 131
<211> 1602
<212> DNA
<213> Homo sapiens
<400> 131
atgggtccgc gcggcgcggc gagcttgccc cgaggccccg gacctcggcg gctgctcctc 60
cccgtcgtcc tcccgctgct gctgctgctg ttgttggcgc cgccgggctc gggcgccggg 120
gccagccggc cgccccacct ggtcttcttg ctggcagacg acctaggctg gaacgacgtc 180
ggcttccacg gctcccgcat ccgcacgccg cacctggacg cgctggcggc cggcggggtg 240
ctcctggaca actactacac gcagccgctg tgcacgccgt cgcggagcca gctgctcact 300
ggccgctacc agatccgtac aggtttacag caccaaataa tctggccctg tcagcccagc 360
tgtgttcctc tggatgaaaa actcctgccc cagctcctaa aagaagcagg ttatactacc 420
catatggtcg gaaaatggca cctgggaatg taccggaaag aatgccttcc aacccgccga 480
ggatttgata cctactttgg atatctcctg ggtagtgaag attattattc ccatgaacgc 540
tgtacattaa ttgacgctct gaatgtcaca cgatgtgctc ttgattttcg agatggcgaa 600
gaagttgcaa caggatataa aaatatgtat tcaacaaaca tattcaccaa aagggctata 660
gccctcataa ctaaccatcc accagagaag cctctgtttc tctaccttgc tctccagtct 720
gtgcatgagc cccttcaggt ccctgaggaa tacttgaagc catatgactt tatccaagac 780
aagaacaggc atcactatgc aggaatggtg tcccttatgg atgaagcagt aggaaatgtc 840
actgcagctt taaaaagcag tgggctctgg aacaacacgg tgttcatctt ttctacagat 900
aacggagggc agactttggc agggggtaat aactggcccc ttcgaggaag aaaatggagc 960
ctgtgggaag gaggcgtccg aggggtgggc tttgtggcaa gccccttgct gaagcagaag 1020
ggcgtgaaga accgggagct catccacatc tctgactggc tgccaacact cgtgaagctg 1080
gccaggggac acaccaatgg cacaaagcct ctggatggct tcgacgtgtg gaaaaccatc 1140
agtgaaggaa gcccatcccc cagaattgag ctgctgcata atattgaccc gaacttcgtg 1200
gactcttcac cgtgtcccag gaacagcatg gctccagcaa aggatgactc ttctcttcca 1260
gaatattcag cctttaacac atctgtccat gctgcaatta gacatggaaa ttggaaactc 1320
ctcacgggct acccaggctg tggttactgg ttccctccac cgtctcaata caatgtttct 1380
gagataccct catcagaccc accaaccaag accctctggc tctttgatat tgatcgggac 1440
cctgaagaaa gacatgacct gtccagagaa tatcctcaca tcgtcacaaa gctcctgtcc 1500
cgcctacagt tctaccataa acactcagtc cccgtgtact tccctgcaca ggacccccgc 1560
tgtgatccca aggccactgg ggtgtggggc ccttggatgt ag 1602
<210> 132
<211> 533
<212> PRT
<213> Homo sapiens
<400> 132
Met Gly Pro Arg Gly Ala Ala Ser Leu Pro Arg Gly Pro Gly Pro Arg
1 5 10 15
Arg Leu Leu Leu Pro Val Val Leu Pro Leu Leu Leu Leu Leu Leu Leu
20 25 30
Ala Pro Pro Gly Ser Gly Ala Gly Ala Ser Arg Pro Pro His Leu Val
35 40 45
Phe Leu Leu Ala Asp Asp Leu Gly Trp Asn Asp Val Gly Phe His Gly
50 55 60
Ser Arg Ile Arg Thr Pro His Leu Asp Ala Leu Ala Ala Gly Gly Val
65 70 75 80
Leu Leu Asp Asn Tyr Tyr Thr Gln Pro Leu Cys Thr Pro Ser Arg Ser
85 90 95
Gln Leu Leu Thr Gly Arg Tyr Gln Ile Arg Thr Gly Leu Gln His Gln
100 105 110
Ile Ile Trp Pro Cys Gln Pro Ser Cys Val Pro Leu Asp Glu Lys Leu
115 120 125
Leu Pro Gln Leu Leu Lys Glu Ala Gly Tyr Thr Thr His Met Val Gly
130 135 140
Lys Trp His Leu Gly Met Tyr Arg Lys Glu Cys Leu Pro Thr Arg Arg
145 150 155 160
Gly Phe Asp Thr Tyr Phe Gly Tyr Leu Leu Gly Ser Glu Asp Tyr Tyr
165 170 175
Ser His Glu Arg Cys Thr Leu Ile Asp Ala Leu Asn Val Thr Arg Cys
180 185 190
Ala Leu Asp Phe Arg Asp Gly Glu Glu Val Ala Thr Gly Tyr Lys Asn
195 200 205
Met Tyr Ser Thr Asn Ile Phe Thr Lys Arg Ala Ile Ala Leu Ile Thr
210 215 220
Asn His Pro Pro Glu Lys Pro Leu Phe Leu Tyr Leu Ala Leu Gln Ser
225 230 235 240
Val His Glu Pro Leu Gln Val Pro Glu Glu Tyr Leu Lys Pro Tyr Asp
245 250 255
Phe Ile Gln Asp Lys Asn Arg His His Tyr Ala Gly Met Val Ser Leu
260 265 270
Met Asp Glu Ala Val Gly Asn Val Thr Ala Ala Leu Lys Ser Ser Gly
275 280 285
Leu Trp Asn Asn Thr Val Phe Ile Phe Ser Thr Asp Asn Gly Gly Gln
290 295 300
Thr Leu Ala Gly Gly Asn Asn Trp Pro Leu Arg Gly Arg Lys Trp Ser
305 310 315 320
Leu Trp Glu Gly Gly Val Arg Gly Val Gly Phe Val Ala Ser Pro Leu
325 330 335
Leu Lys Gln Lys Gly Val Lys Asn Arg Glu Leu Ile His Ile Ser Asp
340 345 350
Trp Leu Pro Thr Leu Val Lys Leu Ala Arg Gly His Thr Asn Gly Thr
355 360 365
Lys Pro Leu Asp Gly Phe Asp Val Trp Lys Thr Ile Ser Glu Gly Ser
370 375 380
Pro Ser Pro Arg Ile Glu Leu Leu His Asn Ile Asp Pro Asn Phe Val
385 390 395 400
Asp Ser Ser Pro Cys Pro Arg Asn Ser Met Ala Pro Ala Lys Asp Asp
405 410 415
Ser Ser Leu Pro Glu Tyr Ser Ala Phe Asn Thr Ser Val His Ala Ala
420 425 430
Ile Arg His Gly Asn Trp Lys Leu Leu Thr Gly Tyr Pro Gly Cys Gly
435 440 445
Tyr Trp Phe Pro Pro Pro Ser Gln Tyr Asn Val Ser Glu Ile Pro Ser
450 455 460
Ser Asp Pro Pro Thr Lys Thr Leu Trp Leu Phe Asp Ile Asp Arg Asp
465 470 475 480
Pro Glu Glu Arg His Asp Leu Ser Arg Glu Tyr Pro His Ile Val Thr
485 490 495
Lys Leu Leu Ser Arg Leu Gln Phe Tyr His Lys His Ser Val Pro Val
500 505 510
Tyr Phe Pro Ala Gln Asp Pro Arg Cys Asp Pro Lys Ala Thr Gly Val
515 520 525
Trp Gly Pro Trp Met
530
<210> 133
<211> 3837
<212> DNA
<213> Homo sapiens
<400> 133
atgaccgctc gcggcctggc ccttggcctc ctcctgctgc tactgtgtcc agcgcaggtg 60
ttttcacagt cctgtgtttg gtatggagag tgtggaattg catatgggga caagaggtac 120
aattgcgaat attctggccc accaaaacca ttgccaaagg atggatatga cttagtgcag 180
gaactctgtc caggattctt ctttggcaat gtcagtctct gttgtgatgt tcggcagctt 240
cagacactaa aagacaacct gcagctgcct ctacagtttc tgtccagatg tccatcctgt 300
ttttataacc tactgaacct gttttgtgag ctgacatgta gccctcgaca gagtcagttt 360
ttgaatgtta cagctactga agattatgtt gatcctgtta caaaccagac gaaaacaaat 420
gtgaaagagt tacaatacta cgtcggacag agttttgcca atgcaatgta caatgcctgc 480
cgggatgtgg aggccccctc aagtaatgac aaggccctgg gactcctgtg tgggaaggac 540
gctgacgcct gtaatgccac caactggatt gaatacatgt tcaataagga caatggacag 600
gcacctttta ccatcactcc tgtgttttca gattttccag tccatgggat ggagcccatg 660
aacaatgcca ccaaaggctg tgacgagtct gtggatgagg tcacagcacc atgtagctgc 720
caagactgct ctattgtctg tggccccaag ccccagcccc cacctcctcc tgctccctgg 780
acgatccttg gcttggacgc catgtatgtc atcatgtgga tcacctacat ggcgtttttg 840
cttgtgtttt ttggagcatt ttttgcagtg tggtgctaca gaaaacggta ttttgtctcc 900
gagtacactc ccatcgatag caatatagct ttttctgtta atgcaagtga caaaggagag 960
gcgtcctgct gtgaccctgt cagcgcagca tttgagggct gcttgaggcg gctgttcaca 1020
cgctgggggt ctttctgcgt ccgaaaccct ggctgtgtca ttttcttctc gctggtcttc 1080
attactgcgt gttcgtcagg cctggtgttt gtccgggtca caaccaatcc agttgacctc 1140
tggtcagccc ccagcagcca ggctcgcctg gaaaaagagt actttgacca gcactttggg 1200
cctttcttcc ggacggagca gctcatcatc cgggcccctc tcactgacaa acacatttac 1260
cagccatacc cttcgggagc tgatgtaccc tttggacctc cgcttgacat acagatactg 1320
caccaggttc ttgacttaca aatagccatc gaaaacatta ctgcctctta tgacaatgag 1380
actgtgacac ttcaagacat ctgcttggcc cctctttcac cgtataacac gaactgcacc 1440
attttgagtg tgttaaatta cttccagaac agccattccg tgctggacca caagaaaggg 1500
gacgacttct ttgtgtatgc cgattaccac acgcactttc tgtactgcgt acgggctcct 1560
gcctctctga atgatacaag tttgctccat gacccttgtc tgggtacgtt tggtggacca 1620
gtgttcccgt ggcttgtgtt gggaggctat gatgatcaaa actacaataa cgccactgcc 1680
cttgtgatta ccttccctgt caataattac tataatgata cagagaagct ccagagggcc 1740
caggcctggg aaaaagagtt tattaatttt gtgaaaaact acaagaatcc caatctgacc 1800
atttccttca ctgctgaacg aagtattgaa gatgaactaa atcgtgaaag tgacagtgat 1860
gtcttcaccg ttgtaattag ctatgccatc atgtttctat atatttccct agccttgggg 1920
cacatgaaaa gctgtcgcag gcttctggtg gattcgaagg tctcactagg catcgcgggc 1980
atcttgatcg tgctgagctc ggtggcttgc tccttgggtg tcttcagcta cattgggttg 2040
cccttgaccc tcattgtgat tgaagtcatc ccgttcctgg tgctggctgt tggagtggac 2100
aacatcttca ttctggtgca ggcctaccag agagatgaac gtcttcaagg ggaaaccctg 2160
gatcagcagc tgggcagggt cctaggagaa gtggctccca gtatgttcct gtcatccttt 2220
tctgagactg tagcattttt cttaggagca ttgtccgtga tgccagccgt gcacaccttc 2280
tctctctttg cgggattggc agtcttcatt gactttcttc tgcagattac ctgtttcgtg 2340
agtctcttgg ggttagacat taaacgtcaa gagaaaaatc ggctagacat cttttgctgt 2400
gtcagaggtg ctgaagatgg aacaagcgtc caggcctcag agagctgttt gtttcgcttc 2460
ttcaaaaact cctattctcc acttctgcta aaggactgga tgagaccaat tgtgatagca 2520
atatttgtgg gtgttctgtc attcagcatc gcagtcctga acaaagtaga tattggattg 2580
gatcagtctc tttcgatgcc agatgactcc tacatggtgg attatttcaa atccatcagt 2640
cagtacctgc atgcgggtcc gcctgtgtac tttgtcctgg aggaagggca cgactacact 2700
tcttccaagg ggcagaacat ggtgtgcggc ggcatgggct gcaacaatga ttccctggtg 2760
cagcagatat ttaacgcggc gcagctggac aactataccc gaataggctt cgccccctcg 2820
tcctggatcg acgattattt cgactgggtg aagccacagt cgtcttgctg tcgagtggac 2880
aatatcactg accagttctg caatgcttca gtggttgacc ctgcctgcgt tcgctgcagg 2940
cctctgactc cggaaggcaa acagaggcct caggggggag acttcatgag attcctgccc 3000
atgttccttt cggataaccc taaccccaag tgtggcaaag ggggacatgc tgcctatagt 3060
tctgcagtta acatcctcct tggccatggc accagggtcg gagccacgta cttcatgacc 3120
taccacaccg tgctgcagac ctctgctgac tttattgacg ctctgaagaa agcccgactt 3180
atagccagta atgtcaccga aaccatgggc attaacggca gtgcctaccg agtatttcct 3240
tacagtgtgt tttatgtctt ctacgaacag tacctgacca tcattgacga cactatcttc 3300
aacctcggtg tgtccctggg cgcgatattt ctggtgacca tggtcctcct gggctgtgag 3360
ctctggtctg cagtcatcat gtgtgccacc atcgccatgg tcttggtcaa catgtttgga 3420
gttatgtggc tctggggcat cagtctgaac gctgtatcct tggtcaacct ggtgatgagc 3480
tgtggcatct ccgtggagtt ctgcagccac ataaccagag cgttcacggt gagcatgaaa 3540
ggcagccgcg tggagcgcgc ggaagaggca cttgcccaca tgggcagctc cgtgttcagt 3600
ggaatcacac ttacaaaatt tggagggatt gtggtgttgg cttttgccaa atctcaaatt 3660
ttccagatat tctacttcag gatgtatttg gccatggtct tactgggagc cactcacgga 3720
ttaatatttc tccctgtctt actcagttac atagggccat cagtaaataa agccaaaagt 3780
tgtgccactg aagagcgata caaaggaaca gagcgcgaac ggcttctaaa tttctag 3837
<210> 134
<211> 1278
<212> PRT
<213> Homo sapiens
<400> 134
Met Thr Ala Arg Gly Leu Ala Leu Gly Leu Leu Leu Leu Leu Leu Cys
1 5 10 15
Pro Ala Gln Val Phe Ser Gln Ser Cys Val Trp Tyr Gly Glu Cys Gly
20 25 30
Ile Ala Tyr Gly Asp Lys Arg Tyr Asn Cys Glu Tyr Ser Gly Pro Pro
35 40 45
Lys Pro Leu Pro Lys Asp Gly Tyr Asp Leu Val Gln Glu Leu Cys Pro
50 55 60
Gly Phe Phe Phe Gly Asn Val Ser Leu Cys Cys Asp Val Arg Gln Leu
65 70 75 80
Gln Thr Leu Lys Asp Asn Leu Gln Leu Pro Leu Gln Phe Leu Ser Arg
85 90 95
Cys Pro Ser Cys Phe Tyr Asn Leu Leu Asn Leu Phe Cys Glu Leu Thr
100 105 110
Cys Ser Pro Arg Gln Ser Gln Phe Leu Asn Val Thr Ala Thr Glu Asp
115 120 125
Tyr Val Asp Pro Val Thr Asn Gln Thr Lys Thr Asn Val Lys Glu Leu
130 135 140
Gln Tyr Tyr Val Gly Gln Ser Phe Ala Asn Ala Met Tyr Asn Ala Cys
145 150 155 160
Arg Asp Val Glu Ala Pro Ser Ser Asn Asp Lys Ala Leu Gly Leu Leu
165 170 175
Cys Gly Lys Asp Ala Asp Ala Cys Asn Ala Thr Asn Trp Ile Glu Tyr
180 185 190
Met Phe Asn Lys Asp Asn Gly Gln Ala Pro Phe Thr Ile Thr Pro Val
195 200 205
Phe Ser Asp Phe Pro Val His Gly Met Glu Pro Met Asn Asn Ala Thr
210 215 220
Lys Gly Cys Asp Glu Ser Val Asp Glu Val Thr Ala Pro Cys Ser Cys
225 230 235 240
Gln Asp Cys Ser Ile Val Cys Gly Pro Lys Pro Gln Pro Pro Pro Pro
245 250 255
Pro Ala Pro Trp Thr Ile Leu Gly Leu Asp Ala Met Tyr Val Ile Met
260 265 270
Trp Ile Thr Tyr Met Ala Phe Leu Leu Val Phe Phe Gly Ala Phe Phe
275 280 285
Ala Val Trp Cys Tyr Arg Lys Arg Tyr Phe Val Ser Glu Tyr Thr Pro
290 295 300
Ile Asp Ser Asn Ile Ala Phe Ser Val Asn Ala Ser Asp Lys Gly Glu
305 310 315 320
Ala Ser Cys Cys Asp Pro Val Ser Ala Ala Phe Glu Gly Cys Leu Arg
325 330 335
Arg Leu Phe Thr Arg Trp Gly Ser Phe Cys Val Arg Asn Pro Gly Cys
340 345 350
Val Ile Phe Phe Ser Leu Val Phe Ile Thr Ala Cys Ser Ser Gly Leu
355 360 365
Val Phe Val Arg Val Thr Thr Asn Pro Val Asp Leu Trp Ser Ala Pro
370 375 380
Ser Ser Gln Ala Arg Leu Glu Lys Glu Tyr Phe Asp Gln His Phe Gly
385 390 395 400
Pro Phe Phe Arg Thr Glu Gln Leu Ile Ile Arg Ala Pro Leu Thr Asp
405 410 415
Lys His Ile Tyr Gln Pro Tyr Pro Ser Gly Ala Asp Val Pro Phe Gly
420 425 430
Pro Pro Leu Asp Ile Gln Ile Leu His Gln Val Leu Asp Leu Gln Ile
435 440 445
Ala Ile Glu Asn Ile Thr Ala Ser Tyr Asp Asn Glu Thr Val Thr Leu
450 455 460
Gln Asp Ile Cys Leu Ala Pro Leu Ser Pro Tyr Asn Thr Asn Cys Thr
465 470 475 480
Ile Leu Ser Val Leu Asn Tyr Phe Gln Asn Ser His Ser Val Leu Asp
485 490 495
His Lys Lys Gly Asp Asp Phe Phe Val Tyr Ala Asp Tyr His Thr His
500 505 510
Phe Leu Tyr Cys Val Arg Ala Pro Ala Ser Leu Asn Asp Thr Ser Leu
515 520 525
Leu His Asp Pro Cys Leu Gly Thr Phe Gly Gly Pro Val Phe Pro Trp
530 535 540
Leu Val Leu Gly Gly Tyr Asp Asp Gln Asn Tyr Asn Asn Ala Thr Ala
545 550 555 560
Leu Val Ile Thr Phe Pro Val Asn Asn Tyr Tyr Asn Asp Thr Glu Lys
565 570 575
Leu Gln Arg Ala Gln Ala Trp Glu Lys Glu Phe Ile Asn Phe Val Lys
580 585 590
Asn Tyr Lys Asn Pro Asn Leu Thr Ile Ser Phe Thr Ala Glu Arg Ser
595 600 605
Ile Glu Asp Glu Leu Asn Arg Glu Ser Asp Ser Asp Val Phe Thr Val
610 615 620
Val Ile Ser Tyr Ala Ile Met Phe Leu Tyr Ile Ser Leu Ala Leu Gly
625 630 635 640
His Met Lys Ser Cys Arg Arg Leu Leu Val Asp Ser Lys Val Ser Leu
645 650 655
Gly Ile Ala Gly Ile Leu Ile Val Leu Ser Ser Val Ala Cys Ser Leu
660 665 670
Gly Val Phe Ser Tyr Ile Gly Leu Pro Leu Thr Leu Ile Val Ile Glu
675 680 685
Val Ile Pro Phe Leu Val Leu Ala Val Gly Val Asp Asn Ile Phe Ile
690 695 700
Leu Val Gln Ala Tyr Gln Arg Asp Glu Arg Leu Gln Gly Glu Thr Leu
705 710 715 720
Asp Gln Gln Leu Gly Arg Val Leu Gly Glu Val Ala Pro Ser Met Phe
725 730 735
Leu Ser Ser Phe Ser Glu Thr Val Ala Phe Phe Leu Gly Ala Leu Ser
740 745 750
Val Met Pro Ala Val His Thr Phe Ser Leu Phe Ala Gly Leu Ala Val
755 760 765
Phe Ile Asp Phe Leu Leu Gln Ile Thr Cys Phe Val Ser Leu Leu Gly
770 775 780
Leu Asp Ile Lys Arg Gln Glu Lys Asn Arg Leu Asp Ile Phe Cys Cys
785 790 795 800
Val Arg Gly Ala Glu Asp Gly Thr Ser Val Gln Ala Ser Glu Ser Cys
805 810 815
Leu Phe Arg Phe Phe Lys Asn Ser Tyr Ser Pro Leu Leu Leu Lys Asp
820 825 830
Trp Met Arg Pro Ile Val Ile Ala Ile Phe Val Gly Val Leu Ser Phe
835 840 845
Ser Ile Ala Val Leu Asn Lys Val Asp Ile Gly Leu Asp Gln Ser Leu
850 855 860
Ser Met Pro Asp Asp Ser Tyr Met Val Asp Tyr Phe Lys Ser Ile Ser
865 870 875 880
Gln Tyr Leu His Ala Gly Pro Pro Val Tyr Phe Val Leu Glu Glu Gly
885 890 895
His Asp Tyr Thr Ser Ser Lys Gly Gln Asn Met Val Cys Gly Gly Met
900 905 910
Gly Cys Asn Asn Asp Ser Leu Val Gln Gln Ile Phe Asn Ala Ala Gln
915 920 925
Leu Asp Asn Tyr Thr Arg Ile Gly Phe Ala Pro Ser Ser Trp Ile Asp
930 935 940
Asp Tyr Phe Asp Trp Val Lys Pro Gln Ser Ser Cys Cys Arg Val Asp
945 950 955 960
Asn Ile Thr Asp Gln Phe Cys Asn Ala Ser Val Val Asp Pro Ala Cys
965 970 975
Val Arg Cys Arg Pro Leu Thr Pro Glu Gly Lys Gln Arg Pro Gln Gly
980 985 990
Gly Asp Phe Met Arg Phe Leu Pro Met Phe Leu Ser Asp Asn Pro Asn
995 1000 1005
Pro Lys Cys Gly Lys Gly Gly His Ala Ala Tyr Ser Ser Ala Val
1010 1015 1020
Asn Ile Leu Leu Gly His Gly Thr Arg Val Gly Ala Thr Tyr Phe
1025 1030 1035
Met Thr Tyr His Thr Val Leu Gln Thr Ser Ala Asp Phe Ile Asp
1040 1045 1050
Ala Leu Lys Lys Ala Arg Leu Ile Ala Ser Asn Val Thr Glu Thr
1055 1060 1065
Met Gly Ile Asn Gly Ser Ala Tyr Arg Val Phe Pro Tyr Ser Val
1070 1075 1080
Phe Tyr Val Phe Tyr Glu Gln Tyr Leu Thr Ile Ile Asp Asp Thr
1085 1090 1095
Ile Phe Asn Leu Gly Val Ser Leu Gly Ala Ile Phe Leu Val Thr
1100 1105 1110
Met Val Leu Leu Gly Cys Glu Leu Trp Ser Ala Val Ile Met Cys
1115 1120 1125
Ala Thr Ile Ala Met Val Leu Val Asn Met Phe Gly Val Met Trp
1130 1135 1140
Leu Trp Gly Ile Ser Leu Asn Ala Val Ser Leu Val Asn Leu Val
1145 1150 1155
Met Ser Cys Gly Ile Ser Val Glu Phe Cys Ser His Ile Thr Arg
1160 1165 1170
Ala Phe Thr Val Ser Met Lys Gly Ser Arg Val Glu Arg Ala Glu
1175 1180 1185
Glu Ala Leu Ala His Met Gly Ser Ser Val Phe Ser Gly Ile Thr
1190 1195 1200
Leu Thr Lys Phe Gly Gly Ile Val Val Leu Ala Phe Ala Lys Ser
1205 1210 1215
Gln Ile Phe Gln Ile Phe Tyr Phe Arg Met Tyr Leu Ala Met Val
1220 1225 1230
Leu Leu Gly Ala Thr His Gly Leu Ile Phe Leu Pro Val Leu Leu
1235 1240 1245
Ser Tyr Ile Gly Pro Ser Val Asn Lys Ala Lys Ser Cys Ala Thr
1250 1255 1260
Glu Glu Arg Tyr Lys Gly Thr Glu Arg Glu Arg Leu Leu Asn Phe
1265 1270 1275
<210> 135
<211> 1224
<212> DNA
<213> Homo sapiens
<400> 135
atgcgccgga acctgcgctt ggggccaagc tctggagctg acgcgcaggg gcaaggcgcc 60
ccgcgtcccg gactggcggc tccgcgcatg ctcctcccac cggcgtcgca ggcctcgaga 120
ggctccggaa gtactgggtg cagcctgatg gcgcaggagg tagacacggc acagggcgcc 180
gagatgcggc ggggcgcggg cgcggctcgg ggacgcgctt cctggtgctg ggccctggcg 240
ctgctttggc tcgcggtggt tccgggctgg tcccgggtct cgggcatccc ctcccggcgc 300
cactggccgg tgccctacaa gcgctttgac ttccgtccaa aacctgatcc ttattgtcaa 360
gctaagtata ctttctgtcc aactggctca cctatcccag ttatggaggg tgatgatgac 420
attgaagttt ttcgattaca agccccagta tgggaattta aatatggaga cctcctggga 480
cacttgaaaa ttatgcatga tgccattgga ttcagaagta cattaactgg caagaactac 540
acaatggaat ggtatgaact tttccaactt ggcaactgta catttcccca tctccgacct 600
gaaatggatg cccctttctg gtgtaatcaa ggcgctgcct gcttttttga gggaattgat 660
gatgttcact ggaaggaaaa tgggacatta gttcaagtag caactatatc aggaaacatg 720
ttcaaccaaa tggcaaagtg ggtgaaacag gacaatgaaa caggaattta ttatgagaca 780
tggaatgtaa aagccagccc agaaaagggg gcagagacat ggtttgattc ctacgactgt 840
tccaaatttg tgttaaggac ctttaacaag ttggctgaat ttggagcaga gttcaagaac 900
atagaaacca actatacaag aatatttctt tacagtggag aacctactta tctgggaaat 960
gaaacatctg tttttgggcc aacaggaaac aagactcttg gtttagccat aaaaagattt 1020
tattacccct tcaaaccaca tttgccaact aaagaatttc tgttgagtct cttgcaaatt 1080
tttgatgcag tgattgtgca caaacagttc tatttgtttt ataattttga atattggttt 1140
ttacctatga aattcccttt tattaaaata acatatgaag aaatcccttt acctatcaga 1200
aacaaaacac tctctggttt ataa 1224
<210> 136
<211> 358
<212> PRT
<213> Homo sapiens
<400> 136
Met Ala Gln Glu Val Asp Thr Ala Gln Gly Ala Glu Met Arg Arg Gly
1 5 10 15
Ala Gly Ala Ala Arg Gly Arg Ala Ser Trp Cys Trp Ala Leu Ala Leu
20 25 30
Leu Trp Leu Ala Val Val Pro Gly Trp Ser Arg Val Ser Gly Ile Pro
35 40 45
Ser Arg Arg His Trp Pro Val Pro Tyr Lys Arg Phe Asp Phe Arg Pro
50 55 60
Lys Pro Asp Pro Tyr Cys Gln Ala Lys Tyr Thr Phe Cys Pro Thr Gly
65 70 75 80
Ser Pro Ile Pro Val Met Glu Gly Asp Asp Asp Ile Glu Val Phe Arg
85 90 95
Leu Gln Ala Pro Val Trp Glu Phe Lys Tyr Gly Asp Leu Leu Gly His
100 105 110
Leu Lys Ile Met His Asp Ala Ile Gly Phe Arg Ser Thr Leu Thr Gly
115 120 125
Lys Asn Tyr Thr Met Glu Trp Tyr Glu Leu Phe Gln Leu Gly Asn Cys
130 135 140
Thr Phe Pro His Leu Arg Pro Glu Met Asp Ala Pro Phe Trp Cys Asn
145 150 155 160
Gln Gly Ala Ala Cys Phe Phe Glu Gly Ile Asp Asp Val His Trp Lys
165 170 175
Glu Asn Gly Thr Leu Val Gln Val Ala Thr Ile Ser Gly Asn Met Phe
180 185 190
Asn Gln Met Ala Lys Trp Val Lys Gln Asp Asn Glu Thr Gly Ile Tyr
195 200 205
Tyr Glu Thr Trp Asn Val Lys Ala Ser Pro Glu Lys Gly Ala Glu Thr
210 215 220
Trp Phe Asp Ser Tyr Asp Cys Ser Lys Phe Val Leu Arg Thr Phe Asn
225 230 235 240
Lys Leu Ala Glu Phe Gly Ala Glu Phe Lys Asn Ile Glu Thr Asn Tyr
245 250 255
Thr Arg Ile Phe Leu Tyr Ser Gly Glu Pro Thr Tyr Leu Gly Asn Glu
260 265 270
Thr Ser Val Phe Gly Pro Thr Gly Asn Lys Thr Leu Gly Leu Ala Ile
275 280 285
Lys Arg Phe Tyr Tyr Pro Phe Lys Pro His Leu Pro Thr Lys Glu Phe
290 295 300
Leu Leu Ser Leu Leu Gln Ile Phe Asp Ala Val Ile Val His Lys Gln
305 310 315 320
Phe Tyr Leu Phe Tyr Asn Phe Glu Tyr Trp Phe Leu Pro Met Lys Phe
325 330 335
Pro Phe Ile Lys Ile Thr Tyr Glu Glu Ile Pro Leu Pro Ile Arg Asn
340 345 350
Lys Thr Leu Ser Gly Leu
355
<210> 137
<211> 1692
<212> DNA
<213> Homo sapiens
<400> 137
atgggactcc aagcctgcct cctagggctc tttgccctca tcctctctgg caaatgcagt 60
tacagcccgg agcccgacca gcggaggacg ctgcccccag gctgggtgtc cctgggccgt 120
gcggaccctg aggaagagct gagtctcacc tttgccctga gacagcagaa tgtggaaaga 180
ctctcggagc tggtgcaggc tgtgtcggat cccagctctc ctcaatacgg aaaatacctg 240
accctagaga atgtggctga tctggtgagg ccatccccac tgaccctcca cacggtgcaa 300
aaatggctct tggcagccgg agcccagaag tgccattctg tgatcacaca ggactttctg 360
acttgctggc tgagcatccg acaagcagag ctgctgctcc ctggggctga gtttcatcac 420
tatgtgggag gacctacgga aacccatgtt gtaaggtccc cacatcccta ccagcttcca 480
caggccttgg ccccccatgt ggactttgtg gggggactgc accgttttcc cccaacatca 540
tccctgaggc aacgtcctga gccgcaggtg acagggactg taggcctgca tctgggggta 600
accccctctg tgatccgtaa gcgatacaac ttgacctcac aagacgtggg ctctggcacc 660
agcaataaca gccaagcctg tgcccagttc ctggagcagt atttccatga ctcagacctg 720
gctcagttca tgcgcctctt cggtggcaac tttgcacatc aggcatcagt agcccgtgtg 780
gttggacaac agggccgggg ccgggccggg attgaggcca gtctagatgt gcagtacctg 840
atgagtgctg gtgccaacat ctccacctgg gtctacagta gccctggccg gcatgaggga 900
caggagccct tcctgcagtg gctcatgctg ctcagtaatg agtcagccct gccacatgtg 960
catactgtga gctatggaga tgatgaggac tccctcagca gcgcctacat ccagcgggtc 1020
aacactgagc tcatgaaggc tgccgctcgg ggtctcaccc tgctcttcgc ctcaggtgac 1080
agtggggccg ggtgttggtc tgtctctgga agacaccagt tccgccctac cttccctgcc 1140
tccagcccct atgtcaccac agtgggaggc acatccttcc aggaaccttt cctcatcaca 1200
aatgaaattg ttgactatat cagtggtggt ggcttcagca atgtgttccc acggccttca 1260
taccaggagg aagctgtaac gaagttcctg agctctagcc cccacctgcc accatccagt 1320
tacttcaatg ccagtggccg tgcctaccca gatgtggctg cactttctga tggctactgg 1380
gtggtcagca acagagtgcc cattccatgg gtgtccggaa cctcggcctc tactccagtg 1440
tttgggggga tcctatcctt gatcaatgag cacaggatcc ttagtggccg cccccctctt 1500
ggctttctca acccaaggct ctaccagcag catggggcag gactctttga tgtaacccgt 1560
ggctgccatg agtcctgtct ggatgaagag gtagagggcc agggtttctg ctctggtcct 1620
ggctgggatc ctgtaacagg ctggggaaca cccaacttcc cagctttgct gaagactcta 1680
ctcaacccct ga 1692
<210> 138
<211> 563
<212> PRT
<213> Homo sapiens
<400> 138
Met Gly Leu Gln Ala Cys Leu Leu Gly Leu Phe Ala Leu Ile Leu Ser
1 5 10 15
Gly Lys Cys Ser Tyr Ser Pro Glu Pro Asp Gln Arg Arg Thr Leu Pro
20 25 30
Pro Gly Trp Val Ser Leu Gly Arg Ala Asp Pro Glu Glu Glu Leu Ser
35 40 45
Leu Thr Phe Ala Leu Arg Gln Gln Asn Val Glu Arg Leu Ser Glu Leu
50 55 60
Val Gln Ala Val Ser Asp Pro Ser Ser Pro Gln Tyr Gly Lys Tyr Leu
65 70 75 80
Thr Leu Glu Asn Val Ala Asp Leu Val Arg Pro Ser Pro Leu Thr Leu
85 90 95
His Thr Val Gln Lys Trp Leu Leu Ala Ala Gly Ala Gln Lys Cys His
100 105 110
Ser Val Ile Thr Gln Asp Phe Leu Thr Cys Trp Leu Ser Ile Arg Gln
115 120 125
Ala Glu Leu Leu Leu Pro Gly Ala Glu Phe His His Tyr Val Gly Gly
130 135 140
Pro Thr Glu Thr His Val Val Arg Ser Pro His Pro Tyr Gln Leu Pro
145 150 155 160
Gln Ala Leu Ala Pro His Val Asp Phe Val Gly Gly Leu His Arg Phe
165 170 175
Pro Pro Thr Ser Ser Leu Arg Gln Arg Pro Glu Pro Gln Val Thr Gly
180 185 190
Thr Val Gly Leu His Leu Gly Val Thr Pro Ser Val Ile Arg Lys Arg
195 200 205
Tyr Asn Leu Thr Ser Gln Asp Val Gly Ser Gly Thr Ser Asn Asn Ser
210 215 220
Gln Ala Cys Ala Gln Phe Leu Glu Gln Tyr Phe His Asp Ser Asp Leu
225 230 235 240
Ala Gln Phe Met Arg Leu Phe Gly Gly Asn Phe Ala His Gln Ala Ser
245 250 255
Val Ala Arg Val Val Gly Gln Gln Gly Arg Gly Arg Ala Gly Ile Glu
260 265 270
Ala Ser Leu Asp Val Gln Tyr Leu Met Ser Ala Gly Ala Asn Ile Ser
275 280 285
Thr Trp Val Tyr Ser Ser Pro Gly Arg His Glu Gly Gln Glu Pro Phe
290 295 300
Leu Gln Trp Leu Met Leu Leu Ser Asn Glu Ser Ala Leu Pro His Val
305 310 315 320
His Thr Val Ser Tyr Gly Asp Asp Glu Asp Ser Leu Ser Ser Ala Tyr
325 330 335
Ile Gln Arg Val Asn Thr Glu Leu Met Lys Ala Ala Ala Arg Gly Leu
340 345 350
Thr Leu Leu Phe Ala Ser Gly Asp Ser Gly Ala Gly Cys Trp Ser Val
355 360 365
Ser Gly Arg His Gln Phe Arg Pro Thr Phe Pro Ala Ser Ser Pro Tyr
370 375 380
Val Thr Thr Val Gly Gly Thr Ser Phe Gln Glu Pro Phe Leu Ile Thr
385 390 395 400
Asn Glu Ile Val Asp Tyr Ile Ser Gly Gly Gly Phe Ser Asn Val Phe
405 410 415
Pro Arg Pro Ser Tyr Gln Glu Glu Ala Val Thr Lys Phe Leu Ser Ser
420 425 430
Ser Pro His Leu Pro Pro Ser Ser Tyr Phe Asn Ala Ser Gly Arg Ala
435 440 445
Tyr Pro Asp Val Ala Ala Leu Ser Asp Gly Tyr Trp Val Val Ser Asn
450 455 460
Arg Val Pro Ile Pro Trp Val Ser Gly Thr Ser Ala Ser Thr Pro Val
465 470 475 480
Phe Gly Gly Ile Leu Ser Leu Ile Asn Glu His Arg Ile Leu Ser Gly
485 490 495
Arg Pro Pro Leu Gly Phe Leu Asn Pro Arg Leu Tyr Gln Gln His Gly
500 505 510
Ala Gly Leu Phe Asp Val Thr Arg Gly Cys His Glu Ser Cys Leu Asp
515 520 525
Glu Glu Val Glu Gly Gln Gly Phe Cys Ser Gly Pro Gly Trp Asp Pro
530 535 540
Val Thr Gly Trp Gly Thr Pro Asn Phe Pro Ala Leu Leu Lys Thr Leu
545 550 555 560
Leu Asn Pro
<210> 139
<211> 1236
<212> DNA
<213> Homo sapiens
<400> 139
atgctgctga agacagtgct cttgctggga catgtggccc aggtgctgat gctggacaat 60
gggctcctgc agacaccacc catgggctgg ctggcctggg aacgcttccg ctgcaacatt 120
aactgtgatg aggacccaaa gaactgcata agtgaacagc tcttcatgga gatggctgac 180
cggatggcac aggatggatg gcgggacatg ggctacacat acctcaacat tgatgactgc 240
tggatcggtg gtcgcgatgc cagtggccgc ctgatgccgg atcccaagcg cttccctcat 300
ggcattcctt tcctggctga ctacgttcac tccctgggcc tgaagttggg tatctacgcg 360
gacatgggca acttcacctg catgggttac ccaggcacca cactggacaa ggtggtccag 420
gatgctcaga ccttcgccga gtggaaggta gacatgctca agctggatgg ctgcttctcc 480
acccccgagg agcgggccca ggggtacccc aagatggctg ctgccctgaa tgccacaggc 540
cgccccatcg ccttctcctg cagctggcca gcctatgaag gcggcctccc cccaagggtg 600
aactacagtc tgctggcgga catctgcaac ctctggcgta actatgatga catccaggac 660
tcctggtgga gcgtgctctc catcctgaat tggttcgtgg agcaccagga catactgcag 720
ccagtggccg gccctgggca ctggaatgac cctgacatgc tgctcattgg gaactttggt 780
ctcagcttag agcaatcccg ggcccagatg gccctgtgga cggtgctggc agcccccctc 840
ttgatgtcca cagacctgcg taccatctcc gcccagaaca tggacattct gcagaatcca 900
ctcatgatca aaatcaacca ggatccctta ggcatccagg gacgcaggat tcacaaggaa 960
aaatctctca tcgaagtgta catgcggcct ctgtccaaca aggctagcgc cttagtcttc 1020
ttcagctgca ggaccgatat gccttatcgc taccactcct cccttggcca gctgaacttc 1080
accgggtctg tgatatatga ggcccaggac gtctactcag gtgacatcat cagtggcctc 1140
cgagatgaaa ccaacttcac agtgatcatc aacccttcag gggtagtgat gtggtacctg 1200
tatcccatca agaacctgga gatgtcccag cagtga 1236
<210> 140
<211> 411
<212> PRT
<213> Homo sapiens
<400> 140
Met Leu Leu Lys Thr Val Leu Leu Leu Gly His Val Ala Gln Val Leu
1 5 10 15
Met Leu Asp Asn Gly Leu Leu Gln Thr Pro Pro Met Gly Trp Leu Ala
20 25 30
Trp Glu Arg Phe Arg Cys Asn Ile Asn Cys Asp Glu Asp Pro Lys Asn
35 40 45
Cys Ile Ser Glu Gln Leu Phe Met Glu Met Ala Asp Arg Met Ala Gln
50 55 60
Asp Gly Trp Arg Asp Met Gly Tyr Thr Tyr Leu Asn Ile Asp Asp Cys
65 70 75 80
Trp Ile Gly Gly Arg Asp Ala Ser Gly Arg Leu Met Pro Asp Pro Lys
85 90 95
Arg Phe Pro His Gly Ile Pro Phe Leu Ala Asp Tyr Val His Ser Leu
100 105 110
Gly Leu Lys Leu Gly Ile Tyr Ala Asp Met Gly Asn Phe Thr Cys Met
115 120 125
Gly Tyr Pro Gly Thr Thr Leu Asp Lys Val Val Gln Asp Ala Gln Thr
130 135 140
Phe Ala Glu Trp Lys Val Asp Met Leu Lys Leu Asp Gly Cys Phe Ser
145 150 155 160
Thr Pro Glu Glu Arg Ala Gln Gly Tyr Pro Lys Met Ala Ala Ala Leu
165 170 175
Asn Ala Thr Gly Arg Pro Ile Ala Phe Ser Cys Ser Trp Pro Ala Tyr
180 185 190
Glu Gly Gly Leu Pro Pro Arg Val Asn Tyr Ser Leu Leu Ala Asp Ile
195 200 205
Cys Asn Leu Trp Arg Asn Tyr Asp Asp Ile Gln Asp Ser Trp Trp Ser
210 215 220
Val Leu Ser Ile Leu Asn Trp Phe Val Glu His Gln Asp Ile Leu Gln
225 230 235 240
Pro Val Ala Gly Pro Gly His Trp Asn Asp Pro Asp Met Leu Leu Ile
245 250 255
Gly Asn Phe Gly Leu Ser Leu Glu Gln Ser Arg Ala Gln Met Ala Leu
260 265 270
Trp Thr Val Leu Ala Ala Pro Leu Leu Met Ser Thr Asp Leu Arg Thr
275 280 285
Ile Ser Ala Gln Asn Met Asp Ile Leu Gln Asn Pro Leu Met Ile Lys
290 295 300
Ile Asn Gln Asp Pro Leu Gly Ile Gln Gly Arg Arg Ile His Lys Glu
305 310 315 320
Lys Ser Leu Ile Glu Val Tyr Met Arg Pro Leu Ser Asn Lys Ala Ser
325 330 335
Ala Leu Val Phe Phe Ser Cys Arg Thr Asp Met Pro Tyr Arg Tyr His
340 345 350
Ser Ser Leu Gly Gln Leu Asn Phe Thr Gly Ser Val Ile Tyr Glu Ala
355 360 365
Gln Asp Val Tyr Ser Gly Asp Ile Ile Ser Gly Leu Arg Asp Glu Thr
370 375 380
Asn Phe Thr Val Ile Ile Asn Pro Ser Gly Val Val Met Trp Tyr Leu
385 390 395 400
Tyr Pro Ile Lys Asn Leu Glu Met Ser Gln Gln
405 410
<210> 141
<211> 1488
<212> DNA
<213> Homo sapiens
<400> 141
atgaggtctc cggttcgaga cctggcccgg aacgatggcg aggagagcac ggaccgcacg 60
cctcttctac cgggcgcccc acgggccgaa gccgctccag tgtgctgctc tgctcgttac 120
aacttagcaa ttttggcctt ttttggtttc ttcattgtgt atgcattacg tgtgaatctg 180
agtgttgcgt tagtggatat ggtagattca aatacaactt tagaagataa tagaacttcc 240
aaggcgtgtc cagagcattc tgctcccata aaagttcatc ataatcaaac gggtaagaag 300
taccaatggg atgcagaaac tcaaggatgg attctcggtt ccttttttta tggctacatc 360
atcacacaga ttcctggagg atatgttgcc agcaaaatag gggggaaaat gctgctagga 420
tttgggatcc ttggcactgc tgtcctcacc ctgttcactc ccattgctgc agatttagga 480
gttggaccac tcattgtact cagagcacta gaaggactag gagagggtgt tacatttcca 540
gccatgcatg ccatgtggtc ttcttgggct ccccctcttg aaagaagcaa acttcttagc 600
atttcatatg caggagcaca gcttgggaca gtaatttctc ttcctctttc tggaataatt 660
tgctactata tgaattggac ttatgtcttc tacttttttg gtactattgg aatattttgg 720
tttcttttgt ggatctggtt agttagtgac acaccacaaa aacacaagag aatttcccat 780
tatgaaaagg aatacattct ttcatcatta agaaatcagc tttcttcaca gaagtcagtg 840
ccgtgggtac ccattttaaa atccctgcca ctttgggcta tcgtagttgc acacttttct 900
tacaactgga ctttttatac tttattgaca ttattgccta cttatatgaa ggagatccta 960
aggttcaatg ttcaagagaa tgggttttta tcttcattgc cttatttagg ctcttggtta 1020
tgtatgatcc tgtctggtca agctgctgac aatttaaggg caaaatggaa tttttcaact 1080
ttatgtgttc gcagaatttt tagccttata ggaatgattg gacctgcagt attcctggta 1140
gctgctggct tcattggctg tgattattct ttggccgttg ctttcctaac tatatcaaca 1200
acactgggag gcttttgctc ttctggattt agcatcaacc atctggatat tgctccttcg 1260
tatgctggta tcctcctggg catcacaaat acatttgcca ctattccagg aatggttggg 1320
cccgtcattg ctaaaagtct gacccctgat aacactgttg gagaatggca aaccgtgttc 1380
tatattgctg ctgctattaa tgtttttggt gccattttct ttacactatt cgccaaaggt 1440
gaagtacaaa actgggctct caatgatcac catggacaca gacactga 1488
<210> 142
<211> 495
<212> PRT
<213> Homo sapiens
<400> 142
Met Arg Ser Pro Val Arg Asp Leu Ala Arg Asn Asp Gly Glu Glu Ser
1 5 10 15
Thr Asp Arg Thr Pro Leu Leu Pro Gly Ala Pro Arg Ala Glu Ala Ala
20 25 30
Pro Val Cys Cys Ser Ala Arg Tyr Asn Leu Ala Ile Leu Ala Phe Phe
35 40 45
Gly Phe Phe Ile Val Tyr Ala Leu Arg Val Asn Leu Ser Val Ala Leu
50 55 60
Val Asp Met Val Asp Ser Asn Thr Thr Leu Glu Asp Asn Arg Thr Ser
65 70 75 80
Lys Ala Cys Pro Glu His Ser Ala Pro Ile Lys Val His His Asn Gln
85 90 95
Thr Gly Lys Lys Tyr Gln Trp Asp Ala Glu Thr Gln Gly Trp Ile Leu
100 105 110
Gly Ser Phe Phe Tyr Gly Tyr Ile Ile Thr Gln Ile Pro Gly Gly Tyr
115 120 125
Val Ala Ser Lys Ile Gly Gly Lys Met Leu Leu Gly Phe Gly Ile Leu
130 135 140
Gly Thr Ala Val Leu Thr Leu Phe Thr Pro Ile Ala Ala Asp Leu Gly
145 150 155 160
Val Gly Pro Leu Ile Val Leu Arg Ala Leu Glu Gly Leu Gly Glu Gly
165 170 175
Val Thr Phe Pro Ala Met His Ala Met Trp Ser Ser Trp Ala Pro Pro
180 185 190
Leu Glu Arg Ser Lys Leu Leu Ser Ile Ser Tyr Ala Gly Ala Gln Leu
195 200 205
Gly Thr Val Ile Ser Leu Pro Leu Ser Gly Ile Ile Cys Tyr Tyr Met
210 215 220
Asn Trp Thr Tyr Val Phe Tyr Phe Phe Gly Thr Ile Gly Ile Phe Trp
225 230 235 240
Phe Leu Leu Trp Ile Trp Leu Val Ser Asp Thr Pro Gln Lys His Lys
245 250 255
Arg Ile Ser His Tyr Glu Lys Glu Tyr Ile Leu Ser Ser Leu Arg Asn
260 265 270
Gln Leu Ser Ser Gln Lys Ser Val Pro Trp Val Pro Ile Leu Lys Ser
275 280 285
Leu Pro Leu Trp Ala Ile Val Val Ala His Phe Ser Tyr Asn Trp Thr
290 295 300
Phe Tyr Thr Leu Leu Thr Leu Leu Pro Thr Tyr Met Lys Glu Ile Leu
305 310 315 320
Arg Phe Asn Val Gln Glu Asn Gly Phe Leu Ser Ser Leu Pro Tyr Leu
325 330 335
Gly Ser Trp Leu Cys Met Ile Leu Ser Gly Gln Ala Ala Asp Asn Leu
340 345 350
Arg Ala Lys Trp Asn Phe Ser Thr Leu Cys Val Arg Arg Ile Phe Ser
355 360 365
Leu Ile Gly Met Ile Gly Pro Ala Val Phe Leu Val Ala Ala Gly Phe
370 375 380
Ile Gly Cys Asp Tyr Ser Leu Ala Val Ala Phe Leu Thr Ile Ser Thr
385 390 395 400
Thr Leu Gly Gly Phe Cys Ser Ser Gly Phe Ser Ile Asn His Leu Asp
405 410 415
Ile Ala Pro Ser Tyr Ala Gly Ile Leu Leu Gly Ile Thr Asn Thr Phe
420 425 430
Ala Thr Ile Pro Gly Met Val Gly Pro Val Ile Ala Lys Ser Leu Thr
435 440 445
Pro Asp Asn Thr Val Gly Glu Trp Gln Thr Val Phe Tyr Ile Ala Ala
450 455 460
Ala Ile Asn Val Phe Gly Ala Ile Phe Phe Thr Leu Phe Ala Lys Gly
465 470 475 480
Glu Val Gln Asn Trp Ala Leu Asn Asp His His Gly His Arg His
485 490 495
<210> 143
<211> 2010
<212> DNA
<213> Homo sapiens
<400> 143
atggctcggc gcggctggcg gcgggcaccc ctccgccgtg gcgtcggcag cagtccccga 60
gcccgcaggc tcatgcggcc cctttggttg ctcctcgcag tgggcgtctt tgactgggca 120
ggggcttcgg acggcggcgg cggagaggct agagccatgg acgaggagat cgtgtccgag 180
aagcaagccg aggagagcca ccggcaggac agcgccaacc tgctcatctt catcctgctg 240
ctcaccctca ccattctcac aatctggctc ttcaagcacc gccgggcccg cttcctgcac 300
gaaaccggcc tggctatgat ttatggtctt ttggtgggcc ttgtgcttcg gtatggcatt 360
catgttccga gtgatgtaaa taatgtgacc ctgagctgtg aagtgcagtc aagtccaact 420
accttactgg ttacttttga tccagaagta tttttcaaca tattacttcc tcctatcata 480
ttttatgcag gttatagcct gaaaaggaga catttttttc gaaatcttgg gtctatccta 540
gcatacgctt ttcttggaac agcaatttct tgtttcgtta ttgggtcaat aatgtatggc 600
tgtgtaacgc tgatgaaggt aacgggacaa cttgcaggag atttttactt tacagattgc 660
ctactgtttg gtgccattgt atcagcaact gatccagtga ctgttcttgc tatattccac 720
gagcttcaag ttgatgttga actctatgca cttctttttg gtgaaagtgt cctcaatgat 780
gctgttgcca tagtgctgtc ctcctcaata gtggcatacc agccagctgg agacaacagt 840
cacacctttg atgtcacagc gatgttcaag tctattggga tcttccttgg aatcttcagt 900
ggatcttttg caatgggtgc tgctactgga gtggtgacag ctttagtgac aaagttcacc 960
aaattacggg agttccagtt gttggagaca ggcctgttct tcttgatgtc ctggagtacc 1020
ttcctcttgg ctgaagcatg gggcttcaca ggtgtagttg cagtattgtt ttgtggcatc 1080
acacaagcac attatacgta taataatttg tccacggagt ctcagcatag aactaaacag 1140
ttgtttgagc ttctcaattt cttggcagag aatttcatct tctcctacat ggggctgaca 1200
ctgttcacct tccagaacca tgtctttaac ccaacatttg tagtaggagc atttgttgct 1260
attttcttgg gaagagctgc caatatttac cccttgtccc tcttacttaa tttgggtaga 1320
agaagtaaga ttggatcaaa ttttcaacac atgatgatgt ttgctggcct tcgtggtgca 1380
atggcatttg ccttggccat tcgagatact gccacttatg cacggcaaat gatgttcagc 1440
accacgcttc tgattgtgtt ttttaccgtg tgggtatttg gtggtggcac cactgcaatg 1500
ctgtcatgct tgcatatcag ggttggtgtt gattcagacc aagaacactt gggtgttcct 1560
gaaaatgaaa ggagaactac caaagcagag agtgcttggc ttttccggat gtggtacaac 1620
tttgatcata actatctgaa gcctctgctg acccacagcg ggcctccgct gacaacaaca 1680
ctccctgcct gctgtggacc catcgccagg tgcctcacca gcccccaggc ttacgaaaac 1740
caggaacagt tgaaagatga tgattctgat cttattctca atgatggtga catcagtttg 1800
acatatggag attctactgt gaacactgaa ccggccacat ccagcgcccc aaggagattt 1860
atgggaaaca gttctgaaga tgccttggat cgggagcttg catttgggga ccatgaactg 1920
gtcattcgag gaacacgcct ggttcttcca atggatgatt ctgaaccccc gctaaatttg 1980
ttagataata cgagacatgg tccagcctaa 2010
<210> 144
<211> 669
<212> PRT
<213> Homo sapiens
<400> 144
Met Ala Arg Arg Gly Trp Arg Arg Ala Pro Leu Arg Arg Gly Val Gly
1 5 10 15
Ser Ser Pro Arg Ala Arg Arg Leu Met Arg Pro Leu Trp Leu Leu Leu
20 25 30
Ala Val Gly Val Phe Asp Trp Ala Gly Ala Ser Asp Gly Gly Gly Gly
35 40 45
Glu Ala Arg Ala Met Asp Glu Glu Ile Val Ser Glu Lys Gln Ala Glu
50 55 60
Glu Ser His Arg Gln Asp Ser Ala Asn Leu Leu Ile Phe Ile Leu Leu
65 70 75 80
Leu Thr Leu Thr Ile Leu Thr Ile Trp Leu Phe Lys His Arg Arg Ala
85 90 95
Arg Phe Leu His Glu Thr Gly Leu Ala Met Ile Tyr Gly Leu Leu Val
100 105 110
Gly Leu Val Leu Arg Tyr Gly Ile His Val Pro Ser Asp Val Asn Asn
115 120 125
Val Thr Leu Ser Cys Glu Val Gln Ser Ser Pro Thr Thr Leu Leu Val
130 135 140
Thr Phe Asp Pro Glu Val Phe Phe Asn Ile Leu Leu Pro Pro Ile Ile
145 150 155 160
Phe Tyr Ala Gly Tyr Ser Leu Lys Arg Arg His Phe Phe Arg Asn Leu
165 170 175
Gly Ser Ile Leu Ala Tyr Ala Phe Leu Gly Thr Ala Ile Ser Cys Phe
180 185 190
Val Ile Gly Ser Ile Met Tyr Gly Cys Val Thr Leu Met Lys Val Thr
195 200 205
Gly Gln Leu Ala Gly Asp Phe Tyr Phe Thr Asp Cys Leu Leu Phe Gly
210 215 220
Ala Ile Val Ser Ala Thr Asp Pro Val Thr Val Leu Ala Ile Phe His
225 230 235 240
Glu Leu Gln Val Asp Val Glu Leu Tyr Ala Leu Leu Phe Gly Glu Ser
245 250 255
Val Leu Asn Asp Ala Val Ala Ile Val Leu Ser Ser Ser Ile Val Ala
260 265 270
Tyr Gln Pro Ala Gly Asp Asn Ser His Thr Phe Asp Val Thr Ala Met
275 280 285
Phe Lys Ser Ile Gly Ile Phe Leu Gly Ile Phe Ser Gly Ser Phe Ala
290 295 300
Met Gly Ala Ala Thr Gly Val Val Thr Ala Leu Val Thr Lys Phe Thr
305 310 315 320
Lys Leu Arg Glu Phe Gln Leu Leu Glu Thr Gly Leu Phe Phe Leu Met
325 330 335
Ser Trp Ser Thr Phe Leu Leu Ala Glu Ala Trp Gly Phe Thr Gly Val
340 345 350
Val Ala Val Leu Phe Cys Gly Ile Thr Gln Ala His Tyr Thr Tyr Asn
355 360 365
Asn Leu Ser Thr Glu Ser Gln His Arg Thr Lys Gln Leu Phe Glu Leu
370 375 380
Leu Asn Phe Leu Ala Glu Asn Phe Ile Phe Ser Tyr Met Gly Leu Thr
385 390 395 400
Leu Phe Thr Phe Gln Asn His Val Phe Asn Pro Thr Phe Val Val Gly
405 410 415
Ala Phe Val Ala Ile Phe Leu Gly Arg Ala Ala Asn Ile Tyr Pro Leu
420 425 430
Ser Leu Leu Leu Asn Leu Gly Arg Arg Ser Lys Ile Gly Ser Asn Phe
435 440 445
Gln His Met Met Met Phe Ala Gly Leu Arg Gly Ala Met Ala Phe Ala
450 455 460
Leu Ala Ile Arg Asp Thr Ala Thr Tyr Ala Arg Gln Met Met Phe Ser
465 470 475 480
Thr Thr Leu Leu Ile Val Phe Phe Thr Val Trp Val Phe Gly Gly Gly
485 490 495
Thr Thr Ala Met Leu Ser Cys Leu His Ile Arg Val Gly Val Asp Ser
500 505 510
Asp Gln Glu His Leu Gly Val Pro Glu Asn Glu Arg Arg Thr Thr Lys
515 520 525
Ala Glu Ser Ala Trp Leu Phe Arg Met Trp Tyr Asn Phe Asp His Asn
530 535 540
Tyr Leu Lys Pro Leu Leu Thr His Ser Gly Pro Pro Leu Thr Thr Thr
545 550 555 560
Leu Pro Ala Cys Cys Gly Pro Ile Ala Arg Cys Leu Thr Ser Pro Gln
565 570 575
Ala Tyr Glu Asn Gln Glu Gln Leu Lys Asp Asp Asp Ser Asp Leu Ile
580 585 590
Leu Asn Asp Gly Asp Ile Ser Leu Thr Tyr Gly Asp Ser Thr Val Asn
595 600 605
Thr Glu Pro Ala Thr Ser Ser Ala Pro Arg Arg Phe Met Gly Asn Ser
610 615 620
Ser Glu Asp Ala Leu Asp Arg Glu Leu Ala Phe Gly Asp His Glu Leu
625 630 635 640
Val Ile Arg Gly Thr Arg Leu Val Leu Pro Met Asp Asp Ser Glu Pro
645 650 655
Pro Leu Asn Leu Leu Asp Asn Thr Arg His Gly Pro Ala
660 665
<210> 145
<211> 2706
<212> DNA
<213> Homo sapiens
<400> 145
atggagccgc cgctcccggt cggagcccag ccgcttgcca ctgtcgaggg tatggagatg 60
aagggtcctc tccgggagcc ctgcgccctg accctagccc agaggaacgg gcaatatgag 120
ttaataatcc agttgcatga gaaggaacag catgttcaag atatcattcc tataaatagc 180
cacttcagat gtgttcaaga agcagaagaa actcttttga ttgacatagc ttctaacagt 240
ggctgcaaaa ttcgggttca gggggactgg atcagagagc gccgctttga aatccctgat 300
gaggaacact gtttgaagtt cctctcagct gtccttgctg ctcagaaagc tcagtcacag 360
cttcttgttc cagagcaaaa ggactcatct agctggtacc agaaattaga cactaaggac 420
aaaccttctg ttttttcagg gcttcttgga tttgaagaca atttttcttc tatgaatttg 480
gacaagaaaa taaattcaca aaatcagcct actgggattc atcgggaacc cccacctcca 540
cccttttcag tgaataaaat gcttccacgt gaaaaagaag cttctaacaa ggagcagccc 600
aaagtgacca acaccatgcg gaagctcttt gtaccaaata cccaatctgg gcagcgggag 660
ggtctcatca aacatatcct ggcaaagcga gagaaagaat atgtcaacat tcagactttc 720
agattttttg ttggaacttg gaatgtgaat ggccagtctc cagatagcgg gttagaacct 780
tggctgaact gtgatcccaa tcctcctgat atctactgca ttggattcca agaactggac 840
ttgagcacag aagccttctt ctactttgaa tctgtgaagg aacaagaatg gtccatggct 900
gtagagagag gtttgcattc caaagccaag tataagaaag ttcaactggt gcgccttgtt 960
gggatgatgc ttcttatatt tgccagaaag gatcagtgtc gatacattcg tgatattgct 1020
acagaaacag ttggaactgg aatcatgggg aaaatgggaa acaaaggtgg ggtagctgtg 1080
agatttgtat ttcacaacac caccttttgc attgtcaatt cccatctggc tgcacacgtg 1140
gaggactttg agagaaggaa tcaagattat aaggacattt gtgcgagaat gagttttgtg 1200
gtcccaaatc agaccctccc gcagttgaac atcatgaaac atgaggttgt catttggttg 1260
ggagatttga attatagact ttgcatgcct gatgccaatg aggtgaaaag tcttattaat 1320
aagaaagacc ttcagagact cttgaaattc gaccagctaa atattcagcg cacacagaaa 1380
aaagcttttg ttgacttcaa tgaaggggaa atcaagttca tccccactta taagtatgac 1440
tctaaaacag accggtggga ttccagtggg aaatgccggg ttccagcctg gtgtgaccga 1500
attctttgga gaggaacaaa tgttaatcag cttaattatc ggagtcacat ggaactgaaa 1560
accagcgacc acaagcctgt tagcgccctc ttccatattg gggtgaaggt tgtggatgaa 1620
cgaaggtacc ggaaagtctt tgaagatagt gtacgcatca tggacagaat ggaaaatgac 1680
ttccttcctt ccttagaact cagcaggagg gagtttgtgt ttgaaaatgt gaagtttcgg 1740
caactacaaa aggagaagtt ccagatcagc aacaatggac aggttccctg ccatttttct 1800
ttcatcccta aacttaatga cagccagtac tgcaagccat ggcttcgggc tgaacctttt 1860
gagggctact tggagccaaa tgagacagtg gacatttctc ttgatgtgta tgtcagcaaa 1920
gactctgtaa ccatcctgaa ctcgggagaa gataagattg aagatattct cgtccttcac 1980
ctggatcgag gcaaagatta cttcttgact atcagtggaa attacctccc aagttgtttt 2040
ggcacatcct tagaggctct gtgccgtatg aaaagaccaa tccgagaagt tcctgttacc 2100
aaactcatag acttggaaga agacagcttc ctagaaaagg agaaatccct tctgcaaatg 2160
gttcctttgg atgaaggtgc cagtgagaga ccccttcagg ttcccaagga gatctggctt 2220
ctagtagatc acctattcaa atacgcctgt caccaggagg acctgttcca gacccctgga 2280
atgcaggaag agctccagca gatcattgat tgtctggata ccagcattcc tgagacaatc 2340
cctggcagca accactctgt ggctgaagca ctgctcattt tcttggaagc cctgccagag 2400
ccagtcatct gttacgagct gtatcagcga tgtcttgact ctgcttatga tccccggatc 2460
tgccgacagg tgatctccca gcttccgaga tgccatagaa atgttttccg ttacttgatg 2520
gcattccttc gagaactctt aaaattctct gaatacaata gcgtcaatgc caacatgatc 2580
gctactctct tcactagtct tctcctgagg cctccaccca accttatggc aagacagact 2640
ccaagtgacc gccagcgtgc tattcagttc cttctgggct ttctgcttgg gagcgaagaa 2700
gactaa 2706
<210> 146
<211> 901
<212> PRT
<213> Homo sapiens
<400> 146
Met Glu Pro Pro Leu Pro Val Gly Ala Gln Pro Leu Ala Thr Val Glu
1 5 10 15
Gly Met Glu Met Lys Gly Pro Leu Arg Glu Pro Cys Ala Leu Thr Leu
20 25 30
Ala Gln Arg Asn Gly Gln Tyr Glu Leu Ile Ile Gln Leu His Glu Lys
35 40 45
Glu Gln His Val Gln Asp Ile Ile Pro Ile Asn Ser His Phe Arg Cys
50 55 60
Val Gln Glu Ala Glu Glu Thr Leu Leu Ile Asp Ile Ala Ser Asn Ser
65 70 75 80
Gly Cys Lys Ile Arg Val Gln Gly Asp Trp Ile Arg Glu Arg Arg Phe
85 90 95
Glu Ile Pro Asp Glu Glu His Cys Leu Lys Phe Leu Ser Ala Val Leu
100 105 110
Ala Ala Gln Lys Ala Gln Ser Gln Leu Leu Val Pro Glu Gln Lys Asp
115 120 125
Ser Ser Ser Trp Tyr Gln Lys Leu Asp Thr Lys Asp Lys Pro Ser Val
130 135 140
Phe Ser Gly Leu Leu Gly Phe Glu Asp Asn Phe Ser Ser Met Asn Leu
145 150 155 160
Asp Lys Lys Ile Asn Ser Gln Asn Gln Pro Thr Gly Ile His Arg Glu
165 170 175
Pro Pro Pro Pro Pro Phe Ser Val Asn Lys Met Leu Pro Arg Glu Lys
180 185 190
Glu Ala Ser Asn Lys Glu Gln Pro Lys Val Thr Asn Thr Met Arg Lys
195 200 205
Leu Phe Val Pro Asn Thr Gln Ser Gly Gln Arg Glu Gly Leu Ile Lys
210 215 220
His Ile Leu Ala Lys Arg Glu Lys Glu Tyr Val Asn Ile Gln Thr Phe
225 230 235 240
Arg Phe Phe Val Gly Thr Trp Asn Val Asn Gly Gln Ser Pro Asp Ser
245 250 255
Gly Leu Glu Pro Trp Leu Asn Cys Asp Pro Asn Pro Pro Asp Ile Tyr
260 265 270
Cys Ile Gly Phe Gln Glu Leu Asp Leu Ser Thr Glu Ala Phe Phe Tyr
275 280 285
Phe Glu Ser Val Lys Glu Gln Glu Trp Ser Met Ala Val Glu Arg Gly
290 295 300
Leu His Ser Lys Ala Lys Tyr Lys Lys Val Gln Leu Val Arg Leu Val
305 310 315 320
Gly Met Met Leu Leu Ile Phe Ala Arg Lys Asp Gln Cys Arg Tyr Ile
325 330 335
Arg Asp Ile Ala Thr Glu Thr Val Gly Thr Gly Ile Met Gly Lys Met
340 345 350
Gly Asn Lys Gly Gly Val Ala Val Arg Phe Val Phe His Asn Thr Thr
355 360 365
Phe Cys Ile Val Asn Ser His Leu Ala Ala His Val Glu Asp Phe Glu
370 375 380
Arg Arg Asn Gln Asp Tyr Lys Asp Ile Cys Ala Arg Met Ser Phe Val
385 390 395 400
Val Pro Asn Gln Thr Leu Pro Gln Leu Asn Ile Met Lys His Glu Val
405 410 415
Val Ile Trp Leu Gly Asp Leu Asn Tyr Arg Leu Cys Met Pro Asp Ala
420 425 430
Asn Glu Val Lys Ser Leu Ile Asn Lys Lys Asp Leu Gln Arg Leu Leu
435 440 445
Lys Phe Asp Gln Leu Asn Ile Gln Arg Thr Gln Lys Lys Ala Phe Val
450 455 460
Asp Phe Asn Glu Gly Glu Ile Lys Phe Ile Pro Thr Tyr Lys Tyr Asp
465 470 475 480
Ser Lys Thr Asp Arg Trp Asp Ser Ser Gly Lys Cys Arg Val Pro Ala
485 490 495
Trp Cys Asp Arg Ile Leu Trp Arg Gly Thr Asn Val Asn Gln Leu Asn
500 505 510
Tyr Arg Ser His Met Glu Leu Lys Thr Ser Asp His Lys Pro Val Ser
515 520 525
Ala Leu Phe His Ile Gly Val Lys Val Val Asp Glu Arg Arg Tyr Arg
530 535 540
Lys Val Phe Glu Asp Ser Val Arg Ile Met Asp Arg Met Glu Asn Asp
545 550 555 560
Phe Leu Pro Ser Leu Glu Leu Ser Arg Arg Glu Phe Val Phe Glu Asn
565 570 575
Val Lys Phe Arg Gln Leu Gln Lys Glu Lys Phe Gln Ile Ser Asn Asn
580 585 590
Gly Gln Val Pro Cys His Phe Ser Phe Ile Pro Lys Leu Asn Asp Ser
595 600 605
Gln Tyr Cys Lys Pro Trp Leu Arg Ala Glu Pro Phe Glu Gly Tyr Leu
610 615 620
Glu Pro Asn Glu Thr Val Asp Ile Ser Leu Asp Val Tyr Val Ser Lys
625 630 635 640
Asp Ser Val Thr Ile Leu Asn Ser Gly Glu Asp Lys Ile Glu Asp Ile
645 650 655
Leu Val Leu His Leu Asp Arg Gly Lys Asp Tyr Phe Leu Thr Ile Ser
660 665 670
Gly Asn Tyr Leu Pro Ser Cys Phe Gly Thr Ser Leu Glu Ala Leu Cys
675 680 685
Arg Met Lys Arg Pro Ile Arg Glu Val Pro Val Thr Lys Leu Ile Asp
690 695 700
Leu Glu Glu Asp Ser Phe Leu Glu Lys Glu Lys Ser Leu Leu Gln Met
705 710 715 720
Val Pro Leu Asp Glu Gly Ala Ser Glu Arg Pro Leu Gln Val Pro Lys
725 730 735
Glu Ile Trp Leu Leu Val Asp His Leu Phe Lys Tyr Ala Cys His Gln
740 745 750
Glu Asp Leu Phe Gln Thr Pro Gly Met Gln Glu Glu Leu Gln Gln Ile
755 760 765
Ile Asp Cys Leu Asp Thr Ser Ile Pro Glu Thr Ile Pro Gly Ser Asn
770 775 780
His Ser Val Ala Glu Ala Leu Leu Ile Phe Leu Glu Ala Leu Pro Glu
785 790 795 800
Pro Val Ile Cys Tyr Glu Leu Tyr Gln Arg Cys Leu Asp Ser Ala Tyr
805 810 815
Asp Pro Arg Ile Cys Arg Gln Val Ile Ser Gln Leu Pro Arg Cys His
820 825 830
Arg Asn Val Phe Arg Tyr Leu Met Ala Phe Leu Arg Glu Leu Leu Lys
835 840 845
Phe Ser Glu Tyr Asn Ser Val Asn Ala Asn Met Ile Ala Thr Leu Phe
850 855 860
Thr Ser Leu Leu Leu Arg Pro Pro Pro Asn Leu Met Ala Arg Gln Thr
865 870 875 880
Pro Ser Asp Arg Gln Arg Ala Ile Gln Phe Leu Leu Gly Phe Leu Leu
885 890 895
Gly Ser Glu Glu Asp
900
<210> 147
<211> 2724
<212> DNA
<213> Homo sapiens
<400> 147
atgcccacgg ccgccgcccc catcatcagc tcggtccaga agctggttct gtatgagact 60
agagctagat actttctagt tgggagcaat aatgcagaaa cgaaatatcg tgtcttgaag 120
attgatagaa cagaaccaaa agatttggtc ataattgatg acaggcatgt ctatactcaa 180
caagaagtaa gggaacttct tggccgcttg gatcttggaa atagaacaaa gatgggacag 240
aaaggatcct cgggcttatt tcgagcggtt tcagcttttg gtgttgtggg ttttgtcagg 300
ttcttagaag gctattatat tgtgttaata actaaaagga ggaagatggc ggatattgga 360
ggtcatgcaa tctataaggt cgaagataca aatatgatct atatacccaa tgattctgta 420
cgggttactc atcctgatga agctaggtat ctacgaatat ttcaaaatgt ggacctatct 480
agcaattttt actttagtta cagctatgat ttgtcccact cacttcaata taatctcact 540
gtcttgcgaa tgcccctgga gatgttaaag tcagaaatga cccagaatcg ccaagagagc 600
tttgacatct ttgaagatga aggattaatt acacaaggtg gaagcggggt atttgggatc 660
tgtagtgagc cttatatgaa atatgtatgg aatggtgaac ttctggatat aattaaaagt 720
actgtgcatc gtgactggct tttgtatatt attcatgggt tctgtgggca gtcaaagctg 780
ttgatctatg gacgaccagt gtatgtcact ctaatagcta gaagatccag taaatttgct 840
ggcacccgtt ttcttaaaag aggtgcaaac tgtgagggtg atgttgcaaa tgaagtggag 900
actgaacaaa tactctgcga tgcttctgtg atgtctttca ctgcaggaag ttattcttca 960
tatgtacaag ttagaggatc tgtgccctta tactggtctc aggacatttc aactatgatg 1020
cctaaaccac ctattacatt ggatcaggca gatccatttg cacatgtggc tgcccttcac 1080
tttgaccaga tgttccagag gtttggctct cccatcatca tcttgaattt agtgaaggaa 1140
cgagagaaaa gaaagcatga aagaattctg agtgaagaac ttgttgctgc tgtgacctat 1200
ctcaaccaat ttttgcctcc tgagcacact attgtttata ttccctggga catggccaag 1260
tataccaaaa gcaagctgtg taatgttctt gatcgactaa atgtgattgc agaaagtgtg 1320
gtgaagaaaa caggtttctt tgtaaaccgc cctgattctt actgtagcat tttgcggcca 1380
gatgaaaagt ggaatgaact aggaggatgt gtgattccca ctggtcgcct gcagactggc 1440
atccttcgaa ccaactgtgt ggactgttta gatcgcacca acacagcaca gtttatggtg 1500
ggaaaatgtg ctctggccta tcagctgtat tcactgggac tgattgacaa acctaatcta 1560
cagtttgata cagatgcagt taggttattt gaggaactct atgaagatca tggtgatacc 1620
ctatcccttc agtatggtgg ttctcaactt gttcatcgtg tgaaaaccta cagaaagata 1680
gcaccatgga cccagcactc caaagacatc atgcaaaccc tgtctagata ttacagcaat 1740
gctttttcag atgccgatag acaagattcc attaatctct tcctgggagt tttccatccc 1800
actgaaggga aacctcatct ctgggagctc ccaacagatt tttatttgca tcacaaaaat 1860
accatgagac ttttgccaac aagaagaagt tatacttact ggtggacacc agaggtgata 1920
aagcatttac cattgcccta tgatgaagtt atctgtgctg tgaacttaaa gaagttgata 1980
gtgaagaaat tccacaaata tgaagaagag attgatatcc acaatgagtt ctttcggcca 2040
tatgagttga gcagctttga tgataccttt tgcttggcta tgacaagctc agcacgtgac 2100
tttatgccta agaccgttgg aattgatcca agtccattta ctgtgcgtaa accagatgaa 2160
actggaaaat cagtattggg aaacaaaagc aatagagaag aagctgtatt acagcggaaa 2220
acggcagcca gcgccccgcc gccccccagc gaggaggctg tgtccagcag ctctgaggat 2280
gactctggga ctgatcggga agaagagggc tctgtgtctc agcgctccac tcccgtgaag 2340
atgactgatg caggagacag tgccaaagtg accgagaatg tggtccaacc catgaaggag 2400
ctatatggaa ttaacctctc agatggcctc tcagaagaag atttctccat ttattcaaga 2460
tttgttcagc tggggcagag tcaacataaa caagacaaga atagccagca gccctgttct 2520
aggtgctcag atggagttat aaaactaaca cccatctcgg ctttctcgca agataacatc 2580
tatgaagttc agcccccaag agtagacaga aaatctacag agatcttcca agcccacatc 2640
caggccagcc aaggtatcat gcagccccta ggaaaagagg actcctccat gtaccgagag 2700
tacatcagga accgctacct gtga 2724
<210> 148
<211> 907
<212> PRT
<213> Homo sapiens
<400> 148
Met Pro Thr Ala Ala Ala Pro Ile Ile Ser Ser Val Gln Lys Leu Val
1 5 10 15
Leu Tyr Glu Thr Arg Ala Arg Tyr Phe Leu Val Gly Ser Asn Asn Ala
20 25 30
Glu Thr Lys Tyr Arg Val Leu Lys Ile Asp Arg Thr Glu Pro Lys Asp
35 40 45
Leu Val Ile Ile Asp Asp Arg His Val Tyr Thr Gln Gln Glu Val Arg
50 55 60
Glu Leu Leu Gly Arg Leu Asp Leu Gly Asn Arg Thr Lys Met Gly Gln
65 70 75 80
Lys Gly Ser Ser Gly Leu Phe Arg Ala Val Ser Ala Phe Gly Val Val
85 90 95
Gly Phe Val Arg Phe Leu Glu Gly Tyr Tyr Ile Val Leu Ile Thr Lys
100 105 110
Arg Arg Lys Met Ala Asp Ile Gly Gly His Ala Ile Tyr Lys Val Glu
115 120 125
Asp Thr Asn Met Ile Tyr Ile Pro Asn Asp Ser Val Arg Val Thr His
130 135 140
Pro Asp Glu Ala Arg Tyr Leu Arg Ile Phe Gln Asn Val Asp Leu Ser
145 150 155 160
Ser Asn Phe Tyr Phe Ser Tyr Ser Tyr Asp Leu Ser His Ser Leu Gln
165 170 175
Tyr Asn Leu Thr Val Leu Arg Met Pro Leu Glu Met Leu Lys Ser Glu
180 185 190
Met Thr Gln Asn Arg Gln Glu Ser Phe Asp Ile Phe Glu Asp Glu Gly
195 200 205
Leu Ile Thr Gln Gly Gly Ser Gly Val Phe Gly Ile Cys Ser Glu Pro
210 215 220
Tyr Met Lys Tyr Val Trp Asn Gly Glu Leu Leu Asp Ile Ile Lys Ser
225 230 235 240
Thr Val His Arg Asp Trp Leu Leu Tyr Ile Ile His Gly Phe Cys Gly
245 250 255
Gln Ser Lys Leu Leu Ile Tyr Gly Arg Pro Val Tyr Val Thr Leu Ile
260 265 270
Ala Arg Arg Ser Ser Lys Phe Ala Gly Thr Arg Phe Leu Lys Arg Gly
275 280 285
Ala Asn Cys Glu Gly Asp Val Ala Asn Glu Val Glu Thr Glu Gln Ile
290 295 300
Leu Cys Asp Ala Ser Val Met Ser Phe Thr Ala Gly Ser Tyr Ser Ser
305 310 315 320
Tyr Val Gln Val Arg Gly Ser Val Pro Leu Tyr Trp Ser Gln Asp Ile
325 330 335
Ser Thr Met Met Pro Lys Pro Pro Ile Thr Leu Asp Gln Ala Asp Pro
340 345 350
Phe Ala His Val Ala Ala Leu His Phe Asp Gln Met Phe Gln Arg Phe
355 360 365
Gly Ser Pro Ile Ile Ile Leu Asn Leu Val Lys Glu Arg Glu Lys Arg
370 375 380
Lys His Glu Arg Ile Leu Ser Glu Glu Leu Val Ala Ala Val Thr Tyr
385 390 395 400
Leu Asn Gln Phe Leu Pro Pro Glu His Thr Ile Val Tyr Ile Pro Trp
405 410 415
Asp Met Ala Lys Tyr Thr Lys Ser Lys Leu Cys Asn Val Leu Asp Arg
420 425 430
Leu Asn Val Ile Ala Glu Ser Val Val Lys Lys Thr Gly Phe Phe Val
435 440 445
Asn Arg Pro Asp Ser Tyr Cys Ser Ile Leu Arg Pro Asp Glu Lys Trp
450 455 460
Asn Glu Leu Gly Gly Cys Val Ile Pro Thr Gly Arg Leu Gln Thr Gly
465 470 475 480
Ile Leu Arg Thr Asn Cys Val Asp Cys Leu Asp Arg Thr Asn Thr Ala
485 490 495
Gln Phe Met Val Gly Lys Cys Ala Leu Ala Tyr Gln Leu Tyr Ser Leu
500 505 510
Gly Leu Ile Asp Lys Pro Asn Leu Gln Phe Asp Thr Asp Ala Val Arg
515 520 525
Leu Phe Glu Glu Leu Tyr Glu Asp His Gly Asp Thr Leu Ser Leu Gln
530 535 540
Tyr Gly Gly Ser Gln Leu Val His Arg Val Lys Thr Tyr Arg Lys Ile
545 550 555 560
Ala Pro Trp Thr Gln His Ser Lys Asp Ile Met Gln Thr Leu Ser Arg
565 570 575
Tyr Tyr Ser Asn Ala Phe Ser Asp Ala Asp Arg Gln Asp Ser Ile Asn
580 585 590
Leu Phe Leu Gly Val Phe His Pro Thr Glu Gly Lys Pro His Leu Trp
595 600 605
Glu Leu Pro Thr Asp Phe Tyr Leu His His Lys Asn Thr Met Arg Leu
610 615 620
Leu Pro Thr Arg Arg Ser Tyr Thr Tyr Trp Trp Thr Pro Glu Val Ile
625 630 635 640
Lys His Leu Pro Leu Pro Tyr Asp Glu Val Ile Cys Ala Val Asn Leu
645 650 655
Lys Lys Leu Ile Val Lys Lys Phe His Lys Tyr Glu Glu Glu Ile Asp
660 665 670
Ile His Asn Glu Phe Phe Arg Pro Tyr Glu Leu Ser Ser Phe Asp Asp
675 680 685
Thr Phe Cys Leu Ala Met Thr Ser Ser Ala Arg Asp Phe Met Pro Lys
690 695 700
Thr Val Gly Ile Asp Pro Ser Pro Phe Thr Val Arg Lys Pro Asp Glu
705 710 715 720
Thr Gly Lys Ser Val Leu Gly Asn Lys Ser Asn Arg Glu Glu Ala Val
725 730 735
Leu Gln Arg Lys Thr Ala Ala Ser Ala Pro Pro Pro Pro Ser Glu Glu
740 745 750
Ala Val Ser Ser Ser Ser Glu Asp Asp Ser Gly Thr Asp Arg Glu Glu
755 760 765
Glu Gly Ser Val Ser Gln Arg Ser Thr Pro Val Lys Met Thr Asp Ala
770 775 780
Gly Asp Ser Ala Lys Val Thr Glu Asn Val Val Gln Pro Met Lys Glu
785 790 795 800
Leu Tyr Gly Ile Asn Leu Ser Asp Gly Leu Ser Glu Glu Asp Phe Ser
805 810 815
Ile Tyr Ser Arg Phe Val Gln Leu Gly Gln Ser Gln His Lys Gln Asp
820 825 830
Lys Asn Ser Gln Gln Pro Cys Ser Arg Cys Ser Asp Gly Val Ile Lys
835 840 845
Leu Thr Pro Ile Ser Ala Phe Ser Gln Asp Asn Ile Tyr Glu Val Gln
850 855 860
Pro Pro Arg Val Asp Arg Lys Ser Thr Glu Ile Phe Gln Ala His Ile
865 870 875 880
Gln Ala Ser Gln Gly Ile Met Gln Pro Leu Gly Lys Glu Asp Ser Ser
885 890 895
Met Tyr Arg Glu Tyr Ile Arg Asn Arg Tyr Leu
900 905
<210> 149
<211> 2241
<212> DNA
<213> Homo sapiens
<400> 149
atggacttct tggaggagcc aatccctggt gtagggacct atgatgattt caatacaatt 60
gattgggtga gagagaagtc tcgagaccgg gataggcacc gagagattac caataaaagc 120
aaagagtcaa catgggcctt aattcacagt gtgagtgatg ctttttccgg ctggttgttg 180
atgctcctta ttgggctttt atcaggttcg ttagctggtt tgatagacat ctctgctcat 240
tggatgacag acttaaaaga aggtatatgc acagggggat tctggtttaa ccatgaacat 300
tgttgctgga actctgagca tgtcaccttt gaagagagag acaaatgtcc agagtggaat 360
agttggtccc agcttatcat cagcacagat gagggagcct ttgcctacat agtcaattat 420
ttcatgtacg tcctctgggc tctcctattt gccttccttg ccgtatctct tgtcaaggtg 480
tttgcgcctt atgcctgtgg ctctggaatc cctgagataa aaactatctt gagtggtttc 540
attattaggg gctatttggg taagtggact ctggttatca aaaccatcac cttggtgctg 600
gcagtgtcat ctggcttgag cctgggcaaa gagggccctc tagtgcacgt ggcttgctgc 660
tgtgggaaca tcctgtgcca ctgcttcaac aaatacagga agaatgaagc caagcgcaga 720
gaggtcttgt cggctgcagc agcagctggt gtatctgtag cctttggagc acctataggt 780
ggagtattat tcagccttga agaggtcagc tactattttc ccctcaaaac attgtggcgt 840
tcattctttg ctgccttggt ggcagcattc actctacgct ccatcaatcc atttgggaac 900
agccgcctgg tactatttta tgtggagttt cacaccccat ggcatctctt tgagctcgtg 960
ccattcattc tgctgggcat atttggtggt ctgtggggag cactgtttat ccgcacaaac 1020
attgcctggt gtcggaagcg aaagaccacc cagttgggca agtatcctgt tatagaggta 1080
ctcgtcgtga cagccatcac tgccatcctg gctttcccca atgaatacac tcggatgagc 1140
acaagtgagc tcatttctga gctgtttaat gactgtggcc ttctggactc ctccaagctc 1200
tgtgattatg agaaccgttt caacacaagc aaagggggtg aactgcctga cagaccggct 1260
ggcgtgggag tctacagtgc aatgtggcag ctggctttaa cactcatact gaaaattgtc 1320
attactatat tcacctttgg catgaagatc ccttctggcc tctttatccc tagcatggct 1380
gttggtgcta tagcaggtcg acttctagga gtaggaatgg aacagctggc ttattaccac 1440
caggaatgga ccgtcttcaa tagctggtgt agtcagggag ctgattgcat cacccccggc 1500
ctttatgcaa tggttggggc tgcagcctgc ttaggtgggg tgactcggat gactgtttct 1560
cttgttgtca taatgtttga actgactggt ggcttagaat acatcgtgcc tctgatggct 1620
gcagccatga caagcaagtg ggtggcagat gctcttgggc gggagggcat ctatgatgcc 1680
cacatccgtc tcaatggata cccctttctt gaagccaaag aagagtttgc tcataagacc 1740
ctggcaatgg atgtgatgaa accccggaga aatgatcctt tgttgactgt ccttactcag 1800
gacagtatga ctgtggaaga tgtagagacc ataatcagtg aaaccactta cagtggcttc 1860
ccagtggtgg tatcccggga gtcccaaaga cttgtgggct ttgtcctccg aagagatctc 1920
attatttcaa ttgaaaatgc tcgaaagaaa caggatgggg ttgttagcac ttccatcatt 1980
tatttcacgg agcattctcc tccattgcca ccatacactc cacccactct aaagcttcgg 2040
aacatcctcg atctcagccc cttcactgtg actgacctta cacccatgga gatcgtagtg 2100
gatattttcc gaaagctggg actgcggcag tgcctggtta cacacaacgg gcgattgctt 2160
ggaatcatta ccaaaaagga tgtgttaaag catatagcac agatggcgaa ccaagatcct 2220
gattccattc tcttcaacta g 2241
<210> 150
<211> 746
<212> PRT
<213> Homo sapiens
<400> 150
Met Asp Phe Leu Glu Glu Pro Ile Pro Gly Val Gly Thr Tyr Asp Asp
1 5 10 15
Phe Asn Thr Ile Asp Trp Val Arg Glu Lys Ser Arg Asp Arg Asp Arg
20 25 30
His Arg Glu Ile Thr Asn Lys Ser Lys Glu Ser Thr Trp Ala Leu Ile
35 40 45
His Ser Val Ser Asp Ala Phe Ser Gly Trp Leu Leu Met Leu Leu Ile
50 55 60
Gly Leu Leu Ser Gly Ser Leu Ala Gly Leu Ile Asp Ile Ser Ala His
65 70 75 80
Trp Met Thr Asp Leu Lys Glu Gly Ile Cys Thr Gly Gly Phe Trp Phe
85 90 95
Asn His Glu His Cys Cys Trp Asn Ser Glu His Val Thr Phe Glu Glu
100 105 110
Arg Asp Lys Cys Pro Glu Trp Asn Ser Trp Ser Gln Leu Ile Ile Ser
115 120 125
Thr Asp Glu Gly Ala Phe Ala Tyr Ile Val Asn Tyr Phe Met Tyr Val
130 135 140
Leu Trp Ala Leu Leu Phe Ala Phe Leu Ala Val Ser Leu Val Lys Val
145 150 155 160
Phe Ala Pro Tyr Ala Cys Gly Ser Gly Ile Pro Glu Ile Lys Thr Ile
165 170 175
Leu Ser Gly Phe Ile Ile Arg Gly Tyr Leu Gly Lys Trp Thr Leu Val
180 185 190
Ile Lys Thr Ile Thr Leu Val Leu Ala Val Ser Ser Gly Leu Ser Leu
195 200 205
Gly Lys Glu Gly Pro Leu Val His Val Ala Cys Cys Cys Gly Asn Ile
210 215 220
Leu Cys His Cys Phe Asn Lys Tyr Arg Lys Asn Glu Ala Lys Arg Arg
225 230 235 240
Glu Val Leu Ser Ala Ala Ala Ala Ala Gly Val Ser Val Ala Phe Gly
245 250 255
Ala Pro Ile Gly Gly Val Leu Phe Ser Leu Glu Glu Val Ser Tyr Tyr
260 265 270
Phe Pro Leu Lys Thr Leu Trp Arg Ser Phe Phe Ala Ala Leu Val Ala
275 280 285
Ala Phe Thr Leu Arg Ser Ile Asn Pro Phe Gly Asn Ser Arg Leu Val
290 295 300
Leu Phe Tyr Val Glu Phe His Thr Pro Trp His Leu Phe Glu Leu Val
305 310 315 320
Pro Phe Ile Leu Leu Gly Ile Phe Gly Gly Leu Trp Gly Ala Leu Phe
325 330 335
Ile Arg Thr Asn Ile Ala Trp Cys Arg Lys Arg Lys Thr Thr Gln Leu
340 345 350
Gly Lys Tyr Pro Val Ile Glu Val Leu Val Val Thr Ala Ile Thr Ala
355 360 365
Ile Leu Ala Phe Pro Asn Glu Tyr Thr Arg Met Ser Thr Ser Glu Leu
370 375 380
Ile Ser Glu Leu Phe Asn Asp Cys Gly Leu Leu Asp Ser Ser Lys Leu
385 390 395 400
Cys Asp Tyr Glu Asn Arg Phe Asn Thr Ser Lys Gly Gly Glu Leu Pro
405 410 415
Asp Arg Pro Ala Gly Val Gly Val Tyr Ser Ala Met Trp Gln Leu Ala
420 425 430
Leu Thr Leu Ile Leu Lys Ile Val Ile Thr Ile Phe Thr Phe Gly Met
435 440 445
Lys Ile Pro Ser Gly Leu Phe Ile Pro Ser Met Ala Val Gly Ala Ile
450 455 460
Ala Gly Arg Leu Leu Gly Val Gly Met Glu Gln Leu Ala Tyr Tyr His
465 470 475 480
Gln Glu Trp Thr Val Phe Asn Ser Trp Cys Ser Gln Gly Ala Asp Cys
485 490 495
Ile Thr Pro Gly Leu Tyr Ala Met Val Gly Ala Ala Ala Cys Leu Gly
500 505 510
Gly Val Thr Arg Met Thr Val Ser Leu Val Val Ile Met Phe Glu Leu
515 520 525
Thr Gly Gly Leu Glu Tyr Ile Val Pro Leu Met Ala Ala Ala Met Thr
530 535 540
Ser Lys Trp Val Ala Asp Ala Leu Gly Arg Glu Gly Ile Tyr Asp Ala
545 550 555 560
His Ile Arg Leu Asn Gly Tyr Pro Phe Leu Glu Ala Lys Glu Glu Phe
565 570 575
Ala His Lys Thr Leu Ala Met Asp Val Met Lys Pro Arg Arg Asn Asp
580 585 590
Pro Leu Leu Thr Val Leu Thr Gln Asp Ser Met Thr Val Glu Asp Val
595 600 605
Glu Thr Ile Ile Ser Glu Thr Thr Tyr Ser Gly Phe Pro Val Val Val
610 615 620
Ser Arg Glu Ser Gln Arg Leu Val Gly Phe Val Leu Arg Arg Asp Leu
625 630 635 640
Ile Ile Ser Ile Glu Asn Ala Arg Lys Lys Gln Asp Gly Val Val Ser
645 650 655
Thr Ser Ile Ile Tyr Phe Thr Glu His Ser Pro Pro Leu Pro Pro Tyr
660 665 670
Thr Pro Pro Thr Leu Lys Leu Arg Asn Ile Leu Asp Leu Ser Pro Phe
675 680 685
Thr Val Thr Asp Leu Thr Pro Met Glu Ile Val Val Asp Ile Phe Arg
690 695 700
Lys Leu Gly Leu Arg Gln Cys Leu Val Thr His Asn Gly Arg Leu Leu
705 710 715 720
Gly Ile Ile Thr Lys Lys Asp Val Leu Lys His Ile Ala Gln Met Ala
725 730 735
Asn Gln Asp Pro Asp Ser Ile Leu Phe Asn
740 745
<210> 151
<211> 828
<212> DNA
<213> Homo sapians
<400> 151
atgacagatg acaaagatgt gcttcgagat gtgtggtttg gacgaattcc aacttgtttc 60
acgctatatc aggatgagat aactgaaagg gaagcagaac catactattt gcttttgcca 120
agagtaagtt atttgacgtt ggtaactgac aaagtgaaaa agcactttca gaaggttatg 180
agacaagaag acattagtga gatatggttt gaatatgaag gcacaccact gaaatggcat 240
tatccaattg gtttgctatt tgatcttctt gcatcaagtt cagctcttcc ttggaacatc 300
acagtacatt ttaagagttt tccagaaaaa gaccttctgc actgtccatc taaggatgca 360
attgaagctc attttatgtc atgtatgaaa gaagctgatg ctttaaaaca taaaagtcaa 420
gtaatcaatg aaatgcagaa aaaagatcac aagcaactct ggatgggatt gcaaaatgac 480
agatttgacc agttttgggc catcaatcgg aaactcatgg aatatcctgc agaagaaaat 540
ggatttcgtt atatcccctt tagaatatat cagacaacga ctgaaagacc tttcattcag 600
aagctgtttc gtcctgtggc tgcagatgga cagttgcaca cactaggaga tctcctcaaa 660
gaagtttgtc cttctgctat tgatcctgaa gatggggaaa aaaagaatca agtgatgatt 720
catggaattg agccaatgtt ggaaacacct ctgcagtggc tgagtgaaca tctgagctac 780
ccggataatt ttcttcatat tagtatcatc ccacagccaa cagattga 828
<210> 152
<211> 275
<212> PRT
<213> Homo sapians
<400> 152
Met Thr Asp Asp Lys Asp Val Leu Arg Asp Val Trp Phe Gly Arg Ile
1 5 10 15
Pro Thr Cys Phe Thr Leu Tyr Gln Asp Glu Ile Thr Glu Arg Glu Ala
20 25 30
Glu Pro Tyr Tyr Leu Leu Leu Pro Arg Val Ser Tyr Leu Thr Leu Val
35 40 45
Thr Asp Lys Val Lys Lys His Phe Gln Lys Val Met Arg Gln Glu Asp
50 55 60
Ile Ser Glu Ile Trp Phe Glu Tyr Glu Gly Thr Pro Leu Lys Trp His
65 70 75 80
Tyr Pro Ile Gly Leu Leu Phe Asp Leu Leu Ala Ser Ser Ser Ala Leu
85 90 95
Pro Trp Asn Ile Thr Val His Phe Lys Ser Phe Pro Glu Lys Asp Leu
100 105 110
Leu His Cys Pro Ser Lys Asp Ala Ile Glu Ala His Phe Met Ser Cys
115 120 125
Met Lys Glu Ala Asp Ala Leu Lys His Lys Ser Gln Val Ile Asn Glu
130 135 140
Met Gln Lys Lys Asp His Lys Gln Leu Trp Met Gly Leu Gln Asn Asp
145 150 155 160
Arg Phe Asp Gln Phe Trp Ala Ile Asn Arg Lys Leu Met Glu Tyr Pro
165 170 175
Ala Glu Glu Asn Gly Phe Arg Tyr Ile Pro Phe Arg Ile Tyr Gln Thr
180 185 190
Thr Thr Glu Arg Pro Phe Ile Gln Lys Leu Phe Arg Pro Val Ala Ala
195 200 205
Asp Gly Gln Leu His Thr Leu Gly Asp Leu Leu Lys Glu Val Cys Pro
210 215 220
Ser Ala Ile Asp Pro Glu Asp Gly Glu Lys Lys Asn Gln Val Met Ile
225 230 235 240
His Gly Ile Glu Pro Met Leu Glu Thr Pro Leu Gln Trp Leu Ser Glu
245 250 255
His Leu Ser Tyr Pro Asp Asn Phe Leu His Ile Ser Ile Ile Pro Gln
260 265 270
Pro Thr Asp
275
<210> 153
<211> 2112
<212> DNA
<213> Homo sapiens
<400> 153
atggcggcag ctacggggga tcctggactc tctaaactgc agtttgcccc ttttagtagt 60
gccttggatg ttgggttttg gcatgagttg acccagaaga agctgaacga gtatcggctg 120
gatgaagctc ccaaggacat taagggttat tactacaatg gtgactctgc tgggctgcca 180
gctcgcttaa cattggagtt cagtgctttt gacatgagtg ctcccacccc agcccgttgc 240
tgcccagcta ttggaacact gtataacacc aacacactcg agtctttcaa gactgcagat 300
aagaagctcc ttttggaaca agcagcaaat gagatatggg aatccataaa atcaggcact 360
gctcttgaaa accctgtact cctcaacaag ttcctcctct tgacatttgc agatctaaag 420
aagtaccact tctactattg gttttgctat cctgccctct gtcttccaga gagtttacct 480
ctcattcagg ggccagtggg tttggatcaa aggttttcac taaaacagat tgaagcacta 540
gagtgtgcat atgataatct ttgtcaaaca gaaggagtca cagctcttcc ttacttctta 600
atcaagtatg atgagaacat ggtgctggtt tccttgctta aacactacag tgatttcttc 660
caaggtcaaa ggacgaagat aacaattggt gtatatgatc cctgtaactt agcccagtac 720
cctggatggc ctttgaggaa ttttttggtc ctagcagccc acagatggag tagcagtttc 780
cagtctgttg aagttgtttg cttccgtgac cgtaccatgc agggggcgag agacgttgcc 840
cacagcatca tcttcgaagt gaagcttcca gaaatggcat ttagcccaga ttgtcctaaa 900
gcagttggat gggaaaagaa ccagaaagga ggcatgggac caaggatggt gaacctcagt 960
gaatgtatgg accctaaaag gttagctgag tcatcagtgg atctaaatct caaactgatg 1020
tgttggagat tggttcctac tttagacttg gacaaggttg tgtctgtcaa atgtctgctg 1080
cttggagccg gcaccttggg ttgcaatgta gctaggacgt tgatgggttg gggcgtgaga 1140
cacatcacat ttgtggacaa tgccaagatc tcctactcca atcctgtgag gcagcctctc 1200
tatgagtttg aagattgcct agggggtggt aagcccaagg ctctggcagc agcggaccgg 1260
ctccagaaaa tattccccgg tgtgaatgcc agaggattca acatgagcat acctatgcct 1320
gggcatccag tgaacttctc cagtgtcact ctggagcaag cccgcagaga tgtggagcaa 1380
ctggagcagc tcatcgaaag ccatgatgtc gtcttcctat tgatggacac cagggagagc 1440
cggtggcttc ctgccgtcat tgctgcaagc aagagaaagc tggtcatcaa tgctgctttg 1500
ggatttgaca catttgttgt catgagacat ggtctgaaga aaccaaagca gcaaggagct 1560
ggggacttgt gtccaaacca ccctgtggca tctgctgacc tcctgggctc atcgcttttt 1620
gccaacatcc ctggttacaa gcttggctgc tacttctgca atgatgtggt ggccccagga 1680
gattcaacca gagaccggac cttggaccag cagtgcactg tgagtcgtcc aggactggcc 1740
gtgattgcag gagccctggc cgtggaattg atggtatctg ttttgcagca tccagaaggg 1800
ggctatgcca ttgccagcag cagtgacgat cggatgaatg agcctccaac ctctcttggg 1860
cttgtgcctc accagatccg gggatttctt tcacggtttg ataatgtcct tcccgtcagc 1920
ctggcatttg acaaatgtac agcttgttct tccaaagttc ttgatcaata tgaacgagaa 1980
ggatttaact tcctagccaa ggtgtttaat tcttcacatt ccttcttaga agacttgact 2040
ggtcttacat tgctgcatca agaaacccaa gctgctgaga tctgggacat gagcgatgat 2100
gagaccatct ga 2112
<210> 154
<211> 703
<212> PRT
<213> Homo sapiens
<400> 154
Met Ala Ala Ala Thr Gly Asp Pro Gly Leu Ser Lys Leu Gln Phe Ala
1 5 10 15
Pro Phe Ser Ser Ala Leu Asp Val Gly Phe Trp His Glu Leu Thr Gln
20 25 30
Lys Lys Leu Asn Glu Tyr Arg Leu Asp Glu Ala Pro Lys Asp Ile Lys
35 40 45
Gly Tyr Tyr Tyr Asn Gly Asp Ser Ala Gly Leu Pro Ala Arg Leu Thr
50 55 60
Leu Glu Phe Ser Ala Phe Asp Met Ser Ala Pro Thr Pro Ala Arg Cys
65 70 75 80
Cys Pro Ala Ile Gly Thr Leu Tyr Asn Thr Asn Thr Leu Glu Ser Phe
85 90 95
Lys Thr Ala Asp Lys Lys Leu Leu Leu Glu Gln Ala Ala Asn Glu Ile
100 105 110
Trp Glu Ser Ile Lys Ser Gly Thr Ala Leu Glu Asn Pro Val Leu Leu
115 120 125
Asn Lys Phe Leu Leu Leu Thr Phe Ala Asp Leu Lys Lys Tyr His Phe
130 135 140
Tyr Tyr Trp Phe Cys Tyr Pro Ala Leu Cys Leu Pro Glu Ser Leu Pro
145 150 155 160
Leu Ile Gln Gly Pro Val Gly Leu Asp Gln Arg Phe Ser Leu Lys Gln
165 170 175
Ile Glu Ala Leu Glu Cys Ala Tyr Asp Asn Leu Cys Gln Thr Glu Gly
180 185 190
Val Thr Ala Leu Pro Tyr Phe Leu Ile Lys Tyr Asp Glu Asn Met Val
195 200 205
Leu Val Ser Leu Leu Lys His Tyr Ser Asp Phe Phe Gln Gly Gln Arg
210 215 220
Thr Lys Ile Thr Ile Gly Val Tyr Asp Pro Cys Asn Leu Ala Gln Tyr
225 230 235 240
Pro Gly Trp Pro Leu Arg Asn Phe Leu Val Leu Ala Ala His Arg Trp
245 250 255
Ser Ser Ser Phe Gln Ser Val Glu Val Val Cys Phe Arg Asp Arg Thr
260 265 270
Met Gln Gly Ala Arg Asp Val Ala His Ser Ile Ile Phe Glu Val Lys
275 280 285
Leu Pro Glu Met Ala Phe Ser Pro Asp Cys Pro Lys Ala Val Gly Trp
290 295 300
Glu Lys Asn Gln Lys Gly Gly Met Gly Pro Arg Met Val Asn Leu Ser
305 310 315 320
Glu Cys Met Asp Pro Lys Arg Leu Ala Glu Ser Ser Val Asp Leu Asn
325 330 335
Leu Lys Leu Met Cys Trp Arg Leu Val Pro Thr Leu Asp Leu Asp Lys
340 345 350
Val Val Ser Val Lys Cys Leu Leu Leu Gly Ala Gly Thr Leu Gly Cys
355 360 365
Asn Val Ala Arg Thr Leu Met Gly Trp Gly Val Arg His Ile Thr Phe
370 375 380
Val Asp Asn Ala Lys Ile Ser Tyr Ser Asn Pro Val Arg Gln Pro Leu
385 390 395 400
Tyr Glu Phe Glu Asp Cys Leu Gly Gly Gly Lys Pro Lys Ala Leu Ala
405 410 415
Ala Ala Asp Arg Leu Gln Lys Ile Phe Pro Gly Val Asn Ala Arg Gly
420 425 430
Phe Asn Met Ser Ile Pro Met Pro Gly His Pro Val Asn Phe Ser Ser
435 440 445
Val Thr Leu Glu Gln Ala Arg Arg Asp Val Glu Gln Leu Glu Gln Leu
450 455 460
Ile Glu Ser His Asp Val Val Phe Leu Leu Met Asp Thr Arg Glu Ser
465 470 475 480
Arg Trp Leu Pro Ala Val Ile Ala Ala Ser Lys Arg Lys Leu Val Ile
485 490 495
Asn Ala Ala Leu Gly Phe Asp Thr Phe Val Val Met Arg His Gly Leu
500 505 510
Lys Lys Pro Lys Gln Gln Gly Ala Gly Asp Leu Cys Pro Asn His Pro
515 520 525
Val Ala Ser Ala Asp Leu Leu Gly Ser Ser Leu Phe Ala Asn Ile Pro
530 535 540
Gly Tyr Lys Leu Gly Cys Tyr Phe Cys Asn Asp Val Val Ala Pro Gly
545 550 555 560
Asp Ser Thr Arg Asp Arg Thr Leu Asp Gln Gln Cys Thr Val Ser Arg
565 570 575
Pro Gly Leu Ala Val Ile Ala Gly Ala Leu Ala Val Glu Leu Met Val
580 585 590
Ser Val Leu Gln His Pro Glu Gly Gly Tyr Ala Ile Ala Ser Ser Ser
595 600 605
Asp Asp Arg Met Asn Glu Pro Pro Thr Ser Leu Gly Leu Val Pro His
610 615 620
Gln Ile Arg Gly Phe Leu Ser Arg Phe Asp Asn Val Leu Pro Val Ser
625 630 635 640
Leu Ala Phe Asp Lys Cys Thr Ala Cys Ser Ser Lys Val Leu Asp Gln
645 650 655
Tyr Glu Arg Glu Gly Phe Asn Phe Leu Ala Lys Val Phe Asn Ser Ser
660 665 670
His Ser Phe Leu Glu Asp Leu Thr Gly Leu Thr Leu Leu His Gln Glu
675 680 685
Thr Gln Ala Ala Glu Ile Trp Asp Met Ser Asp Asp Glu Thr Ile
690 695 700
<210> 155
<211> 7650
<212> DNA
<213> Homo sapiens
<400> 155
atgcttggaa ccggacctgc cgccgccacc accgctgcca ccacatctag caatgtgagc 60
gtcctgcagc agtttgccag tggcctaaag agccggaatg aggaaaccag ggccaaagcc 120
gccaaggagc tccagcacta tgtcaccatg gaactccgag agatgagtca agaggagtct 180
actcgcttct atgaccaact gaaccatcac atttttgaat tggtttccag ctcagatgcc 240
aatgagagga aaggtggcat cttggccata gctagcctca taggagtgga aggtgggaat 300
gccacccgaa ttggcagatt tgccaactat cttcggaacc tcctcccctc caatgaccca 360
gttgtcatgg aaatggcatc caaggccatt ggccgtcttg ccatggcagg ggacactttt 420
accgctgagt acgtggaatt tgaggtgaag cgagccctgg aatggctggg tgctgaccgc 480
aatgagggcc ggagacatgc agctgtcctg gttctccgtg agctggccat cagcgtccct 540
accttcttct tccagcaagt gcaacccttc tttgacaaca tttttgtggc cgtgtgggac 600
cccaaacagg ccatccgtga gggagctgta gccgcccttc gtgcctgtct gattctcaca 660
acccagcgtg agccgaagga gatgcagaag cctcagtggt acaggcacac atttgaagaa 720
gcagagaagg gatttgatga gaccttggcc aaagagaagg gcatgaatcg ggatgatcgg 780
atccatggag ccttgttgat ccttaacgag ctggtccgaa tcagcagcat ggagggagag 840
cgtctgagag aagaaatgga agaaatcaca cagcagcagc tggtacacga caagtactgc 900
aaagatctca tgggcttcgg aacaaaacct cgtcacatta cccccttcac cagtttccag 960
gctgtacagc cccagcagtc aaatgccttg gtggggctgc tggggtacag ctctcaccaa 1020
ggcctcatgg gatttgggac ctcccccagt ccagctaagt ccaccctggt ggagagccgg 1080
tgttgcagag acttgatgga ggagaaattt gatcaggtgt gccagtgggt gctgaaatgc 1140
aggaatagca agaactcgct gatccaaatg acaatcctta atttgttgcc ccgcttggct 1200
gcattccgac cttctgcctt cacagatacc cagtatctcc aagataccat gaaccatgtc 1260
ctaagctgtg tcaagaagga gaaggaacgt acagcggcct tccaagccct ggggctactt 1320
tctgtggctg tgaggtctga gtttaaggtc tatttgcctc gcgtgctgga catcatccga 1380
gcggccctgc ccccaaagga cttcgcccat aagaggcaga aggcaatgca ggtggatgcc 1440
acagtcttca cttgcatcag catgctggct cgagcaatgg ggccaggcat ccagcaggat 1500
atcaaggagc tgctggagcc catgctggca gtgggactaa gccctgccct cactgcagtg 1560
ctctacgacc tgagccgtca gattccacag ctaaagaagg acattcaaga tgggctactg 1620
aaaatgctgt ccctggtcct tatgcacaaa ccccttcgcc acccaggcat gcccaagggc 1680
ctggcccatc agctggcctc tcctggcctc acgaccctcc ctgaggccag cgatgtgggc 1740
agcatcactc ttgccctccg aacgcttggc agctttgaat ttgaaggcca ctctctgacc 1800
caatttgttc gccactgtgc ggatcatttc ctgaacagtg agcacaagga gatccgcatg 1860
gaggctgccc gcacctgctc ccgcctgctc acaccctcca tccacctcat cagtggccat 1920
gctcatgtgg ttagccagac cgcagtgcaa gtggtggcag atgtgcttag caaactgctc 1980
gtagttggga taacagatcc tgaccctgac attcgctact gtgtcttggc gtccctggac 2040
gagcgctttg atgcacacct ggcccaggcg gagaacttgc aggccttgtt tgtggctctg 2100
aatgaccagg tgtttgagat ccgggagctg gccatctgca ctgtgggccg actcagtagc 2160
atgaaccctg cctttgtcat gcctttcctg cgcaagatgc tcatccagat tttgacagag 2220
ttggagcaca gtgggattgg aagaatcaaa gagcagagtg cccgcatgct ggggcacctg 2280
gtctccaatg ccccccgact catccgcccc tacatggagc ctattctgaa ggcattaatt 2340
ttgaaactga aagatccaga ccctgatcca aacccaggtg tgatcaataa tgtcctggca 2400
acaataggag aattggcaca ggttagtggc ctggaaatga ggaaatgggt tgatgaactt 2460
tttattatca tcatggacat gctccaggat tcctctttgt tggccaaaag gcaggtggct 2520
ctgtggaccc tgggacagtt ggtggccagc actggctatg tagtagagcc ctacaggaag 2580
taccctactt tgcttgaggt gctactgaat tttctgaaga ctgagcagaa ccagggtaca 2640
cgcagagagg ccatccgtgt gttagggctt ttaggggctt tggatcctta caagcacaaa 2700
gtgaacattg gcatgataga ccagtcccgg gatgcctctg ctgtcagcct gtcagaatcc 2760
aagtcaagtc aggattcctc tgactatagc actagtgaaa tgctggtcaa catgggaaac 2820
ttgcctctgg atgagttcta cccagctgtg tccatggtgg ccctgatgcg gatcttccga 2880
gaccagtcac tctctcatca tcacaccatg gttgtccagg ccatcacctt catcttcaag 2940
tccctgggac tcaaatgtgt gcagttcctg ccccaggtca tgcccacgtt ccttaacgtc 3000
attcgagtct gtgatggggc catccgggaa tttttgttcc agcagctggg aatgttggtg 3060
tcctttgtga agagccacat cagaccttat atggatgaaa tagtcaccct catgagagaa 3120
ttctgggtca tgaacacctc aattcagagc acgatcattc ttctcattga gcaaattgtg 3180
gtagctcttg ggggtgaatt taagctctac ctgccccagc tgatcccaca catgctgcgt 3240
gtcttcatgc atgacaacag cccaggccgc attgtctcta tcaagttact ggctgcaatc 3300
cagctgtttg gcgccaacct ggatgactac ctgcatttac tgctgcctcc tattgttaag 3360
ttgtttgatg cccctgaagc tccactgcca tctcgaaagg cagcgctaga gactgtggac 3420
cgcctgacgg agtccctgga tttcactgac tatgcctccc ggatcattca ccctattgtt 3480
cgaacactgg accagagccc agaactgcgc tccacagcca tggacacgct gtcttcactt 3540
gtttttcagc tggggaagaa gtaccaaatt ttcattccaa tggtgaataa agttctggtg 3600
cgacaccgaa tcaatcatca gcgctatgat gtgctcatct gcagaattgt caagggatac 3660
acacttgctg atgaagagga ggatcctttg atttaccagc atcggatgct taggagtggc 3720
caaggggatg cattggctag tggaccagtg gaaacaggac ccatgaagaa actgcacgtc 3780
agcaccatca acctccaaaa ggcctggggc gctgccagga gggtctccaa agatgactgg 3840
ctggaatggc tgagacggct gagcctggag ctgctgaagg actcatcatc gccctccctg 3900
cgctcctgct gggccctggc acaggcctac aacccgatgg ccagggatct cttcaatgct 3960
gcatttgtgt cctgctggtc tgaactgaat gaagatcaac aggatgagct catcagaagc 4020
atcgagttgg ccctcacctc acaagacatc gctgaagtca cacagaccct cttaaacttg 4080
gctgaattca tggaacacag tgacaagggc cccctgccac tgagagatga caatggcatt 4140
gttctgctgg gtgagagagc tgccaagtgc cgagcatatg ccaaagcact acactacaaa 4200
gaactggagt tccagaaagg ccccacccct gccattctag aatctctcat cagcattaat 4260
aataagctac agcagccgga ggcagcggcc ggagtgttag aatatgccat gaaacacttt 4320
ggagagctgg agatccaggc tacctggtat gagaaactgc acgagtggga ggatgccctt 4380
gtggcctatg acaagaaaat ggacaccaac aaggacgacc cagagctgat gctgggccgc 4440
atgcgctgcc tcgaggcctt gggggaatgg ggtcaactcc accagcagtg ctgtgaaaag 4500
tggaccctgg ttaatgatga gacccaagcc aagatggccc ggatggctgc tgcagctgca 4560
tggggtttag gtcagtggga cagcatggaa gaatacacct gtatgatccc tcgggacacc 4620
catgatgggg cattttatag agctgtgctg gcactgcatc aggacctctt ctccttggca 4680
caacagtgca ttgacaaggc cagggacctg ctggatgctg aattaactgc gatggcagga 4740
gagagttaca gtcgggcata tggggccatg gtttcttgcc acatgctgtc cgagctggag 4800
gaggttatcc agtacaaact tgtccccgag cgacgagaga tcatccgcca gatctggtgg 4860
gagagactgc agggctgcca gcgtatcgta gaggactggc agaaaatcct tatggtgcgg 4920
tcccttgtgg tcagccctca tgaagacatg agaacctggc tcaagtatgc aagcctgtgc 4980
ggcaagagtg gcaggctggc tcttgctcat aaaactttag tgttgctcct gggagttgat 5040
ccgtctcggc aacttgacca tcctctgcca acagttcacc ctcaggtgac ctatgcctac 5100
atgaaaaaca tgtggaagag tgcccgcaag atcgatgcct tccagcacat gcagcatttt 5160
gtccagacca tgcagcaaca ggcccagcat gccatcgcta ctgaggacca gcagcataag 5220
caggaactgc acaagctcat ggcccgatgc ttcctgaaac ttggagagtg gcagctgaat 5280
ctacagggca tcaatgagag cacaatcccc aaagtgctgc agtactacag cgccgccaca 5340
gagcacgacc gcagctggta caaggcctgg catgcgtggg cagtgatgaa cttcgaagct 5400
gtgctacact acaaacatca gaaccaagcc cgcgatgaga agaagaaact gcgtcatgcc 5460
agcggggcca acatcaccaa cgccaccact gccgccacca cggccgccac tgccaccacc 5520
actgccagca ccgagggcag caacagtgag agcgaggccg agagcaccga gaacagcccc 5580
accccatcgc cgctgcagaa gaaggtcact gaggatctgt ccaaaaccct cctgatgtac 5640
acggtgcctg ccgtccaggg cttcttccgt tccatctcct tgtcacgagg caacaacctc 5700
caggatacac tcagagttct caccttatgg tttgattatg gtcactggcc agatgtcaat 5760
gaggccttag tggagggggt gaaagccatc cagattgata cctggctaca ggttatacct 5820
cagctcattg caagaattga tacgcccaga cccttggtgg gacgtctcat tcaccagctt 5880
ctcacagaca ttggtcggta ccacccccag gccctcatct acccactgac agtggcttct 5940
aagtctacca cgacagcccg gcacaatgca gccaacaaga ttctgaagaa catgtgtgag 6000
cacagcaaca ccctggtcca gcaggccatg atggtgagcg aggagctgat ccgagtggcc 6060
atcctctggc atgagatgtg gcatgaaggc ctggaagagg catctcgttt gtactttggg 6120
gaaaggaacg tgaaaggcat gtttgaggtg ctggagccct tgcatgctat gatggaacgg 6180
ggcccccaga ctctgaagga aacatccttt aatcaggcct atggtcgaga tttaatggag 6240
gcccaagagt ggtgcaggaa gtacatgaaa tcagggaatg tcaaggacct cacccaagcc 6300
tgggacctct attatcatgt gttccgacga atctcaaagc agctgcctca gctcacatcc 6360
ttagagctgc aatatgtttc cccaaaactt ctgatgtgcc gggaccttga attggctgtg 6420
ccaggaacat atgaccccaa ccagccaatc attcgcattc agtccatagc accgtctttg 6480
caagtcatca catccaagca gaggccccgg aaattgacac ttatgggcag caacggacat 6540
gagtttgttt tccttctaaa aggccatgaa gatctgcgcc aggatgagcg tgtgatgcag 6600
ctcttcggcc tggttaacac ccttctggcc aatgacccaa catctcttcg gaaaaacctc 6660
agcatccaga gatacgctgt catcccttta tcgaccaact cgggcctcat tggctgggtt 6720
ccccactgtg acacactgca cgccctcatc cgggactaca gggagaagaa gaagatcctt 6780
ctcaacatcg agcatcgcat catgttgcgg atggctccgg actatgacca cttgactctg 6840
atgcagaagg tggaggtgtt tgagcatgcc gtcaataata cagctgggga cgacctggcc 6900
aagctgctgt ggctgaaaag ccccagctcc gaggtgtggt ttgaccgaag aaccaattat 6960
acccgttctt tagcggtcat gtcaatggtt gggtatattt taggcctggg agatagacac 7020
ccatccaacc tgatgctgga ccgtctgagt gggaagatcc tgcacattga ctttggggac 7080
tgctttgagg ttgctatgac ccgagagaag tttccagaga agattccatt tagactaaca 7140
agaatgttga ccaatgctat ggaggttaca ggcctggatg gcaactacag aatcacatgc 7200
cacacagtga tggaggtgct gcgagagcac aaggacagtg tcatggccgt gctggaagcc 7260
tttgtctatg accccttgct gaactggagg ctgatggaca caaataccaa aggcaacaag 7320
cgatcccgaa cgaggacgga ttcctactct gctggccagt cagtcgaaat tttggacggt 7380
gtggaacttg gagagccagc ccataagaaa acggggacca cagtgccaga atctattcat 7440
tctttcattg gagacggttt ggtgaaacca gaggccctaa ataagaaagc tatccagatt 7500
attaacaggg ttcgagataa gctcactggt cgggacttct ctcatgatga cactttggat 7560
gttccaacgc aagttgagct gctcatcaaa caagcgacat cccatgaaaa cctctgccag 7620
tgctatattg gctggtgccc tttctggtaa 7650
<210> 156
<211> 2549
<212> PRT
<213> Homo sapiens
<400> 156
Met Leu Gly Thr Gly Pro Ala Ala Ala Thr Thr Ala Ala Thr Thr Ser
1 5 10 15
Ser Asn Val Ser Val Leu Gln Gln Phe Ala Ser Gly Leu Lys Ser Arg
20 25 30
Asn Glu Glu Thr Arg Ala Lys Ala Ala Lys Glu Leu Gln His Tyr Val
35 40 45
Thr Met Glu Leu Arg Glu Met Ser Gln Glu Glu Ser Thr Arg Phe Tyr
50 55 60
Asp Gln Leu Asn His His Ile Phe Glu Leu Val Ser Ser Ser Asp Ala
65 70 75 80
Asn Glu Arg Lys Gly Gly Ile Leu Ala Ile Ala Ser Leu Ile Gly Val
85 90 95
Glu Gly Gly Asn Ala Thr Arg Ile Gly Arg Phe Ala Asn Tyr Leu Arg
100 105 110
Asn Leu Leu Pro Ser Asn Asp Pro Val Val Met Glu Met Ala Ser Lys
115 120 125
Ala Ile Gly Arg Leu Ala Met Ala Gly Asp Thr Phe Thr Ala Glu Tyr
130 135 140
Val Glu Phe Glu Val Lys Arg Ala Leu Glu Trp Leu Gly Ala Asp Arg
145 150 155 160
Asn Glu Gly Arg Arg His Ala Ala Val Leu Val Leu Arg Glu Leu Ala
165 170 175
Ile Ser Val Pro Thr Phe Phe Phe Gln Gln Val Gln Pro Phe Phe Asp
180 185 190
Asn Ile Phe Val Ala Val Trp Asp Pro Lys Gln Ala Ile Arg Glu Gly
195 200 205
Ala Val Ala Ala Leu Arg Ala Cys Leu Ile Leu Thr Thr Gln Arg Glu
210 215 220
Pro Lys Glu Met Gln Lys Pro Gln Trp Tyr Arg His Thr Phe Glu Glu
225 230 235 240
Ala Glu Lys Gly Phe Asp Glu Thr Leu Ala Lys Glu Lys Gly Met Asn
245 250 255
Arg Asp Asp Arg Ile His Gly Ala Leu Leu Ile Leu Asn Glu Leu Val
260 265 270
Arg Ile Ser Ser Met Glu Gly Glu Arg Leu Arg Glu Glu Met Glu Glu
275 280 285
Ile Thr Gln Gln Gln Leu Val His Asp Lys Tyr Cys Lys Asp Leu Met
290 295 300
Gly Phe Gly Thr Lys Pro Arg His Ile Thr Pro Phe Thr Ser Phe Gln
305 310 315 320
Ala Val Gln Pro Gln Gln Ser Asn Ala Leu Val Gly Leu Leu Gly Tyr
325 330 335
Ser Ser His Gln Gly Leu Met Gly Phe Gly Thr Ser Pro Ser Pro Ala
340 345 350
Lys Ser Thr Leu Val Glu Ser Arg Cys Cys Arg Asp Leu Met Glu Glu
355 360 365
Lys Phe Asp Gln Val Cys Gln Trp Val Leu Lys Cys Arg Asn Ser Lys
370 375 380
Asn Ser Leu Ile Gln Met Thr Ile Leu Asn Leu Leu Pro Arg Leu Ala
385 390 395 400
Ala Phe Arg Pro Ser Ala Phe Thr Asp Thr Gln Tyr Leu Gln Asp Thr
405 410 415
Met Asn His Val Leu Ser Cys Val Lys Lys Glu Lys Glu Arg Thr Ala
420 425 430
Ala Phe Gln Ala Leu Gly Leu Leu Ser Val Ala Val Arg Ser Glu Phe
435 440 445
Lys Val Tyr Leu Pro Arg Val Leu Asp Ile Ile Arg Ala Ala Leu Pro
450 455 460
Pro Lys Asp Phe Ala His Lys Arg Gln Lys Ala Met Gln Val Asp Ala
465 470 475 480
Thr Val Phe Thr Cys Ile Ser Met Leu Ala Arg Ala Met Gly Pro Gly
485 490 495
Ile Gln Gln Asp Ile Lys Glu Leu Leu Glu Pro Met Leu Ala Val Gly
500 505 510
Leu Ser Pro Ala Leu Thr Ala Val Leu Tyr Asp Leu Ser Arg Gln Ile
515 520 525
Pro Gln Leu Lys Lys Asp Ile Gln Asp Gly Leu Leu Lys Met Leu Ser
530 535 540
Leu Val Leu Met His Lys Pro Leu Arg His Pro Gly Met Pro Lys Gly
545 550 555 560
Leu Ala His Gln Leu Ala Ser Pro Gly Leu Thr Thr Leu Pro Glu Ala
565 570 575
Ser Asp Val Gly Ser Ile Thr Leu Ala Leu Arg Thr Leu Gly Ser Phe
580 585 590
Glu Phe Glu Gly His Ser Leu Thr Gln Phe Val Arg His Cys Ala Asp
595 600 605
His Phe Leu Asn Ser Glu His Lys Glu Ile Arg Met Glu Ala Ala Arg
610 615 620
Thr Cys Ser Arg Leu Leu Thr Pro Ser Ile His Leu Ile Ser Gly His
625 630 635 640
Ala His Val Val Ser Gln Thr Ala Val Gln Val Val Ala Asp Val Leu
645 650 655
Ser Lys Leu Leu Val Val Gly Ile Thr Asp Pro Asp Pro Asp Ile Arg
660 665 670
Tyr Cys Val Leu Ala Ser Leu Asp Glu Arg Phe Asp Ala His Leu Ala
675 680 685
Gln Ala Glu Asn Leu Gln Ala Leu Phe Val Ala Leu Asn Asp Gln Val
690 695 700
Phe Glu Ile Arg Glu Leu Ala Ile Cys Thr Val Gly Arg Leu Ser Ser
705 710 715 720
Met Asn Pro Ala Phe Val Met Pro Phe Leu Arg Lys Met Leu Ile Gln
725 730 735
Ile Leu Thr Glu Leu Glu His Ser Gly Ile Gly Arg Ile Lys Glu Gln
740 745 750
Ser Ala Arg Met Leu Gly His Leu Val Ser Asn Ala Pro Arg Leu Ile
755 760 765
Arg Pro Tyr Met Glu Pro Ile Leu Lys Ala Leu Ile Leu Lys Leu Lys
770 775 780
Asp Pro Asp Pro Asp Pro Asn Pro Gly Val Ile Asn Asn Val Leu Ala
785 790 795 800
Thr Ile Gly Glu Leu Ala Gln Val Ser Gly Leu Glu Met Arg Lys Trp
805 810 815
Val Asp Glu Leu Phe Ile Ile Ile Met Asp Met Leu Gln Asp Ser Ser
820 825 830
Leu Leu Ala Lys Arg Gln Val Ala Leu Trp Thr Leu Gly Gln Leu Val
835 840 845
Ala Ser Thr Gly Tyr Val Val Glu Pro Tyr Arg Lys Tyr Pro Thr Leu
850 855 860
Leu Glu Val Leu Leu Asn Phe Leu Lys Thr Glu Gln Asn Gln Gly Thr
865 870 875 880
Arg Arg Glu Ala Ile Arg Val Leu Gly Leu Leu Gly Ala Leu Asp Pro
885 890 895
Tyr Lys His Lys Val Asn Ile Gly Met Ile Asp Gln Ser Arg Asp Ala
900 905 910
Ser Ala Val Ser Leu Ser Glu Ser Lys Ser Ser Gln Asp Ser Ser Asp
915 920 925
Tyr Ser Thr Ser Glu Met Leu Val Asn Met Gly Asn Leu Pro Leu Asp
930 935 940
Glu Phe Tyr Pro Ala Val Ser Met Val Ala Leu Met Arg Ile Phe Arg
945 950 955 960
Asp Gln Ser Leu Ser His His His Thr Met Val Val Gln Ala Ile Thr
965 970 975
Phe Ile Phe Lys Ser Leu Gly Leu Lys Cys Val Gln Phe Leu Pro Gln
980 985 990
Val Met Pro Thr Phe Leu Asn Val Ile Arg Val Cys Asp Gly Ala Ile
995 1000 1005
Arg Glu Phe Leu Phe Gln Gln Leu Gly Met Leu Val Ser Phe Val
1010 1015 1020
Lys Ser His Ile Arg Pro Tyr Met Asp Glu Ile Val Thr Leu Met
1025 1030 1035
Arg Glu Phe Trp Val Met Asn Thr Ser Ile Gln Ser Thr Ile Ile
1040 1045 1050
Leu Leu Ile Glu Gln Ile Val Val Ala Leu Gly Gly Glu Phe Lys
1055 1060 1065
Leu Tyr Leu Pro Gln Leu Ile Pro His Met Leu Arg Val Phe Met
1070 1075 1080
His Asp Asn Ser Pro Gly Arg Ile Val Ser Ile Lys Leu Leu Ala
1085 1090 1095
Ala Ile Gln Leu Phe Gly Ala Asn Leu Asp Asp Tyr Leu His Leu
1100 1105 1110
Leu Leu Pro Pro Ile Val Lys Leu Phe Asp Ala Pro Glu Ala Pro
1115 1120 1125
Leu Pro Ser Arg Lys Ala Ala Leu Glu Thr Val Asp Arg Leu Thr
1130 1135 1140
Glu Ser Leu Asp Phe Thr Asp Tyr Ala Ser Arg Ile Ile His Pro
1145 1150 1155
Ile Val Arg Thr Leu Asp Gln Ser Pro Glu Leu Arg Ser Thr Ala
1160 1165 1170
Met Asp Thr Leu Ser Ser Leu Val Phe Gln Leu Gly Lys Lys Tyr
1175 1180 1185
Gln Ile Phe Ile Pro Met Val Asn Lys Val Leu Val Arg His Arg
1190 1195 1200
Ile Asn His Gln Arg Tyr Asp Val Leu Ile Cys Arg Ile Val Lys
1205 1210 1215
Gly Tyr Thr Leu Ala Asp Glu Glu Glu Asp Pro Leu Ile Tyr Gln
1220 1225 1230
His Arg Met Leu Arg Ser Gly Gln Gly Asp Ala Leu Ala Ser Gly
1235 1240 1245
Pro Val Glu Thr Gly Pro Met Lys Lys Leu His Val Ser Thr Ile
1250 1255 1260
Asn Leu Gln Lys Ala Trp Gly Ala Ala Arg Arg Val Ser Lys Asp
1265 1270 1275
Asp Trp Leu Glu Trp Leu Arg Arg Leu Ser Leu Glu Leu Leu Lys
1280 1285 1290
Asp Ser Ser Ser Pro Ser Leu Arg Ser Cys Trp Ala Leu Ala Gln
1295 1300 1305
Ala Tyr Asn Pro Met Ala Arg Asp Leu Phe Asn Ala Ala Phe Val
1310 1315 1320
Ser Cys Trp Ser Glu Leu Asn Glu Asp Gln Gln Asp Glu Leu Ile
1325 1330 1335
Arg Ser Ile Glu Leu Ala Leu Thr Ser Gln Asp Ile Ala Glu Val
1340 1345 1350
Thr Gln Thr Leu Leu Asn Leu Ala Glu Phe Met Glu His Ser Asp
1355 1360 1365
Lys Gly Pro Leu Pro Leu Arg Asp Asp Asn Gly Ile Val Leu Leu
1370 1375 1380
Gly Glu Arg Ala Ala Lys Cys Arg Ala Tyr Ala Lys Ala Leu His
1385 1390 1395
Tyr Lys Glu Leu Glu Phe Gln Lys Gly Pro Thr Pro Ala Ile Leu
1400 1405 1410
Glu Ser Leu Ile Ser Ile Asn Asn Lys Leu Gln Gln Pro Glu Ala
1415 1420 1425
Ala Ala Gly Val Leu Glu Tyr Ala Met Lys His Phe Gly Glu Leu
1430 1435 1440
Glu Ile Gln Ala Thr Trp Tyr Glu Lys Leu His Glu Trp Glu Asp
1445 1450 1455
Ala Leu Val Ala Tyr Asp Lys Lys Met Asp Thr Asn Lys Asp Asp
1460 1465 1470
Pro Glu Leu Met Leu Gly Arg Met Arg Cys Leu Glu Ala Leu Gly
1475 1480 1485
Glu Trp Gly Gln Leu His Gln Gln Cys Cys Glu Lys Trp Thr Leu
1490 1495 1500
Val Asn Asp Glu Thr Gln Ala Lys Met Ala Arg Met Ala Ala Ala
1505 1510 1515
Ala Ala Trp Gly Leu Gly Gln Trp Asp Ser Met Glu Glu Tyr Thr
1520 1525 1530
Cys Met Ile Pro Arg Asp Thr His Asp Gly Ala Phe Tyr Arg Ala
1535 1540 1545
Val Leu Ala Leu His Gln Asp Leu Phe Ser Leu Ala Gln Gln Cys
1550 1555 1560
Ile Asp Lys Ala Arg Asp Leu Leu Asp Ala Glu Leu Thr Ala Met
1565 1570 1575
Ala Gly Glu Ser Tyr Ser Arg Ala Tyr Gly Ala Met Val Ser Cys
1580 1585 1590
His Met Leu Ser Glu Leu Glu Glu Val Ile Gln Tyr Lys Leu Val
1595 1600 1605
Pro Glu Arg Arg Glu Ile Ile Arg Gln Ile Trp Trp Glu Arg Leu
1610 1615 1620
Gln Gly Cys Gln Arg Ile Val Glu Asp Trp Gln Lys Ile Leu Met
1625 1630 1635
Val Arg Ser Leu Val Val Ser Pro His Glu Asp Met Arg Thr Trp
1640 1645 1650
Leu Lys Tyr Ala Ser Leu Cys Gly Lys Ser Gly Arg Leu Ala Leu
1655 1660 1665
Ala His Lys Thr Leu Val Leu Leu Leu Gly Val Asp Pro Ser Arg
1670 1675 1680
Gln Leu Asp His Pro Leu Pro Thr Val His Pro Gln Val Thr Tyr
1685 1690 1695
Ala Tyr Met Lys Asn Met Trp Lys Ser Ala Arg Lys Ile Asp Ala
1700 1705 1710
Phe Gln His Met Gln His Phe Val Gln Thr Met Gln Gln Gln Ala
1715 1720 1725
Gln His Ala Ile Ala Thr Glu Asp Gln Gln His Lys Gln Glu Leu
1730 1735 1740
His Lys Leu Met Ala Arg Cys Phe Leu Lys Leu Gly Glu Trp Gln
1745 1750 1755
Leu Asn Leu Gln Gly Ile Asn Glu Ser Thr Ile Pro Lys Val Leu
1760 1765 1770
Gln Tyr Tyr Ser Ala Ala Thr Glu His Asp Arg Ser Trp Tyr Lys
1775 1780 1785
Ala Trp His Ala Trp Ala Val Met Asn Phe Glu Ala Val Leu His
1790 1795 1800
Tyr Lys His Gln Asn Gln Ala Arg Asp Glu Lys Lys Lys Leu Arg
1805 1810 1815
His Ala Ser Gly Ala Asn Ile Thr Asn Ala Thr Thr Ala Ala Thr
1820 1825 1830
Thr Ala Ala Thr Ala Thr Thr Thr Ala Ser Thr Glu Gly Ser Asn
1835 1840 1845
Ser Glu Ser Glu Ala Glu Ser Thr Glu Asn Ser Pro Thr Pro Ser
1850 1855 1860
Pro Leu Gln Lys Lys Val Thr Glu Asp Leu Ser Lys Thr Leu Leu
1865 1870 1875
Met Tyr Thr Val Pro Ala Val Gln Gly Phe Phe Arg Ser Ile Ser
1880 1885 1890
Leu Ser Arg Gly Asn Asn Leu Gln Asp Thr Leu Arg Val Leu Thr
1895 1900 1905
Leu Trp Phe Asp Tyr Gly His Trp Pro Asp Val Asn Glu Ala Leu
1910 1915 1920
Val Glu Gly Val Lys Ala Ile Gln Ile Asp Thr Trp Leu Gln Val
1925 1930 1935
Ile Pro Gln Leu Ile Ala Arg Ile Asp Thr Pro Arg Pro Leu Val
1940 1945 1950
Gly Arg Leu Ile His Gln Leu Leu Thr Asp Ile Gly Arg Tyr His
1955 1960 1965
Pro Gln Ala Leu Ile Tyr Pro Leu Thr Val Ala Ser Lys Ser Thr
1970 1975 1980
Thr Thr Ala Arg His Asn Ala Ala Asn Lys Ile Leu Lys Asn Met
1985 1990 1995
Cys Glu His Ser Asn Thr Leu Val Gln Gln Ala Met Met Val Ser
2000 2005 2010
Glu Glu Leu Ile Arg Val Ala Ile Leu Trp His Glu Met Trp His
2015 2020 2025
Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu Arg Asn
2030 2035 2040
Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met Met
2045 2050 2055
Glu Arg Gly Pro Gln Thr Leu Lys Glu Thr Ser Phe Asn Gln Ala
2060 2065 2070
Tyr Gly Arg Asp Leu Met Glu Ala Gln Glu Trp Cys Arg Lys Tyr
2075 2080 2085
Met Lys Ser Gly Asn Val Lys Asp Leu Thr Gln Ala Trp Asp Leu
2090 2095 2100
Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gln Leu Pro Gln Leu
2105 2110 2115
Thr Ser Leu Glu Leu Gln Tyr Val Ser Pro Lys Leu Leu Met Cys
2120 2125 2130
Arg Asp Leu Glu Leu Ala Val Pro Gly Thr Tyr Asp Pro Asn Gln
2135 2140 2145
Pro Ile Ile Arg Ile Gln Ser Ile Ala Pro Ser Leu Gln Val Ile
2150 2155 2160
Thr Ser Lys Gln Arg Pro Arg Lys Leu Thr Leu Met Gly Ser Asn
2165 2170 2175
Gly His Glu Phe Val Phe Leu Leu Lys Gly His Glu Asp Leu Arg
2180 2185 2190
Gln Asp Glu Arg Val Met Gln Leu Phe Gly Leu Val Asn Thr Leu
2195 2200 2205
Leu Ala Asn Asp Pro Thr Ser Leu Arg Lys Asn Leu Ser Ile Gln
2210 2215 2220
Arg Tyr Ala Val Ile Pro Leu Ser Thr Asn Ser Gly Leu Ile Gly
2225 2230 2235
Trp Val Pro His Cys Asp Thr Leu His Ala Leu Ile Arg Asp Tyr
2240 2245 2250
Arg Glu Lys Lys Lys Ile Leu Leu Asn Ile Glu His Arg Ile Met
2255 2260 2265
Leu Arg Met Ala Pro Asp Tyr Asp His Leu Thr Leu Met Gln Lys
2270 2275 2280
Val Glu Val Phe Glu His Ala Val Asn Asn Thr Ala Gly Asp Asp
2285 2290 2295
Leu Ala Lys Leu Leu Trp Leu Lys Ser Pro Ser Ser Glu Val Trp
2300 2305 2310
Phe Asp Arg Arg Thr Asn Tyr Thr Arg Ser Leu Ala Val Met Ser
2315 2320 2325
Met Val Gly Tyr Ile Leu Gly Leu Gly Asp Arg His Pro Ser Asn
2330 2335 2340
Leu Met Leu Asp Arg Leu Ser Gly Lys Ile Leu His Ile Asp Phe
2345 2350 2355
Gly Asp Cys Phe Glu Val Ala Met Thr Arg Glu Lys Phe Pro Glu
2360 2365 2370
Lys Ile Pro Phe Arg Leu Thr Arg Met Leu Thr Asn Ala Met Glu
2375 2380 2385
Val Thr Gly Leu Asp Gly Asn Tyr Arg Ile Thr Cys His Thr Val
2390 2395 2400
Met Glu Val Leu Arg Glu His Lys Asp Ser Val Met Ala Val Leu
2405 2410 2415
Glu Ala Phe Val Tyr Asp Pro Leu Leu Asn Trp Arg Leu Met Asp
2420 2425 2430
Thr Asn Thr Lys Gly Asn Lys Arg Ser Arg Thr Arg Thr Asp Ser
2435 2440 2445
Tyr Ser Ala Gly Gln Ser Val Glu Ile Leu Asp Gly Val Glu Leu
2450 2455 2460
Gly Glu Pro Ala His Lys Lys Thr Gly Thr Thr Val Pro Glu Ser
2465 2470 2475
Ile His Ser Phe Ile Gly Asp Gly Leu Val Lys Pro Glu Ala Leu
2480 2485 2490
Asn Lys Lys Ala Ile Gln Ile Ile Asn Arg Val Arg Asp Lys Leu
2495 2500 2505
Thr Gly Arg Asp Phe Ser His Asp Asp Thr Leu Asp Val Pro Thr
2510 2515 2520
Gln Val Glu Leu Leu Ile Lys Gln Ala Thr Ser His Glu Asn Leu
2525 2530 2535
Cys Gln Cys Tyr Ile Gly Trp Cys Pro Phe Trp
2540 2545
<210> 157
<211> 1686
<212> DNA
<213> Homo sapiens
<400> 157
atggcaaata tgaatagtga ttctaggcat cttggcacct ctgaggtaga tcatgaaaga 60
gatcctggac ctatgaatat ccagtttgag ccatcggatc taagatccaa aaggcctttc 120
tgtatagagc ccacaaacat cgtgaatgtg aatcatgtca ttcagagggt tagtgaccat 180
gcctctgcca tgaacaagag aattcattac tacagccggc tcaccactcc tgcagacaag 240
gcactgattg ccccagacca tgtagttcca gctccagaag agtgctatgt gtatagtcca 300
ttgggctctg cttataaact tcaaagttac actgaaggat acggtaaaaa caccagttta 360
gtaaccattt ttatgatttg gaataccatg atgggaacat ctatactaag cattccttgg 420
ggcataaaac aggctggatt tactactgga atgtgtgtca tcatactgat gggcctttta 480
acactttatt gctgctacag agtagtgaaa tcacggacta tgatgttttc gttggatacc 540
actagctggg aatatccaga tgtctgcaga cattatttcg gctcctttgg gcagtggtcg 600
agtctccttt tctccttggt gtctctcatt ggagcaatga tagtttattg ggtgcttatg 660
tcaaattttc tttttaatac tggaaagttt atttttaatt ttattcatca cattaatgac 720
acagacacta tactgagtac caataatagc aaccctgtga tttgtccaag tgccgggagt 780
ggaggccatc ctgacaacag ctctatgatt ttctatgcca atgacacagg agcccaacag 840
tttgaaaagt ggtgggataa gtccaggaca gtcccctttt atcttgtagg gctcctcctc 900
ccactgctca atttcaagtc tccttcattt ttttcaaaat ttaatatcct aggcacagtg 960
tctgtccttt atttgatttt ccttgtcacc tttaaggctg ttcgcttggg atttcatttg 1020
gaatttcatt ggtttatacc aacagaattt tttgtaccag agataagatt tcagtttcca 1080
cagctgactg gagtgcttac ccttgctttt tttattcata attgtatcat cacactcttg 1140
aagaacaaca agaaacaaga aaacaatgtg agggacttgt gcattgctta tatgctggtg 1200
acattaactt atctctatat tggagtcctg gtttttgctt catttccttc accaccatta 1260
tccaaagatt gtattgagca gaatttttta gacaacttcc ctagcagtga caccctgtcc 1320
ttcattgcaa ggatattcct gctgttccag atgatgactg tatacccact cttaggctac 1380
ctggctcgtg tccagctttt gggccatatc ttcggtgaca tttatcctag cattttccat 1440
gtgctgattc ttaatctaat tattgtggga gctggagtga tcatggcctg tttctaccca 1500
aacataggag ggatcataag atattcagga gcagcatgtg gactggcctt tgtattcata 1560
tacccatctc tcatctatat aatttccctc caccaagaag agcgtctgac atggcctaaa 1620
ttaatcttcc acgttttcat catcattttg ggcgtggcta acctgattgt tcagtttttt 1680
atgtga 1686
<210> 158
<211> 561
<212> PRT
<213> Homo sapiens
<400> 158
Met Ala Asn Met Asn Ser Asp Ser Arg His Leu Gly Thr Ser Glu Val
1 5 10 15
Asp His Glu Arg Asp Pro Gly Pro Met Asn Ile Gln Phe Glu Pro Ser
20 25 30
Asp Leu Arg Ser Lys Arg Pro Phe Cys Ile Glu Pro Thr Asn Ile Val
35 40 45
Asn Val Asn His Val Ile Gln Arg Val Ser Asp His Ala Ser Ala Met
50 55 60
Asn Lys Arg Ile His Tyr Tyr Ser Arg Leu Thr Thr Pro Ala Asp Lys
65 70 75 80
Ala Leu Ile Ala Pro Asp His Val Val Pro Ala Pro Glu Glu Cys Tyr
85 90 95
Val Tyr Ser Pro Leu Gly Ser Ala Tyr Lys Leu Gln Ser Tyr Thr Glu
100 105 110
Gly Tyr Gly Lys Asn Thr Ser Leu Val Thr Ile Phe Met Ile Trp Asn
115 120 125
Thr Met Met Gly Thr Ser Ile Leu Ser Ile Pro Trp Gly Ile Lys Gln
130 135 140
Ala Gly Phe Thr Thr Gly Met Cys Val Ile Ile Leu Met Gly Leu Leu
145 150 155 160
Thr Leu Tyr Cys Cys Tyr Arg Val Val Lys Ser Arg Thr Met Met Phe
165 170 175
Ser Leu Asp Thr Thr Ser Trp Glu Tyr Pro Asp Val Cys Arg His Tyr
180 185 190
Phe Gly Ser Phe Gly Gln Trp Ser Ser Leu Leu Phe Ser Leu Val Ser
195 200 205
Leu Ile Gly Ala Met Ile Val Tyr Trp Val Leu Met Ser Asn Phe Leu
210 215 220
Phe Asn Thr Gly Lys Phe Ile Phe Asn Phe Ile His His Ile Asn Asp
225 230 235 240
Thr Asp Thr Ile Leu Ser Thr Asn Asn Ser Asn Pro Val Ile Cys Pro
245 250 255
Ser Ala Gly Ser Gly Gly His Pro Asp Asn Ser Ser Met Ile Phe Tyr
260 265 270
Ala Asn Asp Thr Gly Ala Gln Gln Phe Glu Lys Trp Trp Asp Lys Ser
275 280 285
Arg Thr Val Pro Phe Tyr Leu Val Gly Leu Leu Leu Pro Leu Leu Asn
290 295 300
Phe Lys Ser Pro Ser Phe Phe Ser Lys Phe Asn Ile Leu Gly Thr Val
305 310 315 320
Ser Val Leu Tyr Leu Ile Phe Leu Val Thr Phe Lys Ala Val Arg Leu
325 330 335
Gly Phe His Leu Glu Phe His Trp Phe Ile Pro Thr Glu Phe Phe Val
340 345 350
Pro Glu Ile Arg Phe Gln Phe Pro Gln Leu Thr Gly Val Leu Thr Leu
355 360 365
Ala Phe Phe Ile His Asn Cys Ile Ile Thr Leu Leu Lys Asn Asn Lys
370 375 380
Lys Gln Glu Asn Asn Val Arg Asp Leu Cys Ile Ala Tyr Met Leu Val
385 390 395 400
Thr Leu Thr Tyr Leu Tyr Ile Gly Val Leu Val Phe Ala Ser Phe Pro
405 410 415
Ser Pro Pro Leu Ser Lys Asp Cys Ile Glu Gln Asn Phe Leu Asp Asn
420 425 430
Phe Pro Ser Ser Asp Thr Leu Ser Phe Ile Ala Arg Ile Phe Leu Leu
435 440 445
Phe Gln Met Met Thr Val Tyr Pro Leu Leu Gly Tyr Leu Ala Arg Val
450 455 460
Gln Leu Leu Gly His Ile Phe Gly Asp Ile Tyr Pro Ser Ile Phe His
465 470 475 480
Val Leu Ile Leu Asn Leu Ile Ile Val Gly Ala Gly Val Ile Met Ala
485 490 495
Cys Phe Tyr Pro Asn Ile Gly Gly Ile Ile Arg Tyr Ser Gly Ala Ala
500 505 510
Cys Gly Leu Ala Phe Val Phe Ile Tyr Pro Ser Leu Ile Tyr Ile Ile
515 520 525
Ser Leu His Gln Glu Glu Arg Leu Thr Trp Pro Lys Leu Ile Phe His
530 535 540
Val Phe Ile Ile Ile Leu Gly Val Ala Asn Leu Ile Val Gln Phe Phe
545 550 555 560
Met
<210> 159
<211> 1050
<212> DNA
<213> Homo sapiens
<400> 159
atggcggcgg aggaggagga ggtggactct gccgacaccg gagagaggtc aggatggcta 60
actggttggc tccccacatg gtgccctacg tctatatcac accttaaaga agctgaagag 120
aagatgttaa aatgtgtgcc ttgcacatac aaaaaagaac ctgttcgtat atctaatgga 180
aataaaatat ggacactgaa gttctctcat aatatttcaa ataagactcc acttgtcctt 240
ctccatggtt ttggaggagg tcttgggctc tgggcactga attttggaga tctttgcacc 300
aacagacctg tctatgcttt tgacctattg ggttttggac gaagtagtag acccaggttt 360
gacagtgatg cagaagaagt ggagaatcag tttgtggaat ccattgaaga gtggagatgt 420
gccctaggat tggacaaaat gatcttgctt gggcacaacc taggtggatt cttggctgct 480
gcttactcgc tgaagtaccc atcaagggtt aatcatctca ttttagtgga gccttggggt 540
ttccctgaac gaccagacct tgctgatcaa gacagaccaa ttccagtttg gatcagagcc 600
ttgggagcag cattgactcc ctttaaccct ttagctggcc taaggattgc aggacccttt 660
ggtttaagtc tagtgcagcg tttaaggcct gatttcaaac gaaagtattc ttcaatgttc 720
gaagacgata ctgtgacaga atacatctac cactgtaatg tgcagactcc aagtggtgag 780
acagctttca agaatatgac tattccttat ggatgggcaa aaaggccaat gctccagcga 840
attggtaaaa tgcaccctga cattccagtt tcagtgatct ttggcgcccg atcctgcata 900
gatggcaatt ctggcaccag catccagtcc ttacgaccac attcatatgt gaagacaata 960
gctattcttg gggcaggaca ttatgtatat gcagatcaac cagaagaatt caaccagaaa 1020
gtaaaggaga tctgcgacac tgtggactga 1050
<210> 160
<211> 349
<212> PRT
<213> Homo sapiens
<400> 160
Met Ala Ala Glu Glu Glu Glu Val Asp Ser Ala Asp Thr Gly Glu Arg
1 5 10 15
Ser Gly Trp Leu Thr Gly Trp Leu Pro Thr Trp Cys Pro Thr Ser Ile
20 25 30
Ser His Leu Lys Glu Ala Glu Glu Lys Met Leu Lys Cys Val Pro Cys
35 40 45
Thr Tyr Lys Lys Glu Pro Val Arg Ile Ser Asn Gly Asn Lys Ile Trp
50 55 60
Thr Leu Lys Phe Ser His Asn Ile Ser Asn Lys Thr Pro Leu Val Leu
65 70 75 80
Leu His Gly Phe Gly Gly Gly Leu Gly Leu Trp Ala Leu Asn Phe Gly
85 90 95
Asp Leu Cys Thr Asn Arg Pro Val Tyr Ala Phe Asp Leu Leu Gly Phe
100 105 110
Gly Arg Ser Ser Arg Pro Arg Phe Asp Ser Asp Ala Glu Glu Val Glu
115 120 125
Asn Gln Phe Val Glu Ser Ile Glu Glu Trp Arg Cys Ala Leu Gly Leu
130 135 140
Asp Lys Met Ile Leu Leu Gly His Asn Leu Gly Gly Phe Leu Ala Ala
145 150 155 160
Ala Tyr Ser Leu Lys Tyr Pro Ser Arg Val Asn His Leu Ile Leu Val
165 170 175
Glu Pro Trp Gly Phe Pro Glu Arg Pro Asp Leu Ala Asp Gln Asp Arg
180 185 190
Pro Ile Pro Val Trp Ile Arg Ala Leu Gly Ala Ala Leu Thr Pro Phe
195 200 205
Asn Pro Leu Ala Gly Leu Arg Ile Ala Gly Pro Phe Gly Leu Ser Leu
210 215 220
Val Gln Arg Leu Arg Pro Asp Phe Lys Arg Lys Tyr Ser Ser Met Phe
225 230 235 240
Glu Asp Asp Thr Val Thr Glu Tyr Ile Tyr His Cys Asn Val Gln Thr
245 250 255
Pro Ser Gly Glu Thr Ala Phe Lys Asn Met Thr Ile Pro Tyr Gly Trp
260 265 270
Ala Lys Arg Pro Met Leu Gln Arg Ile Gly Lys Met His Pro Asp Ile
275 280 285
Pro Val Ser Val Ile Phe Gly Ala Arg Ser Cys Ile Asp Gly Asn Ser
290 295 300
Gly Thr Ser Ile Gln Ser Leu Arg Pro His Ser Tyr Val Lys Thr Ile
305 310 315 320
Ala Ile Leu Gly Ala Gly His Tyr Val Tyr Ala Asp Gln Pro Glu Glu
325 330 335
Phe Asn Gln Lys Val Lys Glu Ile Cys Asp Thr Val Asp
340 345
<210> 161
<211> 532
<212> DNA
<213> artificial sequence
<220>
<223> modified promoter sequence
<400> 161
gttacataac ttatggtaaa tggcctgcct ggctgactgc ccaatgaccc ctgcccaatg 60
atgtcaataa tgatgtatgt tcccatgtaa tgccaatagg gactttccat tgatgtcaat 120
gggtggagta tttatggtaa ctgcccactt ggcagtacat caagtgtatc atatgccaag 180
tatgccccct attgatgtca atgatggtaa atggcctgcc tggcattatg cccagtacat 240
gaccttatgg gactttccta cttggcagta catctatgta ttagtcattg ctattaccat 300
gggaattcac tagtggagaa gagcatgctt gagggctgag tgcccctcag tgggcagaga 360
gcacatggcc cacagtccct gagaagttgg ggggaggggt gggcaattga actggtgcct 420
agagaaggtg gggcttgggt aaactgggaa agtgatgtgg tgtactggct ccaccttttt 480
ccccagggtg ggggagaacc atatataagt gcagtagtct ctgtgaacat tc 532
<210> 162
<211> 1611
<212> DNA
<213> Homo sapiens
<400> 162
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc 60
atggctggca gcctcacagg tttgcttcta cttcaggcag tgtcgtgggc atcaggtgcc 120
cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca 180
tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag 240
agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg 300
ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt 360
ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa 420
aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta 480
cccatggcca gctgtgactt ctccatccgc acctacacct atgcagacac ccctgatgat 540
ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt 600
caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca 660
cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc 720
ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct 780
gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg 840
agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc 900
cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg 960
gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca 1020
gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaac 1080
gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc 1140
tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg 1200
cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg 1260
aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc 1320
atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc 1380
cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag 1440
aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta 1500
aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag 1560
acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a 1611
<210> 163
<211> 1611
<212> DNA
<213> Homo sapiens
<400> 163
atggagtttt caagtccttc cagagaggaa tgtcccaagc ctttgagtag ggtaagcatc 60
atggctggca gcctcacagg tttgcttcta cttcaggcag tgtcgtgggc atcaggtgcc 120
cgcccctgca tccctaaaag cttcggctac agctcggtgg tgtgtgtctg caatgccaca 180
tactgtgact cctttgaccc cccgaccttt cctgcccttg gtaccttcag ccgctatgag 240
agtacacgca gtgggcgacg gatggagctg agtatggggc ccatccaggc taatcacacg 300
ggcacaggcc tgctactgac cctgcagcca gaacagaagt tccagaaagt gaagggattt 360
ggaggggcca tgacagatgc tgctgctctc aacatccttg ccctgtcacc ccctgcccaa 420
aatttgctac ttaaatcgta cttctctgaa gaaggaatcg gatataacat catccgggta 480
cccatggcca gctccgactt ctccatccgc acctacacct atgcagacac ccctgatgat 540
ttccagttgc acaacttcag cctcccagag gaagatacca agctcaagat acccctgatt 600
caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca 660
cccacttggc tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc 720
ggagacatct accaccagac ctgggccaga tactttgtga agttcctgga tgcctatgct 780
gagcacaagt tacagttctg ggcagtgaca gctgaaaatg agccttctgc tgggctgttg 840
agtggatacc ccttccagtg cctgggcttc acccctgaac atcagcgaga cttcattgcc 900
cgtgacctag gtcctaccct cgccaacagt actcaccaca atgtccgcct actcatgctg 960
gatgaccaac gcttgctgct gccccactgg gcaaaggtgg tactgacaga cccagaagca 1020
gctaaatatg ttcatggcat tgctgtacat tggtacctgg actttctggc tccagccaaa 1080
gccaccctag gggagacaca ccgcctgttc cccaacacca tgctctttgc ctcagaggcc 1140
tgtgtgggct ccaagttctg ggagcagagt gtgcggctag gctcctggga tcgagggatg 1200
cagtacagcc acagcatcat cacgaacctc ctgtaccatg tggtcggctg gaccgactgg 1260
aaccttgccc tgaaccccga aggaggaccc aattgggtgc gtaactttgt cgacagtccc 1320
atcattgtag acatcaccaa ggacacgttt tacaaacagc ccatgttcta ccaccttggc 1380
cacttcagca agttcattcc tgagggctcc cagagagtgg ggctggttgc cagtcagaag 1440
aacgacctgg acgcagtggc actgatgcat cccgatggct ctgctgttgt ggtcgtgcta 1500
aaccgctcct ctaaggatgt gcctcttacc atcaaggatc ctgctgtggg cttcctggag 1560
acaatctcac ctggctactc cattcacacc tacctgtggc gtcgccagtg a 1611
<210> 164
<211> 952
<212> PRT
<213> Homo sapiens
<400> 164
Met Gly Val Arg His Pro Pro Cys Ser His Arg Leu Leu Ala Val Cys
1 5 10 15
Ala Leu Val Ser Leu Ala Thr Ala Ala Leu Leu Gly His Ile Leu Leu
20 25 30
His Asp Phe Leu Leu Val Pro Arg Glu Leu Ser Gly Ser Ser Pro Val
35 40 45
Leu Glu Glu Thr His Pro Ala His Gln Gln Gly Ala Ser Arg Pro Gly
50 55 60
Pro Arg Asp Ala Gln Ala His Pro Gly Arg Pro Arg Ala Val Pro Thr
65 70 75 80
Gln Cys Asp Val Pro Pro Asn Ser Arg Phe Asp Cys Ala Pro Asp Lys
85 90 95
Ala Ile Thr Gln Glu Gln Cys Glu Ala Arg Gly Cys Cys Tyr Ile Pro
100 105 110
Ala Lys Gln Gly Leu Gln Gly Ala Gln Met Gly Gln Pro Trp Cys Phe
115 120 125
Phe Pro Pro Ser Tyr Pro Ser Tyr Lys Leu Glu Asn Leu Ser Ser Ser
130 135 140
Glu Met Gly Tyr Thr Ala Thr Leu Thr Arg Thr Thr Pro Thr Phe Phe
145 150 155 160
Pro Lys Asp Ile Leu Thr Leu Arg Leu Asp Val Met Met Glu Thr Glu
165 170 175
Asn Arg Leu His Phe Thr Ile Lys Asp Pro Ala Asn Arg Arg Tyr Glu
180 185 190
Val Pro Leu Glu Thr Pro His Val His Ser Arg Ala Pro Ser Pro Leu
195 200 205
Tyr Ser Val Glu Phe Ser Glu Glu Pro Phe Gly Val Ile Val Arg Arg
210 215 220
Gln Leu Asp Gly Arg Val Leu Leu Asn Thr Thr Val Ala Pro Leu Phe
225 230 235 240
Phe Ala Asp Gln Phe Leu Gln Leu Ser Thr Ser Leu Pro Ser Gln Tyr
245 250 255
Ile Thr Gly Leu Ala Glu His Leu Ser Pro Leu Met Leu Ser Thr Ser
260 265 270
Trp Thr Arg Ile Thr Leu Trp Asn Arg Asp Leu Ala Pro Thr Pro Gly
275 280 285
Ala Asn Leu Tyr Gly Ser His Pro Phe Tyr Leu Ala Leu Glu Asp Gly
290 295 300
Gly Ser Ala His Gly Val Phe Leu Leu Asn Ser Asn Ala Met Asp Val
305 310 315 320
Val Leu Gln Pro Ser Pro Ala Leu Ser Trp Arg Ser Thr Gly Gly Ile
325 330 335
Leu Asp Val Tyr Ile Phe Leu Gly Pro Glu Pro Lys Ser Val Val Gln
340 345 350
Gln Tyr Leu Asp Val Val Gly Tyr Pro Phe Met Pro Pro Tyr Trp Gly
355 360 365
Leu Gly Phe His Leu Cys Arg Trp Gly Tyr Ser Ser Thr Ala Ile Thr
370 375 380
Arg Gln Val Val Glu Asn Met Thr Arg Ala His Phe Pro Leu Asp Val
385 390 395 400
Gln Trp Asn Asp Leu Asp Tyr Met Asp Ser Arg Arg Asp Phe Thr Phe
405 410 415
Asn Lys Asp Gly Phe Arg Asp Phe Pro Ala Met Val Gln Glu Leu His
420 425 430
Gln Gly Gly Arg Arg Tyr Met Met Ile Val Asp Pro Ala Ile Ser Ser
435 440 445
Ser Gly Pro Ala Gly Ser Tyr Arg Pro Tyr Asp Glu Gly Leu Arg Arg
450 455 460
Gly Val Phe Ile Thr Asn Glu Thr Gly Gln Pro Leu Ile Gly Lys Val
465 470 475 480
Trp Pro Gly Ser Thr Ala Phe Pro Asp Phe Thr Asn Pro Thr Ala Leu
485 490 495
Ala Trp Trp Glu Asp Met Val Ala Glu Phe His Asp Gln Val Pro Phe
500 505 510
Asp Gly Met Trp Ile Asp Met Asn Glu Pro Ser Asn Phe Ile Arg Gly
515 520 525
Ser Glu Asp Gly Cys Pro Asn Asn Glu Leu Glu Asn Pro Pro Tyr Val
530 535 540
Pro Gly Val Val Gly Gly Thr Leu Gln Ala Ala Thr Ile Cys Ala Ser
545 550 555 560
Ser His Gln Phe Leu Ser Thr His Tyr Asn Leu His Asn Leu Tyr Gly
565 570 575
Leu Thr Glu Ala Ile Ala Ser His Arg Ala Leu Val Lys Ala Arg Gly
580 585 590
Thr Arg Pro Phe Val Ile Ser Arg Ser Thr Phe Ala Gly His Gly Arg
595 600 605
Tyr Ala Gly His Trp Thr Gly Asp Val Trp Ser Ser Trp Glu Gln Leu
610 615 620
Ala Ser Ser Val Pro Glu Ile Leu Gln Phe Asn Leu Leu Gly Val Pro
625 630 635 640
Leu Val Gly Ala Asp Val Cys Gly Phe Leu Gly Asn Thr Ser Glu Glu
645 650 655
Leu Cys Val Arg Trp Thr Gln Leu Gly Ala Phe Tyr Pro Phe Met Arg
660 665 670
Asn His Asn Ser Leu Leu Ser Leu Pro Gln Glu Pro Tyr Ser Phe Ser
675 680 685
Glu Pro Ala Gln Gln Ala Met Arg Lys Ala Leu Thr Leu Arg Tyr Ala
690 695 700
Leu Leu Pro His Leu Tyr Thr Leu Phe His Gln Ala His Val Ala Gly
705 710 715 720
Glu Thr Val Ala Arg Pro Leu Phe Leu Glu Phe Pro Lys Asp Ser Ser
725 730 735
Thr Trp Thr Val Asp His Gln Leu Leu Trp Gly Glu Ala Leu Leu Ile
740 745 750
Thr Pro Val Leu Gln Ala Gly Lys Ala Glu Val Thr Gly Tyr Phe Pro
755 760 765
Leu Gly Thr Trp Tyr Asp Leu Gln Thr Val Pro Val Glu Ala Leu Gly
770 775 780
Ser Leu Pro Pro Pro Pro Ala Ala Pro Arg Glu Pro Ala Ile His Ser
785 790 795 800
Glu Gly Gln Trp Val Thr Leu Pro Ala Pro Leu Asp Thr Ile Asn Val
805 810 815
His Leu Arg Ala Gly Tyr Ile Ile Pro Leu Gln Gly Pro Gly Leu Thr
820 825 830
Thr Thr Glu Ser Arg Gln Gln Pro Met Ala Leu Ala Val Ala Leu Thr
835 840 845
Lys Gly Gly Glu Ala Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser
850 855 860
Leu Glu Val Leu Glu Arg Gly Ala Tyr Thr Gln Val Ile Phe Leu Ala
865 870 875 880
Arg Asn Asn Thr Ile Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly
885 890 895
Ala Gly Leu Gln Leu Gln Lys Val Thr Val Leu Gly Val Ala Thr Ala
900 905 910
Pro Gln Gln Val Leu Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr
915 920 925
Ser Pro Asp Thr Lys Val Leu Asp Ile Cys Val Ser Leu Leu Met Gly
930 935 940
Glu Gln Phe Leu Val Ser Trp Cys
945 950
Claims (64)
- 프로모터(promoter)를 암호화(encoding)하는 서열, 리소좀 효소를 암호화하는 제1 폴리뉴클레오티드 및 변형된 GlcNAc-1 포스포트랜스퍼라제(GlcNAc-1 PTase)를 암호화하는 제2 폴리뉴클레오티드를 포함하는 벡터(vector)를 포함하는 조성물로서,
상기 프로모터는 포유동물 세포에서 발현을 유도할 수 있고, 상기 프로모터는 제1 폴리뉴클레오티드 및 제2 폴리뉴클레오티드에 작동가능하게 연결되는, 조성물. - 제1항에 있어서, 상기 벡터가 내부 리보솜 진입 부위(Internal Ribosomal Entry Site; IRES)를 암호화하는 서열을 추가로 포함하는, 조성물.
- 제2항에 있어서, IRES를 암호화하는 서열이 리소좀 효소를 암호화하는 서열과 변형된 GlcNAc-1 PTase를 암호화하는 서열 사이에 위치하는, 조성물.
- 제2항 또는 제3항에 있어서, 5'에서 3'으로, 상기 벡터가 변형된 GlcNAc-1 PTase를 암호화하는 서열, IRES를 암호화하는 서열 및 리소좀 효소를 암호화하는 서열을 포함하는, 조성물.
- 제2항 또는 제3항에 있어서, 5'에서 3'으로, 상기 벡터가 리소좀 효소를 암호화하는 서열, IRES를 암호화하는 서열 및 변형된 GlcNAc-1 PTase를 암호화하는 서열을 포함하는, 조성물.
- 제1항에 있어서, 상기 벡터가 절단 부위를 암호화하는 서열을 추가로 포함하는, 조성물.
- 제6항에 있어서, 상기 절단 부위가 2A 자가-절단 펩티드를 암호화하는 서열을 포함하는, 조성물.
- 제1항 내지 제7항 중 어느 한 항에 있어서, 상기 벡터가 발현 벡터인, 조성물.
- 제1항 내지 제7항 중 어느 한 항에 있어서, 상기 벡터가 전달 벡터인, 조성물.
- 제1항 내지 제9항 중 어느 한 항에 있어서, 상기 벡터가 비-바이러스 벡터(non-viral vector)인, 조성물.
- 제1항 내지 제10항 중 어느 한 항에 있어서, 상기 벡터가 바이러스 벡터인, 조성물.
- 제11항에 있어서, 상기 벡터가 렌티바이러스 벡터인, 조성물.
- 제11항에 있어서, 상기 벡터가 아데노바이러스 벡터 또는 아데노-연관 바이러스(AAV) 벡터인, 조성물.
- 제13항에 있어서, 상기 AAV 벡터가 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 혈청형을 포함하는, 조성물.
- 제13항 또는 제14항에 있어서, 상기 AAV 벡터가, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 하나 이상의 혈청형으로부터 단리 또는 유래된 캡시드를 암호화하는 서열을 포함하는, 조성물.
- 제13항 내지 제15항 중 어느 한 항에 있어서, 상기 AAV 벡터가, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 및 AAV9로 이루어진 그룹으로부터 선택된 하나 이상의 혈청형으로부터 단리 또는 유래된 적어도 하나의 역방향 말단 반복체(inverted terminal repeat; ITR)를 암호화하는 서열을 포함하는, 조성물.
- 제1항 내지 제16항 중 어느 한 항에 있어서, 상기 벡터가 바이시스트론 벡터(bicistronic vector)인, 조성물.
- 제1항 내지 제16항 중 어느 한 항에 있어서, 상기 벡터가 멀티시스트론 벡터인, 조성물.
- 제1항 내지 제18항 중 어느 한 항에 있어서, 상기 프로모터가 구성적 프로모터를 포함하는, 조성물.
- 제19항에 있어서, 상기 구성적 프로모터가 사이토메갈로바이러스(CMV) 프로모터를 포함하는, 조성물.
- 제1항 내지 제20항 중 어느 한 항에 있어서, 상기 벡터가 서열번호 1의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 중 어느 한 항에 있어서, 변형된 GlcNAc-1 포스포트랜스퍼라제를 암호화하는 상기 폴리뉴클레오티드가 서열번호 4의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제22항 중 어느 한 항에 있어서, 상기 리소좀 효소가 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 축적 장애(lysosomal storage disorder; LSD)에 관여하는, 조성물.
- 제23항에 있어서, 상기 리소좀 효소가 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 효소를 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 β-글루코세브로시다제(GBA), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN)로 이루어진 그룹으로부터 선택되는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 β-글루코세브로시다제(GBA)를 포함하는, 조성물.
- 제26항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 5의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 갈락토실세레미다제(GALC)를 포함하는, 조성물.
- 제28항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 6의 핵산 서열을 포함하는, 조성물.
- 제29항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 23의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 α-갈락토시다제(GLA)를 포함하는, 조성물.
- 제31항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 7의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 α-N-아세틸글루코사미니다제(NAGLU)를 포함하는, 조성물.
- 제33항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 8의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 산 α-글루코시다제(GAA)를 포함하는, 조성물.
- 제35항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 9의 핵산 서열을 포함하는, 조성물.
- 제1항 내지 제21항 또는 제24항 중 어느 한 항에 있어서, 상기 리소좀 효소가 리소좀 산 α-만노시다제(LAMAN)를 포함하는, 조성물.
- 제37항에 있어서, 상기 리소좀 효소를 암호화하는 폴리뉴클레오티드가 서열번호 10의 핵산 서열을 포함하는, 조성물.
- 리소좀 축적 장애(LSD)를 치료하는 방법으로서,
상기 방법은 제1항 내지 제38항 중 어느 한 항의 조성물의 유효량을 대상체(subject)에게 투여하는 것을 포함하고,
상기 조성물은 LSD에 관여하는 리소좀 효소의 포스포릴화를 증가시키고, 이에 의해 LSD를 치료하는, 방법. - 제39항에 있어서, 상기 대상체가 LSD의 징후(sign) 또는 증상(symptom)을 나타내는, 방법.
- 제39항 또는 제40항에 있어서, 상기 대상체가 LSD로 진단되어 있는, 방법.
- 리소좀 축적 장애(LSD)의 발생 또는 발병(onset)을 예방하는 방법으로서,
상기 방법은 제1항 내지 제38항 중 어느 한 항의 조성물의 유효량을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 LSD에 관여하는 리소좀 효소의 포스포릴화를 증가시키고, 이에 의해 대상체에서 LSD의 발생을 예방하는, 방법. - 제42항에 있어서, 상기 대상체가 LSD의 발생 또는 발병의 위험이 있는, 방법.
- 제42항 또는 제43항에 있어서, 상기 대상체가 LSD의 징후 또는 증상을 나타내는, 방법.
- 리소좀 축적 장애(LSD)에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법으로서,
상기 방법은 제1항 내지 제38항 중 어느 한 항의 조성물의 유효량을 대상체에게 투여하는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시키는, 방법. - 제45항에 있어서, 상기 대상체가 LSD의 징후 또는 증상을 나타내는, 방법.
- 제45항 또는 제46항에 있어서, 상기 대상체가 LSD의 발생 또는 발병의 위험이 있는, 방법.
- 제45항 또는 제46항에 있어서, 상기 대상체가 LSD로 진단되어 있는, 방법.
- 리소좀 축적 장애(LSD)에 관여하는 리소좀 효소의 포스포릴화를 개선하는 방법으로서,
상기 방법은 제1항 내지 제38항 중 어느 한 항에 따른 조성물의 유효량을 세포에 접촉시키는 것을 포함하고, 상기 조성물은 리소좀 효소의 포스포릴화를 증가시키는, 방법. - 제49항에 있어서, 상기 세포가 시험관내 또는 생체외에 존재하는, 방법.
- 제49항에 있어서, 상기 세포가 생체내에 존재하는, 방법.
- 제49항 내지 제51항 중 어느 한 항에 있어서, 상기 대상체가 세포를 포함하는, 방법.
- 제52항에 있어서, 상기 대상체가 LSD의 징후 또는 증상을 나타내는, 방법.
- 제52항 또는 제53항에 있어서, 상기 대상체가 LSD의 발생 또는 발병의 위험이 있는, 방법.
- 제52항 또는 제53항에 있어서, 상기 대상체가 LSD로 진단되어 있는, 방법.
- 제39항 내지 제55항 중 어느 한 항에 있어서, 상기 리소좀 효소가 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나의 리소좀 축적 장애(LSD)에 관여하는, 방법.
- 제39항 내지 제56항 중 어느 한 항에 있어서, 상기 리소좀 효소가 표 1A, 표 1B 또는 표 1C에 수록된 적어도 하나인, 방법.
- 제39항 내지 제56항 중 어느 한 항에 있어서, 상기 리소좀 효소가 β-글루코세브로시다제(GBA), 갈락토실세레미다제(GALC), α-갈락토시다제(GLA), α-N-아세틸글루코사미니다제(NAGLU), 산 α-글루코시다제(GAA) 및 리소좀 산 α-만노시다제(LAMAN) 중 하나 이상을 포함하는, 방법.
- 제39항 내지 제58항 중 어느 한 항에 있어서, 상기 투여가 전신 투여 경로를 포함하는, 방법.
- 제59항에 있어서, 상기 전신 투여 경로가 장내(enteral), 비경구, 경구, 근육내(IM), 피하(SC), 정맥내(IV), 동맥내(IA), 척추강내, 척수내 또는 심실내인, 방법.
- 제39항 내지 제58항 중 어느 한 항에 있어서, 상기 투여가 국소 투여 경로를 포함하는, 방법.
- 제39항 내지 제61항 중 어느 한 항에 있어서, 상기 대상체가 인간인, 방법.
- 제39항 내지 제62항 중 어느 한 항에 있어서, 상기 대상체가 남성인, 방법.
- 제39항 내지 제62항 중 어느 한 항에 있어서, 상기 대상체가 여성인, 방법.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962869808P | 2019-07-02 | 2019-07-02 | |
US201962869781P | 2019-07-02 | 2019-07-02 | |
US62/869,781 | 2019-07-02 | ||
US62/869,808 | 2019-07-02 | ||
PCT/US2020/040770 WO2021003442A1 (en) | 2019-07-02 | 2020-07-02 | Vector compositions and methods of using same for treatment of lysosomal storage disorders |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20220069917A true KR20220069917A (ko) | 2022-05-27 |
Family
ID=72046993
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227002362A KR20220069917A (ko) | 2019-07-02 | 2020-07-02 | 리소좀 축적 장애의 치료를 위한 벡터 조성물 및 이의 사용 방법 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20220380800A1 (ko) |
EP (1) | EP3994253A1 (ko) |
JP (1) | JP2022538497A (ko) |
KR (1) | KR20220069917A (ko) |
CN (1) | CN114616000A (ko) |
AU (1) | AU2020298575A1 (ko) |
BR (1) | BR112021026866A2 (ko) |
CA (1) | CA3145662A1 (ko) |
TW (1) | TW202122579A (ko) |
WO (1) | WO2021003442A1 (ko) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113238053B (zh) * | 2021-04-30 | 2022-05-13 | 四川大学华西医院 | 一种用于检测stat3二聚化的质粒 |
WO2023150051A1 (en) * | 2022-02-04 | 2023-08-10 | M6P Therapeutics, Inc. | Compositions and methods of using two-promoter vector for treatment of lysosomal storage disorders |
WO2023150388A1 (en) * | 2022-02-07 | 2023-08-10 | M6P Therapeutics, Inc. | Compositions comprising glucocerebrocidase and methods of use thereof |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5703055A (en) | 1989-03-21 | 1997-12-30 | Wisconsin Alumni Research Foundation | Generation of antibodies through lipid mediated DNA delivery |
US5399346A (en) | 1989-06-14 | 1995-03-21 | The United States Of America As Represented By The Department Of Health And Human Services | Gene therapy |
US5585362A (en) | 1989-08-22 | 1996-12-17 | The Regents Of The University Of Michigan | Adenovirus vectors for gene therapy |
US5350674A (en) | 1992-09-04 | 1994-09-27 | Becton, Dickinson And Company | Intrinsic factor - horse peroxidase conjugates and a method for increasing the stability thereof |
US5478745A (en) | 1992-12-04 | 1995-12-26 | University Of Pittsburgh | Recombinant viral vector system |
US6156303A (en) | 1997-06-11 | 2000-12-05 | University Of Washington | Adeno-associated virus (AAV) isolates and AAV vectors derived therefrom |
WO2001029058A1 (en) | 1999-10-15 | 2001-04-26 | University Of Massachusetts | Rna interference pathway genes as tools for targeted genetic interference |
US6326193B1 (en) | 1999-11-05 | 2001-12-04 | Cambria Biosciences, Llc | Insect control agent |
WO2001096584A2 (en) | 2000-06-12 | 2001-12-20 | Akkadix Corporation | Materials and methods for the control of nematodes |
NZ578982A (en) | 2001-11-13 | 2011-03-31 | Univ Pennsylvania | A method of detecting and/or identifying adeno-associated virus (AAV) sequences and isolating novel sequences identified thereby |
ES2975413T3 (es) | 2001-12-17 | 2024-07-05 | Univ Pennsylvania | Secuencias de serotipo 8 de virus adenoasociado (AAV), vectores que las contienen y usos de las mismas |
US6905856B2 (en) * | 2001-12-21 | 2005-06-14 | Genzyme Glycobiology Research Institute, Inc. | Soluble GlcNAc phosphotransferase |
ES2648241T3 (es) | 2003-09-30 | 2017-12-29 | The Trustees Of The University Of Pennsylvania | Clados de virus adenoasociados (AAV), secuencias, vectores que contienen el mismo, y usos de los mismos |
SI2561069T1 (sl) * | 2010-04-23 | 2017-07-31 | Alexion Pharmaceuticals, Inc. | Encim za bolezen lizosomskega shranjevanja |
CA2956469A1 (en) * | 2014-08-11 | 2016-02-18 | Shire Human Genetic Therapies, Inc. | Mannose-6-phosphate bearing peptides fused to lysosomal enzymes |
CA3038598A1 (en) * | 2016-09-30 | 2018-04-05 | Washington University | Compositions comprising a modified glcnac-1-phosphotransferase and methods of use thereof |
-
2020
- 2020-07-02 CA CA3145662A patent/CA3145662A1/en active Pending
- 2020-07-02 JP JP2022500533A patent/JP2022538497A/ja active Pending
- 2020-07-02 WO PCT/US2020/040770 patent/WO2021003442A1/en unknown
- 2020-07-02 BR BR112021026866A patent/BR112021026866A2/pt unknown
- 2020-07-02 KR KR1020227002362A patent/KR20220069917A/ko unknown
- 2020-07-02 EP EP20754414.9A patent/EP3994253A1/en active Pending
- 2020-07-02 TW TW109122470A patent/TW202122579A/zh unknown
- 2020-07-02 AU AU2020298575A patent/AU2020298575A1/en active Pending
- 2020-07-02 CN CN202080061537.2A patent/CN114616000A/zh active Pending
- 2020-07-02 US US17/624,196 patent/US20220380800A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
AU2020298575A1 (en) | 2022-02-24 |
EP3994253A1 (en) | 2022-05-11 |
US20220380800A1 (en) | 2022-12-01 |
WO2021003442A1 (en) | 2021-01-07 |
BR112021026866A2 (pt) | 2022-03-03 |
TW202122579A (zh) | 2021-06-16 |
JP2022538497A (ja) | 2022-09-02 |
CA3145662A1 (en) | 2021-01-07 |
CN114616000A (zh) | 2022-06-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2022201329B2 (en) | Genetically modified cells, tissues, and organs for treating disease | |
CN114176043B (zh) | 用于治疗疾病的遗传修饰的细胞、组织和器官 | |
AU2023274083A1 (en) | Compositions and methods for treating non-age-associated hearing impairment in a human subject | |
AU2022200903B2 (en) | Engineered Cascade components and Cascade complexes | |
KR20200044793A (ko) | Aav의 전달을 위한 조성물 및 방법 | |
KR20220069917A (ko) | 리소좀 축적 장애의 치료를 위한 벡터 조성물 및 이의 사용 방법 | |
KR20200126997A (ko) | 인간 대상체에서의 비-노화-관련 청각 손상의 치료를 위한 조성물 및 방법 | |
KR20210143897A (ko) | 오리지아스로부터의 트랜스포사제를 이용한 핵산 작제물의 진핵세포로의 통합 | |
KR20220098384A (ko) | 폼페병 및 리소좀 장애를 치료하기 위한 간-특이적 프로모터를 포함하는 치료적 아데노-관련 바이러스 | |
KR20220038362A (ko) | 재조합 ad35 벡터 및 관련 유전자 요법 개선 | |
KR20210068068A (ko) | 조작된 프로모터를 갖는 프라탁신 발현 구축물 및 그의 사용 방법 | |
KR20210144861A (ko) | 아마이엘로이스로부터의 트랜스포사제를 이용한 핵산 작제물의 진핵세포 게놈으로의 전위 | |
KR20220157944A (ko) | 인간 대상체에서 비-연령-연관 청각 장애를 치료하기 위한 조성물 및 방법 | |
KR20230093241A (ko) | 글루코실세라미데이스 베타 결핍증과 관련된 신경 장애의 치료를 위한 조성물 및 방법 | |
US20030032791A1 (en) | Novel melanocortin-4 receptor sequences and screening assays to identify compounds useful in regulating animal appetite and metabolic rate | |
CN112639108A (zh) | 治疗非综合征性感觉神经性听力损失的方法 | |
KR20220033468A (ko) | 강화된 이종이식편 생존 및/또는 내성을 위한 하나 이상의 변형된 유전자를 갖는 세포, 조직, 장기 및/또는 동물 | |
US6410236B1 (en) | Correcting diastolic dysfunction in heart failure | |
US20230295668A1 (en) | Methods and compositions for integration of a dna construct | |
CN113874512A (zh) | 诱导毛细胞分化的组合物和方法 | |
KR20230173074A (ko) | 향상된 이종이식편 생존 및 관용을 위한 하나 이상의 변형된 유전자를 갖는 세포, 조직, 기관, 및 동물 | |
RU2774631C1 (ru) | Сконструированные компоненты cascade и комплексы cascade | |
RU2827658C2 (ru) | Сконструированные компоненты cascade и комплексы cascade | |
US20230064326A1 (en) | OPTOGENETIC COMPOSITIONS COMPRISING A CBh PROMOTER SEQUENCE AND METHODS FOR USE | |
RU2817770C2 (ru) | Интеграция конструкций нуклеиновой кислоты в эукариотические клетки с транспозазой из oryzias |