TW202242124A - 靶向t細胞之aav載體 - Google Patents
靶向t細胞之aav載體 Download PDFInfo
- Publication number
- TW202242124A TW202242124A TW111101402A TW111101402A TW202242124A TW 202242124 A TW202242124 A TW 202242124A TW 111101402 A TW111101402 A TW 111101402A TW 111101402 A TW111101402 A TW 111101402A TW 202242124 A TW202242124 A TW 202242124A
- Authority
- TW
- Taiwan
- Prior art keywords
- gly
- pro
- ser
- ala
- asn
- Prior art date
Links
- 210000001744 T-lymphocyte Anatomy 0.000 title claims abstract description 80
- 239000013598 vector Substances 0.000 title claims abstract description 53
- 230000008685 targeting Effects 0.000 title description 25
- 210000004027 cell Anatomy 0.000 claims abstract description 200
- 108090000565 Capsid Proteins Proteins 0.000 claims abstract description 192
- 102100023321 Ceruloplasmin Human genes 0.000 claims abstract description 192
- 238000010361 transduction Methods 0.000 claims abstract description 139
- 230000026683 transduction Effects 0.000 claims abstract description 137
- 238000000034 method Methods 0.000 claims abstract description 66
- 241000702421 Dependoparvovirus Species 0.000 claims abstract description 17
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 207
- 239000013607 AAV vector Substances 0.000 claims description 128
- 150000007523 nucleic acids Chemical class 0.000 claims description 117
- 102000039446 nucleic acids Human genes 0.000 claims description 101
- 108020004707 nucleic acids Proteins 0.000 claims description 101
- 150000001413 amino acids Chemical class 0.000 claims description 66
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 61
- 108090000623 proteins and genes Proteins 0.000 claims description 58
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 40
- 230000014509 gene expression Effects 0.000 claims description 40
- 102000004169 proteins and genes Human genes 0.000 claims description 25
- 238000006467 substitution reaction Methods 0.000 claims description 20
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 19
- 230000001225 therapeutic effect Effects 0.000 claims description 17
- 238000000338 in vitro Methods 0.000 claims description 13
- 238000001727 in vivo Methods 0.000 claims description 13
- 239000000203 mixture Substances 0.000 claims description 13
- 238000011282 treatment Methods 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 9
- 239000013604 expression vector Substances 0.000 claims description 9
- 239000003937 drug carrier Substances 0.000 claims description 8
- 239000002253 acid Substances 0.000 claims description 6
- 239000003814 drug Substances 0.000 claims description 6
- 239000008194 pharmaceutical composition Substances 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 4
- 125000003277 amino group Chemical group 0.000 claims description 3
- 150000001412 amines Chemical class 0.000 claims description 2
- 210000000805 cytoplasm Anatomy 0.000 claims 1
- 210000000234 capsid Anatomy 0.000 abstract description 141
- 241000700605 Viruses Species 0.000 abstract description 41
- 239000013603 viral vector Substances 0.000 description 124
- 235000001014 amino acid Nutrition 0.000 description 77
- 102000004196 processed proteins & peptides Human genes 0.000 description 77
- 230000003612 virological effect Effects 0.000 description 64
- 229940024606 amino acid Drugs 0.000 description 63
- 206010028980 Neoplasm Diseases 0.000 description 58
- 229920001184 polypeptide Polymers 0.000 description 52
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 49
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 39
- 201000011510 cancer Diseases 0.000 description 38
- 201000010099 disease Diseases 0.000 description 35
- 108010050848 glycylleucine Proteins 0.000 description 35
- 230000002163 immunogen Effects 0.000 description 35
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 33
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 28
- 238000009126 molecular therapy Methods 0.000 description 28
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 24
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 23
- 108010079364 N-glycylalanine Proteins 0.000 description 23
- 108010057821 leucylproline Proteins 0.000 description 23
- 210000001519 tissue Anatomy 0.000 description 23
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 22
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 21
- 241000125945 Protoparvovirus Species 0.000 description 21
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 21
- 239000002245 particle Substances 0.000 description 21
- 108010051242 phenylalanylserine Proteins 0.000 description 21
- 235000018102 proteins Nutrition 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 20
- 125000000539 amino acid group Chemical group 0.000 description 20
- 239000000427 antigen Substances 0.000 description 20
- 108091007433 antigens Proteins 0.000 description 20
- 102000036639 antigens Human genes 0.000 description 20
- 108010015796 prolylisoleucine Proteins 0.000 description 20
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 19
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 19
- 108010087823 glycyltyrosine Proteins 0.000 description 19
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 18
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 18
- 108091030071 RNAI Proteins 0.000 description 17
- 230000009368 gene silencing by RNA Effects 0.000 description 17
- 239000005090 green fluorescent protein Substances 0.000 description 17
- 208000015181 infectious disease Diseases 0.000 description 17
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 16
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 16
- 208000002267 Anti-neutrophil cytoplasmic antibody-associated vasculitis Diseases 0.000 description 16
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 16
- 108010077245 asparaginyl-proline Proteins 0.000 description 16
- 241000701161 unidentified adenovirus Species 0.000 description 16
- -1 α-neoenkephalin Substances 0.000 description 16
- 108700019146 Transgenes Proteins 0.000 description 15
- 230000004048 modification Effects 0.000 description 15
- 238000012986 modification Methods 0.000 description 15
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 14
- 229920002971 Heparan sulfate Polymers 0.000 description 14
- 230000028993 immune response Effects 0.000 description 14
- 108010077112 prolyl-proline Proteins 0.000 description 14
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 13
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 13
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 13
- 238000003780 insertion Methods 0.000 description 13
- 230000037431 insertion Effects 0.000 description 13
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 13
- 241001164823 Adeno-associated virus - 7 Species 0.000 description 12
- 241000649046 Adeno-associated virus 11 Species 0.000 description 12
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 12
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 12
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 12
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 12
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 12
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 12
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 12
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 12
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 12
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 12
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 12
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 12
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 12
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 12
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 12
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 12
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 12
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 12
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 12
- 108010070944 alanylhistidine Proteins 0.000 description 12
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 12
- 108010089804 glycyl-threonine Proteins 0.000 description 12
- 108010025306 histidylleucine Proteins 0.000 description 12
- 230000001939 inductive effect Effects 0.000 description 12
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 12
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 12
- 210000002706 plastid Anatomy 0.000 description 12
- 102000005962 receptors Human genes 0.000 description 12
- 108020003175 receptors Proteins 0.000 description 12
- 230000010076 replication Effects 0.000 description 12
- 210000002845 virion Anatomy 0.000 description 12
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 11
- 241000649045 Adeno-associated virus 10 Species 0.000 description 11
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 11
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 11
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 11
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 11
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 11
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 11
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 11
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 11
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 11
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 11
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 11
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 11
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 11
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 11
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 11
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 11
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 11
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 11
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 11
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 11
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 11
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 11
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 11
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 11
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 11
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 11
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 11
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 11
- 108010079005 RDV peptide Proteins 0.000 description 11
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 11
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 11
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 11
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 11
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 11
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 11
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 11
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 11
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 11
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 11
- 108010092854 aspartyllysine Proteins 0.000 description 11
- 230000027455 binding Effects 0.000 description 11
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 11
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 11
- 108010077515 glycylproline Proteins 0.000 description 11
- 108010040030 histidinoalanine Proteins 0.000 description 11
- 108010092114 histidylphenylalanine Proteins 0.000 description 11
- 108010085325 histidylproline Proteins 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 11
- 230000000670 limiting effect Effects 0.000 description 11
- 238000004519 manufacturing process Methods 0.000 description 11
- 238000004806 packaging method and process Methods 0.000 description 11
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 11
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 11
- 230000010415 tropism Effects 0.000 description 11
- 108010045269 tryptophyltryptophan Proteins 0.000 description 11
- 241000649047 Adeno-associated virus 12 Species 0.000 description 10
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 10
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 10
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 10
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 10
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 10
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 10
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 10
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 10
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 10
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 10
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 10
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 10
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 10
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 10
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 10
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 10
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 10
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 10
- 241000283690 Bos taurus Species 0.000 description 10
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 10
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 10
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 10
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 10
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 10
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 10
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 10
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 10
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 10
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 10
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 10
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 10
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 10
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 10
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 10
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 10
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 10
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 10
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 10
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 10
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 10
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 10
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 10
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 10
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 10
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 10
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 10
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 10
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 10
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 10
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 10
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 10
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 10
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 10
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 10
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 10
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 10
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 10
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 10
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 10
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 10
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 10
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 10
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 10
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 10
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 10
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 10
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 10
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 10
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 10
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 10
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 10
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 10
- 108010069495 cysteinyltyrosine Proteins 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 10
- 210000002027 skeletal muscle Anatomy 0.000 description 10
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 9
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 9
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 9
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 9
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 9
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 9
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 9
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 9
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 9
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 9
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 9
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 9
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 9
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 9
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 9
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 9
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 9
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 9
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 9
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 9
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 9
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 9
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 9
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 9
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 9
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 9
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 9
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 9
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 9
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 9
- 241000725303 Human immunodeficiency virus Species 0.000 description 9
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 9
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 9
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 9
- 108010065920 Insulin Lispro Proteins 0.000 description 9
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 9
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 9
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 9
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 9
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 9
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 9
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 9
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 9
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 9
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 9
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 9
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 9
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 9
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 9
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 9
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 9
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 9
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 9
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 9
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 9
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 9
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 9
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 9
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 9
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 9
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 9
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 9
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 9
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 9
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 9
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 9
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 9
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 9
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 9
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 9
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 9
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 9
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 9
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 9
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 9
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 9
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 9
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 9
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 9
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 9
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 9
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 9
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 9
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 9
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 9
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 238000001415 gene therapy Methods 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 208000024891 symptom Diseases 0.000 description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 8
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 8
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 8
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 8
- 241000271566 Aves Species 0.000 description 8
- 101710132601 Capsid protein Proteins 0.000 description 8
- 101710197658 Capsid protein VP1 Proteins 0.000 description 8
- 108010019670 Chimeric Antigen Receptors Proteins 0.000 description 8
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 8
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 8
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 8
- 241000713311 Simian immunodeficiency virus Species 0.000 description 8
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 8
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 8
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 8
- 101710108545 Viral protein 1 Proteins 0.000 description 8
- 210000000188 diaphragm Anatomy 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 210000004185 liver Anatomy 0.000 description 8
- 230000002265 prevention Effects 0.000 description 8
- 208000011580 syndromic disease Diseases 0.000 description 8
- 230000009885 systemic effect Effects 0.000 description 8
- 238000012546 transfer Methods 0.000 description 8
- 102400000068 Angiostatin Human genes 0.000 description 7
- 108010079709 Angiostatins Proteins 0.000 description 7
- 102000004127 Cytokines Human genes 0.000 description 7
- 108090000695 Cytokines Proteins 0.000 description 7
- 108010069091 Dystrophin Proteins 0.000 description 7
- 102000004627 Iduronidase Human genes 0.000 description 7
- 108010003381 Iduronidase Proteins 0.000 description 7
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 7
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 7
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 7
- FZCSTZYAHCUGEM-UHFFFAOYSA-N aspergillomarasmine B Natural products OC(=O)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O FZCSTZYAHCUGEM-UHFFFAOYSA-N 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 230000007812 deficiency Effects 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 210000003734 kidney Anatomy 0.000 description 7
- 210000004165 myocardium Anatomy 0.000 description 7
- 108010059929 phospholamban Proteins 0.000 description 7
- 230000001681 protective effect Effects 0.000 description 7
- 238000012384 transportation and delivery Methods 0.000 description 7
- 241001529453 unidentified herpesvirus Species 0.000 description 7
- 241000405344 Adeno-associated dependoparvovirus A Species 0.000 description 6
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 6
- 102000001039 Dystrophin Human genes 0.000 description 6
- 102100039939 Growth/differentiation factor 8 Human genes 0.000 description 6
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 6
- 108010056852 Myostatin Proteins 0.000 description 6
- 230000000692 anti-sense effect Effects 0.000 description 6
- 210000004556 brain Anatomy 0.000 description 6
- 230000001086 cytosolic effect Effects 0.000 description 6
- 230000036039 immunity Effects 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 102000006495 integrins Human genes 0.000 description 6
- 108010044426 integrins Proteins 0.000 description 6
- 102000005681 phospholamban Human genes 0.000 description 6
- 102000040430 polynucleotide Human genes 0.000 description 6
- 108091033319 polynucleotide Proteins 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 210000000130 stem cell Anatomy 0.000 description 6
- 241000701931 Canine parvovirus Species 0.000 description 5
- 108091007741 Chimeric antigen receptor T cells Proteins 0.000 description 5
- 102000003951 Erythropoietin Human genes 0.000 description 5
- 108090000394 Erythropoietin Proteins 0.000 description 5
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- 101710081079 Minor spike protein H Proteins 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 229940105423 erythropoietin Drugs 0.000 description 5
- 229930182470 glycoside Natural products 0.000 description 5
- 239000003102 growth factor Substances 0.000 description 5
- 210000002216 heart Anatomy 0.000 description 5
- 229960002897 heparin Drugs 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 5
- 238000002560 therapeutic procedure Methods 0.000 description 5
- 230000002103 transcriptional effect Effects 0.000 description 5
- 241000712461 unidentified influenza virus Species 0.000 description 5
- 102400001047 Endostatin Human genes 0.000 description 4
- 108010079505 Endostatins Proteins 0.000 description 4
- 208000024720 Fabry Disease Diseases 0.000 description 4
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 4
- 108010017544 Glucosylceramidase Proteins 0.000 description 4
- 102000004547 Glucosylceramidase Human genes 0.000 description 4
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 4
- 102000000588 Interleukin-2 Human genes 0.000 description 4
- 108010002350 Interleukin-2 Proteins 0.000 description 4
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- 101710096786 Lysosomal acid alpha-glucosidase Proteins 0.000 description 4
- 206010027476 Metastases Diseases 0.000 description 4
- 108010025020 Nerve Growth Factor Proteins 0.000 description 4
- 101710163270 Nuclease Proteins 0.000 description 4
- 206010045261 Type IIa hyperlipidaemia Diseases 0.000 description 4
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 4
- 230000003110 anti-inflammatory effect Effects 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 208000035475 disorder Diseases 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 210000002919 epithelial cell Anatomy 0.000 description 4
- 210000003743 erythrocyte Anatomy 0.000 description 4
- 208000007345 glycogen storage disease Diseases 0.000 description 4
- 210000003494 hepatocyte Anatomy 0.000 description 4
- 230000002458 infectious effect Effects 0.000 description 4
- 230000002401 inhibitory effect Effects 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 210000003205 muscle Anatomy 0.000 description 4
- 208000008338 non-alcoholic fatty liver disease Diseases 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 210000000496 pancreas Anatomy 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 239000013608 rAAV vector Substances 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000007858 starting material Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000002463 transducing effect Effects 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- 241000701447 unidentified baculovirus Species 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- 241000958487 Adeno-associated virus 3B Species 0.000 description 3
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 3
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 3
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical group NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 3
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 3
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 3
- JRVABKHPWDRUJF-UBHSHLNASA-N Asn-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N JRVABKHPWDRUJF-UBHSHLNASA-N 0.000 description 3
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 3
- 102100026189 Beta-galactosidase Human genes 0.000 description 3
- 241000701922 Bovine parvovirus Species 0.000 description 3
- 102100022641 Coagulation factor IX Human genes 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 3
- 201000003883 Cystic fibrosis Diseases 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 102100025907 Dyslexia-associated protein KIAA0319-like protein Human genes 0.000 description 3
- 102100031780 Endonuclease Human genes 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 241000283073 Equus caballus Species 0.000 description 3
- 206010016654 Fibrosis Diseases 0.000 description 3
- 208000015872 Gaucher disease Diseases 0.000 description 3
- 102000034615 Glial cell line-derived neurotrophic factor Human genes 0.000 description 3
- 108091010837 Glial cell line-derived neurotrophic factor Proteins 0.000 description 3
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 3
- 208000032007 Glycogen storage disease due to acid maltase deficiency Diseases 0.000 description 3
- 206010053185 Glycogen storage disease type II Diseases 0.000 description 3
- 102000003886 Glycoproteins Human genes 0.000 description 3
- 108090000288 Glycoproteins Proteins 0.000 description 3
- 208000005176 Hepatitis C Diseases 0.000 description 3
- 241000175212 Herpesvirales Species 0.000 description 3
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 3
- 241000702617 Human parvovirus B19 Species 0.000 description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 102000003996 Interferon-beta Human genes 0.000 description 3
- 108090000467 Interferon-beta Proteins 0.000 description 3
- 102000014150 Interferons Human genes 0.000 description 3
- 108010050904 Interferons Proteins 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- 108010001831 LDL receptors Proteins 0.000 description 3
- 206010023927 Lassa fever Diseases 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 3
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 3
- 206010025323 Lymphomas Diseases 0.000 description 3
- 102000004083 Lymphotoxin-alpha Human genes 0.000 description 3
- 108090000542 Lymphotoxin-alpha Proteins 0.000 description 3
- 102100033448 Lysosomal alpha-glucosidase Human genes 0.000 description 3
- 206010056886 Mucopolysaccharidosis I Diseases 0.000 description 3
- 108090000189 Neuropeptides Proteins 0.000 description 3
- 208000018737 Parkinson disease Diseases 0.000 description 3
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- 102000005262 Sulfatase Human genes 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- 102100024276 Transcription factor SOX-3 Human genes 0.000 description 3
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 3
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 3
- 108020005202 Viral DNA Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 3
- 108010005774 beta-Galactosidase Proteins 0.000 description 3
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 210000004748 cultured cell Anatomy 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 3
- 230000004064 dysfunction Effects 0.000 description 3
- 210000002889 endothelial cell Anatomy 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 230000004761 fibrosis Effects 0.000 description 3
- 238000000684 flow cytometry Methods 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 201000004502 glycogen storage disease II Diseases 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 208000002672 hepatitis B Diseases 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 229940079322 interferon Drugs 0.000 description 3
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 3
- 208000019423 liver disease Diseases 0.000 description 3
- 230000036210 malignancy Effects 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000009401 metastasis Effects 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 210000002569 neuron Anatomy 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 230000001718 repressive effect Effects 0.000 description 3
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 3
- 108060007951 sulfatase Proteins 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 238000002054 transplantation Methods 0.000 description 3
- 229960005486 vaccine Drugs 0.000 description 3
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 2
- RDFMDVXONNIGBC-UHFFFAOYSA-N 2-aminoheptanoic acid Chemical compound CCCCCC(N)C(O)=O RDFMDVXONNIGBC-UHFFFAOYSA-N 0.000 description 2
- 108010046716 3-Methyl-2-Oxobutanoate Dehydrogenase (Lipoamide) Proteins 0.000 description 2
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 2
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 2
- 108010059616 Activins Proteins 0.000 description 2
- 102000005606 Activins Human genes 0.000 description 2
- 102100036664 Adenosine deaminase Human genes 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- 208000024827 Alzheimer disease Diseases 0.000 description 2
- 208000031277 Amaurotic familial idiocy Diseases 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- 102100031491 Arylsulfatase B Human genes 0.000 description 2
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 2
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 101000588395 Bacillus subtilis (strain 168) Beta-hexosaminidase Proteins 0.000 description 2
- 102000006734 Beta-Globulins Human genes 0.000 description 2
- 108010087504 Beta-Globulins Proteins 0.000 description 2
- 102100022548 Beta-hexosaminidase subunit alpha Human genes 0.000 description 2
- 102000007350 Bone Morphogenetic Proteins Human genes 0.000 description 2
- 108010007726 Bone Morphogenetic Proteins Proteins 0.000 description 2
- 208000003174 Brain Neoplasms Diseases 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 208000026310 Breast neoplasm Diseases 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 108010051834 CTTHWGFTLC peptide Proteins 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- 208000024172 Cardiovascular disease Diseases 0.000 description 2
- 108090000994 Catalytic RNA Proteins 0.000 description 2
- 102000053642 Catalytic RNA Human genes 0.000 description 2
- 102000000844 Cell Surface Receptors Human genes 0.000 description 2
- 108010001857 Cell Surface Receptors Proteins 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 102000007644 Colony-Stimulating Factors Human genes 0.000 description 2
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 2
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 2
- 102000016607 Diphtheria Toxin Human genes 0.000 description 2
- 108010053187 Diphtheria Toxin Proteins 0.000 description 2
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 2
- 241001115402 Ebolavirus Species 0.000 description 2
- 101800003838 Epidermal growth factor Proteins 0.000 description 2
- 102400001368 Epidermal growth factor Human genes 0.000 description 2
- 108010076282 Factor IX Proteins 0.000 description 2
- 108010054218 Factor VIII Proteins 0.000 description 2
- 102000001690 Factor VIII Human genes 0.000 description 2
- 201000003542 Factor VIII deficiency Diseases 0.000 description 2
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 2
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 2
- 102000016970 Follistatin Human genes 0.000 description 2
- 108010014612 Follistatin Proteins 0.000 description 2
- 102000006575 G-Protein-Coupled Receptor Kinases Human genes 0.000 description 2
- 108010008959 G-Protein-Coupled Receptor Kinases Proteins 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 2
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 2
- 102000053187 Glucuronidase Human genes 0.000 description 2
- 108010060309 Glucuronidase Proteins 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 229920002683 Glycosaminoglycan Polymers 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 2
- 208000009292 Hemophilia A Diseases 0.000 description 2
- 206010019860 Hereditary angioedema Diseases 0.000 description 2
- 102000016871 Hexosaminidase A Human genes 0.000 description 2
- 108010053317 Hexosaminidase A Proteins 0.000 description 2
- 208000017604 Hodgkin disease Diseases 0.000 description 2
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 2
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 2
- 101001076904 Homo sapiens Dyslexia-associated protein KIAA0319-like protein Proteins 0.000 description 2
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 2
- 101001076292 Homo sapiens Insulin-like growth factor II Proteins 0.000 description 2
- 101000685712 Homo sapiens Protein S100-A1 Proteins 0.000 description 2
- 101100369992 Homo sapiens TNFSF10 gene Proteins 0.000 description 2
- 208000030673 Homozygous familial hypercholesterolemia Diseases 0.000 description 2
- 244000309467 Human Coronavirus Species 0.000 description 2
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 2
- 102100029098 Hypoxanthine-guanine phosphoribosyltransferase Human genes 0.000 description 2
- 108010053927 Iduronate Sulfatase Proteins 0.000 description 2
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 102000004218 Insulin-Like Growth Factor I Human genes 0.000 description 2
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 2
- 102100025947 Insulin-like growth factor II Human genes 0.000 description 2
- 102000006992 Interferon-alpha Human genes 0.000 description 2
- 108010047761 Interferon-alpha Proteins 0.000 description 2
- 102000008070 Interferon-gamma Human genes 0.000 description 2
- 108010074328 Interferon-gamma Proteins 0.000 description 2
- 102000004388 Interleukin-4 Human genes 0.000 description 2
- 108090000978 Interleukin-4 Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 2
- 241000710842 Japanese encephalitis virus Species 0.000 description 2
- 208000008839 Kidney Neoplasms Diseases 0.000 description 2
- JUQLUIFNNFIIKC-YFKPBYRVSA-N L-2-aminopimelic acid Chemical compound OC(=O)[C@@H](N)CCCCC(O)=O JUQLUIFNNFIIKC-YFKPBYRVSA-N 0.000 description 2
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 2
- 241000270322 Lepidosauria Species 0.000 description 2
- 208000009625 Lesch-Nyhan syndrome Diseases 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 208000015439 Lysosomal storage disease Diseases 0.000 description 2
- 241001115401 Marburgvirus Species 0.000 description 2
- 102100022430 Melanocyte protein PMEL Human genes 0.000 description 2
- 101710151321 Melanostatin Proteins 0.000 description 2
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 2
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 102000015728 Mucins Human genes 0.000 description 2
- 108010063954 Mucins Proteins 0.000 description 2
- 206010056893 Mucopolysaccharidosis VII Diseases 0.000 description 2
- 206010048723 Multiple-drug resistance Diseases 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 102000003505 Myosin Human genes 0.000 description 2
- 108060008487 Myosin Proteins 0.000 description 2
- 108010027520 N-Acetylgalactosamine-4-Sulfatase Proteins 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 102100023282 N-acetylglucosamine-6-sulfatase Human genes 0.000 description 2
- 108010023320 N-acetylglucosamine-6-sulfatase Proteins 0.000 description 2
- YPIGGYHFMKJNKV-UHFFFAOYSA-N N-ethylglycine Chemical compound CC[NH2+]CC([O-])=O YPIGGYHFMKJNKV-UHFFFAOYSA-N 0.000 description 2
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 2
- KSPIYJQBLVDRRI-UHFFFAOYSA-N N-methylisoleucine Chemical compound CCC(C)C(NC)C(O)=O KSPIYJQBLVDRRI-UHFFFAOYSA-N 0.000 description 2
- 102000015336 Nerve Growth Factor Human genes 0.000 description 2
- 102000007072 Nerve Growth Factors Human genes 0.000 description 2
- 102400000064 Neuropeptide Y Human genes 0.000 description 2
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 2
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 2
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 2
- 102000004132 Ornithine aminotransferases Human genes 0.000 description 2
- 108090000691 Ornithine aminotransferases Proteins 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 2
- 108060005874 Parvalbumin Proteins 0.000 description 2
- 102000001675 Parvalbumin Human genes 0.000 description 2
- 241000701945 Parvoviridae Species 0.000 description 2
- 208000037581 Persistent Infection Diseases 0.000 description 2
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 2
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- 201000011252 Phenylketonuria Diseases 0.000 description 2
- 108090000430 Phosphatidylinositol 3-kinases Proteins 0.000 description 2
- 102000003993 Phosphatidylinositol 3-kinases Human genes 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 102100023097 Protein S100-A1 Human genes 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 102000014128 RANK Ligand Human genes 0.000 description 2
- 108010025832 RANK Ligand Proteins 0.000 description 2
- 108020005067 RNA Splice Sites Proteins 0.000 description 2
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 206010039491 Sarcoma Diseases 0.000 description 2
- 108010077895 Sarcosine Proteins 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- 241000270295 Serpentes Species 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 201000001828 Sly syndrome Diseases 0.000 description 2
- 102000013275 Somatomedins Human genes 0.000 description 2
- 102000011971 Sphingomyelin Phosphodiesterase Human genes 0.000 description 2
- 108010061312 Sphingomyelin Phosphodiesterase Proteins 0.000 description 2
- 102000019197 Superoxide Dismutase Human genes 0.000 description 2
- 108010012715 Superoxide dismutase Proteins 0.000 description 2
- 101800001271 Surface protein Proteins 0.000 description 2
- 238000010459 TALEN Methods 0.000 description 2
- 108700012411 TNFSF10 Proteins 0.000 description 2
- 208000022292 Tay-Sachs disease Diseases 0.000 description 2
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- 208000035317 Total hypoxanthine-guanine phosphoribosyl transferase deficiency Diseases 0.000 description 2
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 2
- 102100024598 Tumor necrosis factor ligand superfamily member 10 Human genes 0.000 description 2
- 102100031988 Tumor necrosis factor ligand superfamily member 6 Human genes 0.000 description 2
- 108050002568 Tumor necrosis factor ligand superfamily member 6 Proteins 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- 102000003425 Tyrosinase Human genes 0.000 description 2
- 108060008724 Tyrosinase Proteins 0.000 description 2
- 108091026823 U7 small nuclear RNA Proteins 0.000 description 2
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- 208000002495 Uterine Neoplasms Diseases 0.000 description 2
- 206010046865 Vaccinia virus infection Diseases 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 101710185494 Zinc finger protein Proteins 0.000 description 2
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 2
- 108020002494 acetyltransferase Proteins 0.000 description 2
- 102000005421 acetyltransferase Human genes 0.000 description 2
- 239000000488 activin Substances 0.000 description 2
- 208000009956 adenocarcinoma Diseases 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 description 2
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 2
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 2
- 108010030291 alpha-Galactosidase Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 208000007502 anemia Diseases 0.000 description 2
- 239000004037 angiogenesis inhibitor Substances 0.000 description 2
- 229940121369 angiogenesis inhibitor Drugs 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 206010003246 arthritis Diseases 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 208000036556 autosomal recessive T cell-negative B cell-negative NK cell-negative due to adenosine deaminase deficiency severe combined immunodeficiency Diseases 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 229940000635 beta-alanine Drugs 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 229940112869 bone morphogenetic protein Drugs 0.000 description 2
- 210000004413 cardiac myocyte Anatomy 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000015861 cell surface binding Effects 0.000 description 2
- 210000003169 central nervous system Anatomy 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 210000000172 cytosol Anatomy 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 210000004443 dendritic cell Anatomy 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 238000001493 electron microscopy Methods 0.000 description 2
- 229940116977 epidermal growth factor Drugs 0.000 description 2
- 206010015037 epilepsy Diseases 0.000 description 2
- 108700020302 erbB-2 Genes Proteins 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 229960004222 factor ix Drugs 0.000 description 2
- 229960000301 factor viii Drugs 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 229940044627 gamma-interferon Drugs 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 108091006104 gene-regulatory proteins Proteins 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 150000002338 glycosides Chemical class 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 229920000669 heparin Polymers 0.000 description 2
- 208000006454 hepatitis Diseases 0.000 description 2
- 231100000283 hepatitis Toxicity 0.000 description 2
- 230000002519 immonomodulatory effect Effects 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 229940028885 interleukin-4 Drugs 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- RGXCTRIQQODGIZ-UHFFFAOYSA-O isodesmosine Chemical compound OC(=O)C(N)CCCC[N+]1=CC(CCC(N)C(O)=O)=CC(CCC(N)C(O)=O)=C1CCCC(N)C(O)=O RGXCTRIQQODGIZ-UHFFFAOYSA-O 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 210000005229 liver cell Anatomy 0.000 description 2
- 208000014018 liver neoplasm Diseases 0.000 description 2
- 201000005202 lung cancer Diseases 0.000 description 2
- 208000020816 lung neoplasm Diseases 0.000 description 2
- 230000002132 lysosomal effect Effects 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 208000030159 metabolic disease Diseases 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 230000001394 metastastic effect Effects 0.000 description 2
- 206010061289 metastatic neoplasm Diseases 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 201000002273 mucopolysaccharidosis II Diseases 0.000 description 2
- 208000022018 mucopolysaccharidosis type 2 Diseases 0.000 description 2
- 208000025919 mucopolysaccharidosis type 7 Diseases 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 230000001613 neoplastic effect Effects 0.000 description 2
- 229940053128 nerve growth factor Drugs 0.000 description 2
- 239000003900 neurotrophic factor Substances 0.000 description 2
- 206010053219 non-alcoholic steatohepatitis Diseases 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- URPYMXQQVHTUDU-OFGSCBOVSA-N nucleopeptide y Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 URPYMXQQVHTUDU-OFGSCBOVSA-N 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 229960003104 ornithine Drugs 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000000241 respiratory effect Effects 0.000 description 2
- 210000001525 retina Anatomy 0.000 description 2
- 230000002207 retinal effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 108091092562 ribozyme Proteins 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 2
- 210000002363 skeletal muscle cell Anatomy 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 210000002460 smooth muscle Anatomy 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 208000008732 thymoma Diseases 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 2
- 206010046766 uterine cancer Diseases 0.000 description 2
- 238000002255 vaccination Methods 0.000 description 2
- 208000007089 vaccinia Diseases 0.000 description 2
- LYTCVQQGCSNFJU-LKGYBJPKSA-N α-bungarotoxin Chemical compound C(/[C@H]1O[C@H]2C[C@H]3O[C@@H](CC(=C)C=O)C[C@H](O)[C@]3(C)O[C@@H]2C[C@@H]1O[C@@H]1C2)=C/C[C@]1(C)O[C@H]1[C@@]2(C)O[C@]2(C)CC[C@@H]3O[C@@H]4C[C@]5(C)O[C@@H]6C(C)=CC(=O)O[C@H]6C[C@H]5O[C@H]4C[C@@H](C)[C@H]3O[C@H]2C1 LYTCVQQGCSNFJU-LKGYBJPKSA-N 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- BJBUEDPLEOHJGE-UHFFFAOYSA-N (2R,3S)-3-Hydroxy-2-pyrolidinecarboxylic acid Natural products OC1CCNC1C(O)=O BJBUEDPLEOHJGE-UHFFFAOYSA-N 0.000 description 1
- DVVNPYAVFWSKAB-LURJTMIESA-N (2s)-2-(butylazaniumyl)propanoate Chemical group CCCCN[C@@H](C)C(O)=O DVVNPYAVFWSKAB-LURJTMIESA-N 0.000 description 1
- BVAUMRCGVHUWOZ-ZETCQYMHSA-N (2s)-2-(cyclohexylazaniumyl)propanoate Chemical compound OC(=O)[C@H](C)NC1CCCCC1 BVAUMRCGVHUWOZ-ZETCQYMHSA-N 0.000 description 1
- RWLSBXBFZHDHHX-VIFPVBQESA-N (2s)-2-(naphthalen-2-ylamino)propanoic acid Chemical compound C1=CC=CC2=CC(N[C@@H](C)C(O)=O)=CC=C21 RWLSBXBFZHDHHX-VIFPVBQESA-N 0.000 description 1
- MKSPBYRGLCNGRC-OEMOKZHXSA-N (2s)-2-[[(2s)-1-[(2s)-1-[(2s)-2-[[(2s)-2-[[(2s,3r)-2-[[(2s)-2-aminopropanoyl]amino]-3-hydroxybutanoyl]amino]-3-(1h-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound O=C([C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)CC(C)C)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKSPBYRGLCNGRC-OEMOKZHXSA-N 0.000 description 1
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- HOZBSSWDEKVXNO-BXRBKJIMSA-N (2s)-2-azanylbutanedioic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O.OC(=O)[C@@H](N)CC(O)=O HOZBSSWDEKVXNO-BXRBKJIMSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- LOGFVTREOLYCPF-KXNHARMFSA-N (2s,3r)-2-[[(2r)-1-[(2s)-2,6-diaminohexanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxybutanoic acid Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-KXNHARMFSA-N 0.000 description 1
- AGTSSZRZBSNTGQ-ITZCFHCWSA-N (2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[2-[[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomet Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 AGTSSZRZBSNTGQ-ITZCFHCWSA-N 0.000 description 1
- NMWKYTGJWUAZPZ-WWHBDHEGSA-N (4S)-4-[[(4R,7S,10S,16S,19S,25S,28S,31R)-31-[[(2S)-2-[[(1R,6R,9S,12S,18S,21S,24S,27S,30S,33S,36S,39S,42R,47R,53S,56S,59S,62S,65S,68S,71S,76S,79S,85S)-47-[[(2S)-2-[[(2S)-4-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-methylbutanoyl]amino]-3-hydroxypropanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-phenylpropanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-18-(4-aminobutyl)-27,68-bis(3-amino-3-oxopropyl)-36,71,76-tribenzyl-39-(3-carbamimidamidopropyl)-24-(2-carboxyethyl)-21,56-bis(carboxymethyl)-65,85-bis[(1R)-1-hydroxyethyl]-59-(hydroxymethyl)-62,79-bis(1H-imidazol-4-ylmethyl)-9-methyl-33-(2-methylpropyl)-8,11,17,20,23,26,29,32,35,38,41,48,54,57,60,63,66,69,72,74,77,80,83,86-tetracosaoxo-30-propan-2-yl-3,4,44,45-tetrathia-7,10,16,19,22,25,28,31,34,37,40,49,55,58,61,64,67,70,73,75,78,81,84,87-tetracosazatetracyclo[40.31.14.012,16.049,53]heptaoctacontane-6-carbonyl]amino]-3-methylbutanoyl]amino]-7-(3-carbamimidamidopropyl)-25-(hydroxymethyl)-19-[(4-hydroxyphenyl)methyl]-28-(1H-imidazol-4-ylmethyl)-10-methyl-6,9,12,15,18,21,24,27,30-nonaoxo-16-propan-2-yl-1,2-dithia-5,8,11,14,17,20,23,26,29-nonazacyclodotriacontane-4-carbonyl]amino]-5-[[(2S)-1-[[(2S)-1-[[(2S)-3-carboxy-1-[[(2S)-1-[[(2S)-1-[[(1S)-1-carboxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H]2CSSC[C@@H]3NC(=O)[C@H](Cc4ccccc4)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]4CCCN4C(=O)[C@H](CSSC[C@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](Cc4ccccc4)NC3=O)[C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](Cc3ccccc3)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N3CCC[C@H]3C(=O)N[C@@H](C)C(=O)N2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc2ccccc2)NC(=O)[C@H](Cc2c[nH]cn2)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(=O)N[C@@H](Cc2c[nH]cn2)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](Cc2ccc(O)cc2)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1)C(=O)N[C@@H](C)C(O)=O NMWKYTGJWUAZPZ-WWHBDHEGSA-N 0.000 description 1
- KBSORCBSCSOBHH-IHJZLXGESA-N (4s)-5-[(2s)-2-[[(2s)-6-amino-1-[[(2s)-1-[[(2s)-1-[[(2s)-1-[[2-[[(2s)-1-amino-3-methyl-1-oxobutan-2-yl]amino]-2-oxoethyl]amino]-1-oxopropan-2-yl]amino]-3-carboxy-1-oxopropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]carbamoyl]pyrrolidin Chemical compound CC(C)[C@@H](C(N)=O)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KBSORCBSCSOBHH-IHJZLXGESA-N 0.000 description 1
- XABCFXXGZPWJQP-BYPYZUCNSA-N (S)-3-aminoadipic acid Chemical compound OC(=O)C[C@@H](N)CCC(O)=O XABCFXXGZPWJQP-BYPYZUCNSA-N 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- JHTPBGFVWWSHDL-UHFFFAOYSA-N 1,4-dichloro-2-isothiocyanatobenzene Chemical compound ClC1=CC=C(Cl)C(N=C=S)=C1 JHTPBGFVWWSHDL-UHFFFAOYSA-N 0.000 description 1
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- GMKMEZVLHJARHF-UHFFFAOYSA-N 2,6-diaminopimelic acid Chemical compound OC(=O)C(N)CCCC(N)C(O)=O GMKMEZVLHJARHF-UHFFFAOYSA-N 0.000 description 1
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 1
- OYIFNHCXNCRBQI-UHFFFAOYSA-N 2-aminoadipic acid Chemical compound OC(=O)C(N)CCCC(O)=O OYIFNHCXNCRBQI-UHFFFAOYSA-N 0.000 description 1
- NYCRCTMDYITATC-UHFFFAOYSA-N 2-fluorophenylalanine Chemical compound OC(=O)C(N)CC1=CC=CC=C1F NYCRCTMDYITATC-UHFFFAOYSA-N 0.000 description 1
- LKKMLIBUAXYLOY-UHFFFAOYSA-N 3-Amino-1-methyl-5H-pyrido[4,3-b]indole Chemical compound N1C2=CC=CC=C2C2=C1C=C(N)N=C2C LKKMLIBUAXYLOY-UHFFFAOYSA-N 0.000 description 1
- XABCFXXGZPWJQP-UHFFFAOYSA-N 3-aminoadipic acid Chemical compound OC(=O)CC(N)CCC(O)=O XABCFXXGZPWJQP-UHFFFAOYSA-N 0.000 description 1
- BMNBFRJBYVIOAY-UHFFFAOYSA-N 4,7,8-trihydroxy-3,4-dihydro-2h-isoquinolin-2-ium-1-one;chloride Chemical compound [Cl-].OC1=CC=C2C(O)C[NH2+]C(=O)C2=C1O BMNBFRJBYVIOAY-UHFFFAOYSA-N 0.000 description 1
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 description 1
- 102100027271 40S ribosomal protein SA Human genes 0.000 description 1
- 108050007366 40S ribosomal protein SA Proteins 0.000 description 1
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 1
- 102100030310 5,6-dihydroxyindole-2-carboxylic acid oxidase Human genes 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- OOXNYFKPOPJIOT-UHFFFAOYSA-N 5-(3-bromophenyl)-7-(6-morpholin-4-ylpyridin-3-yl)pyrido[2,3-d]pyrimidin-4-amine;dihydrochloride Chemical compound Cl.Cl.C=12C(N)=NC=NC2=NC(C=2C=NC(=CC=2)N2CCOCC2)=CC=1C1=CC=CC(Br)=C1 OOXNYFKPOPJIOT-UHFFFAOYSA-N 0.000 description 1
- SQDAZGGFXASXDW-UHFFFAOYSA-N 5-bromo-2-(trifluoromethoxy)pyridine Chemical compound FC(F)(F)OC1=CC=C(Br)C=N1 SQDAZGGFXASXDW-UHFFFAOYSA-N 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N 5-hydroxylysine Chemical group NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 102100024643 ATP-binding cassette sub-family D member 1 Human genes 0.000 description 1
- 241000208140 Acer Species 0.000 description 1
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 102100021305 Acyl-CoA:lysophosphatidylglycerol acyltransferase 1 Human genes 0.000 description 1
- 102100032534 Adenosine kinase Human genes 0.000 description 1
- 108010076278 Adenosine kinase Proteins 0.000 description 1
- 102100032158 Adenylate cyclase type 6 Human genes 0.000 description 1
- 101710194245 Adenylate cyclase type 6 Proteins 0.000 description 1
- 108060003345 Adrenergic Receptor Proteins 0.000 description 1
- 102000017910 Adrenergic receptor Human genes 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 201000011374 Alagille syndrome Diseases 0.000 description 1
- 208000022309 Alcoholic Liver disease Diseases 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002572 Alpha-Globulins Human genes 0.000 description 1
- 108010068307 Alpha-Globulins Proteins 0.000 description 1
- 101710195183 Alpha-bungarotoxin Proteins 0.000 description 1
- 102100026277 Alpha-galactosidase A Human genes 0.000 description 1
- 241000702419 Ambidensovirus Species 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000712891 Arenavirus Species 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- ALOVURZCXKYKJC-NAKRPEOUSA-N Arg-Asp-Gln-Ser Chemical compound N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O ALOVURZCXKYKJC-NAKRPEOUSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- 102100021723 Arginase-1 Human genes 0.000 description 1
- 101710129000 Arginase-1 Proteins 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108700016171 Aspartate ammonia-lyases Proteins 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 206010003591 Ataxia Diseases 0.000 description 1
- 102000007372 Ataxin-1 Human genes 0.000 description 1
- 108010032963 Ataxin-1 Proteins 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 206010003827 Autoimmune hepatitis Diseases 0.000 description 1
- 102100035526 B melanoma antigen 1 Human genes 0.000 description 1
- 108700020463 BRCA1 Proteins 0.000 description 1
- 102000036365 BRCA1 Human genes 0.000 description 1
- 102000052609 BRCA2 Human genes 0.000 description 1
- 108700020462 BRCA2 Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101000805768 Banna virus (strain Indonesia/JKT-6423/1980) mRNA (guanine-N(7))-methyltransferase Proteins 0.000 description 1
- 108060000903 Beta-catenin Proteins 0.000 description 1
- 102000015735 Beta-catenin Human genes 0.000 description 1
- 102400000748 Beta-endorphin Human genes 0.000 description 1
- 101800005049 Beta-endorphin Proteins 0.000 description 1
- 208000008439 Biliary Liver Cirrhosis Diseases 0.000 description 1
- 208000033222 Biliary cirrhosis primary Diseases 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 1
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- 108010051479 Bombesin Proteins 0.000 description 1
- 102000013585 Bombesin Human genes 0.000 description 1
- 208000010392 Bone Fractures Diseases 0.000 description 1
- 101800004538 Bradykinin Proteins 0.000 description 1
- 102400000967 Bradykinin Human genes 0.000 description 1
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 1
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 1
- 208000007257 Budd-Chiari syndrome Diseases 0.000 description 1
- 101710155857 C-C motif chemokine 2 Proteins 0.000 description 1
- 108010029697 CD40 Ligand Proteins 0.000 description 1
- 102100032937 CD40 ligand Human genes 0.000 description 1
- 108091016585 CD44 antigen Proteins 0.000 description 1
- 102100037904 CD9 antigen Human genes 0.000 description 1
- 102100035344 Cadherin-related family member 1 Human genes 0.000 description 1
- 101100005789 Caenorhabditis elegans cdk-4 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 102100025570 Cancer/testis antigen 1 Human genes 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 101150044789 Cap gene Proteins 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 206010007559 Cardiac failure congestive Diseases 0.000 description 1
- 102100026548 Caspase-8 Human genes 0.000 description 1
- 108090000538 Caspase-8 Proteins 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 101000686790 Chaetoceros protobacilladnavirus 2 Replication-associated protein Proteins 0.000 description 1
- 102000000018 Chemokine CCL2 Human genes 0.000 description 1
- 241000684559 Chicken parvovirus Species 0.000 description 1
- 101000936911 Chionoecetes opilio Sarcoplasmic/endoplasmic reticulum calcium ATPase Proteins 0.000 description 1
- 101000864475 Chlamydia phage 1 Internal scaffolding protein VP3 Proteins 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 229920001287 Chondroitin sulfate Polymers 0.000 description 1
- 102100026735 Coagulation factor VIII Human genes 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 206010010099 Combined immunodeficiency Diseases 0.000 description 1
- VPAXJOUATWLOPR-UHFFFAOYSA-N Conferone Chemical compound C1=CC(=O)OC2=CC(OCC3C4(C)CCC(=O)C(C)(C)C4CC=C3C)=CC=C21 VPAXJOUATWLOPR-UHFFFAOYSA-N 0.000 description 1
- 206010010317 Congenital absence of bile ducts Diseases 0.000 description 1
- 206010010456 Congenital emphysema Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000711573 Coronaviridae Species 0.000 description 1
- 108700002856 Coronavirus Envelope Proteins Proteins 0.000 description 1
- 102100030851 Cortistatin Human genes 0.000 description 1
- 229930185483 Cortistatin Natural products 0.000 description 1
- 208000012483 Crigler-Najjar syndrome type 1 Diseases 0.000 description 1
- 201000003075 Crimean-Congo hemorrhagic fever Diseases 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- 108091000069 Cystinyl Aminopeptidase Proteins 0.000 description 1
- 102000000311 Cytosine Deaminase Human genes 0.000 description 1
- 108010080611 Cytosine Deaminase Proteins 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102100035426 DnaJ homolog subfamily B member 7 Human genes 0.000 description 1
- 101100285903 Drosophila melanogaster Hsc70-2 gene Proteins 0.000 description 1
- 101100178718 Drosophila melanogaster Hsc70-4 gene Proteins 0.000 description 1
- 101100178723 Drosophila melanogaster Hsc70-5 gene Proteins 0.000 description 1
- 101100396994 Drosophila melanogaster Inos gene Proteins 0.000 description 1
- 101710205593 Dyslexia-associated protein KIAA0319-like protein Proteins 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102100027723 Endogenous retrovirus group K member 6 Rec protein Human genes 0.000 description 1
- 108010049140 Endorphins Proteins 0.000 description 1
- 102000009025 Endorphins Human genes 0.000 description 1
- 102100038083 Endosialin Human genes 0.000 description 1
- 101710144543 Endosialin Proteins 0.000 description 1
- 101710121417 Envelope glycoprotein Proteins 0.000 description 1
- 241000713730 Equine infectious anemia virus Species 0.000 description 1
- 241001337814 Erysiphe glycines Species 0.000 description 1
- 241000121268 Erythroparvovirus Species 0.000 description 1
- 101000803553 Eumenes pomiformis Venom peptide 3 Proteins 0.000 description 1
- 206010016077 Factor IX deficiency Diseases 0.000 description 1
- 108010014173 Factor X Proteins 0.000 description 1
- 241000701925 Feline parvovirus Species 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- NIGWMJHCCYYCSF-UHFFFAOYSA-N Fenclonine Chemical compound OC(=O)C(N)CC1=CC=C(Cl)C=C1 NIGWMJHCCYYCSF-UHFFFAOYSA-N 0.000 description 1
- 108090000386 Fibroblast Growth Factor 1 Proteins 0.000 description 1
- 102100031706 Fibroblast growth factor 1 Human genes 0.000 description 1
- 102100024785 Fibroblast growth factor 2 Human genes 0.000 description 1
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 1
- 102000003972 Fibroblast growth factor 7 Human genes 0.000 description 1
- 108090000385 Fibroblast growth factor 7 Proteins 0.000 description 1
- 102100023593 Fibroblast growth factor receptor 1 Human genes 0.000 description 1
- 101710182386 Fibroblast growth factor receptor 1 Proteins 0.000 description 1
- 241000711950 Filoviridae Species 0.000 description 1
- 241000710831 Flavivirus Species 0.000 description 1
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102100039717 G antigen 1 Human genes 0.000 description 1
- 108010046649 GDNP peptide Proteins 0.000 description 1
- 108010033708 GFE-1 peptide Proteins 0.000 description 1
- 101150117028 GP gene Proteins 0.000 description 1
- 101710177291 Gag polyprotein Proteins 0.000 description 1
- 102100028496 Galactocerebrosidase Human genes 0.000 description 1
- 108010042681 Galactosylceramidase Proteins 0.000 description 1
- 102400001370 Galanin Human genes 0.000 description 1
- 101800002068 Galanin Proteins 0.000 description 1
- 102000004862 Gastrin releasing peptide Human genes 0.000 description 1
- 108090001053 Gastrin releasing peptide Proteins 0.000 description 1
- 208000009139 Gilbert Disease Diseases 0.000 description 1
- 208000022412 Gilbert syndrome Diseases 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- KGNSGRRALVIRGR-QWRGUYRKSA-N Gln-Tyr Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KGNSGRRALVIRGR-QWRGUYRKSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- 208000010055 Globoid Cell Leukodystrophy Diseases 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- XAXJIUAWAFVADB-VJBMBRPKSA-N Glu-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XAXJIUAWAFVADB-VJBMBRPKSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 208000011476 Glycogen storage disease due to glucose-6-phosphatase deficiency type Ib Diseases 0.000 description 1
- 206010018464 Glycogen storage disease type I Diseases 0.000 description 1
- 239000000579 Gonadotropin-Releasing Hormone Substances 0.000 description 1
- 241001517118 Goose parvovirus Species 0.000 description 1
- 206010018691 Granuloma Diseases 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- 108010051696 Growth Hormone Proteins 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- 241000702620 H-1 parvovirus Species 0.000 description 1
- QXZGBUJJYSLZLT-UHFFFAOYSA-N H-Arg-Pro-Pro-Gly-Phe-Ser-Pro-Phe-Arg-OH Natural products NC(N)=NCCCC(N)C(=O)N1CCCC1C(=O)N1C(C(=O)NCC(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CO)C(=O)N2C(CCC2)C(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CCCN=C(N)N)C(O)=O)CCC1 QXZGBUJJYSLZLT-UHFFFAOYSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 101000583961 Halorubrum pleomorphic virus 1 Matrix protein Proteins 0.000 description 1
- 206010019280 Heart failures Diseases 0.000 description 1
- 208000018565 Hemochromatosis Diseases 0.000 description 1
- 108010085682 Hemoglobin A Proteins 0.000 description 1
- 102000007513 Hemoglobin A Human genes 0.000 description 1
- 108091005880 Hemoglobin F Proteins 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 102000008055 Heparan Sulfate Proteoglycans Human genes 0.000 description 1
- 206010019708 Hepatic steatosis Diseases 0.000 description 1
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- 101001042227 Homo sapiens Acyl-CoA:lysophosphatidylglycerol acyltransferase 1 Proteins 0.000 description 1
- 101000874316 Homo sapiens B melanoma antigen 1 Proteins 0.000 description 1
- 101000738354 Homo sapiens CD9 antigen Proteins 0.000 description 1
- 101000737767 Homo sapiens Cadherin-related family member 1 Proteins 0.000 description 1
- 101000856237 Homo sapiens Cancer/testis antigen 1 Proteins 0.000 description 1
- 101000914324 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 5 Proteins 0.000 description 1
- 101000914321 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 7 Proteins 0.000 description 1
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 1
- 101000804114 Homo sapiens DnaJ homolog subfamily B member 7 Proteins 0.000 description 1
- 101000886137 Homo sapiens G antigen 1 Proteins 0.000 description 1
- 101000578784 Homo sapiens Melanoma antigen recognized by T-cells 1 Proteins 0.000 description 1
- 101000617725 Homo sapiens Pregnancy-specific beta-1-glycoprotein 2 Proteins 0.000 description 1
- 101000984042 Homo sapiens Protein lin-28 homolog A Proteins 0.000 description 1
- 101000687905 Homo sapiens Transcription factor SOX-2 Proteins 0.000 description 1
- 101000687911 Homo sapiens Transcription factor SOX-3 Proteins 0.000 description 1
- 206010020365 Homocystinuria Diseases 0.000 description 1
- 101150090950 Hsc70-1 gene Proteins 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000701806 Human papillomavirus Species 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- 208000015178 Hurler syndrome Diseases 0.000 description 1
- 208000015204 Hurler-Scheie syndrome Diseases 0.000 description 1
- 102000006933 Hydroxymethyl and Formyl Transferases Human genes 0.000 description 1
- 108010072462 Hydroxymethyl and Formyl Transferases Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 208000034600 Hyperornithinemia-hyperammonemia-homocitrullinuria syndrome Diseases 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- 102000038455 IGF Type 1 Receptor Human genes 0.000 description 1
- 108010031794 IGF Type 1 Receptor Proteins 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 241000711450 Infectious bronchitis virus Species 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000003777 Interleukin-1 beta Human genes 0.000 description 1
- 108090000193 Interleukin-1 beta Proteins 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 102000003815 Interleukin-11 Human genes 0.000 description 1
- 108090000177 Interleukin-11 Proteins 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 102000003816 Interleukin-13 Human genes 0.000 description 1
- 108090000176 Interleukin-13 Proteins 0.000 description 1
- 102000003810 Interleukin-18 Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 102000004125 Interleukin-1alpha Human genes 0.000 description 1
- 108010082786 Interleukin-1alpha Proteins 0.000 description 1
- 102100039064 Interleukin-3 Human genes 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 102100039897 Interleukin-5 Human genes 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 102000004889 Interleukin-6 Human genes 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102100021592 Interleukin-7 Human genes 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 108090001007 Interleukin-8 Proteins 0.000 description 1
- 102400001355 Interleukin-8 Human genes 0.000 description 1
- 102000000585 Interleukin-9 Human genes 0.000 description 1
- 108010002335 Interleukin-9 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 206010023126 Jaundice Diseases 0.000 description 1
- 208000012659 Joint disease Diseases 0.000 description 1
- 108010010994 K237 peptide Proteins 0.000 description 1
- 208000028226 Krabbe disease Diseases 0.000 description 1
- 108700021430 Kruppel-Like Factor 4 Proteins 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- AGPKZVBTJJNPAG-UHNVWZDZSA-N L-allo-Isoleucine Chemical compound CC[C@@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-UHNVWZDZSA-N 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- 102100031413 L-dopachrome tautomerase Human genes 0.000 description 1
- 101710093778 L-dopachrome tautomerase Proteins 0.000 description 1
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 102000008192 Lactoglobulins Human genes 0.000 description 1
- 108010060630 Lactoglobulins Proteins 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- 102100020872 Leucyl-cystinyl aminopeptidase Human genes 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 208000019693 Lung disease Diseases 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- 108700041567 MDR Genes Proteins 0.000 description 1
- 241000282567 Macaca fascicularis Species 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 208000030162 Maple syrup disease Diseases 0.000 description 1
- 201000005505 Measles Diseases 0.000 description 1
- 239000000637 Melanocyte-Stimulating Hormone Substances 0.000 description 1
- 108010007013 Melanocyte-Stimulating Hormones Proteins 0.000 description 1
- 102100028389 Melanoma antigen recognized by T-cells 1 Human genes 0.000 description 1
- 108010049137 Member 1 Subfamily D ATP Binding Cassette Transporter Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- KLGIQJRMFHIGCQ-ZFWWWQNUSA-N Met-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)NCC(O)=O)=CNC2=C1 KLGIQJRMFHIGCQ-ZFWWWQNUSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 208000002678 Mucopolysaccharidoses Diseases 0.000 description 1
- 206010028095 Mucopolysaccharidosis IV Diseases 0.000 description 1
- 208000025915 Mucopolysaccharidosis type 6 Diseases 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 208000005647 Mumps Diseases 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101100310657 Mus musculus Sox1 gene Proteins 0.000 description 1
- 101100310645 Mus musculus Sox15 gene Proteins 0.000 description 1
- 206010028289 Muscle atrophy Diseases 0.000 description 1
- 241001503699 Muscovy duck parvovirus Species 0.000 description 1
- 108091057508 Myc family Proteins 0.000 description 1
- PQNASZJZHFPQLE-LURJTMIESA-N N(6)-methyl-L-lysine Chemical compound CNCCCC[C@H](N)C(O)=O PQNASZJZHFPQLE-LURJTMIESA-N 0.000 description 1
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 1
- 108700026495 N-Myc Proto-Oncogene Proteins 0.000 description 1
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 1
- 108700043217 N-acetyl glutamate synthetase deficiency Proteins 0.000 description 1
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 1
- 206010071092 N-acetylglutamate synthase deficiency Diseases 0.000 description 1
- OLNLSTNFRUFTLM-BYPYZUCNSA-N N-ethyl-L-asparagine Chemical compound CCN[C@H](C(O)=O)CC(N)=O OLNLSTNFRUFTLM-BYPYZUCNSA-N 0.000 description 1
- OLNLSTNFRUFTLM-UHFFFAOYSA-N N-ethylasparagine Chemical compound CCNC(C(O)=O)CC(N)=O OLNLSTNFRUFTLM-UHFFFAOYSA-N 0.000 description 1
- 108010065338 N-ethylglycine Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- KSPIYJQBLVDRRI-WDSKDSINSA-N N-methyl-L-isoleucine Chemical compound CC[C@H](C)[C@H](NC)C(O)=O KSPIYJQBLVDRRI-WDSKDSINSA-N 0.000 description 1
- 102100030124 N-myc proto-oncogene protein Human genes 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101150118742 NP gene Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 102100028782 Neprilysin Human genes 0.000 description 1
- 108090000028 Neprilysin Proteins 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 102400001103 Neurotensin Human genes 0.000 description 1
- 101800001814 Neurotensin Proteins 0.000 description 1
- 208000014060 Niemann-Pick disease Diseases 0.000 description 1
- 102000008297 Nuclear Matrix-Associated Proteins Human genes 0.000 description 1
- 108010035916 Nuclear Matrix-Associated Proteins Proteins 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 108091007960 PI3Ks Proteins 0.000 description 1
- 102100034640 PWWP domain-containing DNA repair factor 3A Human genes 0.000 description 1
- 108050007154 PWWP domain-containing DNA repair factor 3A Proteins 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000015731 Peptide Hormones Human genes 0.000 description 1
- 108010038988 Peptide Hormones Proteins 0.000 description 1
- 208000005764 Peripheral Arterial Disease Diseases 0.000 description 1
- 208000030831 Peripheral arterial occlusive disease Diseases 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 201000005702 Pertussis Diseases 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- 108010069013 Phenylalanine Hydroxylase Proteins 0.000 description 1
- 102100038223 Phenylalanine-4-hydroxylase Human genes 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 1
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 102100024266 Pneumadin Human genes 0.000 description 1
- 208000000474 Poliomyelitis Diseases 0.000 description 1
- 102100022019 Pregnancy-specific beta-1-glycoprotein 2 Human genes 0.000 description 1
- 208000004777 Primary Hyperoxaluria Diseases 0.000 description 1
- 208000012654 Primary biliary cholangitis Diseases 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 1
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 201000002150 Progressive familial intrahepatic cholestasis Diseases 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000169446 Promethis Species 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 102000029301 Protein S Human genes 0.000 description 1
- 108010066124 Protein S Proteins 0.000 description 1
- 101710149951 Protein Tat Proteins 0.000 description 1
- 208000008425 Protein deficiency Diseases 0.000 description 1
- 102100025460 Protein lin-28 homolog A Human genes 0.000 description 1
- 102000004097 Protein phosphatase inhibitor 1 Human genes 0.000 description 1
- 108090000506 Protein phosphatase inhibitor 1 Proteins 0.000 description 1
- 102000016611 Proteoglycans Human genes 0.000 description 1
- 108010067787 Proteoglycans Proteins 0.000 description 1
- 102000008022 Proto-Oncogene Proteins c-met Human genes 0.000 description 1
- 108010089836 Proto-Oncogene Proteins c-met Proteins 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 101710100968 Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 208000017442 Retinal disease Diseases 0.000 description 1
- 101150107345 Rhag gene Proteins 0.000 description 1
- 241000713124 Rift Valley fever virus Species 0.000 description 1
- 102400000235 Rimorphin Human genes 0.000 description 1
- 101800001440 Rimorphin Proteins 0.000 description 1
- 108010083379 Sarcoglycans Proteins 0.000 description 1
- 102000006308 Sarcoglycans Human genes 0.000 description 1
- 201000002883 Scheie syndrome Diseases 0.000 description 1
- 101100150366 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sks2 gene Proteins 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 101710173693 Short transient receptor potential channel 1 Proteins 0.000 description 1
- 101710173694 Short transient receptor potential channel 2 Proteins 0.000 description 1
- 108091027967 Small hairpin RNA Proteins 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 102100038803 Somatotropin Human genes 0.000 description 1
- 102000005890 Spectrin Human genes 0.000 description 1
- 108010019965 Spectrin Proteins 0.000 description 1
- 208000009415 Spinocerebellar Ataxias Diseases 0.000 description 1
- 101000857870 Squalus acanthias Gonadoliberin Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 108090000054 Syndecan-2 Proteins 0.000 description 1
- 102220564325 TIR domain-containing adapter molecule 2_S16E_mutation Human genes 0.000 description 1
- 108700012920 TNF Proteins 0.000 description 1
- 101710192266 Tegument protein VP22 Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 108010055044 Tetanus Toxin Proteins 0.000 description 1
- 208000002903 Thalassemia Diseases 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102100024270 Transcription factor SOX-2 Human genes 0.000 description 1
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 1
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 1
- 102000009618 Transforming Growth Factors Human genes 0.000 description 1
- 108010009583 Transforming Growth Factors Proteins 0.000 description 1
- 241000711484 Transmissible gastroenteritis virus Species 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- RERRMBXDSFMBQE-ZFWWWQNUSA-N Trp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERRMBXDSFMBQE-ZFWWWQNUSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 description 1
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 1
- 206010054094 Tumour necrosis Diseases 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- 108091000117 Tyrosine 3-Monooxygenase Proteins 0.000 description 1
- 102000048218 Tyrosine 3-monooxygenases Human genes 0.000 description 1
- 108091026838 U1 spliceosomal RNA Proteins 0.000 description 1
- 102100027244 U4/U6.U5 tri-snRNP-associated protein 1 Human genes 0.000 description 1
- 101710155955 U4/U6.U5 tri-snRNP-associated protein 1 Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- 108010003205 Vasoactive Intestinal Peptide Proteins 0.000 description 1
- 102400000015 Vasoactive intestinal peptide Human genes 0.000 description 1
- 108010004977 Vasopressins Proteins 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 208000018839 Wilson disease Diseases 0.000 description 1
- 241000710772 Yellow fever virus Species 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 201000009628 adenosine deaminase deficiency Diseases 0.000 description 1
- 108060000200 adenylate cyclase Proteins 0.000 description 1
- 102000030621 adenylate cyclase Human genes 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 208000031045 adult-onset type II citrullinemia Diseases 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000001476 alcoholic effect Effects 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 208000006682 alpha 1-Antitrypsin Deficiency Diseases 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 102000005840 alpha-Galactosidase Human genes 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010048605 aspartyl-valyl-phenylalanyl-tyrosyl-prolyl-tyrosyl-prolyl-tyrosyl-alanyl-seryl-glycyl-serine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 210000000270 basal cell Anatomy 0.000 description 1
- 210000003651 basophil Anatomy 0.000 description 1
- WTOFYLAWDLQMBZ-LURJTMIESA-N beta(2-thienyl)alanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CS1 WTOFYLAWDLQMBZ-LURJTMIESA-N 0.000 description 1
- WOPZMFQRCBYPJU-NTXHZHDSSA-N beta-endorphin Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C1=CC=CC=C1 WOPZMFQRCBYPJU-NTXHZHDSSA-N 0.000 description 1
- 201000005271 biliary atresia Diseases 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000003114 blood coagulation factor Substances 0.000 description 1
- DNDCVAGJPBKION-DOPDSADYSA-N bombesin Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC=1NC2=CC=CC=C2C=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1NC(=O)CC1)C(C)C)C1=CN=CN1 DNDCVAGJPBKION-DOPDSADYSA-N 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- QXZGBUJJYSLZLT-FDISYFBBSA-N bradykinin Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(=O)NCC(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CO)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CCC1 QXZGBUJJYSLZLT-FDISYFBBSA-N 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 229940077737 brain-derived neurotrophic factor Drugs 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 229940059329 chondroitin sulfate Drugs 0.000 description 1
- BJBUEDPLEOHJGE-IUYQGCFVSA-N cis-3-hydroxy-D-proline zwitterion Chemical compound O[C@H]1CCN[C@H]1C(O)=O BJBUEDPLEOHJGE-IUYQGCFVSA-N 0.000 description 1
- 208000025812 citrin deficiency Diseases 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 238000010961 commercial manufacture process Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- JECGPMYZUFFYJW-UHFFFAOYSA-N conferone Natural products CC1=CCC2C(C)(C)C(=O)CCC2(C)C1COc3cccc4C=CC(=O)Oc34 JECGPMYZUFFYJW-UHFFFAOYSA-N 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 210000004087 cornea Anatomy 0.000 description 1
- 108010005430 cortistatin Proteins 0.000 description 1
- DDRPLNQJNRBRNY-WYYADCIBSA-N cortistatin-14 Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N1)NC(=O)[C@H]1NCCC1)C(=O)N[C@@H](CCCCN)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 DDRPLNQJNRBRNY-WYYADCIBSA-N 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- OOTFVKOQINZBBF-UHFFFAOYSA-N cystamine Chemical compound CCSSCCN OOTFVKOQINZBBF-UHFFFAOYSA-N 0.000 description 1
- 229940099500 cystamine Drugs 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010066070 cysteinyl-prolyl-isoleucyl-glutamyl-aspartyl-arginyl-prolyl-methionyl-cysteine (1-9) disulfide Proteins 0.000 description 1
- 230000016396 cytokine production Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 210000001787 dendrite Anatomy 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 108010078428 env Gene Products Proteins 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 description 1
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 description 1
- 210000003013 erythroid precursor cell Anatomy 0.000 description 1
- 102000015694 estrogen receptors Human genes 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 229940012426 factor x Drugs 0.000 description 1
- 201000011110 familial lipoprotein lipase deficiency Diseases 0.000 description 1
- 229940126864 fibroblast growth factor Drugs 0.000 description 1
- 229940029303 fibroblast growth factor-1 Drugs 0.000 description 1
- 201000010073 fibrogenesis imperfecta ossium Diseases 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000022244 formylation Effects 0.000 description 1
- 238000006170 formylation reaction Methods 0.000 description 1
- 208000024386 fungal infectious disease Diseases 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108700039708 galantide Proteins 0.000 description 1
- TZOJVPDIYKRJSM-GKPUQKAJSA-N galantide Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(N)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)[C@@H](C)O)C1=CC=CC=C1 TZOJVPDIYKRJSM-GKPUQKAJSA-N 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- PUBCCFNQJQKCNC-XKNFJVFFSA-N gastrin-releasingpeptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C1=CNC=N1 PUBCCFNQJQKCNC-XKNFJVFFSA-N 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000009395 genetic defect Effects 0.000 description 1
- 230000002518 glial effect Effects 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 102000034238 globular proteins Human genes 0.000 description 1
- 108091005896 globular proteins Proteins 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 201000004541 glycogen storage disease I Diseases 0.000 description 1
- 208000005516 glycogen storage disease Ib Diseases 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 230000002710 gonadal effect Effects 0.000 description 1
- XLXSAKCOAKORKW-AQJXLSMYSA-N gonadorelin Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 XLXSAKCOAKORKW-AQJXLSMYSA-N 0.000 description 1
- 229940035638 gonadotropin-releasing hormone Drugs 0.000 description 1
- 239000000122 growth hormone Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002064 heart cell Anatomy 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 208000009429 hemophilia B Diseases 0.000 description 1
- 208000005252 hepatitis A Diseases 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010046019 histidyl-glutamyl-tryptophyl-seryl-tyrosyl-leucyl-alanyl-prolyl-tyrosyl-prolyl-tryptophyl-phenylalanine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000009851 immunogenic response Effects 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 108700027921 interferon tau Proteins 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229940076144 interleukin-10 Drugs 0.000 description 1
- 229940074383 interleukin-11 Drugs 0.000 description 1
- 229940117681 interleukin-12 Drugs 0.000 description 1
- 229940076264 interleukin-3 Drugs 0.000 description 1
- 229940100602 interleukin-5 Drugs 0.000 description 1
- 229940100601 interleukin-6 Drugs 0.000 description 1
- 229940100994 interleukin-7 Drugs 0.000 description 1
- 229940096397 interleukin-8 Drugs 0.000 description 1
- XKTZWUACRZHVAN-VADRZIEHSA-N interleukin-8 Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](NC(C)=O)CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N1[C@H](CCC1)C(=O)N1[C@H](CCC1)C(=O)N[C@@H](C)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CCC(O)=O)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC=1C=CC(O)=CC=1)C(=O)N[C@H](CO)C(=O)N1[C@H](CCC1)C(N)=O)C1=CC=CC=C1 XKTZWUACRZHVAN-VADRZIEHSA-N 0.000 description 1
- 229940118526 interleukin-9 Drugs 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- VBUWHHLIZKOSMS-RIWXPGAOSA-N invicorp Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 VBUWHHLIZKOSMS-RIWXPGAOSA-N 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010075476 isoleucyl-glutamyl-leucyl-leucyl-glutaminyl-alanyl-arginine Proteins 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- VWHRYODZTDMVSS-QMMMGPOBSA-N m-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(F)=C1 VWHRYODZTDMVSS-QMMMGPOBSA-N 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 208000002780 macular degeneration Diseases 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 208000024393 maple syrup urine disease Diseases 0.000 description 1
- 210000002901 mesenchymal stem cell Anatomy 0.000 description 1
- CWWARWOPSKGELM-SARDKLJWSA-N methyl (2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-5-amino-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-5 Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)OC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=CC=C1 CWWARWOPSKGELM-SARDKLJWSA-N 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- XLTANAWLDBYGFU-UHFFFAOYSA-N methyllycaconitine hydrochloride Natural products C1CC(OC)C2(C3C4OC)C5CC(C(C6)OC)C(OC)C5C6(O)C4(O)C2N(CC)CC31COC(=O)C1=CC=CC=C1N1C(=O)CC(C)C1=O XLTANAWLDBYGFU-UHFFFAOYSA-N 0.000 description 1
- 201000003694 methylmalonic acidemia Diseases 0.000 description 1
- 108091061970 miR-26a stem-loop Proteins 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000007431 microscopic evaluation Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000004001 molecular interaction Effects 0.000 description 1
- SLZIZIJTGAYEKK-CIJSCKBQSA-N molport-023-220-247 Chemical compound C([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)[C@@H](C)O)C1=CNC=N1 SLZIZIJTGAYEKK-CIJSCKBQSA-N 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 229940051875 mucins Drugs 0.000 description 1
- 206010028093 mucopolysaccharidosis Diseases 0.000 description 1
- 208000005340 mucopolysaccharidosis III Diseases 0.000 description 1
- 208000000690 mucopolysaccharidosis VI Diseases 0.000 description 1
- 208000011045 mucopolysaccharidosis type 3 Diseases 0.000 description 1
- 208000010978 mucopolysaccharidosis type 4 Diseases 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 208000010805 mumps infectious disease Diseases 0.000 description 1
- 230000037257 muscle growth Effects 0.000 description 1
- 201000000585 muscular atrophy Diseases 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 210000001178 neural stem cell Anatomy 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- PCJGZPGTCUMMOT-ISULXFBGSA-N neurotensin Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 PCJGZPGTCUMMOT-ISULXFBGSA-N 0.000 description 1
- 230000000508 neurotrophic effect Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 210000004248 oligodendroglia Anatomy 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 201000007976 ornithine translocase deficiency Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 108010012604 pneumadin Proteins 0.000 description 1
- 108010089520 pol Gene Products Proteins 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 201000004012 propionic acidemia Diseases 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 201000001514 prostate carcinoma Diseases 0.000 description 1
- 210000005267 prostate cell Anatomy 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 1
- 108010043277 recombinant soluble CD4 Proteins 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 210000003583 retinal pigment epithelium Anatomy 0.000 description 1
- 201000005404 rubella Diseases 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 208000010157 sclerosing cholangitis Diseases 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 125000005629 sialic acid group Chemical group 0.000 description 1
- 125000002593 sialoyl group Chemical group 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- 229960002930 sirolimus Drugs 0.000 description 1
- 210000004927 skin cell Anatomy 0.000 description 1
- 239000004055 small Interfering RNA Substances 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 210000001057 smooth muscle myoblast Anatomy 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 201000003624 spinocerebellar ataxia type 1 Diseases 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 210000001324 spliceosome Anatomy 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 238000012385 systemic delivery Methods 0.000 description 1
- 101150047061 tag-72 gene Proteins 0.000 description 1
- 229940118376 tetanus toxin Drugs 0.000 description 1
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 1
- YSMODUONRAFBET-WHFBIAKZSA-N threo-5-hydroxy-L-lysine Chemical compound NC[C@@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-WHFBIAKZSA-N 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000011820 transgenic animal model Methods 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 230000005740 tumor formation Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 201000007972 tyrosinemia type I Diseases 0.000 description 1
- 230000004222 uncontrolled growth Effects 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000002525 vasculotropin inhibitor Substances 0.000 description 1
- 239000005526 vasoconstrictor agent Substances 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 229940051021 yellow-fever virus Drugs 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K45/00—Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
- A61K45/06—Mixtures of active ingredients without chemical characterisation, e.g. antiphlogistics and cardiaca
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0634—Cells from the blood or the immune system
- C12N5/0636—T lymphocytes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14145—Special targeting system for viral vectors
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Immunology (AREA)
- Hematology (AREA)
- Cell Biology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本發明提供變異體AAV衣殼蛋白及包含其之AAV衣殼及病毒載體。與天然AAV衣殼序列相比,本文所述之病毒載體在所關注靶細胞,諸如T細胞中可具有增加的轉導。本發明亦提供將本發明之病毒載體及病毒衣殼投與至細胞或有需要之患者的方法。
Description
本發明係關於來自腺相關病毒(AAV)之變異體衣殼蛋白及包含其之病毒衣殼及病毒載體。特定言之,本發明係關於變異體AAV衣殼蛋白及包含其之AAV衣殼,其可併入病毒載體中以賦予增強T細胞之活體內及/或離體細胞轉導的表型。
腺相關病毒(AAV)係一種小型單股DNA病毒,其屬於細小病毒科之依賴病毒屬。由於AAV能夠感染多種細胞及組織類型、缺乏致病性、免疫原性低以及能夠有效轉導非分裂細胞,因此AAV係用於基因療法之有前景的病毒載體。每種已知的AAV血清型均具有不同的感染特定細胞類型之能力。
人們對使用AAV靶向T細胞感興趣。例如,AAV靶向T細胞可用於基因療法,以預防、限制及/或逆轉T細胞耗竭。T細胞耗竭係在許多慢性感染及癌症期間出現的一種T細胞功能障礙狀態,且亦已被證明會降低CAR-T療法的有效性。然而,AAV通常不以高水準轉導T細胞。
因此,此項技術中需要能夠以增強的轉導效率靶向T細胞之經改良AAV載體。
本發明係關於包含一或多種轉導相關肽之腺相關病毒(AAV)衣殼蛋白,以及包含其之AAV衣殼及病毒載體。所揭示之轉導相關肽可增強AAV載體向所需細胞類型,諸如T細胞的細胞轉導。
本發明提供包含衣殼蛋白之重組腺相關病毒(AAV)載體,其中該衣殼蛋白包含具有SEQ ID NO: 17至23中任一者之序列的轉導相關肽。在一些實施方案中,衣殼蛋白包含與SEQ ID NO: 1具有至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。在一些實施例中,轉導相關肽取代對應於SEQ ID NO: 1之胺基酸454-460的胺基酸。在一些實施例中,衣殼蛋白包含選自由以下組成之群的胺基酸序列:SEQ ID NO: 2、4、6、8、10、12及14,或與其至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致的序列。
本發明提供包含衣殼蛋白之重組AAV載體,其中衣殼蛋白包含SEQ ID NO: 1之序列,其中SEQ ID NO: 1之胺基酸454-460經包含序列X1-X2-X3-X4-X5-X6-X7 (SEQ ID NO: 24)之轉導相關肽取代。在一些實施例中,X1不為G,X2不為S,X3不為A,X4不為Q,X5不為N,X6不為K,及/或X7不為D。在一些實施例中,X1為H、M、A、Q、V或S。在一些實施例中,X2為A或T。在一些實施例中,X3為P或T。在一些實施例中,X4為R或D。在一些實施例中,X5為V、Q、C、S或D。在一些實施例中,X6為E、A或P。在一些實施例中,X7為E、G、N、T或A。在一些實施例中,X1為H,X2為A,X3為P,X4為R,X5為V,X6為E,且X7為E。在一些實施例中,X1為M,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。在一些實施例中,X1為H,X2為T,X3為T,X4為D,X5 為C,X6為A,且X7為N。在一些實施例中,X1為A,X2為A,X3為P,X4為R,X5為S,X6為E,且X7為T。在一些實施例中,X1為Q,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。在一些實施例中,X1為V,X2為A,X3為P,X4為R,X5 為D,X6為P,且X7為A。在一些實施例中,X1為S,X2為A,X3為P,X4為R,X5為S,X46為E,且X7為N。
在一些實施例中,衣殼蛋白包含與SEQ ID NO: 1具有至少約95%、至少約96%、至少約97%、至少約98%或至少約99%一致性的胺基酸序列。在一些實施例中,衣殼蛋白包含與SEQ ID NO: 1具有約99%一致性的胺基酸序列。在一些實施例中,衣殼蛋白包含選自由以下組成之群的胺基酸序列:SEQ ID NO: 2、4、6、8、10、12及14。
本發明提供包含衣殼蛋白之重組AAV載體,其中衣殼蛋白包含具有SEQ ID NO: 16之胺基酸序列的轉導相關肽,其中該轉導相關肽相對於SEQ ID NO: 1取代胺基酸454-460。在一些實施例中,轉導相關肽具有SEQ ID NO: 17-23中任一者之胺基酸序列。
本發明提供編碼具有SEQ ID NO: 2、4、6、8、10、12及14中任一者之序列之重組AAV衣殼蛋白的核酸。在一些實施例中,核酸包含選自由SEQ ID NO: 3、5、7、9、11、13及15組成之群的序列。在一些實施例中,核酸為DNA序列。在一些實施例中,核酸為RNA序列。本發明提供包含本文揭示之任一種核酸的表現載體。本發明進一步提供包含本文揭示之任一種核酸或本文揭示之任一種表現載體的細胞。
在一些實施例中,本文揭示之任一種重組AAV載體進一步包含經衣殼蛋白包裹之貨物核酸。在一些實施例中,貨物核酸編碼治療性蛋白質或治療性RNA。在一些實施例中,與不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向細胞中之轉導。在一些實施例中,細胞為T細胞。在一些實施例中,與不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向T細胞之細胞核中之轉導。在一些實施例中,與不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向T細胞之胞質液中之轉導。
本發明提供組合物,其包含本文揭示之任一種重組AAV載體、本文揭示之任一種核酸、本文揭示之任一種表現載體或本文揭示之任一種細胞。本發明進一步提供醫藥組合物,其包含本文揭示之任一種細胞或本文揭示之任一種重組AAV載體;及醫藥學上可接受之載體。
本發明提供將AAV載體遞送至細胞中之方法,其包含使細胞與本文揭示之任一種AAV載體接觸。在一些實施例中,細胞接觸係在活體外、離體或活體內進行。在一些實施例中,細胞為T細胞。本發明提供治療有需要之個體的方法,其包含向該個體投與有效量的本文揭示之任一種AAV載體。本發明提供治療有需要之個體的方法,其包含向該個體投與已與本文揭示之任一種AAV載體離體接觸的細胞。在一些實施例中,個體為哺乳動物。在一些實施例中,個體為人類。本發明提供用作藥劑的本文揭示之任一種AAV載體。本發明亦提供本文揭示之任一種AAV載體,用於治療有需要之個體的方法。
此等及其他實施例將在下文所述之實施方式中更詳細地闡述。
相關申請案之交叉引用
本申請案主張2021年1月14日申請之美國臨時申請案第63/137,497號的優先權,該申請案之內容以引用之方式整體併入本文中。
電子提交之文字檔案之描述
本文電子提交之文字檔案的內容以引用方式整體併入:序列表之電腦可讀格式複本(檔案名稱:STRD_022_01WO Sequence Listing.txt,記錄日期為2022年1月4日,檔案大小為163.4千位元組)。
除非另外定義,否則本文所用之所有技術及科學術語均具有與本發明所屬技術領域之一般技術者通常所理解相同的含義。在本文之實施方式中使用之術語僅出於描述特定實施例之目的且不意欲進行限制。
本文提及之所有出版物、專利申請案、專利、GenBank或其他寄存編號及其他參考文獻出於所有目的以引用方式整體併入。
在本發明及所附申請專利範圍中,AAV衣殼蛋白中之所有胺基酸位置的指定係相對於VP1衣殼次單元編號。熟習此項技術者應理解,若將本文所述之修飾插入AAV cap基因中,則可能引起VP1、VP2及/或VP3衣殼次單元之修飾。替代地,衣殼次單元可獨立地表現以僅在衣殼次單元中之一或兩者(VP1、VP2、VP3、VP1 + VP2、VP1 + VP3或VP2 + VP3)中達成修飾。
定義
在本文之描述及所附申請專利範圍中使用以下術語:
除非上下文另有明確指示,否則單數形式「一(a/an)」及「該」亦欲包括複數形式。
此外,如本文所用之術語「約」在提及可量測之值(例如多核苷酸或多肽序列之長度、劑量、時間、溫度及諸如此類之量)時意欲涵蓋指定值之± 20%、± 10%、± 5%、± 1%、± 0.5%或甚至± 0.1%之變化。
同樣,如本文所用之「及/或」係指且涵蓋一或多個相關所列條目之任何及所有可能的組合,以及當在替代方案中解釋時缺少組合(「或」)。
除非上下文另有指示,否則本文所述之各種特徵可明確旨在以任何組合使用。
此外,本發明亦考慮在一些實施例中,可排除或省略本文所述之任一特徵或特徵組合。為進一步說明,例如,若說明書指示特定胺基酸可選自A、G、I、L及/或V,則該語言亦指示該胺基酸可選自該(等)胺基酸之任何子集,例如A、G、I或L;A、G、I或V;A或G;僅L等,如同此類子組合各自在本文中明確闡述一般。此外,此類語言亦指示,可放棄一或多個指定之胺基酸。舉例而言,在一些實施例中,胺基酸不為A、G或I;不為A;不為G或V等,如同每個該可能之放棄在本文中明確闡述一般。
如本文所用之術語「減少(reduce/reduces/reduction)」及類似術語意謂減少至少約10%、約15%、約20%、約25%、約35%、約50%、約75%、約80%、約85%、約90%、約95%、約97%或更多。
如本文所用之術語「增強(enhance/enhances/enhancement)」及類似術語指示增加至少約10%、約15%、約20%、約25%、約35%、約50%、約75%、約80%、約85%、約90%、約95%、約100%、約150%、約200%、約300%、約400%、約500%或更多。
如本文所用之術語「細小病毒」涵蓋細小病毒科(Parvoviridae),包括自主複製細小病毒及依賴病毒。自主細小病毒包括
原細小病毒屬 (Protoparvovirus)、
紅細小病毒屬 (Erythroparvovirus)、
博卡病毒屬 (Bocaparvirus)及
濃核病毒屬 (Densovirus)亞科之成員。例示性自主細小病毒包括但不限於小鼠微小病毒、牛細小病毒、犬細小病毒、雞細小病毒、貓泛白血球症病毒、貓細小病毒、鵝細小病毒、H1細小病毒、番鴨細小病毒、B19病毒及現在已知或後來發現之任何其他自主細小病毒。其他自主細小病毒為熟習此項技術者已知的。參見例如BERNARD N. FIELDS等人, VIROLOGY, 第2卷, 第69章(第4版, Lippincott-Raven Publishers;Cotmore等人Archives of Virology DOI 10.1007/s00705-013-1914-I)。術語「個體(subject)」、「個體(individual)」及「患者」在本文中可互換使用以指脊椎動物,諸如哺乳動物。哺乳動物可為例如小鼠、大鼠、兔、貓、狗、豬、羊、馬、非人靈長類動物(例如食蟹猴、黑猩猩)或人類。亦考慮活體內獲得或活體外培養之個體的組織、細胞或其衍生物。人類個體可為成人、青少年、兒童(2歲至14歲)、嬰兒(1個月至24個月)或新生兒(最多1個月)。在一些實施例中,成人為約65歲或以上、或約60歲或以上的老年人。在一些實施例中,個體為孕婦或打算懷孕的婦女。在一些實施例中,個體「需要」本文所述之方法。
如本文所用之術語「腺相關病毒」 (AAV)包括但不限於1型AAV、2型AAV、3型(包括3A型及3B型) AAV、4型AAV、5型AAV、6型AAV、7型AAV、8型AAV、9型AAV、10型AAV、11型AAV、12型AAV、13型AAV、rh32.33型AAV、rh8型AAV、rh10型AAV、rh74型AAV、hu.68型AAV、禽AAV、牛AAV、犬AAV、馬AAV、羊AAV、蛇AAV、髭頰蜥AAV、AAV2i8、AAV2g9、AAV-LK03、AAV7m8、AAV Anc80、AAV PHP.B及現在已知或後來發現之任何其他AAV。例如,參見BERNARD N. FIELDS等人,VIROLOGY,第2卷,第69章(第4版,Lippincott-Raven Publishers)。已鑑別出多種AAV血清型及演化支(例如,參見Gao等人(2004) J. Virology 78:6381-6388;Moris等人(2004) Virology 33-:375-383;及表2)。
如本文所用之術語「嵌合AAV」係指包含衣殼蛋白之AAV,該衣殼蛋白具有衍生自兩種或更多種不同AAV血清型之區、域及/或個別胺基酸。在一些實施例中,嵌合AAV包含衣殼蛋白,該衣殼蛋白包含衍生自第一AAV血清型之第一區及衍生自第二AAV血清型之第二區。在一些實施例中,嵌合AAV包含衣殼蛋白,該衣殼蛋白包含衍生自第一AAV血清型之第一區、衍生自第二AAV血清型之第二區及衍生自第三AAV血清型之第三區。在一些實施例中,嵌合AAV可包含衍生自AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11及/或AAV12中之兩者或更多者的區、域、個別胺基酸。舉例而言,嵌合AAV可包括如下文(表1)所示之來自第一及第二AAV血清型之區、域及/或個別胺基酸,其中AAVX+Y指示包括衍生自AAVX及AAVY之序列的嵌合AAV。
表 1 :嵌合 AAV
藉由在一個衣殼蛋白中包括來自多種AAV血清型之個別胺基酸或區域,可獲得具有單獨衍生自多種AAV血清型之多種所需性質的衣殼蛋白。
各種AAV血清型及自主細小病毒之基因體序列以及天然末端重複(TR)、Rep蛋白及衣殼蛋白之序列為此項技術中已知的。此類序列可參見文獻或諸如GenBank之公共資料庫。例如,參見GenBank寄存編號NC_002077、NC_001401、NC_001729、NC_001863、NC_001829、NC_001 862、AAB95450.1、NC_000883、NC_001701、NC_001510、NC_006152、NC_006261、AF063497、U89790、AF043303、AF028705、AF028704、J02275、J01901、J02275、X01457、AF288061、AH009962、AY028226、AY028223、NC_001358、NC_001540、AF513851、AF513852、AY530579;其關於教示細小病毒以及AAV核酸及胺基酸序列之揭示內容以引用方式併入本文中。
亦參見例如Srivistava等人(1983) J. Virology 45:555;Chiorini等人(1998) J Virology 71:6823;Chiorini等人(1999) J. Virology 73: 1309;Bantel-Schaal等人(1999) J Virology 73:939;Xiao等人(1999) J Virology 73:3994;Muramatsu等人(1996) Virology 221:208;Shade等人(1986) J. Virol. 58:921;Gao等人(2002) Proc. Nat. Acad. Sci. USA 99:11854;Moris等人(2004) Virology 33:375-383;國際專利公開案WO 00/28061、WO 99/61601、WO 98/11244;及美國專利第6,156,303號;其關於教示細小病毒以及AAV核酸及胺基酸序列之揭示內容以引用方式併入本文中。亦參見表2。自主細小病毒及AAV之衣殼結構更詳細描述於BERNARD N. FIELDS等人,VIROLOGY,第2卷,第69及70章(第4版,Lippincott-Raven Publishers)中。亦參見AAV2 (Xie等人(2002) Proc. Nat. Acad. Sci. 99: 10405-10)、AAV9 (DiMattia等人(2012) J. Virol. 86:6947-6958)、AAV8 (Nam等人(2007) J. Virol. 81: 12260-12271)、AAV6 (Ng等人(2010) J. Virol. 84:12945-12957)、AAV5 (Govindasamy等人(2013) J. Virol. 87, 11187-11199)、AAV4 (Govindasamy等人(2006) J. Virol. 80:11556-11570)、AAV3B (Lerch等人(2010) Virology 403:26-36)、BPV (Kailasan等人(2015) J. Virol. 89:2603-2614)及CPV (Xie等人(1996) J. Mol. Biol. 6:497-520及Tsao等人(1991) Science 251:1456-64)之晶體結構的描述。
表 2
GenBank 寄存編號 | GenBank 寄存編號 | GenBank 寄存編號 | |||
完整基因體 | 演化支C | Rh57 | AY530569 | ||
腺相關病毒1 | NC_002077, AF063497 | Hu9 | AY530629 | Rh50 | AY530563 |
腺相關病毒2 | NC_001401 | Hu10 | AY530576 | Rh49 | AY530562 |
腺相關病毒3 | NC_001729 | Hu11 | AY530577 | Hu39 | AY530601 |
腺相關病毒3B | NC_001863 | Hu53 | AY530615 | Rh58 | AY530570 |
腺相關病毒4 | NC_001829 | Hu55 | AY530617 | Rh61 | AY530572 |
腺相關病毒5 | Y18065, AF085716 | Hu54 | AY530616 | Rh52 | AY530565 |
腺相關病毒6 | NC_001862, AAB95450.1 | Hu7 | AY530628 | Rh53 | AY530566 |
禽AAV ATCC VR-865 | AY186198, AY629583, NC_004828 | Hu18 | AY530583 | Rh51 | AY530564 |
禽AAV株DA-1 | NC_006263, AY629583 | Hu15 | AY530580 | Rh64 | AY530574 |
牛AAV | NC_005889, AY388617, AAR26465 | Hu16 | AY530581 | Rh43 | AY530560 |
AAV11 | AAT46339, AY631966 | Hu25 | AY530591 | AAV8 | AF513852 |
AAV12 | ABI16639, DQ813647 | Hu60 | AY530622 | Rh8 | AY242997 |
演化支A | Ch5 | AY243021 | Rh1 | AY530556 | |
AAV1 | NC_002077, AF063497 | Hu3 | AY530595 | 演化支F | |
AAV6 | NC_001862 | Hu1 | AY530575 | Hu14 (AAV9) | AY530579 |
Hu.48 | AY530611 | Hu4 | AY530602 | Hu31 | AY530596 |
Hu 43 | AY530606 | Hu2 | AY530585 | Hu32 | AY530597 |
Hu 44 | AY530607 | Hu61 | AY530623 | HSC1 | MI332400.1 |
Hu 46 | AY530609 | 演化支D | HSC2 | MI332401.1 | |
演化支B | Rh62 | AY530573 | HSC3 | MI332402.1 | |
Hu. 19 | AY530584 | Rh48 | AY530561 | HSC4 | MI332403.1 |
Hu. 20 | AY530586 | Rh54 | AY530567 | HSC5 | MI332405.1 |
Hu 23 | AY530589 | Rh55 | AY530568 | HSC6 | MI332404.1 |
Hu22 | AY530588 | Cy2 | AY243020 | HSC7 | MI332407.1 |
Hu24 | AY530590 | AAV7 | AF513851 | HSC8 | MI332408.1 |
Hu21 | AY530587 | Rh35 | AY243000 | HSC9 | MI332409.1 |
Hu27 | AY530592 | Rh37 | AY242998 | HSC11 | MI332406.1 |
Hu28 | AY530593 | Rh36 | AY242999 | HSC12 | MI332410.1 |
Hu 29 | AY530594 | Cy6 | AY243016 | HSC13 | MI332411.1 |
Hu63 | AY530624 | Cy4 | AY243018 | HSC14 | MI332412.1 |
Hu64 | AY530625 | Cy3 | AY243019 | HSC15 | MI332413.1 |
Hu13 | AY530578 | Cy5 | AY243017 | HSC16 | MI332414.1 |
Hu56 | AY530618 | Rh13 | AY243013 | HSC17 | MI332415.1 |
Hu57 | AY530619 | 演化支E | Hu68 | ||
Hu49 | AY530612 | Rh38 | AY530558 | 選殖分離物 | |
Hu58 | AY530620 | Hu66 | AY530626 | AAV5 | Y18065, AF085716 |
Hu34 | AY530598 | Hu42 | AY530605 | AAV 3 | NC_001729 |
Hu35 | AY530599 | Hu67 | AY530627 | AAV 3B | NC_001863 |
AAV2 | NC_001401 | Hu40 | AY530603 | AAV4 | NC_001829 |
Hu45 | AY530608 | Hu41 | AY530604 | Rh34 | AY243001 |
Hu47 | AY530610 | Hu37 | AY530600 | Rh33 | AY243002 |
Hu51 | AY530613 | Rh40 | AY530559 | Rh32 | AY243003 |
Hu52 | AY530614 | Rh2 | AY243007 | 其他 | |
Hu T41 | AY695378 | Bb1 | AY243023 | Rh74 | |
Hu S17 | AY695376 | Bb2 | AY243022 | 髭頰蜥AAV | |
Hu T88 | AY695375 | Rh10 | AY243015 | 蛇AAV | NC_006148.1 |
Hu T71 | AY695374 | Hu17 | AY530582 | ||
Hu T70 | AY695373 | Hu6 | AY530621 | ||
Hu T40 | AY695372 | Rh25 | AY530557 | ||
Hu T32 | AY695371 | Pi2 | AY530554 | ||
Hu T17 | AY695370 | Pi1 | AY530553 | ||
Hu LG15 | AY695377 | Pi3 | AY530555 |
重組AAV (rAAV)載體可使用病毒產生細胞株在培養物中產生。術語「病毒產生細胞(viral production cell)」、「病毒產生細胞株(viral production cell line)」或「病毒產生細胞(viral producer cell)」係指用於產生病毒載體之細胞。HEK293及239T細胞係常用的病毒產生細胞株。下表8列出用於各種病毒載體之例示性病毒產生細胞株。rAAV之產生通常需要在細胞中存在三種元件:1)側接有AAV反向末端重複(ITR)序列之轉殖基因,2) AAV rep及cap基因,及3)輔助病毒蛋白序列。此三種元件可提供於一或多種質體上,且轉染或轉導至細胞中。
表 8 :例示性病毒產生細胞株
病毒載體 | 例示性病毒產生細胞株 |
腺病毒 | HEK293、911、pTG6559、PER.C6、GH329、N52.E6、HeLa-E1、UR、VLI-293 |
腺相關病毒(AAV) | HEK293、Sf9 |
反轉錄病毒 | HEK293 |
慢病毒 | 293T |
如本文所用之術語「感染複數」或「MOI」係指與細胞接觸之病毒粒子數。例如,經培養細胞可以介於約1 × 10
2至約1 × 10
5個病毒粒子/細胞範圍內之MOI與AAV接觸。
如本文所用,術語「轉導」係指藉由病毒載體將核酸(例如轉殖基因)引入細胞中之過程。本文描述經修飾AAV衣殼蛋白(例如,變異體衣殼蛋白)及包含其之衣殼,其可併入病毒載體中以賦予增強
活體內或
離體細胞轉導的表型。如本文所用,「增強的轉導」、「增強的細胞轉導」及類似術語可指轉導增加約1.5倍至約100倍或更多倍。例如,轉導可增加至少1.5倍、至少2倍、至少3倍、至少4倍、至少5倍、至少10倍、至少20倍、至少30倍、至少40倍、至少50倍、至少60倍、至少70倍、至少80倍、至少90倍、至少100倍或更多倍。相對於野生型或天然AAV載體,可增強經修飾AAV (例如,包含衣殼變異體之AAV)的轉導。在一些實施例中,相對於在其他方面相同但缺少轉導相關肽之AAV載體,可增強包含轉導相關肽之AAV載體的轉導。
術語「轉殖基因」係指用於細胞轉導之任何核酸序列,該細胞可為
離體維持之細胞或生物體中之細胞。轉殖基因可為編碼序列、非編碼序列、cDNA、基因或其片段或部分、基因體序列、調控元件及諸如此類。「轉殖基因」生物體,諸如轉殖基因植物或轉殖基因動物係一種生物體,轉殖基因已被遞送或引入至其中且轉殖基因可在轉殖基因生物體中表現以產生產物,該產物之存在可在生物體中賦予作用(例如,治療或有益作用)及/或表型(例如,期望的或改變的表型)。
如本文所用之術語「向性」係指病毒優先進入某些細胞或組織中,隨後視情況表現(例如轉錄及視情況轉譯)細胞中之病毒基因體攜帶的序列,例如對於重組病毒,表現所關注異源核酸。
熟習此項技術者應理解,自病毒基因體轉錄異源核酸序列可能不會在不存在反式作用因子之情況下啟動,例如,對於誘導型啟動子或以其他方式調節的核酸序列。在rAAV基因體之情況下,來自病毒基因體之基因表現可來自穩定整合的原病毒、來自非整合的游離基因體,以及病毒在細胞內可採取之任何其他形式。
如本文所用之「全身向性」及「全身性轉導」 (及等效術語)指示,本發明之病毒衣殼或病毒載體分別對全身組織(例如腦、肺、骨骼肌、心臟、肝、腎及/或胰臟)展現向性或轉導。在一些實施例中,觀測到肌肉組織(例如骨骼肌、膈肌及心肌)之全身性轉導。在一些實施例中,達成骨骼肌組織之全身性轉導。舉例而言,在一些實施例中,轉導基本上全身所有的骨骼肌(但轉導之效率可因肌肉類型而不同)。在一些實施例中,達成四肢肌肉、心肌及膈肌之全身性轉導。視情況,病毒衣殼或病毒載體係經由全身性途徑(例如諸如靜脈內、關節內或淋巴內之全身性途徑)來投與。
替代地,在一些實施例中,衣殼或病毒載體係經局部遞送(例如至足墊、肌內、皮內、皮下、局部)。
除非另有指示,否則「有效轉導」或「有效向性」或類似術語可藉由參考適宜對照來確定(例如轉導或向性分別為對照之至少約50%、約60%、約70%、約80%、約85%、約90%、約95%或更多)。在一些實施例中,病毒載體有效地轉導T細胞、骨骼肌、心肌、膈肌、胰臟(包括β-胰島細胞)、脾、胃腸道(例如上皮及/或平滑肌)、中樞神經系統細胞、肺、關節細胞及/或腎或對其具有有效向性。適宜對照將取決於多種因素,包括期望向性特徵。在一些實施例中,適宜對照為野生型或天然病毒。
類似地,藉由參考適宜對照,可確定病毒是否「不能有效地轉導靶組織」或「對靶組織不具有效向性」,或類似術語。在一些實施例中,病毒載體不能有效地轉導肝、腎、性腺及/或生殖細胞(亦即對其不具有效向性)。在一些實施例中,組織(例如肝)之不期望轉導為期望靶組織(例如骨骼肌、膈肌、心肌及/或中樞神經系統細胞)之轉導水準的20%或更小、10%或更小、5%或更小、1%或更小、0.1%或更小。
如本文所用,除非另有說明,否則術語「多肽」包括肽及蛋白質。
「多核苷酸」為核苷酸鹼基之序列,且可為RNA、DNA或DNA-RNA雜合序列(包括天然存在及非天然存在之核苷酸二者),但在代表性實施例中為單股或雙股DNA序列。
如本文所用之「經分離」多核苷酸(例如「經分離DNA」或「經分離RNA」)意謂至少部分地與天然生物體或病毒之至少一些其他組分(例如細胞或病毒結構組分或通常發現與多核苷酸相關之其他多肽或核酸)分離的多核苷酸。在代表性實施例中,「經分離」核苷酸與起始材料相比富集至少約10倍、約100倍、約1000倍、約10,000倍或更多倍。
同樣,「經分離」多肽意謂至少部分地與天然生物體或病毒之至少一些其他組分(例如細胞或病毒結構組分或通常發現之與多肽相關之其他多肽或核酸)分離的多肽。在一些實施例中,「經分離」多肽與起始材料相比富集至少約10倍、約100倍、約1000倍、約10,000倍或更多倍。
如本文所用之「分離」或「純化」 (或語法等效物)病毒載體意謂病毒載體至少部分地與起始材料中之至少一些其他組分分離。在一些實施例中,「經分離」或「經純化」之病毒載體與起始材料相比富集至少約10倍、100倍、1000倍、10,000倍或更多倍。
如本文所用,術語「轉導相關肽」係指可併入AAV載體中以改變AAV載體向任何細胞中之轉導的短胺基酸序列。轉導相關肽可能對AAV載體的轉導產生任何影響。例如,在一些實施例中,轉導相關肽增加AAV載體向所關注靶細胞中之轉導。在一些實施例中,轉導相關肽降低AAV載體向未經靶向之細胞中之轉導。轉導相關肽可插入至現有AAV衣殼序列中(亦即在序列中產生淨添加的胺基酸),或其可替換AAV衣殼序列之現有部分(亦即不產生序列中之胺基酸數量的淨變化或減少)。
「治療性多肽」或「治療性蛋白質」為可緩解、減輕、預防、延遲及/或穩定由細胞或個體中蛋白質缺失或缺陷引起之症狀的多肽,及/或為以其他方式賦予個體益處(例如抗癌效應或移植存活性之改良)的多肽。
術語「治療(treat/treating/treatment of)」 (及其語法變化形式)意謂個體病況之嚴重程度降低、至少部分地改良或穩定,及/或至少一種臨床症狀達成一定程度之緩解、減輕、減少或穩定,及/或疾病或病症之進展有所延遲。術語「個體」及術語「患者」在本文中可互換使用。
術語「預防(prevent/preventing/prevention)」 (及其語法變化形式)係指預防及/或延遲個體之疾病、病症及/或臨床症狀之發作,及/或相對于在無本發明之方法之情況下會發生的疾病、病症及/或臨床症狀之發作嚴重程度有所降低。預防可為完全的,例如完全無疾病、病症及/或臨床症狀。預防亦可為部分的,使得個體之疾病、病症及/或臨床症狀之發生及/或發作之嚴重程度小於無本發明時會發生的情況。
如本文所用,「治療有效量」係指當投與至個體以治療疾病或疾病之至少一種臨床症狀時,足以影響對疾病或其症狀之此類治療的量。「治療有效量」可視例如疾病及/或疾病之症狀、疾病及/或疾病或病症之症狀之嚴重程度、欲治療患者之年齡、體重及/或健康狀況以及開處方醫師之判斷而變化。在任何給定情況下,合適之量可由熟習此項技術者確定,或能夠藉由常規實驗來確定。
如本文所用之術語「病毒載體」、「載體」或「基因遞送載體」係指病毒(例如AAV)粒子,其用作核酸遞送媒劑且包含包裝在病毒粒子內之載體基因體(例如病毒DNA [vDNA])。替代地,在一些情況下,術語「載體」可用於單獨指載體基因體/vDNA。
「腺相關病毒載體」或「AAV載體」通常包含AAV衣殼及經AAV衣殼包裹之核酸(例如包含轉殖基因之核酸)。「AAV衣殼」係一種近球形的蛋白質殼,其包含約60個以T = 1二十面體對稱性締合及排列的「AAV衣殼蛋白」(在本文中可互換地稱為「AAV衣殼蛋白次單元」或「衣殼蛋白」)。本文所述之AAV載體的AAV衣殼包含複數個AAV衣殼蛋白。當AAV載體描述為包含AAV衣殼蛋白時,應理解,AAV載體包含AAV衣殼,其中AAV衣殼包含一或多個AAV衣殼蛋白。術語「病毒樣粒子(viral-like particle/virus-like particle)」係指不含有包含轉移卡匣或轉殖基因之任何載體基因體或核酸的蛋白質衣殼。術語「AAV載體」、「AAV衣殼」及「AAV衣殼蛋白」有時在本文中可互換使用。基於上下文,一般熟習此項技術者將能夠容易地推斷出所用特定術語的含義。
在一些實施例中,AAV載體可含有包含「轉移卡匣」之核酸,亦即包含一或多個可藉由AAV遞送至細胞之序列的核酸。在一些實施例中,核酸為自互補的(亦即雙股)。在一些實施例中,核酸不為自互補的(亦即單股)。
「rAAV載體基因體」或「rAAV基因體」為包含一或多個異源核酸序列之AAV基因體(亦即vDNA)。rAAV載體通常僅需要順式反向末端重複序列(TR)來生成病毒。所有其他病毒序列係可有可無的且可以反式供應(Muzyczka, (1992) Curr. Topics Microbiol. Immunol. 158:97)。通常,rAAV載體基因體將僅保留一或多個TR序列以最大化可經載體有效包裝之轉殖基因的大小。結構及非結構蛋白編碼序列可以反式提供(例如自諸如質體之載體,或藉由將序列穩定整合至包裝細胞中)。在實施例中,rAAV載體基因體包含至少一個TR序列(例如AAV TR序列),視情況兩個TR (例如兩個AAV TR),其通常將在載體基因體之5'端及3’端且側接異源核酸,但無需與其鄰接。TR可彼此相同或不同。
術語「末端重複序列」或「TR」包括任何病毒末端重複序列或合成序列,其形成髮夾結構且起反向末端重複序列的作用(亦即,介導所需功能,諸如復制、病毒包裝、整合及/或原病毒救援及諸如此類)。TR可為AAV TR或非AAV TR。例如,非AAV TR序列,諸如其他細小病毒(例如犬細小病毒(CPV)、小鼠細小病毒(MVM)、人類細小病毒B-19)序列或任何其他適合之病毒序列(例如作為SV40複製起點之SV40髮夾)可用作TR,其可藉由截短、取代、缺失、插入及/或添加進一步修飾。此外,TR可為部分或完全合成的,諸如授予Samulski等人之美國專利第5,478,745號中所述之「雙D序列」。
「AAV末端重複序列」或「AAV TR」可來自任何AAV,包括但不限於血清型1、2、3、4、5、6、7、8、9、10、11、12、13或任何現在已知或後來發現之其他AAV (參見例如表2)。AAV末端重複序列不需要具有天然末端重複序列(例如,天然AAV TR序列可藉由插入、缺失、截短及/或錯義突變來改變),只要末端重複序列介導所需功能,例如復制、病毒包裝、整合及/或原病毒救援及諸如此類即可。
本發明之病毒載體另外可為如國際專利公開案WO00/28004及Chao等人(2000) Molecular Therapy 2:619中所述之「靶向」病毒載體(例如具有定向向性)及/或「雜合」細小病毒(亦即其中病毒TR及病毒衣殼來自不同的細小病毒)。
本發明之病毒載體另外可為如國際專利公開案WO 01/92551 (其揭示內容以全文引用方式併入本文中)中所述之雙螺旋細小病毒粒子。因此,在一些實施例中,雙股(雙螺旋)基因體可包裝至本發明之病毒衣殼中。
此外,病毒衣殼或基因體元件可含有其他修飾,包括插入、缺失及/或取代。
如本文所用之術語「胺基酸
」涵蓋任何天然胺基酸、其經修飾形式及合成胺基酸。
天然存在之左旋(L-)胺基酸顯示於表3中。
表 3 :胺基酸殘基及縮寫。
胺基酸殘基 | 縮寫 | |
三字母代碼 | 單字母代碼 | |
丙胺酸 | Ala | A |
精胺酸 | Arg | R |
天冬醯胺 | Asn | N |
天冬胺酸(天冬胺酸鹽) | Asp | D |
半胱胺酸 | Cys | C |
麩醯胺酸 | Gln | Q |
麩胺酸(麩胺酸鹽) | Glu | E |
甘胺酸 | Gly | G |
組胺酸 | His | H |
異白胺酸 | Ile | I |
白胺酸 | Leu | L |
離胺酸 | Lys | K |
甲硫胺酸 | Met | M |
苯丙胺酸 | Phe | F |
脯胺酸 | Pro | P |
絲胺酸 | Ser | S |
蘇胺酸 | Thr | T |
色胺酸 | Trp | W |
酪胺酸 | Tyr | Y |
纈胺酸 | Val | V |
替代地,胺基酸可為經修飾胺基酸殘基(非限制性實例顯示於表4中)及/或可為藉由轉譯後修飾(例如乙醯化、醯胺化、甲醯化、羥基化、甲基化、磷酸化或硫酸化)修飾之胺基酸。
表 4 :經修飾胺基酸殘基
經修飾胺基酸殘基 | 縮寫 |
胺基酸殘基衍生物 | |
2-胺基己二酸 | Aad |
3-胺基己二酸 | bAad |
β-丙胺酸,β-胺基丙酸 | bAla |
2-胺基丁酸 | Abu |
4-胺基丁酸,六氫吡啶酸 | 4Abu |
6-胺基己酸 | Acp |
2-胺基庚酸 | Ahe |
2-胺基異丁酸 | Aib |
3-胺基異丁酸 | bAib |
2-胺基庚二酸 | Apm |
第三丁基丙胺酸 | t-BuA |
瓜胺酸 | Cit |
環己基丙胺酸 | Cha |
2,4-二胺基丁酸 | Dbu |
鎖鏈素 | Des |
2,21-二胺基庚二酸 | Dpm |
2,3-二胺基丙酸 | Dpr |
N-乙基甘胺酸 | EtGly |
N-乙基天冬醯胺 | EtAsn |
高精胺酸 | hArg |
高半胱胺酸 | hCys |
高絲胺酸 | hSer |
羥離胺酸 | Hyl |
別-羥離胺酸 | aHyl |
3-羥脯胺酸 | 3Hyp |
4-羥脯胺酸 | 4Hyp |
異鎖鏈素 | Ide |
別-異白胺酸 | aIle |
甲硫胺酸亞碸 | MSO |
N-甲基甘胺酸,肌胺酸 | MeGly |
N-甲基異白胺酸 | MeIle |
6-N-甲基離胺酸 | MeLys |
N-甲基纈胺酸 | MeVal |
2-萘基丙胺酸 | 2-Nal |
正纈胺酸 | Nva |
正白胺酸 | Nle |
鳥胺酸 | Orn |
4-氯苯丙胺酸 | Phe(4-C1) |
2-氟苯丙胺酸 | Phe(2-F) |
3-氟苯丙胺酸 | Phe(3-F) |
4-氟苯丙胺酸 | Phe(4-F) |
苯基甘胺酸 | Phg |
β-2-噻吩基丙胺酸 | Thi |
此外,非天然存在之胺基酸可為「非天然」胺基酸(如Wang等人,Annu Rev Biophys Biomol Struct. 35:225-49 (2006)所述)。此等非天然胺基酸可有利地用於將所關注分子化學連接至AAV衣殼蛋白。
「主動免疫反應」或「主動免疫性
」之特徵在於「在遇到免疫原後宿主組織及細胞之參與。其涉及淋巴網狀組織中免疫勝任細胞之分化及增殖,從而導致抗體之合成或細胞介導之反應性之發展,或兩者皆有」。Herbert B. Herscowitz, Immunophysiology: Cell Function and Cellular Interactions in Antibody Formation, IMMUNOLOGY: BASIC PROCESSES 1 17 (Joseph A. Bellanti編,1985)。換言之,在藉由感染或接種暴露于免疫原後,宿主產生主動免疫反應。主動免疫可與被動免疫形成對比,被動免疫係經由將預成型物質(抗體、轉移因子、胸腺移植物、介白素-2)自經主動免疫之宿主轉移至非免疫宿主來獲得。
如本文所用之「保護性」免疫反應或「保護性」免疫性指示,免疫反應賦予個體一定之益處在於,其預防或減少疾病之發生。替代地,保護性免疫反應或保護性免疫性可用於治療及/或預防疾病,尤其癌症或腫瘤(例如藉由預防癌症或腫瘤形成、藉由使癌症或腫瘤消退及/或藉由預防轉移及/或藉由預防轉移性結節之生長)。保護效應可為完全的或部分的,只要治療之益處超過其任何缺點即可。
如本文所用之術語「癌症」涵蓋腫瘤形成癌症。同樣,術語「癌性組織」涵蓋腫瘤。「癌細胞抗原」涵蓋腫瘤抗原。
術語「癌症」在此項技術中具有其理解之含義,例如,有可能擴散至身體之遠端位點之組織不受控生長(亦即轉移)。例示性癌症包括但不限於黑色素瘤、腺癌、胸腺瘤、淋巴瘤(例如非霍奇金氏淋巴瘤、霍奇金氏淋巴瘤)、肉瘤、肺癌、肝癌、結腸癌、白血病、子宮癌、乳癌、前列腺癌、卵巢癌、子宮頸癌、膀胱癌、腎癌、胰臟癌、腦癌及現在已知或後來鑑別之任何其他癌症或惡性病況。在代表性實施例中,本發明提供治療及/或預防腫瘤形成癌症之方法。
術語「腫瘤」在此項技術中亦理解為例如多細胞生物體內未分化細胞之異常團塊。腫瘤可為惡性或良性的。在代表性實施例中,本文所揭示之方法用於預防及治療惡性腫瘤。
術語「治療癌症」、「癌症之治療」及等效術語欲指降低或至少部分地消除癌症之嚴重程度、及/或減緩及/或控制疾病之進展、及/或使疾病穩定。在一些實施例中,此等術語指示,防止或減少或至少部分地消除癌症之轉移、及/或防止或減少或至少部分地消除轉移性結節之生長。
術語「癌症之預防」或「預防癌症」及等效術語欲指該等方法至少部分地消除或減少及/或延遲癌症發作之發生率及/或嚴重程度。換言之,個體之癌症發作的可能性或概率可降低及/或個體之癌症發作可延遲。
經修飾 AAV 衣殼蛋白及包含其之衣殼
本發明提供AAV衣殼蛋白(VP1、VP2及/或VP3)變異體,以及包含其之病毒衣殼及病毒載體。各衣殼變異體包含一或多個轉導相關肽。轉導相關肽不存在於天然存在之AAV衣殼蛋白中,且在一些實施例中可賦予包含衣殼蛋白之AAV載體增強的向所關注靶細胞(例如T細胞)中之轉導。本文揭示之AAV衣殼蛋白變異體可為相對於現在已知或以後發現之任何AAV血清型之衣殼蛋白的變異體。在一些實施例中,AAV衣殼蛋白變異體為來自選自以下之AAV血清型之衣殼蛋白的變異體:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh10、AAVrh32.33、AAVrh74、牛AAV及禽AAV。
a. AAV 衣殼蛋白之修飾
在一些實施例中,本文所述之轉導相關肽可賦予包含經修飾AAV衣殼蛋白之病毒載體一或多種所需特性,包括但不限於在
活體外、
活體內或
離體增強各種細胞類型(例如T細胞)中之細胞轉導。在一些實施例中,本發明之衣殼蛋白可併入AAV載體中。在一些實施例中,與野生型AAV或包含不包含轉導相關肽之AAV衣殼蛋白的AAV病毒粒子或AAV病毒載體相比,包含衣殼蛋白之AAV載體具有增強的細胞轉導(例如增強的T細胞轉導)。在一些實施例中,本發明之AAV病毒粒子或載體亦可逃避中和抗體。
本發明之轉導相關肽可取代野生型AAV衣殼蛋白之胺基酸序列,使得AAV衣殼蛋白序列中之胺基酸數目無淨增加或減少。在一些實施例中,與野生型AAV衣殼蛋白序列相比,用本發明之轉導相關肽取代野生型AAV衣殼蛋白之胺基酸序列可引起胺基酸之淨損失(例如缺失)。例如,轉導相關肽可取代來自以下任一种血清型之AAV衣殼蛋白中之一或多個胺基酸:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh.10、AAVrh32.33、AAVrh74、牛AAV及禽AAV。在一些實施例中,本發明之轉導相關肽可插入至野生型AAV衣殼蛋白之胺基酸序列中,使得AAV衣殼蛋白序列中之胺基酸數目增加。
在一些實施例中,AAV衣殼蛋白之修飾使得天然AAV衣殼蛋白之一或多個胺基酸殘基經天然衣殼序列中不存在之胺基酸取代。在一些實施例中,AAV衣殼蛋白之修飾使得以下胺基酸殘基中之一或多者經天然衣殼蛋白序列中不存在之胺基酸取代:454、455、456、457、458、459及460,其中胺基酸編號係相對於野生型AAV6衣殼蛋白之VP1序列,或任何其他AAV血清型之衣殼蛋白中之對應殘基。在一些實施例中,AAV衣殼蛋白之修飾使得以下胺基酸殘基中之一或多者經缺失:454、455、456、457、458、459及460,其中胺基酸編號係相對於野生型AAV6衣殼蛋白之VP1序列,或任何其他AAV血清型之衣殼蛋白中之對應殘基。在一些實施例中,AAV衣殼蛋白之修飾使得胺基酸454、455、456、457、458、459及/或460中之一或多者相對於天然AAV6衣殼蛋白序列(SEQ ID NO: 1)之胺基酸序列經取代。
在一些實施例中,AAV衣殼蛋白包含序列X1-X2-X3-X4-X5-X6-X7 (SEQ ID NO: 24)之轉導相關肽。在一些實施方案中,AAV衣殼蛋白包含序列X1-X2-X3-X4-X5-X6-X7 (SEQ ID NO: 24)之轉導相關肽,其中衣殼蛋白為以下血清型中之任一者:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh.10、AAVrh32.33、AAVrh74、牛AAV或禽AAV。在一些實施例中,包含選自SEQ ID NO: 1或25-34中任一者之胺基酸序列的AAV衣殼蛋白包含序列X1-X2-X3-X4-X5-X6-X7 ( SEQ ID NO: 24)之轉導相關肽。在一些實施例中,AAV衣殼蛋白包含天然AAV6衣殼蛋白序列(例如SEQ ID NO: 1)之序列,且進一步包含SEQ ID NO: 24之轉導相關肽。在一些實施例中,AAV衣殼蛋白包含與野生型AAV衣殼蛋白序列,諸如SEQ ID NO: 1或25-34之胺基酸序列具有至少約80%一致性,例如至少約85%、至少約90%、至少約95%、至少約96%、至少約97%、至少約98%、至少約99%、至少約99.5%或約100%一致性的胺基酸序列。在一些實施例中,本文揭示之AAV衣殼蛋白包含與SEQ ID NO: 1具有約99%一致性之胺基酸序列。
SEQ ID NO: 24之轉導相關肽可用於取代所揭示AAV衣殼蛋白之胺基酸序列中任何地方的一或多個胺基酸殘基。在一些實施例中,SEQ ID NO: 24之轉導相關肽可用於取代衣殼蛋白中之序列,其中該衣殼蛋白具有選自SEQ ID NO: 1及25-34中任一者之胺基酸序列。在一些實施例中,序列SEQ ID NO: 24之轉導相關肽可插入至本文揭示之AAV衣殼蛋白的胺基酸序列中。在一些實施例中,用序列SEQ ID NO: 24之轉導相關肽取代本文所述之一或多種AAV衣殼蛋白的天然序列可使得自AAV衣殼蛋白之序列中刪除一或多個胺基酸。在一些實施例中,衣殼蛋白可包含SEQ ID NO: 1之序列,除了 SEQ ID NO: 1之胺基酸454-460經包含序列SEQ ID NO: 24之轉導相關肽取代。在一些實施例中,SEQ ID NO: 24用於取代野生型AAV衣殼蛋白之序列,使得所得序列包含至少一個、兩個、三個等不存在於野生型序列中之個別胺基酸。
在一些實施例中,SEQ ID NO: 24包含一序列,其中X1不為G,X2不為S,X3不為A,X4不為Q,X5不為N,X6不為K,及/或X7不為D。在一些實施例中,X1為H、M、A、Q、V或S。在一些實施例中,X2為A或T。在一些實施例中,X3為P或T。在一些實施例中,X4為R或D。在一些實施例中,X5為V、Q、C、S或D。在一些實施例中,X6為E、A或P。在一些實施例中,X7為E、G、N、T或A。在一些實施例中,X1為H,X2為A,X3為P,X4為R,X5為V,X6為E,且X7為E。在一些實施例中,X1為M,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。在一些實施例中,X1為H,X2為T,X3為T,X4為D,X5為C,X6為A,且X7為N。在一些實施例中,X1為A,X2為A,X3為P,X4為R,X5為S,X6為E,在一些實施例中,X1為Q,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。在一些實施例中,X1為V,X2為A,X3為P,X4為R,X5為D,X6為P,X7為A。在一些實施例中,X1為S,X2為A,X3為P,X4為R,X5為S,X46為E,且X7為N。
在一些實施例中,轉導相關肽具有X1-X2-X3-X4-X5-X6-X7之胺基酸序列,其中X1 = H、M、Q、V或S;X2 = A或T;X3 = P或T;X4 = R或D;X5 = V、Q、C、S或D;X6 = E、A或P;且X7 = E、G、N、T或A (SEQ ID NO: 16)。在一些實施例中,轉導相關肽具有SEQ ID NO: 17-23中任一者之胺基酸序列。
在一些實施例中,AAV衣殼蛋白包含具有SEQ ID NO: 17-23中任一者之胺基酸序列的轉導相關肽。在一些實施例中,具有SEQ ID NO: 17-23中任一者之胺基酸序列的轉導相關肽取代AAV衣殼蛋白之一或多個胺基酸。本發明提供以下任一种血清型之AAV衣殼蛋白的變異體:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh.10、AAVrh32.33、AAVrh74、牛AAV及禽AAV,其中AAV衣殼蛋白變異體包含胺基酸序列,該胺基酸序列包含具有SEQ ID NO: 17-23中任一者之胺基酸序列的轉導相關肽。在一些實施例中,AAV衣殼蛋白包含選自SEQ ID NO: 1及25-34 中任一者之胺基酸序列,但其中一或多個胺基酸經具有SEQ ID NO: 17-23中任一者之胺基酸序列的轉導相關肽取代。
在一些實施例中,具有SEQ ID NO:17-23中任一者之胺基酸序列的轉導相關肽取代以下血清型中任一者之AAV衣殼蛋白的一或多個胺基酸:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh.10、AAVrh32.33、AAVrh74、牛AAV及禽AAV。在一些實施例中,具有SEQ ID NO: 17-23中任一者之胺基酸序列的轉導相關肽取代AAV衣殼蛋白之一或多個胺基酸,該AAV衣殼蛋白包含選自SEQ ID NO: 1及25-34中任一者之胺基酸序列。
在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經包含序列SEQ ID NO: 17-23中之任一者的轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 17之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 18之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 19之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 20之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 21之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 22之轉導相關肽取代。在一些實施例中,天然AAV6衣殼蛋白(例如SEQ ID NO: 1)之胺基酸454-460經序列SEQ ID NO: 23之轉導相關肽取代。
在一些實施例中,AAV衣殼蛋白包含選自由SEQ ID NO: 2、4、6、8、10、12及14組成之群的胺基酸序列,或與其至少約80%一致之序列。例如,在一些實施例中,AAV衣殼蛋白包含與SEQ ID NO: 2、4、6、8、10、12或14中之任一者至少約85%、至少約90%、至少約95%、至少約96%、至少約97%、至少約98%、至少約99%一致、至少約99.5%或約100%一致的胺基酸序列。
b. AAV 衣殼蛋白之其他修飾
本發明考慮欲修飾之AAV衣殼蛋白可為天然存在之AAV衣殼蛋白(例如AAV2、AAV3a或3b、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10或AAV11衣殼蛋白或表2中所示之任一AAV),但不限於此。熟習此項技術者應理解,對AAV衣殼蛋白之多種操縱為此項技術中已知的,且本發明不限於天然存在之AAV衣殼蛋白之修飾。舉例而言,欲修飾之衣殼蛋白與天然AAV相比可能已具有變化(例如衍生自天然AAV衣殼蛋白,例如AAV2、AAV3a、AAV3b、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12或現在已知或後來發現之任何其他AAV)。在一些實施例中,衣殼蛋白可為工程化AAV,諸如AAV2i8、AAV2g9、AAV-LK03、AAV7m8、AAV Anc80、AAV PHP.B。此類AAV衣殼蛋白亦在本發明之範疇內。
在一些實施例中,AAV衣殼蛋白為嵌合的。例如,嵌合AAV衣殼蛋白可包含衍生自兩種或更多種AAV血清型或三種或更多種AAV血清型之序列。嵌合AAV衣殼蛋白可包含衍生自兩種或更多種以下AAV血清型之序列:AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh.8、AAVrh.10、AAVrh32.33、AAVrh74、牛AAV及禽AAV。
因此,在一些實施例中,欲修飾之AAV衣殼蛋白可衍生自天然AAV,但進一步包含插入及/或取代至衣殼蛋白中之一或多個外源序列(例如天然病毒之外源)及/或已藉由缺失一或多個胺基酸而改變。因此,當本文提及特定AAV衣殼蛋白(例如AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10或AAV11衣殼蛋白或來自表2中所示之任一AAV的衣殼蛋白等)時,其意欲涵蓋天然衣殼蛋白以及具有除本發明之修飾以外之變化的衣殼蛋白。此類變化包括取代、插入及/或缺失。在一些實施例中,與天然AAV衣殼蛋白序列相比,衣殼蛋白包含1個、2個、3個、4個、5個、6個、7個、8個、9個、10個、11個、12個、13個、14個、15個、16個、17個、18個、19個或20個、少於20個、少於30個、少於40個、少於50個、少於60個或少於70個插入其中之胺基酸(本發明之胺基酸序列取代除外)。在實施例中,與天然AAV衣殼蛋白序列相比,衣殼蛋白包含1個、2個、3個、4個、5個、6個、7個、8個、9個、10個、11個、12個、13個、14個、15個、16個、17個、18個、19個或20個、少於20個、少於30個、少於40個、少於50個、少於60個或少於70個胺基酸取代(根據本發明之轉導相關肽除外)。在一些實施例中,與天然AAV衣殼蛋白序列相比,衣殼蛋白包含1個、2個、3個、4個、5個、6個、7個、8個、9個、10個、11個、12個、13個、14個、15個、16個、17個、18個、19個或20個、少於20個、少於30個、少於40個、少於50個、少於60個或少於70個胺基酸之缺失(本發明之轉導相關肽除外)。
根據本發明對AAV衣殼蛋白之修飾為「選擇性」修飾。此方法與使用AAV血清型之間的整個次單元或大域交換之先前工作(例如,參見國際專利公開案WO 00/28004及Hauck等人(2003) J. Virology 77:2768-2774)有所不同。在一些實施例中,「選擇性」修飾產生少於或等於約20個、18個、15個、12個、10個、9個、8個、7個、6個、5個、4個或3個鄰接胺基酸之插入及/或取代及/或缺失。本發明之經修飾衣殼蛋白及衣殼可進一步包含現在已知或後來鑑別之任何其他修飾。在本文所述之實施例(其中胺基酸殘基經除存在於野生型或天然胺基酸序列中之胺基酸殘基外之任一胺基酸殘基取代)中,該任何其他胺基酸殘基可為此項技術中已知之任何天然或非天然胺基酸殘基(例如,參見表3及表4)。在一些實施例中,取代可為保守取代,且在一些實施例中,取代可為非保守取代。
如本文所述,來自多種AAV之衣殼蛋白之胺基酸序列及核酸序列為此項技術中已知的。因此,可容易地確定任何其他AAV之「對應於」天然AAV衣殼蛋白之胺基酸位置的胺基酸(例如藉由使用序列比對)。確定兩個或更多個胺基酸序列之間的序列相似性或一致性之方法為此項技術中已知的。序列相似性或一致性可使用此項技術中已知之標準技術確定,包括但不限於Smith及Waterman, Adv. Appl. Math. 2, 482 (1981)之局部序列一致性演算法;Needleman及Wunsch, J Mol. Biol. 48,443 (1970)之序列一致性比對演算法;Pearson及Lipman, Proc. Natl. Acad. Sci. USA 85,2444 (1988)之相似性搜尋方法;此等演算法(Wisconsin Genetics軟體包中之GAP、BESTFIT、FASTA及TFASTA, Genetics Computer Group, 575 Science Drive, Madison, WI)之計算實施;Devereux等人,Nucl. Acid Res. 12, 387-395 (1984)所述之最佳擬合序列程式或檢查。
另一適宜演算法為Altschul等人,J Mol. Biol. 215, 403-410, (1990)及Karlin等人,Proc. Natl. Acad. Sci. USA 90, 5873-5787 (1993)中所述之BLAST演算法。尤其適用之BLAST程式為自Altschul等人,Methods in Enzymology, 266, 460-480 (1996); blast.wustl/edu/blast/ README.html獲得之WU-BLAST-2程式。WU-BLAST-2利用若干搜尋參數,該等參數視情況設定為預設值。該等參數係動態值且由程式本身視特定序列之組成及正在搜索相關序列之特定資料庫之組成來建立;然而,可調整該等值以增加靈敏度。
此外,另一適用之演算法係如Altschul等人(1997) Nucleic Acids Res. 25, 3389-3402報導之空位化BLAST。
除非另有說明,否則在本發明中使用以下萬維網地址處可用之BLAST演算法進行百分比一致性計算:blast.ncbi.nlm.nih.gov/Blast.cgi。
c. 經修飾病毒衣殼
本發明亦提供包含至少一種本文揭示之變異體衣殼蛋白的病毒衣殼。在一些實施例中,病毒衣殼為細小病毒衣殼,其可進一步為自主細小病毒衣殼或依賴病毒衣殼。視情況,病毒衣殼為AAV衣殼。在一些實施例中,AAV衣殼為AAV1、AAV2、AAV3a、AAV3b、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11、AAV12、AAVrh8、AAVrh10、AAVrh32.33、牛AAV衣殼、禽AAV衣殼或現在已知或後來鑑別之任何其他AAV。AAV血清型之非限制性清單顯示於表2中。本發明之AAV衣殼可為表2中所列出之任一AAV血清型或因一或多個插入、取代及/或缺失而衍生自前述任一者。經修飾的病毒衣殼可用作「衣殼媒劑」,如在例如美國專利第5,863,541號中所描述。可使用此項技術中已知之任何方法,例如藉由自桿狀病毒表現來產生根據本發明之病毒衣殼(Brown等人, (1994) Virology 198:477-488)。在一些實施例中,AAV衣殼包含約60種本文所述之變異體衣殼蛋白。
在一些實施例中,病毒衣殼可為靶向病毒衣殼,其包含靶向序列(例如取代或插入病毒衣殼中),該靶向序列將病毒衣殼定向至與存在於期望靶組織上之細胞表面分子相互作用(例如,參見國際專利公開案WO 00/28004及Hauck等人(2003) J. Virology 77:2768-2774);Shi等人,Human Gene Therapy 17:353-361 (2006) [描述在AAV衣殼次單元之位置520及/或584處插入整合素受體結合模體RGD];及美國專利第7,314,912號[描述在AAV2衣殼次單元之胺基酸位置447、534、573及587之後插入含有RGD模體之PI肽])。AAV衣殼次單元內之耐受插入之其他位置為此項技術中已知(例如Grifman等人,Molecular Therapy 3:964-975 (2001)所述之位置449及588)。
例如,本發明之病毒衣殼可對某些所關注靶組織(例如肝、骨骼肌、心臟、膈肌、腎、腦、胃、腸、皮膚、內皮細胞及/或肺)具有相對無效的向性。靶向序列可有利地併入此等低轉導載體中,藉此賦予病毒衣殼期望向性,及視情況對細胞,諸如T細胞之特定組織的選擇向性。包含靶向序列之AAV衣殼蛋白、衣殼及載體描述於例如國際專利公開案WO 00/28004中。作為另一實例,如Wang等人,Annu Rev Biophys Biomol Struct. 35:225-49 (2006)所述之一或多個非天然胺基酸可在正交位點併入本發明之AAV衣殼次單元中作為將低轉導載體重定向至期望靶組織之方式。該等非天然胺基酸可有利地用於將所關注分子化學連接至AAV衣殼蛋白,包括但不限於:聚糖(甘露糖-樹突細胞靶向);RGD、鈴蟾素或神經肽,用於靶向遞送至特定癌細胞類型;選自靶向特定細胞表面受體(例如生長因子受體、整合素及諸如此類)之噬菌體展示之RNA適配體或肽。化學修飾胺基酸之方法為此項技術中已知的(例如,參見Greg T. Hermanson, Bioconjugate Techniques,第1版,Academic Press, 1996)。在一些實施例中,靶向序列可為將感染定向至特定細胞類型之病毒衣殼序列(例如自主細小病毒衣殼序列、AAV衣殼序列或任何其他病毒衣殼序列)。
作為另一非限制性實例,可將肝素或硫酸乙醯肝素(HS)結合域(例如呼吸道融合病毒肝素結合結構域)插入或取代至通常不結合HS受體之衣殼次單元(例如AAV4、AAV5)中以賦予肝素及/或硫酸乙醯肝素與所得變異體之結合。此項技術中已知,HS/肝素結合係由精胺酸及/或離胺酸中富含之「鹼性小片段(basic patch)」介導。在例示性實施例中,可採用模體BXXB之序列(SEQ ID NO: 105),其中「B」為鹼性殘基且X為中性及/或疏水性的。作為非限制性實例,BXXB可為RGNR (SEQ ID NO: 106)。作為另一非限制性實例,用BXXB取代天然AAV2衣殼蛋白中之胺基酸位置262至265或另一AAV血清型之衣殼蛋白中之對應位置。
細小病毒B19使用紅血球糖苷酯作為其受體來感染原代紅血球先驅細胞(Brown等人(1993) Science 262: 114)。B19之結構已確定至8 Å解析度(Agbandje-McKenna等人(1994) Virology 203: 106)。B19衣殼之結合至紅血球糖苷酯之區域已定位於胺基酸399-406之間(Chapman等人(1993) Virology 194:419),其為β-桶結構E與F之間的環出區域(Chipman等人(1996) Proc. Nat. Acad. Sci. USA 93:7502)。因此,B19衣殼之紅血球糖苷酯受體結合域可取代至本發明之AAV衣殼蛋白中以使病毒衣殼或包含其之病毒載體靶向至紅血球。
在一些實施例中,外源靶向序列可為編碼改變包含經修飾AAV衣殼蛋白之病毒衣殼或病毒載體之向性之肽的任一胺基酸序列。在一些實施例中,靶向肽或蛋白質可為天然存在的,或替代地為完全或部分合成的。例示性靶向序列包括結合至細胞表面受體及醣蛋白之配位體及其他肽,例如ROD肽序列、緩激肽、激素、肽生長因子(例如表皮生長因子、神經生長因子、纖維母細胞生長因子、血小板源性生長因子、胰島素樣生長因子I及II等)、細胞介素、黑色素細胞刺激激素(例如a、β或γ)、神經肽及內啡肽及諸如此類,以及其保留使細胞靶向其同源受體之能力的片段。其他說明性肽及蛋白質包括物質P、角質細胞生長因子、神經肽Y、胃泌素釋放肽、介白素2、雞蛋清溶菌酶、促紅血球生成素、促性腺素釋放素、皮質抑素、β-內啡肽、白胺腦素、強啡肽B、α-新腦啡肽、血管收縮肽、促加壓素肽(pneumadin)、血管活性腸肽、神經調壓素、腸動素及如上文所述各者之片段。作為另一替代,可將毒素(例如破傷風毒素或蛇毒素(例如α-銀環蛇毒素)及諸如此類)之結合結構域取代至衣殼蛋白中作為靶向序列。在另一代表性實施例中,AAV衣殼蛋白可藉由如Cleves (Current Biology 7:R318 (1997))所述將「非典型」輸入/輸出信號肽(例如纖維母細胞生長因子-1及-2、介白素1、HIV-1 Tat蛋白、皰疹病毒VP22蛋白及諸如此類)取代至AAV衣殼蛋白中來修飾。亦涵蓋定向經特定細胞攝取之肽模體,例如FVFLP (SEQ ID NO: 104)肽模體觸發經肝細胞攝取。
可使用噬菌體展示技術以及此項技術中已知之其他技術來鑑別識別任一所關注細胞類型的肽。靶向序列可編碼靶向細胞表面結合位點之任一肽,包括受體(例如蛋白質、碳水化合物、醣蛋白或蛋白多醣)。細胞表面結合位點之實例包括但不限於硫酸乙醯肝素、硫酸軟骨素及其他糖胺聚糖、在黏蛋白上發現之唾液酸部分、醣蛋白及神經節苷酯、MHC 1醣蛋白、在膜醣蛋白上發現之碳水化合物組分,包括甘露糖、N-乙醯基-半乳糖胺、N-乙醯基-葡糖胺、岩藻醣、半乳糖及諸如此類。表7顯示適合靶向序列之其他非限制性實例。
表 7 :說明性靶向序列
Y*為磷酸基-Tyr
序列 | SEQ ID NO | 參考文獻 |
NSVRDL(G/S) | 35 | Muller等人,Nature Biotechnology 21: 1040-1046 (2003) |
PRSVTVP | 36 | Muller等人,Nature Biotechnology 21: 1040-1046 (2003) |
NSVSSX(S/A) | 37 | Muller等人,Nature Biotechnology 21: 1040-1046 (2003) |
NGR, NGRAHA | 38 | Grifman等人,Molecular Therapy 3:964-975 (2001) |
QPEHSST | 39 | Work等人,Molecular Therapy 13:683-693 (2006) |
VNTANST | 40 | Work等人,Molecular Therapy 13:683-693 (2006) |
HGPMQS | 41 | Work等人,Molecular Therapy 13:683-693 (2006) |
PHKPPLA | 42 | Work等人,Molecular Therapy 13:683-693 (2006) |
IKNNEMW | 43 | Work等人,Molecular Therapy 13:683-693 (2006) |
RNLDTPM | 44 | Work等人,Molecular Therapy 13:683-693 (2006) |
VDSHRQS | 45 | Work等人,Molecular Therapy 13:683-693 (2006) |
YDSKTKT | 46 | Work等人,Molecular Therapy 13:683-693 (2006) |
SQLPHQK | 47 | Work等人,Molecular Therapy 13:683-693 (2006) |
STMQQNT | 48 | Work等人,Molecular Therapy 13:683-693 (2006) |
TERYMTQ | 49 | Work等人,Molecular Therapy 13:683-693 (2006) |
QPEHSST | 50 | Work等人,Molecular Therapy 13:683-693 (2006) |
DASLSTS | 51 | Work等人,Molecular Therapy 13:683-693 (2006) |
DLPNKT | 52 | Work等人,Molecular Therapy 13:683-693 (2006) |
DLTAARL | 53 | Work等人,Molecular Therapy 13:683-693 (2006) |
EPHQFNY | 54 | Work等人,Molecular Therapy 13:683-693 (2006) |
EPQSNHT | 55 | Work等人,Molecular Therapy 13:683-693 (2006) |
MSSWPSQ | 56 | Work等人,Molecular Therapy 13:683-693 (2006) |
NPKHNAT | 57 | Work等人,Molecular Therapy 13:683-693 (2006) |
PDGMRTT | 58 | Work等人,Molecular Therapy 13:683-693 (2006) |
PNNNKTT | 59 | Work等人,Molecular Therapy 13:683-693 (2006) |
QSTTHDS | 60 | Work等人,Molecular Therapy 13:683-693 (2006) |
TGSKQKQ | 61 | Work等人,Molecular Therapy 13:683-693 (2006) |
SLKHQAL | 62 | Work等人,Molecular Therapy 13:683-693 (2006) |
SPIDGEQ | 63 | Work等人,Molecular Therapy 13:683-693 (2006) |
WIFPWIQL | 64 | Hajitou等人,TCM 16:80-88 (2006) |
CDCRGDCFC | 65 | Hajitou等人,TCM 16:80-88 (2006) |
CNGRC | 66 | Hajitou等人,TCM 16:80-88 (2006) |
CPRECES | 67 | Hajitou等人,TCM 16:80-88 (2006) |
CTTHWGFTLC | 68 | Hajitou等人,TCM 16:80-88 (2006) |
CGRRAGGSC | 69 | Hajitou等人,TCM 16:80-88 (2006) |
CKGGRAKDC | 70 | Hajitou等人,TCM 16:80-88 (2006) |
CVPELGHEC | 71 | Hajitou等人,TCM 16:80-88 (2006) |
CRRETAWAK | 72 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
VSWFSHRYSPFAVS | 73 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
GYRDGYAGPILYN | 74 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
XXXY*XXX | 75 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
Y*E/MNW | 76 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
RPLPPLP | 77 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
APPLPPR | 78 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
DVFYPYPYASGS | 79 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
MYWYPY | 80 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
DITWDQLWDLMK | 81 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CWDD(G/L)WLC | 82 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
EWCEYLGGYLRCYA | 83 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
YXCXXGPXTWXCXP | 84 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
IEGPTLRQWLAARA | 85 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
LWXX(Y/W/F/H) | 86 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
XFXXYLW | 87 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
RWGLCD | 88 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
MSRPACPPNDKYE | 89 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CLRSGRGC | 90 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CHWMFSPWC | 91 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
WXXF | 92 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CSSRLDAC | 93 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CLPVASC | 94 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CGFECVRQCPERC | 95 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CVALCREACGEGC | 96 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
SWCEPGWCR | 97 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
YSGWGW | 98 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
GLSGGRS | 99 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
LMLPRAD | 100 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CSCFRDVCC | 101 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CRDVVSVIC | 102 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
CNGRC | 103 | Koivunen等人,J. Nucl. Med. 40:883-888 (1999) |
MARSGL | 104 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MARAKE | 105 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MSRTMS | 106 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
KCCYSL | 107 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MYWGDSHWLQYWYE | 108 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MQLPLAT | 109 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
EWLS | 110 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SNEW | 111 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
TNYL | 112 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
WIFPWIQL | 113 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
WDLAWMFRLPVG | 114 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CTVALPGGYVRVC | 115 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CVPELGHEC | 116 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CGRRAGGSC | 117 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CVAYCIEHHCWTC | 118 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CVFAHNYDYLVC | 119 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CVFTSNYAFC | 120 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
VHSPNKK | 121 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CDCRGDCFC | 122 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CRGDGWC | 123 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
XRGCDX | 124 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
PXX(S/T) | 125 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CTTHWGFTLC | 126 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SGKGPRQITAL | 127 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
A(A/Q)(N/A)(L/Y)(T/V/M/R)(R/K) | 128 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
VYMSPF | 129 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MQLPLAT | 130 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
ATWLPPR | 131 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
HTMYYHHYQHHL | 132 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SEVGCRAGPLQWLCEKYFG | 133 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CGLLPVGRPDRNVWRWLC | 134 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CKGQCDRFKGLPWEC | 135 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SGRSA | 136 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
WGFP | 137 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
LWXXAr | 138 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
XFXXYLW | 139 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
AEPMPHSLNFSQYLWYT | 140 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
WAY(W/F)SP | 141 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
IELLQAR | 142 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
DITWDQLWDLMK | 143 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
AYTKCSRQWRTCMTTH | 144 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
PQNSKIPGPTFLDPH | 145 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SMEPALPDWWWKMFK | 146 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
ANTPCGPYTHDCPVKR | 147 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
TACHQHVRMVRP | 148 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
VPWMEPAYQRFL | 149 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
DPRATPGS | 150 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
FRPNRAQDYNTN | 151 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CTKNSYLMC | 152 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
C(R/Q)L/RT(G/N)XXG(A/V)GC | 153 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
CPIEDRPMC | 154 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
HEWSYLAPYPWF | 155 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
MCPKHPLGC | 156 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
RMWPSSTVNLSAGRR | 157 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SAKTAVSQRVWLPSHRGGEP | 158 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
KSREHVNNSACPSKRITAAL | 159 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
EGFR | 160 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
AGLGVR | 161 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
GTRQGHTMRLGVSDG | 162 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
IAGLATPGWSHWLAL | 163 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
SMSIARL | 164 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
HTFEPGV | 165 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
NTSLKRISNKR1RRK | 166 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
LRIKRKRRKRKKTRK | 167 | Newton及Deutscher, Phage Peptide Display in Handbook of Experimental Pharmacology,第145-163頁,Springer-Verlag, Berlin (2008) |
在一些實施例中,靶向序列可為可用於化學偶合(例如,可包含可經由其R基團化學偶合之精胺酸及/或離胺酸殘基)至靶向進入細胞中之另一分子的肽。在一些實施例中,本發明之AAV衣殼蛋白或病毒衣殼可包含如WO 2006/066066中所述之突變。舉例而言,衣殼蛋白可包含天然AAV2衣殼蛋白之胺基酸位置263、705、708及/或716之選擇性胺基酸取代或另一AAV血清型之衣殼蛋白中之對應變化。
另外或替代地,在一些實施例中,衣殼蛋白、病毒衣殼或載體包含緊跟在AAV2衣殼蛋白之胺基酸位置264之後的選擇性胺基酸插入或其他AAV之衣殼蛋白中之對應變化。「緊跟在胺基酸位置X之後」欲指插入緊跟在所指示胺基酸位置之後(例如「在胺基酸位置264之後」指示位置265處之點插入或例如位置265至268之較大插入等)。此外,在一些實施例中,本發明之衣殼蛋白、病毒衣殼或載體可包含例如PCT公開案第WO 2010/093784號(例如2i8)及/或PCT公開案第WO 2014/144229號(例如雙重聚醣)中所述之胺基酸修飾。
異源分子定義為並非天然存在於AAV感染中之彼等分子,例如不由野生型AAV基因體編碼之彼等分子。此外,治療上有用之分子可與嵌合病毒衣殼之外部締合以將分子轉移至宿主靶細胞中。此類締合之分子可包括DNA、RNA、有機小分子、金屬、碳水化合物、脂質及/或多肽。在本發明之一個實施例中,治療上有用之分子共價連接(亦即結合或化學偶合)至衣殼蛋白。共價連接分子之方法為熟習此項技術者已知。
c. 經修飾病毒載體
本發明提供包含本發明之衣殼蛋白變異體及衣殼的病毒載體。在一些實施例中,病毒載體為細小病毒載體(例如包含細小病毒衣殼及/或載體基因體),例如AAV載體(例如包含AAV衣殼及/或載體基因體)。在一些實施例中,病毒載體含有包含本發明之經修飾衣殼蛋白的經修飾AAV衣殼及載體基因體。
例如,在一些實施例中,病毒載體包含:(a)病毒衣殼(例如AAV衣殼),其包含本發明之衣殼蛋白變異體;及(b)核酸,其包含末端重複序列(例如AAV TR),其中包含末端重複序列之核酸經病毒衣殼包裹。核酸可視情況包含兩個末端重複序列(例如兩個AAV TR)。在代表性實施例中,病毒載體為包含編碼所關注多肽或功能性RNA之異源核酸的重組病毒載體。
AAV通常不以高水準轉導T細胞。相比之下,在一些實施例中,與野生型病毒載體或無衣殼蛋白變異體之病毒載體的轉導水準相比,本發明之病毒載體表現出增強的一或多種細胞類型(例如T細胞)及/或組織之轉導。在一些實施例中,與野生型或天然AAV病毒載體相比,AAV病毒載體具有增加的細胞轉導。在一些實施例中,與野生型或天然AAV病毒載體或不包含本文揭示之任一種衣殼蛋白變異體的AAV病毒載體相比,AAV病毒載體在一或多種細胞類型(例如T細胞)中具有增加的轉導。在一些實施例中,AAV病毒載體可具有增加的向造血幹細胞中之轉導。在一些實施例中,AAV病毒載體可在單核球、嗜鹼性球、嗜酸性球、嗜中性球、樹突狀細胞、巨噬細胞、B細胞、T細胞及/或自然殺手細胞中具有增加的轉導。在一些實施例中,AAV病毒載體可在衛星細胞、間充質乾細胞及/或基底細胞中具有增加的轉導。在一些實施例中,AAV病毒載體可在肺上皮細胞、肝細胞及/或骨骼肌細胞中具有增加的轉導。
AAV之已知受體及輔助受體包括硫酸乙酰肝素蛋白聚醣、整合素、O-連接之唾液酸、N-連接之唾液酸、AAV受體(AAVR、KIAA0319L)、肝細胞生長因子受體(c-Met)、CD9、FGFR-1、37/67-kDa層連結蛋白受體及血小板衍生生長因子受體。在實施例中,本發明之AAV病毒載體對此等受體及/或輔助受體中之一或多者具有增加的親和力。例如,在一些實施例中,與野生型或天然AAV病毒載體相比,AAV病毒載體具有增加的肝素及/或硫酸乙酰肝素結合。在一些實施例中,與野生型或天然AAV病毒載體相比,AAV病毒載體具有增加的唾液酸結合。在一些實施例中,與野生型或天然AAV病毒載體相比,AAV病毒載體具有增加的整合素結合。在一些實施例中,與野生型或天然AAV病毒載體相比,AAV病毒載體與包含α次單元及β次單元之整合素的結合增加。整合素可為例如α4β7、α4β1、α1β1、α2β1、αEβ7、αLβ2、α5β1、α5β6、α5β5、α5β8、α5β8、α3β1、α5β1、α11β1、α5β3、α11β3、αVβ3、αVβ5、αVβ6、αVβ8。
本發明亦提供編碼本發明之一或多種衣殼蛋白變異體(例如AAV衣殼蛋白變異體)或一或多種包含衣殼蛋白變異體之衣殼(例如AAV衣殼)的核苷酸序列或包含其之表現載體。在一些實施例中,核酸編碼具有SEQ ID NO: 2、4、6、8、10、12及14中任一者之序列的重組AAV衣殼蛋白。在一些實施例中,核酸包含選自由SEQ ID NO: 3、5、7、9、11、13及15組成之群的序列。核苷酸序列可為DNA序列或RNA序列。表現載體不受限制,且可為病毒載體(例如腺病毒、AAV、皰疹病毒、牛痘、痘病毒、桿狀病毒及諸如此類),或非病毒載體,諸如質體、噬菌體、YAC、BAC及諸如此類。本發明亦提供包含本發明之一或多個核苷酸序列或表現載體的細胞。細胞可為
活體外的、
離體的或
活體內的。
產生病毒載體之方法
本發明進一步提供產生本文揭示之病毒載體的方法。因此,在一些實施例中,本發明提供一種產生具有增加的細胞轉導(例如,增加的向T細胞之轉導)之AAV載體之方法,其包含:a)鑑別AAV衣殼蛋白上之表面暴露殘基;b)生成包含(a)中鑑別之表面暴露胺基酸殘基之胺基酸取代的AAV衣殼蛋白文庫;c)自(b)之AAV衣殼蛋白文庫中產生包含衣殼蛋白之AAV粒子;d)在可發生感染及復制之條件下使(c)之AAV粒子與細胞接觸;e)選擇能夠完成至少一個感染週期且複製至效價與對照AAV粒子相似或比其更高的AAV粒子。在一些實施例中,步驟(d)及(e)重複超過一次,例如2、3、4、5、6、7、8、9或10次。用於鑑別表面暴露殘基之方法的非限制性實例包括低溫電子顯微術。亦參見AAV2 (Xie等人(2002) Proc. Nat. Acad. Sci. 99: 10405-10)、AAV9 (DiMattia等人(2012) J. Virol. 86:6947-6958)、AAV8 (Nam等人(2007) J. Virol. 81: 12260-12271)、AAV6 (Ng等人(2010) J. Virol. 84:12945-12957)、AAV5 (Govindasamy等人(2013) J. Virol. 87, 11187-11199)、AAV4 (Govindasamy等人(2006) J. Virol. 80:11556-11570)、AAV3B (Lerch等人(2010) Virology 403:26-36)、BPV (Kailasan等人(2015) J. Virol. 89:2603-2614)及CPV (Xie等人(1996) J. Mol. Biol. 6:497-520及Tsao等人(1991) Science 251:1456-64)之晶體結構的描述。
解析及識別表面暴露殘基允許經由隨機、合理及/或簡併突變誘發對其進行後續修飾以產生AAV衣殼,該等衣殼可經由進一步選擇及/或篩選來鑑別。因此,在另一實施例中,本發明提供一種產生具有增加的細胞轉導(例如,增加的向T細胞之轉導)之AAV載體之方法,其包含:a)鑑別AAV衣殼蛋白上之表面暴露胺基酸殘基;b)藉由隨機、合理及/或簡併突變誘發生成包含(a)中鑑別之表面暴露胺基酸殘基之胺基酸取代的AAV衣殼蛋白;c)自(b)之AAV衣殼蛋白中產生包含衣殼蛋白之AAV粒子;d)在可發生感染及復制之條件下使(c)之AAV粒子與細胞接觸;及e)選擇能夠完成至少一個感染週期且複製至效價與對照AAV粒子相似或比其更高的AAV粒子。
藉由隨機、合理及/或簡併突變誘發生成包含表面暴露胺基酸殘基之胺基酸取代之AAV衣殼蛋白的方法為此項技術中已知的。此綜合方法提供一種可應用於修改任何AAV衣殼的平台技術。此平台技術之應用產生衍生自原始AAV衣殼模板之AAV變異體,該等變異體具有增強的轉導效率。作為一個優點及益處,此技術之應用將擴大有資格接受AAV載體基因療法之患者群組。
在一些實施例中,本發明提供一種產生病毒載體之方法,該方法包含向細胞提供:(a)包含至少一個TR序列(例如AAV TR序列)之核酸模板,及(b)足以使核酸模板複製並殼體化成AAV衣殼的AAV序列(例如編碼本發明之AAV衣殼的AAV rep序列及AAV cap序列)。視情況,核酸模板進一步包含至少一個異源核酸序列。在一些實施例中,核酸模板包含兩個AAV ITR序列,該兩個AAV ITR序列位於異源核酸序列(若存在)之5'及3',但無需與其直接鄰接。
核酸模板以及AAV rep及cap序列係在使得在細胞中產生包含包裝在AAV衣殼內之核酸模板之病毒載體的條件下提供。該方法可進一步包含自細胞收集病毒載體之步驟。病毒載體可自培養基及/或藉由溶解細胞來收集。細胞可為容許AAV病毒複製之細胞。可採用此項技術中已知之任一適宜細胞。在一些實施例中,細胞為哺乳動物細胞。在一些實施例中,細胞可為提供自複製缺陷型輔助病毒缺失之功能之反式互補包裝細胞株,例如293細胞或其他E1a反式互補細胞。
AAV複製及衣殼序列可藉由此項技術中已知之任一方法提供。目前方案通常在單一質體上表現AAV rep/cap基因。AAV複製及包裝序列無需一起提供,但其一起提供可能較為方便。AAV rep及/或cap序列可由任一病毒或非病毒載體提供。舉例而言,rep/cap序列可由雜合腺病毒或皰疹病毒載體提供(例如插入缺失腺病毒載體之E1a或E3區中)。亦可採用EBV載體來表現AAV cap及rep基因。此方法之一個優點在於,EBV載體為游離型,但在整個連續細胞分裂中將維持高複本數(亦即作為染色體外元件穩定整合至細胞中,稱為「基於EBV之核游離基因體」,參見Margolski, (1992) Curr. Top. Microbiol. Immun. 158:67)。作為另一替代方案,rep/cap序列可穩定併入細胞中。通常,AAV rep/cap序列將不由TR側接,以防止該等序列之拯救及/或包裝。
核酸模板可使用此項技術中已知之任一方法提供至細胞。舉例而言,模板可由非病毒(例如質體)或病毒載體供應。在一些實施例中,核酸模板係由皰疹病毒或腺病毒載體提供(例如插入缺失腺病毒之E1a或E3區中)。作為另一說明,Palombo等人(1998) J. Virology 72:5025闡述攜帶側接有AAV TR之報導基因之杆狀病毒載體。亦可採用EBV載體來遞送模板,如上文關於rep/cap基因所述。
在一些實施例中,核酸模板係藉由使rAAV病毒複製來提供。在一些實施例中,包含核酸模板之AAV原病毒穩定整合至細胞染色體中。為提高病毒效價,可向細胞提供促進生產性AAV感染之輔助病毒功能(例如腺病毒或皰疹病毒)。AAV複製所必需之輔助病毒序列為此項技術中已知。通常,該等序列將由輔助腺病毒或皰疹病毒載體提供。替代地,腺病毒或皰疹病毒序列可由另一非病毒或病毒載體提供,例如作為攜帶促進有效AAV產生之所有輔助基因之非感染性腺病毒微質體,如Ferrari等人(1997) Nature Med. 3: 1295及美國專利第6,040,183號及第6,093,570號所述。
此外,輔助病毒功能可由包裝細胞提供,其中輔助序列包埋於染色體中或維持為穩定的染色體外元件。通常,輔助病毒序列無法包裝至AAV病毒粒子中,例如不由TR側接。熟習此項技術者應理解,可能有利的是
在單一輔助構築體上提供AAV複製及衣殼序列以及輔助病毒序列(例如腺病毒序列)。此輔助構築體可為非病毒或病毒構築體。作為一個非限制性說明,輔助構築體可為包含AAV rep/cap基因之雜合腺病毒或雜合皰疹病毒。在一些實施例中,AAV rep/cap序列及腺病毒輔助序列係由單一腺病毒輔助載體供應。此載體另外可進一步包含核酸模板。AAV rep/cap序列及/或rAAV模板可插入腺病毒之缺失區域(例如E1a或E3區)中。
在一些實施例中,AAV rep/cap序列及腺病毒輔助序列係由單一腺病毒輔助載體供應。rAAV模板可例如作為質體模板提供。在一些實施例中,AAV rep/cap序列及腺病毒輔助序列係由單一腺病毒輔助載體提供,且rAAV模板整合至細胞中作為原病毒。替代地,rAAV模板係由作為染色體外元件(例如作為基於EBV之核游離基因體)維持在細胞內之EBV載體提供。
在一些實施例中,AAV rep/cap序列及腺病毒輔助序列係由單一輔助腺病毒提供。rAAV模板可提供為單獨的複製病毒載體。舉例而言,rAAV模板可由rAAV粒子或第二重組腺病毒粒子提供。根據前述方法,雜合腺病毒載體通常包含足以進行腺病毒複製及包裝之腺病毒5'及3’順式序列(亦即腺病毒末端重複序列及PAC序列)。AAV rep/cap序列及(若存在) rAAV模板包埋於腺病毒主鏈中且側接有5'及3’順式序列,以使得此等序列可包裝至腺病毒衣殼中。如上文所述,腺病毒輔助序列及AAV rep/cap序列通常不由TR側接,以使得此等序列不包裝至AAV病毒粒子中。Zhang等人((2001) Gene Ther. 18:704-12)描述包含腺病毒以及AAV rep及cap基因之嵌合輔助病毒。
皰疹病毒亦可用作AAV包裝方法中之輔助病毒。編碼AAV Rep蛋白之雜合皰疹病毒可有利地促進可縮放的AAV載體產生方案。已描述表現AAV-2 rep及cap基因之I型雜合單純皰疹病毒(HSV-1)載體(Conway等人(1999) Gene Therapy 6:986及WO 00/17377)。在一些實施例中,本發明之病毒載體可在使用杆狀病毒載體遞送rep/cap基因及rAAV模板之昆蟲細胞中產生,如例如Urabe等人(2002) Human Gene Therapy 13: 1935-43所述。
可藉由此項技術中已知之任一方法獲得不含污染之輔助病毒之AAV載體儲備液。舉例而言,AAV及輔助病毒可容易地基於大小來區分。AAV亦可基於對乙酰肝素受質之親和力自輔助病毒分離(Zolotukhin等人(1999) Gene Therapy 6:973)。可使用缺失的複製缺陷型輔助病毒,以使得任一污染之輔助病毒不能複製。在一些實施例中,可採用缺少晚期基因表現之輔助腺病毒,因為僅需要腺病毒早期基因表現來介導AAV病毒之包裝。晚期基因表現有缺陷之腺病毒突變體為此項技術中已知的(例如ts100K及ts149腺病毒突變體)。
重組病毒載體
本發明提供包含本文揭示之至少一種衣殼蛋白(例如AAV衣殼蛋白)或至少一種衣殼(例如AAV衣殼)的重組病毒載體(例如重組AAV載體),其中衣殼蛋白包含一或多種本文揭示之轉導相關肽。在一些實施例中,與野生型AAV載體或不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向細胞,諸如T細胞中之轉導。在一些實施例中,與野生型AAV載體或不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向T細胞之細胞核中之轉導。在一些實施例中,與野生型AAV載體或不包含轉導相關肽之AAV載體相比,AAV載體展現增加的向T細胞之胞質液中之轉導。
本發明之重組病毒載體可用於將核酸
活體外、
離體及
活體內遞送至細胞。可由經修飾病毒衣殼包裝且轉移至細胞中之分子包括異源DNA、RNA、多肽、有機小分子、金屬或其組合。具體而言,病毒載體可有利地用於將核酸遞送或轉移至動物細胞,包括哺乳動物細胞。因此,在一些實施例中,核酸(「貨物核酸」)可經本發明之衣殼蛋白包裹。在本發明之病毒載體中遞送之貨物核酸序列可為任何所關注異源核酸序列。
在一些實施例中,與由野生型AAV載體(諸如AAV6載體)或不包含本文揭示之轉導相關肽之AAV載體遞送之異源核酸的表現相比,由本文揭示之AAV載體遞送之異源核酸的表現增加。在一些實施例中,與由野生型AAV載體(諸如AAV6載體)或不包含本文揭示之轉導相關肽之AAV載體遞送之異源核酸的表現相比,由本文揭示之AAV載體遞送之異源核酸的表現增加至少約1.5倍,例如約2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、6倍、7倍、8倍、9倍或10倍,包括位於其間之所有值及子範圍。在一些實施例中,與由野生型AAV載體(諸如AAV6載體)或不包含本文揭示之轉導相關肽之AAV載體遞送之異源核酸的表現相比,由本文揭示之AAV載體遞送之異源核酸的表現增加至少約10%,例如約20%、約30%、約40%、約50%、約60%、約70%、約80%、約90%或約100%,包括位於其間之所有值及子範圍。
所關注核酸包括編碼多肽,包括治療性(例如用於醫學或獸醫用途)或免疫原性(例如用於疫苗)多肽或RNA之核酸。在一些實施例中,貨物核酸編碼治療性蛋白質或治療性RNA。
治療性多肽可包括但不限於嵌合抗原受體(CAR)、ABCD1、β球蛋白(HBB)、血紅蛋白A、血紅蛋白F、囊腫纖維化跨膜調節蛋白(CFTR)、肌肉萎縮蛋白(包括微小肌肉萎縮蛋白及微肌肉萎縮蛋白,例如參見Vincent等人(1993) Nature Genetics 5: 130;美國專利公開案第2003/017131號;國際公開案WO/2008/088895;Wang等人,Proc. Natl. Acad. Sci. USA 97: 1 3714-13719 (2000);及Gregorevic等人,Mol. Ther. 16:657-64 (2008))、肌肉生長抑制素前肽、濾泡抑素、11型活化素可溶性受體、IGF-1、抗發炎多肽(例如IκB顯性突變體)、肌長蛋白、肌營養相關蛋白(Tinsley等人(1996) Nature 384:349)、微小肌營養相關蛋白、凝血因子(例如VIII因子、IX因子、X因子等)、促紅血球生成素、血管抑素、內皮抑素、過氧化氫酶、酪胺酸羥化酶、超氧化物歧化酶、瘦素、LDL受體、脂蛋白脂酶、鳥胺酸胺甲醯基轉移酶、β-球蛋白、a-球蛋白、血影蛋白、α-1-抗胰蛋白酶、腺苷去胺酶、次黃嘌呤鳥嘌呤磷酸核糖基轉移酶、β-葡萄糖腦苷脂酶、鞘磷脂酶、溶酶體己糖胺酶A、具支鏈酮酸去氫酶、RP65蛋白、細胞介素(例如α-干擾素、β-干擾素、γ-干擾素、介白素-2、介白素-4、顆粒球-巨噬細胞群落刺激因子、淋巴毒素及諸如此類)、肽生長因子、神經營養因子及激素(例如生長激素、胰島素、胰島素樣生長因子1及2、血小板源性生長因子、表皮生長因子、纖維母細胞生長因子、神經生長因子、神經營養因子-3及-4、腦源神經營養因子、骨形態發生蛋白[包括RANKL及VEGF]、神經膠質源性生長因子、轉化生長因子-α及-β及諸如此類)、溶酶體酸性α-葡萄糖苷酶、α-半乳糖苷酶A、受體(例如腫瘤壞死生長因子可溶性受體)、S100A1、小清蛋白、6型腺苷酸環化酶、調節鈣處置之分子(例如SERCA
2A、PP1及其片段之抑制劑1 [例如WO 2006/029319及WO 2007/100465])、實現2型G-蛋白偶聯受體激酶敲低之分子(例如截短的組成型活性bARKct)、抗發炎因子(例如IRAP)、抗肌肉生長抑制素蛋白、天冬醯轉移酶、單株抗體(包括單鏈單株抗體;例示性Mab係Herceptin
®Mab)、神經肽及其片段(例如加蘭肽(galanin)、神經肽Y (參見U.S. 7,071,172))、血管生成抑制劑,諸如血管抑制素及其他VEGF抑制劑(例如血管抑制素2 [參見WO JP2006/073052])。其他說明性異源核酸序列編碼自殺基因產物(例如胸苷激酶、胞嘧啶去胺酶、白喉毒素及腫瘤壞死因子)、增強或抑制宿主因子轉錄之蛋白質(例如連接至轉錄增強子或抑制元件之核酸酶死亡Cas9、連接至轉錄增強子或抑制元件之鋅指蛋白、連接至轉錄增強子或抑制元件之轉錄活化劑樣(TAL)效應物)、賦予癌症療法中所用之藥物抗性之蛋白質、腫瘤抑制基因產物(例如p53、Rb、Wt-1)、TRAIL、FAS配位體及在有需要之個體中具有治療效應之任何其他多肽。AAV載體亦可用於遞送單株抗體及抗體片段,例如針對肌肉生長抑制素之抗體或抗體片段(例如,參見Fang等人,Nature Biotechnology 23:584-590 (2005))。編碼多肽之異源核酸序列包括編碼報導基因多肽(例如酶)之彼等核酸序列。報導基因多肽為此項技術中已知且包括但不限於綠色螢光蛋白、β-半乳糖苷酶、鹼性磷酸酶、螢光素酶及氯黴素乙醯基轉移酶基因。視情況,異源核酸編碼分泌多肽(例如在其天然狀態下為分泌多肽或已工程化為分泌(例如藉由與如此項技術中已知之分泌信號序列可操作締合)的多肽)。
替代地,在本發明之一些實施例中,異源核酸可編碼反義核酸、核酶(例如如美國專利第5,877,022號中所述)、實現剪接體介導之剪接/隨機剪接之RNA (參見Puttaraju等人(1999) Nature Biotech. 17:246;美國專利第6,013,487號;美國專利第6,083,702號)、干擾RNA (RNAi,包括調介基因沉默之siRNA、shRNA或miRNA) (參見Sharp等人(2000) Science 287:2431)及其他非轉譯RNA,例如「嚮導」RNA (Gorman等人(1998) Proc. Nat. Acad. Sci. USA 95 :4929;授予Yuan等人之美國專利第5,869,248號)及諸如此類。例示性非轉譯RNA包括針對多重抗藥性(MDR)基因產物之RNAi (例如用於治療及/或預防腫瘤及/或投與心臟以預防化學療法之損傷)、針對肌肉生長抑制素之RNAi (例如用於杜興氏肌營養不良症(Duchenne muscular dystrophy))、針對VEGF之RNAi (例如用於治療及/或預防腫瘤)、針對受磷蛋白之RNAi (例如用於治療心血管疾病,例如參見Andino等人,J. Gene Med. 10: 132-142 (2008)及Li等人,Acta Pharmacol Sin. 26:51-55 (2005));受磷蛋白抑制性或顯性-陰性分子,例如受磷蛋白S 16E (例如用於治療心血管疾病,例如參見Hoshijima等人,Nat. Med. 8:864-871 (2002))、針對腺苷激酶之RNAi (例如用於癲癇)及針對病原體生物體及病毒之RNAi (例如B型及/或C型肝炎病毒、人類免疫缺失病毒、CMV、單純皰疹病毒、人類乳頭瘤病毒等)。
此外,可遞送指導選擇式剪接之核酸序列。為進行說明,與肌肉萎縮蛋白外顯子51之5'及/或3'剪接位點互補之反義序列(或其他抑制序列)可與U1或U7小核(sn) RNA啟動子結合遞送以誘導此外顯子之跳躍。舉例而言,包含位於反義/抑制性序列之5’之U1或U7 snRNA啟動子之DNA序列可在本發明之經修飾衣殼中包裝且遞送。
在一些實施例中,可遞送指導基因編輯之核酸序列。例如,核酸可編碼基因編輯分子,諸如嚮導RNA或核酸酶。在一些實施例中,核酸可編碼鋅指核酸酶、歸巢核酸內切酶、TALEN (轉錄活化劑樣效應物核酸酶)、NgAgo (農桿菌核酸內切酶)、SGN (結構導向核酸內切酶)或RGN (RNA導向核酸酶,諸如Cas9核酸酶或Cpf1核酸酶)。
病毒載體亦可包含與宿主染色體上之基因座共用同源性且與其重組之異源核酸。此方法可用於例如校正宿主細胞中之遺傳缺陷。
本發明亦提供表現例如用於疫苗接種之免疫原性多肽的病毒載體。核酸可編碼此項技術中已知之任何所關注免疫原,包括但不限於人類免疫缺失病毒(HIV)、猿猴免疫缺失病毒(SIV)、流行性感冒病毒、HIV或SIV gag蛋白、腫瘤抗原、癌症抗原、細菌抗原、病毒抗原及諸如此類之免疫原。
使用細小病毒作為疫苗載體為此項技術中已知(例如,參見Miyamura等人(1994) Proc. Nat. Acad. Sci USA 91:8507;授予Young等人之美國專利第5,916,563號、授予Mazzara等人之美國專利第5,905,040號、美國專利第5,882,652號、授予Samulski等人之美國專利第5,863,541號)。抗原可呈遞於細小病毒衣殼中。替代地,抗原可自引入重組載體基因體中之異源核酸表現。如本文所述及/或如此項技術中已知之任一所關注免疫原可由本發明之病毒載體提供。
免疫原性多肽可為適於引發免疫反應及/或保護個體免受感染及/或疾病之任一多肽,該感染及/或疾病包括但不限於微生物、細菌、原生動物、寄生蟲、真菌及/或病毒感染及疾病。舉例而言,免疫原性多肽可為正黏液病毒免疫原(例如流行性感冒病毒免疫原,例如流行性感冒病毒血球凝集素(HA)表面蛋白或流行性感冒病毒核蛋白或馬流行性感冒病毒免疫原)或慢病毒免疫原(例如馬感染性貧血病毒免疫原、猿猴免疫缺失病毒(SIV)免疫原或人類免疫缺失病毒(HIV)免疫原,例如HIV或SIV包膜GP 160蛋白、HIV或SIV基質/衣殼蛋白及HIV或SIV gag、pol及env基因產物)。免疫原性多肽亦可為沙狀病毒免疫原(例如拉沙熱病毒(Lassa fever virus)免疫原,例如拉沙熱病毒核衣殼蛋白及拉沙熱包膜醣蛋白)、痘病毒免疫原(例如牛痘病毒免疫原,例如牛痘LI或L8基因產物)、黃病毒免疫原(例如黃熱病毒免疫原或日本腦炎病毒(Japanese encephalitis virus)免疫原)、絲狀病毒免疫原(例如伊波拉病毒(Ebola virus)免疫原或馬堡病毒(Marburg virus)免疫原,例如NP及GP基因產物)、崩芽病毒免疫原(例如RVFV、CCHF及/或SFS病毒免疫原)或冠狀病毒免疫原(例如感染性人類冠狀病毒免疫原,例如人類冠狀病毒包膜醣蛋白或豬可傳播性胃腸炎病毒免疫原或禽感染性支氣管炎病毒免疫原)。免疫原性多肽另外可為脊髓灰質炎免疫原、皰疹免疫原(例如CMV、EBV、HSV免疫原)、腮腺炎免疫原、麻疹免疫原、風疹免疫原、白喉毒素或其他白喉免疫原、百日咳抗原、肝炎(例如A型肝炎、B型肝炎、C型肝炎等)免疫原及/或此項技術中現在已知或後來鑑別為免疫原之任何其他疫苗免疫原。
替代地,免疫原性多肽可為任何腫瘤或癌細胞抗原。視情況,腫瘤或癌症抗原係在癌細胞之表面上表現。
例示性癌細胞及腫瘤細胞抗原闡述於S.A. Rosenberg (Immunity 10:281 (1991))中。其他說明性癌症及腫瘤抗原包括但不限於:BRCA1基因產物、BRCA2基因產物、gp100、酪胺酸酶、GAGE- 1/2、BAGE、RAGE、LAGE、NY-ESO-1、CDK-4、β-連環蛋白、MUM-1、半胱天冬酶-8、ΚΙΑA0205、HPVE、SART-1、FRAME、p15、黑色素瘤腫瘤抗原(Kawakami等人(1994) Proc. Natl. Acad. Sci. USA 91:3515;Kawakami等人(1994) J. Exp. Med., 180:347;Kawakami等人(1994) Cancer Res. 54:3124)、MART-1、gp100、MAGE-1、MAGE-2、MAGE-3、CEA、TRP-1、TRP-2、P-15、酪胺酸酶(Brichard等人(1993) J Exp. Med. 178:489);HER-2/neu基因產物(美國專利第4.968.603號)、CA 125、LK26、FB5 (內皮唾液酸蛋白)、TAG 72、AFP、CA 19-9、NSE、DU-PAN-2、CA50、SPan-1、CA72-4、HCG、STN (唾液醯基Tn抗原)、c-erbB-2蛋白、PSA、L-CanAg、雌激素受體、牛乳球蛋白、p53腫瘤抑制蛋白(Levine, (1993) Ann. Rev. Biochem. 62:623);黏蛋白抗原(國際專利公開案第WO 90/05142號);端粒酶;核基質蛋白;前列腺酸磷酸酶;乳頭瘤病毒抗原;及/或現在已知或後來發現之與以下癌症相關之抗原:黑色素瘤、腺癌、胸腺瘤、淋巴瘤(例如非霍奇金氏淋巴瘤(non-Hodgkin's lymphoma)、霍奇金氏淋巴瘤)、肉瘤、肺癌、肝癌、結腸癌、白血病、子宮癌、乳癌、前列腺癌、卵巢癌、子宮頸癌、膀胱癌、腎癌、胰臟癌、腦癌及現在已知或後來鑑別之任何其他癌症或惡性病況或其轉移(例如,參見Rosenberg (1996) Ann. Rev. Med. 47:481-91)。
熟習此項技術者應理解,所關注異源核酸可與適當控制序列可操作地締合。舉例而言,異源核酸可與表現控制元件(例如轉錄/轉譯控制信號、複製起點、多聚腺苷酸化信號、內部核糖體進入位點(IRES)、啟動子及/或增強子及諸如此類)可操作地締合。
此外,調控相關異源核酸之表現可在轉錄後水準上、例如藉由根據選擇性阻斷特定位點之剪接活性(例如如WO 2006/119137中所述)之寡核苷酸、小分子及/或其他化合物之存在或不存在調控不同內含子之選擇性剪接來達成。
熟習此項技術者應了解,根據期望水準及組織特異性表現,可使用多種啟動子/增強子元件。啟動子/增強子可為組成型或誘導型,此視期望的表現模式而定。啟動子/增強子可為天然(native)或外源的且可為天然(natural)或合成序列。外源欲指在其中引入轉錄起始區之野生型宿主中未發現轉錄起始區。
在一些實施例中,啟動子/增強子元件對於靶細胞或欲治療之個體可為天然的。在代表性實施例中,啟動子/增強子元件對於異源核酸序列可為天然的。啟動子/增強子元件通常經選擇以使得其在所關注靶細胞中起作用。此外,在一些實施例中,啟動子/增強子元件為哺乳動物啟動子/增強子元件。啟動子/增強子元件可為組成型或誘導型。
誘導型表現控制元件通常在期望對異源核酸序列之表現提供調控之彼等應用中係有利的。用於基因遞送之誘導型啟動子/增強子元件可為組織特異性或較佳啟動子/增強子元件,且包括肌肉特異性或較佳啟動子/增強子元件(包括心肌、骨骼肌及/或平滑肌特異性或較佳啟動子/增強子元件)、神經組織特異性或較佳啟動子/增強子元件(包括腦特異性或較佳啟動子/增強子元件)、眼特異性或較佳啟動子/增強子元件(包括視網膜特異性啟動子/增強子元件及角膜特異性啟動子/增強子元件)、肝特異性或較佳啟動子/增強子元件、骨髓特異性或較佳啟動子/增強子元件、胰臟特異性或較佳啟動子/增強子元件、脾特異性或較佳啟動子/增強子元件及肺特異性或較佳啟動子/增強子元件。在一些實施例中,誘導型表現控制元件,諸如啟動子及/或增強子促進T細胞中之選擇性表現。其他誘導型啟動子/增強子元件包括激素誘導型元件及金屬誘導型元件。例示性誘導型啟動子/增強子元件包括但不限於Tet開/關元件、RU486誘導型啟動子、蛻皮激素誘導型啟動子、雷帕黴素(rapamycin)誘導型啟動子及金屬硫蛋白啟動子。
在實施例(其中異源核酸序列在靶細胞中轉錄且然後轉譯)中,通常包括特異性起始信號以有效轉譯插入之蛋白質編碼序列。可包括ATG起始密碼子及相鄰序列之此等外源轉譯控制序列可具有多種來源,天然及合成的。
醫藥組合物及使用方法
本發明亦提供包含本文揭示之AAV衣殼蛋白、AAV衣殼、病毒載體、核酸、表現載體及/或細胞中之至少一者的組合物。在一些實施例中,組合物進一步包含醫藥學上可接受之載劑。在一些實施例中,提供醫藥組合物,其包含醫藥學上可接受之載劑中之本發明之病毒載體及/或衣殼及/或衣殼蛋白及/或病毒粒子,以及視情況選用之其他藥劑、醫藥劑、穩定劑、緩衝劑、載劑、佐劑、稀釋劑等。對於注射,載劑通常將為液體。對於其他投與模式,載劑可為固體或液體。對於吸入投與,載劑將為可吸入的,且視情況可呈固體或液體微粒形式。「醫藥學上可接受之」意謂無毒或另外非所需之材料,亦即該材料可向個體投與而不會引起任何非所需生物效應。
本發明之病毒載體提供將異源核酸遞送至寬範圍之細胞(包括分裂細胞及非分裂細胞)中之手段。在一些實施例中,細胞為T細胞。病毒載體可用於將所關注核酸遞送至
活體外細胞,例如以產生
活體外多肽或用於
離體基因療法。病毒載體另外可用於將核酸遞送至有需要之個體,例如以表現免疫原性或治療性多肽或功能性RNA之方法中。以此方式,多肽或功能性RNA可在個體之
活體內產生。個體可能需要多肽之原因在於,個體缺乏該多肽。此外,該方法可實踐之原因在於,個體中多肽或功能性RNA之產生可賦予一些有益效應。在一些實施例中,該等方法包含在
活體外、
離體或
活體內細胞中表現多肽或RNA,且視情況自細胞中分離多肽或RNA。病毒載體亦可用於在經培養細胞或個體中產生所關注多肽或功能性RNA (例如使用個體作為生物反應器以產生多肽或觀察功能性RNA對個體之效應,例如結合篩選方法)。
本發明提供向個體投與本發明之病毒載體、病毒粒子及/或組合物中之任一者的方法。因此,本發明提供治療有需要之個體的方法,其包含向該個體投與有效量的本文揭示之任一種病毒載體(例如AAV載體)、任一種病毒粒子(例如AAV粒子)及/或任一種組合物。因此,本發明提供本文揭示之任一種病毒載體(例如AAV載體)、任一種病毒粒子(例如AAV粒子)及/或任一種組合物,其用作藥劑,及/或用於治療有需要之個體的方法。
在一些實施例中,本發明之病毒載體可用於遞送編碼多肽或功能性RNA之異源核酸以治療及/或預防遞送治療性多肽或功能性RNA有益之任何疾病狀態。在一些實施例中,疾病狀態與T細胞功能障礙或T細胞增加相關、有關或由其引起。在一些實施例中,疾病狀態包括但不限於:囊腫纖維化(囊腫纖維化跨膜調控蛋白)及其他肺病、血友病A (VIII因子)、血友病B (IX因子)、地中海貧血(β-球蛋白)、貧血(促紅血球生成素)及其他血液病症。阿茲海默氏病(Alzheimer's disease,GDF;腦啡肽酶)、多發性硬化(β-干擾素)、帕金森氏病(Parkinson's disease,神經膠質細胞株源性神經營養因子[GDNF])、亨廷頓氏病(Huntington's disease,移除重複之RNAi)、肌肉萎縮性脊髓側索硬化症、癲癇(加蘭肽、神經營養因子)及其他神經病症、癌症(內皮抑素、血管抑素、TRAIL、FAS配位體、細胞介素(包括干擾素);RNAi (包括針對VEGF或多重抗藥性基因產物之RNAi)、mir-26a [例如用於肝細胞癌])、糖尿病(胰島素)、肌營養不良(包括杜興氏肌肉萎縮症(Duchenne),肌肉萎縮蛋白、微小肌肉萎縮蛋白、胰島素樣生長因子I、肌聚糖[例如a、β、γ]、針對肌靜力肌肉生長抑制素前肽之RNAi、濾泡抑素、II型活化素可溶性受體、抗發炎多肽(例如IκB顯性突變體)、肌長蛋白、肌營養相關蛋白、微小肌營養相關蛋白、針對肌肉萎縮蛋白基因中之剪接接合處以誘導外顯子跳躍之反義或RNAi [例如,參見WO/2003/095647]、針對U7 snRNA以誘導外顯子跳躍之反義[例如,參見WO/2006/021724]及針對肌肉生長抑制素或肌肉生長抑制素前肽之抗體或抗體片段)及貝克氏、高歇氏病(葡萄糖腦苷脂酶)、赫爾勒病(a-L-艾杜糖醛酸酶)、腺苷去胺酶缺陷(腺苷去胺酶)、肝糖貯積病(例如法佈瑞氏病[a-半乳糖苷酶]及龐貝氏病[溶酶體酸性α-葡萄糖苷酶])及其他代謝失調、先天性肺氣腫(α-1-抗胰蛋白酶)、萊-尼二氏症候群(Lesch-Nyhan Syndrome,次黃嘌呤鳥嘌呤磷酸核糖基轉移酶)、尼曼-匹克病(鞘磷脂酶)、泰-薩二氏病(Tay-Sachs disease,溶酶體己糖胺酶A)、楓糖漿尿病(Maple Syrup Urine Disease,具支鏈酮酸去氫酶)、視網膜變性疾病(以及眼及視網膜之其他疾病;例如PDGF用於黃斑變性及/或血管抑制素或VEGF之其他抑制劑或治療/預防視網膜病症(例如在I型糖尿病中)之其他血管生成抑制劑)、實體器官(例如腦)之疾病(包括帕金森氏病[GDNF]、星細胞瘤[內皮抑素、血管抑素及/或針對VEGF之RNAi]、神經膠母細胞瘤[內皮抑素、血管抑素及/或針對VEGF之RNAi])、肝、腎、心臟(包括鬱血性心臟衰竭或外周動脈疾病(PAD)) (例如藉由遞送蛋白質磷酸酶抑制劑I (I-1)及其片段(例如IlC)、serca2a、調控受磷蛋白基因之鋅指蛋白、Barkct、[32-腎上腺素受體、2-腎上腺素受體激酶(BARK)、磷酸肌醇-3激酶(PI3激酶)、S100A1、小清蛋白、6型腺苷酸環化酶、實現2型G-蛋白偶聯受體激酶敲低之分子(例如截短的組成型活性bARKct);鈣調素(calsarcin)、針對受磷蛋白之RNAi;受磷蛋白抑制性或顯性-陰性分子(例如受磷蛋白S16E)等)、關節炎(胰島素樣生長因子)、關節病症(胰島素樣生長因子1及/或2)、內膜增生(例如藉由遞送enos、inos)、改良心臟移植之存活(超氧化物歧化酶)、AIDS (可溶性CD4)、肌肉消瘦(胰島素樣生長因子I)、腎缺陷(促紅血球生成素)、貧血(促紅血球生成素)、關節炎(抗發炎因子,例如I RAP及TNFa可溶性受體)、肝炎(a-干擾素)、LDL受體缺陷(LDL受體)、高胺血症(鳥胺酸胺甲醯基轉移酶)、克拉培氏病(Krabbe's disease,半乳糖腦苷脂酶)、巴騰氏病(Batten's disease)、脊腦共濟失調(包括SCA1、SCA2及SCA3)、苯丙酮尿(苯丙胺酸羥化酶)、自體免疫疾病及諸如此類。本發明可進一步用於器官移植後以增加移植成功率及/或減少器官移植或附加療法之負面副作用(例如藉由投與免疫抑制劑或抑制性核酸以阻斷細胞介素產生)。作為另一實例,骨形態發生蛋白(包括BNP 2、7等、RANKL及/或VEGF)可與骨同種異體移植物一起投與,例如在癌症患者骨折或手術移除之後。
在一些實施例中,本發明之病毒載體可用於遞送編碼多肽或功能性RNA之異源核酸以治療及/或預防肝疾病或病症。肝疾病或病症可為例如原發性膽汁性肝硬化、非酒精性脂肪肝病(NAFLD)、非酒精性脂肪性肝炎(NASH)、自體免疫肝炎、B型肝炎、C型肝炎、酒精性肝病、纖維化、黃疸、原發性硬化性膽道炎(PSC)、巴-克二氏症候群(Budd-Chiari syndrome)、血色素沈著症、威爾森氏病(Wilson’s disease)、酒精性纖維化、非酒精性纖維化、肝脂肪變性、吉伯特氏症候群(Gilbert’s syndrome)、膽道閉鎖、α-1-抗胰蛋白酶缺陷、阿拉吉歐症候群(alagille syndrome)、進行性家族性肝內膽汁淤積、血友病B、遺傳性血管性水腫(HAE)、同型接合家族性高膽固醇血症(HoFH)、異型接合家族性高膽固醇血症(HeFH)、馮吉爾克病(Von Gierke's Disease,GSD I)、血友病A、甲基丙二酸血症、丙酸血症、高胱胺酸尿症、苯丙酮尿(PKU)、1型酪胺酸血症、精胺酸酶1缺陷、精胺酸琥珀酸溶解酶缺陷、磷酸胺甲醯基酯合成酶1缺陷、1型瓜胺酸血症、檸檬素缺陷、1型克-納二氏症候群(Crigler-Najjar Syndrome Type 1)、胱胺酸病、法佈瑞氏病、1b型肝糖貯積病、LPL缺陷、N-乙醯麩胺酸合成酶缺陷、鳥胺酸胺甲醯基轉移酶缺陷、鳥胺酸移位酶缺陷、1型原發性高草酸尿症或ADA SCID。
本文所述之病毒載體亦可用於產生經誘導之多潛能幹細胞(iPS)。舉例而言,本發明之病毒載體可用於將幹細胞相關之核酸遞送至非多潛能細胞,例如成人纖維母細胞、皮膚細胞、肝細胞、腎細胞、脂肪細胞、心臟細胞、神經細胞、上皮細胞、內皮細胞及諸如此類。
與幹細胞相關之核酸編碼因子為此項技術中已知。該等與幹細胞及多潛能性相關之因子之非限制性實例包括Oct-3/4、SOX家族(例如SOX 1、SOX2、SOX3及/或SOX 15)、Klf家族(例如Klfl、KHZ Klf4及/或Klf5)、Myc家族(例如C-myc、L-myc及/或N-myc)、NANOG及/或LIN28。
在一些實施例中,本文揭示之經修飾載體可用於治療如本文所述之溶酶體貯積症,例如黏多糖沈積症(例如斯賴症候群(Sly syndrome) [β-葡萄糖醛酸苷酶]、赫爾勒症候群(Hurler Syndrome) [α-L-艾杜糖醛酸酶]、沙伊症候群(Scheie Syndrome) [α-L-艾杜糖醛酸酶]、赫爾勒-沙伊二氏症候群(Hurler-Scheie Syndrome)[α-L-艾杜糖醛酸酶]、亨特氏症候群(Hunter's Syndrome) [艾杜糖醛酸硫酸酯酶]、聖菲利柏氏症候群(Sanfilippo Syndrome) (A [乙醯肝素硫酸酯酶]、B [N-乙醯葡萄糖胺酶]、C [乙醯基-CoA:α-葡萄胺糖苷乙醯基轉移酶]、D [N-乙醯葡萄糖胺6-硫酸酯酶])、莫奇症候群(Morquio Syndrome) (A [半乳糖-6-硫酸硫酸酯酶]、B [β-半乳糖苷酶])、馬-拉二氏症候群(Maroteaux-Lamy Syndrome) [N-乙醯半乳糖胺-4-硫酸酯酶]等)、法佈瑞氏病(Fabry disease) (a-半乳糖苷酶)、高歇氏病(Gaucher's disease) (葡萄糖腦苷脂酶)或肝糖貯積症(例如龐貝氏病(Pompe disease);溶酶體酸性α-葡萄糖苷酶)。在一些實施例中,亦可實踐本發明以治療及/或預防代謝失調,例如糖尿病(例如胰島素)、血友病(例如IX因子或VIII因子)、溶酶體貯積症(例如黏多糖沈積症,例如斯賴症候群[β-葡萄糖醛酸苷酶]、赫爾勒症候群[α-L-艾杜糖醛酸酶]、沙伊症候群[α-L-艾杜糖醛酸酶]、赫爾勒-沙伊二氏症候群[α-L-艾杜糖醛酸酶]、亨特氏症候群[艾杜糖醛酸硫酸酯酶]、聖菲利柏氏症候群A [乙醯肝素硫酸酯酶]、B [N-乙醯葡萄糖胺酶]、C [乙醯基-CoA:α-葡萄胺糖苷乙醯基轉移酶]、D [N-乙醯葡萄糖胺6-硫酸酯酶]、莫奇症候群A [半乳糖硫酸酯硫酸酯酶]、B [β-半乳糖苷酶]、馬-拉二氏症候群[N-乙醯半乳糖胺-4-硫酸酯酶]等)、法佈瑞氏病(α-半乳糖苷酶)、高歇氏病(葡萄糖腦苷脂酶)或肝糖貯積症(例如龐貝氏病;溶酶體酸性α-葡萄糖苷酶)。
基因轉移對於理解疾病狀態及為該等疾病狀態提供療法具有重要用途。許多遺傳病之缺陷基因係已知且已經選殖。一般而言,上述疾病狀態分為兩類:缺陷狀態(通常為酶),其通常以隱性方式遺傳;及不平衡狀態,其可涉及調控或結構蛋白,且通常以顯性方式遺傳。對於缺陷狀態之疾病,可使用基因轉移將正常基因帶入受侵襲之組織中進行替代治療,以及使用反義突變創建該疾病之動物模型。對於不平衡的疾病狀態,可使用基因轉移在模型系統中創建疾病狀態,然後可將其用於對抗疾病狀態。因此,根據本發明之病毒載體容許治療及/或預防遺傳病。
根據本發明之病毒載體亦可用於將功能性RNA在活體外或活體內提供至細胞。功能性RNA可為例如非編碼RNA。在一些實施例中,功能性RNA在細胞中之表現可減少細胞對特定靶蛋白之表現。因此,可投與功能性RNA以減少有需要之個體中特定蛋白質之表現。在一些實施例中,功能性RNA在細胞中之表現可增加細胞對特定靶蛋白之表現。因此,可投與功能性RNA以增加有需要之個體中特定蛋白質之表現。在一些實施例中,功能性RNA之表現可調控細胞中特定靶RNA之剪接。因此,可投與功能性RNA以調控有需要之個體中特定RNA之剪接。在一些實施例中,細胞中功能性RNA之表現可調控細胞特定靶蛋白之功能。因此,可投與功能性RNA以調控有需要之個體中特定蛋白質之功能。亦可將功能性RNA活體外投與至細胞以調控基因表現及/或細胞生理學,例如以最佳化細胞或組織培養系統或篩選方法。
在一些實施例中,本文揭示之病毒載體可與細胞
離體接觸。在一些實施例中,細胞為T細胞,諸如活化T細胞。在一些實施例中,細胞(例如活化T細胞)係獲自個體,諸如人類患者。在一些實施例中,將與病毒載體接觸之細胞投與至有需要之個體。
在一些實施例中,病毒載體包含編碼嵌合抗原受體(CAR)之異源核酸。因此,在一些實施例中,病毒載體與T細胞之接觸引起嵌合抗原受體(CAR)的表現以產生CAR T細胞。因此,在一些實施例中,本發明提供製備CAR T細胞之方法,其包含使本文揭示之任一種病毒載體與T細胞
離體接觸。本發明進一步提供使用本文揭示之任一種方法產生的CAR T細胞,以及治療有需要之個體的方法,該等方法包含向個體投與本文揭示之CAR T細胞。在一些實施例中,CAR T細胞使用自同一個體獲得的T細胞(自體T細胞)產生,而在其他實施例中,CAR T細胞使用自健康供體個體獲得的T細胞(同種異體T細胞)產生。需要CAR T細胞投與之個體可由醫師或熟練的醫學從業者確定,且可能患有任何疾病,諸如癌症,例如急性淋巴球白血病(ALL)、瀰漫性大B細胞淋巴瘤(DLBCL)、霍奇金氏淋巴瘤、急性骨髓性白血病(AML)或多發性骨髓瘤。
T細胞耗竭係在許多慢性感染及癌症期間出現的一種T細胞功能障礙狀態,且亦已被證明會降低CAR-T療法的有效性。在一些實施例中,本文揭示之重組病毒載體用於基因治療方法(例如CAR-T治療方法)以預防、限制及/或逆轉T細胞耗竭。因此,本發明提供緩解、預防、限制及/或逆轉個體之T細胞耗竭的方法,其包含向該個體投與有效量的本文揭示之任一種病毒載體(例如AAV載體)、任一種病毒粒子(例如AAV粒子)及/或任一種組合物。
在一些實施例中,病毒載體包含編碼免疫原,諸如免疫原性多肽之異源核酸。因此,在一些實施例中,病毒載體與細胞之接觸引起免疫原之表現。在一些實施例中,可將細胞投與至個體,且因此使得在個體中誘導針對免疫原之免疫反應。在一些實施例中,引發保護性免疫反應。在一些實施例中,細胞為抗原呈遞細胞(例如樹突狀細胞)。在一些實施例中,已自個體取出細胞,將病毒載體引入該等細胞中,且接著將細胞投與回個體中。自個體取出細胞用於離體操縱、然後引入回個體中之方法為此項技術中已知的(例如,參見美國專利第5,399,346號)。替代地,可將重組病毒載體引入供體個體之細胞、經培養細胞或任何其他適宜來源之細胞中,且將細胞投與至有需要之個體(亦即「接受者」個體)。
在一些實施例中,可自患有癌症之個體取出細胞,且與表現根據本發明之癌細胞抗原的病毒載體接觸。然後將經修飾細胞投與至個體,由此引發針對癌細胞抗原之免疫反應。此方法可有利地用於無法在
活體內引起足夠免疫反應(亦即無法產生足量增強抗體)的免疫受損個體。替代地,癌症抗原可作為病毒衣殼之一部分表現,或以其他方式與病毒衣殼締合(例如如上文所述)。作為另一替代方案,可投與此項技術中已知之任何其他治療性核酸(例如RNAi)或多肽(例如細胞介素)來治療及/或預防癌症。
此項技術中已知,免疫調節細胞介素(例如α-干擾素、β-干擾素、γ-干擾素、ω-干擾素、τ-干擾素、介白素-1-α、介白素-1β、介白素-2、介白素-3、介白素-4、介白素5、介白素-6、介白素-7、介白素-8、介白素-9、介白素-10、介白素-11、介白素-12、介白素-13、介白素-14、介白素-18、B細胞生長因子、CD40配位體、腫瘤壞死因子-α、腫瘤壞死因子-β、單核球化學吸引蛋白-1、顆粒球-巨噬細胞群落刺激因子及淋巴毒素)可增強免疫反應。因此,免疫調節細胞介素(較佳CTL誘導性細胞介素)可與病毒載體結合投與個體。細胞介素可藉由此項技術中已知之任一方法來投與。可將外源細胞介素投與至個體,或替代地可使用適合載體將編碼細胞介素之核酸遞送至個體,且在活體內產生細胞介素。
另外,根據本發明之病毒載體可用於診斷及篩選方法中,其中所關注核酸在細胞培養系統或者轉殖基因動物模型中瞬時或穩定表現。
本發明之病毒載體亦可用於多種非治療目的,包括但不限於用於評估基因靶向、清除、轉錄、轉譯等的方案中,如熟習此項技術者將顯而易見。病毒載體亦可用於評估安全性(擴散、毒性、免疫原性等)之目的。例如,美國食品及藥物管理局在評估臨床療效之前將此類資料視為監管批准過程之一部分。
在一些實施例中,本發明之經修飾病毒衣殼可用於產生針對新型衣殼結構的抗體。在一些實施例中,可將外源胺基酸序列插入經修飾病毒衣殼中用於抗原呈遞至細胞,例如投與至個體以產生針對外源胺基酸序列之免疫反應。
在一些實施例中,可在投與遞送編碼所關注多肽或功能性RNA之核酸的病毒載體之前及/或與其同時(例如在彼此之數分鐘或數小時內)投與病毒衣殼以阻斷某些細胞位點。舉例而言,可遞送本發明之衣殼以阻斷肝細胞上之細胞受體,且可隨後或同時投與遞送載體,此可減少肝細胞之轉導,且增強其他靶標(例如骨骼肌、心肌及/或膈肌)之轉導。
劑量及投與方法
可以適當感染複數根據適用於特定靶細胞之標準轉導方法將病毒載體引入細胞中。欲投與之病毒載體之效價可根據靶細胞類型及數量以及特定病毒載體而變化,且可由熟習此項技術者在無過度實驗之情況下確定。在代表性實施例中,將至少約10
3個感染單位、視情況至少約10
5個感染單位引入細胞中。
其中引入病毒載體之細胞可為任一類型,包括但不限於T細胞、神經細胞(包括外周及中樞神經系統之細胞,具體而言腦細胞,例如神經元及寡突神經膠質細胞)、肺細胞、眼細胞(包括視網膜細胞、視網膜色素上皮細胞及角膜細胞)、上皮細胞(例如腸及呼吸上皮細胞)、肌肉細胞(例如骨骼肌細胞、心肌細胞、平滑肌細胞及/或膈肌細胞)、樹突細胞、胰臟細胞(包括胰島細胞)、肝細胞、心肌細胞、骨細胞(例如骨髓幹細胞)、造血幹細胞、脾細胞、角質細胞、纖維母細胞、內皮細胞、前列腺細胞、生殖細胞及諸如此類。在代表性實施例中,細胞可為任何先驅細胞。作為另一可能性,細胞可為幹細胞(例如神經幹細胞、肝幹細胞)。作為另一替代,細胞可為癌細胞或腫瘤細胞。此外,細胞可來自任何起源物種,如上文所指示。
適用於
離體核酸遞送之細胞係如上文所述。欲投與個體之細胞之劑量將根據個體之年齡、病況及物種、細胞類型、細胞表現之核酸、投與模式及諸如此類而變化。通常,在醫藥學上可接受之載劑中,每劑量將投與至少約10
2至約10
8個細胞或至少約10
3至約10
6個細胞。在一些實施例中,經病毒載體轉導之細胞係以治療有效量與醫藥載劑組合投與個體。
在一些實施例中,將病毒載體引入細胞中且可將細胞投與至個體以引發針對所遞送多肽(例如表現為轉殖基因或表現於衣殼中)之免疫原性反應。通常,投與一定量之表現免疫原性有效量之多肽的細胞與醫藥學上可接受之載劑的組合。「免疫原性有效量」係在投與醫藥調配物之個體中足以引發針對多肽之主動免疫反應之所表現多肽的量。在一些實施例中,該劑量足以產生保護性免疫反應。所賦予之保護程度無需為完全或永久的,只要投與免疫原性多肽之益處超過其任何缺點即可。因此,本發明提供一種將核酸投與至細胞之方法,該方法包含使細胞與本發明之病毒載體、病毒粒子及/或組合物接觸。
欲投與至個體之病毒載體及/或衣殼之劑量視投與模式、欲治療及/或預防之疾病或病況、個別個體之病況、具體病毒載體或衣殼及欲遞送之核酸及諸如此類而定,且可以常規方式確定。用於達成治療效應之例示性劑量為以下效價:至少約10
5、約10
6、約10
7、約10
8、約10
9、約10
10、約10
11、約10
12、約10
13、約10
14、約10
15個轉導單位、視情況約10
8-10
13個轉導單位。在一些實施例中,可採用超過一次投與(例如兩次、三次、四次或更多次投與)在不同時間間隔(例如每天、每週、每月、每年等)內達成所需基因表現量。
向有需要之人類個體或動物投與本發明之病毒載體、病毒粒子及/或衣殼可藉由此項技術中已知之任何方法來實施。視情況,病毒載體、病毒粒子及/或組合物係以治療有效劑量在醫藥學上可接受之載劑中遞送。在一些實施例中,遞送治療有效量之病毒載體、病毒粒子及/或衣殼。
例示性投與模式包括口服、直腸、經黏膜、鼻內、吸入(例如經由氣溶膠)、經頰(例如舌下)、陰道、鞘內、眼內、經皮、子宮內(或卵內)、胃腸外(例如靜脈內、皮下、皮內、肌內[包括投與骨骼肌、膈肌及/或心肌]、皮內、胸膜內、腦內及關節內)、局部(例如投與皮膚及黏膜表面,包括氣道表面及經皮)、淋巴內及諸如此類以及直接組織或器官注入(例如投與肝、骨骼肌、心肌、膈肌或腦)。投與亦可為投與腫瘤(例如在腫瘤或淋巴結中或附近)。在任何給定情況下,最適宜途徑將視所治療及/或預防之病況之性質及嚴重程度以及所用具體載體之性質而定。亦可實踐本發明以產生用於全身性遞送之非編碼RNA,例如反義RNA、RNAi或其他功能性RNA (例如核酶)。
可注射物可以習用形式製備,呈液體溶液或懸浮液、適合在注射前溶解或懸浮於液體中之固體形式或乳液形式。替代地,可以局部而非全身方式、例如以貯庫或持續釋放調配物之形式投與本發明之病毒載體及/或病毒衣殼。此外,病毒載體及/或病毒衣殼可黏附至手術可植入基質來遞送(例如如美國專利公開案第US-2004-0013645-A1號中所述)。
實例
本文包括之以下實例僅用於說明目的,不欲進行限制。如本文所用,術語STRD.201、STRD.202、STRD.203、STRD.204、STRD.205、STRD.206及STRD.207用於描述衣殼蛋白序列,且術語AAV-STRD.201、AAV-STRD.202、AAV-STRD.203、AAV-STRD.204、AAV-STRD.205、AAV-STRD.206及AAV-STRD.207用於描述包含衣殼蛋白之AAV載體。然而,術語STRD.201、STRD.202、STRD.203、STRD.204、STRD.205、STRD.206及STRD. 207可在一些上下文中用於描述包含所命名衣殼之AAV載體,如熟習此項技術者將顯而易見。
實例 1 :包含轉導相關肽之 AAV 衣殼蛋白變異體的進化
活體外進化過程用於製備AAV衣殼蛋白變異體,該等變異體在併入AAV載體中時增強載體向T細胞中之轉導。此過程之第一步涉及使用冷凍電子顯微鏡鑑別AAV衣殼表面上之表面暴露區域。然後使用簡併引子對AAV衣殼之表面暴露區域內選定之殘基進行突變誘發,各密碼子經核苷酸NNK取代且基因片段藉由吉布森組裝(Gibson assembly)及/或多步PCR組合在一起。在此,對SEQ ID NO: 1之胺基酸殘基454-460進行隨機突變誘發以生成重組衣殼基因序列文庫。此簡併文庫中之各基因選殖至野生型AAV基因體中以替代原始編碼Cap之DNA序列,從而產生質體文庫。然後用腺病毒輔助質體將質體文庫轉染至293生產細胞株中以生成AAV衣殼文庫。經由DNA定序證實AAV文庫之成功生成。
為鑑別可靶向且有效轉導T細胞之AAV載體,對上述AAV文庫進行多輪
活體外選擇。特定言之,進行第一輪向混合細胞群體中之轉導,然後進行兩輪向活化供體T細胞中之轉導。在各階段,病毒DNA均經純化、PCR擴增且反向選殖至AAV載體中,且用於下一輪選擇。WO 2019/195449、WO 2019/195423及WO 2019/195444中提供了用於組合工程化及選擇AAV載體之一般方法的更多細節,該等公開案之內容各自以引用方式整體併入本文中。三輪感染後,自經培養T細胞中分離出AAV粒子。特定言之,細胞經裂解且自T細胞之細胞核及細胞質部分純化病毒DNA,PCR擴增且反向選殖至如上所述之AAV載體中。
在實例1中描述之三輪選擇及進化之後,在T細胞之細胞核及細胞質部分中富集的AAV變異體經定序以鑑別單一AAV分離物。在圖5所示之氣泡圖中,氣泡大小與讀段數目成正比。對在細胞核部分(AAV.STRD-203、205)、細胞質部分(AAV.STRD-206、207)或細胞核及細胞質部分(AAV.STRD-201、202及204)中最富集之AAV變異體進行定序以鑑別胺基酸位置454-460處存在的胺基酸殘基。參見圖6及表5。此等結果表明,含有包含表5之轉導相關肽之變異體衣殼蛋白的重組AAV病毒粒子能夠有效地轉導T細胞。
表 5 :使用
活體外進化過程鑑別之轉導相關肽
實例 2 :包含轉導相關肽之 AAV 載體的可製造性
變異體AAV6衣殼蛋白 | 轉導增強肽 | SEQ ID NO: | 對應AAV6衣殼變異體之SEQ ID NO |
STRD-201 | HAPRVEE | 17 | 2 |
STRD-202 | MAPRQEG | 18 | 4 |
STRD-203 | HTTDCAN | 19 | 6 |
STRD-204 | AAPRSET | 20 | 8 |
STRD-205 | QAPRQEG | 21 | 10 |
STRD-206 | VAPRDPA | 22 | 12 |
STRD-207 | SAPRSEN | 23 | 14 |
為確定實例1中鑑別之各種AAV載體是否可在大規模系統中製造,根據標準方法產生AAV,且比較產量與野生型AAV6載體之產量。
根據標準三重轉染方案,在HEK293細胞中產生AAV。簡言之,用以下質體轉染細胞:(i)包含野生型AAV9衣殼序列或表5中所列之變異體衣殼序列的質體,(ii)包含5’ITR、轉殖基因及3’ ITR序列之質體;及(iii)包含AAV產生所需之輔助基因的質體。自細胞培養物之上清液中純化AAV。隨後,使用基於PCR之定量方法量測各AAV之產量。
如圖1及表6中所示,包含STRD-201衣殼序列之重組AAV載體(此處稱為「AAV.STRD-201」)之產量高於野生型AAV6之產量。此外,AAV.STRD-204、AAV.STRD-205、AAV.STRD-206及AAV.STRD-207之產量與野生型AAV6之產量相當。
此等資料證實,包含衣殼變異體蛋白之AAV載體適於商業製造。
表 6 :
實例 3 : 表徵 T 細胞中 AAV 變異體對 GFP 轉殖基因的表現
重組AAV | 體積(mL) | 效價(載體基因體(vg)/mL) | 產量(總vg) |
AAV6 | 2.80 | 1.97E+11 | 5.52E+11 |
AAV.STRD-201 | 3.25 | 1.77E+11 | 5.75E+11 |
AAV.STRD-202 | 2.25 | 1.31E+11 | 2.95E+11 |
AAV.STRD-203 | 2.25 | 2.64E+09 | 5.94E+09 |
AAV.STRD-204 | 2.50 | 1.40E+11 | 3.50E+11 |
AAV.STRD-205 | 2.50 | 1.68E+11 | 4.20E+11 |
AAV.STRD-206 | 2.50 | 1.26E+11 | 3.15E+11 |
AAV.STRD-207 | 2.50 | 1.48E+11 | 3.70E+11 |
重組AAV變異體AAV.STRD-201、AAV.STRD-202、AAV.STRD-204、AAV.STRD-205、AAV.STRD-206及AAV.STRD-207或攜帶GFP轉殖基因序列之野生型AAV6載體被轉導至經活化T細胞中。由於T細胞在擴增過程中結塊,因此在成像之前將細胞上下吸移或混合。藉由顯微鏡觀測GFP之表現且來自實驗之影像在圖2中示出。較高GFP表現表明病毒載體向T細胞中之更有效轉導。如自圖2中之影像可見,與野生型AAV6病毒載體相比,所有AAV變異體均顯示更亮的綠色螢光信號,且因此,GFP在活化T細胞中之表現更高。在重組AAV變異體中,AAV.STRD-201及AAV.STRD-207顯示特別增強的GFP表現,表明向T細胞中之轉導更加增強。為了進一步分析與野生型AAV6病毒載體相比AAV變異體之GFP表現量,對經AAV6載體或AAV.STRD-207變異體轉導之T細胞進行流動式細胞測量術,單獨使用T細胞作為陰性對照。如圖3C所示,與經AAV6親本載體轉導之細胞群體相比,經AAV.STRD-207變異體轉導之細胞比例增加顯示更高的GFP信號(由藍線上方的FITC信號指示)。經AAV變異體(AAV.STRD-201、AAV.STRD-202、AAV.STRD-204、AAV.STRD-205、AAV.STRD-206及AAV.STRD-207)轉導之細胞中的GFP表現在圖4中進一步定量,其顯示給定群體中GFP陽性細胞之% (用條形表示)以及該群體中GFP之平均強度(用線圖表示)。結果表明,與野生型AAV6相比,GFP陽性細胞數目的增加與經AAV變異體轉導之細胞中GFP信號平均強度的增加很好地對應,表明AAV變異體向T細胞中之轉導的增強使得T細胞中之GFP表現增加。
前述內容係對本發明之說明,且不應理解為限制本發明。本發明係由所附申請專利範圍限定,其中包括申請專利範圍之等效內容。
圖1示出與野生型AAV6相比,對於包含變異體衣殼之各種AAV載體,使用實例2中所述之製造方法獲得的總載體基因體(vg)體積產量。
圖2示出來自用野生型AAV6或包含指定AAV6衣殼變異體之AAV載體轉導之T細胞之顯微分析的影像。各AAV載體均包裝有一個GFP轉殖基因。如所指示,使用不同感染複數(MOI)在經AAV載體轉導細胞後獲得影像。
圖3A-3C示出用野生型AAV6或包含變異體衣殼之指定AAV轉導之T細胞之流動式細胞測量術分析的結果,各AAV均包裝有GFP轉殖基因。圖3A示出所測試細胞樣品之大小及粒度(亦即,前向散射及側向散射),從中鑑別了所關注細胞群體(在圖中圈出)。圖3B僅示出選擇用於分析之細胞群體的大小及粒度。圖3C示出對於所關注細胞群體量測之螢光(FITC)信號。與用野生型AAV6轉導之細胞相比,用包含STRD-207衣殼之AAV載體轉導之細胞的螢光增加。
圖4示出獲自用野生型AAV6或包含所指示衣殼變異體之各AAV進行之流動式細胞測量術實驗的GFP陽性T細胞百分比圖。T細胞源自兩個不同的人類供體(供體11及供體12)。如所指示,使用不同的MOI (供體12 T細胞為10,000、5,000及2,500,且供體11 T細胞為15,000、7,500及3,750)。
圖5A及5B為氣泡圖,其描繪在如實例1中所述之針對T細胞轉導的三輪進化及選擇之後,自活化T細胞之細胞核部分(圖5A)及細胞質部分(圖5B)獲得的包含變異體衣殼之個別AAV的分離物。各氣泡代表不同的衣殼蛋白胺基酸序列,其中氣泡之半徑與對應文庫中該變異體之讀段數目成正比。y軸代表絕對讀段數目。為便於可視化,資料沿x軸分佈。選擇優勢分離物進行定序分析。
圖6示出在富含T細胞之細胞核部分或細胞質部分之AAV載體中鑑別之轉導相關肽的序列。此等轉導相關肽位於衣殼蛋白之胺基酸464-456處,其中胺基酸編號對應於野生型AAV6 (SEQ ID NO: 1)。圖6中示出之序列從上至下依次對應於SEQ ID NO: 17-23。
<![CDATA[<110> 美商史崔德生物公司(StrideBio, Inc.)]]> <![CDATA[<120> 靶向T細胞之AAV載體]]> <![CDATA[<130> STRD-022/01WO 331843-2276]]> <![CDATA[<150> US 63/137,497 ]]> <![CDATA[<151> 2021-01-14]]> <![CDATA[<160> 167 ]]> <![CDATA[<170> PatentIn version 3.5]]> <![CDATA[<210> 1]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 1]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 2]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-01衣殼]]> <![CDATA[<400> 2]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser His Ala Pro Arg Val Glu Glu Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 3]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-01衣殼]]> <![CDATA[<400> 3]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtccc acgccccccg ggtggaggag 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 4]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-02衣殼]]> <![CDATA[<400> 4]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Met Ala Pro Arg Gln Glu Gly Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 5]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-02衣殼]]> <![CDATA[<400> 5]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtcca tggccccccg gcaggagggc 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 6]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-03衣殼]]> <![CDATA[<400> 6]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser His Thr Thr Asp Cys Ala Asn Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 7]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-03衣殼]]> <![CDATA[<400> 7]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtccc acaccaccga ctgcgccaac 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 8]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-04衣殼]]> <![CDATA[<400> 8]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Ala Ala Pro Arg Ser Glu Thr Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 9]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-04衣殼]]> <![CDATA[<400> 9]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtccg ccgccccccg gtccgagacc 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 10]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-05衣殼]]> <![CDATA[<400> 10]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Gln Ala Pro Arg Gln Glu Gly Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 11]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-05衣殼]]> <![CDATA[<400> 11]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtccc aggccccccg gcaggagggc 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 12]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-06衣殼]]> <![CDATA[<400> 12]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Val Ala Pro Arg Asp Pro Ala Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 13]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-06衣殼]]> <![CDATA[<400> 13]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtccg tggccccccg ggaccccgcc 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 14]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-07衣殼]]> <![CDATA[<400> 14]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Ser Ala Pro Arg Ser Glu Asn Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 15]]> <![CDATA[<211> 2211]]> <![CDATA[<212> DNA]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-07衣殼]]> <![CDATA[<400> 15]]> atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 aagggggagc ccgtcaacgc ggcggatgca gcggccctcg agcacgacaa ggcctacgac 240 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 gccaagaaga gggttctcga accttttggt ctggttgagg aaggtgctaa gacggctcct 420 ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcattggc 480 aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 accaccagca cccgaacatg ggccttgccc acctataaca accacctcta caagcaaatc 780 tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 gggtattttg atttcaacag attccactgc catttctcac cacgtgactg gcagcgactc 900 atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 gtcaaggagg tcacgacgaa tgatggcgtc acgaccatcg ctaataacct taccagcacg 1020 gttcaagtct tctcggactc ggagtaccag ttgccgtacg tcctcggctc tgcgcaccag 1080 ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcagtacgg ctacctaacg 1140 ctcaacaatg gcagccaggc agtgggacgg tcatcctttt actgcctgga atatttccca 1200 tcgcagatgc tgagaacggg caataacttt accttcagct acaccttcga ggacgtgcct 1260 ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 cagtacctgt attacctgaa cagaactcag aatcagtcca gcgccccccg gagcgagaac 1380 ttgctgttta gccgggggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 ggaccctgtt accggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaac 1500 tttacctgga ctggtgcttc aaaatataac cttaatgggc gtgaatctat aatcaaccct 1560 ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat gagcggtgtc 1620 atgatttttg gaaaggagag cgccggagct tcaaacactg cattggacaa tgtcatgatc 1680 acagacgaag aggaaatcaa agccactaac cccgtggcca ccgaaagatt tgggactgtg 1740 gcagtcaatc tccagagcag cagcacagac cctgcgaccg gagatgtgca tgttatggga 1800 gccttacctg gaatggtgtg gcaagacaga gacgtatacc tgcagggtcc tatttgggcc 1860 aaaattcctc acacggatgg acactttcac ccgtctcctc tcatgggcgg ctttggactt 1920 aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggca 1980 gagttttcgg ctacaaagtt tgcttcattc atcacccagt attccacagg acaagtgagc 2040 gtggagattg aatgggagct gcagaaagaa aacagcaaac gctggaatcc cgaagtgcag 2100 tatacatcta actatgcaaa atctgccaac gttgatttca ctgtggacaa caatggactt 2160 tatactgagc ctcgccccat tggcacccgt tacctcaccc gtcccctgta a 2211 <![CDATA[<210> 16]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 一致序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(1)]]> <![CDATA[<223> Xaa為His、Met、Gln、Val或Ser]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(2)]]> <![CDATA[<223> Xaa為Ala或Ser]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(3)]]> <![CDATA[<223> Xaa為Pro或Thr]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (4)..(4)]]> <![CDATA[<223> Xaa為Arg或Asp]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(5)]]> <![CDATA[<223> Xaa為Val、Gln、Cys、Ser或Asp]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (6)..(6)]]> <![CDATA[<223> Xaa為Glu、Ala或Pro]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (7)..(7)]]> <![CDATA[<223> Xaa為Glu、Gly、Asn、Thr或Ala]]> <![CDATA[<400> 16]]> Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 <![CDATA[<210> 17]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-01之454-460區]]> <![CDATA[<400> 17]]> His Ala Pro Arg Val Glu Glu 1 5 <![CDATA[<210> 18]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-02之454-460區]]> <![CDATA[<400> 18]]> Met Ala Pro Arg Gln Glu Gly 1 5 <![CDATA[<210> 19]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-03之454-460區]]> <![CDATA[<400> 19]]> His Thr Thr Asp Cys Ala Asn 1 5 <![CDATA[<210> 20]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-04之454-460區]]> <![CDATA[<400> 20]]> Ala Ala Pro Arg Ser Glu Thr 1 5 <![CDATA[<210> 21]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-05之454-460區]]> <![CDATA[<400> 21]]> Gln Ala Pro Arg Gln Glu Gly 1 5 <![CDATA[<210> 22]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-06之454-460區]]> <![CDATA[<400> 22]]> Val Ala Pro Arg Asp Pro Ala 1 5 <![CDATA[<210> 23]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> CAM640-07之454-460區]]> <![CDATA[<400> 23]]> Ser Ala Pro Arg Ser Glu Asn 1 5 <![CDATA[<210> 24]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 454-460區]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(7)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 24]]> Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1 5 <![CDATA[<210> 25]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 25]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly 145 150 155 160 Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 260 265 270 Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 275 280 285 His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn 290 295 300 Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln 305 310 315 320 Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn 325 330 335 Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro 340 345 350 Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala 355 360 365 Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 370 375 380 Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 385 390 395 400 Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 405 410 415 Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp 420 425 430 Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg 435 440 445 Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser 450 455 460 Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn 485 490 495 Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 500 505 510 Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys 515 520 525 Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly 530 535 540 Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 565 570 575 Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp Pro Ala 580 585 590 Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn 690 695 700 Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 705 710 715 720 Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu 725 730 735 <![CDATA[<210> 26]]> <![CDATA[<211> 735]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 26]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 1 5 10 15 Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 20 25 30 Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Arg Gln Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 145 150 155 160 Lys Ala Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ala Asp Ser Val Pro Asp Pro Gln Pro Leu Gly Gln Pro Pro 180 185 190 Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 260 265 270 Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 275 280 285 Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 290 295 300 Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln Val 305 310 315 320 Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 325 330 335 Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 340 345 350 Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 355 360 365 Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 370 375 380 Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 385 390 395 400 Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 405 410 415 Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 420 425 430 Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser Arg Thr 435 440 445 Asn Thr Pro Ser Gly Thr Thr Thr Gln Ser Arg Leu Gln Phe Ser Gln 450 455 460 Ala Gly Ala Ser Asp Ile Arg Asp Gln Ser Arg Asn Trp Leu Pro Gly 465 470 475 480 Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 485 490 495 Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 500 505 510 Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 515 520 525 Asp Glu Glu Lys Phe Phe Pro Gln Ser Gly Val Leu Ile Phe Gly Lys 530 535 540 Gln Gly Ser Glu Lys Thr Asn Val Asp Ile Glu Lys Val Met Ile Thr 545 550 555 560 Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln Tyr 565 570 575 Gly Ser Val Ser Thr Asn Leu Gln Arg Gly Asn Arg Gln Ala Ala Thr 580 585 590 Ala Asp Val Asn Thr Gln Gly Val Leu Pro Gly Met Val Trp Gln Asp 595 600 605 Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr 610 615 620 Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 625 630 635 640 His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala Asn 645 650 655 Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe Ile Thr Gln 660 665 670 Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln Lys 675 680 685 Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn Tyr 690 695 700 Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 705 710 715 720 Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 725 730 735 <![CDATA[<210> 27]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 27]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gln Pro 20 25 30 Lys Ala Asn Gln Gln His Gln Asp Asn Arg Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Ile Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Asp Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Val Gly 145 150 155 160 Lys Ser Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro 180 185 190 Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Ser Gln Ser Gly Ala Ser Asn Asp Asn His Tyr 260 265 270 Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 275 280 285 Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn Trp 290 295 300 Gly Phe Arg Pro Lys Lys Leu Ser Phe Lys Leu Phe Asn Ile Gln Val 305 310 315 320 Lys Glu Val Thr Gln Asn Asp Gly Thr Thr Thr Ile Ala Asn Asn Leu 325 330 335 Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu Pro Tyr 340 345 350 Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala Asp 355 360 365 Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 370 375 380 Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 385 390 395 400 Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr Phe Glu 405 410 415 Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp Arg 420 425 430 Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg Thr 435 440 445 Gln Gly Thr Thr Ser Gly Thr Thr Asn Gln Ser Arg Leu Leu Phe Ser 450 455 460 Gln Ala Gly Pro Gln Ser Met Ser Leu Gln Ala Arg Asn Trp Leu Pro 465 470 475 480 Gly Pro Cys Tyr Arg Gln Gln Arg Leu Ser Lys Thr Ala Asn Asp Asn 485 490 495 Asn Asn Ser Asn Phe Pro Trp Thr Ala Ala Ser Lys Tyr His Leu Asn 500 505 510 Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys 515 520 525 Asp Asp Glu Glu Lys Phe Phe Pro Met His Gly Asn Leu Ile Phe Gly 530 535 540 Lys Glu Gly Thr Thr Ala Ser Asn Ala Glu Leu Asp Asn Val Met Ile 545 550 555 560 Thr Asp Glu Glu Glu Ile Arg Thr Thr Asn Pro Val Ala Thr Glu Gln 565 570 575 Tyr Gly Thr Val Ala Asn Asn Leu Gln Ser Ser Asn Thr Ala Pro Thr 580 585 590 Thr Arg Thr Val Asn Asp Gln Gly Ala Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 625 630 635 640 Lys His Pro Pro Pro Gln Ile Met Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 690 695 700 Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val 705 710 715 720 Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 725 730 735 <![CDATA[<210> 28]]> <![CDATA[<211> 734]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 28]]> Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu 1 5 10 15 Gly Val Arg Glu Trp Trp Ala Leu Gln Pro Gly Ala Pro Lys Pro Lys 20 25 30 Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro Gly 35 40 45 Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro Val 50 55 60 Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gln 65 70 75 80 Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 85 90 95 Ala Glu Phe Gln Gln Arg Leu Gln Gly Asp Thr Ser Phe Gly Gly Asn 100 105 110 Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro Leu 115 120 125 Gly Leu Val Glu Gln Ala Gly Glu Thr Ala Pro Gly Lys Lys Arg Pro 130 135 140 Leu Ile Glu Ser Pro Gln Gln Pro Asp Ser Ser Thr Gly Ile Gly Lys 145 150 155 160 Lys Gly Lys Gln Pro Ala Lys Lys Lys Leu Val Phe Glu Asp Glu Thr 165 170 175 Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Thr Ser Gly Ala Met Ser 180 185 190 Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val Glu Gly 195 200 205 Gly Gln Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 210 215 220 Asp Ser Thr Trp Ser Glu Gly His Val Thr Thr Thr Ser Thr Arg Thr 225 230 235 240 Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu Gly Glu 245 250 255 Ser Leu Gln Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 260 265 270 Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 275 280 285 Arg Leu Ile Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met Arg Val 290 295 300 Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Thr Ser Asn Gly Glu 305 310 315 320 Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gln Ile Phe Ala Asp 325 330 335 Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gln Glu Gly Ser 340 345 350 Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gln Tyr Gly Tyr 355 360 365 Cys Gly Leu Val Thr Gly Asn Thr Ser Gln Gln Gln Thr Asp Arg Asn 370 375 380 Ala Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly 385 390 395 400 Asn Asn Phe Glu Ile Thr Tyr Ser Phe Glu Lys Val Pro Phe His Ser 405 410 415 Met Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Ile 420 425 430 Asp Gln Tyr Leu Trp Gly Leu Gln Ser Thr Thr Thr Gly Thr Thr Leu 435 440 445 Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro Thr Asn 450 455 460 Phe Ser Asn Phe Lys Lys Asn Trp Leu Pro Gly Pro Ser Ile Lys Gln 465 470 475 480 Gln Gly Phe Ser Lys Thr Ala Asn Gln Asn Tyr Lys Ile Pro Ala Thr 485 490 495 Gly Ser Asp Ser Leu Ile Lys Tyr Glu Thr His Ser Thr Leu Asp Gly 500 505 510 Arg Trp Ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala Gly Pro 515 520 525 Ala Asp Ser Lys Phe Ser Asn Ser Gln Leu Ile Phe Ala Gly Pro Lys 530 535 540 Gln Asn Gly Asn Thr Ala Thr Val Pro Gly Thr Leu Ile Phe Thr Ser 545 550 555 560 Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met Trp Gly 565 570 575 Asn Leu Pro Gly Gly Asp Gln Ser Asn Ser Asn Leu Pro Thr Val Asp 580 585 590 Arg Leu Thr Ala Leu Gly Ala Val Pro Gly Met Val Trp Gln Asn Arg 595 600 605 Asp Ile Tyr Tyr Gln Gly Pro Ile Trp Ala Lys Ile Pro His Thr Asp 610 615 620 Gly His Phe His Pro Ser Pro Leu Ile Gly Gly Phe Gly Leu Lys His 625 630 635 640 Pro Pro Pro Gln Ile Phe Ile Lys Asn Thr Pro Val Pro Ala Asn Pro 645 650 655 Ala Thr Thr Phe Ser Ser Thr Pro Val Asn Ser Phe Ile Thr Gln Tyr 660 665 670 Ser Thr Gly Gln Val Ser Val Gln Ile Asp Trp Glu Ile Gln Lys Glu 675 680 685 Arg Ser Lys Arg Trp Asn Pro Glu Val Gln Phe Thr Ser Asn Tyr Gly 690 695 700 Gln Gln Asn Ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys Tyr Thr 705 710 715 720 Glu Pro Arg Ala Ile Gly Thr Arg Tyr Leu Thr His His Leu 725 730 <![CDATA[<210> 29]]> <![CDATA[<211> 724]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 29]]> Met Ser Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 1 5 10 15 Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 20 25 30 Pro Asn Gln Gln His Gln Asp Gln Ala Arg Gly Leu Val Leu Pro Gly 35 40 45 Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 50 55 60 Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp Ile Ser Tyr Asn Glu 65 70 75 80 Gln Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 85 90 95 Ala Glu Phe Gln Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 100 105 110 Leu Gly Lys Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro Phe 115 120 125 Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg Ile 130 135 140 Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 145 150 155 160 Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gln 165 170 175 Gln Leu Gln Ile Pro Ala Gln Pro Ala Ser Ser Leu Gly Ala Asp Thr 180 185 190 Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gln Gly Ala 195 200 205 Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 210 215 220 Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 225 230 235 240 Ser Tyr Asn Asn His Gln Tyr Arg Glu Ile Lys Ser Gly Ser Val Asp 245 250 255 Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 260 265 270 Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gln 275 280 285 Arg Leu Ile Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 290 295 300 Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Val Gln Asp Ser Thr 305 310 315 320 Thr Thr Ile Ala Asn Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp 325 330 335 Asp Asp Tyr Gln Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 340 345 350 Leu Pro Ala Phe Pro Pro Gln Val Phe Thr Leu Pro Gln Tyr Gly Tyr 355 360 365 Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 370 375 380 Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 385 390 395 400 Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 405 410 415 Phe Ala Pro Ser Gln Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 420 425 430 Gln Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gln 435 440 445 Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 450 455 460 Phe Pro Gly Pro Met Gly Arg Thr Gln Gly Trp Asn Leu Gly Ser Gly 465 470 475 480 Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 485 490 495 Leu Glu Gly Ala Ser Tyr Gln Val Pro Pro Gln Pro Asn Gly Met Thr 500 505 510 Asn Asn Leu Gln Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met Ile 515 520 525 Phe Asn Ser Gln Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 530 535 540 Gly Asn Met Leu Ile Thr Ser Glu Ser Glu Thr Gln Pro Val Asn Arg 545 550 555 560 Val Ala Tyr Asn Val Gly Gly Gln Met Ala Thr Asn Asn Gln Ser Ser 565 570 575 Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gln Glu Ile Val Pro 580 585 590 Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp 595 600 605 Ala Lys Ile Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 610 615 620 Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu Ile Lys Asn 625 630 635 640 Thr Pro Val Pro Gly Asn Ile Thr Ser Phe Ser Asp Val Pro Val Ser 645 650 655 Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Thr Val Glu Met Glu 660 665 670 Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln 675 680 685 Tyr Thr Asn Asn Tyr Asn Asp Pro Gln Phe Val Asp Phe Ala Pro Asp 690 695 700 Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro Ile Gly Thr Arg Tyr Leu 705 710 715 720 Thr Arg Pro Leu <![CDATA[<210> 30]]> <![CDATA[<211> 737]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 30]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 130 135 140 Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 145 150 155 160 Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 165 170 175 Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 180 185 190 Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 195 200 205 Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 210 215 220 Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 225 230 235 240 Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 245 250 255 Leu Tyr Lys Gln Ile Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 260 265 270 Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 275 280 285 Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 290 295 300 Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn Ile 305 310 315 320 Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn 325 330 335 Asn Leu Thr Ser Thr Ile Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu 340 345 350 Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro 355 360 365 Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn 370 375 380 Gly Ser Gln Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 385 390 395 400 Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 405 410 415 Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 420 425 430 Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ala 435 440 445 Arg Thr Gln Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gln 450 455 460 Phe Tyr Gln Gly Gly Pro Ser Thr Met Ala Glu Gln Ala Lys Asn Trp 465 470 475 480 Leu Pro Gly Pro Cys Phe Arg Gln Gln Arg Val Ser Lys Thr Leu Asp 485 490 495 Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 500 505 510 Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 515 520 525 His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu Ile 530 535 540 Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 545 550 555 560 Met Thr Asn Glu Glu Glu Ile Arg Pro Thr Asn Pro Val Ala Thr Glu 565 570 575 Glu Tyr Gly Ile Val Ser Ser Asn Leu Gln Ala Ala Asn Thr Ala Ala 580 585 590 Gln Thr Gln Val Val Asn Asn Gln Gly Ala Leu Pro Gly Met Val Trp 595 600 605 Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro 610 615 620 His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 625 630 635 640 Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro 645 650 655 Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe Ile 660 665 670 Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu 675 680 685 Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser 690 695 700 Asn Phe Glu Lys Gln Thr Gly Val Asp Phe Ala Val Asp Ser Gln Gly 705 710 715 720 Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn 725 730 735 Leu <![CDATA[<210> 31]]> <![CDATA[<211> 738]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 31]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 145 150 155 160 Gly Lys Lys Gly Gln Gln Pro Ala Arg Lys Arg Leu Asn Phe Gly Gln 165 170 175 Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro 180 185 190 Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 195 200 205 Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 210 215 220 Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 225 230 235 240 Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 245 250 255 Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 260 265 270 Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 275 280 285 Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 290 295 300 Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 305 310 315 320 Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 325 330 335 Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 340 345 350 Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 355 360 365 Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 370 375 380 Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 385 390 395 400 Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Thr Tyr 405 410 415 Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 420 425 430 Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 435 440 445 Ser Arg Thr Gln Thr Thr Gly Gly Thr Ala Asn Thr Gln Thr Leu Gly 450 455 460 Phe Ser Gln Gly Gly Pro Asn Thr Met Ala Asn Gln Ala Lys Asn Trp 465 470 475 480 Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Thr Gly 485 490 495 Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 500 505 510 Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly Ile Ala Met Ala Thr 515 520 525 His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly Ile Leu Ile 530 535 540 Phe Gly Lys Gln Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 545 550 555 560 Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 565 570 575 Glu Glu Tyr Gly Ile Val Ala Asp Asn Leu Gln Gln Gln Asn Thr Ala 580 585 590 Pro Gln Ile Gly Thr Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 595 600 605 Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 610 615 620 Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 625 630 635 640 Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 645 650 655 Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ser Lys Leu Asn Ser Phe 660 665 670 Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 675 680 685 Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 690 695 700 Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 705 710 715 720 Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 725 730 735 Asn Leu <![CDATA[<210> 32]]> <![CDATA[<211> 736]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 32]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 20 25 30 Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 145 150 155 160 Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 165 170 175 Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 180 185 190 Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 195 200 205 Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 210 215 220 Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 225 230 235 240 Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 245 250 255 Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 260 265 270 Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 275 280 285 Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 290 295 300 Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 305 310 315 320 Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 325 330 335 Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 340 345 350 Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 355 360 365 Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 370 375 380 Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 385 390 395 400 Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 405 410 415 Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 420 425 430 Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 435 440 445 Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 450 455 460 Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 465 470 475 480 Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 485 490 495 Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 500 505 510 Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 515 520 525 Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 530 535 540 Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 545 550 555 560 Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 565 570 575 Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 580 585 590 Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 595 600 605 Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 610 615 620 Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 625 630 635 640 Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 645 650 655 Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 660 665 670 Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 675 680 685 Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 690 695 700 Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 705 710 715 720 Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 725 730 735 <![CDATA[<210> 33]]> <![CDATA[<211> 738]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 33]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 145 150 155 160 Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 165 170 175 Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 180 185 190 Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 195 200 205 Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 210 215 220 Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 225 230 235 240 Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 245 250 255 Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 260 265 270 Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 275 280 285 Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 290 295 300 Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 305 310 315 320 Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 325 330 335 Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 340 345 350 Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 355 360 365 Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 370 375 380 Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 385 390 395 400 Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 405 410 415 Gln Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 420 425 430 Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 435 440 445 Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 450 455 460 Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 465 470 475 480 Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 485 490 495 Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 500 505 510 Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 515 520 525 His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 530 535 540 Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 545 550 555 560 Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 565 570 575 Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 580 585 590 Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 595 600 605 Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 610 615 620 Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 625 630 635 640 Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 645 650 655 Pro Ala Asp Pro Pro Thr Thr Phe Ser Gln Ala Lys Leu Ala Ser Phe 660 665 670 Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 675 680 685 Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 690 695 700 Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 705 710 715 720 Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 725 730 735 Asn Leu <![CDATA[<210> 34]]> <![CDATA[<211> 733]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 依賴細小病毒屬腺相關依賴細小病毒A (Dependoparvovirus Adeno-associated dependoparvovirus A)]]> <![CDATA[<400> 34]]> Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 1 5 10 15 Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 20 25 30 Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro 35 40 45 Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 50 55 60 Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 65 70 75 80 Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 85 90 95 Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 100 105 110 Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 115 120 125 Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 130 135 140 Pro Leu Glu Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly Lys 145 150 155 160 Lys Gly Lys Gln Pro Ala Arg Lys Arg Leu Asn Phe Glu Glu Asp Thr 165 170 175 Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 180 185 190 Ser Asp Ile Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 195 200 205 Gly Gln Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 210 215 220 Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 225 230 235 240 Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 245 250 255 Thr Ser Ser Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 260 265 270 Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln 275 280 285 Arg Leu Ile Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 290 295 300 Lys Ile Phe Asn Ile Gln Val Lys Glu Val Thr Thr Ser Asn Gly Glu 305 310 315 320 Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gln Ile Phe Ala Asp 325 330 335 Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gln Glu Gly Ser 340 345 350 Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gln Tyr Gly Tyr 355 360 365 Cys Gly Ile Val Thr Gly Glu Asn Gln Asn Gln Thr Asp Arg Asn Ala 370 375 380 Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gln Met Leu Arg Thr Gly Asn 385 390 395 400 Asn Phe Glu Met Ala Tyr Asn Phe Glu Lys Val Pro Phe His Ser Met 405 410 415 Tyr Ala His Ser Gln Ser Leu Asp Arg Leu Met Asn Pro Leu Leu Asp 420 425 430 Gln Tyr Leu Trp His Leu Gln Ser Thr Thr Ser Gly Glu Thr Leu Asn 435 440 445 Gln Gly Asn Ala Ala Thr Thr Phe Gly Lys Ile Arg Ser Gly Asp Phe 450 455 460 Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gln Gln 465 470 475 480 Arg Phe Ser Lys Thr Ala Ser Gln Asn Tyr Lys Ile Pro Ala Ser Gly 485 490 495 Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 500 505 510 Trp Ser Asn Ile Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 515 520 525 Asp Gly Asp Phe Ser Asn Ala Gln Leu Ile Phe Pro Gly Pro Ser Val 530 535 540 Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 545 550 555 560 Glu Glu Ile Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gln 565 570 575 Ile Ala Asp Asn Asn Gln Asn Ala Thr Thr Ala Pro Ile Thr Gly Asn 580 585 590 Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gln Asn Arg Asp 595 600 605 Ile Tyr Tyr Gln Gly Pro Ile Trp Ala Lys Ile Pro His Ala Asp Gly 610 615 620 His Phe His Pro Ser Pro Leu Ile Gly Gly Phe Gly Leu Lys His Pro 625 630 635 640 Pro Pro Gln Ile Phe Ile Lys Asn Thr Pro Val Pro Ala Asn Pro Ala 645 650 655 Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe Ile Thr Gln Tyr Ser 660 665 670 Thr Gly Gln Val Ala Val Gln Ile Glu Trp Glu Ile Glu Lys Glu Arg 675 680 685 Ser Lys Arg Trp Asn Pro Glu Val Gln Phe Thr Ser Asn Tyr Gly Asn 690 695 700 Gln Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 705 710 715 720 Pro Arg Val Ile Gly Ser Arg Tyr Leu Thr Asn His Leu 725 730 <![CDATA[<210> 35]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (7)..(7)]]> <![CDATA[<223> Xaa為Gly或Ser]]> <![CDATA[<400> 35]]> Asn Ser Val Arg Asp Leu Xaa 1 5 <![CDATA[<210> 36]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 36]]> Pro Arg Ser Val Thr Val Pro 1 5 <![CDATA[<210> 37]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (6)..(6)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (7)..(7)]]> <![CDATA[<223> Xaa為Ser或Ala]]> <![CDATA[<400> 37]]> Asn Ser Val Ser Ser Xaa Xaa 1 5 <![CDATA[<210> 38]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 38]]> Asn Gly Arg Ala His Ala 1 5 <![CDATA[<210> 39]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 39]]> Gln Pro Glu His Ser Ser Thr 1 5 <![CDATA[<210> 40]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 40]]> Val Asn Thr Ala Asn Ser Thr 1 5 <![CDATA[<210> 41]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 41]]> His Gly Pro Met Gln Ser 1 5 <![CDATA[<210> 42]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 42]]> Pro His Lys Pro Pro Leu Ala 1 5 <![CDATA[<210> 43]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 43]]> Ile Lys Asn Asn Glu Met Trp 1 5 <![CDATA[<210> 44]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 44]]> Arg Asn Leu Asp Thr Pro Met 1 5 <![CDATA[<210> 45]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 45]]> Val Asp Ser His Arg Gln Ser 1 5 <![CDATA[<210> 46]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 46]]> Tyr Asp Ser Lys Thr Lys Thr 1 5 <![CDATA[<210> 47]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 47]]> Ser Gln Leu Pro His Gln Lys 1 5 <![CDATA[<210> 48]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 48]]> Ser Thr Met Gln Gln Asn Thr 1 5 <![CDATA[<210> 49]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 49]]> Thr Glu Arg Tyr Met Thr Gln 1 5 <![CDATA[<210> 50]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 50]]> Gln Pro Glu His Ser Ser Thr 1 5 <![CDATA[<210> 51]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 51]]> Asp Ala Ser Leu Ser Thr Ser 1 5 <![CDATA[<210> 52]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 52]]> Asp Leu Pro Asn Lys Thr 1 5 <![CDATA[<210> 53]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 53]]> Asp Leu Thr Ala Ala Arg Leu 1 5 <![CDATA[<210> 54]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 54]]> Glu Pro His Gln Phe Asn Tyr 1 5 <![CDATA[<210> 55]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 55]]> Glu Pro Gln Ser Asn His Thr 1 5 <![CDATA[<210> 56]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 56]]> Met Ser Ser Trp Pro Ser Gln 1 5 <![CDATA[<210> 57]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 57]]> Asn Pro Lys His Asn Ala Thr 1 5 <![CDATA[<210> 58]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 58]]> Pro Asp Gly Met Arg Thr Thr 1 5 <![CDATA[<210> 59]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 59]]> Pro Asn Asn Asn Lys Thr Thr 1 5 <![CDATA[<210> 60]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 60]]> Gln Ser Thr Thr His Asp Ser 1 5 <![CDATA[<210> 61]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 61]]> Thr Gly Ser Lys Gln Lys Gln 1 5 <![CDATA[<210> 62]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 62]]> Ser Leu Lys His Gln Ala Leu 1 5 <![CDATA[<210> 63]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 63]]> Ser Pro Ile Asp Gly Glu Gln 1 5 <![CDATA[<210> 64]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 64]]> Trp Ile Phe Pro Trp Ile Gln Leu 1 5 <![CDATA[<210> 65]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 65]]> Cys Asp Cys Arg Gly Asp Cys Phe Cys 1 5 <![CDATA[<210> 66]]> <![CDATA[<211> 5]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 66]]> Cys Asn Gly Arg Cys 1 5 <![CDATA[<210> 67]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 67]]> Cys Pro Arg Glu Cys Glu Ser 1 5 <![CDATA[<210> 68]]> <![CDATA[<211> 10]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 68]]> Cys Thr Thr His Trp Gly Phe Thr Leu Cys 1 5 10 <![CDATA[<210> 69]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 69]]> Cys Gly Arg Arg Ala Gly Gly Ser Cys 1 5 <![CDATA[<210> 70]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 70]]> Cys Lys Gly Gly Arg Ala Lys Asp Cys 1 5 <![CDATA[<210> 71]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 71]]> Cys Val Pro Glu Leu Gly His Glu Cys 1 5 <![CDATA[<210> 72]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 72]]> Cys Arg Arg Glu Thr Ala Trp Ala Lys 1 5 <![CDATA[<210> 73]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 73]]> Val Ser Trp Phe Ser His Arg Tyr Ser Pro Phe Ala Val Ser 1 5 10 <![CDATA[<210> 74]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 74]]> Gly Tyr Arg Asp Gly Tyr Ala Gly Pro Ile Leu Tyr Asn 1 5 10 <![CDATA[<210> 75]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(3)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> MOD_RES]]> <![CDATA[<222> (4)..(4)]]> <![CDATA[<223> Y為磷酸酪胺酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(7)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 75]]> Xaa Xaa Xaa Tyr Xaa Xaa Xaa 1 5 <![CDATA[<210> 76]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> MOD_RES]]> <![CDATA[<222> (1)..(1)]]> <![CDATA[<223> Y為磷酸酪胺酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(2)]]> <![CDATA[<223> Xaa為Glu或Met]]> <![CDATA[<400> 76]]> Tyr Xaa Asn Trp 1 <![CDATA[<210> 77]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 77]]> Arg Pro Leu Pro Pro Leu Pro 1 5 <![CDATA[<210> 78]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 78]]> Ala Pro Pro Leu Pro Pro Arg 1 5 <![CDATA[<210> 79]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 79]]> Asp Val Phe Tyr Pro Tyr Pro Tyr Ala Ser Gly Ser 1 5 10 <![CDATA[<210> 80]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 80]]> Met Tyr Trp Tyr Pro Tyr 1 5 <![CDATA[<210> 81]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 81]]> Asp Ile Thr Trp Asp Gln Leu Trp Asp Leu Met Lys 1 5 10 <![CDATA[<210> 82]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(5)]]> <![CDATA[<223> Xaa為Gly或Leu]]> <![CDATA[<400> 82]]> Cys Trp Asp Asp Xaa Trp Leu Cys 1 5 <![CDATA[<210> 83]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 83]]> Glu Trp Cys Glu Tyr Leu Gly Gly Tyr Leu Arg Cys Tyr Ala 1 5 10 <![CDATA[<210> 84]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(2)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (4)..(5)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (8)..(8)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (11)..(11)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (13)..(13)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 84]]> Tyr Xaa Cys Xaa Xaa Gly Pro Xaa Thr Trp Xaa Cys Xaa Pro 1 5 10 <![CDATA[<210> 85]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 85]]> Ile Glu Gly Pro Thr Leu Arg Gln Trp Leu Ala Ala Arg Ala 1 5 10 <![CDATA[<210> 86]]> <![CDATA[<211> 5]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(4)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(5)]]> <![CDATA[<223> Xaa為Tyr、Trp、Phe或His]]> <![CDATA[<400> 86]]> Leu Trp Xaa Xaa Xaa 1 5 <![CDATA[<210> 87]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(1)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(4)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 87]]> Xaa Phe Xaa Xaa Tyr Leu Trp 1 5 <![CDATA[<210> 88]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 88]]> Arg Trp Gly Leu Cys Asp 1 5 <![CDATA[<210> 89]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 89]]> Met Ser Arg Pro Ala Cys Pro Pro Asn Asp Lys Tyr Glu 1 5 10 <![CDATA[<210> 90]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 90]]> Cys Leu Arg Ser Gly Arg Gly Cys 1 5 <![CDATA[<210> 91]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 91]]> Cys His Trp Met Phe Ser Pro Trp Cys 1 5 <![CDATA[<210> 92]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(3)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 92]]> Trp Xaa Xaa Phe 1 <![CDATA[<210> 93]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 93]]> Cys Ser Ser Arg Leu Asp Ala Cys 1 5 <![CDATA[<210> 94]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 94]]> Cys Leu Pro Val Ala Ser Cys 1 5 <![CDATA[<210> 95]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 95]]> Cys Gly Phe Glu Cys Val Arg Gln Cys Pro Glu Arg Cys 1 5 10 <![CDATA[<210> 96]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 96]]> Cys Val Ala Leu Cys Arg Glu Ala Cys Gly Glu Gly Cys 1 5 10 <![CDATA[<210> 97]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 97]]> Ser Trp Cys Glu Pro Gly Trp Cys Arg 1 5 <![CDATA[<210> 98]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 98]]> Tyr Ser Gly Trp Gly Trp 1 5 <![CDATA[<210> 99]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 99]]> Gly Leu Ser Gly Gly Arg Ser 1 5 <![CDATA[<210> 100]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 100]]> Leu Met Leu Pro Arg Ala Asp 1 5 <![CDATA[<210> 101]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 101]]> Cys Ser Cys Phe Arg Asp Val Cys Cys 1 5 <![CDATA[<210> 102]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 102]]> Cys Arg Asp Val Val Ser Val Ile Cys 1 5 <![CDATA[<210> 103]]> <![CDATA[<211> 5]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 103]]> Cys Asn Gly Arg Cys 1 5 <![CDATA[<210> 104]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 104]]> Met Ala Arg Ser Gly Leu 1 5 <![CDATA[<210> 105]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 105]]> Met Ala Arg Ala Lys Glu 1 5 <![CDATA[<210> 106]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 106]]> Met Ser Arg Thr Met Ser 1 5 <![CDATA[<210> 107]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 107]]> Lys Cys Cys Tyr Ser Leu 1 5 <![CDATA[<210> 108]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 108]]> Met Tyr Trp Gly Asp Ser His Trp Leu Gln Tyr Trp Tyr Glu 1 5 10 <![CDATA[<210> 109]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 109]]> Met Gln Leu Pro Leu Ala Thr 1 5 <![CDATA[<210> 110]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 110]]> Glu Trp Leu Ser 1 <![CDATA[<210> 111]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 111]]> Ser Asn Glu Trp 1 <![CDATA[<210> 112]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 112]]> Thr Asn Tyr Leu 1 <![CDATA[<210> 113]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 113]]> Trp Ile Phe Pro Trp Ile Gln Leu 1 5 <![CDATA[<210> 114]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 114]]> Trp Asp Leu Ala Trp Met Phe Arg Leu Pro Val Gly 1 5 10 <![CDATA[<210> 115]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 115]]> Cys Thr Val Ala Leu Pro Gly Gly Tyr Val Arg Val Cys 1 5 10 <![CDATA[<210> 116]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 116]]> Cys Val Pro Glu Leu Gly His Glu Cys 1 5 <![CDATA[<210> 117]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 117]]> Cys Gly Arg Arg Ala Gly Gly Ser Cys 1 5 <![CDATA[<210> 118]]> <![CDATA[<211> 13]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 118]]> Cys Val Ala Tyr Cys Ile Glu His His Cys Trp Thr Cys 1 5 10 <![CDATA[<210> 119]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 119]]> Cys Val Phe Ala His Asn Tyr Asp Tyr Leu Val Cys 1 5 10 <![CDATA[<210> 120]]> <![CDATA[<211> 10]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 120]]> Cys Val Phe Thr Ser Asn Tyr Ala Phe Cys 1 5 10 <![CDATA[<210> 121]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 121]]> Val His Ser Pro Asn Lys Lys 1 5 <![CDATA[<210> 122]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 122]]> Cys Asp Cys Arg Gly Asp Cys Phe Cys 1 5 <![CDATA[<210> 123]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 123]]> Cys Arg Gly Asp Gly Trp Cys 1 5 <![CDATA[<210> 124]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(1)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (6)..(6)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 124]]> Xaa Arg Gly Cys Asp Xaa 1 5 <![CDATA[<210> 125]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(3)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (4)..(4)]]> <![CDATA[<223> Xaa為Ser或Thr]]> <![CDATA[<400> 125]]> Pro Xaa Xaa Xaa 1 <![CDATA[<210> 126]]> <![CDATA[<211> 10]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 126]]> Cys Thr Thr His Trp Gly Phe Thr Leu Cys 1 5 10 <![CDATA[<210> 127]]> <![CDATA[<211> 11]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 127]]> Ser Gly Lys Gly Pro Arg Gln Ile Thr Ala Leu 1 5 10 <![CDATA[<210> 128]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(2)]]> <![CDATA[<223> Xaa為Ala或Gln]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(3)]]> <![CDATA[<223> Xaa為Asn或Ala]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (4)..(4)]]> <![CDATA[<223> Xaa為Leu或Tyr]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(5)]]> <![CDATA[<223> Xaa為Thr、Val、Met或Arg]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (6)..(6)]]> <![CDATA[<223> Xaa為Arg或Lys]]> <![CDATA[<400> 128]]> Ala Xaa Xaa Xaa Xaa Xaa 1 5 <![CDATA[<210> 129]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 129]]> Val Tyr Met Ser Pro Phe 1 5 <![CDATA[<210> 130]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 130]]> Met Gln Leu Pro Leu Ala Thr 1 5 <![CDATA[<210> 131]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 131]]> Ala Thr Trp Leu Pro Pro Arg 1 5 <![CDATA[<210> 132]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 132]]> His Thr Met Tyr Tyr His His Tyr Gln His His Leu 1 5 10 <![CDATA[<210> 133]]> <![CDATA[<211> 19]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 133]]> Ser Glu Val Gly Cys Arg Ala Gly Pro Leu Gln Trp Leu Cys Glu Lys 1 5 10 15 Tyr Phe Gly <![CDATA[<210> 134]]> <![CDATA[<211> 18]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 134]]> Cys Gly Leu Leu Pro Val Gly Arg Pro Asp Arg Asn Val Trp Arg Trp 1 5 10 15 Leu Cys <![CDATA[<210> 135]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 135]]> Cys Lys Gly Gln Cys Asp Arg Phe Lys Gly Leu Pro Trp Glu Cys 1 5 10 15 <![CDATA[<210> 136]]> <![CDATA[<211> 5]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 136]]> Ser Gly Arg Ser Ala 1 5 <![CDATA[<210> 137]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 137]]> Trp Gly Phe Pro 1 <![CDATA[<210> 138]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(4)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 138]]> Leu Trp Xaa Xaa Ala Arg 1 5 <![CDATA[<210> 139]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (1)..(1)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(4)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<400> 139]]> Xaa Phe Xaa Xaa Tyr Leu Trp 1 5 <![CDATA[<210> 140]]> <![CDATA[<211> 17]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 140]]> Ala Glu Pro Met Pro His Ser Leu Asn Phe Ser Gln Tyr Leu Trp Tyr 1 5 10 15 Thr <![CDATA[<210> 141]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (4)..(4)]]> <![CDATA[<223> Xaa為Trp或Phe]]> <![CDATA[<400> 141]]> Trp Ala Tyr Xaa Ser Pro 1 5 <![CDATA[<210> 142]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 142]]> Ile Glu Leu Leu Gln Ala Arg 1 5 <![CDATA[<210> 143]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 143]]> Asp Ile Thr Trp Asp Gln Leu Trp Asp Leu Met Lys 1 5 10 <![CDATA[<210> 144]]> <![CDATA[<211> 16]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 144]]> Ala Tyr Thr Lys Cys Ser Arg Gln Trp Arg Thr Cys Met Thr Thr His 1 5 10 15 <![CDATA[<210> 145]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 145]]> Pro Gln Asn Ser Lys Ile Pro Gly Pro Thr Phe Leu Asp Pro His 1 5 10 15 <![CDATA[<210> 146]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 146]]> Ser Met Glu Pro Ala Leu Pro Asp Trp Trp Trp Lys Met Phe Lys 1 5 10 15 <![CDATA[<210> 147]]> <![CDATA[<211> 16]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 147]]> Ala Asn Thr Pro Cys Gly Pro Tyr Thr His Asp Cys Pro Val Lys Arg 1 5 10 15 <![CDATA[<210> 148]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 148]]> Thr Ala Cys His Gln His Val Arg Met Val Arg Pro 1 5 10 <![CDATA[<210> 149]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 149]]> Val Pro Trp Met Glu Pro Ala Tyr Gln Arg Phe Leu 1 5 10 <![CDATA[<210> 150]]> <![CDATA[<211> 8]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 150]]> Asp Pro Arg Ala Thr Pro Gly Ser 1 5 <![CDATA[<210> 151]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 151]]> Phe Arg Pro Asn Arg Ala Gln Asp Tyr Asn Thr Asn 1 5 10 <![CDATA[<210> 152]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 152]]> Cys Thr Lys Asn Ser Tyr Leu Met Cys 1 5 <![CDATA[<210> 153]]> <![CDATA[<211> 11]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (2)..(2)]]> <![CDATA[<223> Xaa為Arg或Gln]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (3)..(3)]]> <![CDATA[<223> Xaa為Leu或Arg]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (5)..(5)]]> <![CDATA[<223> Xaa為Gly或Asn]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (6)..(7)]]> <![CDATA[<223> Xaa為任何胺基酸]]> <![CDATA[<220>]]> <![CDATA[<221> SITE]]> <![CDATA[<222> (9)..(9)]]> <![CDATA[<223> Xaa為Ala或Val]]> <![CDATA[<400> 153]]> Cys Xaa Xaa Thr Xaa Xaa Xaa Gly Xaa Gly Cys 1 5 10 <![CDATA[<210> 154]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 154]]> Cys Pro Ile Glu Asp Arg Pro Met Cys 1 5 <![CDATA[<210> 155]]> <![CDATA[<211> 12]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 155]]> His Glu Trp Ser Tyr Leu Ala Pro Tyr Pro Trp Phe 1 5 10 <![CDATA[<210> 156]]> <![CDATA[<211> 9]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 156]]> Met Cys Pro Lys His Pro Leu Gly Cys 1 5 <![CDATA[<210> 157]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 157]]> Arg Met Trp Pro Ser Ser Thr Val Asn Leu Ser Ala Gly Arg Arg 1 5 10 15 <![CDATA[<210> 158]]> <![CDATA[<211> 20]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 158]]> Ser Ala Lys Thr Ala Val Ser Gln Arg Val Trp Leu Pro Ser His Arg 1 5 10 15 Gly Gly Glu Pro 20 <![CDATA[<210> 159]]> <![CDATA[<211> 20]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 159]]> Lys Ser Arg Glu His Val Asn Asn Ser Ala Cys Pro Ser Lys Arg Ile 1 5 10 15 Thr Ala Ala Leu 20 <![CDATA[<210> 160]]> <![CDATA[<211> 4]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 160]]> Glu Gly Phe Arg 1 <![CDATA[<210> 161]]> <![CDATA[<211> 6]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 161]]> Ala Gly Leu Gly Val Arg 1 5 <![CDATA[<210> 162]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 162]]> Gly Thr Arg Gln Gly His Thr Met Arg Leu Gly Val Ser Asp Gly 1 5 10 15 <![CDATA[<210> 163]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 163]]> Ile Ala Gly Leu Ala Thr Pro Gly Trp Ser His Trp Leu Ala Leu 1 5 10 15 <![CDATA[<210> 164]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 164]]> Ser Met Ser Ile Ala Arg Leu 1 5 <![CDATA[<210> 165]]> <![CDATA[<211> 7]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 165]]> His Thr Phe Glu Pro Gly Val 1 5 <![CDATA[<210> 166]]> <![CDATA[<211> 14]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 166]]> Asn Thr Ser Leu Lys Arg Ile Ser Asn Lys Arg Arg Arg Lys 1 5 10 <![CDATA[<210> 167]]> <![CDATA[<211> 15]]> <![CDATA[<212> PRT]]> <![CDATA[<213> 人工序列(Artificial Sequence)]]> <![CDATA[<220>]]> <![CDATA[<223> 靶向肽序列]]> <![CDATA[<400> 167]]> Leu Arg Ile Lys Arg Lys Arg Arg Lys Arg Lys Lys Thr Arg Lys 1 5 10 15
Claims (48)
- 一種包含衣殼蛋白之重組腺相關病毒(AAV)載體,其中該衣殼蛋白包含具有SEQ ID NO: 17至23中任一者之序列的轉導相關肽。
- 如請求項1之重組AAV載體,其中該衣殼蛋白包含與SEQ ID NO: 1具有至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性之胺基酸序列。
- 如請求項1或2之重組AAV載體,其中該轉導相關肽取代對應於SEQ ID NO: 1之胺基酸454-460的胺基酸。
- 如請求項1之重組AAV載體,其中該衣殼蛋白包含選自由以下組成之群的胺基酸序列:SEQ ID NO: 2、4、6、8、10、12及14,或與其至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致的序列。
- 一種包含衣殼蛋白之重組AAV載體,其中該衣殼蛋白包含SEQ ID NO: 1之序列,其中SEQ ID NO: 1之胺基酸454-460經包含序列X1-X2-X3-X4-X5-X6-X7 (SEQ ID NO: 24)之轉導相關肽取代。
- 如請求項5之重組AAV載體,其中X1不為G,X2不為S,X3不為A,X4不為Q,X5不為N,X6不為K,及/或X7不為D。
- 如請求項5至6中任一項之重組AAV載體,其中X1為H、M、A、Q、V或S。
- 如請求項5至7中任一項之重組AAV載體,其中X2為A或T。
- 如請求項5至8中任一項之重組AAV載體,其中X3為P或T。
- 如請求項5至9中任一項之重組AAV載體,其中X4為R或D。
- 如請求項5至10中任一項之重組AAV載體,其中X5為V、Q、C、S或D。
- 如請求項5至11中任一項之重組AAV載體,其中X6為E、A或P。
- 如請求項5至12中任一項之重組AAV載體,其中X7為E、G、N、T或A。
- 如請求項5之重組AAV載體,其中X1為H,X2為A,X3為P,X4為R,X5為V,X6為E,且X7為E。
- 如請求項5之重組AAV載體,其中X1為M,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。
- 如請求項5之重組AAV載體,其中X1為H,X2為T,X3為T,X4為D,X5為C,X6為A,且X7為N。
- 如請求項5之重組AAV載體,其中X1為A,X2為A,X3為P,X4為R,X5為S,X6為E,且X7為T。
- 如請求項5之重組AAV載體,其中X1為Q,X2為A,X3為P,X4為R,X5為Q,X6為E,且X7為G。
- 如請求項5之重組AAV載體,其中X1為V,X2為A,X3為P,X4為R,X5為D,X6為P,且X7為A。
- 如請求項5之重組AAV載體,其中X1為S,X2為A,X3為P,X4為R,X5為S,X46為E,且X7為N。
- 如請求項5之重組AAV載體,其中該衣殼蛋白包含與SEQ ID NO: 1具有至少約95%、至少約96%、至少約97%、至少約98%或至少約99%一致性之胺基酸序列。
- 如請求項21之重組AAV載體,其中該衣殼蛋白包含與SEQ ID NO: 1具有約99%一致性之胺基酸序列。
- 如請求項5之重組AAV載體,其中該衣殼蛋白包含選自由SEQ ID NO: 2、4、6、8、10、12及14組成之群的胺基酸序列。
- 一種包含衣殼蛋白之重組AAV載體,其中該衣殼蛋白包含具有SEQ ID NO: 16之胺基酸序列的轉導相關肽,其中該轉導相關肽相對於SEQ ID NO: 1取代胺基酸454-460。
- 如請求項24之重組AAV載體,其中該轉導相關肽具有SEQ ID NO: 17-23中任一者之胺基酸序列。
- 一種核酸,其編碼具有SEQ ID NO: 2、4、6、8、10、12及14中任一者之序列的重組AAV衣殼蛋白。
- 如請求項26之核酸,其中該核酸包含選自由SEQ ID NO: 3、5、7、9、11、13及15組成之群的序列。
- 如請求項26或27之核酸,其中該核酸為DNA序列。
- 如請求項26或27之核酸,其中該核酸為RNA序列。
- 一種表現載體,其包含如請求項26至29中任一項之核酸。
- 一種細胞,其包含如請求項26至29中任一項之核酸或如請求項30之表現載體。
- 如請求項1至25中任一項之重組AAV載體,其進一步包含經該衣殼蛋白包裹的貨物核酸。
- 如請求項32之重組AAV載體,其中該貨物核酸編碼治療性蛋白質或治療性RNA。
- 如請求項32至33中任一項之重組AAV載體,其中與不包含該轉導相關肽之AAV載體相比,該AAV載體展現增加的向細胞中之轉導。
- 如請求項34之AAV載體,其中該細胞為T細胞。
- 如請求項35之AAV載體,其中與不包含該轉導相關肽之AAV載體相比,該AAV載體展現增加的向T細胞之細胞核中之轉導。
- 如請求項35之AAV載體,其中與不包含該轉導相關肽之AAV載體相比,該AAV載體展現增加的向T細胞之胞質液中之轉導。
- 一種組合物,其包含如請求項1至25或32至37中任一項之重組AAV載體、如請求項26至29中任一項之核酸、如請求項30之表現載體或如請求項31之細胞。
- 一種醫藥組合物,其包含如請求項31之細胞或如請求項1至25或32至37中任一項之重組AAV載體;及醫藥學上可接受之載劑。
- 一種將AAV載體遞送至細胞中之方法,其包含使該細胞與如請求項1至25或32至37中任一項之AAV載體接觸。
- 如請求項40之方法,其中該細胞之該接觸係在 活體外、離體或 活體內進行。
- 如請求項40或41之方法,其中該細胞為T細胞。
- 一種治療有需要之個體的方法,其包含向該個體投與有效量的如請求項1至25或32至37中任一項之AAV載體。
- 一種治療有需要之個體的方法,其包含向該個體投與已與如請求項1至25或32至37中任一項之AAV載體 離體接觸的細胞。
- 如請求項43或44之方法,其中該個體為哺乳動物。
- 如請求項45之方法,其中該個體為人類。
- 如請求項1至25或32至37中任一項之AAV載體,其用作藥劑。
- 如請求項1至25或32至37中任一項之AAV載體,其用於治療有需要之個體的方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163137497P | 2021-01-14 | 2021-01-14 | |
US63/137,497 | 2021-01-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
TW202242124A true TW202242124A (zh) | 2022-11-01 |
Family
ID=81328090
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW111101402A TW202242124A (zh) | 2021-01-14 | 2022-01-13 | 靶向t細胞之aav載體 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20240123085A1 (zh) |
EP (1) | EP4277920A1 (zh) |
JP (1) | JP2024503091A (zh) |
KR (1) | KR20230135093A (zh) |
CN (1) | CN117203222A (zh) |
AR (1) | AR124651A1 (zh) |
AU (1) | AU2022208037A1 (zh) |
CA (1) | CA3204794A1 (zh) |
TW (1) | TW202242124A (zh) |
WO (1) | WO2022155482A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023060264A1 (en) | 2021-10-08 | 2023-04-13 | Dyno Therapeutics, Inc. | Capsid variants and methods of using the same |
WO2024124019A2 (en) * | 2022-12-07 | 2024-06-13 | Ginkgo Bioworks, Inc. | Aav vectors targeting hematopoietic stem cells |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0322417A1 (en) | 1986-09-08 | 1989-07-05 | Applied Biotechnology, Inc. | Empty viral capsid vaccines |
US4968603A (en) | 1986-12-31 | 1990-11-06 | The Regents Of The University Of California | Determination of status in neoplastic disease |
EP0442926A1 (en) | 1988-11-10 | 1991-08-28 | Imperial Cancer Research Technology Limited | Polypeptides |
US5916563A (en) | 1988-11-14 | 1999-06-29 | United States Of America | Parvovirus protein presenting capsids |
US5399346A (en) | 1989-06-14 | 1995-03-21 | The United States Of America As Represented By The Department Of Health And Human Services | Gene therapy |
ES2026826A6 (es) | 1991-03-26 | 1992-05-01 | Ercros Sa | Procedimiento para la produccion de una vacuna subunidad contra el parvovirus canino y otros virus relacionados. |
US5478745A (en) | 1992-12-04 | 1995-12-26 | University Of Pittsburgh | Recombinant viral vector system |
US5869248A (en) | 1994-03-07 | 1999-02-09 | Yale University | Targeted cleavage of RNA using ribonuclease P targeting and cleavage sequences |
US6204059B1 (en) | 1994-06-30 | 2001-03-20 | University Of Pittsburgh | AAV capsid vehicles for molecular transfer |
US5599706A (en) | 1994-09-23 | 1997-02-04 | Stinchcomb; Dan T. | Ribozymes targeted to apo(a) mRNA |
US6093570A (en) | 1995-06-07 | 2000-07-25 | The University Of North Carolina At Chapel Hill | Helper virus-free AAV production |
US6040183A (en) | 1995-06-07 | 2000-03-21 | University Of North Carloina At Chapel Hill | Helper virus-free AAV production |
CA2240494C (en) | 1995-12-15 | 2007-03-13 | Lloyd G. Mitchell | Therapeutic molecules generated by trans-splicing |
US6083702A (en) | 1995-12-15 | 2000-07-04 | Intronn Holdings Llc | Methods and compositions for use in spliceosome mediated RNA trans-splicing |
AU4645697A (en) | 1996-09-11 | 1998-04-02 | Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services, The | Aav4 vector and uses thereof |
US6156303A (en) | 1997-06-11 | 2000-12-05 | University Of Washington | Adeno-associated virus (AAV) isolates and AAV vectors derived therefrom |
ATE402254T1 (de) | 1998-05-28 | 2008-08-15 | Us Gov Health & Human Serv | Aav5 vektoren und deren verwendung |
NZ511171A (en) | 1998-09-22 | 2004-02-27 | Univ Florida | Methods for large-scale production of recombinant AAV vectors |
PT1127150E (pt) | 1998-11-05 | 2007-08-22 | Univ Pennsylvania | ''sequências de ácido nucleico do vírus adeno associado do serotipo 1, vectores e células hospedeiras que as contêm'' |
ES2340230T3 (es) | 1998-11-10 | 2010-05-31 | University Of North Carolina At Chapel Hill | Vectores viricos y sus procedimientos de preparacion y administracion. |
US7314912B1 (en) | 1999-06-21 | 2008-01-01 | Medigene Aktiengesellschaft | AAv scleroprotein, production and use thereof |
WO2001092551A2 (en) | 2000-06-01 | 2001-12-06 | University Of North Carolina At Chapel Hill | Duplexed parvovirus vectors |
US7201898B2 (en) | 2000-06-01 | 2007-04-10 | The University Of North Carolina At Chapel Hill | Methods and compounds for controlled release of recombinant parvovirus vectors |
US6623729B2 (en) | 2001-07-09 | 2003-09-23 | Korea Advanced Institute Of Science And Technology | Process for preparing sustained release micelle employing conjugate of anticancer drug and biodegradable polymer |
WO2003093295A2 (en) | 2002-04-30 | 2003-11-13 | University Of North Carolina At Chapel Hill | Secretion signal vectors |
ITRM20020253A1 (it) | 2002-05-08 | 2003-11-10 | Univ Roma | Molecole chimeriche di snrna con sequenze antisenso per le giunzioni di splicing del gene della distrofina e applicazioni terapeutiche. |
FR2874384B1 (fr) | 2004-08-17 | 2010-07-30 | Genethon | Vecteur viral adeno-associe pour realiser du saut d'exons dans un gene codant une proteine a domaines dispensables |
JP4346526B2 (ja) | 2004-08-31 | 2009-10-21 | 株式会社東芝 | 半導体集積回路装置 |
WO2006029319A2 (en) | 2004-09-09 | 2006-03-16 | The General Hospital Corporation | Modulating phosphatase activity in cardiac cells |
NZ555830A (en) | 2004-12-15 | 2009-01-31 | Univ North Carolina | Chimeric vectors |
JP2008539698A (ja) | 2005-04-29 | 2008-11-20 | ザ・ユニヴァーシティ・オヴ・ノース・キャロライナ・アト・チャペル・ヒル | 転写後レベルでの核酸発現調節のための方法および組成物 |
US8283151B2 (en) * | 2005-04-29 | 2012-10-09 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Isolation, cloning and characterization of new adeno-associated virus (AAV) serotypes |
EP2441770A1 (en) | 2006-02-10 | 2012-04-18 | The University of Cincinnati | Phosphatase inhibitor protein-1 as a regulator of cardiac function |
CN101711164B (zh) | 2007-01-18 | 2014-06-04 | 密苏里-哥伦比亚大学 | 可修复肌纤维膜中的神经元型一氧化氮合酶的合成小/微-抗肌萎缩蛋白基因 |
EP2396343B1 (en) | 2009-02-11 | 2017-05-17 | The University of North Carolina At Chapel Hill | Modified virus vectors and methods of making and using the same |
AU2014227766B2 (en) | 2013-03-15 | 2018-10-04 | The University Of North Carolina At Chapel Hill | Methods and compositions for dual glycan binding AAV vectors |
CN109923211A (zh) * | 2016-09-08 | 2019-06-21 | 蓝鸟生物公司 | Pd-1归巢核酸内切酶变体、组合物及使用方法 |
MX2020010466A (es) | 2018-04-03 | 2021-01-08 | Vectores de virus que evitan anticuerpos. | |
EP3773743A1 (en) | 2018-04-03 | 2021-02-17 | Stridebio, Inc. | Virus vectors for targeting ophthalmic tissues |
BR112020020266A2 (pt) | 2018-04-03 | 2021-01-19 | Stridebio, Inc. | Vetores de vírus com evasão de anticorpos |
-
2022
- 2022-01-13 TW TW111101402A patent/TW202242124A/zh unknown
- 2022-01-14 JP JP2023542960A patent/JP2024503091A/ja active Pending
- 2022-01-14 AR ARP220100072A patent/AR124651A1/es unknown
- 2022-01-14 AU AU2022208037A patent/AU2022208037A1/en active Pending
- 2022-01-14 KR KR1020237026711A patent/KR20230135093A/ko unknown
- 2022-01-14 EP EP22703750.4A patent/EP4277920A1/en active Pending
- 2022-01-14 CA CA3204794A patent/CA3204794A1/en active Pending
- 2022-01-14 WO PCT/US2022/012542 patent/WO2022155482A1/en active Application Filing
- 2022-01-14 CN CN202280016202.8A patent/CN117203222A/zh active Pending
-
2023
- 2023-07-12 US US18/221,211 patent/US20240123085A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
AU2022208037A1 (en) | 2023-07-20 |
CA3204794A1 (en) | 2022-07-21 |
WO2022155482A1 (en) | 2022-07-21 |
KR20230135093A (ko) | 2023-09-22 |
AR124651A1 (es) | 2023-04-19 |
WO2022155482A9 (en) | 2022-12-22 |
EP4277920A1 (en) | 2023-11-22 |
JP2024503091A (ja) | 2024-01-24 |
CN117203222A (zh) | 2023-12-08 |
US20240123085A1 (en) | 2024-04-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20240209031A1 (en) | Methods and compositions for gene transfer across the vasculature | |
US10907176B2 (en) | Methods and compositions for targeted gene transfer | |
US11981914B2 (en) | Recombinant adeno-associated virus vectors | |
US11718862B2 (en) | Methods and compositions for circular RNA molecules | |
US11976096B2 (en) | Antibody-evading virus vectors | |
US9409953B2 (en) | Viral vectors with modified transduction profiles and methods of making and using the same | |
CA3095179A1 (en) | Antibody-evading virus vectors | |
US11905523B2 (en) | Adeno-associated viral vectors for treatment of Niemann-Pick Disease type-C | |
CA3094465A1 (en) | Virus vectors for targeting ophthalmic tissues | |
NZ555830A (en) | Chimeric vectors | |
US20240123085A1 (en) | Aav vectors targeting t-cells | |
WO2024124019A2 (en) | Aav vectors targeting hematopoietic stem cells | |
TW202342759A (zh) | 重組腺相關病毒載體及其使用方法 |