CN101287838A - 狂犬病病毒组合物和方法 - Google Patents
狂犬病病毒组合物和方法 Download PDFInfo
- Publication number
- CN101287838A CN101287838A CNA2006800383144A CN200680038314A CN101287838A CN 101287838 A CN101287838 A CN 101287838A CN A2006800383144 A CNA2006800383144 A CN A2006800383144A CN 200680038314 A CN200680038314 A CN 200680038314A CN 101287838 A CN101287838 A CN 101287838A
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- arg
- val
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 241000711798 Rabies lyssavirus Species 0.000 title claims abstract description 173
- 239000000203 mixture Substances 0.000 title claims abstract description 74
- 238000000034 method Methods 0.000 title claims abstract description 66
- 239000013598 vector Substances 0.000 title description 10
- 241000700605 Viruses Species 0.000 claims abstract description 205
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 65
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 59
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 59
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 21
- 108020004414 DNA Proteins 0.000 claims description 113
- 239000002773 nucleotide Substances 0.000 claims description 90
- 125000003729 nucleotide group Chemical group 0.000 claims description 90
- 239000013612 plasmid Substances 0.000 claims description 78
- 108090000623 proteins and genes Proteins 0.000 claims description 78
- 206010037742 Rabies Diseases 0.000 claims description 57
- 241000282326 Felis catus Species 0.000 claims description 42
- 230000014509 gene expression Effects 0.000 claims description 41
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 41
- 230000003321 amplification Effects 0.000 claims description 40
- 239000012634 fragment Substances 0.000 claims description 37
- 230000002068 genetic effect Effects 0.000 claims description 34
- 239000002299 complementary DNA Substances 0.000 claims description 33
- 241001465754 Metazoa Species 0.000 claims description 27
- 238000001890 transfection Methods 0.000 claims description 27
- 101710137500 T7 RNA polymerase Proteins 0.000 claims description 25
- 238000002360 preparation method Methods 0.000 claims description 25
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 23
- 108091033319 polynucleotide Proteins 0.000 claims description 19
- 102000040430 polynucleotide Human genes 0.000 claims description 19
- 239000002157 polynucleotide Substances 0.000 claims description 19
- 208000015181 infectious disease Diseases 0.000 claims description 18
- 235000018102 proteins Nutrition 0.000 claims description 18
- 229960003127 rabies vaccine Drugs 0.000 claims description 17
- 238000010839 reverse transcription Methods 0.000 claims description 12
- 108090000994 Catalytic RNA Proteins 0.000 claims description 11
- 102000053642 Catalytic RNA Human genes 0.000 claims description 11
- 230000004044 response Effects 0.000 claims description 11
- 108091092562 ribozyme Proteins 0.000 claims description 11
- 108090001102 Hammerhead ribozyme Proteins 0.000 claims description 10
- 208000037262 Hepatitis delta Diseases 0.000 claims description 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 10
- 239000003937 drug carrier Substances 0.000 claims description 10
- 241000724709 Hepatitis delta virus Species 0.000 claims description 9
- 230000008676 import Effects 0.000 claims description 8
- 239000008194 pharmaceutical composition Substances 0.000 claims description 8
- 230000002238 attenuated effect Effects 0.000 claims description 7
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 claims description 7
- 241000958531 Lycalopex culpaeus Species 0.000 claims description 6
- 108020004635 Complementary DNA Proteins 0.000 claims description 4
- 101900236200 Rabies virus Nucleoprotein Proteins 0.000 claims description 4
- 101900061860 Rabies virus Phosphoprotein Proteins 0.000 claims description 4
- 229940124861 Rabies virus vaccine Drugs 0.000 claims description 4
- 239000000969 carrier Substances 0.000 claims description 4
- 239000002671 adjuvant Substances 0.000 claims description 3
- 238000010804 cDNA synthesis Methods 0.000 claims description 3
- 239000003981 vehicle Substances 0.000 claims description 3
- 101710153593 Albumin A Proteins 0.000 claims description 2
- 241000282461 Canis lupus Species 0.000 claims description 2
- 108700037791 Rabies virus L Proteins 0.000 claims description 2
- 241000555745 Sciuridae Species 0.000 claims description 2
- 230000002441 reversible effect Effects 0.000 abstract description 35
- 230000002163 immunogen Effects 0.000 abstract description 8
- 108091005461 Nucleic proteins Proteins 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 108
- 108091034117 Oligonucleotide Proteins 0.000 description 72
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 53
- 229940024606 amino acid Drugs 0.000 description 46
- 235000001014 amino acid Nutrition 0.000 description 45
- 150000001413 amino acids Chemical class 0.000 description 44
- 108090000765 processed proteins & peptides Proteins 0.000 description 41
- 230000003612 virological effect Effects 0.000 description 39
- 230000035897 transcription Effects 0.000 description 38
- 238000013518 transcription Methods 0.000 description 38
- 108090000288 Glycoproteins Proteins 0.000 description 37
- 241000699666 Mus <mouse, genus> Species 0.000 description 34
- 102000003886 Glycoproteins Human genes 0.000 description 33
- 108700026244 Open Reading Frames Proteins 0.000 description 33
- 229920001184 polypeptide Polymers 0.000 description 32
- 102000004196 processed proteins & peptides Human genes 0.000 description 32
- 101150082239 G gene Proteins 0.000 description 31
- 229960005486 vaccine Drugs 0.000 description 29
- 239000000523 sample Substances 0.000 description 28
- 238000011081 inoculation Methods 0.000 description 27
- 230000008859 change Effects 0.000 description 25
- 238000009396 hybridization Methods 0.000 description 24
- 230000003308 immunostimulating effect Effects 0.000 description 23
- 108010050848 glycylleucine Proteins 0.000 description 20
- 238000011084 recovery Methods 0.000 description 19
- 241000701022 Cytomegalovirus Species 0.000 description 18
- 101100148606 Caenorhabditis elegans pst-1 gene Proteins 0.000 description 17
- 238000003757 reverse transcription PCR Methods 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 16
- 201000010099 disease Diseases 0.000 description 16
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 15
- 230000001419 dependent effect Effects 0.000 description 15
- 238000011161 development Methods 0.000 description 15
- 230000018109 developmental process Effects 0.000 description 15
- 239000005090 green fluorescent protein Substances 0.000 description 15
- 230000008521 reorganization Effects 0.000 description 15
- 238000012360 testing method Methods 0.000 description 15
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 14
- 108091029795 Intergenic region Proteins 0.000 description 14
- 238000003752 polymerase chain reaction Methods 0.000 description 14
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 13
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 13
- 239000000427 antigen Substances 0.000 description 13
- 108091007433 antigens Proteins 0.000 description 13
- 102000036639 antigens Human genes 0.000 description 13
- 230000006870 function Effects 0.000 description 13
- 108020004999 messenger RNA Proteins 0.000 description 13
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 210000004940 nucleus Anatomy 0.000 description 13
- 230000001717 pathogenic effect Effects 0.000 description 13
- 230000008034 disappearance Effects 0.000 description 12
- 238000011160 research Methods 0.000 description 12
- 230000036039 immunity Effects 0.000 description 11
- 238000007834 ligase chain reaction Methods 0.000 description 11
- 230000027455 binding Effects 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 101150062031 L gene Proteins 0.000 description 9
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 9
- 241000711828 Lyssavirus Species 0.000 description 9
- 101150084044 P gene Proteins 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 230000000875 corresponding effect Effects 0.000 description 9
- 108010015792 glycyllysine Proteins 0.000 description 9
- 108010092114 histidylphenylalanine Proteins 0.000 description 9
- 239000000463 material Substances 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- -1 ttacks Species 0.000 description 9
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- 108060003951 Immunoglobulin Proteins 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- 108010081734 Ribonucleoproteins Proteins 0.000 description 8
- 102000004389 Ribonucleoproteins Human genes 0.000 description 8
- 101150052859 Slc9a1 gene Proteins 0.000 description 8
- 108020005038 Terminator Codon Proteins 0.000 description 8
- 108010013835 arginine glutamate Proteins 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 210000004556 brain Anatomy 0.000 description 8
- 230000034994 death Effects 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- 241001493065 dsRNA viruses Species 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 229940088598 enzyme Drugs 0.000 description 8
- 102000018358 immunoglobulin Human genes 0.000 description 8
- 239000007924 injection Substances 0.000 description 8
- 238000002347 injection Methods 0.000 description 8
- 238000007918 intramuscular administration Methods 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- 238000010369 molecular cloning Methods 0.000 description 8
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- 230000004083 survival effect Effects 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 241000699800 Cricetinae Species 0.000 description 7
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 7
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 7
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 7
- 241000711841 Rabies virus ERA Species 0.000 description 7
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 230000000890 antigenic effect Effects 0.000 description 7
- 150000001875 compounds Chemical class 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 239000002245 particle Substances 0.000 description 7
- 108010012581 phenylalanylglutamate Proteins 0.000 description 7
- 108010029020 prolylglycine Proteins 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 208000024891 symptom Diseases 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 230000009385 viral infection Effects 0.000 description 7
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 6
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 6
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 6
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 102000011931 Nucleoproteins Human genes 0.000 description 6
- 108010061100 Nucleoproteins Proteins 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 230000002596 correlated effect Effects 0.000 description 6
- 230000009849 deactivation Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 239000011159 matrix material Substances 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 229940126578 oral vaccine Drugs 0.000 description 6
- 238000012856 packing Methods 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000002965 ELISA Methods 0.000 description 5
- 101150066002 GFP gene Proteins 0.000 description 5
- 241001200922 Gagata Species 0.000 description 5
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 5
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 5
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 5
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 5
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 5
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 5
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 5
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 5
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 5
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 5
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 230000001276 controlling effect Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 230000036541 health Effects 0.000 description 5
- 230000028993 immune response Effects 0.000 description 5
- 230000003053 immunization Effects 0.000 description 5
- 238000002649 immunization Methods 0.000 description 5
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 229930182817 methionine Natural products 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 230000001681 protective effect Effects 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- 229920000936 Agarose Polymers 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 4
- 108010032595 Antibody Binding Sites Proteins 0.000 description 4
- 239000004475 Arginine Substances 0.000 description 4
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 4
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 4
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 4
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 4
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 4
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 4
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- 101710091977 Hydrophobin Proteins 0.000 description 4
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 4
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 4
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 4
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 4
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 4
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 4
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 4
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 4
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 4
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 4
- 239000006035 Tryptophane Substances 0.000 description 4
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 230000000840 anti-viral effect Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 239000011230 binding agent Substances 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000012268 genome sequencing Methods 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- 239000001257 hydrogen Substances 0.000 description 4
- 238000007917 intracranial administration Methods 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 230000002265 prevention Effects 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000000644 propagated effect Effects 0.000 description 4
- 230000008707 rearrangement Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 230000009870 specific binding Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000004448 titration Methods 0.000 description 4
- 231100000419 toxicity Toxicity 0.000 description 4
- 230000001988 toxicity Effects 0.000 description 4
- 229960004799 tryptophan Drugs 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 3
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 3
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 3
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 3
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 3
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 3
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 3
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 3
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 3
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 3
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 3
- 230000004543 DNA replication Effects 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- 206010015548 Euthanasia Diseases 0.000 description 3
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 3
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 3
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 3
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 3
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 3
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 3
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 3
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 3
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 3
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 3
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 3
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 3
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 3
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 3
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 3
- CRYJOCSSSACEAA-VKOGCVSHSA-N Ile-Trp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CRYJOCSSSACEAA-VKOGCVSHSA-N 0.000 description 3
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 3
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 3
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 3
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 3
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 3
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 3
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 3
- 241000266847 Mephitidae Species 0.000 description 3
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 3
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 3
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 3
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 3
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 3
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 3
- CWHJIJJSDGEHNS-MYLFLSLOSA-N Senegenin Chemical compound C1[C@H](O)[C@H](O)[C@@](C)(C(O)=O)[C@@H]2CC[C@@]3(C)C(CC[C@]4(CCC(C[C@H]44)(C)C)C(O)=O)=C4[C@@H](CCl)C[C@@H]3[C@]21C CWHJIJJSDGEHNS-MYLFLSLOSA-N 0.000 description 3
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 3
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 3
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 3
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 3
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 3
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 3
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 3
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 3
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- 108010081404 acein-2 Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 230000006907 apoptotic process Effects 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 210000003169 central nervous system Anatomy 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000007850 fluorescent dye Substances 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 229960002989 glutamic acid Drugs 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 3
- 230000009851 immunogenic response Effects 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 239000009871 tenuigenin Substances 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- VKWMGUNWDFIWNW-UHFFFAOYSA-N 2-chloro-1,1-dioxo-1,2-benzothiazol-3-one Chemical compound C1=CC=C2S(=O)(=O)N(Cl)C(=O)C2=C1 VKWMGUNWDFIWNW-UHFFFAOYSA-N 0.000 description 2
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 2
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 2
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 2
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 2
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 2
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 2
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- BSBNNPICFPXDNH-SRVKXCTJSA-N Asn-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BSBNNPICFPXDNH-SRVKXCTJSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 2
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 2
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 2
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 2
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 2
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000020089 Atacta Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 2
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 2
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 2
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 2
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- YTMBNLHIDIKJIU-HCXYKTFWSA-N D-Arginyl-L-arginyl-D-glutaminyl-L-phenylalanine Chemical compound NC(=N)NCCC[C@@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](CCC(O)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YTMBNLHIDIKJIU-HCXYKTFWSA-N 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 102100031780 Endonuclease Human genes 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- ZRXBYKAOFHLTDN-GUBZILKMSA-N Gln-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N ZRXBYKAOFHLTDN-GUBZILKMSA-N 0.000 description 2
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 2
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 2
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 2
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 2
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 2
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 2
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 2
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 2
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 2
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 2
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 2
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 2
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 2
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 2
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 2
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- SYIPVNMWBZXKMU-HJPIBITLSA-N His-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N SYIPVNMWBZXKMU-HJPIBITLSA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 2
- FBCURAVMSXNOLP-JYJNAYRXSA-N His-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBCURAVMSXNOLP-JYJNAYRXSA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- XSEAJSPAOTZXJE-IHPCNDPISA-N His-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N XSEAJSPAOTZXJE-IHPCNDPISA-N 0.000 description 2
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 2
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 2
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 2
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 2
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 2
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- 101710128836 Large T antigen Proteins 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 2
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 2
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 2
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 2
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 2
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 2
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 2
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 2
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 2
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 2
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 2
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- 241000725171 Mokola lyssavirus Species 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108700001237 Nucleic Acid-Based Vaccines Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 102000057297 Pepsin A Human genes 0.000 description 2
- 108090000284 Pepsin A Proteins 0.000 description 2
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 2
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 2
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 2
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 2
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 2
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 2
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 2
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 2
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 2
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 2
- 230000010748 Photoabsorption Effects 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 2
- 108010009460 RNA Polymerase II Proteins 0.000 description 2
- 102000009572 RNA Polymerase II Human genes 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 241000711931 Rhabdoviridae Species 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 241000315672 SARS coronavirus Species 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 2
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 2
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 2
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 2
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 2
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 2
- XKKBFNPJFZLTMY-CWRNSKLLSA-N Trp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O XKKBFNPJFZLTMY-CWRNSKLLSA-N 0.000 description 2
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 2
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 2
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 2
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 2
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 2
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 2
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- SFSZDJHNAICYSD-PMVMPFDFSA-N Tyr-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC4=CC=C(C=C4)O)N SFSZDJHNAICYSD-PMVMPFDFSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 2
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 2
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 108020000999 Viral RNA Proteins 0.000 description 2
- 108010084455 Zeocin Proteins 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine group Chemical group [C@@H]1([C@H](O)[C@H](O)[C@@H](CO)O1)N1C=NC=2C(N)=NC=NC12 OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 239000003708 ampul Substances 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 244000309466 calf Species 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- CRQQGFGUEAVUIL-UHFFFAOYSA-N chlorothalonil Chemical compound ClC1=C(Cl)C(C#N)=C(Cl)C(C#N)=C1Cl CRQQGFGUEAVUIL-UHFFFAOYSA-N 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000004043 dyeing Methods 0.000 description 2
- 230000012202 endocytosis Effects 0.000 description 2
- 210000001163 endosome Anatomy 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 2
- 229940097277 hygromycin b Drugs 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 239000012212 insulator Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000010255 intramuscular injection Methods 0.000 description 2
- 239000007927 intramuscular injection Substances 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 210000003712 lysosome Anatomy 0.000 description 2
- 230000001868 lysosomic effect Effects 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 229940126619 mouse monoclonal antibody Drugs 0.000 description 2
- 239000010813 municipal solid waste Substances 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 238000011330 nucleic acid test Methods 0.000 description 2
- 229940023146 nucleic acid vaccine Drugs 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000007918 pathogenicity Effects 0.000 description 2
- 229940111202 pepsin Drugs 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 230000003584 silencer Effects 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000008093 supporting effect Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 2
- 239000012646 vaccine adjuvant Substances 0.000 description 2
- 229940124931 vaccine adjuvant Drugs 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- BLSQLHNBWJLIBQ-OZXSUGGESA-N (2R,4S)-terconazole Chemical compound C1CN(C(C)C)CCN1C(C=C1)=CC=C1OC[C@@H]1O[C@@](CN2N=CN=C2)(C=2C(=CC(Cl)=CC=2)Cl)OC1 BLSQLHNBWJLIBQ-OZXSUGGESA-N 0.000 description 1
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- SADYNMDJGAWAEW-JKQORVJESA-N (2s)-2-[[(2s)-3-carboxy-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN SADYNMDJGAWAEW-JKQORVJESA-N 0.000 description 1
- WACNXHCZHTVBJM-UHFFFAOYSA-N 1,2,3,4,5-pentafluorobenzene Chemical compound FC1=CC(F)=C(F)C(F)=C1F WACNXHCZHTVBJM-UHFFFAOYSA-N 0.000 description 1
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 1
- 125000001622 2-naphthyl group Chemical group [H]C1=C([H])C([H])=C2C([H])=C(*)C([H])=C([H])C2=C1[H] 0.000 description 1
- 241000023308 Acca Species 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- NEBFIUZIGRTIFY-BJDJZHNGSA-N Ala-Met-Ser-Arg Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NEBFIUZIGRTIFY-BJDJZHNGSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 101100162403 Arabidopsis thaliana ALEU gene Proteins 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 102000003916 Arrestin Human genes 0.000 description 1
- 108090000328 Arrestin Proteins 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 1
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- 241000295638 Australian bat lyssavirus Species 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000282470 Canis latrans Species 0.000 description 1
- 101100056797 Canis lupus familiaris SAG gene Proteins 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- PCRVDEANNSYGTA-IHRRRGAJSA-N Cys-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CC1=CC=C(O)C=C1 PCRVDEANNSYGTA-IHRRRGAJSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241001269238 Data Species 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 241001635598 Enicostema Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 101001065501 Escherichia phage MS2 Lysis protein Proteins 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 241001520680 European bat lyssavirus Species 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- TYRMVTKPOWPZBC-SXNHZJKMSA-N Gln-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N TYRMVTKPOWPZBC-SXNHZJKMSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- SLFSYFJKSIVSON-SRVKXCTJSA-N His-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SLFSYFJKSIVSON-SRVKXCTJSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- CSRRMQFXMBPSIL-SIXJUCDHSA-N His-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N CSRRMQFXMBPSIL-SIXJUCDHSA-N 0.000 description 1
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 1
- 206010053317 Hydrophobia Diseases 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000446313 Lamella Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- MNGBICITWAPGAS-BPUTZDHNSA-N Met-Ser-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MNGBICITWAPGAS-BPUTZDHNSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 241000526636 Nipah henipavirus Species 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108010019644 Oligodendrocyte Transcription Factor 2 Proteins 0.000 description 1
- 102100026058 Oligodendrocyte transcription factor 2 Human genes 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- YEEFZOKPYOUXMX-KKUMJFAQSA-N Phe-Gln-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YEEFZOKPYOUXMX-KKUMJFAQSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- RETPETNFPLNLRV-JYJNAYRXSA-N Pro-Asn-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O RETPETNFPLNLRV-JYJNAYRXSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 206010061926 Purulence Diseases 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101100532512 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SAG1 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 241000251131 Sphyrna Species 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 230000005867 T cell response Effects 0.000 description 1
- 241000143014 T7virus Species 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- 241000218636 Thuja Species 0.000 description 1
- 241000710914 Totivirus Species 0.000 description 1
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- HABYQJRYDKEVOI-IHPCNDPISA-N Trp-His-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCCN)C(=O)O)N HABYQJRYDKEVOI-IHPCNDPISA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 108091034135 Vault RNA Proteins 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 208000025259 Viral Zoonoses Diseases 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- LWZFANDGMFTDAV-BURFUSLBSA-N [(2r)-2-[(2r,3r,4s)-3,4-dihydroxyoxolan-2-yl]-2-hydroxyethyl] dodecanoate Chemical compound CCCCCCCCCCCC(=O)OC[C@@H](O)[C@H]1OC[C@H](O)[C@H]1O LWZFANDGMFTDAV-BURFUSLBSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 239000003570 air Substances 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 159000000013 aluminium salts Chemical class 0.000 description 1
- 229910000329 aluminium sulfate Inorganic materials 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000003855 balanced salt solution Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- JCZLABDVDPYLRZ-AWEZNQCLSA-N biphenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C1=CC=CC=C1 JCZLABDVDPYLRZ-AWEZNQCLSA-N 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 238000009583 bone marrow aspiration Methods 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000001364 causal effect Effects 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 230000024321 chromosome segregation Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 230000002498 deadly effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000004064 dysfunction Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 101150051821 era gene Proteins 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 229940089256 fungistat Drugs 0.000 description 1
- 238000007499 fusion processing Methods 0.000 description 1
- 101150110946 gatC gene Proteins 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 125000002795 guanidino group Chemical group C(N)(=N)N* 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 102000057593 human F8 Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- YOBAEOGBNPPUQV-UHFFFAOYSA-N iron;trihydrate Chemical compound O.O.O.[Fe].[Fe] YOBAEOGBNPPUQV-UHFFFAOYSA-N 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- 229950003188 isovaleryl diethylamide Drugs 0.000 description 1
- 210000002415 kinetochore Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 206010025482 malaise Diseases 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- HNJJXZKZRAWDPF-UHFFFAOYSA-N methapyrilene Chemical group C=1C=CC=NC=1N(CCN(C)C)CC1=CC=CS1 HNJJXZKZRAWDPF-UHFFFAOYSA-N 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 239000004530 micro-emulsion Substances 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000007758 minimum essential medium Substances 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 210000001640 nerve ending Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 239000011022 opal Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 210000003300 oropharynx Anatomy 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 125000000405 phenylalanyl group Chemical group 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 210000004908 prostatic fluid Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 125000000714 pyrimidinyl group Chemical group 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 229940047431 recombinate Drugs 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 230000003319 supportive effect Effects 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 230000033772 system development Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 150000003544 thiamines Chemical group 0.000 description 1
- 230000008719 thickening Effects 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108091007466 transmembrane glycoproteins Proteins 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 239000005723 virus inoculator Substances 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 239000008215 water for injection Substances 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/205—Rhabdoviridae, e.g. rabies virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5254—Virus avirulent or attenuated
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/20011—Rhabdoviridae
- C12N2760/20111—Lyssavirus, e.g. rabies virus
- C12N2760/20134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/20011—Rhabdoviridae
- C12N2760/20111—Lyssavirus, e.g. rabies virus
- C12N2760/20141—Use of virus, viral particle or viral elements as a vector
- C12N2760/20143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/20011—Rhabdoviridae
- C12N2760/20111—Lyssavirus, e.g. rabies virus
- C12N2760/20161—Methods of inactivation or attenuation
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Virology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Immunology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Biochemistry (AREA)
- Epidemiology (AREA)
- Mycology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Tropical Medicine & Parasitology (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
提供了狂犬病病毒组合物和方法。公开了狂犬病病毒株Evelyn-Rokitnicki-Abelseth(ERA)的全长序列。提供一种用于产生重组ERA病毒和其衍生物的反向遗传系统,以及包括ERA和/或ERA衍生病毒株、核酸和/或蛋白的组合物。在一些情况下,组合物是可用于狂犬病病毒接触前或者接触后治疗的免疫原性组合物。
Description
相关申请的参考
本申请要求享有2005年10月14日提交的美国临时申请60/727,038的优先权,在此完整引入作为参考。
政府支持的声明
本发明由美国政府机构,疾病控制和预防中心(Centers for DiseaseControl and Prevention)做出。因此,美国政府在本发明中享有确定的权利。
技术领域
本公开涉及病毒学领域。更具体地,本公开涉及可用于保护哺乳动物免受狂犬病病毒感染的组合物和用于生产免疫原性组合物的方法。
发明背景
狂犬病仍然是感染人类和动物的最可怕的传染性疾病之一,尽管在其预防和控制方面已取得显著的科学进展。在世界不同地区,狂犬病表现为不同的问题。在美国,狂犬病储存宿主存在于许多野生动物种类中,包括浣熊、臭鼬、狐狸和蝙蝠(Rupprecht等人,Emerg.Infect.Dis.1(4):107-114,1995)。在横跨美国的广大地理区域发现了狂犬病传染在这些陆生哺乳动物中的爆发。例如,浣熊狂犬病影响从佛罗里达州到缅因州超过1百万平方公里的区域。尽管发达国家仍然存在野生动物狂犬病,但利用野生动物的口服免疫在控制和消除野生动物狂犬病方面已经取得进展。
尽管如此,狂犬病仍然是对公共卫生的一种主要威胁,每年持续造成50,000至60,000人死亡(世界卫生组织,2003年4月)。人类主要是通过被患有狂犬病的家养或者野生动物咬伤而感染狂犬病病毒。在发展中国家,狗导致大约94%的人类狂犬病死亡。在非洲、亚洲和南美洲的大多数国家狗,狂犬病仍然是动物流行性的,并且在这些国家中,狗导致该疾病引起的大部分人类死亡。因此,控制家养和野生动物中的狂犬病病毒感染不仅降低这些动物的死亡率而且降低人类接触的风险。
狂犬病病毒通过被感染动物的咬伤或者抓伤造成的破裂皮肤传送。接触狂犬病病毒导致其渗透入外周、无髓鞘的神经末梢,继而通过逆向轴突运输传播,复制只在神经元中发生,并最后到达中枢神经系统(CNS)。CNS的感染造成细胞机能障碍和死亡(Rupprecht &Dietzschold,Lab Invest.57:603,1987)。因为狂犬病病毒直接从细胞传播到细胞,它基本上逃脱了免疫识别(Clark & Prabhakar,Rabies,In:Olson等人,eds.,Comparative Pathology of Viral Disease,2:165,BocaRaton,FL,CRC Press,1985)。
狂犬病病毒(RV)是一种弹状病毒-一种具有阴性有义极性的非节段RNA病毒。在弹状病毒科(Rhabdoviridae)家族中,狂犬病病毒是狂犬病毒(Lyssavirus)属的原型。RV由两个主要结构成分构成:核衣壳或者核糖核蛋白(RNP),和围绕RNP核的双层膜形式的被膜。所有弹状病毒的感染性成分是RNP核,由负链RNA基因组组成,所述负链RNA基因组被核蛋白(N)结合RNA依赖性RNA聚合酶(L)和磷蛋白(P)包装。环绕RNP的膜包含两种蛋白:跨膜糖蛋白(G)和基质(M)蛋白,位于膜内部位置。因此,病毒基因组编码这5种蛋白:RNP中的3种蛋白(N,L和P),基质蛋白(M),和糖蛋白(G)。
不同狂犬病病毒株致病性的分子决定簇还没有被完全阐明。RV致病性归因于多基因事件(Yamada等人,Microbiol.Immunol.50:25-32,2006)。例如,RV基因组中的一些位置如果突变,则影响病毒转录或者复制,减轻毒性。N基因磷酸化位点的丝氨酸残基389处(Wu等人,J.Virol.76:4153-4161,2002)或者L基因高度保守的C模体的GDN核心序列(Schnell和Conzelmann,Virol.214:522-530,1995)的突变,显著降低RV转录和复制。
G蛋白,还称为刺突蛋白,参与RV的细胞粘附和膜融合。已经鉴定G蛋白330到340位的氨基酸区域(称为抗原部位III)对RV某些株的毒性重要。数个研究支持以下观点,即固定RV株的致病性由糖蛋白氨基酸残基333处存在的精氨酸或者赖氨酸决定(Dietzschold等人,Proc.Natl.Acad.Sci.USA 80:70-74,1983;Tuffereau等人,Virol.172:206-212,1989)。
这种现象似乎至少适用于固定的狂犬病病毒例如CVS,ERA,PV,SAD-B19和HEP-Flury株(Anilionis等人,Nature 294:275-278,1981;Morimoto等人,Virol.173:465-477,1989)。例如,具有不同于糖蛋白333位Arg的氨基酸的狂犬病疫苗病毒描述于,例如,WO 00/32755(描述RV突变体,其中与亲代病毒相比,G蛋白Arg333密码子的所有3种核苷酸都被改变,因此333位Arg被另一种氨基酸取代);欧洲专利350398(描述无毒力的RV突变体SAG1,来自RV的Bern SAD株,其中糖蛋白333位的Arg已经被替换为Ser);和欧洲专利申请583998(描述减毒的RV突变体,SAG2,其中G蛋白333位的Arg被Glu替换)。
其他株,例如RC-HL株,在G的333位具有精氨酸残基,但在成年小鼠中不造成致命感染(Ito等人,Microl.Immunol.38:479-482,1994;Ito等人,J.Virol.75:9121-9128,2001)。因此,整个G可能有助于RV的毒性,尽管决定簇或者区域之前还没有被鉴定。G基因编码诱导病毒中和抗体的唯一蛋白。已知RV糖蛋白的至少3种状态:负责受体结合的天然状态(N);膜融合过程初始步骤必需的活性疏水状态(A)(Gaudin,J.Cell Biol.150:601-612,2000),和融合无活性构象(I)。G的正确折叠和成熟在免疫识别中起着重要作用。ERA G胞外结构域中3个潜在糖基化位置分别存在于Asn37,Asn247和Asn319残基(Wojczyk等人,Glycobiology.8:121-130,1998)。G的非糖基化不仅影响构象,而且抑制蛋白在细胞表面的呈递。因此,阐明导致狂犬病病毒致病性的分子决定簇提出了一个复杂的问题。
发明概述
此处公开了对应于狂犬病病毒Evelyn-Rokitnicki-Abelseth(ERA)固定疫苗的病毒株的完整序列,以及对其和狂犬病病毒属其他株进行测序的方法。
还描述了狂犬病病毒的反向遗传系统,尤其是利用狂犬病病毒株ERA作为示范。T7 RNA聚合酶的使用促进病毒复原,所述T7 RNA聚合酶在N末端包含一个八氨基酸核定位信号(NLS)。除亲代ERA病毒株外,还描述了数种其他衍生病毒,包括ERA-(缺失psi-区域),ERAgreenl(在psi-区域插入绿色荧光蛋白基因),ERAgreen2(在磷蛋白和基质蛋白基因间区域插入绿色荧光蛋白),ERA2g(在psi-区域包含糖蛋白的额外拷贝),ERAg3(在糖蛋白氨基酸333处有突变),ERA2g3(在psi-区域氨基酸333处具有改变的糖蛋白的额外拷贝),ERA-G(其中已经删除糖蛋白),ERAgm(M和G基因在基因组中互换),和ERAgmg(重排的ERAgm构建体中G的两个拷贝)。额外转录单位被整合入ERA病毒基因组用于开放阅读框(ORF)的有效表达。通过优化此处描述的繁殖条件,在生物反应器或者固定的组织烧瓶中恢复病毒的滴度达到超过109ffu/mL。
还公开了一种组成性表达ERA糖蛋白的改造细胞系。该细胞系,命名为BSR-G,用于重组的,包括减毒和/或复制缺陷的,狂犬病病毒的制备。
根据下列详细说明,本发明的上述以及其他目的、特征和优点将更显而易见。
附图简述
图1A.ERA转录质粒的示意图。锤头核酶和反基因组ERA基因组的位置如图解所示。按5′到3′方向显示N,P,M G和L蛋白的相对位置。
图1B.构建全长ERA狂犬病病毒基因组cDNA质粒pTMF的示意图。RT-PCR产物F1、F2片段,和限制性酶识别位点(Nhe1,Kpn1,Blp1,Pst和Not1)(未按比例绘制)。左边的条显示RdRz-锤头核酶,右边的条显示HDVRz-丁型肝炎病毒核酶。符号◆表示Kpn1或者Pst1位点被删除,垂直箭头表示Nhe1或者Not1位点仍是完整的。
图2.通过pNLST7质粒的NLST7RNA聚合酶自身基因作用设想机理的示意图。DNA转染试剂复合物通过胞吞作用摄入细胞。从溶酶体和内体释放的大部分DNA保留在细胞的细胞质中。有限量的质粒转移到细胞核:1)通过CMV立即早期启动子,NLST7基因通过细胞RNA聚合酶II转录;2)成熟的NLST7 mRNA从细胞核转运到细胞质用于NLST7RNA聚合酶合成;3)新合成的NLST7RNA聚合酶转移到细胞核,而痕量NLST7保留在细胞质中;4)NLST7RNA聚合酶通过pT7启动子起始转录。通过转录后修饰,产生额外的NLST7mRNA用于蛋白合成,从而提高病毒恢复效率。
图3.10种衍生ERA病毒基因组的示意图。各基因的大小不是按比例绘制。符号“*”标明在Aa333残基处G的突变,Ψ是Psi-区域。
图4A.转染细胞中回收的ERA-G病毒,在BHK-G细胞系中传播和生长的分析。在A中,甚至在以用于病毒恢复的质粒转染后的7天后,ERA-G病毒病灶仍被抑制。在B中,在正常BSR细胞中传代后,恢复的ERA-G病毒不传播。只有个别细胞被DFA染色。在C中,ERA-G病毒在组成型BHK-G细胞系中生长良好。
图4B.BHK-G细胞系中G表达的分析。通过间接荧光染色法,ERA狂犬病病毒G在稳定的细胞系BHK-G的细胞质中表达。
图4C.利用G探针,通过Northern印迹对ERA-G病毒感染细胞中G mRNA的分析。第2道显示在ERA-G病毒感染的BHK-G细胞中检测到G基因mRNA,而未检测到病毒基因组RNA。第1道是ERA-狂犬病病毒感染的BHK-G细胞的总RNA对照,其中G mRNA和病毒基因组RNA均被检测到。
图5.单步病毒生长曲线所有恢复狂犬病病毒ERA株生长到109或者1010ffu/mL,但ERA-G只达到107ffu/mL。
图6.ERAgreen1/ERAgreen2狂犬病病毒感染的BSR细胞中的绿色病灶Trans1是在Psi和L基因间区域整合的翻译单位。Trans2是在P和M基因间区域的翻译单位。ERAgreen2和ERAgreen1均在病毒感染的BSR细胞中稳定表达GFP蛋白,而病毒感染后ERAgreen2绿色病灶的出现比在ERAgreen1中早48小时。
图7.在双G,和G、M重排的ERA-狂犬病病毒中G mRNA表达的分析。在利用G探针的Northern印迹中,与ERA-病毒感染的细胞(第2道)相比,对ERA2g(第1道)、ERAgm(第3道)和ERA2g3(第4道)中G mRNA光密度的测量显示增强的mRNA水平。利用ERA-病毒作为100%,计算比例。
图8A.通过体内接种重组ERA和衍生物诱导的发病率。三周龄小鼠肌内接种8种恢复病毒。在接种后10天,在ERA、ERA-和ERAgreen1组中,50%、50%和20%的小鼠分别显示狂犬病的相应临床症状,但没有死亡。其他组中没有观察到不良征象。
图8B.接种重组ERA和衍生物的小鼠中攻击后的存活率。从图8A中所示的试验中存活的小鼠用Texas犬/山狗(coyote)狂犬病病毒进行肌内攻击。在攻击后5天,在ERA和ERA-组中,40和62%的小鼠分别显示狂犬病征象并被处安乐死。在所有其他组中,没有观察到狂犬病征象。
图8C.脑内接种重组ERA和ERAg3病毒后的存活。三周龄小鼠分别脑内接种ERA和ERAg3病毒株。ERA组中所有小鼠在接种后15天死亡,而在ERAg3组中,所有小鼠都存活,没有临床症状。
图8D.乳鼠脑内接种后的存活。两日龄的乳鼠分别脑内接种ERAg3和ERA-G病毒构建体。ERAg3组中所有小鼠死亡,而ERA-G组中没有小鼠死亡。
图8E.接种重组ERA和衍生病毒的小鼠的中和抗体滴度。利用RFFIT测定小鼠中和抗体滴度,在病毒接种组中范围从每ml1.36到5.61IU。
图9A.感染Albama蝙蝠狂犬病病毒后的存活。仓鼠接种活的Albama蝙蝠狂犬病病毒,然后用ERAg3病毒或者狂犬病免疫球蛋白和商品化供应的灭活RV疫苗进行接触后处理。在超过3个月期间进行存活评定。
图9B.感染Thai Street犬狂犬病病毒后的存活。仓鼠接种活的Albama蝙蝠狂犬病病毒,然后用ERAg3病毒或者狂犬病免疫球蛋白和商品化供应的灭活RV疫苗进行接触后处理。在超过3个月期间进行存活评定。
图9C.感染Texas山狗狂犬病病毒后的存活率。仓鼠接种活的Albama蝙蝠狂犬病病毒,然后用ERAg3病毒或者狂犬病免疫球蛋白和商品化供应的灭活RV疫苗进行接触后处理。在超过3个月期间进行存活评定。
序列表
如37C.F.R.1.822所定义,利用核苷酸碱基的标准字母缩写和氨基酸3字母编码,显示所附序列表中所列的核酸和氨基酸序列。每个核酸序列只显示一条链,但应当理解为通过任意参考所显示的链,还包括互补链,除非上下文明确表示只意在一条链。适当的话应理解,通过用尿嘧啶取代硫胺残基,表示为DNA的序列可以转换为RNA。
SEQ ID NO:1.ERA CDC野生型病毒,11,931个核苷酸
1-58核苷酸,前导区
71-1420核苷酸,N基因
1514-2404核苷酸,P基因
2496-3101核苷酸,M基因
3317-4888核苷酸,G基因
4964-5362核苷酸,Psi-区域
5417-11797核苷酸,L基因
11862-11931核苷酸,Trailer区
SEQ ID NO:2.ERACDC:71到1420:450aa,N蛋白.
SEQ ID NO:3.ERACDC:1514到2404:297aa,P蛋白.
SEQ ID NO:4.ERACDC:2496到3101:202aa,M蛋白.
SEQ ID NO:5.ERACDC:3317到4888:524aa,G蛋白.
SEQ ID NO:6.ERACDC:5417 to 11797:2127aa,L蛋白.
SEQ ID NO:7.通过反向遗传系统恢复的重组ERA(rERA)有11,930个核苷酸。在重组ERA反向遗传系统中,野生型ERA株中G基因和psi-区之间的特异性poly(A8)tract被突变为poly(A7)tract,作为序列标记物。因此,rERA比野生型ERA少一个核苷酸。所有其他序列信息都是完全相同的。
SEQ ID NO:8.ERAg3株(11,930个核苷酸),G蛋白中的氨基酸(333Aa)已经改变;相应的核酸在4370到4372位。
SEQ ID NO:9.ERA-(11,577个核苷酸),无psi(假-基因)区;额外转录单位已经导入核苷酸4950到5008位。
SEQ ID NO:10.ERA-2G(13,150个核苷酸),该株具有G基因的两个拷贝;第二个拷贝在4988到6559位插入。
SEQ ID NO:11.ERAgreen(12,266个核苷酸),该株在4993到5673位包含GFP的编码序列;细胞或者组织感染后在紫外光下显示绿色。
SEQ ID NO:12.ERA-G(10,288个核苷酸),该株不含G基因。
SEQ ID NO:13.ERA-2g3(13,150个核苷酸);该株具有G基因的两个拷贝(其中第二个在4988到6559位),两个均是在氨基酸333被取代(对应于所示序列中核苷酸位置4370-4372和6041-6043)。
SEQ ID NO:14.ERA-pt(11,976个核苷酸,P基因后2469到2521位处具有一个额外的转录单位)。
SEQ ID NO:15.ERA-pt-GFP(12,662个核苷酸,在P基因后2505到3185位插入GFP基因)。
SEQ ID NO:16.ERAgm(11,914个核苷酸)G和M基因的位置分别与G在2505-4076位和M在4122-4727位互换。
SEQ ID NO:17.ERAg3m(11,914个核苷酸)G和M基因的位置分别与G在2505-4076位和M在4122-4727位互换。G基因在氨基酸333位突变。
SEQ ID NO:18.ERAgmg(13,556个核苷酸),该株在2505-4076位和4943-6514位具有G基因的两个拷贝,在4122-4727位侧翼带有M基因。
SEQ ID NO:19.锤头核酶的前10个核苷酸对应于狂犬病病毒ERA基因组的5′端。
SEQ ID NO:20.核苷酸序列编码SV40T抗原核定位信号(NLS)。
SEQ ID NO:21-23.人工Kozak序列。
SEQ ID NO:24-57.合成寡核苷酸。
SEQ ID NO:58.在氨基酸333位突变的G蛋白的氨基酸序列(从Arg到Glu)。
SEQ ID NO:59-65.合成寡核苷酸。
具体内容
I.介绍
病毒性人畜共患病很难预防。一种主要范例是通过口服免疫控制野生动物狂犬病。所有当前得到许可的口服狂犬病疫苗基于一个共同来源。Evelyn-Rokitnicki-Abelseth(ERA)的固定狂犬病病毒(RV)来源于Street-Alabama-Dufferin(SAD)株,1935年首先从阿拉巴马(USA)的一条疯狗中分离。在小鼠脑、幼仓鼠肾(BHK)细胞和鸡胚中进行SAD RV的多次传代后,获得ERA株。ERA在BHK细胞中的重复克隆最终获得B-19克隆,其被命名为SAD-B19,用于疫苗研究。通过反向遗传恢复的第一个RV株是SAD-B19。尽管SAD-B19和ERA RV来自相同来源,但是在不同动物的口服疫苗研究中观察到不同的结果。例如,ERA在臭鼬或者浣熊中口服不诱导明显的中和抗体,而SAD-B19诱导。为了阐明这两种RV株之间的潜在差异,需要一种用于ERA RV株的反向遗传系统。
反向遗传学提出了一种按照指定路线修饰RNA病毒的可行途径。一种用于狂犬病病毒原始株的反向遗传系统在1994年成功建立(Schnell等人,The EMBO J.13,4195-4203,1994)。在十年间,已经对该系统作出改进,导致病毒恢复的效率增加。这种增加的效率有助于阐明病毒致病性、蛋白-蛋白以及蛋白-RNA相互作用。
在狂犬病病毒基因组内,已经认为一些区域包含重要的信号,例如病毒远端启动子区,核蛋白包装,RNA依赖性RNA聚合酶L转录起始位点,多腺苷酸化和终止位点。这些信号对于确保病毒的有效恢复和设计额外的转录单位是非常重要的,所述额外转录单位用于将外源的开放阅读框(ORF)接纳到狂犬病病毒基因组中。
本公开提供一种有效的反向遗传系统,并描述其产生ERA株病毒的变异体的用途。此处描述的修饰获得适合用于接纳ORF表达和疫苗开发的候选株。
反向遗传系统由一组质粒组成。第一种质粒包括ERA病毒cRNA。为了在转录的病毒cDNA中产生可靠的病毒反基因组末端,ERA基因组在cDNA3′端侧翼为锤头核酶,5′端侧翼为丁型肝炎病毒核酶。反基因组盒与细菌噬菌体T7转录起始信号融合,这还任选在巨细胞病毒(CMV)立即早期启动子的控制下。
该系统还包括大量辅助质粒,所述辅助质粒编码参与病毒包装的蛋白。例如,该系统通常包括编码病毒核蛋白(N)、磷蛋白(P)、RNA依赖性聚合酶(L)、和任选的病毒糖蛋白(G)的辅助质粒。该系统还包括编码噬菌体T7 RNA聚合酶(T7)的质粒,其可以通过添加核定位信号(NLS)进行修饰,以增加转染细胞的细胞核中T7聚合酶的表达。T7RNA聚合酶表达质粒被构建为“自身基因”,其在转染到细胞中之后转录全长的病毒反基因组cRNA用于核蛋白包装。
反向遗传系统可用于设计和制备用于狂犬病病毒治疗(接触前和/或后)的免疫原性组合物,和用于制备表达外源开放阅读框(ORF)的狂犬病病毒ERA载体。例如,额外转录单位可以被设计、检测并在Psi-区和/或磷蛋白(P)-基质(M)蛋白基因间区域处整合到ERA基因组中。基本上任何感兴趣的ORF都可以在ERA载体的环境中表达,包括编码病毒抗原和其他病原体的ORF,例如其他狂犬病病毒属的抗原,以及用于表达其他治疗感兴趣的蛋白。
因此,此处公开的方法和组合物可用于设计和制备狂犬病病毒免疫原性组合物,包括适合作为疫苗的组合物,所述疫苗用于狂犬病病毒接触前和/或后的治疗。
II.缩写
ADE 抗体依赖性增强
Ag-ELISA 抗原-捕获ELISA
DNA 脱氧核糖核酸
ERA 狂犬病病毒株Evelyn-Rokitnicki-Abelseth
ELISA 酶联免疫吸附分析
G 糖蛋白
i.c. 脑内
IFA 间接免疫-荧光分析
i.m. 肌内
L RNA依赖性RNA聚合酶
M 基质蛋白
mAb 单克隆抗体
N 核蛋白
ORF 开放阅读框
P 磷蛋白
PCR 聚合酶链式反应
RACE cDNA末端5’快速扩增
RNA 核糖核酸
RNP 核糖核蛋白
RT-PCR 逆转录-聚合酶链式反应
RV 狂犬病病毒
trans1 额外转录单位1
trans2 额外转录单位2
III.术语
除非另有解释,所有此处使用的技术和科学术语的含义与本发明所属领域的普通技术人员通常所理解的相同。类似地,除非另作注解,技术术语根据常规用法使用。分子生物学常见术语的定义可以参见Benjamin Lewin,基因V(Genes V),Oxford University Press出版,1994(ISBN 0-19-854287-9);Kendrew等人(编著),分子生物学全编(TheEncyclopedia of Molecular Biology),Blackwell Science Ltd.出版,1994(ISBN 0-632-02182-9);和Robert A.Meyers(编著),分子生物学和生物技术:综合参考(Molecular Biology and Biotechnology:a ComprehensiveDesk Reference),VCH Publishers,Inc.出版,1995(ISBN 1-56081-569-8)。
单数术语“a”、“an”和“the”包括复数指代,除非上下文另有明确指示。类似地,单词“或”意在包括“和”,除非上下文另有明确指示。因此“包括A或者B”意思是包括A,或者B,或者A和B。还应了解,对于核酸或者多肽给出的所有碱基大小或者氨基酸大小,和分子量或者分子质量值,都是近似的,提供来说明。
为了有助于浏览本发明的不同实施方式,提供下列专用术语的解释:
佐剂:非特异性增强针对抗原的免疫应答的物质。Singh等人综述了在人类中使用的疫苗佐剂的发展(Nat.Biotechnol.17:1075-1081,1999),在其出版时公开了铝盐和MF59微乳剂是仅有的被批准用于人的疫苗佐剂。
扩增:核酸分子的扩增(例如,DNA或者RNA分子)是指一种实验技术的使用,该技术增加样品中核酸分子的拷贝数。一个扩增的实例是聚合酶链式反应(PCR),其中在允许引物与样品中核酸模版杂交的条件下,样品接触寡核苷酸引物对。引物在合适的条件下延伸,从模板解离,再退火,延伸,再解离以扩增核酸的拷贝数。扩增产物可以利用电泳、限制性核酸内切酶切割模式、寡核苷酸杂交或者连接、和/或核酸测序这类技术进行表征。
扩增方法的其他实例包括链置换扩增,如美国专利5,744,311所公开;无转录等温扩增,如美国专利6,033,881所公开;修复链式反应扩增,如WO 90/01069所公开;连接酶链式反应扩增,如EP-A-320,308所公开;间隙填补连接酶链式反应扩增,如美国专利5,427,930所公开;和NASBATM无RNA转录扩增,如美国专利6,025,134所公开。扩增方法可以改变,包括例如通过其他的步骤或者将扩增与另一个方案联合。
动物:活的多细胞脊椎生物体,包括例如哺乳动物和鸟类的范畴。术语哺乳动物包括人和非人哺乳动物。类似地,术语“对象”包括人类和兽类对象,例如,人,非人灵长类,狗,猫,马,和牛。
抗体:一种蛋白(或者蛋白复合物),包括基本上由免疫球蛋白基因或者免疫球蛋白基因片段编码的一种或者多种多肽。识别的免疫球蛋白基因包括κ、λ、α、γ、δ、ε和μ恒定区基因,以及众多的免疫球蛋白可变区基因。轻链被分为κ或者λ。重链被分为γ、μ、α、δ或者ε,分别依次定义免疫球蛋白类型,IgG,IgM,IgA,IgD和IgE。
基本的免疫球蛋白(抗体)结构单位通常是四聚体。每个四聚体由多肽链的两个相同对组成,每个对具有一个“轻”(大约25kDa)链和一个“重”(大约50-70kDa)链。每条链的N-末端确定一个大约100到110或者更多氨基酸的可变区,主要负责抗原识别。术语“可变轻链”(VL)和“可变重链”(VH)分别指代这些轻链和重链。
此处使用的术语“抗体”包括完整的免疫球蛋白以及许多明确表征的片段。例如,结合靶蛋白(或者在蛋白或者融合蛋白内的表位)的Fabs,Fvs和单链Fvs(SCFvs)也是所述蛋白(或者表位)的特异结合剂。这些抗体片段如下所示:(1)Fab,通过用木瓜蛋白酶消化整个抗体产生的包含抗体分子的单价抗原结合片段的片段,产生完整的轻链和一种重链的一部分;(2)Fab′,通过用胃蛋白酶处理整个抗体然后还原获得的抗体分子片段,产生完整的轻链和一部分重链;每个抗体分子得到两个Fab′片段;(3)(Fab′)2,通过用胃蛋白酶处理整个抗体而没有后续的还原得到的抗体片段;(4)F(ab′)2,通过两个二硫键连接在一起的两个Fab′片段的二聚体;(5)Fv,包含轻链可变区和重链可变区、表达为两条链的遗传工程片段;和(6)单链抗体,一种包含轻链可变区和重链可变区的遗传工程分子,通过合适的多肽接头连接为基因融合的单链分子。制备这些片段的方法是常规的(参见,例如,Harlow和Lane,抗体应用:实验室手册(Using Antibodies:ALaboratory Manual),CSHL,New York,1999)。
本公开方法和组合物中使用的抗体可以是单克隆或者多克隆的。仅仅举例来说,单克隆抗体可以根据Kohler和Milstein (Nature256:495-97,1975)的传统方法或者其衍生方法从鼠类杂交瘤制备。单克隆抗体制备的详细步骤描述于Harlow和Lane,抗体应用:实验室手册(Using Antibodies:A Laboratory Manual),CSHL,New York,1999。
抗体结合亲合力:单一抗体结合位点和配体(例如,抗原或者表位)之间结合的强度。抗体结合位点X对配体Y的亲合力表示为解离常数(Kd),这是占据溶液中存在的一半X结合位点所需的Y浓度。较小(Kd)表明X和Y之间较强或者较高的亲合力相互作用,并且占据位点需要的配体浓度较低。通常,抗体结合亲合力受到互补位识别的表位中一个或多个氨基酸的改变、修饰和/或取代的影响。
在一个实例中,在Ag-ELISA分析中通过终点滴定测量抗体结合亲合力。如果与未改变的表位相比,针对修饰/取代表位的特异抗体的终点滴度相差至少4倍,例如至少10倍,至少100倍或者更大,则通过修饰和/或取代互补位识别的表位中的一个或者多个氨基酸,抗体结合亲合力显著降低(或者可测量地降低)。
抗原:能够刺激动物抗体产生或者T细胞应答的化合物、组合物或者物质,包括注射或吸收到动物中的组合物。抗原与特异性体液或者细胞免疫的产物反应,包括那些被异源免疫原诱导的产物。在一个实施方式中,抗原是病毒抗原。
减毒:在活病毒的情况下,例如狂犬病病毒,如果病毒感染细胞或者对象的能力和/或其致病的能力降低(例如,消除),则该病毒被减毒。通常,在给具有免疫能力的对象施用后,减毒病毒至少保留一些引发免疫应答的能力。在一些情况下,减毒病毒能够引发保护性免疫应答而不引起任何感染的征象或者症状。
结合或者稳定结合:如果足量的寡核苷酸形成碱基对或者与其靶核酸杂交,则寡核苷酸与靶核酸结合或者稳定结合,以允许对所述结合的检测。通过靶:寡核苷酸复合物的物理或者功能特性可以检测结合。靶和寡核苷酸之间的结合可以利用本领域技术人员已知的任何方法进行检测,包括功能或者物理结合分析。通过检测结合是否对生物合成过程例如基因表达、DNA复制、转录、翻译等等有可观察到的影响,可以对结合进行功能检测。
检测DNA或者RNA互补链结合的物理方法是本领域公知的,这类方法包括DNase I或者化学足迹法,凝胶迁移和亲合力切割分析,Northern印迹,Southern印迹,点印迹,和光吸收检测方法。例如,一种广泛使用的方法,因为它如此简单并且稳定,包括当温度缓慢升高时,在220到300nm处观察溶液光吸收的变化,所述溶液包含寡核苷酸(或者类似物)和靶核酸。如果寡核苷酸或者类似物与其靶结合,当寡核苷酸(或者类似物)和靶解离或者熔解时,在特征温度处的吸收突然增加。
寡聚物和其靶核酸之间的结合经常表征为温度(Tm),在该温度下,50%的寡聚物从其靶熔解。相对于具有较低Tm的复合物,较高的Tm表示更强或者更稳定的复合物。
cDNA(互补DNA):一段DNA,缺失内部、非编码段(内含子)和确定转录的调控序列。在实验室中通过从细胞提取的信使RNA逆转录,合成cDNA。
电泳:电泳是在电场影响下,带电荷的溶质或者颗粒在液体介质中的迁移。电泳分离被广泛用于高分子分析。特别重要的是蛋白和核酸序列的鉴定。这类分离可基于大小或者电荷的差异。核苷酸序列带有相同的电荷,因此根据大小差异进行分离。电泳可以在无支持的液体介质中进行(例如,毛细管电泳),但更常见的是液体介质穿行过固相支持介质。最广泛使用的支持介质是凝胶,例如,聚丙烯酰胺和琼脂糖凝胶。
筛分凝胶(例如,琼脂糖)阻碍分子的流动。凝胶孔径决定能够自由流过凝胶的分子大小。当分子大小增加时,穿过凝胶的时间值也增加。结果,小分子比大分子更快地通过凝胶,因此在给定时间段内比更大的分子从加样区前进得更远。这类凝胶用于核苷酸序列基于大小的分离。
线性DNA片段迁移通过琼脂糖凝胶,其迁移率与它们分子量的log10成反比。通过利用具有不同琼脂糖浓度的凝胶,能够分辨不同大小的DNA片段。较高浓度的琼脂糖有利于小DNA的分离,而低琼脂糖浓度能够分辨更大的DNA。
表位:抗原决定簇。这些是特定化学基团,例如分子上的连续或者非连续的肽序列,所述基团是抗原性的,即引发特异性免疫应答。基于抗体的三维结构和匹配(或者同源)的表位三维结构,抗体结合特定抗原表位。
“取代表位”包括在表位中的至少一种结构取代,例如一个氨基酸取代另一个。
杂交:寡核苷酸和它们的类似物通过互补碱基之间的氢键合杂交,包括Watson-Crick,Hoogsteen或者反向Hoogsteen氢键合。通常,核酸由含氮碱基组成,所述含氮碱基是嘧啶(胞嘧啶(C),尿嘧啶(U),和胸腺嘧啶(T))或者嘌呤(腺嘌呤(A)和鸟嘌呤(G))。这些含氮碱基在嘧啶和嘌呤之间形成氢键,嘧啶与嘌呤的成键称为“碱基配对”。更具体地,A与T或者U形成氢键,G与C形成键。“互补”是指在不同的核酸序列或者相同核酸序列的两个不同区域之间发生的碱基配对。
“可特异杂交”和“特异互补”是表明互补性的足够程度使得在寡核苷酸(或者其类似物)和DNA或者RNA靶之间发生稳定和特异结合的术语。寡核苷酸或者寡核苷酸类似物无需与其靶序列100%互补以便可特异杂交。当寡核苷酸或者类似物与靶DNA或者RNA分子的结合干扰靶DNA或者RNA的正常功能时,寡核苷酸或者类似物是可特异杂交的,在需要特异结合的情况下,例如在体内分析或者系统中的生理条件下,存在足够程度的互补性以避免寡核苷酸或者类似物与非靶序列的非特异结合。这类结合称为特异杂交。
根据所选杂交方法和组合物的性质和杂交核酸序列的长度,杂交条件导致的特定严谨程度是不同的。通常,杂交温度和杂交缓冲液的离子强度(尤其是Na+和/或Mg++浓度)将决定杂交的严谨性,尽管洗涤时间也影响严谨性。关于达到特定严谨程度所需的杂交条件的计算论述于Sambrook等人(编著),分子克隆:实验室手册第二版1-3册(Molecular Cloning:A Laboratory Manual,2nded.,vol.1-3),9和11章节,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,NY,1989,;和Ausubel等人分子生物学简编第四版(Short Protocols in MolecularBiology,4thed.),John Wiley & Sons,Inc.,1999。
对于本公开来说,“严谨条件”包括如下条件,即在该条件下杂交只在杂交分子和靶序列之间的错配小于25%的情况下发生。为了更精确的定义,“严谨条件”可以分解为特定水平的严谨性。因此,此处使用的“适度严谨”条件是指序列错配大于25%的分子不会杂交的条件;“中等严谨”条件是指错配大于15%的分子不会杂交的条件,而“高度严谨”条件是指错配大于10%的序列不会杂交的条件。“非常高度的严谨”条件是指错配大于6%的序列不会杂交的条件。
“特异杂交”是指当所述序列存在于复杂混合物(例如,总细胞DNA或者RNA)中时,分子只与或者基本上只与特定核苷酸序列结合、形成双链或者杂交。特异杂交也可能在不同严谨条件下发生。
免疫刺激组合物:此处使用的术语是指用于刺激或者引发脊椎动物特异性免疫应答(或者免疫原性应答)的组合物。免疫刺激组合物可以是蛋白抗原或者用于表达蛋白抗原的质粒载体。在一些实施方式中,免疫原性应答是保护性的或者提供保护性免疫,从而它使脊椎动物能够更好抵抗来自免疫刺激组合物针对的生物体的感染或者疾病进展。
不希望受到特定理论的约束,认为免疫刺激组合物诱导的免疫原性应答可能起于抗体的产生,所述抗体对免疫刺激组合物提供的一种或者多种表位特异。或者,该应答可能包括基于T-辅助细胞或者细胞毒性细胞的应答,所述应答针对免疫刺激组合物提供的一种或者多种表位。所有这3种应答可能源于幼稚或者记忆细胞。这类免疫刺激组合物的一个特定实例是疫苗。
在一些实施方式中,免疫刺激组合物的“有效量”或者“免疫-刺激量”是当施用于对象时足够引起可检测的免疫应答的量。这类应答可能包括,例如,对免疫刺激组合物提供的一种或者多种表位特异的抗体的产生。或者,该应答可能包括基于T-辅助细胞或者CTL的应答,所述应答针对免疫刺激组合物提供的一种或者多种表位。所有这3种应答可能源于幼稚或者记忆细胞。在其他实施方式中,免疫刺激组合物的“保护有效量”是当施用于对象时足够赋予对象保护性免疫的量。
抑制或者治疗疾病:例如在处于患病风险的对象中抑制疾病或者病症的完全发展。疾病的一个特定实例是狂犬病。“治疗”是指疾病或病理状态开始发展后,改善其征象或者症状或者病理病症的治疗介入。就疾病、病理状况或者症状而言,此处使用的术语“改善”是指任意可观察到的治疗的有益作用。有益作用可以证明,例如,通过在易感对象中疾病临床症状的延迟发作,疾病的一些或者全部临床症状的严重程度减轻,疾病的进展减慢,疾病的复发次数减少,对象的整体健康或者康复改善,或者通过本领域公知的对特定疾病特异的其他参数。
分离的:“分离的”或者“纯化的”生物组分(例如核酸,肽,蛋白,蛋白复合物,或者颗粒)已经从组分天然存在的生物体细胞中的其他生物学组分中,即其他染色体和染色体外DNA和RNA和蛋白中,被基本上分离、分离产生或者纯化。因此“分离的”或者“纯化的”核酸、肽和蛋白包括通过标准纯化方法纯化的核酸和蛋白。该术语还包含在宿主细胞中利用重组表达制备的核酸、肽和蛋白,以及化学合成的核酸或者蛋白。术语“分离的”或者“纯化的”并不需要绝对的纯度;它只是用来作为一个相对术语。因此,例如,分离的生物学组分是其中生物学组分比在细胞内其自然环境或者其他生产容器中的生物学组分更富集的生物学组分。优选的,制品被纯化使生物学组分代表制品的全部生物学组分含量的至少50%,例如至少70%,至少90%,至少95%或者更多。
标记:一种可检测的化合物或者组合物,其与另一个分子直接或者间接结合以促进该分子的检测。标记的特定、非限制实例包括荧光标签,酶连接物,和放射性同位素。
核酸分子:核苷酸的多聚形式,包括RNA、cDNA、基因组DNA和上述的合成形成和混合聚合物的有义和反义链。核苷酸是指核糖核苷酸、脱氧核苷酸或者任一种类型核苷酸的修饰形式。此处使用的术语“核酸分子”与“核酸”和“多核苷酸”同义。核酸分子通常至少长10个碱基,除非另作说明。该术语包括DNA的单链和双链形式。多核苷酸可以包括连接在一起天然存在的或者修饰的核苷酸的任一种或者两者,通过天然存在和/或非天然存在的核苷酸连接进行连接。
寡核苷酸:一种核酸分子,通常包括300个碱基或者以下的长度。尤其是,该术语经常是指单链脱氧核糖核苷酸,但它也可以指单链或者双链核糖核苷酸,RNA:DNA杂交和双链DNA。术语“寡核苷酸”还包括寡聚核苷(oligonucleosides)(也就是说,去除磷酸盐的寡核苷酸)和任何其他的有机碱基聚合物。
一些实例中,寡核苷酸长度为大约10到大约90个碱基,例如,长度为12,13,14,15,16,17,18,19或20个碱基。其它寡核苷酸长度为大约25,大约30,大约35,大约40,大约45,大约50,大约55,大约60个碱基,大约65个碱基,大约70个碱基,大约75个碱基或者大约80个碱基。寡核苷酸可以是单链的,例如,用作探针或引物,或者可以是双链的,例如,用于突变体基因的构建。寡核苷酸可以是有义或反义寡核苷酸。寡核苷酸可以按照上述关于核酸分子的论述来修饰。寡核苷酸能够从现有的核酸来源(例如,基因组或cDNA)获得,但是也能够是合成的(例如,通过实验室或者体外寡核苷酸合成来生产)。
开放阅读框(ORF):编码氨基酸的一系列核苷酸三联体(密码子),没有任何内部终止密码子。这些序列通常可翻译为肽/多肽/蛋白/多蛋白。
本领域已识别下列密码子(显示为RNA)可以互换地用于编码各特定氨基酸或者终止:丙氨酸(Ala或者A)GCU,GCG,GCA,或者GCG;精氨酸(Arg或者R)CGU,CGC,CGA,CGG,AGA,或者AGG;天门冬酰胺(Asn或者N)AAU或者AAC;天冬氨酸(Asp或者D)GAU或者GAC;半胱氨酸(Cys或者C)UGU或者UGC;谷氨酸(Glu或者E)GAA或者GAG;谷氨酰胺(Gln或者Q)CAA或者CAG;甘氨酸(Gly或者G)GGU,GGC,GGA,或者GGG;组氨酸(His或者H)CAU或者CAC;异亮氨酸(Ile或者I)AUU,AUC,或者AUA;亮氨酸(Leu或者L)UUA,UUG,CUU,CUC,CUA,或者CUG;赖氨酸(Lys或者K)AAA或者AAG;甲硫氨酸(Met或者M)AUG;苯丙氨酸(Phe或者F)UUU或者UUC;脯氨酸(Pro或者P)CCU,CCC,CCA,或者CCG;丝氨酸(Ser或者S)UCU,UCC,UCA,UCG,AGU,或者AGC;终止密码子UAA(赭石)或者UAG(琥珀)或者UGA(蛋白石);苏氨酸(Thr或者T)ACU,ACC,ACA,或者ACG;酪氨酸(Tyr或者Y)UAU或者UAC;色氨酸(Trp或者W)UGG;和缬氨酸(Val或者V)GUU,GUC,GUA,或者GUG。在每种情况下对于DNA的相应密码子用T取代U。
可操作地连接:当第一核酸序列置于与第二核酸序列的功能关系时,第一核酸序列与第二核酸序列可操作地连接。例如,启动子可操作地连接到编码序列表示启动子影响编码序列的转录或者表达。通常,可操作地连接的DNA序列是连续的,并且必要时在相同阅读框内连接两个蛋白编码区域。如果存在内含子,可操作连接的DNA序列可能不是连续的。
互补位:抗体的一部分,负责抗体与抗原上的抗原决定簇(表位)的结合。
药学上可接受的载体:用于本公开的药学上可接受的载体是常规的。E.W.Martin的雷氏药物科学(Remington’s PharmaceuticalSciences)第15版(1975),,Mack Publishing Co.,Easton,,描述了适于一种或者多种治疗化合物或者分子的药物输送的组合物和制剂,例如一种或者多种SARS-CoV核酸分子,蛋白或者结合这些蛋白的抗体,和其他药剂。
通常,载体的性质将取决于所使用的特定给药方式。例如,肠胃外制剂通常包括注射液,所述注射液包括药学上和生理上可接受的液体,例如水、生理盐水、平衡盐溶液、水性右旋糖、甘油等作为介质。对于固体组合物(例如,粉剂,丸剂,片剂,或者胶囊形式),常规的无毒固相载体可以包括,例如,药物级的甘露醇,乳糖,淀粉,或者硬脂酸镁。除生物中性载体外,要给药的药物组合物可以包含少量无毒的辅助物质,例如湿润或者乳化剂,防腐剂,和pH缓冲剂等等,例如乙酸钠或者脱水山梨醇单月桂酸酯。
多肽:一种聚合物,其中单体是通过酰胺键连接在一起的氨基酸残基。当氨基酸是α-氨基酸时,可以使用L-光学异构体或者D-光学异构体,对许多生物用途来说L-异构体是优选的。此处使用的术语“多肽”或者“蛋白”旨在包括任意氨基酸分子并包括修饰的氨基酸分子。术语“多肽”特别旨在涵盖天然存在的蛋白,以及重组或者合成产生的那些蛋白。
保守氨基酸取代是指当进行取代时,最少干扰最初蛋白的性质,也就是说,蛋白的结构、尤其功能是保守的,不会被这种取代显著改变。保守取代的实例如下所示。
保守取代通常保持(a)取代区域内多肽骨架的结构,例如,片层或者螺旋构象,(b)分子在靶位点的电荷或者疏水性,或者(c)侧链的体积。
根据它们的侧链,通常将氨基酸分类为一种或者多种类型,包括极性,疏水性,酸性,碱性和芳香族的。极性氨基酸的实例包括那些具有侧链功能基团例如羟基、巯基和酰胺的氨基酸,以及酸性和碱性氨基酸。极性氨基酸包括,不限于,天门冬酰胺,半胱氨酸,谷氨酰胺,组氨酸,硒代半胱氨酸,丝氨酸,苏氨酸,色氨酸和酪氨酸。疏水性或者非极性氨基酸的实例包括那些具有非极性脂肪族侧链的残基的氨基酸,例如,不限于,亮氨酸,异亮氨酸,缬氨酸,甘氨酸,丙氨酸,脯氨酸,甲硫氨酸和苯丙氨酸。碱性氨基酸残基的实例包括那些碱性侧链的残基,例如氨基或者胍基基团。碱性氨基酸残基包括,不限于,精氨酸,同聚赖氨酸和赖氨酸。酸性氨基酸残基的实例包括那些具有酸性侧链功能基团的残基,例如羧基。酸性氨基酸残基包括,不限于,天冬氨酸和谷氨酸。芳香族氨基酸包括那些具有芳香族侧链基团的氨基酸。芳香族氨基酸的实例包括,不限于,联苯基丙氨酸,组氨酸,2-萘基丙氨酸(napthylalananine),五氟苯丙氨酸,苯丙氨酸,色氨酸和酪氨酸。注意到一些氨基酸被分到多于一个组,例如,组氨酸,色氨酸和酪氨酸被分类为极性和芳香族氨基酸。被分类到上述各组的其他氨基酸是本领域普通技术人员已知的。
通常预期产生蛋白性质最大变化的取代将是非保守性的,例如在以下方面的变化(a)亲水性残基,例如,丝氨酰或者苏氨酰取代(或者被取代)疏水性残基,例如,亮氨酰,异亮氨酰,苯丙氨酰,缬氨酰或者丙氨酰;(b)半胱氨酸或者脯氨酸取代(或者被取代)任意其他残基;(c)具有正电性侧链的残基,例如,赖氨酰,精氨酰,或者组胺酰(histadyl),取代(或者被取代)负电性的残基,例如,谷氨酰或者天冬氨酰基;或者(d)具有庞大侧链的残基,例如,苯丙氨酸,取代(或者被取代)没有侧链的残基,例如,甘氨酸。
探针和引物:探针包括连接于可检测的标记物或者其他报告分子的分离核酸分子。典型标记物包括放射性同位素,酶底物,辅因子,配体,化学发光或者荧光剂,半抗原和酶。标记方法和选择适合于不同目的的标记物的指导论述于,例如Sambrook等人编著),分子克隆:实验室手册第二版1-3册(Molecular Cloning:A LaboratoryManual,2nded.,vol.1-3),Cold Spring Harbor Laboratory Press,ColdSpring Harbor,NY,1989和Ausubel等人的分子生物学简编第四版(Short Protocols in Molecular Biology,4thed.),John Wiley & Sons,Inc.,1999。
引物是短的核酸分子,例如6个核苷酸以上长度的DNA寡核苷酸,例如与连续互补核苷酸或者待扩增的序列杂交的寡核苷酸。更长的DNA寡核苷酸可以是大约10,12,15,20,25,30或者50个核苷酸以上的长度。引物可以通过核酸杂交与互补靶DNA链退火,在引物和靶DNA链之间形成杂交,然后引物通过DNA聚合酶沿着靶DNA链延伸。引物对可以用于核酸序列的扩增,例如,通过聚合酶链式反应(PCR)或者本领域已知的其他核酸扩增方法。扩增的其他实例包括链替代扩增,如美国专利5,744,311所公开;无转录等温扩增,如美国专利6,033,881所公开;修复链式反应扩增,如WO 90/01069所公开;连接酶链式反应扩增,如EP-A-320 308所公开;间隙填补连接酶链式反应扩增,如5,427,930所公开;和NASBATM无RNA转录扩增,如美国专利6,025,134所公开。
制备和使用核酸探针和引物的方法描述于例如Sambrook等人(编著)的分子克隆:实验室手册第二版1-3册(Molecular Cloning:ALaboratory Manual,2nd ed.,vol.1-3),Cold Spring Harbor LaboratoryPress,Cold Spring Harbor,NY,1989;Ausubel等人的分子生物学简编第四版(Short Protocols in Molecular Biology,4th ed).,John Wiley &Sons,Inc.,1999;和Innis等人的PCR教程-方法和应用指导(PCRProtocols,AGuide to Methods and Applications),Academic Press,Inc.,San Diego,CA,1990。扩增引物对可以来源于已知序列,例如,通过利用用于该目的的计算机程序例如Primer(Version 0.5,1991,Whitehead Institute for Biomedical Research,Cambridge,MA)。本领域普通技术人员了解特定探针或者引物的特异性随其长度而增加。因此,为了获得更高的特异性,可以选择探针和引物包括靶核苷酸序列的至少20,25,30,35,40,45,50或者更多个连续核苷酸。
蛋白:一种生物分子,尤其是多肽,由基因表达并由氨基酸组成。
纯化的:术语“纯化的”并不需要绝对的纯度;它用作一个相对术语。因此,例如,纯化的蛋白制品是其中目标蛋白比其在细胞内自然环境中更纯的蛋白制品。通常,纯化蛋白制品使得所述蛋白占制品总蛋白含量的至少50%。
重组核酸:一种核酸分子,其不是天然存在的或者具有一个由序列的两个本来分离的片段人工组合制备的序列。这种人工组合通过化学合成或者,更常见的,通过核酸分离片段的人工处理实现,例如,通过遗传工程技术,例如描述于Sambrook等人(编著)的分子克隆:实验室手册第二版1-3册(Molecular Cloning:A Laboratory Manual,2nd ed.,vol.1-3),Cold Spring Harbor Laboratory Press,Cold SpringHarbor,NY,1989。术语重组包括只是通过天然核酸分子一部分的添加、取代或者缺失改变的核酸。
调控序列或者元件:这些术语通常是指影响或者控制基因表达的一类DNA序列。该术语包括的是启动子,增强子,基因座控制区(LCR),绝缘子/临界元件,沉默子,基质结合区(MAR,还称为支架附着区),阻遏子,翻译终止子,复制起点,着丝粒,和减数分裂重组热点。启动子是邻近基因5′端的DNA序列,作为DNA依赖性RNA聚合酶的结合位点,从这里起始转录。增强子是提高从启动子起始的转录水平的控制元件,通常与增强子方向或者与启动子的距离无关。对与它们连接的基因的表达,LCRs提供组织特异的和短暂的调节。LCR功能和它们与基因的相对位置无关,但取决于拷贝数。据认为它们作用于打开核小体结构,从而其他因子可以结合DNA。LCR还可能影响复制时间和起点的使用。绝缘子(亦称临界元件)是通过阻断周围染色质的作用,阻止基因转录活化(或者灭活)的DNA序列。沉默子和阻遏子是抑制基因表达的控制元件;它们对基因的作用与它们的方向或者与基因的距离无关。MAR是结合核支架的DNA内序列;它们能够通过将染色体分离成调控结构域影响转录。据认为MAR介导染色体内高级的环结构。转录终止子是基因邻近区域,在此RNA聚合酶从模板释放。复制起点是在DNA合成或者细胞分裂的复制期基因组开始DNA复制过程的区域。减数分裂重组热点是基因组比减数分裂期间平均值更频繁重组的区域。
复制子:体内作为DNA复制的自主、自我复制单位的任意遗传元件(例如,质粒,染色体,病毒)。
样品:代表整体的一部分、一块或者片段。该术语包括任意物质,包括例如从动物、植物或者环境获得的样品。
“环境样品”包括从室内或者室外环境中无生命的物体或者储库(reservoir)中获得的样品。环境样品包括,但不限于:土壤,水,灰尘和空气样品;大体积样品,包括建筑材料,家具和垃圾填埋物;和其他储库样品,例如动物垃圾,收获的谷物和食品。
“生物样品”是从植物或者动物对象获得的样品。此处使用的生物样品包括可用于检测对象病毒感染的所有样品,包括但不限于:细胞,组织和体液,例如血液;血液的衍生物和部分(例如血清);提取的胆汁;活组织切片或者手术去除的组织,包括例如,未固定的、冷冻的、福尔马林固定的和/或埋入石蜡的组织;泪液;乳汁;皮肤刮擦物;表面冲洗物;尿液;痰液;脑脊液;前列腺液;脓;骨髓抽吸物;BAL;唾液;子宫颈拭子;阴道拭子;和口咽洗涤物。
序列同一性:两个核酸序列或者两个氨基酸序列之间的相似性,根据序列之间的相似性表达,还称为序列同一性。序列同一性经常根据百分比同一性(或者相似性或者同源性)来测量;百分比越高,两个序列越相似。
用于比较的序列比对方法是本领域公知的。不同的程序和比对算法描述于:Smith和Waterman(Adv.Appl.Math.,2:482,1981);Needleman和Wunsch(J.Mol.Biol.,48:443,1970);Pearson和Lipman(Proc.Natl.Acad.Sci.,85:2444,1988);Higgins和Sharp(Gene,73:237-44,1988);Higgins和Sharp(CABIOS,5:151-53,1989);Corpet等人(Nuc.Acids Res.,16:10881-90,1988);Huang等人(Comp.Appls.Biosci.,8:155-65,1992);和Pearson等人(Meth.Mol.Biol.,24:307-31,1994)。Altschul等人(Nature Genet.,6:119-29,1994)提出了序列比对方法和同源性计算的详细考虑因素。
比对工具ALIGN(Myers和Miller,CABIOS 4:11-17,1989)或者LFASTA(Pearson和Lipman,1988)可用于进行序列比较(InternetProgram1996,W.R.Pearson and the University of Virginia,“fasta20u63”version 2.0u63,出厂日期1996年12月)。ALIGN比较彼此的完整序列,而LFASTA比较局部相似的区域。在因特网的NCSA站点上现有这些比对工具和它们各自的教学软件。或者,为了比较大于大约30个氨基酸的氨基酸序列,可以使用“Blast 2 sequences”功能,用默认BLOSUM62矩阵设置为默认参数,(空位开放罚分11,每个残基空位罚分1)。当比对短肽(少于约30个氨基酸)时,应使用“Blast2sequences”功能进行比对,采用PAM30矩阵设为默认参数(开放空位9,延伸空位1罚分)。BLAST序列比较系统是可用的,例如,来自NCBI站点;还可参见Altschul等人,J.Mol.Biol.,215:403-10,1990;Gish和States,Nature Genet.,3:266-72,1993;Madden等人,Meth.Enzymol.,266:131-41,1996;Altschul等人,Nucleic Acids Res.,25:3389-402,1997;和Zhang和Madden,Genome Res.,7:649-56,1997.
在一些情况下蛋白的直向同源物(等同于其他种类的蛋白)表征为具有超过75%的序列同一性,所述序列同一性利用设置到默认参数的ALIGN,与特定蛋白的氨基酸序列进行全长比对算出。当利用这种方法评估时,与参照序列具有更高相似性的蛋白将显示同一性的百分比增加,例如至少80%,至少85%,至少90%,至少92%,至少95%,或者至少98%的序列同一性。此外,可以对已公开的融合蛋白的一个或者两个结合结构域的全长比较序列同一性。
当显著小于完整序列进行序列同一性比较时,同源序列在10-20短窗口(short windows)上将通常具有至少80%的序列同一性,可能具有至少85%,至少90%,至少95%,或者至少99%的序列同一性,取决于它们与参照序列的相似性。可以使用LFASTA确定这类短窗口上的序列同一性;方法描述于NCSA站点。本领域技术人员清楚这些序列同一性范围只提供指导;完全有可能获得超出所提供范围的非常显著的同系物。与对蛋白描述的相同,相似的同源性概念适用于核酸。两个核酸分子紧密相关的另一种表示是两个分子在严谨条件下彼此杂交。
因为遗传密码的简并性,不显示高度同一性的核酸序列仍可能编码相似的氨基酸序列。应了解利用这种简并性可以造成核酸序列的变化以产生多重核酸序列,所述多重核酸序列各编码基本上相同的蛋白。
特异结合剂:一种基本上只结合指定靶的试剂。因此蛋白特异结合剂基本上只结合指定蛋白,或者蛋白内的特定区域。此处使用的蛋白特异结合剂包括基本上结合特定多肽的抗体和其他试剂。抗体可以是对多肽特异的单克隆或者多克隆抗体,以及其免疫有效部分(“片段”)。
通过使用或者调节常规步骤,可以轻易地确定特异试剂基本上只结合特异多肽。利用Western印迹步骤的合适的体外分析实例包括IFA和Ag-ELISA,在许多标准文本中有描述,包括Harlow和Lane,抗体应用:实验室手册(Using Antibodies:A Laboratory Manual),CSHL,NewYork,1999。
转化:“转化”细胞是其中通过分子生物学技术已经导入核酸分子的细胞。该术语包括可以将核酸分子导入这类细胞的所有技术,包括利用病毒载体转染,利用质粒载体转化,和通过电穿孔导入裸DNA,脂质体转染,和粒子枪加速。
载体:作为导入宿主细胞的核酸分子,从而产生转化的宿主细胞。载体可以包括允许其在宿主细胞中复制的核酸序列,例如复制起点(参与启动DNA合成的DNA序列)。载体还可以包括一种或者多种选择性标志基因和其他本领域已知的遗传元件。
病毒:在活细胞内复制的微小感染性生物体。病毒通常基本上由一个蛋白被膜包围的单核酸核组成,具有只在活细胞内复制的能力。“病毒复制”是通过至少一个病毒生活周期发生的其它病毒的产生。病毒可以摧毁宿主细胞的正常功能,造成细胞表现为病毒决定的方式。例如,病毒感染可能导致细胞产生细胞因子,或者对细胞因子应答,而未感染的细胞通常不会这样。
尽管与此处描述的相似或者等同的方法和材料可用于本发明的实践或者检验,但合适的方法和材料在下面描述。所有此处提及的出版物、专利申请、专利和其他参考文献在此完整引入作为参考。在不一致的情况下,以本说明书为准,包括术语的解释。此外,材料、方法和实例只是说明性的,而不是为了限制。
IV.数个实施方式的概述
此处第一个实施方式提供的是重组狂犬病病毒基因组,包括SEQID NO:1(全长ERA序列)所示的核酸。还提供该基因组编码的分离的狂犬病病毒蛋白,包括包含下列序列所示的氨基酸序列的特定蛋白:SEQ ID NO:2(N蛋白);SEQ ID NO:3(P蛋白);SEQ ID NO:4(M蛋白);SEQ ID NO:5(G蛋白);或者SEQ ID NO:6(L蛋白);和编码这类蛋白的分离核酸分子。举例来说,这类分离的核酸分子包括下列所示的核苷酸序列:SEQ ID NO:1的核苷酸71-1423(N蛋白);SEQ ID NO:1的核苷酸1511-2407(P蛋白);SEQ ID NO:1的核苷酸2491-3104(M蛋白);SEQ ID NO:1的核苷酸3318-4892(G蛋白);或者SEQ ID NO:1(L蛋白)的核苷酸5418-11,801。
还提供具有SEQ ID NO:7所示核酸序列的重组病毒基因组,其不同于SEQ ID NO:1,因为在G基因和psi-区之间的polyA tract上缺失一个腺苷残基。SEQ ID NO:7还编码如SEQ ID NO:2-6所示的蛋白。
还提供ERA病毒株衍生物的基因组,如SEQ ID NO:8-18所示。在某些实施方式中,基因组存在于载体中,例如质粒。
另一个描述的实施方式是使用此处描述的方法测序全长狂犬病病毒基因组的系统。还描述了用于异源蛋白表达的病毒载体系统。
另一个实施方式提供包括此处提供的一种或者多种核酸分子,或者一种或者多种蛋白的组合物。任选的,这类组合物包含药学上可接受的载体,佐剂,或者其两种或更多种的组合。
还提供在对象中引发针对抗原表位的免疫应答的方法,包括给对象导入包括此处描述的核苷酸、肽或者多肽的组合物,从而在对象中引发免疫应答。
本公开的另一个方面涉及用于产生重组狂犬病病毒的载体系统。该载体系统包括第一载体(转录载体),包含全长狂犬病病毒反基因组DNA(或者其衍生物),和一组辅助载体,包含编码至少一种狂犬病病毒株ERA蛋白的核酸。转染宿主细胞中载体的表达引起活的重组狂犬病病毒的产生。在某些实施方式中,反基因组DNA是ERA株(例如,SEQ ID NO:1或者SEQ ID NO:7)或者其衍生物,例如SEQ IDNO:8-18中的一种。在某些实施例中,载体是质粒。
为了促进全长病毒RNA的恢复,转录载体可以包括,按5′到3′方向:锤头核酶;狂犬病病毒反基因组cDNA;和丁型肝炎病毒核酶。选择锤头核酶的核苷酸与狂犬病病毒的反义基因组序列互补。反基因组cDNA的转录受至少一个CMV启动子和噬菌体T7RNA聚合酶启动子的转录调控,通常是在这两个启动子的控制下。
辅助载体通常包括包含编码狂犬病病毒N蛋白的多核苷酸序列的载体;包含编码狂犬病病毒P蛋白的多核苷酸序列的载体;包含编码狂犬病病毒M蛋白的多核苷酸序列的载体;包含编码狂犬病病毒L蛋白的多核苷酸序列的载体;和包含编码噬菌体T7RNA聚合酶的多核苷酸序列的载体。在一个实施方式中,T7RNA聚合酶包括核定位信号(NLS)。任选的,载体系统还包括包含编码狂犬病病毒G蛋白的多核苷酸序列的载体。
编码狂犬病病毒P、M、L或G蛋白或者T7聚合酶的一种或者多种多核苷酸序列的转录受CMV启动子和T7启动子二者的转录调控。相反,编码狂犬病病毒N蛋白的多核苷酸序列的转录受T7启动子的转录调控,并且转录是帽-非依赖性的(cap-independent)。
另外一种实施方式是活狂犬病疫苗,每种都包括此处提供的重组狂犬病病毒基因组。这类重组狂犬病基因组的实例包括ERA G333(SEQID NO:13)所示序列;ERA 2G(SEQ ID NO:8)所示序列;和此处ERA2G333(SEQ ID NO:10)所示序列。任选的,狂犬病疫苗是减毒的。
还提供一种产生活狂犬病病毒(例如,用于免疫原性组合物,例如疫苗)的方法,通过将载体系统导入宿主细胞。在载体系统转染到合适的宿主细胞中后,活的和任选减毒的病毒被恢复。通过这类方法产生的活狂犬病疫苗的制备和给药也是本文所预期的。
还公开了一种接种对象抗狂犬病的方法,该方法包括向对象施用有效量的根据提供的说明所述的活狂犬病疫苗,使对象的细胞感染狂犬病疫苗,其中在对象中产生抗-狂犬病免疫应答。在一个实施方式中,对象是人。在另一个实施方式中,对象是非人动物。例如,在有些情况下非人动物是猫,狗,大鼠,小鼠,蝙蝠,狐狸,浣熊,松鼠,负鼠,山狗或者狼。
在某些实施方式中,狂犬病疫苗肠道给药。例如,在一些情况下肠道给药包括口服给药。口服给药包括例如通过为接种野生动物群接种而设计的食物诱饵进行给药。
还提供包括所述活狂犬病疫苗(例如,减毒的活狂犬病疫苗)和药学上可接受的载体或者赋形剂的药物组合物。
V.测序完整狂犬病病毒属基因组的方法
为了便利全长ERA基因组的测序,开发了一种对全长负链RNA病毒进行测序的方法。该方法适合于狂犬病病毒属的测序,例如狂犬病病毒,以及其他负链RNA病毒。狂犬病病毒是单负链RNA病毒,具有大约12kb的基因组,范围在11,918(澳大利亚蝙蝠狂犬病病毒属)和11,940(Mokola病毒)个碱基之间。GENBANK中提供的狂犬病病毒核酸序列主要集中于编码下述蛋白的序列--核衣壳蛋白(N),糖蛋白(G),磷蛋白(P)和基质蛋白(M)基因,它们接近基因组的3′端。现有的种系发生分析主要基于N和G基因。但是,对于关系较远的狂犬病病毒株,RNA依赖性RNA聚合酶(L)基因是用于种系发生分析的最合适候选者。令人遗憾地,公共基因数据库中很少提供L基因序列。此外,据认为狂犬病病毒末端的前导区和trailer区对病毒转录和复制(的调节)非常重要。这些可能是例如用于核蛋白包装的保守区域或者L/P蛋白的结合位点。在前导-N,N-P,P-M,M-G,假-基因区和G-L中的基因间区域还作为病毒转录启始的信号。因此,不仅编码区域,而且病毒基因组内的非编码区域,都可以用于种系发生分析或者进化研究。利用此处提供的全基因组测序方法,这些序列都可以更容易地进行分析。
该方法包括单步逆转录和二步克隆进合适的载体。该方法在载体中产生容易测序的基因组,无需进行易出错的反复RT-PCR反应。利用在狂犬病基因组末端发现的反向重复(和其他狂犬病病毒属的基因组),已经设计出通用引物,并在此处描述用于此处所述的快速全基因组测序步骤。
狂犬病病毒的前导和trailer区包含用于病毒转录和复制的信号。基于对GenBank提供的基因组序列的分析,在狂犬病病毒或者狂犬病相关病毒包括Mokola病毒中,末端11个核苷酸是严格保守的。此处提供的测序方法原理基于末端11个互补核苷酸。因为这两种11个核苷酸的序列是互补的,它们不能用于后续PCR反应。应了解,具有反向重复的其他病毒利用对应于所述重复的引物能够类似地进行扩增。该11个反基因组有义核苷酸被设计为用于纯化的ERA基因组的逆转录引物,其完整性通过大小比较和Northern印迹进行验证。利用N,P,M,G,L基因探针和11个核苷酸作为寡核苷酸探针,其只结合基因组RNA而不是病毒mRNA,通过Northern印迹确认全基因组cDNA。
利用精心设计的保守末端序列对应引物,完全可以在一个反应中逆转录狂犬病病毒全基因组,条件是病毒基因组制备物的质量要高。
ERA序列与SAD序列密切相关,SAD序列是其衍生物。这并不奇怪,因为在1970年代ERA从CDC送到瑞士,在其被寄到德国前,在那里研究人员对它进行改造以便在细胞中生长,在德国它进一步衍生,并且衍生物在1990年左右被完全测序。迄今为止,根据血清交叉保护和遗传研究,已经将狂犬病和狂犬病相关病毒分为7种不同类型:典型狂犬病病毒1型(包括ERA),2型(Lagos蝙蝠),3型(Mokola),4型(Duvenhage),5型(欧洲蝙蝠狂犬病病毒属[EBL]I),6型(EBLII)和7型(澳大利亚蝙蝠病毒)。序列分析在下列领域起着重要作用:系统发生学、进化研究、基因功能预测研究和其他相关领域,包括定位病毒转录和复制调控区域,因此生物信息学趋向于潜在的治疗药物。
本领域普通技术人员已经知道,随着逆转录聚合酶链式反应
(RT-PCR)技术的发展,现在在一个反应中多达12kb以上的RNA逆转录为cDNA是相对容易的。在优化条件下,PCR可以在一个反应中扩增大于30kb的靶。
利用此处提供的产生全长病毒基因组序列的方法,尤其是狂犬病基因组序列,分析不同病毒株现在已成为现实。利用得到的全长基因组还能够有效设计减毒病毒,例如用于免疫刺激组合物和疫苗的免疫作用或者生产。
不存在“通用”狂犬病病毒基因组,但这些基因组是相关的。在不同类型中相似性从60%到100%。一些区域,例如L基因,似乎更保守,而其他区域,例如不编码多肽的psi区,是更可变化的。不仅狂犬病和狂犬病相关病毒漂变,而且任意RNA病毒也将随时间变化。病毒如何改变和出现仍是未解决的问题。因此,全基因组序列分析对于进化、致病性和基因功能研究是重要的。
就涉及全基因组测序而言,此处描述的该系统是用于狂犬病病毒的第一个。相信其适合其他RNA病毒,尤其在狂犬病病毒属中。目前,对于狂犬病病毒种系发生研究,科学家只利用N,P或者G基因,它们在被感染的细胞或者组织中是最丰富的。已知对于关系较远的株的比较,包括大半基因组的L基因可能是一个理想的候选位点,应该被使用。令人遗憾地,由于只有非常有限的资料可以利用,这类进化比较是不可能的,更不用说全基因组序列了。还是对于病毒转录和复制研究来说,据推测位于基因组3′和5′末端的前导和trailer区起着重要作用。基因间区域还是病毒反式和顺式研究的信号。所有这些数据是相当有限的,因为它们不包括在mRNA内。只有全基因组序列可以提供这个水平的必要信息。全基因组测序不仅可用于疫苗开发,它还适用于基本的病毒转录和复制研究。它还可应用于开发siRNA和基因治疗。
VI.ERA基因组测序
利用此处描述的方法,已经产生ERA狂犬病病毒基因组的唯一序列。该序列显示于SEQ ID NO:1。在基因组下列位置编码ERA狂犬病病毒的5种蛋白(SEQ ID NO:2-6):N,71-1423;P,1511-2407;M,2491-3104;G,3318-4892;和L,5418-11801。ERA和SAD-B19之间的同源性分别是:N 99.56%,P 98.65%,M 96.53%,G 99.05%和L 99.20%。ERA和SAD-B19之间的一个特定差异是G和假基因之间的基因间区域,SAD-B19G转录终止/多腺苷酸化信号被破坏。
ERA狂犬病病毒全基因组序列是利用反向遗传学进行疫苗开发和致病性研究的先决条件。
VII.用于产生病毒的优化系统
实施例6和7提供了用于产生ERA病毒的一组优化条件,其中滴度高达每ml1010ffu。在生物反应器中,恢复的病毒可以生长到~109到1010ffu/ml。这种高生产水平对于口服疫苗开发是最为重要的,这样可以在合理时间段内利用合理的资源分配产生足够的疫苗材料。
对于亲代和重组ERA株来说,提供的生长条件可以稳定产生这种高病毒滴度。这些生产数据对于潜在的狂犬病口服疫苗开发非常重要。
VIII.用于产生G-病毒的BSR-G细胞系
尽管先前已经从BHK细胞拯救了缺失G蛋白的RV株,但用缺失G蛋白的ERA株病毒仍然是不可能的。在小鼠脑内或者肌内接种ERA-G后,没有小鼠死亡或者显示任何狂犬病症状。
只有补充糖蛋白,ERA-G(不含糖蛋白)才能在细胞中生长。另外,突变的病毒不能传播。为了帮助ERA-G生长,建立了BSR-G细胞系,其组成性表达ERA糖蛋白。在下面的实施例中描述该细胞系的产生。该细胞系用于RV株的恢复,例如在缺乏G下难以恢复的ERA-G,以及用于优化其他株的恢复。
IX.用于工程化狂犬病病毒疫苗和异源蛋白表达的反向遗传系统
直接通过分子生物学方法不易操纵RNA。传统的RNA病毒疫苗来自天然减毒的分离物,其难以控制并造成不可预测的结果。反向遗传技术使操纵RNA病毒成为DNA成为可能,其可以根据精心设计被突变、切除或者重建。每个基因功能都可以仔细、独立和一致地研究,这有利于疫苗开发。反向遗传学涉及将RNA病毒基因组逆转录成cDNA,并且克隆入载体,例如质粒。在转染宿主细胞后,载体被转录成RNA,被结构蛋白质包装,所述结构蛋白质还可以由质粒提供。包装的RNA形成核糖核蛋白复合物,这导致可以被恢复的病毒颗粒。
尽管已经公开了用于狂犬病病毒(RV)反向遗传学的3个系统(Schnell等人,The EMBO J.13,4195-4203,1994;Inoue等人,J.Virol.Method.107,229-236,2003;Ito等人,Microbiol.Immunol.47,613-617,2003),但这些系统不易适应其他株。目前,即使当病毒株之间是紧密相关的,也没有狂犬病病毒株借助不同病毒株的辅助质粒而恢复。因此,对于任意特异病毒株突变或者疫苗开发来说,必须开发一种特定处理系统。
ERA株是用于狂犬病口服疫苗开发的合适候选物,但其残余致病性是明显的。在1970年代间,对ERA RV进行了广泛的疫苗开发(Black和Lawson,Can.J.Comp.Med.44:169-176,1980;Charlton,和Casey,Can.J.Vet.Res.20:168-172,1978;Lawson,和Crawley,Can.J.Vet.Res.36:339-344,1972)。ERA和SAD-B19均起源于SAD。在初期口服疫苗试验中,SAD-B19在浣熊和臭鼬中均是有效的,而ERA不是。此外,在动物试验中证实,ERA杀死脑内(i.c.)给药的两周龄小鼠。这些观察引起这两种RV株之间关系和微细改变的潜在影响的疑问。根据全病毒基因组序列比较,ERA和SAD-B19共享极高的核苷酸同一性和氨基酸同源性。为了阐明狂犬病病毒这些高度相关株的免疫原性和致病性的遗传基础,开发出一种针对ERA的有效反向遗传系统,其不同于先前报道的针对狂犬病病毒的反向遗传系统。
此处公开的狂犬病反向遗传系统可用于各种目的,包括:(1)以指定方式减毒ERA病毒用于疫苗开发;(2)产生ERA病毒载体用于表达异源ORF(例如,在治疗组合物的情况下,例如疫苗和基因治疗);(3)确定ERA RV发病机理的遗传基础;和(4)确定ERA和SAD病毒之间遗传差异的生物学影响。
反向遗传系统具有下列特征中的一些或者全部,利用示范性ERA株反基因组cDNA,示意性图示于图1A。
该系统基于一个全长转录质粒外加多个辅助质粒(例如,5个辅助质粒)。辅助质粒编码N、P、L蛋白,和任选的G蛋白,以及T7聚合酶。尽管G蛋白不是病毒拯救必需的,但当包括在转染中时,它改善病毒恢复效率或者病毒出芽。
转录涉及细胞RNA依赖性RNA聚合酶II,其是哺乳动物细胞中现有的,和T7RNA聚合酶,其由pNLST7质粒提供。这两种聚合酶使病毒恢复率又高又稳定。
在转录质粒中,锤头和丁型肝炎病毒核酶在狂犬病病毒(例如,ERA株)反基因组cDNA的侧翼,通过转录能够产生可靠的反基因组vRNA的5′和3′末端。将锤头序列的前10个核苷酸设计成与反义基因组序列的前10个核苷酸互补。例如,对于ERA反基因cDNA来说,锤头序列的前10个核苷酸是:TGTTAAGCGT(SEQ ID NO:19)。
已经建立两种修饰的T7RNA聚合酶构建体,它们比先前应用的野生型T7RNA聚合酶更有效地支持病毒恢复。一种T7RNA聚合酶已经从第一个ATG突变到AT。第二种T7RNA聚合酶具有源自SV40病毒大T抗原的八氨基酸核定位信号(NLS),融合在亲代T7的第一个ATG后:ATG CCA AAA AAG AAG AGA AAG GTA GAA(SEO IDNO:20)。NLS有下划线。NLS的添加导致T7RNA聚合酶主要存在于细胞核中。按照NLS修饰质粒的转染机理,DNA/转染剂复合物与细胞表面结合。通过胞吞作用,复合物被摄入内体/溶酶体,DNA被释放入胞浆。在没有NLS的情况下,大多数转染的质粒保留在胞浆中,只有小百分比释放的DNA到达细胞核,在那里其转录成RNA。在蛋白合成后,NLST7RNA聚合酶被转运回细胞核,细胞核中的辅助质粒(含T7/CMV启动子)将通过NLST7和细胞聚合酶II转录。因此,更多辅助质粒的mRNA和全长pTMF的cRNA或者其衍生物被合成,引起高效的病毒恢复。
在NLST7通过CMV启动子的初始表达后,NLST7聚合酶结合pT7进行NLST7基因的转录。通过在细胞核中转录物的修饰,更多NLST7mRNA被合成,引起NLST7聚合酶的更多表达。NLST7聚合酶以及全长反基因组转录单位的pT7受NLST7聚合酶的控制,其充当“自身基因”。NLST7RNA聚合酶的自身基因机理图示于图2。在T7RNA聚合酶在细胞核中表达后,转染的T7构建体继续转录全长RNA模板用于N蛋白包装和/或L蛋白结合,增强病毒恢复效率。
T7聚合酶,和所有其他质粒,除了编码N蛋白的质粒pTN外,都处于CMV和T7转录调控元件二者的控制之下。编码N蛋白的核酸受T7启动子控制,并基于IRES(Internal Ribosome Entry Site,内部核糖体进入位点)按照帽-非依赖性的方式被翻译。如果所有质粒在CMV启动子(19)的控制下克隆,则细胞RNA聚合酶II独自就可以辅助RV的恢复。在此处公开的ERA反向遗传系统中,只有pTN受T7启动子控制并以帽-非依赖性的方式翻译。所有其他构建体受CMV和T7转录调控元件二者的控制。通常,在RV中,N合成是丰富的,N、P和L之间的比例是大约50∶25∶1。为了在RV反向遗传中模拟野生型病毒转录和组装,N表达应该是最高的。借助于NLST7聚合酶和IRES翻译模式,质粒转染后N蛋白被有效表达。这减少了与宿主细胞中持家基因对转录的竞争,因为哺乳动物细胞中不存在T7转录起始信号,并导致T7转录效率的增加。
为了增强病毒蛋白的产生,可以构建辅助质粒以掺入Kozak序列,为了每种蛋白编码序列的翻译效率,所述Kozak序列已经优化。示范性的优化Kozak序列显示于表2。
表2:优化的Kozak序列.
构建体 | 启动子 | Kozak内容 | SEQ ID NO: | 特殊性质 |
pTMF | CMV/T7 | n/a | n/a | 在末端HamRZ/HdvRZ |
pTN | T7/IRES | ACCACCATGG | SEQ ID NO:21 | n/a |
pMP | CMV/T7 | ACCACCATGA | SEQ ID NO:22 | n/a |
pMG | CMV/T7 | ACCACCATGG | SEQ ID NO:21 | n/a |
pML | CMV/T7 | ACCACCATGC | SEQ ID NO:23 | n/a |
pNLST7 | CMV/T7 | ACCACCATGA | SEQ ID NO:22 | 8氨基酸NLS |
CMV/T7表示CMV启动子在pT7启动子之前。HdRz代表锤头核酶,而HDVRz是丁型肝炎病毒核酶。pTMF是全长转录质粒,pTN、pMP、pMG、pML和pNLST7是辅助质粒。
在ERA反向遗传系统中转染5天后,拯救的病毒可靠和可重复地(repatably)生长到107ffu/ml,无需进一步扩增。
X.衍生病毒
狂犬病病毒致病性的完整机理还没有被完全表征,使得合理的疫苗设计成问题。例如,RV糖蛋白看来在狂犬病病毒的致病性和免疫原性中起作用。突变(例如在糖蛋白的333位)产生在成年小鼠中不造成致命感染的病毒(Ito等人,Microl.Immunol.38,479-482,1994;Ito等人,J.Virol.75,9121-9128,2001)。但是,已经显示RV糖蛋白的过表达引起凋亡和抗病毒免疫应答的增强(Faber等人,J.Virol.76,3374-3381,2002)。因此具有修饰(例如,去除,氨基酸取代)的G蛋白的ERA病毒株可以是用于疫苗开发的特定毒株。
利用此处公开的反向遗传系统,可以设计出具有有利性质的重组狂犬病病毒。在亲代ERA株以外,此处公开的示范性重组病毒还包括,无Psi-区的ERA(ERA-),ERAgreenl(在Psi-区插入的绿色荧光基因),ERAgreen2(在P-M基因间区域克隆的绿色荧光基因),ERA2g(在Psi-区包含G的额外拷贝),ERAg3(G在333位氨基酸处突变),ERA2g3(在Psi-区包含突变G的额外拷贝),ERAgm(在基因组中M和G基因互换),和ERAgmg(在重排的ERAgm构建体中G的两个拷贝)。这些示范性株示意性图解于图3。
具有去除和/或突变的糖蛋白的修饰株特别适合用作免疫原性组合物,用于狂犬病病毒的接触前和后的治疗,因为这类病毒不能在细胞之间传播和造成疾病。此外,修饰的病毒例如ERA2g3,其因为编码突变糖蛋白的序列的加倍而过表达G蛋白,被预测增强凋亡并引发增强的抗病毒免疫应答。
例如,小鼠脑内和肌内接种G缺失(ERA-G)后,没有观察到不良事件。此外,ERA-G保护小鼠免受RV街毒株的致命攻击。因此,ERA-G看起来是用于疫苗开发的ERA的更安全的株。此外,ERA G氨基酸333位的精氨酸突变为谷氨酸(从核苷酸AGA到GAG,和在ERAg3和ERA2g3株中一样),产生一种减毒病毒。减毒作用通过动物接种试验得到证实。因为RV G的过表达引起凋亡和抗病毒免疫应答的增强,减毒病毒例如具有多拷贝G的ERA2g3作为疫苗候选物特别有利。
此处描述的用于狂犬病疫苗开发的系统不限于G基因的修饰,而是可类似地应用于每种病毒蛋白。为了促进修饰不同蛋白组分的系统化方法,根据此处提供的序列数据,通过反向遗传学可以解决致病性的详细定位。
此处描述的反向遗传系统还使狂犬病病毒载体系统能够用于外源(异源)的基因表达。所述的非限制性实施方式是基于ERA病毒。在整合到ERA RV基因组后,此处显示的额外转录单位是在两个不同位置起作用的。在一个实施方式中,额外转录单位整合到psi区的位置(trans 1)中。在另一个实施方式中,额外转录单位被插入RV P-M基因间区域。
在单链负性RNA病毒中,基因组3′末端序列主要作为转录启动子,而基因组5′末端序列作为复制启动子(Conzelmann和Schnell,J.Virol.68:713-719,1994;Finke等人,J.Virol.71:7281-7288,1997)。因此,trans2占据了导致比trans1更强转录的位置,所述转录驱动ORF表达。因此,此处公开的载体可用于调节异源ORF的表达到所需水平,只需通过选择ORF插入载体的位置。例如,当需要蛋白的高水平表达时,trans 2通常是异源ORF插入的理想位置。类似地,如果需要更适中水平的表达,异源ORF可以插入trans 1。无需过度实验,本领域技术人员就能确定对于每个ORF和特定应用来说的最适表达水平。
因此,此处提供的病毒载体是用于外源基因插入和表达的优异构建体,正如此处根据绿色荧光蛋白基因的表达所证实的那样。尽管所公开的载体的功用和效果是针对GFP进行证实的,但是应注意到该载体同样适用于表达感兴趣的任何基因或者ORF。
如所述,此处提供的基于狂犬病的异源表达系统可用于表达任何外源(异源)蛋白。尤其预期,举例来说,这类异源基因来自于另一种病原生物,例如其他致病性病毒,例如SARS病毒,Nipah病毒,等等。此外,公开的载体可以用于输送其他治疗基因,包括例如,编码具有治疗价值的蛋白或者功能RNA分子,例如siRNAs。
XI.药物和免疫刺激组合物及其用途
药物组合物包括减毒的或者固定的拯救病毒,包括至少一个病毒表位的病毒核酸序列或者病毒多肽也包括在本公开中。这些药物组合物包括治疗有效量的一种或者多种活性化合物,例如减毒或者固定病毒,包含至少一个病毒表位的病毒多肽,或者一种或者多种编码这些多肽的核酸分子,与药学上可接受的载体结合。预期在某些实施方式中,包括多个病毒表位的病毒核酸序列或者病毒多肽将用于制备本公开的药物组合物。
此处公开的是适合用作免疫刺激组合物的物质,所述组合物用于病毒感染的抑制或者治疗(接触前或者接触后),例如,狂犬病病毒感染。
在一个实施方式中,免疫刺激组合物包含减毒的或者固定的拯救(重组)病毒。在另一个实施方式中,组合物包含分离的或者重组的病毒多肽,所述多肽包括至少一种病毒表位(例如狂犬病病毒G蛋白)。在另一个实施方式中,免疫刺激组合物包含一种核酸载体,所述载体包括至少一种此处描述的病毒核酸分子,或者包括编码至少一种病毒表位的核酸序列。在一个特定的、非限制性实施例中,编码至少一种病毒表位的核酸序列在一个翻译单位中表达,例如那些描述于公开的PCT申请PCT/US99/12298和PCT/US02/10764(两篇均在此完整引入)。
免疫刺激病毒、病毒多肽、编码这类多肽的构建体或者载体,与药学上可接受的载体或者赋形剂组合,用于作为免疫刺激组合物施用于人或者动物对象。
免疫原性制剂可以方便地作为单位剂型,并利用常规药物技术制备。这类技术包括将活性成分和药物载体或者赋形剂结合的步骤。通常,通过将活性成分与液体载体均匀和紧密地结合来制备制剂。适于肠胃外给药的制剂包括含水和不含水的无菌的注射溶液,其可以包括抗氧化剂,缓冲液,抑菌剂和溶质,所述溶质使制剂与目标接受者的血液等渗;以及含水和不含水的无菌悬浮液,其可以包括悬浮剂和增稠剂。制剂可以是单位剂量或者多剂量容器,例如,密封的安瓿和小瓶,并可以保存于冷冻干燥(冻干)条件,只需使用前即刻添加无菌液体载体,例如,注射用水。从本领域普通技术人员通常使用的无菌粉剂、颗粒和片剂可以制备即用注射溶液和悬浮液。
在某些实施方式中,单位剂量制剂是包括给药成分的剂量或单位,或者其合适的部分的制剂。应了解除了上面特别提及的成分外,此处包括的制剂还可以包含本领域普通技术人员经常使用的其他药剂。
此处提供的组合物,包括那些用作免疫刺激组合物的,可以通过不同的途径给药,例如口服,包括颊和舌下,直肠的,肠胃外,气雾剂,鼻,肌内,皮下,真皮内和局部。它们可以不同形式给药,包括但不限于溶液,乳液和悬浮液,微球,颗粒,微粒,纳米颗粒和脂质体。
根据给药途径,给药的体积将不同。举例来说,肌肉注射可以为大约0.1ml到大约1.0ml。本领域普通技术人员将了解不同给药途径的合适体积。
免疫刺激化合物(例如,疫苗)领域的较为近期的发展是直接注射编码肽抗原的核酸分子(概述于Janeway & Travers,Immunobiology:The Immune System In Health and Disease,13.25页,Garland Publishing,Inc.,New York,1997;和McDonnell & Askari,N.Engl.J.Med.334:42-45,1996)。包括此处所述核酸分子的载体或者包括编码病毒多肽的核酸序列的载体可以在这类DNA免疫方法中使用,所述病毒多肽包括至少一个病毒表位。
因此,此处使用的术语“免疫刺激组合物”还包括核酸疫苗,其中编码病毒多肽的核酸分子在药物组合物中施用于对象,所述病毒多肽包括至少一个病毒表位。对于基因免疫来说,本领域技术人员已知的合适输送方法包括直接肌肉注射质粒DNA(Wolff等人,Hum.Mol.Genet.1:363,1992),输送与特定蛋白载体复合的DNA(Wu等人,J.Biol.Chem.264:16985,1989),DNA与磷酸钙的共沉淀(Benvenisty和Reshef,Proc.Natl.Acad.Sci.83:9551,1986),DNA在脂质体中的包裹(Kaneda等人,Science 243:375,1989),粒子轰击(Tang等人,Nature356:152,1992;Eisenbraun等人,DNA Cell Biol.12:791,1993),和利用克隆逆转录病毒载体进行体内感染(Seeger等人,Proc.Natl.Acad.Sci.81:5849,1984)。类似地,核酸疫苗制品可以通过病毒载体给药。
选择各剂量免疫刺激组合物中免疫刺激化合物的量为诱导免疫刺激或者免疫保护应答,而没有显著的不良副作用。这种量将根据所使用的特异免疫原以及它如何呈递而不同。初始注射可以从大约1μg到大约1mg,一些实施方式从大约10μg到大约800μg,其他实施方式从大约25μg到大约500μg。在免疫刺激组合物的初始给药后,对象可以接受一次或者数次加强给药,充分间隔开。加强给药可以从大约1μg到大约1mg,其他实施方式从大约10μg到大约750μg,还有的其他从大约50μg到大约500μg。间隔1-5年的周期性加强,例如3年,可以是合乎需要的,以保持所需水平的保护性免疫。
还预期提供的免疫刺激分子和组合物可以间接施用于对象,通过体外首先刺激细胞,然后受刺激的细胞施用于对象以引发免疫应答。此外,药物或者免疫刺激组合物或者治疗方法可以与其他疗法联合给药。
含有免疫刺激组合物的食物-诱饵的制备也是本领域普通技术人员已知的。例如,含有活RV疫苗的食物诱饵的制备公开于Wandeler等人(Rev.Infect.Dis.10(suppl.4):649-653,1988),Aubert等人(pp.219-243,in Lyssaviruses(Rupprecht等人,编著),Springer-Verlag,NewYork,1994),和Fu等人(pp.607-617,in New Generation Vaccines(2ndEdit.)(Levine等人,编著),Marcel Dekker,Inc.,New York,1997),各文献的完整公开均在此引入作为参考。
XII.试剂盒
此处还提供用于病毒感染检测和/或诊断的试剂盒,例如感染狂犬病病毒或者其他狂犬病病毒属。此处提供的分析试剂盒的实例是作为抗原的重组病毒多肽(或者其片段)和作为第二抗体的酶联抗人抗体。这类试剂盒的实例还可以包括一种或者多种酶底物。如果来自对象的样品包含抗病毒特异性蛋白的抗体,则这类试剂盒可用于检测。在这类试剂盒中,合适量的病毒多肽(或者其片段)提供于一个或者多个容器中,或者附着在底物上。例如,病毒多肽可以提供在水溶液中或者作为冷冻干燥的或者冻干的粉剂。提供病毒多肽的容器可以是任何常规容器,即能够保持供给形式,例如,微量离心管,安瓿或者瓶子。
试剂盒提供的各多肽的量可以是任何合适的量,并可以取决于该产品针对的市场。例如,如果试剂盒适合于研究或者临床用途,提供的各多肽的量将可能是足以进行数次分析的量。确定合适量的一般性指导可以参见,例如,Ausubel等人(编著),分子生物学简编(ShortProtocols in Molecular Biology),John Wiley and Sons,New York,NY,1999和Harlow和Lane,抗体应用:实验室手册(Using Antibodies:ALaboratory Manual),CSHL,New York,1999。
提供下列实施例以说明某些特定特征和/或实施方式。这些实施例不应解释为将本发明限制于所述的特定特征或者实施方式上。
实施例
实施例1:ERA RV的测序
本实施例提供用于测序弹状病毒全长基因组方法的说明,在该情况下尤其是狂犬病病毒。
狂犬病病毒株ERA从CDC保藏获得,并在幼仓鼠肾(BHK-21)细胞中繁殖。在37℃、5%CO2孵箱中感染4天后,收集病毒并进行纯化。简单来说,收集细胞上清,在2,000rpm离心15分钟以去除细胞碎片。澄清的上清进一步在18,000rpm离心1小时。沉淀团在PBS中重悬浮,进行狂犬病基因组RNA提取。
按照制造商推荐的方案,利用Trizol试剂(GIBCO Invitrogen)从ERA感染的BHK-21细胞提取总RNA。利用Roche的高纯度病毒RNA试剂盒,从浓缩的ERA病毒上清纯化ERA基因组RNA。
通过凝胶电泳和利用N、P、G和M杂交探针的Northern印迹,确认纯化的ERA基因组RNA的完整性。简单来说,5μg基因组RNA加样到变性RNA凝胶中,并转移到尼龙膜用于杂交。按照制造商的说明书,利用Roche的Dig DNA标记试剂盒对探针进行标记。
狂犬病病毒5′反基因组的11个保守核苷酸被设计为逆转录引物。利用Invitrogen的第一链cDNA合成试剂盒进行RT反应。通过Northern印迹,利用N,P,M,和G探针杂交,以及11个保守寡核苷酸作为地高辛标记的寡核苷酸探针,确认来自ERA基因组的完整cDNA。
选择两组引物用于PCR反应,其在两个连续片段中扩增全ERA基因组。一组引物由5′反基因组末端的11个核苷酸组成,Le5:ACGCTTAACAA(SEQ ID NO:24)和BLp3:GTCGCTTGCTAAGCACTCCTGGTA(SEQ ID NO:25)。另一组包含5′基因组末端的11个互补核苷酸,Le3:TGCGAATTGTT(SEQ ID NO:26)和BLp5:GAGTGCTTAGCAAGCGACCT(SEQ ID NO:27)。Blp3和Blp5引物位于狂犬病病毒基因组的相对保守区域。
纯化PCR片段并克隆入TOPO载体,所述载体购自Invitrogen。在ABI 310测序仪上进行测序,利用Accelrys的BioEdit软件或者SeqMerge软件在GCG环境中对序列进行装配。
ERA基因组完整比对的序列如SEQ ID NO:1所示。参照SEQ IDNO:1,表3提供个体蛋白编码序列的位置。N,P,M,G和L蛋白的氨基酸序列分别在SEQ ID NO:2到6中提供。
表3:狂犬病病毒ERA株的蛋白编码序列的位置
基因/基因组 | NT | rERA序列中的位置 |
ERA | 11930 | 1-11930 |
N | 1412 | 71-1423 |
P | 962 | 1511-2407 |
M | 789 | 2491-3104 |
G | 1647 | 3318-4892 |
Psi-区 | 398 | |
L | 6445 | 5418-11801 |
前导 | 58 | |
Trailer | 70 |
该方法可用于狂犬病和狂犬病相关病毒。狂犬病和狂犬病相关病毒至少有7种推断的类型。提供的序列方法还可以用于其他负链RNA病毒。这是因为几乎所有的
负链RNA病毒基因组在两个末端均具有大约12个保守核苷酸,它们类似地可以用作RT-PCR的引物。对于不同的病毒种类引物当然会不同,普通技术人员根据此处的教导能够确定特异引物的序列。
实施例2:用于狂犬病病毒反向遗传系统的质粒的构建
本实施例描述用于狂犬病病毒的反向遗传系统的设计和发展。狂犬病病毒株ERA从ATCC获得,按描述的(Wu等人,J.Virol.76,4153-4161,2002)进行制备。为了获得病毒基因组全长病毒cDNA,BSR细胞(幼仓鼠肾,BHK,细胞的克隆)感染ERA株病毒,在补充10%胎牛血清的Dulbecco最低限必需培养基中生长。收集上清,在22,000g离心1小时。收集病毒沉淀团利用购自Qiagen(Valencia,CA)的RNA病毒提取试剂盒,按照制造商的说明书,用于病毒基因组RNA纯化。通过凝胶电泳确认病毒基因组RNA的完整性。利用Invitrogen(Carlsbad,CA)的第一链cDNA合成试剂盒转录病毒基因组cDNA。应用逆转录(RT)反应混合物,通过聚合酶链式反应(PCR)扩增,分别用于全长病毒基因组cDNA、N、P、G和L基因的合成。为了组装全长病毒基因组cDNA,按照图1B示意性图解的4个连续步骤构建pTMF质粒。Superscript III逆转录酶和高保真platinum pfx聚合酶(Invitrogen,Carlsbad,Ca)用于cDNA转录产物合成和连续PCR扩增。对于逆转录反应,在RT反应混合物中使用1μg纯化的基因组RNA,在50℃孵育80分钟,然后在85℃加热5分钟以灭活Superscript III。在RT反应后,添加1单位RNaseH以消化cDNA-RNA杂交物中的模板RNA。
为了产生全长病毒基因组cDNA,通过RT-PCR扩增的两个重叠片段如下:片段1(F1)利用下列引物进行RT-PCR扩增:Le5-Kpn(CCGGGTACCACGCTTAAC AACCAGATCAAAGA;SEQ ID NO:28,下划线为Kpn1识别位点)和Le3-Blp(TAGGTCGCTTGCTAAGCACTCCTGGTAGGAC;SEQ ID NO:29,下划线为Blp1识别位点)。片段2(F2)利用下列引物进行RT-PCR扩增:Tr5-Blp(GTCCTACCAGGAGTGCTTAGCAAGCGACCTA;SEQ ID NO:30,下划线为Blp1识别位点)和Tr3-Pst(AAAACTGCAGACGCTTAACAAATAAACAACAAAA;SEQ IDNO:31,下划线为Pst1识别位点)。在成功合成上述两个片段后,对Kpn1和Blp1限制性酶消化的F1进行凝胶纯化,并克隆到pBluescriptIISK(+)噬菌粒(Stratagene,La Jolla,Ca),以形成pSKF1质粒。凝胶纯化的F2片段,通过Blp1和Pst1切割,被连续克隆到pSKF1质粒以形成全长病毒反基因组cDNA。合成锤头核酶(oligol,CAAGGCTAGCTGTTAAGCGTCTGATGAGTCCGTGAGGACGAAACTATAGGAAAGGAATTCCTATAGTCGGTACCACGCT;SEQ ID NO:32,下划线为Nhe1和Kpn1识别位点;Oligo2,AGCGTGGTACCGACTATAGGAATTCCTTTCCTATAGTTTCGTCCTCACGGACTCATCAGACGCTTAACAGCTAGCCTTG;SEQ ID NO:33,下划线为Kpn1和Nhe1识别位点),在5’端包含Nhe1识别位点,在3’端包含Kpn1位点。这融合在F1片段5’端的前面。合成丁型肝炎病毒核酶(oligo3,GACCTGCAGGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGGGCGCGGCCGCACTC;SEQ ID NO:34,下划线为Pst1和Not1识别位点;Oligo4,GAGTGCGGCCGCGCCCTCCCTTAGCCATCCGAGTGGACGTGCGTCCTCCTTCGGATGCCCAGGTCGGACCGCGAGGAGGTGGAGATGCCATGCCGACCCCTGCAGGTC;SEQ ID NO:35,下划线为Not1和Pst1识别位点)(Symons,Annu.Rev.Biochem.61:641-671,1992),在其5’端具有Pst1位点,其3’端具有Not1位点,融合到F2片段的3’端。锤头核酶和F1片段之间的连接性Kpn1识别位点,和F2片段和丁型肝炎病毒核酶之间的Pst1位点,通过定点突变被去除。全长病毒反基因组cDNA被夹在锤头和丁型肝炎病毒核酶之间。其被移出并克隆到pBluescriptIISK(+)噬菌粒以制备pSKF构建体。具有两个核酶的完整病毒反基因组cDNA被融合到T7转录起始位点的下游,受pcDNA3.1/Neo(+)质粒(Invitrogen,Carlsbad,CA)中CMV立即早期启动子的控制。这最后一个步骤完成pTMF质粒的构建。
野生型ERA病毒基因组在G和Psi区之间的基因间区域包括8个残基(polyA8)的polyA tract。为了从亲代株区分拯救的ERA(rERA)病毒,通过去除一个A,将一段7A(polyA7)导入pTMF构建体,而不是原始的polyA8。在回收rERA病毒后,进行RT-PCR,随后的序列数据证实导入的polyA7序列标记物的存在。
pTN质粒:利用引物(5N:ACCACCATGGATGCCGACAAGATTG;SEQ ID NO:36,下划线为Ncol识别位点和起始密码子;和3N:GGCCCATGGTTATGAGTCACTCGAATATGTCTT;SEQ ID NO:37,下划线为Ncol识别位点和终止密码子),通过RT-PCR扩增N基因,克隆到pCITE-2a(+)(帽-非依赖性翻译增强子,Cap-IndependentTranslation Enhancer)质粒(Novagen,Madison WI)。
pMP质粒:利用引物(5P:TTGGTACCACCATGAGCAAGATCTTTGTCAATC;SEQ ID NO:38,下划线为Kpn1识别位点和起始密码子;和3P:GGAGAGGAATTCTTAGCAAGATGTATAGCGATTC;SEQ ID NO:39,下划线为EcoR1识别位点和终止密码子),通过RT-PCR扩增P基因,并克隆到pcDNA3.1/Neo(+)质粒。
pMG质粒:利用引物(5G:TTGGTACCACCATGGTTCCTCAGGCTCTCCTG;SEQ ID NO:40,下划线为Kpn1识别位点和起始密码子;和3G:AAAACTGCAGTCACAGTCTGGTCTCACCCCCAC;SEQ ID NO:41,下划线为Pst1识别位点和终止密码子),通过RT-PCR扩增G基因,并克隆到pcDNA3.1/Neo(+)质粒。
pML质粒:利用引物(5L:ACCGCTAGCACCACCATGCTCGATCCTGGAGAGGTC;SEQ ID NO:42,下划线为Nhe1识别位点和起始密码子;和3L:AAAACTGCAGTCACAGGCAACTGTAGTCTAGTAG;SEQ ID NO:43,下划线为Pst1识别位点和终止密码子),通过RT-PCR扩增L基因,并克隆到pcDNA3.1/Neo(+)质粒。
pT7质粒:按照制造商的说明书,利用Dneasy组织试剂盒(Qiagen,Valencia,CA),从细菌BL-21(Novagene,Madison,WI)提取基因组DNA。利用引物(5T7:TCGCTAGCACCACCATGAACACGATTAACATCGCTAAG;SEQ IDNO:44,下划线为Nhe1识别位点和起始密码子;和3T7:GATGAATTCTTACGCGAACGCGAAGTCCGACTC;SEQ ID NO:45,下划线为EcoR1识别位点和终止密码子),通过PCR从纯化的基因组DNA扩增T7RNA聚合酶基因并克隆到pcDNA3.1/Neo(+)质粒。
pNLST7质粒:源自SV40大T抗原的8氨基酸核定位信(NLS),利用pT7质粒作为模版,和引物(5T7NLS:TCGCTAGCCACCATGCCAAAAAAGAAGAGAAAGGTAGAAAACACGATTAACATCGCTAAGAAC;SEQ ID NO:46,下划线为NLS,和3T7引物),通过PCR扩增添加到T7RNA聚合酶的N端。扩增的片段命名为NLST7,被克隆到pcDNA3.1/Neo(+),形成pNLST7构建体。
pGFP质粒:Monster绿色荧光蛋白(GFP)质粒phMGFP购自Promega(Madison,WI)。利用引物(GFP5:AAAACTGCAGGCCACCATGGGCGTGATCAAG;SEQ ID NO:47,下划线为Pst1识别位点和起始密码子;和GFP3:CCGCTCGGTACCTATTAGCCGGCCTGGCGGG;SEQ ID NO:48,下划线为Kpn1识别位点和终止密码子),通过PCR扩增GFP基因,并克隆到pcDNA3.1/Neo(+)质粒。
对所有质粒构建体至少测序3次,以证实不存在克隆后预料不到的突变或者缺失、定点突变或者基因缺失。此外,证实了糖蛋白和Psi区之间存在由polyA tract组成的标记序列,而不是野生型ERA基因组中观察到的8个残基,所述polyA tract具有7个腺苷残基。
实施例3:BSR细胞中的T7RNA聚合酶表达
本实施例证明,向噬菌体T7 RNA聚合酶添加核定位信号,指导聚合酶在转染细胞的细胞核中表达。按照Wu,等人(J.Virol.76,4153-4161,2002)的描述进行BSR细胞的转染。简单来说,6孔板中接近80%汇合的BSR细胞每孔分别用0.5μg pT7或者pNLST7质粒转染。在转染后48小时,细胞用80%冷丙酮固定1小时,并在室温下干燥。依次添加抗T7RNA聚合酶的小鼠单克隆抗体和山羊抗小鼠的IgG-FITC偶联物,并在两步骤间接荧光染色法程序中洗涤。在UV显微镜检查后记录结果。观察到pT7表达的无核定位信号的T7RNA聚合酶主要在胞浆中,而包含核定位信号的NLST7聚合酶主要存在于细胞的细胞核中。这些结果表明NLS的添加有效地将T7RNA聚合酶靶定到转染细胞的细胞核。
实施例4:组成性表达ERA糖蛋白的BSR细胞系的建立
本实施例描述组成性表达ERA糖蛋白的BHK细胞系的设计和产生。利用Flp-InTM系统(Invitrogen,Carlsbad,CA)构建表达ERA糖蛋白的BHK细胞系。简单来说,转染前,Flp-InTM-BHK细胞(含有单一整合的Flp重组靶位点)在一个6孔板中生长到大约20%汇合,并在补充100μg/ml Zeocin的普通DMEM培养基中维持。利用pMG质粒作为模版,用引物EF5G5(CACCATGGTTCCTCAGGCTCTCCTG;SEQ IDNO:49)和EF5G3(TCACAGTCTGGTCTCACCCCCAC;SEQ ID NO:50)通过PCR扩增ERA G基因,并克隆到pEF5/FRT/V5-D-TOPO载体(Invitrogen,Carlsbad,CA),以产生pEFG构建体。表达Flp重组酶的pOG44质粒和pEFG一起,以10∶1的比例共转染Flp-InTM-BHK细胞。转染后,细胞在无Zeocin、但含有400μg/ml潮霉素B的DMEM中维持。48小时后,划开细胞使次日发生的汇合不超过20%。细胞在潮霉素B选择性培养基中在37℃生长大约1周。利用人抗-G单克隆抗体和山羊抗人IgG-FITC偶联物,通过间接荧光染色法检测靶ERA G的表达。组成性表达G的细胞系被命名为BHK-G,用于生长ERA-G病毒。
实施例5:狂犬病病毒Evelyn-Rokitnicki-Abelseth(ERA)株的限定修饰
除了上述的亲代ERA病毒株外,利用此处公开的反向遗传系统开发了衍生病毒株。产生了数个示范性的修饰病毒,即ERA-(完整psi-区的缺失),ERAgreen1(绿色荧光蛋白基因插入ERA病毒基因组psi区),ERAgreen2(绿色荧光蛋白基因插入磷蛋白和基质蛋白的基因间区域),ERA2g(在psi-区包括糖蛋白的额外拷贝),ERAg3(在糖蛋白氨基酸333位具有突变),ERA2g3(在psi-区Aa333具有突变糖蛋白的额外拷贝),ERA-G(去除糖蛋白),ERAgm(基因组中M和G基因互换),和ERAgmg(重排ERAgm构建体中G的两个拷贝)。这些衍生物示意性图解于图3。通过优化所述的生长条件,在组织培养瓶和生物反应器中,所有的拯救病毒都可以达到109到1010ffu/ml的病毒滴度。
反向遗传系统中的基因缺失和定点突变
狂犬病病毒ERA基因组Psi区的缺失
狂犬病病毒ERA基因组的完整Psi-区如下被删除:利用pTMF作为模板,用引物(5Δψ:CCCTCTGCAGTTTGGTACCGTCGAGAAAAAAACATTAGATCAGAAG;SEQ ID NO:51,下划线为Pst1和Kpn1识别位点;和Le3-Blp引物),通过PCR扩增3’Δψ片段,并克隆到pCR-BluntII-TOPO载体(Invitrogen,Carlsbad,CA),用于pPΔ5ψ质粒的构建。利用相同的模板,用引物(SnaB5:ATGAACTTTCTACGTAAGATAGTG;SEQ ID NO:52,下划线为SnaB1识别位点;和3Δψ:CAAACTGCAGAGGGGTGTTAGTTTTTTTCAAAAAGAACCCCCCAAG;SEQ ID NO:53,下划线为Pst1识别位点),通过PCR扩增5’Δψ片段,连续克隆到上述pPΔ5ψ质粒,完成pPΔψ质粒的构建。通过SnaB1和Pst1限制性酶消化pPΔψ质粒回收的片段,取代pSKF构建体中的对应物以制备pSKFΔψ质粒。通过用Nhe1和Not1消化pSKFΔψ质粒得到的包含ERA基因组cDNA的全DNA片段,被重新克隆到pcDNA3.1/Neo(+)质粒,完成pTMFΔψ的构建。为了验证缺失Psi的拯救株,命名为ERA-,在用ERA-感染BSR细胞总RNA的RT-PCR中,使用覆盖Psi-区的引物。只有从rERA病毒扩增出对应于Psi区的400bp片段,而非ERA。序列数据证实Psi-区的完全缺失。
狂犬病病毒ERA基因组中糖蛋白基因的缺失:
利用pSKF作为模板,用引物(SnaB5引物,和3Δg:CAAACTGCAGAGGGGTGTTAGTTTTTTTCACATCCAAGAGGATC;SEQ ID NO:54)通过PCR扩增5’gΔψ片段。在SnaB1和Pst1限制性酶消化后,该回收的片段被克隆,以取代其在pSKFΔψ构建体中的对应物。利用相同模板,用引物(5Δg:CCTCTGCAGTTTGGTACCTTGAAAAAAACCTGGGTTCAATAG;SEQ ID NO:55,和Le3-Blp引物)通过PCR扩增3’gΔψ片段,并被连续克隆到修饰的pSKFΔψ,以取代其对应物。最终片段,通过SnaB1和Blp1限制性酶从不含G基因的pSKFΔψ上切割回收,被重新克隆到pcDNA3.1/Neo(+)质粒,以形成pTMFΔg构建体用于病毒恢复。
糖蛋白基因定点突变:
按照先前描述的(Wu等人,J.Virol.76:4153-4161,2002)进行定点突变,在糖蛋白的333位氨基酸处引入从AGA到GAG的3个核苷酸改变。诱变反应中的引物是M5G引物:CTCACTACAAGTCAGTCGAGACTTGGAATGAGATC (SEQ ID NO:56,黑体为3个突变的核苷酸)和M3G引物:GACTGACTTTGAGTGAGCATCGGCTTCCATCAAGG(SEQ ID NO:57)。对于恢复株(ERAg3),在利用引物5G和3G进行RT-PCR后,通过测序证实氨基酸333位(aa333)AGA到GAG的3个核苷酸变化。通过DNA测序确认后,突变的G被克隆回pTMF质粒,以制备pTMFg3构建体用于病毒恢复。该突变的G基因编码的糖蛋白如SEQ ID NO:58所示。
外源ORF整合入ERA狂犬病病毒基因组
为了在RV中表达外源ORF,制备含Pst1和Kpn1识别位点的额外转录单位,分别整合在Psi或者P-M基因的基因间区域。简单来说,为了在Psi-区制备额外的转录单位,遵循相同的步骤,除了5′Δψ片段扩增步骤,3Δψ引物变为3Δψcis:CCAAACTGCAGCGAAAGGAGGGGTGTTAGTTTTTTTCATGATGAACCCCCCAAGGGGAGG(SEQ ID NO:59)。不含Psi-区、但具有额外转录单位的最终构建体被命名为pMTFΔψcis。GFP,ERA G,或者氨基酸残基333位突变的G分别被克隆到该翻译单位中,以形成pMTFgfp1,pMTF2g,pMTFg3,pMTF2g3构建体,用于病毒拯救。
为了将额外转录单位整合到P-M基因间区域,利用pMTF作为模板,用引物cis55扩增cisp5片段:GACTCACTATAGGGAGACCCAAGCTGGCTAGCTGTTAAG(SEQ IDNO:60),cis53:CCAAACTGCAGCGAAAGGAGGGGTGTTAGTTTTTTTCATGTTGACTTTAGGACATCTCGG(SEQ ID NO:61),并被克隆以取代其在pMTF质粒中的对应物。利用引物cis35:CCTTTCGCTGCAGTTTGGTACCGTCGAGAAAAAAACAGGCAACACCACTGATAAAATGAAC(SEQ ID NO:62)和cis33:CCTCCCCTTCAAGAGGGCCCCTGGAATCAG(SEQ ID NO:63),以相似的方法扩增和克隆cisp3片段。在cisp5和cisp3片段组装到一起后,最终构建体被命名为pMTFcisp,用于接受ORF。包含GFP基因的重组构建体被命名为pTMFgfp2,用于病毒恢复。
为了产生ERA衍生物,命名为ERAgm,其中糖蛋白编码序列与基质蛋白编码序列的顺序相反,按如上所述删除糖蛋白基因。然后将G基因(按上面公开的进行扩增)插入P和M基因之间,按照N-P-G-M-L的顺序获得狂犬病病毒基因组。类似地,使用相同策略产生ERAg3m衍生物,其中糖蛋白通过取代由上述定点突变产生的G基因,在333位氨基酸残基具有3个核苷酸突变(从AGA到GAG)。为了产生ERAgmg构建体,将糖蛋白基因的额外拷贝插入P和M基因之间,按照N-P-G-M-G-L的顺序制备狂犬病病毒基因组。
额外转录单位被修饰和整合入ERA基因组的两个不同区域,即psi-区和P-M基因间区域。当异源ORF被整合入这些转录单位时,分别称为trans 1和trans 2,导致有效产生编码的产物。
转录单位的序列是:CTAACACCCCTCCTTTCGCTGCAGTTTGGTACCGTCGAGAAAAAAA(SEQ ID NO:64,下划线是Pst1和Kpn1)。
实施例6:亲代和衍生病毒的恢复
本实施例描述利用此处公开的反向遗传系统对亲代ERA病毒和示范性衍生物的恢复。按照制造商推荐的方案,BSR细胞在6孔板中接近80%融合时,用3μg/孔的病毒全长转录质粒pTMF(分别是pTMFΔψ,pTMFg3,pTMF2g,pTMF2g3,pTMFgfp1,pTMFgfp2,pTMFΔg,pTMFgm或者pTMFgmg)和5种辅助质粒:pTN(1μg/孔),pMP(0.5μg/孔),pML(0.5μg/孔),pMG(0.5μg/孔)和pNLST7(1μg/孔),通过TransIT-LT1试剂(Mirus,Madison,WI)进行转染。转染后4天,每孔添加1ml新鲜BSR细胞悬液(大约5×105细胞)。细胞在37℃,5%CO2中孵育3天。收集细胞上清用于病毒滴定。
为了滴定恢复的病毒,LAB-TEK 8孔板(Naperville,IL)中的单层BSR细胞用10倍系列稀释的病毒上清感染,在37℃、0.5%CO2中孵育48小时。室温下细胞在80%冷丙酮中固定1小时,在37℃用FITC标记的抗狂犬病病毒N单克隆抗体染色30分钟。用PBS洗板3次后,利用直接荧光显微镜计数染色灶。在万维网(world wide web)cdc.gov/ncidod/dvrd/rabies/professional/publications/DFA-diagnosis/DFA_protocol.htm可以找到直接RV荧光分析(DFA)的详情。
如表3所示,除ERA-G外的所有病毒都从培养的BSR细胞中高滴度地恢复。令人惊讶的是,G基因与M基因的重排和转换没有妨碍重组衍生ERA病毒的恢复。先前认为RV基因组中G基因的重排是不可行的,因为G蛋白的过表达造成细胞死亡(Faber等人,J.Virol.76:3374-3381,2002)。但是,这些结果证明,在ERA株中重排是可能的。因此,有可能不仅对于G基因,而且对于其他基因来说,RV基因改组也是可能的。
按照与其他病毒构建体拯救相同的步骤进行质粒转染后,ERA-G(不含G)病毒被恢复,但在第一轮转染后,病毒灶是非常有限的,并限制于局部区域。即使在37℃、5%CO2中孵育1周后,拯救的病毒也不能进一步传播到邻近的正常BSR细胞(图4A)。利用上述转染上清感染的正常BSR细胞在DFA试验中呈现单细胞染色,这表明恢复的病毒不能传播。为了扩增ERA-G病毒,按照实施例4所述,建立组成性表达ERA G的BHK细胞系(命名为BHK-G)。通过间接荧光分析筛选,对表达G的BHK细胞库进行筛选,并维持用于ERA-G病毒的扩增(图4B)。借助BHK-G细胞系,ERA-G病毒生长到107ffu/ml。从感染ERA-G病毒的BHK-G细胞提取总RNA,利用G基因探针进行Northern印迹分析(图4C)。病毒基因组RNA中不存在G基因,但是检测到G mRNA,其来自感染的支持性BHK-G细胞。在纯化的ERA-G病毒基因组RNA中,利用G探针没有检测到杂交信号,表明ERA基因组中G基因的缺失。
实施例7:在生物反应器中拯救的ERA病毒和其衍生物生长到高滴度
在口服疫苗开发中,通常需要高病毒滴度以便在给药后引发可靠的免疫。本实施例证明,ERA病毒和衍生物可以在适合商品化规模放大的体积的生物反应器中生长到高滴度。所有10种拯救的ERA病毒在生物反应器CELLine AD1000(IBS Integra Bioscience,Chur,瑞士)中扩增到滴度从107到1010ffu/ml。简单来说,如上所述,用示范性的反基因组转录载体和辅助载体转染BSR细胞。在十分之一生物反应器容积中,以106细胞/ml浓度,每细胞1个病毒粒子的感染复数接种细胞。37℃、5%CO2条件下,转染细胞在添加10%胎牛血清的DMEM中生长。每3到5天收集上清,收集2到3次。与其他病毒相比,缺陷型ERA-G生长较差,对于ERA-G来说只有108ffu/ml(表3和图5)。
表3.全长质粒构建体和对应的拯救病毒
质粒构建体 | 拯救病毒 | 来自培养细胞的滴度ffu/ml | 生物反应器中的滴度ffu/ml |
pTMF | rERA | 5×107 | 3×1010 |
pTMFΔψ | ERA- | 6.3×107 | 3.2×1010 |
pTMFg3 | ERAg3 | 3×106 | 1.8×109 |
pTMFgfp1 | ERAgreen1 | 3.5×106 | 5.6×109 |
pTMFgfp2 | ERAgreen2 | 2×107 | 6.2×109 |
pTMF2g | ERA2g | 1.6×106 | 3.9×109 |
pTMF2g3 | ERA2g3 | 8×107 | 4.6×109 |
pTMFΔg | ERA-G | 1.2×102 | 1.5×107 |
pTMFgm | ERAgm | 5.31×106 | 1.9×109 |
pTMFgmg | ERAgmg | 3.1×106 | 1.2×109 |
实施例8:狂犬病病毒中来自额外翻译单位的外源蛋白的表达
本实施例证明了来自异源ORF的重组蛋白的表达,所述ORF插入狂犬病病毒载体。在本实施例中,ERA病毒载体被用作原型狂犬病病毒载体。为了构建ERA病毒作为接受ORF的载体,N和P基因之间的保守RV转录单位被修饰,并在两个不同位置导入ERA基因组:1)在psi区(trans 1),和2)在P-M基因间区域(trans 2)。将转录单位设计成具有两个独特的限制性酶识别位点,以便异源多核苷酸序列的导入:(TTTTTTTGATTGTGGGGAGGAAAGCGACGTCAAACCATGGCAGCTCTTTTTTT:SEQ ID NO:65,下划线为Pst1和Kpn1位点)。
在第一个实施例中,为了病毒恢复,GFP基因被克隆入该单位,因为当转染的BSR细胞仍在孵育时,可以直接在UV显微镜下观察GFP表达。利用470±20nm的激发滤波器,通过荧光显微镜检查直接可见GFP蛋白的表达。ERAgreen2(在RV基因组中P基因后插入的GFP基因-trans 2)-感染的细胞在质粒转染3天后,显示清晰的绿色灶,而ERAgreen1(在“传统”Ψ区G基因后插入的GFP基因-trans 1)直到转染后5天才显示明显的绿色灶(图6)。导入的转录单位在RV基因组两个位置都是有功能的,尽管当GFP从trans 2表达时,表达和积聚显然更快。因此,这些结果还表明,通过选择其中克隆ORF的转录单位,可以调节异源ORF的表达水平。
在其他实施例中,1)ERA G的额外拷贝;或者2)在333位具有氨基酸取代的ERA G的额外拷贝,被整合到ERA病毒基因组中。成功拯救的病毒分别被命名为ERA2g和ERA2g3。因为定量病毒G的表达是不现实的,所以利用G探针通过Northern印迹来证实ERA2g和ERA2g3-感染的细胞中G表达水平的相对增加。简单来说,利用DigDNA标记试剂盒(Roche,Indianapolis,IN)标记ERA G基因探针,用Dig核酸检测试剂盒(Roche,Indianapolis,IN)进行成像,通过密度分光光度法测量(图7)。利用5G和3G引物通过RT-PCR还证实了恢复病毒中串联连接的G基因。观察到表示单一G拷贝的主要条带位于1.5kb。此外,在大约3.0kb处观察到第二条较弱的条带,指示串联排列的两个G。
这些结果证明,转录单位导入ERA基因组可用于从导入的ORF表达各种异源蛋白。此外,通过ORF插入的位置,调节异源ORF编码蛋白的表达。因此,ERA病毒是一个用于重组蛋白表达的可广泛适用的载体。
实施例9:针对工程化病毒的体内免疫应答
本实施例证实接种工程化ERA病毒和示范性衍生物的体内影响。所有动物管理和实验步骤均遵循CDC实验动物管理和使用指导(CDCInstitutional Animal Care and Use Guidelines)。80只3周龄小鼠被分成8组,每组10只,肌内(i.m)使用恢复病毒(每只小鼠106ffu病毒)。10只健康小鼠留作未感染的模拟对照。对于ERA和ERAg3构建体,对另一组10只3周龄小鼠进行相同剂量病毒的额外脑内(i.c)注射。在2日龄乳鼠中,只进行相同剂量的ERAg3和ERA-G病毒脑内接种。每天检查动物的患病情况。所有动物利用CO2中毒安乐死,取出大脑用于狂犬病病毒诊断。感染后10天,通过眶后途径取血,获得血清用于中和抗体分析,然后进行标准快速荧光灶抑制试验(RFFIT)(Smith等人,Bulletin of the World Health Organization.48:535-541,1973)。感染后1个月,存活的动物用致死量的狂犬病街病毒(狗/山狗唾液腺匀浆)攻击(Orciari等人,Vaccine.19:4511-4518,2001)。
抗狂犬病病毒G的小鼠单克隆抗体(Mab 52311)保存在CDC(Hamir等人,Vet Rec.136,295-296,1995),FITC-偶联抗-N单克隆抗体购自Centocor(Horsham,PA)。T7RNA聚合酶单克隆抗体来自Novagen(Madison,WI)。山羊抗-小鼠IgG-FITC偶联物购自Sigma-Aldrich(St.Louis,MO)。抗狂犬病病毒G单克隆抗体(Mab1-909)保存在CDC,山羊抗人IgG-FITC偶联物购自Sigma-Aldrich(St.Louis,MO)。
在肌内接种8种不同病毒构建体的3周龄小鼠中,50%接种ERA(rERA)或者ERA-的小鼠,和20%接种ERAgreen1的小鼠在接种后19天显示轻度神经征象。其他组都未显示提示狂犬病病毒感染的任何征象(图8A)。在攻击前收集血清用于中和抗体滴定。ERA2g(5.60IU)和ERA2g3(5.61IU)比单拷贝的G病毒构建体引发更高的滴度(图8E)。接种后1个月存活的小鼠接受致命的狗/山狗街病毒(0.05ml,保存在CDC用于标准动物攻击试验)的攻击。在ERA和ERA-组中,40到62%的小鼠分别显示轻度狂犬病征象,然后处以安乐死。所有其他组都存活,没有狂犬病的任何征象(图8B)。在i.c组中,3周龄小鼠在接种ERAg3后存活,但在注射ERA后死亡(图8C)。ERA-G构建体不杀死2日龄乳鼠,但是ERAg3具有足以杀死所有感染乳鼠的毒力(图8D)。示例性的抗体滴度显示于表4。
表4:狂犬病特异抗体的制备
组 平均滴度
ERA 433
G333 468
2G 560
2G333 561
-PSI 490
GFP 437
Ggreen 833
Gminus 136
Controls <1/5
这些数据证明,所有基于ERA的病毒都能够在接种后引发免疫应答。与预期一致,亲代ERA病毒是有毒力的,在感染动物中造成显著的发病和死亡。相反,各种示范性衍生物当在攻击前接种小鼠时,引发保护性免疫应答。
除了如上所述的接触前评价外,还检测了在用毒性狂犬病病毒感染后,ERA病毒衍生物引发保护性免疫应答的能力。简单来说,仓鼠组用3种不同狂犬病病毒株中的一种感染(每组n=9),并且接受重组疫苗(ERA-g333)或者狂犬病免疫球蛋白加灭活的商业狂犬病疫苗。如图9A-C所示,大约80-100%的对照动物死亡,而大约60-100%的接种动物存活。这些结果证明,衍生的狂犬病病毒的接触后给药基本上可以防护不同的狂犬病病毒株。
鉴于本发明公开的原理可以应用到许多可能的实施方式,应当认识到,举例说明的实施方式只是本发明优选的实施例,不应被用于限制本发明的范围。而是,本发明的范围由所附权利要求确定。因此要求所有在这些权利要求的范围和精神内的都作为我们的发明。
序列表
<110>美国政府健康及人类服务部,疾病控制和预防中心
(The Government of the United States of America as represented bythe Secretary of the Department of Health and Human Services,Centers for Disease Control and Prevention)
<120>狂犬病病毒组合物和方法
(Rabies Virus Compositions and Methods)
<130>SCT081535-00
<150>60/727,038
<151>2005-10-14
<160>65
<170>PatentIn version 3.3
<210>1
<211>11931
<212>DNA
<213>rabies virus
<220>
<221>misc_feature
<222>(1)..(58)
<223>Leader Region
<220>
<221>CDS
<222>(71)..(1420)
<223>N gene
<220>
<221>CDS
<222>(1514)..(2404)
<223>P gene
<220>
<221>CDS
<222>(2496)..(3101)
<223>M gene
<220>
<221>CDS
<222>(3317)..(4888)
<223>G gene
<220>
<221>misc_feature
<222>(4964)..(5362)
<223>Psi region
<220>
<221>CDS
<222>(5417)..(11797)
<223>L gene
<220>
<221>misc_feature
<222>(11862)..(11931)
<223>Trailer region
<400>1
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atg gat gcc gac aag att gta ttc aaa gtc aat aat cag 109
Met Asp Ala Asp Lys Ile Val Phe Lys Val Asn Asn Gln
1 5 10
gtg gtc tct ttg aag cct gag att atc gtg gat caa cat gag tac aag 157
Val Val Ser Leu Lys Pro Glu Ile Ile Val Asp Gln His Glu Tyr Lys
15 20 25
tac cct gcc atc aaa gat ttg aaa aag ccc tgt ata acc cta gga aag 205
Tyr Pro Ala Ile Lys Asp Leu Lys Lys Pro Cys Ile Thr Leu Gly Lys
30 35 40 45
gct ccc gat tta aat aaa gca tac aag tca gtt ttg tca ggc atg agc 253
Ala Pro Asp Leu Asn Lys Ala Tyr Lys Ser Val Leu Ser Gly Met Ser
50 55 60
gcc gcc aaa ctt gat cct gac gat gta tgt tcc tat ttg gca gcg gca 301
Ala Ala Lys Leu Asp Pro Asp Asp Val Cys Ser Tyr Leu Ala Ala Ala
65 70 75
atg cag ttt ttt gag ggg aca tgt ccg gaa gac tgg acc agc tat gga 349
Met Gln Phe Phe Glu Gly Thr Cys Pro Glu Asp Trp Thr Ser Tyr Gly
80 85 90
atc gtg att gca cga aaa gga gat aag atc acc cca ggt tct ctg gtg 397
Ile Val Ile Ala Arg Lys Gly Asp Lys Ile Thr Pro Gly Ser Leu Val
95 100 105
gag ata aaa cgt act gat gta gaa ggg aat tgg gct ctg aca gga ggc 445
Glu Ile Lys Arg Thr Asp Val Glu Gly Asn Trp Ala Leu Thr Gly Gly
110 115 120 125
atg gaa ctg aca aga gac ccc act gtc cct gag cat gcg tcc tta gtc 493
Met Glu Leu Thr Arg Asp Pro Thr Val Pro Glu His Ala Ser Leu Val
130 135 140
ggt ctt ctc ttg agt ctg tat agg ttg agc aaa ata tcc ggg caa aac 541
Gly Leu Leu Leu Ser Leu Tyr Arg Leu Ser Lys Ile Ser Gly Gln Asn
145 150 155
act ggt aac tat aag aca aac att gca gac agg ata gag cag att ttt 589
Thr Gly Asn Tyr Lys Thr Asn Ile Ala Asp Arg Ile Glu GlnIle Phe
160 165 170
gag aca gcc cct ttt gtt aaa atc gtg gaa cac cat act cta atg aca 637
Glu Thr Ala Pro Phe Val Lys Ile Val Glu His His Thr Leu Met Thr
175 180 185
act cac aaa atg tgt gct aat tgg agt act ata cca aac ttc aga ttt 685
Thr His Lys Met Cys Ala Asn Trp Ser Thr Ile Pro Asn Phe Arg Phe
190 195 200 205
ttg gcc gga acc tat gac atg ttt ttc tcc cgg att gag cat cta tat 733
Leu Ala Gly Thr Tyr Asp Met Phe Phe Ser Arg Ile Glu His Leu Tyr
210 215 220
tca gca atc aga gtg ggc aca gtt gtc act gct tat gaa gac tgt tca 781
Ser Ala Ile Arg Val Gly Thr Val Val Thr Ala Tyr Glu Asp Cys Ser
225 230 235
gga ctg gta tca ttt act ggg ttc ata aaa caa atc aat ctc acc gct 829
Gly Leu Val Ser Phe Thr Gly Phe Ile Lys Gln Ile Asn Leu Thr Ala
240 245 250
aga gag gca ata cta tat ttc ttc cac aag aac ttt gag gaa gag ata 877
Arg Glu Ala Ile Leu Tyr Phe Phe His Lys Asn Phe Glu Glu Glu Ile
255 260 265
aga aga atg ttt gag cca ggg cag gag aca gct gtt cct cac tct tat 925
Arg Arg Met Phe Glu Pro Gly Gln Glu Thr Ala Val Pro His Ser Tyr
270 275 280 285
ttc atc cac ttc cgt tca cta ggc ttg agt ggg aaa tct cct tat tca 973
Phe Ile His Phe Arg Ser Leu Gly Leu Ser Gly Lys Ser Pro Tyr Ser
290 295 300
tca aat gct gtt ggt cac gtg ttc aat ctc att cac ttt gta gga tgc 102l
Ser Asn Ala Val Gly His Val Phe Asn Leu Ile His Phe Val Gly Cys
305 310 315
tat atg ggt caa gtc aga tcc cta aat gca acg gtt att gct gca tgt 1069
Tyr Met Gly Gln Val Arg Ser Leu Asn Ala Thr Val Ile Ala Ala Cys
320 325 330
gct cct cat gaa atg tct gtt cta ggg ggc tat ctg gga gag gaa ttc 1117
Ala Pro His Glu Met Ser Val Leu Gly Gly Tyr Leu Gly Glu Glu Phe
335 340 345
ttc ggg aaa ggg aca ttt gaa aga aga ttc ttc aga gat gag aaa gaa 1165
Phe Gly Lys Gly Thr Phe Glu Arg Arg Phe Phe Arg Asp Glu Lys Glu
350 355 360 365
ctt caa gaa tac gag gcg gct gaa ctg aca aag act gac gta gca ctg 1213
Leu Gln Glu Tyr Glu Ala Ala Glu Leu Thr Lys Thr Asp Val Ala Leu
370 375 380
gca gat gat gga act gtc aac tct gac gac gag gac tac ttc tca ggt 1261
Ala Asp Asp Gly Thr Val Asn Ser Asp Asp Glu Asp Tyr Phe Ser Gly
385 390 395
gaa acc aga agt ccg gag gct gtt tat act cga atc atg atg aat gga 1309
Glu Thr Arg Ser Pro Glu Ala Val Tyr Thr Arg Ile Met Met Asn Gly
400 405 410
ggt cga cta aag aga tct cac ata cgg aga tat gtc tca gtc agt tcc 1357
Gly Arg Leu Lys Arg Ser His Ile Arg Arg Tyr Val Ser Val Ser Ser
415 420 425
aat cat caa gcc cgt cca aac tca ttc gcc gag ttt cta aac aag aca 1405
Asn His Gln Ala Arg Pro Asn Ser Phe Ala Glu Phe Leu Asn Lys Thr
430 435 440 445
tat tcg agt gac tca taagaagttg aacaacaaaa tgccggaaat ctacggattg 1460
Tyr Ser Ser Asp Ser
450
tgtatatcca tcatgaaaaa aactaacacc cctcctttcg aaccatccca aac atg 1516
Met
agc aag atc ttt gtc aat cct agt gct att aga gcc ggt ctg gcc gat 1564
Ser Lys Ile Phe Val Asn Pro Ser Ala Ile Arg Ala Gly Leu Ala Asp
455 460 465
ctt gag atg gct gaa gaa act gtt gat ctg atc aat aga aat atc gaa 1612
Leu Glu Met Ala Glu Glu Thr Val Asp Leu Ile Asn Arg Asn Ile Glu
470 475 480
gac aat cag gct cat ctc caa ggg gaa ccc ata gaa gtg gac aat ctc 1660
Asp Asn Gln Ala His Leu Gln Gly Glu Pro Ile Glu Val Asp Asn Leu
485 490 495
cct gag gat atg ggg cga ctt cac ctg gat gat gga aaa tcg ccc aac 1708
Pro Glu Asp Met Gly Arg Leu His Leu Asp Asp Gly Lys Ser Pro Asn
500 505 510 515
cct ggt gag atg gcc aag gtg gga gaa ggc aag tat cga gag gac ttt 1756
Pro Gly Glu Met Ala Lys Val Gly Glu Gly Lys Tyr Arg Glu Asp Phe
520 525 530
cag atg gat gaa gga gag gat ctt agc ttc ctg ttc cag tca tac ctg 1804
Gln Met Asp Glu Gly Glu Asp Leu Ser Phe Leu Phe Gln Ser Tyr Leu
535 540 545
gaa aat gtt gga gtc caa ata gtc aga caa atg agg tca gga gag aga 1852
Glu Asn Val Gly Val Gln Ile Val Arg Gln Met Arg Ser Gly Glu Arg
550 555 560
ttt ctc aag ata tgg tca cag acc gta gaa gag att ata tcc tat gtc 1900
Phe Leu Lys Ile Trp Ser Gln Thr Val Glu Glu Ile Ile Ser Tyr Val
565 570 575
gcg gtc aac ttt ccc aac cct cca gga aag tct tca gag gat aaa tca 1948
Ala Val Asn Phe Pro Asn Pro Pro Gly Lys Ser Ser Glu Asp Lys Ser
580 585 590 595
acc cag act act ggc cga gag ctc aag aag gag aca aca ccc act cct 1996
Thr Gln Thr Thr Gly Arg Glu Leu Lys Lys Glu Thr Thr Pro Thr Pro
600 605 610
tct cag aga gaa agc caa tca tcg aaa gcc agg atg gcg gct caa att 2044
Ser Gln Arg Glu Ser Gln Ser Ser Lys Ala Arg Met Ala Ala Gln Ile
615 620 625
gct tct ggc cct cca gcc ctt gaa tgg tcg gcc acc aat gaa gag gat 2092
Ala Ser Gly Pro Pro Ala Leu Glu Trp Ser Ala Thr Asn Glu Glu Asp
630 635 640
gat cta tca gtg gag gct gag atc gct cac cag att gca gaa agt ttc 2140
Asp Leu Ser Val Glu Ala Glu Ile Ala His Gln Ile Ala Glu Ser Phe
645 650 655
tcc aaa aaa tat aag ttt ccc tct cga tcc tca ggg ata ctc ttg tat 2188
Ser Lys Lys Tyr Lys Phe Pro Ser Arg Ser Ser Gly Ile Leu Leu Tyr
660 665 670 675
aat ttt gag caa ttg aaa atg aac ctt gat gat ata gtt aaa gag gca 2236
Asn Phe Glu Gln Leu Lys Met Asn Leu Asp Asp Ile Val Lys Glu Ala
680 685 690
aaa aat gta cca ggt gtg acc cgt tta gcc cat gac ggg tcc aaa ctc 2284
Lys Asn Val Pro Gly Val Thr Arg Leu Ala His Asp Gly Ser Lys Leu
695 700 705
ccc cta aga tgt gta ctg gga tgg gtc gct ttg gcc aac cct aag aaa 2332
Pro Leu Arg Cys Val Leu Gly Trp Val Ala Leu Ala Asn Pro Lys Lys
710 715 720
ttc cag ttg tta gtc gaa tcc gac aag ctg agt aaa atc atg caa gat 2380
Phe Gln Leu Leu Val Glu Ser Asp Lys Leu Ser Lys Ile Met Gln Asp
725 730 735
gac ttg aat cgc tat aca tct tgc taaccgaacc tctccactca gtccctctag 2434
Asp Leu Asn Arg Tyr Thr Ser Cys
740 745
acaataaagt ccgagatgtc ctaaagtcaa catgaaaaaa acaggcaaca ccactgataa 2494
a atg aac ttt cta cgt aag ata gtg aaa aat tgc agg gac gag gac act 2543
Met Asn Phe Leu Arg Lys Ile Val Lys Asn Cys Arg Asp Glu Asp Thr
750 755 760
caa aaa ccc tct ccc gtg tca gcc cct ctg gat gac gat gac ttg tgg 2591
Gln Lys Pro Ser Pro Val Ser Ala Pro Leu Asp Asp Asp Asp Leu Trp
765 770 775
ctt cca ccc cct gaa tac gtc ccg ctg aaa gaa ctt aca agc aag aag 2639
Leu Pro Pro Pro Glu Tyr Val Pro Leu Lys Glu Leu Thr Ser Lys Lys
780 785 790 795
aac atg agg aac ttt tgt atc aac gga ggg gtt aaa gtg tgt agc ccg 2687
Asn Met Arg Asn Phe Cys Ile Asn Gly Gly Val Lys Val Cys Ser Pro
800 805 810
aat ggt tac tcg ttc agg atc ctg cgg cac att ctg aaa tca ttc gac 2735
Asn Gly Tyr Ser Phe Arg Ile Leu Arg His Ile Leu Lys Ser Phe Asp
815 820 825
gag ata tat tct ggg aat cat agg atg atc ggg tta gcc aaa gta gtt 2783
Glu Ile Tyr Ser Gly Asn His Arg Met Ile Gly Leu Ala Lys Val Val
830 835 840
att gga ctg gct ttg tca gga tct cca gtc cct gag ggc atg aac tgg 2831
Ile Gly Leu Ala Leu Ser Gly Ser Pro Val Pro Glu Gly Met Asn Trp
845 850 855
gta tac aaa ttg agg aga acc ttt atc ttc cag tgg gct gat tcc agg 2879
Val Tyr Lys Leu Arg Arg Thr Phe Ile Phe Gln Trp Ala Asp Ser Arg
860 865 870 875
ggc cct ctt gaa ggg gag gag ttg gaa tac tct cag gag atc act tgg 2927
Gly Pro Leu Glu Gly Glu Glu Leu Glu Tyr Ser Gln Glu Ile Thr Trp
880 885 890
gat gat gat act gag ttc gtc gga ttg caa ata aga gtg att gca aaa 2975
Asp Asp Asp Thr Glu Phe Val Gly Leu Gln Ile Arg Val Ile Ala Lys
895 900 905
cag tgt cat atc cag ggc aga atc tgg tgt atc aac atg aac ccg aga 3023
Gln Cys His Ile Gln Gly Arg Ile Trp Cys Ile Asn Met Asn Pro Arg
910 915 920
gca tgt caa cta tgg tct gac atg tct ctt cag aca caa agg tcc gaa 3071
Ala Cys Gln Leu Trp Ser Asp Met Ser Leu Gln Thr Gln Arg Ser Glu
925 930 935
gag gac aaa gat tcc tct ctg ctt cta gaa taatcagatt atatcccgca 3121
Glu Asp Lys Asp Ser Ser Leu Leu Leu Glu
940 945
aatttatcac ttgtttacct ctggaggaga gaacatatgg gctcaactcc aacccttggg 3181
agcaatataa caaaaaacat gttatggtgc cattaaaccg ctgcatttca tcaaagtcaa 3241
gttgattacc tttacatttt gatcctcttg gatgtgaaaa aaactattaa catccctcaa 3301
aagactcaag gaaag atg gtt cct cag gct ctc ctg ttt gta ccc ctt ctg 3352
Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu
950 955 960
gtt ttt cca ttg tgt ttt ggg aaa ttc cct att tac acg ata cca gac 3400
Val Phe Pro Leu Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp
965 970 975
aag ctt ggt ccc tgg agc ccg att gac ata cat cac ctc agc tgc cca 3448
Lys Leu Gly Pro Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro
980 985 990
aac aat ttg gta gtg gag gac gaa gga tgc acc aac ctg tca ggg ttc 3496
Asn Asn Leu Val Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe
995 1000 1005
tcc tac atg gaa ctt aaa gtt gga tac atc tta gcc ata aaa atg 3541
Ser Tyr Met Glu Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met
1010 1015 1020
aac ggg ttc act tgc aca ggc gtt gtg acg gag gct gaa acc tat 3586
Asn Gly Phe Thr Cys Thr Gly Val Val Thr Glu Ala Glu Thr Tyr
1025 1030 1035
act aac ttc gtt ggt tat gtc aca acc acg ttc aaa aga aag cat 363l
Thr Asn Phe Val Gly Tyr Val Thr Thr Thr Phe Lys Arg Lys His
1040 1045 1050
ttc cgc cca aca cca gat gca tgt aga gcc gcg tac aac tgg aag 3676
Phe Arg Pro Thr Pro Asp Ala Cys Arg Ala Ala Tyr Asn Trp Lys
1055 1060 1065
atg gcc ggt gac ccc aga tat gaa gag tct cta cac aat ccg tac 3721
Met Ala Gly Asp Pro Arg Tyr Glu Glu Ser Leu His Asn Pro Tyr
1070 1075 1080
cct gac tac cac tgg ctt cga act gta aaa acc acc aag gag tct 3766
Pro Asp Tyr His Trp Leu Arg Thr Val Lys Thr Thr Lys Glu Ser
1085 1090 1095
ctc gtt atc ata tct cca agt gtg gca gat ttg gac cca tat gac 3811
Leu Val Ile Ile Ser Pro Ser Val Ala Asp Leu Asp Pro Tyr Asp
1100 1105 11l0
aga tcc ctt cac tcg agg gtc ttc cct agc ggg aag tgc tca gga 3856
Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly Lys Cys Ser Gly
1115 1120 1125
gta gcg gtg tct tct acc tac tgc tcc act aac cac gat tac acc 3901
Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His Asp Tyr Thr
1130 1135 1140
att tgg atg ccc gag aat ccg aga cta ggg atg tct tgt gac att 3946
Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys Asp Ile
1145 1150 1155
ttt acc aat agt agg ggg aag aga gca tcc aaa ggg agt gag act 3991
Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser Glu Thr
1160 1165 1170
tgc ggc ttt gta gat gaa aga ggc cta tat aag tct tta aaa gga 4036
Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly
1175 1180 1185
gca tgc aaa ctc aag tta tgt gga gtt cta gga ctt aga ctt atg 4081
Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met
1190 1195 1200
gat gga aca tgg gtc gcg atg caa aca tca aat gaa acc aaa tgg 4126
Asp Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp
1205 1210 1215
tgc ccc ccc gat cag ttg gtg aac ctg cac gac ttt cgc tca gac 4171
Cys Pro Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp
1220 1225 1230
gaa att gag cac ctt gtt gta gag gag ttg gtc agg aag aga gag 4216
Glu Ile Glu His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu
1235 1240 1245
gag tgt ctg gat gca cta gag tcc atc atg aca acc aag tca gtg 4261
Glu Cys Leu Asp Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val
1250 1255 1260
agt ttc aga cgt ccc agt cat tta aga aaa ctt gtc cct ggg ttt 4306
Ser Phe Arg Arg Pro Ser His Leu Arg Lys Leu Val Pro Gly Phe
1265 1270 1275
gga aaa gca tat acc ata ttc aac aag acc ttg atg gaa gcc gat 4351
Gly Lys Ala Tyr Thr Ile Phe Asn Lys Thr Leu Met Glu Ala Asp
1280 1285 1290
gct cac tac aag tca gtc aga act tgg aat gag atc ctc cct tca 4396
Ala His Tyr Lys Ser Val Arg Thr Trp Asn Glu Ile Leu Pro Ser
1295 1300 1305
aaa ggg tgt tta aga gtt ggg ggg agg tgt cat cct cat gtg aac 4441
Lys Gly Cys Leu Arg Val Gly Gly Arg Cys His Pro His Val Asn
1310 1315 1320
ggg gtg ttt ttc aat ggt ata ata tta gga cct gac ggc aat gtc 4486
Gly Val Phe Phe Asn Gly Ile Ile Leu Gly Pro Asp Gly Asn Val
1325 1330 1335
tta atc cca gag atg caa tca tcc ctc ctc cag caa cat atg gag 4531
Leu Ile Pro Glu Met Gln Ser Ser Leu Leu Gln Gln His Met Glu
1340 1345 1350
ttg ttg gaa tcc tcg gtt atc ccc ctt gtg cac ccc ctg gca gac 4576
Leu Leu Glu Ser Ser Val Ile Pro Leu Val His Pro Leu Ala Asp
1355 1360 1365
ccg tct acc gtt ttc aag gac ggt gac gag gct gag gat ttt gtt 4621
Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu Asp Phe Val
1370 1375 1380
gaa gtt cac ctt ccc gat gtg cac aat cag gtc tca gga gtt gac 4666
Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly Val Asp
1385 1390 1395
ttg ggt ctc ccg aac tgg ggg aag tat gta tta ctg agt gca ggg 4711
Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala Gly
1400 1405 1410
gcc ctg act gcc ttg atg ttg ata att ttc ctg atg aca tgt tgt 4756
Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys
1415 1420 1425
aga aga gtc aat cga tca gaa cct acg caa cac aat ctc aga ggg 4801
Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly
1430 1435 1440
aca ggg agg gag gtg tca gtc act ccc caa agc ggg aag atc ata 4846
Thr Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile
1445 1450 1455
tct tca tgg gaa tca cac aag agt ggg ggt gag acc aga ctg 4888
Ser Ser Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu
1460 1465 1470
tgaggactgg ccgtcctttc aacgatccaa gtcctgaaga tcacctcccc ttggggggtt 4948
ctttttgaaa aaaaacctgg gttcaatagt cctcctcgaa ctccatgcaa ctgggtagat 5008
tcaagagtca tgagattttc attaatcctc tcagttgatc aagcaagatc atgtagattc 5068
tcataatagg ggagatcttc tagcagtttc agtgactaac ggtactttca ttctccagga 5128
actgacacca acagttgtag acaaaccacg gggtgtctcg ggtgactctg tgcttgggca 5188
cagacaaagg tcatggtgtg ttccatgata gcggactcag gatgagttaa ttgagagagg 5248
cagtcttcct cccgtgaagg acataagcag tagctcacaa tcatcccgcg tctcagcaaa 5308
gtgtgcataa ttataaagtg ctgggtcatc taagcttttc agtcgagaaa aaaacattag 5368
atcagaagaa caactggcaa cacttctcaa cctgagacct acttcaag atg ctc gat 5425
Met Leu Asp
1475
cct gga gag gtc tat gat gac cct att gac cca atc gag tta gag 5470
Pro Gly Glu Val Tyr Asp Asp Pro Ile Asp Pro Ile Glu Leu Glu
1480 1485 1490
gat gaa ccc aga gga acc ccc act gtc ccc aac atc ttg agg aac 5515
Asp Glu Pro Arg Gly Thr Pro Thr Val Pro Asn Ile Leu Arg Asn
1495 1500 1505
tct gac tac aat ctc aac tct cct ttg ata gaa gat cct gct aga 5560
Ser Asp Tyr Asn Leu Asn Ser Pro Leu Ile Glu Asp Pro Ala Arg
1510 1515 1520
cta atg tta gaa tgg tta aaa aca ggg aat aga cct tat cgg atg 5605
Leu Met Leu Glu Trp Leu Lys Thr Gly Asn Arg Pro Tyr Arg Met
1525 1530 1535
act cta aca gac aat tgc tcc agg tct ttc aga gtt ttg aaa gat 5650
Thr Leu Thr Asp Asn Cys Ser Arg Ser Phe Arg Val Leu Lys Asp
1540 1545 1550
tat ttc aag aag gta gat ttg ggt tct ctc aag gtg ggc gga atg 5695
Tyr Phe Lys Lys Val Asp Leu Gly Ser Leu Lys Val Gly Gly Met
1555 1560 1565
gct gca cag tca atg att tct ctc tgg tta tat ggt gcc cac tct 5740
Ala Ala Gln Ser Met Ile Ser Leu Trp Leu Tyr Gly Ala His Ser
1570 1575 1580
gaa tcc aac agg agc cgg aga tgt ata aca gac ttg gcc cat ttc 5785
Glu Ser Asn Arg Ser Arg Arg Cys Ile Thr Asp Leu Ala His Phe
1585 1590 1595
tat tcc aag tcg tcc ccc ata gag aag ctg ttg aat ctc acg cta 5830
Tyr Ser Lys Ser Ser Pro Ile Glu Lys Leu Leu Asn Leu Thr Leu
1600 1605 1610
gga aat aga ggg ctg aga atc ccc cca gag gga gtg tta agt tgc 5875
Gly Asn Arg Gly Leu Arg Ile Pro Pro Glu Gly Val Leu Ser Cys
1615 1620 1625
ctt gag agg gtt gat tat gat aat gca ttt gga agg tat ctt gcc 5920
Leu Glu Arg Val Asp Tyr Asp Asn Ala Phe Gly Arg Tyr Leu Ala
1630 1635 1640
aac acg tat tcc tct tac ttg ttc ttc cat gta atc acc tta tac 5965
Asn Thr Tyr Ser Ser Tyr Leu Phe Phe His Val Ile Thr Leu Tyr
1645 1650 1655
atg aac gcc cta gac tgg gat gaa gaa aag acc atc cta gca tta 6010
Met Asn Ala Leu Asp Trp Asp Glu Glu Lys Thr Ile Leu Ala Leu
1660 1665 1670
tgg aaa gat tta acc tca gtg gac atc ggg aag gac ttg gta aag 6055
Trp Lys Asp Leu Thr Ser Val Asp Ile Gly Lys Asp Leu Val Lys
1675 1680 1685
ttc aaa gac caa ata tgg gga ctg ccg atc gtg aca aag gac ttt 6100
Phe Lys Asp Gln Ile Trp Gly Leu Pro Ile Val Thr Lys Asp Phe
1690 1695 1700
gtt tac tcc caa agt tcc aat tgt ctt ttt gac aga aac tac aca 6145
Val Tyr Ser Gln Ser Ser Asn Cys Leu Phe Asp Arg Asn Tyr Thr
1705 1710 1715
ctt atg cta aaa gaa ctt ttc ttg tct cgc ttc aac tcc tta atg 6190
Leu Met Leu Lys Glu Leu Phe Leu Ser Arg Phe Asn Ser Leu Met
1720 1725 1730
gtc ttg ctc tct ccc cca gag ccc cga tac tca gat gac ttg ata 6235
Val Leu Leu Ser Pro Pro Glu Pro Arg Tyr Ser Asp Asp Leu Ile
1735 1740 1745
tct caa cta tgc cag ctg tac att gct ggg gat caa gtc ttg tct 6280
Ser Gln Leu Cys Gln Leu Tyr Ile Ala Gly Asp Gln Val Leu Ser
1750 1755 1760
atg tgt gga aac tcc ggc tat gaa gtc atc aaa ata ttg gag cca 6325
Met Cys Gly Asn Ser Gly Tyr Glu Val Ile Lys Ile Leu Glu Pro
1765 1770 1775
tat gtc gtg aat agt tta gtc cag aga gca gaa aag ttt agg cct 6370
Tyr Val Val Asn Ser Leu Val Gln Arg Ala Glu Lys Phe Arg Pro
1780 1785 1790
ctc att cat tcc ttg gga gac ttt cct gta ttt ata aaa gac aag 6415
Leu Ile His Ser Leu Gly Asp Phe Pro Val Phe Ile Lys Asp Lys
1795 1800 1805
gta agt caa ctt gaa gag acg ttc ggt ccc tgt gca aga agg ttc 6460
Val Ser Gln Leu Glu Glu Thr Phe Gly Pro Cys Ala Arg Arg Phe
1810 1815 1820
ttt agg gct ctg gat caa ttc gac aac ata cat gac ttg gtt ttt 6505
Phe Arg Ala Leu Asp Gln Phe Asp Asn Ile His Asp Leu Val Phe
1825 1830 1835
gtg tat ggc tgt tac agg cat tgg ggg cac cca tat ata gat tat 6550
Val Tyr Gly Cys Tyr Arg His Trp Gly His Pro Tyr Ile Asp Tyr
1840 1845 1850
cga aag ggt ctg tca aaa cta tat gat cag gtt cac att aaa aaa 6595
Arg Lys Gly Leu Ser Lys Leu Tyr Asp Gln Val His Ile Lys Lys
1855 1860 1865
gtg ata gat aag tcc tac cag gag tgc tta gca agc gac cta gcc 6640
Val Ile Asp Lys Ser Tyr Gln Glu Cys Leu Ala Ser Asp Leu Ala
1870 1875 1880
agg agg atc ctt aga tgg ggt ttt gat aag tac tcc aag tgg tat 6685
Arg Arg Ile Leu Arg Trp Gly Phe Asp Lys Tyr Ser Lys Trp Tyr
1885 1890 1895
ctg gat tca aga ttc cta gcc cga gac cac ccc ttg act ccc tat 6730
Leu Asp Ser Arg Phe Leu Ala Arg Asp His Pro Leu Thr Pro Tyr
1900 1905 1910
atc aaa acc caa aca tgg cca ccc aaa cat att gta gac ttg gtg 6775
Ile Lys Thr Gln Thr Trp Pro Pro Lys His Ile Val Asp Leu Val
1915 1920 1925
ggg gat aca tgg cac aag ctc ccg atc acg cag atc ttt gag att 6820
Gly Asp Thr Trp His Lys Leu Pro Ile Thr Gln Ile Phe Glu Ile
1930 1935 1940
cct gaa tca atg gat ccg tca gaa ata ttg gat gac aaa tca cat 6865
Pro Glu Ser Met Asp Pro Ser Glu Ile Leu Asp Asp Lys Ser His
1945 1950 1955
tct ttc acc aga acg aga cta gct tct tgg ctg tca gaa aac cga 6910
Ser Phe Thr Arg Thr Arg Leu Ala Ser Trp Leu Ser Glu Asn Arg
1960 1965 1970
ggg gga cct gtt cct agc gaa aaa gtt att atc acg gcc ctg tct 6955
Gly Gly Pro Val Pro Ser Glu Lys Val Ile Ile Thr Ala Leu Ser
1975 1980 1985
aag ccg cct gtc aat ccc cga gag ttt ctg agg tct ata gac ctc 7000
Lys Pro Pro Val Asn Pro Arg Glu Phe Leu Arg Ser Ile Asp Leu
1990 1995 2000
gga gga ttg cca gat gaa gac ttg ata att ggc ctc aag cca aag 7045
Gly Gly Leu Pro Asp Glu Asp Leu Ile Ile Gly Leu Lys Pro Lys
2005 2010 2015
gaa cgg gaa ttg aag att gaa ggt cga ttc ttt gct cta atg tca 7090
Glu Arg Glu Leu Lys Ile Glu Gly Arg Phe Phe Ala Leu Met Ser
2020 2025 2030
tgg aat cta aga ttg tat ttt gtc atc act gaa aaa ctc ttg gcc 7135
Trp Asn Leu Arg Leu Tyr Phe Val Ile Thr Glu Lys Leu Leu Ala
2035 2040 2045
aac tac atc ttg cca ctt ttt gac gcg ctg act atg aca gac aac 7180
Asn Tyr Ile Leu Pro Leu Phe Asp Ala Leu Thr Met Thr Asp Asn
2050 2055 2060
ctg aac aag gtg ttt aaa aag ctg atc gac agg gtc acc ggg caa 7225
Leu Asn Lys Val Phe Lys Lys Leu Ile Asp Arg Val Thr Gly Gln
2065 2070 2075
ggg ctt ttg gac tat tca agg gtc aca tat gca ttt cac ctg gac 7270
Gly Leu Leu Asp Tyr Ser Arg Val Thr Tyr Ala Phe His Leu Asp
2080 2085 2090
tat gaa aag tgg aac aac cat caa aga tta gag tca aca gag gat 7315
Tyr Glu Lys Trp Asn Asn His Gln Arg Leu Glu Ser Thr Glu Asp
2095 2100 2105
gta ttt tct gtc cta gat caa gtg ttt gga ttg aag aga gtg ttt 7360
Val Phe Ser Val Leu Asp Gln Val Phe Gly Leu Lys Arg Val Phe
2110 2115 2120
tct aga aca cac gag ttt ttt caa aag gcc tgg atc tat tat tca 7405
Ser Arg Thr His Glu Phe Phe Gln Lys Ala Trp Ile Tyr Tyr Ser
2125 2130 2135
gac aga tca gac ctc atc ggg tta cgg gag gat caa ata tac tgc 7450
Asp Arg Ser Asp Leu Ile Gly Leu Arg Glu Asp Gln Ile Tyr Cys
2140 2145 2150
tta gat gcg tcc aac ggc cca acc tgt tgg aat ggc cag gat ggc 7495
Leu Asp Ala Ser Asn Gly Pro Thr Cys Trp Asn Gly Gln Asp Gly
2155 2160 2165
ggg cta gaa ggc tta cgg cag aag ggc tgg agt cta gtc agc tta 7540
Gly Leu Glu Gly Leu Arg Gln Lys Gly Trp Ser Leu Val Ser Leu
2170 2175 2180
ttg atg ata gat aga gaa tct caa atc agg aac aca aga acc aaa 7585
Leu Met Ile Asp Arg Glu Ser Gln Ile Arg Asn Thr Arg Thr Lys
2185 2190 2195
ata cta gct caa gga gac aac cag gtt tta tgt ccg aca tat atg 7630
Ile Leu Ala Gln Gly Asp Asn Gln Val Leu Cys Pro Thr Tyr Met
2200 2205 2210
ttg tcg cca ggg cta tct caa gag ggg ctc ctc tat gaa ttg gag 7675
Leu Ser Pro Gly Leu Ser Gln Glu Gly Leu Leu Tyr Glu Leu Glu
2215 2220 2225
aga ata tca agg aat gca ctt tcg ata tac aga gcc gtc gag gaa 7720
Arg Ile Ser Arg Asn Ala Leu Ser Ile Tyr Arg Ala Val Glu Glu
2230 2235 2240
ggg gca tct aag cta ggg ctg atc acc aag aaa gaa gag acc atg 7765
Gly Ala Ser Lys Leu Gly Leu Ile Thr Lys Lys Glu Glu Thr Met
2245 2250 2255
tgt agt tat gac ttc ctc atc tat gga aaa acc cct ttg ttt aga 7810
Cys Ser Tyr Asp Phe Leu Ile Tyr Gly Lys Thr Pro Leu Phe Arg
2260 2265 2270
ggt aac ata ttg gtg cct gag tcc aaa aga tgg gcc aga gtc tct 7855
Gly Asn Ile Leu Val Pro Glu Ser Lys Arg Trp Ala Arg Val Ser
2275 2280 2285
tgc gtc tct aat gac caa ata gtc aac ctc gcc aat ata atg tcg 7900
Cys Val Ser Asn Asp Gln Ile Val Asn Leu Ala Asn Ile Met Ser
2290 2295 2300
aca gtg tcc acc aat gcg cta aca gtg gca caa cac tct caa tct 7945
Thr Val Ser Thr Asn Ala Leu Thr Val Ala Gln His Ser Gln Ser
2305 2310 2315
ttg atc aaa ccg atg ggg gat ttt ctg ctc atg tca gta cag gca 7990
Leu Ile Lys Pro Met Gly Asp Phe Leu Leu Met Ser Val Gln Ala
2320 2325 2330
gtc ttt cac tac ctg cta ttt agc cca atc tta aag gga aga gtt 8035
Val Phe His Tyr Leu Leu Phe Ser Pro Ile Leu Lys Gly Arg Val
2335 2340 2345
tac aag att ctg agc gct gaa ggg gat agc ttt ctc cta gcc atg 8080
Tyr Lys Ile Leu Ser Ala Glu Gly Asp Ser Phe Leu Leu Ala Met
2350 2355 2360
tca agg ata atc tat cta gat cct tct ttg gga ggg gta tct gga 8125
Ser Arg Ile Ile Tyr Leu Asp Pro Ser Leu Gly Gly Val Ser Gly
2365 2370 2375
atg tcc ctc gga aga ttc cat ata cga cag ttc tca gac cct gtc 8170
Met Ser Leu Gly Arg Phe His Ile Arg Gln Phe Ser Asp Pro Val
2380 2385 2390
tct gaa ggg tta tcc ttc tgg aga gag atc tgg tta agc tcc cac 8215
Ser Glu Gly Leu Ser Phe Trp Arg Glu Ile Trp Leu Ser Ser His
2395 2400 2405
gag tcc tgg gtt cac gcg ttg tgt caa gag gct gga aac cca gat 8260
Glu Ser Trp Val His Ala Leu Cys Gln Glu Ala Gly Asn Pro Asp
2410 2415 2420
ctt gga gag aga aca ctc gag agc ttc act cgc ctt cta gaa gat 8305
Leu Gly Glu Arg Thr Leu Glu Ser Phe Thr Arg Leu Leu Glu Asp
2425 2430 2435
cct acc acc tta aat atc aga gga ggg gcc agt cct acc att cta 8350
Pro Thr Thr Leu Asn Ile Arg Gly Gly Ala Ser Pro Thr Ile Leu
2440 2445 2450
ctc aag gat gca atc aga aag gct tta tat gac gag gtg gac aag 8395
Leu Lys Asp Ala Ile Arg Lys Ala Leu Tyr Asp Glu Val Asp Lys
2455 2460 2465
gtg gag aat tca gag ttt cga gag gca atc ctg ttg tcc aag acc 8440
Val Glu Asn Ser Glu Phe Arg Glu Ala Ile Leu Leu Ser Lys Thr
2470 2475 2480
cat aga gat aat ttt ata ctc ttc tta aca tct gtt gag cct ctg 8485
His Arg Asp Asn Phe Ile Leu Phe Leu Thr Ser Val Glu Pro Leu
2485 2490 2495
ttt cct cga ttt ctc agt gag cta ttc agt tcg tct ttt ttg gga 8530
Phe Pro Arg Phe Leu Ser Glu Leu Phe Ser Ser Ser Phe Leu Gly
2500 2505 2510
atc ccc gag tca atc att gga ttg ata caa aac tcc cga acg ata 8575
Ile Pro Glu Ser Ile Ile Gly Leu Ile Gln Asn Ser Arg Thr Ile
2515 2520 2525
aga agg cag ttt aga aag agt ctc tca aaa act tta gaa gaa tcc 8620
Arg Arg Gln Phe Arg Lys Ser Leu Ser Lys Thr Leu Glu Glu Ser
2530 2535 2540
ttc tac aac tca gag atc cac ggg att agt cgg atg acc cag aca 8665
Phe Tyr Asn Ser Glu Ile His Gly Ile Ser Arg Met Thr Gln Thr
2545 2550 2555
cct cag agg gtt ggg ggg gtg tgg cct tgc tct tca gag agg gca 8710
Pro Gln Arg Val Gly Gly Val Trp Pro Cys Ser Ser Glu Arg Ala
2560 2565 2570
gat cta ctt agg gag atc tct tgg gga aga aaa gtg gta ggc acg 8755
Asp Leu Leu Arg Glu Ile Ser Trp Gly Arg Lys Val Val Gly Thr
2575 2580 2585
aca gtt cct cac cct tct gag atg ttg ggg tta ctt ccc aag tcc 8800
Thr Val Pro His Pro Ser Glu Met Leu Gly Leu Leu Pro Lys Ser
2590 2595 2600
tct att tct tgc act tgt gga gca aca gga gga ggc aat cct aga 8845
Ser Ile Ser Cys Thr Cys Gly Ala Thr Gly Gly Gly Asn Pro Arg
2605 2610 2615
gtt tct gta tca gta ctc ccg tcc ttt gat cag tca ttt ttt tca 8890
Val Ser Val Ser Val Leu Pro Ser Phe Asp Gln Ser Phe Phe Ser
2620 2625 2630
cga ggc ccc cta aag ggg tac ttg ggc tcg tcc acc tct atg tcg 8935
Arg Gly Pro Leu Lys Gly Tyr Leu Gly Ser Ser Thr Ser Met Ser
2635 2640 2645
acc cag cta ttc cat gca tgg gaa aaa gtc act aat gtt cat gtg 8980
Thr Gln Leu Phe His Ala Trp Glu Lys Val Thr Asn Val His Val
2650 2655 2660
gtg aag aga gct cta tcg tta aaa gaa tct ata aac tgg ttc att 9025
Val Lys Arg Ala Leu Ser Leu Lys Glu Ser Ile Asn Trp Phe Ile
2665 2670 2675
act aga gat tcc aac ttg gct caa gct cta att agg aac att atg 9070
Thr Arg Asp Ser Asn Leu Ala Gln Ala Leu Ile Arg Asn Ile Met
2680 2685 2690
tct ctg aca ggc cct gat ttc cct cta gag gag gcc cct gtc ttc 9115
Ser Leu Thr Gly Pro Asp Phe Pro Leu Glu Glu Ala Pro Val Phe
2695 2700 2705
aaa agg acg ggg tca gcc ttg cat agg ttc aag tct gcc aga tac 9160
Lys Arg Thr Gly Ser Ala Leu His Arg Phe Lys Ser Ala Arg Tyr
2710 2715 2720
agc gaa gga ggg tat tct tct gtc tgc ccg aac ctc ctc tct cat 9205
Ser Glu Gly Gly Tyr Ser Ser Val Cys Pro Asn Leu Leu Ser His
2725 2730 2735
att tct gtt agt aca gac acc atg tct gat ttg acc caa gac ggg 9250
Ile Ser Val Ser Thr Asp Thr Met Ser Asp Leu Thr Gln Asp Gly
2740 2745 2750
aag aac tac gat ttc atg ttc cag cca ttg atg ctt tat gca cag 9295
Lys Asn Tyr Asp Phe Met Phe Gln Pro Leu Met Leu Tyr Ala Gln
2755 2760 2765
aca tgg aca tca gag ctg gta cag aga gac aca agg cta aga gac 9340
Thr Trp Thr Ser Glu Leu Val Gln Arg Asp Thr Arg Leu Arg Asp
2770 2775 2780
tct acg ttt cat tgg cac ctc cga tgc aac agg tgt gtg aga ccc 9385
Ser Thr Phe His Trp His Leu Arg Cys Asn Arg Cys Val Arg Pro
2785 2790 2795
att gac gac gtg acc ctg gag acc tct cag atc ttc gag ttt ccg 9430
Ile Asp Asp Val Thr Leu Glu Thr Ser Gln Ile Phe Glu Phe Pro
2800 2805 2810
gat gtg tcg aaa aga ata tcc aga atg gtt tct ggg gct gtg cct 9475
Asp Val Ser Lys Arg Ile Ser Arg Met Val Ser Gly Ala Val Pro
2815 2820 2825
cac ttc cag agg ctt ccc gat atc cgt ctg aga cca gga gat ttt 9520
His Phe Gln Arg Leu Pro Asp Ile Arg Leu Arg Pro Gly Asp Phe
2830 2835 2840
gaa tct cta agc ggt aga gaa aag tct cac cat atc gga tca gct 9565
Glu Ser Leu Ser Gly Arg Glu Lys Ser His His Ile Gly Ser Ala
2845 2850 2855
cag ggg ctc tta tac tca atc tta gtg gca att cac gac tca gga 9610
Gln Gly Leu Leu Tyr Ser Ile Leu Val Ala Ile His Asp Ser Gly
2860 2865 2870
tac aat gat gga acc atc ttc cct gcc aac ata tac ggc aag gtt 9655
Tyr Asn Asp Gly Thr Ile Phe Pro Ala Asn Ile Tyr Gly Lys Val
2875 2880 2885
tcc cct aga gac tat ttg aga ggg ctc gca agg gga gta ttg ata 9700
Ser Pro Arg Asp Tyr Leu Arg Gly Leu Ala Arg Gly Val Leu Ile
2890 2895 2900
gga tcc tcg att tgc ttc ttg aca aga atg aca aat atc aat att 9745
Gly Ser Ser Ile Cys Phe Leu Thr Arg Met Thr Asn Ile Asn Ile
2905 2910 2915
aat aga cct ctt gaa ttg atc tca ggg gta atc tca tat att ctc 9790
Asn Arg Pro Leu Glu Leu Ile Ser Gly Val Ile Ser Tyr Ile Leu
2920 2925 2930
ctg agg cta gat aac cat ccc tcc ttg tac ata atg ctc aga gaa 9835
Leu Arg Leu Asp Asn His Pro Ser Leu Tyr Ile Met Leu Arg Glu
2935 2940 2945
ccg tct ctt aga gga gag ata ttt tct atc cct cag aaa atc ccc 9880
Pro Ser Leu Arg Gly Glu Ile Phe Ser Ile Pro Gln Lys Ile Pro
2950 2955 2960
gcc gct tat cca acc act atg aaa gaa ggc aac aga tca atc ttg 9925
Ala Ala Tyr Pro Thr Thr Met Lys Glu Gly Asn Arg Ser Ile Leu
2965 2970 2975
tgt tat ctc caa cat gtg cta cgc tat gag cga gag ata atc acg 9970
Cys Tyr Leu Gln His Val Leu Arg Tyr Glu Arg Glu Ile Ile Thr
2980 2985 2990
gcg tct cca gag aat gac tgg cta tgg atc ttt tca gac ttt aga 10015
Ala Ser Pro Glu Asn Asp Trp Leu Trp Ile Phe Ser Asp Phe Arg
2995 3000 3005
agt gcc aaa atg acg tac cta acc ctc att act tac cag tct cat 10060
Ser Ala Lys Met Thr Tyr Leu Thr Leu Ile Thr Tyr Gln Ser His
3010 3015 3020
ctt cta ctc cag agg gtt gag aga aac cta tct aag agt atg aga 10105
Leu Leu Leu Gln Arg Val Glu Arg Asn Leu Ser Lys Ser Met Arg
3025 3030 3035
gat aac ctg cga caa ttg agt tcc ttg atg agg cag gtg ctg ggc 10150
Asp Asn Leu Arg Gln Leu Ser Ser Leu Met Arg Gln Val Leu Gly
3040 3045 3050
ggg cac gga gaa gat acc tta gag tca gac gac aac att caa cga 10195
Gly His Gly Glu Asp Thr Leu Glu Ser Asp Asp Asn Ile Gln Arg
3055 3060 3065
ctg cta aaa gac tct tta cga agg aca aga tgg gtg gat caa gag 10240
Leu Leu Lys Asp Ser Leu Arg Arg Thr Arg Trp Val Asp Gln Glu
3070 3075 3080
gtg cgc cat gca gct aga acc atg act gga gat tac agc ccc aac 10285
Val Arg His Ala Ala Arg Thr Met Thr Gly Asp Tyr Ser Pro Asn
3085 3090 3095
aag aag gtg tcc cgt aag gta gga tgt tca gaa tgg gtc tgc tct 10330
Lys Lys Val Ser Arg Lys Val Gly Cys Ser Glu Trp Val Cys Ser
3100 3105 3110
gct caa cag gtt gca gtc tct acc tca gca aac ccg gcc cct gtc 10375
Ala Gln Gln Val Ala Val Ser Thr Ser Ala Asn Pro Ala Pro Val
3115 3120 3125
tcg gag ctt gac ata agg gcc ctc tct aag agg ttc cag aac cct 10420
Ser Glu Leu Asp Ile Arg Ala Leu Ser Lys Arg Phe Gln Asn Pro
3130 3135 3140
ttg atc tcg ggc ttg aga gtg gtt cag tgg gca acc ggt gct cat 10465
Leu Ile Ser Gly Leu Arg Val Val Gln Trp Ala Thr Gly Ala His
3145 3150 3155
tat aag ctt aag cct att cta gat gat ctc aat gtt ttc ccc tct 10510
Tyr Lys Leu Lys Pro Ile Leu Asp Asp Leu Asn Val Phe Pro Ser
3160 3165 3170
ctc tgc ctt gta gtt ggg gac ggg tca ggg ggg ata tca agg gca 10555
Leu Cys Leu Val Val Gly Asp Gly Ser Gly Gly Ile Ser Arg Ala
3175 3180 3185
gtc ctc aac atg ttt cca gat gcc aag ctt gtg ttc aac agt ctc 10600
Val Leu Asn Met Phe Pro Asp Ala Lys Leu Val Phe Asn Ser Leu
3190 3195 3200
tta gag gtg aat gac ctg atg gct tcc gga aca cat cca ctg cct 10645
Leu Glu Val Asn Asp Leu Met Ala Ser Gly Thr His Pro Leu Pro
3205 3210 3215
cct tca gca atc atg agg gga gga aat ggt atc gtc tcc aga gtg 10690
Pro Ser Ala Ile Met Arg Gly Gly Asn Gly Ile Val Ser Arg Val
3220 3225 3230
ata gat ttt gac tca atc tgg gaa aaa ccg tcc gac ttg aga aac 10735
Ile Asp Phe Asp Ser Ile Trp Glu Lys Pro Ser Asp Leu Arg Asn
3235 3240 3245
ttg gca acc tgg aaa tac ttc cag tca gtc caa aag cag gtc aac 10780
Leu Ala Thr Trp Lys Tyr Phe Gln Ser Val Gln Lys Gln Val Asn
3250 3255 3260
atg tcc tat gac ctc att att tgc gat gca gaa gtt act gac att 10825
Met Ser Tyr Asp Leu Ile Ile Cys Asp Ala Glu Val Thr Asp Ile
3265 3270 3275
gca tct atc aac cgg ata acc ctg tta atg tcc gat ttt gca ttg 10870
Ala Ser Ile Asn Arg Ile Thr Leu Leu Met Ser Asp Phe Ala Leu
3280 3285 3290
tct ata gat gga cca ctc tat ttg gtc ttc aaa act tat ggg act 10915
Ser Ile Asp Gly Pro Leu Tyr Leu Val Phe Lys Thr Tyr Gly Thr
3295 3300 3305
atg cta gta aat cca aac tac aag gct att caa cac ctg tca aga 10960
Met Leu Val Asn Pro Asn Tyr Lys Ala Ile Gln His Leu Ser Arg
3310 3315 3320
gcg ttc ccc tcg gtc aca ggg ttt atc acc caa gta act tcg tct 11005
Ala Phe Pro Ser Val Thr Gly Phe Ile Thr Gln Val Thr Ser Ser
3325 3330 3335
ttt tca tct gag ctc tac ctc cga ttc tcc aaa cga ggg aag ctt 11050
Phe Ser Ser Glu Leu Tyr Leu Arg Phe Ser Lys Arg Gly Lys Leu
3340 3345 3350
ttc aga gat gct gag tac ttg acc tct tcc acc ctt cga gaa atg 11095
Phe Arg Asp Ala Glu Tyr Leu Thr Ser Ser Thr Leu Arg Glu Met
3355 3360 3365
agc ctt gtg tta ttc aat tgt agc agc ccc aag agt gag atg cag 11140
Ser Leu Val Leu Phe Asn Cys Ser Ser Pro Lys Ser Glu Met Gln
3370 3375 3380
aga gct cgt tcc ttg aac tat cag gat ctt gtg aga gga ttt cct 11185
Arg Ala Arg Ser Leu Asn Tyr Gln Asp Leu Val Arg Gly Phe Pro
3385 3390 3395
gaa gaa atc ata tca aat cct tac aat gag atg atc ata act ctg 11230
Glu Glu Ile Ile Ser Asn Pro Tyr Asn Glu Met Ile Ile Thr Leu
3400 3405 3410
att gac agt gat gta gaa tct ttt cta gtc cac aag atg gtt gat 11275
Ile Asp Ser Asp Val Glu Ser Phe Leu Val His Lys Met Val Asp
3415 3420 3425
gat ctt gag tta cag agg gga act ctg tct aaa gtg gct atc att 11320
Asp Leu Glu Leu Gln Arg Gly Thr Leu Ser Lys Val Ala Ile Ile
3430 3435 3440
ata gcc atc atg ata gtt ttc tcc aac aga gtc ttc aac gtt tcc 11365
Ile Ala Ile Met Ile Val Phe Ser Asn Arg Val Phe Asn Val Ser
3445 3450 3455
aaa ccc cta act gac ccc ttg ttc tat cca ccg tct gat ccc aaa 11410
Lys Pro Leu Thr Asp Pro Leu Phe Tyr Pro Pro Ser Asp Pro Lys
3460 3465 3470
atc ctg agg cac ttc aac ata tgt cgc agt act atg atg tat cta 11455
Ile Leu Arg His Phe Asn Ile Cys Arg Ser Thr Met Met Tyr Leu
3475 3480 3485
tct act gct tta ggt gac gtc cct agc ttc gca aga ctt cac gac 11500
Ser Thr Ala Leu Gly Asp Val Pro Ser Phe Ala Arg Leu His Asp
3490 3495 3500
ctg tat aac aga cct ata act tat tac ttc aga aag caa ttc att 11545
Leu Tyr Asn Arg Pro Ile Thr Tyr Tyr Phe Arg Lys Gln Phe Ile
3505 3510 3515
cga ggg aac gtt tat cta tct tgg agt tgg tcc aac gac acc tca 11590
Arg Gly Asn Val Tyr Leu Ser Trp Ser Trp Ser Asn Asp Thr Ser
3520 3525 3530
gtg ttc aaa agg gta gcc tgt aat tct agc ctg agt ctg tca tct 11635
Val Phe Lys Arg Val Ala Cys Asn Ser Ser Leu Ser Leu Ser Ser
3535 3540 3545
cac tgg atc agg ttg att tac aag ata gtg aag gct acc aga ctc 11680
His Trp Ile Arg Leu Ile Tyr Lys Ile Val Lys Ala Thr Arg Leu
3550 3555 3560
gtt ggc agc atc aag gat cta tcc aga gaa gtg gaa aga cac ctt 11725
Val Gly Ser Ile Lys Asp Leu Ser Arg Glu Val Glu Arg His Leu
3565 3570 3575
cat agg tac aac agg tgg atc acc cta gag gat atc aga tct aga 11770
His Arg Tyr Asn Arg Trp Ile Thr Leu Glu Asp Ile Arg Ser Arg
3580 3585 3590
tca tcc cta cta gac tac agt tgc ctg tgaaccggat actcctggaa 11817
Ser Ser Leu Leu Asp Tyr Ser Cys Leu
3595 3600
gcctgcccat gctaagactc ttgtgtgatg tatcttgaaa aaaacaagat cctaaatctg 11877
aacctttggt tgtttgattg tttttctcat ttttgttgtt tatttgttaa gcgt 11931
<210>2
<211>450
<212>PRT
<213>rabies virus
<400>2
Met Asp Ala Asp Lys Ile Val Phe Lys Val Asn Asn Gln Val Val Ser
1 5 10 15
Leu Lys Pro Glu Ile Ile Val Asp Gln His Glu Tyr Lys Tyr Pro Ala
20 25 30
Ile Lys Asp Leu Lys Lys Pro Cys Ile Thr Leu Gly Lys Ala Pro Asp
35 40 45
Leu Asn Lys Ala Tyr Lys Ser Val Leu Ser Gly Met Ser Ala Ala Lys
50 55 60
Leu Asp Pro Asp Asp Val Cys Ser Tyr Leu Ala Ala Ala Met Gln Phe
65 70 75 80
Phe Glu Gly Thr Cys Pro Glu Asp Trp Thr Ser Tyr Gly Ile Val Ile
85 90 95
Ala Arg Lys Gly Asp Lys Ile Thr Pro Gly Ser Leu Val Glu Ile Lys
100 105 110
Arg Thr Asp Val Glu Gly Asn Trp Ala Leu Thr Gly Gly Met Glu Leu
115 120 125
Thr Arg Asp Pro Thr Val Pro Glu His Ala Ser Leu Val Gly Leu Leu
130 135 140
Leu Ser Leu Tyr Arg Leu Ser Lys Ile Ser Gly Gln Asn Thr Gly Asn
145 150 155 160
Tyr Lys Thr Asn Ile Ala Asp Arg Ile Glu Gln Ile Phe Glu Thr Ala
165 170 175
Pro Phe Val Lys Ile Val Glu His His Thr Leu Met Thr Thr His Lys
180 185 190
Met Cys Ala Asn Trp Ser Thr Ile Pro Asn Phe Arg Phe Leu Ala Gly
195 200 205
Thr Tyr Asp Met Phe Phe Ser Arg Ile Glu His Leu Tyr Ser Ala Ile
210 215 220
Arg Val Gly Thr Val Val Thr Ala Tyr Glu Asp Cys Ser Gly Leu Val
225 230 235 240
Ser Phe Thr Gly Phe Ile Lys Gln Ile Asn Leu Thr Ala Arg Glu Ala
245 250 255
Ile Leu Tyr Phe Phe His Lys Asn Phe Glu Glu Glu Ile Arg Arg Met
260 265 270
Phe Glu Pro Gly Gln Glu Thr Ala Val Pro His Ser Tyr Phe Ile His
275 280 285
Phe Arg Ser Leu Gly Leu Ser Gly Lys Ser Pro Tyr Ser Ser Asn Ala
290 295 300
Val Gly His Val Phe Asn Leu Ile His Phe Val Gly Cys Tyr Met Gly
305 310 315 320
Gln Val Arg Ser Leu Asn Ala Thr Val Ile Ala Ala Cys Ala Pro His
325 330 335
Glu Met Ser Val Leu Gly Gly Tyr Leu Gly Glu Glu Phe Phe Gly Lys
340 345 350
Gly Thr Phe Glu Arg Arg Phe Phe Arg Asp Glu Lys Glu Leu Gln Glu
355 360 365
Tyr Glu Ala Ala Glu Leu Thr Lys Thr Asp Val Ala Leu Ala Asp Asp
370 375 380
Gly Thr Val Asn Ser Asp Asp Glu Asp Tyr Phe Ser Gly Glu Thr Arg
385 390 395 400
Ser Pro Glu Ala Val Tyr Thr Arg Ile Met Met Asn Gly Gly Arg Leu
405 410 415
Lys Arg Ser His Ile Arg Arg Tyr Val Ser Val Ser Ser Asn His Gln
420 425 430
Ala Arg Pro Asn Ser Phe Ala Glu Phe Leu Asn Lys Thr Tyr Ser Ser
435 440 445
Asp Ser
450
<210>3
<211>297
<212>PRT
<213>rabies virus
<400>3
Met Ser Lys Ile Phe Val Asn Pro Ser Ala Ile Arg Ala Gly Leu Ala
1 5 10 15
Asp Leu Glu Met Ala Glu Glu Thr Val Asp Leu Ile Asn Arg Asn Ile
20 25 30
Glu Asp Asn Gln Ala His Leu Gln Gly Glu Pro Ile Glu Val Asp Asn
35 40 45
Leu Pro Glu Asp Met Gly Arg Leu His Leu Asp Asp Gly Lys Ser Pro
50 55 60
Asn Pro Gly Glu Met Ala Lys Val Gly Glu Gly Lys Tyr Arg Glu Asp
65 70 75 80
Phe Gln Met Asp Glu Gly Glu Asp Leu Ser Phe Leu Phe Gln Ser Tyr
85 90 95
Leu Glu Asn Val Gly Val Gln Ile Val Arg Gln Met Arg Ser Gly Glu
100 105 110
Arg Phe Leu Lys Ile Trp Ser Gln Thr Val Glu Glu Ile Ile Ser Tyr
115 120 125
Val Ala Val Asn Phe Pro Asn Pro Pro Gly Lys Ser Ser Glu Asp Lys
130 135 140
Ser Thr Gln Thr Thr Gly Arg Glu Leu Lys Lys Glu Thr Thr Pro Thr
145 150 155 160
Pro Ser Gln Arg Glu Ser Gln Ser Ser Lys Ala Arg Met Ala Ala Gln
165 170 175
Ile Ala Ser Gly Pro Pro Ala Leu Glu Trp Ser Ala Thr Asn Glu Glu
180 185 190
Asp Asp Leu Ser Val Glu Ala Glu Ile Ala His Gln Ile Ala Glu Ser
195 200 205
Phe Ser Lys Lys Tyr Lys Phe Pro Ser Arg Ser Ser Gly Ile Leu Leu
210 215 220
Tyr Asn Phe Glu Gln Leu Lys Met Asn Leu Asp Asp Ile Val Lys Glu
225 230 235 240
Ala Lys Asn Val Pro Gly Val Thr Arg Leu Ala His Asp Gly Ser Lys
245 250 255
Leu Pro Leu Arg Cys Val Leu Gly Trp Val Ala Leu Ala Asn Pro Lys
260 265 270
Lys Phe Gln Leu Leu Val Glu Ser Asp Lys Leu Ser Lys Ile Met Gln
275 280 285
Asp Asp Leu Asn Arg Tyr Thr Ser Cys
290 295
<210>4
<211>202
<212>PRT
<213>rabies virus
<400>4
Met Asn Phe Leu Arg Lys Ile Val Lys Asn Cys Arg Asp G1u Asp Thr
1 5 10 15
Gln Lys Pro Ser Pro Val Ser Ala Pro Leu Asp Asp Asp Asp Leu Trp
20 25 30
Leu Pro Pro Pro Glu Tyr Val Pro Leu Lys Glu Leu Thr Ser Lys Lys
35 40 45
Asn Met Arg Asn Phe Cys Ile Asn Gly Gly Val Lys Val Cys Ser Pro
50 55 60
Asn Gly Tyr Ser Phe Arg Ile Leu Arg His Ile Leu Lys Ser Phe Asp
65 70 75 80
Glu Ile Tyr Ser Gly Asn His Arg Met Ile Gly Leu Ala Lys Val Val
85 90 95
Ile Gly Leu Ala Leu Ser Gly Ser Pro Val Pro Glu Gly Met Asn Trp
100 105 110
Val Tyr Lys Leu Arg Arg Thr Phe Ile Phe Gln Trp Ala Asp Ser Arg
115 120 125
Gly Pro Leu Glu Gly Glu Glu Leu Glu Tyr Ser Gln Glu Ile Thr Trp
130 135 140
Asp Asp Asp Thr Glu Phe Val Gly Leu Gln Ile Arg Val Ile Ala Lys
145 150 155 160
Gln Cys His Ile Gln Gly Arg Ile Trp Cys Ile Asn Met Asn Pro Arg
165 170 175
Ala Cys Gln Leu Trp Ser Asp Met Ser Leu Gln Thr Gln Arg Ser Glu
180 185 190
Glu Asp Lys Asp Ser Ser Leu Leu Leu Glu
195 200
<210>5
<211>524
<212>PRT
<213>rabies virus
<400>5
Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu Val Phe Pro Leu
1 5 10 15
Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp Lys Leu Gly Pro
20 25 30
Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro Asn Asn Leu Val
35 40 45
Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe Ser Tyr Met Glu
50 55 60
Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met Asn Gly Phe Thr Cys
65 70 75 80
Thr Gly Val Val Thr Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr
85 90 95
Val Thr Thr Thr Phe Lys Arg Lys His Phe Arg Pro Thr Pro Asp Ala
100 105 110
Cys Arg Ala Ala Tyr Asn Trp Lys Met Ala Gly Asp Pro Arg Tyr Glu
115 120 125
Glu Ser Leu His Asn Pro Tyr Pro Asp Tyr His Trp Leu Arg Thr Val
130 135 140
Lys Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Val Ala Asp
145 150 155 160
Leu Asp Pro Tyr Asp Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly
165 170 175
Lys Cys Ser Gly Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His
180 185 190
Asp Tyr Thr Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys
195 200 205
Asp Ile Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser G1u
210 215 220
Thr Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly
225 230 235 240
Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met Asp
245 250 255
Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp Cys Pro
260 265 270
Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp Glu Ile Glu
275 280 285
His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu Glu Cys Leu Asp
290 295 300
Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val Ser Phe Arg Arg Pro
305 310 315 320
Ser His Leu Arg Lys Leu Val Pro Gly Phe Gly Lys Ala Tyr Thr Ile
325 330 335
Phe Asn Lys Thr Leu Met Glu Ala Asp Ala His Tyr Lys Ser Val Arg
340 345 350
Thr Trp Asn Glu Ile Leu Pro Ser Lys Gly Cys Leu Arg Val Gly Gly
355 360 365
Arg Cys His Pro His Val Asn Gly Val Phe Phe Asn Gly Ile Ile Leu
370 375 380
Gly Pro Asp Gly Asn Val Leu Ile Pro Glu Met Gln Ser Ser Leu Leu
385 390 395 400
Gln Gln His Met Glu Leu Leu Glu Ser Ser Val Ile Pro Leu Val His
405 410 415
Pro Leu Ala Asp Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu
420 425 430
Asp Phe Val Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly
435 440 445
Val Asp Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala
450 455 460
Gly Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys
465 470 475 480
Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly Thr
485 490 495
Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile Ser Ser
500 505 510
Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu
515 520
<210>6
<211>2127
<212>PRT
<213>rabies virus
<400>6
Met Leu Asp Pro Gly Glu Val Tyr Asp Asp Pro Ile Asp Pro Ile Glu
1 5 10 15
Leu Glu Asp Glu Pro Arg Gly Thr Pro Thr Val Pro Asn Ile Leu Arg
20 25 30
Asn Ser Asp Tyr Asn Leu Asn Ser Pro Leu Ile Glu Asp Pro Ala Arg
35 40 45
Leu Met Leu Glu Trp Leu Lys Thr Gly Asn Arg Pro Tyr Arg Met Thr
50 55 60
Leu Thr Asp Asn Cys Ser Arg Ser Phe Arg Val Leu Lys Asp Tyr Phe
65 70 75 80
Lys Lys Val Asp Leu Gly Ser Leu Lys Val Gly Gly Met Ala Ala Gln
85 90 95
Ser Met Ile Ser Leu Trp Leu Tyr Gly Ala His Ser Glu Ser Asn Arg
100 105 110
Ser Arg Arg Cys Ile Thr Asp Leu Ala His Phe Tyr Ser Lys Ser Ser
115 120 125
Pro Ile Glu Lys Leu Leu Asn Leu Thr Leu Gly Asn Arg Gly Leu Arg
130 135 140
Ile Pro Pro Glu Gly Val Leu Ser Cys Leu Glu Arg Val Asp Tyr Asp
145 150 155 160
Asn Ala Phe Gly Arg Tyr Leu Ala Asn Thr Tyr Ser Ser Tyr Leu Phe
165 170 175
Phe His Val Ile Thr Leu Tyr Met Asn Ala Leu Asp Trp Asp Glu Glu
180 185 190
Lys Thr Ile Leu Ala Leu Trp Lys Asp Leu Thr Ser Val Asp Ile Gly
195 200 205
Lys Asp Leu Val Lys Phe Lys Asp Gln Ile Trp Gly Leu Pro Ile Val
210 215 220
Thr Lys Asp Phe Val Tyr Ser Gln Ser Ser Asn Cys Leu Phe Asp Arg
225 230 235 240
Asn Tyr Thr Leu Met Leu Lys Glu Leu Phe Leu Ser Arg Phe Asn Ser
245 250 255
Leu Met Val Leu Leu Ser Pro Pro Glu Pro Arg Tyr Ser Asp Asp Leu
260 265 270
Ile Ser Gln Leu Cys Gln Leu Tyr Ile Ala Gly Asp Gln Val Leu Ser
275 280 285
Met Cys Gly Asn Ser Gly Tyr Glu Val Ile Lys Ile Leu Glu Pro Tyr
290 295 300
Val Val Asn Ser Leu Val Gln Arg Ala Glu Lys Phe Arg Pro Leu Ile
305 310 315 320
His Ser Leu Gly Asp Phe Pro Val Phe Ile Lys Asp Lys Val Ser Gln
325 330 335
Leu Glu Glu Thr Phe Gly Pro Cys Ala Arg Arg Phe Phe Arg Ala Leu
340 345 350
Asp Gln Phe Asp Asn Ile His Asp Leu Val Phe Val Tyr Gly Cys Tyr
355 360 365
Arg His Trp Gly His Pro Tyr Ile Asp Tyr Arg Lys Gly Leu Ser Lys
370 375 380
Leu Tyr Asp Gln Val His Ile Lys Lys Val Ile Asp Lys Ser Tyr Gln
385 390 395 400
Glu Cys Leu Ala Ser Asp Leu Ala Arg Arg Ile Leu Arg Trp Gly Phe
405 410 415
Asp Lys Tyr Ser Lys Trp Tyr Leu Asp Ser Arg Phe Leu Ala Arg Asp
420 425 430
His Pro Leu Thr Pro Tyr Ile Lys Thr Gln Thr Trp Pro Pro Lys His
435 440 445
Ile Val Asp Leu Val Gly Asp Thr Trp His Lys Leu Pro Ile Thr Gln
450 455 460
Ile Phe Glu Ile Pro Glu Ser Met Asp Pro Ser Glu Ile Leu Asp Asp
465 470 475 480
Lys Ser His Ser Phe Thr Arg Thr Arg Leu Ala Ser Trp Leu Ser Glu
485 490 495
Asn Arg Gly Gly Pro Val Pro Ser Glu Lys Val Ile Ile Thr Ala Leu
500 505 510
Ser Lys Pro Pro Val Asn Pro Arg Glu Phe Leu Arg Ser Ile Asp Leu
515 520 525
Gly Gly Leu Pro Asp Glu Asp Leu Ile Ile Gly Leu Lys Pro Lys Glu
530 535 540
Arg Glu Leu Lys Ile Glu Gly Arg Phe Phe Ala Leu Met Ser Trp Asn
545 550 555 560
Leu Arg Leu Tyr Phe Val Ile Thr Glu Lys Leu Leu Ala Asn Tyr Ile
565 570 575
Leu Pro Leu Phe Asp Ala Leu Thr Met Thr Asp Asn Leu Asn Lys Val
580 585 590
Phe Lys Lys Leu Ile Asp Arg Val Thr Gly Gln Gly Leu Leu Asp Tyr
595 600 605
Ser Arg Val Thr Tyr Ala Phe His Leu Asp Tyr Glu Lys Trp Asn Asn
610 615 620
His Gln Arg Leu Glu Ser Thr Glu Asp Val Phe Ser Val Leu Asp Gln
625 630 635 640
Val Phe Gly Leu Lys Arg Val Phe Ser Arg Thr His Glu Phe Phe Gln
645 650 655
Lys Ala Trp Ile Tyr Tyr Ser Asp Arg Ser Asp Leu Ile Gly Leu Arg
660 665 670
Glu Asp Gln Ile Tyr Cys Leu Asp Ala Ser Asn Gly Pro Thr Cys Trp
675 680 685
Asn Gly Gln Asp Gly Gly Leu Glu Gly Leu Arg Gln Lys Gly Trp Ser
690 695 700
Leu Val Ser Leu Leu Met Ile Asp Arg Glu Ser Gln Ile Arg Asn Thr
705 710 715 720
Arg Thr Lys Ile Leu Ala Gln Gly Asp Asn Gln Val Leu Cys Pro Thr
725 730 735
Tyr Met Leu Ser Pro Gly Leu Ser Gln Glu Gly Leu Leu Tyr Glu Leu
740 745 750
Glu Arg Ile Ser Arg Asn Ala Leu Ser Ile Tyr Arg Ala Val Glu Glu
755 760 765
Gly Ala Ser Lys Leu Gly Leu Ile Thr Lys Lys Glu Glu Thr Met Cys
770 775 780
Ser Tyr Asp Phe Leu Ile Tyr Gly Lys Thr Pro Leu Phe Arg Gly Asn
785 790 795 800
Ile Leu Val Pro Glu Ser Lys Arg Trp Ala Arg Val Ser Cys Val Ser
805 810 815
Asn Asp Gln Ile Val Asn Leu Ala Asn Ile Met Ser Thr Val Ser Thr
820 825 830
Asn Ala Leu Thr Val Ala Gln His Ser Gln Ser Leu Ile Lys Pro Met
835 840 845
Gly Asp Phe Leu Leu Met Ser Val Gln Ala Val Phe His Tyr Leu Leu
850 855 860
Phe Ser Pro Ile Leu Lys Gly Arg Val Tyr Lys Ile Leu Ser Ala Glu
865 870 875 880
Gly Asp Ser Phe Leu Leu Ala Met Ser Arg Ile Ile Tyr Leu Asp Pro
885 890 895
Ser Leu Gly Gly Val Ser Gly Met Ser Leu Gly Arg Phe His Ile Arg
900 905 910
Gln Phe Ser Asp Pro Val Ser Glu Gly Leu Ser Phe Trp Arg Glu Ile
915 920 925
Trp Leu Ser Ser His Glu Ser Trp Val His Ala Leu Cys Gln Glu Ala
930 935 940
Gly Asn Pro Asp Leu Gly Glu Arg Thr Leu Glu Ser Phe Thr Arg Leu
945 950 955 960
Leu Glu Asp Pro Thr Thr Leu Asn Ile Arg Gly Gly Ala Ser Pro Thr
965 970 975
Ile Leu Leu Lys Asp Ala Ile Arg Lys Ala Leu Tyr Asp Glu Val Asp
980 985 990
Lys Val Glu Asn Ser Glu Phe Arg Glu Ala Ile Leu Leu Ser Lys Thr
995 1000 1005
His Arg Asp Asn Phe Ile Leu Phe Leu Thr Ser Val Glu Pro Leu
1010 1015 1020
Phe Pro Arg Phe Leu Ser Glu Leu Phe Ser Ser Ser Phe Leu Gly
1025 1030 1035
Ile Pro Glu Ser Ile Ile Gly Leu Ile Gln Asn Ser Arg Thr Ile
1040 1045 1050
Arg Arg Gln Phe Arg Lys Ser Leu Ser Lys Thr Leu Glu Glu Ser
1055 1060 1065
Phe Tyr Asn Ser Glu Ile His Gly Ile Ser Arg Met Thr Gln Thr
1070 1075 1080
Pro Gln Arg Val Gly Gly Val Trp Pro Cys Ser Ser Glu Arg Ala
1085 1090 1095
Asp Leu Leu Arg Glu Ile Ser Trp Gly Arg Lys Val Val Gly Thr
1100 1105 1110
Thr Val Pro His Pro Ser Glu Met Leu Gly Leu Leu Pro Lys Ser
1115 1120 1125
Ser Ile Ser Cys Thr Cys Gly Ala Thr Gly Gly Gly Asn Pro Arg
1130 1135 1140
Val Ser Val Ser Val Leu Pro Ser Phe Asp Gln Ser Phe Phe Ser
1145 1150 1155
Arg Gly Pro Leu Lys Gly Tyr Leu Gly Ser Ser Thr Ser Met Ser
1160 1165 1170
Thr Gln Leu Phe His Ala Trp Glu Lys Val Thr Asn Val His Val
1175 1180 1185
Val Lys Arg Ala Leu Ser Leu Lys Glu Ser Ile Asn Trp Phe Ile
1190 1195 1200
Thr Arg Asp Ser Asn Leu Ala Gln Ala Leu Ile Arg Asn Ile Met
1205 1210 1215
Ser Leu Thr Gly Pro Asp Phe Pro Leu Glu Glu Ala Pro Val Phe
1220 1225 1230
Lys Arg Thr Gly Ser Ala Leu His Arg Phe Lys Ser Ala Arg Tyr
1235 1240 1245
Ser Glu Gly Gly Tyr Ser Ser Val Cys Pro Asn Leu Leu Ser His
1250 1255 1260
Ile Ser Val Ser Thr Asp Thr Met Ser Asp Leu Thr Gln Asp Gly
1265 1270 1275
Lys Asn Tyr Asp Phe Met Phe Gln Pro Leu Met Leu Tyr Ala Gln
1280 1285 1290
Thr Trp Thr Ser Glu Leu Val Gln Arg Asp Thr Arg Leu Arg Asp
1295 1300 1305
Ser Thr Phe His Trp His Leu Arg Cys Asn Arg Cys Val Arg Pro
1310 1315 1320
Ile Asp Asp Val Thr Leu Glu Thr Ser Gln Ile Phe Glu Phe Pro
1325 1330 1335
Asp Val Ser Lys Arg Ile Ser Arg Met Val Ser Gly Ala Val Pro
1340 1345 1350
His Phe Gln Arg Leu Pro Asp Ile Arg Leu Arg Pro Gly Asp Phe
1355 1360 1365
Glu Ser Leu Ser Gly Arg Glu Lys Ser His His Ile Gly Ser Ala
1370 1375 1380
Gln Gly Leu Leu Tyr Ser Ile Leu Val Ala Ile His Asp Ser Gly
1385 1390 1395
Tyr Asn Asp Gly Thr Ile Phe Pro Ala Asn Ile Tyr Gly Lys Val
1400 1405 1410
Ser Pro Arg Asp Tyr Leu Arg Gly Leu Ala Arg Gly Val Leu Ile
1415 1420 1425
Gly Ser Ser Ile Cys Phe Leu Thr Arg Met Thr Asn Ile Asn Ile
1430 1435 1440
Asn Arg Pro Leu Glu Leu Ile Ser Gly Val Ile Ser Tyr Ile Leu
1445 1450 1455
Leu Arg Leu Asp Asn His Pro Ser Leu Tyr Ile Met Leu Arg Glu
1460 1465 1470
Pro Ser Leu Arg Gly Glu Ile Phe Ser Ile Pro Gln Lys Ile Pro
1475 1480 1485
Ala Ala Tyr Pro Thr Thr Met Lys Glu Gly Asn Arg Ser Ile Leu
1490 1495 1500
Cys Tyr Leu Gln His Val Leu Arg Tyr Glu Arg Glu Ile Ile Thr
1505 1510 1515
Ala Ser Pro Glu Asn Asp Trp Leu Trp Ile Phe Ser Asp Phe Arg
1520 1525 1530
Ser Ala Lys Met Thr Tyr Leu Thr Leu Ile Thr Tyr Gln Ser His
1535 1540 1545
Leu Leu Leu Gln Arg Val Glu Arg Asn Leu Ser Lys Ser Met Arg
1550 1555 1560
Asp Asn Leu Arg Gln Leu Ser Ser Leu Met Arg Gln Val Leu Gly
1565 1570 1575
Gly His Gly Glu Asp Thr Leu Glu Ser Asp Asp Asn Ile Gln Arg
1580 1585 1590
Leu Leu Lys Asp Ser Leu Arg Arg Thr Arg Trp Val Asp Gln Glu
1595 1600 1605
Val Arg His Ala Ala Arg Thr Met Thr Gly Asp Tyr Ser Pro Asn
1610 1615 1620
Lys Lys Val Ser Arg Lys Val Gly Cys Ser Glu Trp Val Cys Ser
1625 1630 1635
Ala Gln Gln Val Ala Val Ser Thr Ser Ala Asn Pro Ala Pro Val
1640 1645 1650
Ser Glu Leu Asp Ile Arg Ala Leu Ser Lys Arg Phe Gln Asn Pro
1655 1660 1665
Leu Ile Ser Gly Leu Arg Val Val Gln Trp Ala Thr Gly Ala His
1670 1675 1680
Tyr Lys Leu Lys Pro Ile Leu Asp Asp Leu Asn Val Phe Pro Ser
1685 1690 1695
Leu Cys Leu Val Val Gly Asp Gly Ser Gly Gly Ile Ser Arg Ala
1700 1705 1710
Val Leu Asn Met Phe Pro Asp Ala Lys Leu Val Phe Asn Ser Leu
1715 1720 1725
Leu Glu Val Asn Asp Leu Met Ala Ser Gly Thr His Pro Leu Pro
1730 1735 1740
Pro Ser Ala Ile Met Arg Gly Gly Asn Gly Ile Val Ser Arg Val
1745 1750 1755
Ile Asp Phe Asp Ser Ile Trp Glu Lys Pro Ser Asp Leu Arg Asn
1760 1765 1770
Leu Ala Thr Trp Lys Tyr Phe Gln Ser Val Gln Lys Gln Val Asn
1775 1780 1785
Met Ser Tyr Asp Leu Ile Ile Cys Asp Ala Glu Val Thr Asp Ile
1790 1795 1800
Ala Ser Ile Asn Arg Ile Thr Leu Leu Met Ser Asp Phe Ala Leu
1805 1810 1815
Ser Ile Asp Gly Pro Leu Tyr Leu Val Phe Lys Thr Tyr Gly Thr
1820 1825 1830
Met Leu Val Asn Pro Asn Tyr Lys Ala Ile Gln His Leu Ser Arg
1835 1840 1845
Ala Phe Pro Ser Val Thr Gly Phe Ile Thr Gln Val Thr Ser Ser
1850 1855 1860
Phe Ser Ser Glu Leu Tyr Leu Arg Phe Ser Lys Arg Gly Lys Leu
1865 1870 1875
Phe Arg Asp Ala Glu Tyr Leu Thr Ser Ser Thr Leu Arg Glu Met
1880 1885 1890
Ser Leu Val Leu Phe Asn Cys Ser Ser Pro Lys Ser Glu Met Gln
1895 1900 1905
Arg Ala Arg Ser Leu Asn Tyr Gln Asp Leu Val Arg Gly Phe Pro
1910 1915 1920
Glu Glu Ile Ile Ser Asn Pro Tyr Asn Glu Met Ile Ile Thr Leu
1925 1930 1935
Ile Asp Ser Asp Val Glu Ser Phe Leu Val His Lys Met Val Asp
1940 1945 1950
Asp Leu Glu Leu Gln Arg Gly Thr Leu Ser Lys Val Ala Ile Ile
1955 1960 1965
Ile Ala Ile Met Ile Val Phe Ser Asn Arg Val Phe Asn Val Ser
1970 1975 1980
Lys Pro Leu Thr Asp Pro Leu Phe Tyr Pro Pro Ser Asp Pro Lys
1985 1990 1995
Ile Leu Arg His Phe Asn Ile Cys Arg Ser Thr Met Met Tyr Leu
2000 2005 2010
Ser Thr Ala Leu Gly Asp Val Pro Ser Phe Ala Arg Leu His Asp
2015 2020 2025
Leu Tyr Asn Arg Pro Ile Thr Tyr Tyr Phe Arg Lys Gln Phe Ile
2030 2035 2040
Arg Gly Asn Val Tyr Leu Ser Trp Ser Trp Ser Asn Asp Thr Ser
2045 2050 2055
Val Phe Lys Arg Val Ala Cys Asn Ser Ser Leu Ser Leu Ser Ser
2060 2065 2070
His Trp Ile Arg Leu Ile Tyr LysIle Val Lys Ala Thr Arg Leu
2075 2080 2085
Val Gly Ser Ile Lys Asp Leu Ser Arg Glu Val Glu Arg His Leu
2090 2095 2100
His Arg Tyr Asn Arg Trp Ile Thr Leu Glu Asp Ile Arg Ser Arg
2105 2110 2115
Ser Ser Leu Leu Asp Tyr Ser Cys Leu
2120 2125
<210>7
<211>11930
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA rabies virus genome
<220>
<221>misc_feature
<222>(1)..(58)
<223>Leader region
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(4963)..(5361)
<223>Psi region
<220>
<221>misc_feature
<222>(5416)..(11796)
<220>
<221>misc_feature
<222>(11861)..(11930)
<223>Trailer region
<400>7
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aacaacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatctt agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aaccctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agccaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctatacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
ggggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc cccccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtcccagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtca gaacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa cgatccaagt 4920
cctgaagatc acctcccctt ggggggttct ttttgaaaaa aacctgggtt caatagtcct 4980
cctcgaactc catgcaactg ggtagattca agagtcatga gattttcatt aatcctctca 5040
gttgatcaag caagatcatg tagattctca taatagggga gatcttctag cagtttcagt 5100
gactaacggt actttcattc tccaggaact gacaccaaca gttgtagaca aaccacgggg 5160
tgtctcgggt gactctgtgc ttgggcacag acaaaggtca tggtgtgttc catgatagcg 5220
gactcaggat gagttaattg agagaggcag tcttcctccc gtgaaggaca taagcagtag 5280
ctcacaatca tcccgcgtct cagcaaagtg tgcataatta taaagtgctg ggtcatctaa 5340
gcttttcagt cgagaaaaaa acattagatc agaagaacaa ctggcaacac ttctcaacct 5400
gagacctact tcaagatgct cgatcctgga gaggtctatg atgaccctat tgacccaatc 5460
gagttagagg atgaacccag aggaaccccc actgtcccca acatcttgag gaactctgac 5520
tacaatctca actctccttt gatagaagat cctgctagac taatgttaga atggttaaaa 5580
acagggaata gaccttatcg gatgactcta acagacaatt gctccaggtc tttcagagtt 5640
ttgaaagatt atttcaagaa ggtagatttg ggttctctca aggtgggcgg aatggctgca 5700
cagtcaatga tttctctctg gttatatggt gcccactctg aatccaacag gagccggaga 5760
tgtataacag acttggccca tttctattcc aagtcgtccc ccatagagaa gctgttgaat 5820
ctcacgctag gaaatagagg gctgagaatc cccccagagg gagtgttaag ttgccttgag 5880
agggttgatt atgataatgc atttggaagg tatcttgcca acacgtattc ctcttacttg 5940
ttcttccatg taatcacctt atacatgaac gccctagact gggatgaaga aaagaccatc 6000
ctagcattat ggaaagattt aacctcagtg gacatcggga aggacttggt aaagttcaaa 6060
gaccaaatat ggggactgcc gatcgtgaca aaggactttg tttactccca aagttccaat 6120
tgtctttttg acagaaacta cacacttatg ctaaaagaac ttttcttgtc tcgcttcaac 6180
tccttaatgg tcttgctctc tcccccagag ccccgatact cagatgactt gatatctcaa 6240
ctatgccagc tgtacattgc tggggatcaa gtcttgtcta tgtgtggaaa ctccggctat 6300
gaagtcatca aaatattgga gccatatgtc gtgaatagtt tagtccagag agcagaaaag 6360
tttaggcctc tcattcattc cttgggagac tttcctgtat ttataaaaga caaggtaagt 6420
caacttgaag agacgttcgg tccctgtgca agaaggttct ttagggctct ggatcaattc 6480
gacaacatac atgacttggt ttttgtgtat ggctgttaca ggcattgggg gcacccatat 6540
atagattatc gaaagggtct gtcaaaacta tatgatcagg ttcacattaa aaaagtgata 6600
gataagtcct accaggagtg cttagcaagc gacctagcca ggaggatcct tagatggggt 6660
tttgataagt actccaagtg gtatctggat tcaagattcc tagcccgaga ccaccccttg 6720
actccctata tcaaaaccca aacatggcca cccaaacata ttgtagactt ggtgggggat 6780
acatggcaca agctcccgat cacgcagatc tttgagattc ctgaatcaat ggatccgtca 6840
gaaatattgg atgacaaatc acattctttc accagaacga gactagcttc ttggctgtca 6900
gaaaaccgag ggggacctgt tcctagcgaa aaagttatta tcacggccct gtctaagccg 6960
cctgtcaatc cccgagagtt tctgaggtct atagacctcg gaggattgcc agatgaagac 7020
ttgataattg gcctcaagcc aaaggaacgg gaattgaaga ttgaaggtcg attctttgct 7080
ctaatgtcat ggaatctaag attgtatttt gtcatcactg aaaaactctt ggccaactac 7140
atcttgccac tttttgacgc gctgactatg acagacaacc tgaacaaggt gtttaaaaag 7200
ctgatcgaca gggtcaccgg gcaagggctt ttggactatt caagggtcac atatgcattt 7260
cacctggact atgaaaagtg gaacaaccat caaagattag agtcaacaga ggatgtattt 7320
tctgtcctag atcaagtgtt tggattgaag agagtgtttt ctagaacaca cgagtttttt 7380
caaaaggcct ggatctatta ttcagacaga tcagacctca tcgggttacg ggaggatcaa 7440
atatactgct tagatgcgtc caacggccca acctgttgga atggccagga tggcgggcta 7500
gaaggcttac ggcagaaggg ctggagtcta gtcagcttat tgatgataga tagagaatct 7560
caaatcagga acacaagaac caaaatacta gctcaaggag acaaccaggt tttatgtccg 7620
acatatatgt tgtcgccagg gctatctcaa gaggggctcc tctatgaatt ggagagaata 7680
tcaaggaatg cactttcgat atacagagcc gtcgaggaag gggcatctaa gctagggctg 7740
atcaccaaga aagaagagac catgtgtagt tatgacttcc tcatctatgg aaaaacccct 7800
ttgtttagag gtaacatatt ggtgcctgag tccaaaagat gggccagagt ctcttgcgtc 7860
tctaatgacc aaatagtcaa cctcgccaat ataatgtcga cagtgtccac caatgcgcta 7920
acagtggcac aacactctca atctttgatc aaaccgatgg gggattttct gctcatgtca 7980
gtacaggcag tctttcacta cctgctattt agcccaatct taaagggaag agtttacaag 8040
attctgagcg ctgaagggga tagctttctc ctagccatgt caaggataat ctatctagat 8100
ccttctttgg gaggggtatc tggaatgtcc ctcggaagat tccatatacg acagttctca 8160
gaccctgtct ctgaagggtt atccttctgg agagagatct ggttaagctc ccacgagtcc 8220
tgggttcacg cgttgtgtca agaggctgga aacccagatc ttggagagag aacactcgag 8280
agcttcactc gccttctaga agatcctacc accttaaata tcagaggagg ggccagtcct 8340
accattctac tcaaggatgc aatcagaaag gctttatatg acgaggtgga caaggtggag 8400
aattcagagt ttcgagaggc aatcctgttg tccaagaccc atagagataa ttttatactc 8460
ttcttaacat ctgttgagcc tctgtttcct cgatttctca gtgagctatt cagttcgtct 8520
tttttgggaa tccccgagtc aatcattgga ttgatacaaa actcccgaac gataagaagg 8580
cagtttagaa agagtctctc aaaaacttta gaagaatcct tctacaactc agagatccac 8640
gggattagtc ggatgaccca gacacctcag agggttgggg gggtgtggcc ttgctcttca 8700
gagagggcag atctacttag ggagatctct tggggaagaa aagtggtagg cacgacagtt 8760
cctcaccctt ctgagatgtt ggggttactt cccaagtcct ctatttcttg cacttgtgga 8820
gcaacaggag gaggcaatcc tagagtttct gtatcagtac tcccgtcctt tgatcagtca 8880
tttttttcac gaggccccct aaaggggtac ttgggctcgt ccacctctat gtcgacccag 8940
ctattccatg catgggaaaa agtcactaat gttcatgtgg tgaagagagc tctatcgtta 9000
aaagaatcta taaactggtt cattactaga gattccaact tggctcaagc tctaattagg 9060
aacattatgt ctctgacagg ccctgatttc cctctagagg aggcccctgt cttcaaaagg 9120
acggggtcag ccttgcatag gttcaagtct gccagataca gcgaaggagg gtattcttct 9180
gtctgcccga acctcctctc tcatatttct gttagtacag acaccatgtc tgatttgacc 9240
caagacggga agaactacga tttcatgttc cagccattga tgctttatgc acagacatgg 9300
acatcagagc tggtacagag agacacaagg ctaagagact ctacgtttca ttggcacctc 9360
cgatgcaaca ggtgtgtgag acccattgac gacgtgaccc tggagacctc tcagatcttc 9420
gagtttccgg atgtgtcgaa aagaatatcc agaatggttt ctggggctgt gcctcacttc 9480
cagaggcttc ccgatatccg tctgagacca ggagattttg aatctctaag cggtagagaa 9540
aagtctcacc atatcggatc agctcagggg ctcttatact caatcttagt ggcaattcac 9600
gactcaggat acaatgatgg aaccatcttc cctgccaaca tatacggcaa ggtttcccct 9660
agagactatt tgagagggct cgcaagggga gtattgatag gatcctcgat ttgcttcttg 9720
acaagaatga caaatatcaa tattaataga cctcttgaat tgatctcagg ggtaatctca 9780
tatattctcc tgaggctaga taaccatccc tccttgtaca taatgctcag agaaccgtct 9840
cttagaggag agatattttc tatccctcag aaaatccccg ccgcttatcc aaccactatg 9900
aaagaaggca acagatcaat cttgtgttat ctccaacatg tgctacgcta tgagcgagag 9960
ataatcacgg cgtctccaga gaatgactgg ctatggatct tttcagactt tagaagtgcc 10020
aaaatgacgt acctaaccct cattacttac cagtctcatc ttctactcca gagggttgag 10080
agaaacctat ctaagagtat gagagataac ctgcgacaat tgagttcctt gatgaggcag 10140
gtgctgggcg ggcacggaga agatacctta gagtcagacg acaacattca acgactgcta 10200
aaagactctt tacgaaggac aagatgggtg gatcaagagg tgcgccatgc agctagaacc 10260
atgactggag attacagccc caacaagaag gtgtcccgta aggtaggatg ttcagaatgg 10320
gtctgctctg ctcaacaggt tgcagtctct acctcagcaa acccggcccc tgtctcggag 10380
cttgacataa gggccctctc taagaggttc cagaaccctt tgatctcggg cttgagagtg 10440
gttcagtggg caaccggtgc tcattataag cttaagccta ttctagatga tctcaatgtt 10500
ttcccctctc tctgccttgt agttggggac gggtcagggg ggatatcaag ggcagtcctc 10560
aacatgtttc cagatgccaa gcttgtgttc aacagtctct tagaggtgaa tgacctgatg 10620
gcttccggaa cacatccact gcctccttca gcaatcatga ggggaggaaa tggtatcgtc 10680
tccagagtga tagattttga ctcaatctgg gaaaaaccgt ccgacttgag aaacttggca 10740
acctggaaat acttccagtc agtccaaaag caggtcaaca tgtcctatga cctcattatt 10800
tgcgatgcag aagttactga cattgcatct atcaaccgga taaccctgtt aatgtccgat 10860
tttgcattgt ctatagatgg accactctat ttggtcttca aaacttatgg gactatgcta 10920
gtaaatccaa actacaaggc tattcaacac ctgtcaagag cgttcccctc ggtcacaggg 10980
tttatcaccc aagtaacttc gtctttttca tctgagctct acctccgatt ctccaaacga 11040
gggaagcttt tcagagatgc tgagtacttg acctcttcca cccttcgaga aatgagcctt 11100
gtgttattca attgtagcag ccccaagagt gagatgcaga gagctcgttc cttgaactat 11160
caggatcttg tgagaggatt tcctgaagaa atcatatcaa atccttacaa tgagatgatc 11220
ataactctga ttgacagtga tgtagaatct tttctagtcc acaagatggt tgatgatctt 11280
gagttacaga ggggaactct gtctaaagtg gctatcatta tagccatcat gatagttttc 11340
tccaacagag tcttcaacgt ttccaaaccc ctaactgacc ccttgttcta tccaccgtct 11400
gatcccaaaa tcctgaggca cttcaacata tgtcgcagta ctatgatgta tctatctact 11460
gctttaggtg acgtccctag cttcgcaaga cttcacgacc tgtataacag acctataact 11520
tattacttca gaaagcaatt cattcgaggg aacgtttatc tatcttggag ttggtccaac 11580
gacacctcag tgttcaaaag ggtagcctgt aattctagcc tgagtctgtc atctcactgg 11640
atcaggttga tttacaagat agtgaaggct accagactcg ttggcagcat caaggatcta 11700
tccagagaag tggaaagaca ccttcatagg tacaacaggt ggatcaccct agaggatatc 11760
agatctagat catccctact agactacagt tgcctgtgaa ccggatactc ctggaagcct 11820
gcccatgcta agactcttgt gtgatgtatc ttgaaaaaaa caagatccta aatctgaacc 11880
tttggttgtt tgattgtttt tctcattttt gttgtttatt tgttaagcgt 11930
<210>8
<211>11930
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERAg3 rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(5416)..(11796)
<400>8
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aacaacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatctt agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aaccctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agccaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctatacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
ggggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc cccccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtcccagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtcg agacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa ctatccaagt 4920
cctgaagatc acctcccctt ggggggttct ttttgaaaaa aacctgggtt caatagtcct 4980
cctcgaactc catgcaactg ggtagattca agagtcatga gattttcatt aatcctctca 5040
gttgatcaag caagatcatg tagattctca taatagggga gatcttctag cagtttcagt 5100
gactaacggt actttcattc tccaggaact gacaccaaca gttgtagaca aaccacgggg 5160
tgtctcgggt gactctgtgc ttgggcacag acaaaggtca tggtgtgttc catgatagcg 5220
gactcaggat gagttaattg agagaggcag tcttcctccc gtgaaggaca taagcagtag 5280
ctcacaatca tcccgcgtct cagcaaagtg tgcataatta taaagtgctg ggtcatctaa 5340
gcttttcagt cgagaaaaaa acattagatc agaagaacaa ctggcaacac ttctcaacct 5400
gagacctact tcaagatgct cgatcctgga gaggtctatg atgaccctat tgacccaatc 5460
gagttagagg atgaacccag aggaaccccc actgtcccca acatcttgag gaactctgac 5520
tacaatctca actctccttt gatagaagat cctgctagac taatgttaga atggttaaaa 5580
acagggaata gaccttatcg gatgactcta acagacaatt gctccaggtc tttcagagtt 5640
ttgaaagatt atttcaagaa ggtagatttg ggttctctca aggtgggcgg aatggctgca 5700
cagtcaatga tttctctctg gttatatggt gcccactctg aatccaacag gagccggaga 5760
tgtataacag acttggccca tttctattcc aagtcgtccc ccatagagaa gctgttgaat 5820
ctcacgctag gaaatagagg gctgagaatc cccccagagg gagtgttaag ttgccttgag 5880
agggttgatt atgataatgc atttggaagg tatcttgcca acacgtattc ctcttacttg 5940
ttcttccatg taatcacctt atacatgaac gccctagact gggatgaaga aaagaccatc 6000
ctagcattat ggaaagattt aacctcagtg gacatcggga aggacttggt aaagttcaaa 6060
gaccaaatat ggggactgcc gatcgtgaca aaggactttg tttactccca aagttccaat 6120
tgtctttttg acagaaacta cacacttatg ctaaaagaac ttttcttgtc tcgcttcaac 6180
tccttaatgg tcttgctctc tcccccagag ccccgatact cagatgactt gatatctcaa 6240
ctatgccagc tgtacattgc tggggatcaa gtcttgtcta tgtgtggaaa ctccggctat 6300
gaagtcatca aaatattgga gccatatgtc gtgaatagtt tagtccagag agcagaaaag 6360
tttaggcctc tcattcattc cttgggagac tttcctgtat ttataaaaga caaggtaagt 6420
caacttgaag agacgttcgg tccctgtgca agaaggttct ttagggctct ggatcaattc 6480
gacaacatac atgacttggt ttttgtgtat ggctgttaca ggcattgggg gcacccatat 6540
atagattatc gaaagggtct gtcaaaacta tatgatcagg ttcacattaa aaaagtgata 6600
gataagtcct accaggagtg cttagcaagc gacctagcca ggaggatcct tagatggggt 6660
tttgataagt actccaagtg gtatctggat tcaagattcc tagcccgaga ccaccccttg 6720
actccttata tcaaaaccca aacatggcca cccaaacata ttgtagactt ggtgggggat 6780
acatggcaca agctcccgat cacgcagatc tttgagattc ctgaatcaat ggatccgtca 6840
gaaatattgg atgacaaatc acattctttc accagaacga gactagcttc ttggctgtca 6900
gaaaaccgag ggggacctgt tcctagcgaa aaagttatta tcacggccct gtctaagccg 6960
cctgtcaatc cccgagagtt tctgaggtct atagacctcg gaggattgcc agatgaagac 7020
ttgataattg gcctcaagcc aaaggaacgg gaattgaaga ttgaaggtcg attctttgct 7080
ctaatgtcat ggaatctaag attgtatttt gtcatcactg aaaaactctt ggccaactac 7140
atcttgccac tttttgacgc gctgactatg acagacaacc tgaacaaggt gtttaaaaag 7200
ctgatcgaca gggtcaccgg gcaagggctt ttggactatt caagggtcac atatgcattt 7260
cacctggact atgaaaagtg gaacaaccat caaagattag agtcaacaga ggatgtattt 7320
tctgtcctag atcaagtgtt tggattgaag agagtgtttt ctagaacaca cgagtttttt 7380
caaaaggcct ggatctatta ttcagacaga tcagacctca tcgggttacg ggaggatcaa 7440
atatactgct tagatgcgtc caacggccca acctgttgga atggccagga tggcgggcta 7500
gaaggcttac ggcagaaggg ctggagtcta gtcagcttat tgatgataga tagagaatct 7560
caaatcagga acacaagaac caaaatacta gctcaaggag acaaccaggt tttatgtccg 7620
acatatatgt tgtcgccagg gctatctcaa gaggggctcc tctatgaatt ggagagaata 7680
tcaaggaatg cactttcgat atacagagcc gtcgaggaag gggcatctaa gctagggctg 7740
atcatcaaga aagaagagac catgtgtagt tatgacttcc tcatctatgg aaaaacccct 7800
ttgtttagag gtaacatatt ggtgcctgag tccaaaagat gggccagagt ctcttgcgtc 7860
tctaatgacc aaatagtcaa cctcgccaat ataatgtcga cagtgtccac caatgcgcta 7920
acagtggcac aacactctca atctttgatc aaaccgatga gggattttct gctcatgtca 7980
gtacaggcag tctttcacta cctgctattt agcccaatct taaagggaag agtttacaag 8040
attctgagcg ctgaagggga tagctttctc ctagccatgt caaggataat ctatctagat 8100
ccttctttgg gaggggtatc tggaatgtcc ctcggaagat tccatatacg acagttctca 8160
gaccctgtct ctgaagggtt atccttctgg agagagatct ggttaagctc ccacgagtcc 8220
tggattcacg cgttgtgtca agaggctgga aacccagatc ttggagagag aacactcgag 8280
agcttcactc gccttctaga agatcctacc accttaaata tcagaggagg ggccagtcct 8340
accattctac tcaaggatgc aatcagaaag gctttatatg acgaggtgga caaggtggag 8400
aattcagagt ttcgagaggc aatcctgttg tccaagaccc atagagataa ttttatactc 8460
ttcttaacat ctgttgagcc tctgtttcct cgatttctca gtgagctatt cagttcgtct 8520
tttttgggaa tccccgagtc aatcattgga ttgatacaaa actcccgaac gataagaagg 8580
cagtttagaa agagtctctc aaaaacttta gaagaatcct tctacaactc agagatccac 8640
gggattagtc ggatgaccca gacacctcag agggttgggg gggtgtggcc ttgctcttca 8700
gagagggcag atctacttag ggagatctct tggggaagaa aagtggtagg cacgacagtt 8760
cctcaccctt ctgagatgtt ggggttactt cccaagtcct ctatttcttg cacttgtgga 8820
gcaacaggag gaggcaatcc tagagtttct gtatcagtac tcccgtcctt tgatcagtca 8880
tttttttcac gaggccccct aaaggggtac ttgggctcgt ccacctctat gtcgacccag 8940
ctattccatg catgggaaaa agtcactaat gttcatgtgg tgaagagagc tctatcgtta 9000
aaagaatcta taaactggtt cattactaga gattccaact tggctcaagc tctaattagg 9060
aacattatgt ctctgacagg ccctgatttc cctctagagg aggcccctgt cttcaaaagg 9120
acggggtcag ccttgcatag gttcaagtct gccagataca gcgaaggagg gtattcttct 9180
gtctgcccga acctcctctc tcatatttct gttagtacag acaccatgtc tgatttgacc 9240
caagacggga agaactacga tttcatgttc cagccattga tgctttatgc acagacatgg 9300
acatcagagc tggtacagag agacacaagg ctaagagact ctacgtttca ttggcacctc 9360
cgatgcaaca ggtgtgtgag acccattgac gacgtgaccc tggagacctc tcagatcttc 9420
gagtttccgg atgtgtcgaa aagaatatcc agaatggttt ctggggctgt gcctcacttc 9480
cagaggcttc ccgatatccg tctgagacca ggagattttg aatctctaag cggtagagaa 9540
aagtctcacc atatcggatc agctcagggg ctcttatact caatcttagt ggcaattcac 9600
gactcaggat acaatgatgg aaccatcttc cctgtcaaca tatacgacaa ggtttcccct 9660
agagactatt tgagagggct cgcaagggga gtattgatag gatcctcgat ttgcttcttg 9720
acaagaatga caaatatcaa tattaataga cctcttgaat tgatctcagg ggtaatctca 9780
tatattctcc tgaggctaga taaccatccc tccttgtaca taatgctcag agaaccgtct 9840
cttagaggag agatattttc tatccctcag aaaatccccg ccgcttatcc aaccactatg 9900
aaagaaggca acagatcaat cttgtgttat ctccaacatg tgctacgcta tgagcgagag 9960
ataatcacgg cgtctccaga gaatgactgg ctatggatct tttcagactt tagaagtgcc 10020
aaaatgacgt acctaaccct cattacttac cagtctcatc ttctactcca gagggttgag 10080
agaaacctat ctaagagtat gagagataac ctgcgacaat tgagttcctt gatgaggcag 10140
gtgctgggcg ggcacggaga agatacctta gagtcagacg acaacattca acgactgcta 10200
aaagactctt tacgaaggac aagatgggtg gatcaagagg tgcgccatgc agctagaacc 10260
atgactggag attacagccc caacaagaag gtgtcccgta aggtaggatg ttcagaatgg 10320
gtctgctctg ctcaacaggt tgcagtctct acctcagcaa acccggcccc tgtctcggag 10380
cttgacataa gggccctctc taagaggttc cagaaccctt tgatctcggg cttgagagtg 10440
gttcagtggg caaccggtgc tcattataag cttaagccta ttctagatga tctcaatgtt 10500
ttcccatctc tctgccttgt agttggggac gggtcagggg ggatatcaag ggcagtcctc 10560
aacatgtttc cagatgccaa gcttgtgttc aacagtctct tagaggtgaa tgacctgatg 10620
gcttccggaa cacatccact gcctccttca gcaatcatga ggggaggaaa tgatatcgtc 10680
tccagagtga tagattttga ctcaatctgg gaaaaaccgt ccgacttgag aaacttggca 10740
acctggaaat acttccagtc agtccaaaag caggtcaaca tgtcctatga cctcattatt 10800
tgcgatgcag aagttactga cattgcatct atcaaccgga taaccctgtt aatgtccgat 10860
tttgcattgt ctatagatgg accactctat ttggtcttca aaacttatgg gactatgcta 10920
gtaaatccaa actacaaggc tattcaacac ctgtcaagag cgttcccctc ggtcacaggg 10980
tttatcaccc aagtaacttc gtctttttca tctgagctct acctccgatt ctccaaacga 11040
gggaagtttt tcagagatgc tgagtacttg acctcttcca cccttcgaga aatgagcctt 11100
gtgttattca attgtagcag ccccaagagt gagatgcaga gagctcgttc cttgaactat 11160
caggatcttg tgagaggatt tcctgaagaa atcatatcaa atccttacaa tgagatgatc 11220
ataactctga ttgacagtga tgtagaatct tttctagtcc acaagatggt tgatgatctt 11280
gagttacaga ggggaactct gtctaaagtg gctatcatta tagccatcat gatagttttc 11340
tccaacagag tcttcaacgt ttccaaaccc ctaactgacc ccttgttcta tccaccgtct 11400
gatcccaaaa tcctgaggca cttcaacata tgttgcagta ctatgatgta tctatctact 11460
gctttaggtg acgtccctag cttcgcaaga cttcacgacc tgtataacag acctataact 11520
tattacttca gaaagcaatt cattcgaggg aacgtttatc tatcttggag ttggtccaac 11580
gacacctcag tgttcaaaag ggtagcctgt aattctagcc tgagtctgtc atctcactgg 11640
atcaggttga tttacaagat agtgaagact accagactcg ttggcagcat caaggatcta 11700
tccagagaag tggaaagaca ccttcatagg tacaacaggt ggatcaccct agaggatatc 11760
agatctagat catccctact agactacagt tgcctgtgaa ccggatactc ctggaagcct 11820
gcccatgcta agactcttgt gtgatgtatc ttgaaaaaaa caagatccta aatctgaacc 11880
tttggttgtt tgattgtttt tctcattttt gttgtttatt tgttaagcgt 11930
<210>9
<211>11577
<212>DNA
<213>artificial sequence
<220>
<223>recombinant ERA-rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(5063)..(11443)
<400>9
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agtcaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctacacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
agggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc ctcccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtctcagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtca gaacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa ctatccaagt 4920
cctgaagatc acctcccctt ggggggttca tcatgaaaaa aactaacacc cctcctttcg 4980
ctgcagtttg gtaccgtcga gaaaaaaaca ttagatcaga agaacaactg gcaacacttc 5040
tcaacctgag acctacttca agatgctcga tcctggagag gtctatgatg accctattga 5100
cccaatcgag ttagaggatg aacccagagg aacccccact gtccccaaca tcttgaggaa 5160
ctctgactac aatctcaact ctcctttgat agaagatcct gctagactaa tgttagaatg 5220
gttaaaaaca gggaatagac cttatcggat gactctaaca gacaattgct ccaggtcttt 5280
cagagttttg aaagattatt tcaagaaggt agatttgggt tctctcaagg tgggcggaat 5340
ggctgcacag tcaatgattt ctctctggtt atatggtgcc cactctgaat ccaacaggag 5400
ccggagatgt ataacagact tggcccattt ctattccaag tcgtccccca tagagaagct 5460
gttgaatctc acgctaggaa atagagggct gagaatcccc ccagagggag tgttaagttg 5520
ccttgagagg gttgattatg ataatgcatt tggaaggtat cttgccaaca cgtattcctc 5580
ttacttgttc ttccatgtaa tcaccttata catgaacgcc ctagactggg atgaagaaaa 5640
gaccatccta gcattatgga aagatttaac ctcagtggac atcgggaagg acttggtaaa 5700
gttcaaagac caaatatggg gactgctgat cgtgacaaag gactttgttt actcccaaag 5760
ttccaattgt ctttttgaca gaaactacac acttatgcta aaagatcttt tcttgtctcg 5820
cttcaactcc ttaatggtct tgctctctcc cccagagccc cgatactcag atgacttgat 5880
atctcaacta tgccagctgt acattgctgg ggatcaagtc ttgtctatgt gtggaaactc 5940
cggctatgaa gtcatcaaaa tattggagcc atatgtcgtg aatagtttag tccagagagc 6000
agaaaagttt aggcctctca ttcattcctt gggagacttt cctgtattta taaaagacaa 6060
ggtaagtcaa cttgaagaga cgttcggtcc ctgtgcaaga aggttcttta gggctctgga 6120
tcaattcgac aacatacatg acttggtttt tgtgtatggc tgttacaggc attgggggca 6180
cccatatata gattatcgaa agggtctgtc aaaactatat gatcaggttc acattaaaaa 6240
agtgatagat aagtcctacc aggagtgctt agcaagcgac ctagccagga ggatccttag 6300
atggggtttt gataagtact ccaagtggta tctggattca agattcctag cccgagacca 6360
ccccttgact ccttatatca aaacccaaac atggccaccc aaacatattg tagacttggt 6420
gggggataca tggcacaagc tcccgatcac gcagatcttt gagattcctg aatcaatgga 6480
tccgtcagaa atattggatg acaaatcaca ttctttcacc agaacgagac tagcttcttg 6540
gctgtcagaa aaccgagggg gacctgttcc tagcgaaaaa gttattatca cggccctgtc 6600
taagccgcct gtcaatcccc gagagtttct gaggtctata gacctcggag gattgccaga 6660
tgaagacttg ataattggcc tcaagccaaa ggaacgggaa ttgaagattg aaggtcgatt 6720
ctttgctcta atgtcatgga atctaagatt gtattttgtc atcactgaaa aactcttggc 6780
caactacatc ttgccacttt ttgacgcgct gactatgaca gacaacctga acaaggtgtt 6840
taaaaagctg atcgacaggg tcaccgggca agggcttttg gactattcaa gggtcacata 6900
tgcatttcac ctggactatg aaaagtggaa caaccatcaa agattagagt caacagagga 6960
tgtattttct gtcctagatc aagtgtttgg attgaagaga gtgttttcta gaacacacga 7020
gttttttcaa aaggcctgga tctattattc agacagatca gacctcatcg ggttacggga 7080
ggatcaaata tactgcttag atgcgtccaa cggcccaacc tgttggaatg gccaggatgg 7140
cgggctagaa ggcttacggc agaagggctg gagtctagtc agcttattga tgatagatag 7200
agaatctcaa atcaggaaca caagaaccaa aatactagct caaggagaca accaggtttt 7260
atgtccgaca tatatgttgt cgccagggct atctcaagag gggctcctct atgaattgga 7320
gagaatatca aggaatgcac tttcgatata cagagccgtc gaggaagggg catctaagct 7380
agggctgatc atcaagaaag aagagaccat gtgtagttat gacttcctca tctatggaaa 7440
aacccctttg tttagaggta acatattggt gcctgagtcc aaaagatggg ccagagtctc 7500
ttgcgtctct aatgaccaaa tagtcaacct cgccaatata atgtcgacag tgtccaccaa 7560
tgcgctaaca gtggcacaac actctcaatc tttgatcaaa ccgatgaggg attttctgct 7620
catgtcagta caggcagtct ttcactacct gctatttagc ccaatcttaa agggaagagt 7680
ttacaagatt ctgagcgctg aaggggatag ctttctccta gccatgtcaa ggataatcta 7740
tctagatcct tctttgggag gggtatctgg aatgtccctc ggaagattcc atatacgaca 7800
gttctcagac cctgtctctg aagggttatc cttctggaga gagatctggt taagctccca 7860
cgagtcctgg attcacgcgt tgtgtcaaga ggctggaaac ccagatcttg gagagagaac 7920
actcgagagc ttcactcgcc ttctagaaga tcctaccacc ttaaatatca gaggaggggc 7980
cagtcctacc attctactca aggatgcaat cagaaaggct ttatatgacg aggtggacaa 8040
ggtggagaat tcagagtttc gagaggcaat cctgttgtcc aagacccata gagataattt 8100
tatactcttc ttaacatctg ttgagcctct gtttcctcga tttctcagtg agctattcag 8160
ttcgtctttt ttgggaatcc ccgagtcaat cattggattg atacaaaact cccgaacgat 8220
aagaaggcag tttagaaaga gtctctcaaa aactttagaa gaatccttct acaactcaga 8280
gatccacggg attagtcgga tgacccagac acctcagagg gttggggggg tgtggccttg 8340
ctcttcagag agggcagatc tacttaggga gatctcttgg ggaagaaaag tggtaggcac 8400
gacagttcct cacccttctg agatgttggg gttacttccc aagtcctcta tttcttgcac 8460
ttgtggagca acaggaggag gcaatcctag agtttctgta tcagtactcc cgtcctttga 8520
tcagtcattt ttttcacgag gccccctaaa ggggtacttg ggctcgtcca cctctatgtc 8580
gacccagcta ttccatgcat gggaaaaagt cactaatgtt catgtggtga agagagctct 8640
atcgttaaaa gaatctataa actggttcat tactagagat tccaacttgg ctcaagctct 8700
aattaggaac attatgtctc tgacaggccc tgatttccct ctagaggagg cccctgtctt 8760
caaaaggacg gggtcagcct tgcataggtt caagtctgcc agatacagcg aaggagggta 8820
ttcttctgtc tgcccgaacc tcctctctca tatttctgtt agtacagaca ccatgtctga 8880
tttgacccaa gacgggaaga actacgattt catgttccag ccattgatgc tttatgcaca 8940
gacatggaca tcagagctgg tacagagaga cacaaggcta agagactcta cgtttcattg 9000
gcacctccga tgcaacaggt gtgtgagacc cattgacgac gtgaccctgg agacctctca 9060
gatcttcgag tttccggatg tgtcgaaaag aatatccaga atggtttctg gggctgtgcc 9120
tcacttccag aggcttcccg atatccgtct gagaccagga gattttgaat ctctaagcgg 9180
tagagaaaag tctcaccata tcggatcagc tcaggggctc ttatactcaa tcttagtggc 9240
aattcacgac tcaggataca atgatggaac catcttccct gtcaacatat acgacaaggt 9300
ttcccctaga gactatttga gagggctcgc aaggggagta ttgataggat cctcgatttg 9360
cttcttgaca agaatgacaa atatcaatat taatagacct cttgaattga tctcaggggt 9420
aatctcatat attctcctga ggctagataa ccatccctcc ttgtacataa tgctcagaga 9480
accgtctctt agaggagaga tattttctat ccctcagaaa atccccgccg cttatccaac 9540
cactatgaaa gaaggcaaca gatcaatctt gtgttatctc caacatgtgc tacgctatga 9600
gcgagagata atcacggcgt ctccagagaa tgactggcta tggatctttt cagactttag 9660
aagtgccaaa atgacgtacc taaccctcat tacttaccag tctcatcttc tactccagag 9720
ggttgagaga aacctatcta agagtatgag agataacctg cgacaattga gttccttgat 9780
gaggcaggtg ctgggcgggc acggagaaga taccttagag tcagacgaca acattcaacg 9840
actgctaaaa gactctttac gaaggacaag atgggtggat caagaggtgc gccatgcagc 9900
tagaaccatg actggagatt acagccccaa caagaaggtg tcccgtaagg taggatgttc 9960
agaatgggtc tgctctgctc aacaggttgc agtctctacc tcagcaaacc cggcccctgt 10020
ctcggagctt gacataaggg ccctctctaa gaggttccag aaccctttga tctcgggctt 10080
gagagtggtt cagtgggcaa ccggtgctca ttataagctt aagcctattc tagatgatct 10140
caatgttttc ccatctctct gccttgtagt tggggacggg tcagggggga tatcaagggc 10200
agtcctcaac atgtttccag atgccaagct tgtgttcaac agtctcttag aggtgaatga 10260
cctgatggct tccggaacac atccactgcc tccttcagca atcatgaggg gaggaaatga 10320
tatcgtctcc agagtgatag attttgactc aatctgggaa aaaccgtccg acttgagaaa 10380
cttggcaacc tggaaatact tccagtcagt ccaaaagcag gtcaacatgt cctatgacct 10440
cattatttgc gatgcagaag ttactgacat tgcatctatc aaccggataa ccctgttaat 10500
gtccgatttt gcattgtcta tagatggacc actctatttg gtcttcaaaa cttatgggac 10560
tatgctagta aatccaaact acaaggctat tcaacacctg tcaagagcgt tcccctcggt 10620
cacagggttt atcacccaag taacttcgtc tttttcatct gagctctacc tccgattctc 10680
caaacgaggg aagtttttca gagatgctga gtacttgacc tcttccaccc ttcgagaaat 10740
gagccttgtg ttattcaatt gtagcagccc caagagtgag atgcagagag ctcgttcctt 10800
gaactatcag gatcttgtga gaggatttcc tgaagaaatc atatcaaatc cttacaatga 10860
gatgatcata actctgattg acagtgatgt agaatctttt ctagtccaca agatggttga 10920
tgatcttgag ttacagaggg gaactctgtc taaagtggct atcattatag ccatcatgat 10980
agttttctcc aacagagtct tcaacgtttc caaaccccta actgacccct tgttctatcc 11040
accgtctgat cccaaaatcc tgaggcactt caacatatgt tgcagtacta tgatgtatct 11100
atctactgct ttaggtgacg tccctagctt cgcaagactt cacgacctgt ataacagacc 11160
tataacttat tacttcagaa agcaattcat tcgagggaac gtttatctat cttggagttg 11220
gtccaacgac acctcagtgt tcaaaagggt agcctgtaat tctagcctga gtctgtcatc 11280
tcactggatc aggttgattt acaagatagt gaagactacc agactcgttg gcagcatcaa 11340
ggatctatcc agagaagtgg aaagacacct tcataggtac aacaggtgga tcaccctaga 11400
ggatatcaga tctagatcat ccctactaga ctacagttgc ctgtgatccg gatactcctg 11460
gaagcctgcc catgctaaga ctcttgtgtg atgtatcttg aaaaaaacaa gatcctaaat 11520
ctgaaccttt ggttgtttga ttgtttttct catttttgtt gtttatttgt taagcgt 11577
<210>10
<211>13150
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA-2G rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(4988)..(6559)
<220>
<221>misc_feature
<222>(6636)..(13016)
<400>10
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agtcaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctacacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
agggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc ctcccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtctcagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtca gaacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa ctatccaagt 4920
cctgaagatc acctcccctt ggggggttca tcatgaaaaa aactaacacc cctcctttcg 4980
ctgcaggatg gttcctcagg ctctcctgtt tgtacccctt ctggtttttc cattgtgttt 5040
tgggaaattc cctatttaca cgataccaga caagcttggt ccctggagcc cgattgacat 5100
acatcacctc agctgcccaa acaatttggt agtggaggac gaaggatgca ccaacctgtc 5160
agggttctcc tacatggaac ttaaagttgg atacatctta gccataaaaa tgaacgggtt 5220
cacttgcaca ggcgttgtga cggaggctga aacctacact aacttcgttg gttatgtcac 5280
aaccacgttc aaaagaaagc atttccgccc aacaccagat gcatgtagag ccgcgtacaa 5340
ctggaagatg gccggtgacc ccagatatga agagtctcta cacaatccgt accctgacta 5400
ccactggctt cgaactgtaa aaaccaccaa ggagtctctc gttatcatat ctccaagtgt 5460
ggcagatttg gacccatatg acagatccct tcactcgagg gtcttcccta gcgggaagtg 5520
ctcaggagta gcggtgtctt ctacctactg ctccactaac cacgattaca ccatttggat 5580
gcccgagaat ccgagactag ggatgtcttg tgacattttt accaatagta gagggaagag 5640
agcatccaaa gggagtgaga cttgcggctt tgtagatgaa agaggcctat ataagtcttt 5700
aaaaggagca tgcaaactca agttatgtgg agttctagga cttagactta tggatggaac 5760
atgggtcgcg atgcaaacat caaatgaaac caaatggtgc cctcccgatc agttggtgaa 5820
cctgcacgac tttcgctcag acgaaattga gcaccttgtt gtagaggagt tggtcaggaa 5880
gagagaggag tgtctggatg cactagagtc catcatgaca accaagtcag tgagtttcag 5940
acgtctcagt catttaagaa aacttgtccc tgggtttgga aaagcatata ccatattcaa 6000
caagaccttg atggaagccg atgctcacta caagtcagtc agaacttgga atgagatcct 6060
cccttcaaaa gggtgtttaa gagttggggg gaggtgtcat cctcatgtga acggggtgtt 6120
tttcaatggt ataatattag gacctgacgg caatgtctta atcccagaga tgcaatcatc 6180
cctcctccag caacatatgg agttgttgga atcctcggtt atcccccttg tgcaccccct 6240
ggcagacccg tctaccgttt tcaaggacgg tgacgaggct gaggattttg ttgaagttca 6300
ccttcccgat gtgcacaatc aggtctcagg agttgacttg ggtctcccga actgggggaa 6360
gtatgtatta ctgagtgcag gggccctgac tgccttgatg ttgataattt tcctgatgac 6420
atgttgtaga agagtcaatc gatcagaacc tacgcaacac aatctcagag ggacagggag 6480
ggaggtgtca gtcactcccc aaagcgggaa gatcatatct tcatgggaat cacacaagag 6540
tgggggtgag accagactgt gaggtaccgt cgagaaaaaa acattagatc agaagaacaa 6600
ctggcaacac ttctcaacct gagacctact tcaagatgct cgatcctgga gaggtctatg 6660
atgaccctat tgacccaatc gagttagagg atgaacccag aggaaccccc actgtcccca 6720
acatcttgag gaactctgac tacaatctca actctccttt gatagaagat cctgctagac 6780
taatgttaga atggttaaaa acagggaata gaccttatcg gatgactcta acagacaatt 6840
gctccaggtc tttcagagtt ttgaaagatt atttcaagaa ggtagatttg ggttctctca 6900
aggtgggcgg aatggctgca cagtcaatga tttctctctg gttatatggt gcccactctg 6960
aatccaacag gagccggaga tgtataacag acttggccca tttctattcc aagtcgtccc 7020
ccatagagaa gctgttgaat ctcacgctag gaaatagagg gctgagaatc cccccagagg 7080
gagtgttaag ttgccttgag agggttgatt atgataatgc atttggaagg tatcttgcca 7140
acacgtattc ctcttacttg ttcttccatg taatcacctt atacatgaac gccctagact 7200
gggatgaaga aaagaccatc ctagcattat ggaaagattt aacctcagtg gacatcggga 7260
aggacttggt aaagttcaaa gaccaaatat ggggactgct gatcgtgaca aaggactttg 7320
tttactccca aagttccaat tgtctttttg acagaaacta cacacttatg ctaaaagatc 7380
ttttcttgtc tcgcttcaac tccttaatgg tcttgctctc tcccccagag ccccgatact 7440
cagatgactt gatatctcaa ctatgccagc tgtacattgc tggggatcaa gtcttgtcta 7500
tgtgtggaaa ctccggctat gaagtcatca aaatattgga gccatatgtc gtgaatagtt 7560
tagtccagag agcagaaaag tttaggcctc tcattcattc cttgggagac tttcctgtat 7620
ttataaaaga caaggtaagt caacttgaag agacgttcgg tccctgtgca agaaggttct 7680
ttagggctct ggatcaattc gacaacatac atgacttggt ttttgtgtat ggctgttaca 7740
ggcattgggg gcacccatat atagattatc gaaagggtct gtcaaaacta tatgatcagg 7800
ttcacattaa aaaagtgata gataagtcct accaggagtg cttagcaagc gacctagcca 7860
ggaggatcct tagatggggt tttgataagt actccaagtg gtatctggat tcaagattcc 7920
tagcccgaga ccaccccttg actccttata tcaaaaccca aacatggcca cccaaacata 7980
ttgtagactt ggtgggggat acatggcaca agctcccgat cacgcagatc tttgagattc 8040
ctgaatcaat ggatccgtca gaaatattgg atgacaaatc acattctttc accagaacga 8100
gactagcttc ttggctgtca gaaaaccgag ggggacctgt tcctagcgaa aaagttatta 8160
tcacggccct gtctaagccg cctgtcaatc cccgagagtt tctgaggtct atagacctcg 8220
gaggattgcc agatgaagac ttgataattg gcctcaagcc aaaggaacgg gaattgaaga 8280
ttgaaggtcg attctttgct ctaatgtcat ggaatctaag attgtatttt gtcatcactg 8340
aaaaactctt ggccaactac atcttgccac tttttgacgc gctgactatg acagacaacc 8400
tgaacaaggt gtttaaaaag ctgatcgaca gggtcaccgg gcaagggctt ttggactatt 8460
caagggtcac atatgcattt cacctggact atgaaaagtg gaacaaccat caaagattag 8520
agtcaacaga ggatgtattt tctgtcctag atcaagtgtt tggattgaag agagtgtttt 8580
ctagaacaca cgagtttttt caaaaggcct ggatctatta ttcagacaga tcagacctca 8640
tcgggttacg ggaggatcaa atatactgct tagatgcgtc caacggccca acctgttgga 8700
atggccagga tggcgggcta gaaggcttac ggcagaaggg ctggagtcta gtcagcttat 8760
tgatgataga tagagaatct caaatcagga acacaagaac caaaatacta gctcaaggag 8820
acaaccaggt tttatgtccg acatatatgt tgtcgccagg gctatctcaa gaggggctcc 8880
tctatgaatt ggagagaata tcaaggaatg cactttcgat atacagagcc gtcgaggaag 8940
gggcatctaa gctagggctg atcatcaaga aagaagagac catgtgtagt tatgacttcc 9000
tcatctatgg aaaaacccct ttgtttagag gtaacatatt ggtgcctgag tccaaaagat 9060
gggccagagt ctcttgcgtc tctaatgacc aaatagtcaa cctcgccaat ataatgtcga 9120
cagtgtccac caatgcgcta acagtggcac aacactctca atctttgatc aaaccgatga 9180
gggattttct gctcatgtca gtacaggcag tctttcacta cctgctattt agcccaatct 9240
taaagggaag agtttacaag attctgagcg ctgaagggga tagctttctc ctagccatgt 9300
caaggataat ctatctagat ccttctttgg gaggggtatc tggaatgtcc ctcggaagat 9360
tccatatacg acagttctca gaccctgtct ctgaagggtt atccttctgg agagagatct 9420
ggttaagctc ccacgagtcc tggattcacg cgttgtgtca agaggctgga aacccagatc 9480
ttggagagag aacactcgag agcttcactc gccttctaga agatcctacc accttaaata 9540
tcagaggagg ggccagtcct accattctac tcaaggatgc aatcagaaag gctttatatg 9600
acgaggtgga caaggtggag aattcagagt ttcgagaggc aatcctgttg tccaagaccc 9660
atagagataa ttttatactc ttcttaacat ctgttgagcc tctgtttcct cgatttctca 9720
gtgagctatt cagttcgtct tttttgggaa tccccgagtc aatcattgga ttgatacaaa 9780
actcccgaac gataagaagg cagtttagaa agagtctctc aaaaacttta gaagaatcct 9840
tctacaactc agagatccac gggattagtc ggatgaccca gacacctcag agggttgggg 9900
gggtgtggcc ttgctcttca gagagggcag atctacttag ggagatctct tggggaagaa 9960
aagtggtagg cacgacagtt cctcaccctt ctgagatgtt ggggttactt cccaagtcct 10020
ctatttcttg cacttgtgga gcaacaggag gaggcaatcc tagagtttct gtatcagtac 10080
tcccgtcctt tgatcagtca tttttttcac gaggccccct aaaggggtac ttgggctcgt 10140
ccacctctat gtcgacccag ctattccatg catgggaaaa agtcactaat gttcatgtgg 10200
tgaagagagc tctatcgtta aaagaatcta taaactggtt cattactaga gattccaact 10260
tggctcaagc tctaattagg aacattatgt ctctgacagg ccctgatttc cctctagagg 10320
aggcccctgt cttcaaaagg acggggtcag ccttgcatag gttcaagtct gccagataca 10380
gcgaaggagg gtattcttct gtctgcccga acctcctctc tcatatttct gttagtacag 10440
acaccatgtc tgatttgacc caagacggga agaactacga tttcatgttc cagccattga 10500
tgctttatgc acagacatgg acatcagagc tggtacagag agacacaagg ctaagagact 10560
ctacgtttca ttggcacctc cgatgcaaca ggtgtgtgag acccattgac gacgtgaccc 10620
tggagacctc tcagatcttc gagtttccgg atgtgtcgaa aagaatatcc agaatggttt 10680
ctggggctgt gcctcacttc cagaggcttc ccgatatccg tctgagacca ggagattttg 10740
aatctctaag cggtagagaa aagtctcacc atatcggatc agctcagggg ctcttatact 10800
caatcttagt ggcaattcac gactcaggat acaatgatgg aaccatcttc cctgtcaaca 10860
tatacgacaa ggtttcccct agagactatt tgagagggct cgcaagggga gtattgatag 10920
gatcctcgat ttgcttcttg acaagaatga caaatatcaa tattaataga cctcttgaat 10980
tgatctcagg ggtaatctca tatattctcc tgaggctaga taaccatccc tccttgtaca 11040
taatgctcag agaaccgtct cttagaggag agatattttc tatccctcag aaaatccccg 11100
ccgcttatcc aaccactatg aaagaaggca acagatcaat cttgtgttat ctccaacatg 11160
tgctacgcta tgagcgagag ataatcacgg cgtctccaga gaatgactgg ctatggatct 11220
tttcagactt tagaagtgcc aaaatgacgt acctaaccct cattacttac cagtctcatc 11280
ttctactcca gagggttgag agaaacctat ctaagagtat gagagataac ctgcgacaat 11340
tgagttcctt gatgaggcag gtgctgggcg ggcacggaga agatacctta gagtcagacg 11400
acaacattca acgactgcta aaagactctt tacgaaggac aagatgggtg gatcaagagg 11460
tgcgccatgc agctagaacc atgactggag attacagccc caacaagaag gtgtcccgta 11520
aggtaggatg ttcagaatgg gtctgctctg ctcaacaggt tgcagtctct acctcagcaa 11580
acccggcccc tgtctcggag cttgacataa gggccctctc taagaggttc cagaaccctt 11640
tgatctcggg cttgagagtg gttcagtggg caaccggtgc tcattataag cttaagccta 11700
ttctagatga tctcaatgtt ttcccatctc tctgccttgt agttggggac gggtcagggg 11760
ggatatcaag ggcagtcctc aacatgtttc cagatgccaa gcttgtgttc aacagtctct 11820
tagaggtgaa tgacctgatg gcttccggaa cacatccact gcctccttca gcaatcatga 11880
ggggaggaaa tgatatcgtc tccagagtga tagattttga ctcaatctgg gaaaaaccgt 11940
ccgacttgag aaacttggca acctggaaat acttccagtc agtccaaaag caggtcaaca 12000
tgtcctatga cctcattatt tgcgatgcag aagttactga cattgcatct atcaaccgga 12060
taaccctgtt aatgtccgat tttgcattgt ctatagatgg accactctat ttggtcttca 12120
aaacttatgg gactatgcta gtaaatccaa actacaaggc tattcaacac ctgtcaagag 12180
cgttcccctc ggtcacaggg tttatcaccc aagtaacttc gtctttttca tctgagctct 12240
acctccgatt ctccaaacga gggaagtttt tcagagatgc tgagtacttg acctcttcca 12300
cccttcgaga aatgagcctt gtgttattca attgtagcag ccccaagagt gagatgcaga 12360
gagctcgttc cttgaactat caggatcttg tgagaggatt tcctgaagaa atcatatcaa 12420
atccttacaa tgagatgatc ataactctga ttgacagtga tgtagaatct tttctagtcc 12480
acaagatggt tgatgatctt gagttacaga ggggaactct gtctaaagtg gctatcatta 12540
tagccatcat gatagttttc tccaacagag tcttcaacgt ttccaaaccc ctaactgacc 12600
ccttgttcta tccaccgtct gatcccaaaa tcctgaggca cttcaacata tgttgcagta 12660
ctatgatgta tctatctact gctttaggtg acgtccctag cttcgcaaga cttcacgacc 12720
tgtataacag acctataact tattacttca gaaagcaatt cattcgaggg aacgtttatc 12780
tatcttggag ttggtccaac gacacctcag tgttcaaaag ggtagcctgt aattctagcc 12840
tgagtctgtc atctcactgg atcaggttga tttacaagat agtgaagact accagactcg 12900
ttggcagcat caaggatcta tccagagaag tggaaagaca ccttcatagg tacaacaggt 12960
ggatcaccct agaggatatc agatctagat catccctact agactacagt tgcctgtgat 13020
ccggatactc ctggaagcct gcccatgcta agactcttgt gtgatgtatc ttgaaaaaaa 13080
caagatccta aatctgaacc tttggttgtt tgattgtttt tctcattttt gttgtttatt 13140
tgttaagcgt 13150
<210>11
<211>12266
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERAgreen rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(4993)..(5673)
<220>
<221>misc_feature
<222>(5752)..(12132)
<400>11
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agtcaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctacacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
agggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc ctcccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtctcagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtca gaacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa ctatccaagt 4920
cctgaagatc acctcccctt ggggggttca tcatgaaaaa aactaacacc cctcctttcg 4980
ctgcaggcca ccatgggcgt gatcaagccc gacatgaaga tcaagctgcg gatggagggc 5040
gccgtgaacg gccacaaatt cgtgatcgag ggcgacggga aaggcaagcc ctttgagggt 5100
aagcagacta tggacctgac cgtgatcgag ggcgcccccc tgcccttcgc ttatgacatt 5160
ctcaccaccg tgttcgacta cggtaaccgt gtcttcgcca agtaccccaa ggacatccct 5220
gactacttca agcagacctt ccccgagggc tactcgtggg agcgaagcat gacatacgag 5280
gaccagggaa tctgtatcgc tacaaacgac atcaccatga tgaagggtgt ggacgactgc 5340
ttcgtgtaca aaatccgctt cgacggggtc aacttccctg ctaatggccc ggtgatgcag 5400
cgcaagaccc taaagtggga gcccagtacc gagaagatgt acgtgcggga cggcgtactg 5460
aagggcgatg ttaatatggc actgctcttg gagggaggcg gccactaccg ctgcgacttc 5520
aagaccacct acaaagccaa gaaggtggtg cagcttcccg actaccactt cgtggaccac 5580
cgcatcgaga tcgtgagcca cgacaaggac tacaacaaag tcaagctgta cgagcacgcc 5640
gaagcccaca gcggactacc ccgccaggcc ggctaatagg taccgtcgag aaaaaaacat 5700
tagatcagaa gaacaactgg caacacttct caacctgaga cctacttcaa gatgctcgat 5760
cctggagagg tctatgatga ccctattgac ccaatcgagt tagaggatga acccagagga 5820
acccccactg tccccaacat cttgaggaac tctgactaca atctcaactc tcctttgata 5880
gaagatcctg ctagactaat gttagaatgg ttaaaaacag ggaatagacc ttatcggatg 5940
actctaacag acaattgctc caggtctttc agagttttga aagattattt caagaaggta 6000
gatttgggtt ctctcaaggt gggcggaatg gctgcacagt caatgatttc tctctggtta 6060
tatggtgccc actctgaatc caacaggagc cggagatgta taacagactt ggcccatttc 6120
tattccaagt cgtcccccat agagaagctg ttgaatctca cgctaggaaa tagagggctg 6180
agaatccccc cagagggagt gttaagttgc cttgagaggg ttgattatga taatgcattt 6240
ggaaggtatc ttgccaacac gtattcctct tacttgttct tccatgtaat caccttatac 6300
atgaacgccc tagactggga tgaagaaaag accatcctag cattatggaa agatttaacc 6360
tcagtggaca tcgggaagga cttggtaaag ttcaaagacc aaatatgggg actgctgatc 6420
gtgacaaagg actttgttta ctcccaaagt tccaattgtc tttttgacag aaactacaca 6480
cttatgctaa aagatctttt cttgtctcgc ttcaactcct taatggtctt gctctctccc 6540
ccagagcccc gatactcaga tgacttgata tctcaactat gccagctgta cattgctggg 6600
gatcaagtct tgtctatgtg tggaaactcc ggctatgaag tcatcaaaat attggagcca 6660
tatgtcgtga atagtttagt ccagagagca gaaaagttta ggcctctcat tcattccttg 6720
ggagactttc ctgtatttat aaaagacaag gtaagtcaac ttgaagagac gttcggtccc 6780
tgtgcaagaa ggttctttag ggctctggat caattcgaca acatacatga cttggttttt 6840
gtgtatggct gttacaggca ttgggggcac ccatatatag attatcgaaa gggtctgtca 6900
aaactatatg atcaggttca cattaaaaaa gtgatagata agtcctacca ggagtgctta 6960
gcaagcgacc tagccaggag gatccttaga tggggttttg ataagtactc caagtggtat 7020
ctggattcaa gattcctagc ccgagaccac cccttgactc cttatatcaa aacccaaaca 7080
tggccaccca aacatattgt agacttggtg ggggatacat ggcacaagct cccgatcacg 7140
cagatctttg agattcctga atcaatggat ccgtcagaaa tattggatga caaatcacat 7200
tctttcacca gaacgagact agcttcttgg ctgtcagaaa accgaggggg acctgttcct 7260
agcgaaaaag ttattatcac ggccctgtct aagccgcctg tcaatccccg agagtttctg 7320
aggtctatag acctcggagg attgccagat gaagacttga taattggcct caagccaaag 7380
gaacgggaat tgaagattga aggtcgattc tttgctctaa tgtcatggaa tctaagattg 7440
tattttgtca tcactgaaaa actcttggcc aactacatct tgccactttt tgacgcgctg 7500
actatgacag acaacctgaa caaggtgttt aaaaagctga tcgacagggt caccgggcaa 7560
gggcttttgg actattcaag ggtcacatat gcatttcacc tggactatga aaagtggaac 7620
aaccatcaaa gattagagtc aacagaggat gtattttctg tcctagatca agtgtttgga 7680
ttgaagagag tgttttctag aacacacgag ttttttcaaa aggcctggat ctattattca 7740
gacagatcag acctcatcgg gttacgggag gatcaaatat actgcttaga tgcgtccaac 7800
ggcccaacct gttggaatgg ccaggatggc gggctagaag gcttacggca gaagggctgg 7860
agtctagtca gcttattgat gatagataga gaatctcaaa tcaggaacac aagaaccaaa 7920
atactagctc aaggagacaa ccaggtttta tgtccgacat atatgttgtc gccagggcta 7980
tctcaagagg ggctcctcta tgaattggag agaatatcaa ggaatgcact ttcgatatac 8040
agagccgtcg aggaaggggc atctaagcta gggctgatca tcaagaaaga agagaccatg 8100
tgtagttatg acttcctcat ctatggaaaa acccctttgt ttagaggtaa catattggtg 8160
cctgagtcca aaagatgggc cagagtctct tgcgtctcta atgaccaaat agtcaacctc 8220
gccaatataa tgtcgacagt gtccaccaat gcgctaacag tggcacaaca ctctcaatct 8280
ttgatcaaac cgatgaggga ttttctgctc atgtcagtac aggcagtctt tcactacctg 8340
ctatttagcc caatcttaaa gggaagagtt tacaagattc tgagcgctga aggggatagc 8400
tttctcctag ccatgtcaag gataatctat ctagatcctt ctttgggagg ggtatctgga 8460
atgtccctcg gaagattcca tatacgacag ttctcagacc ctgtctctga agggttatcc 8520
ttctggagag agatctggtt aagctcccac gagtcctgga ttcacgcgtt gtgtcaagag 8580
gctggaaacc cagatcttgg agagagaaca ctcgagagct tcactcgcct tctagaagat 8640
cctaccacct taaatatcag aggaggggcc agtcctacca ttctactcaa ggatgcaatc 8700
agaaaggctt tatatgacga ggtggacaag gtggagaatt cagagtttcg agaggcaatc 8760
ctgttgtcca agacccatag agataatttt atactcttct taacatctgt tgagcctctg 8820
tttcctcgat ttctcagtga gctattcagt tcgtcttttt tgggaatccc cgagtcaatc 8880
attggattga tacaaaactc ccgaacgata agaaggcagt ttagaaagag tctctcaaaa 8940
actttagaag aatccttcta caactcagag atccacggga ttagtcggat gacccagaca 9000
cctcagaggg ttgggggggt gtggccttgc tcttcagaga gggcagatct acttagggag 9060
atctcttggg gaagaaaagt ggtaggcacg acagttcctc acccttctga gatgttgggg 9120
ttacttccca agtcctctat ttcttgcact tgtggagcaa caggaggagg caatcctaga 9180
gtttctgtat cagtactccc gtcctttgat cagtcatttt tttcacgagg ccccctaaag 9240
gggtacttgg gctcgtccac ctctatgtcg acccagctat tccatgcatg ggaaaaagtc 9300
actaatgttc atgtggtgaa gagagctcta tcgttaaaag aatctataaa ctggttcatt 9360
actagagatt ccaacttggc tcaagctcta attaggaaca ttatgtctct gacaggccct 9420
gatttccctc tagaggaggc ccctgtcttc aaaaggacgg ggtcagcctt gcataggttc 9480
aagtctgcca gatacagcga aggagggtat tcttctgtct gcccgaacct cctctctcat 9540
atttctgtta gtacagacac catgtctgat ttgacccaag acgggaagaa ctacgatttc 9600
atgttccagc cattgatgct ttatgcacag acatggacat cagagctggt acagagagac 9660
acaaggctaa gagactctac gtttcattgg cacctccgat gcaacaggtg tgtgagaccc 9720
attgacgacg tgaccctgga gacctctcag atcttcgagt ttccggatgt gtcgaaaaga 9780
atatccagaa tggtttctgg ggctgtgcct cacttccaga ggcttcccga tatccgtctg 9840
agaccaggag attttgaatc tctaagcggt agagaaaagt ctcaccatat cggatcagct 9900
caggggctct tatactcaat cttagtggca attcacgact caggatacaa tgatggaacc 9960
atcttccctg tcaacatata cgacaaggtt tcccctagag actatttgag agggctcgca 10020
aggggagtat tgataggatc ctcgatttgc ttcttgacaa gaatgacaaa tatcaatatt 10080
aatagacctc ttgaattgat ctcaggggta atctcatata ttctcctgag gctagataac 10140
catccctcct tgtacataat gctcagagaa ccgtctctta gaggagagat attttctatc 10200
cctcagaaaa tccccgccgc ttatccaacc actatgaaag aaggcaacag atcaatcttg 10260
tgttatctcc aacatgtgct acgctatgag cgagagataa tcacggcgtc tccagagaat 10320
gactggctat ggatcttttc agactttaga agtgccaaaa tgacgtacct aaccctcatt 10380
acttaccagt ctcatcttct actccagagg gttgagagaa acctatctaa gagtatgaga 10440
gataacctgc gacaattgag ttccttgatg aggcaggtgc tgggcgggca cggagaagat 10500
accttagagt cagacgacaa cattcaacga ctgctaaaag actctttacg aaggacaaga 10560
tgggtggatc aagaggtgcg ccatgcagct agaaccatga ctggagatta cagccccaac 10620
aagaaggtgt cccgtaaggt aggatgttca gaatgggtct gctctgctca acaggttgca 10680
gtctctacct cagcaaaccc ggcccctgtc tcggagcttg acataagggc cctctctaag 10740
aggttccaga accctttgat ctcgggcttg agagtggttc agtgggcaac cggtgctcat 10800
tataagctta agcctattct agatgatctc aatgttttcc catctctctg ccttgtagtt 10860
ggggacgggt caggggggat atcaagggca gtcctcaaca tgtttccaga tgccaagctt 10920
gtgttcaaca gtctcttaga ggtgaatgac ctgatggctt ccggaacaca tccactgcct 10980
ccttcagcaa tcatgagggg aggaaatgat atcgtctcca gagtgataga ttttgactca 11040
atctgggaaa aaccgtccga cttgagaaac ttggcaacct ggaaatactt ccagtcagtc 11100
caaaagcagg tcaacatgtc ctatgacctc attatttgcg atgcagaagt tactgacatt 11160
gcatctatca accggataac cctgttaatg tccgattttg cattgtctat agatggacca 11220
ctctatttgg tcttcaaaac ttatgggact atgctagtaa atccaaacta caaggctatt 11280
caacacctgt caagagcgtt cccctcggtc acagggttta tcacccaagt aacttcgtct 11340
ttttcatctg agctctacct ccgattctcc aaacgaggga agtttttcag agatgctgag 11400
tacttgacct cttccaccct tcgagaaatg agccttgtgt tattcaattg tagcagcccc 11460
aagagtgaga tgcagagagc tcgttccttg aactatcagg atcttgtgag aggatttcct 11520
gaagaaatca tatcaaatcc ttacaatgag atgatcataa ctctgattga cagtgatgta 11580
gaatcttttc tagtccacaa gatggttgat gatcttgagt tacagagggg aactctgtct 11640
aaagtggcta tcattatagc catcatgata gttttctcca acagagtctt caacgtttcc 11700
aaacccctaa ctgacccctt gttctatcca ccgtctgatc ccaaaatcct gaggcacttc 11760
aacatatgtt gcagtactat gatgtatcta tctactgctt taggtgacgt ccctagcttc 11820
gcaagacttc acgacctgta taacagacct ataacttatt acttcagaaa gcaattcatt 11880
cgagggaacg tttatctatc ttggagttgg tccaacgaca cctcagtgtt caaaagggta 11940
gcctgtaatt ctagcctgag tctgtcatct cactggatca ggttgattta caagatagtg 12000
aagactacca gactcgttgg cagcatcaag gatctatcca gagaagtgga aagacacctt 12060
cataggtaca acaggtggat caccctagag gatatcagat ctagatcatc cctactagac 12120
tacagttgcc tgtgatccgg atactcctgg aagcctgccc atgctaagac tcttgtgtga 12180
tgtatcttga aaaaaacaag atcctaaatc tgaacctttg gttgtttgat tgtttttctc 12240
atttttgttg tttatttgtt aagcgt 12266
<210>12
<211>10288
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA-G rabies virus
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3774)..(10154)
<400>12
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agtcaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactaaca cccctctgca 3300
gtttggtacc ttgaaaaaaa cctgggttca atagtcctcc ttgaactcca tgcaactggg 3360
tagattcaag agtcatgaga ttttcattaa tcctctcagt tgatcaagca agatcatgta 3420
gattctcata ataggggaga tcttctagca gtttcagtga ctaacggtac tttcattctc 3480
caggaactga caccaacagt tgtagacaaa ccacggggtg tctcgggtga ctctgtgctt 3540
gggcacagac aaaggtcatg gtgtgttcca tgatagcgga ctcaggatga gttaattgag 3600
agaggcagtc ttcctcccgt gaaggacata agcagtagct cacaatcatc tcgcgtctca 3660
gcaaagtgtg cataattata aagtgctggg tcatctaagc ttttcagtcg agaaaaaaac 3720
attagatcag aagaacaact ggcaacactt ctcaacctga gacctacttc aagatgctcg 3780
atcctggaga ggtctatgat gaccctattg acccaatcga gttagaggat gaacccagag 3840
gaacccccac tgtccccaac atcttgagga actctgacta caatctcaac tctcctttga 3900
tagaagatcc tgctagacta atgttagaat ggttaaaaac agggaataga ccttatcgga 3960
tgactctaac agacaattgc tccaggtctt tcagagtttt gaaagattat ttcaagaagg 4020
tagatttggg ttctctcaag gtgggcggaa tggctgcaca gtcaatgatt tctctctggt 4080
tatatggtgc ccactctgaa tccaacagga gccggagatg tataacagac ttggcccatt 4140
tctattccaa gtcgtccccc atagagaagc tgttgaatct cacgctagga aatagagggc 4200
tgagaatccc cccagaggga gtgttaagtt gccttgagag ggttgattat gataatgcat 4260
ttggaaggta tcttgccaac acgtattcct cttacttgtt cttccatgta atcaccttat 4320
acatgaacgc cctagactgg gatgaagaaa agaccatcct agcattatgg aaagatttaa 4380
cctcagtgga catcgggaag gacttggtaa agttcaaaga ccaaatatgg ggactgctga 4440
tcgtgacaaa ggactttgtt tactcccaaa gttccaattg tctttttgac agaaactaca 4500
cacttatgct aaaagatctt ttcttgtctc gcttcaactc cttaatggtc ttgctctctc 4560
ccccagagcc ccgatactca gatgacttga tatctcaact atgccagctg tacattgctg 4620
gggatcaagt cttgtctatg tgtggaaact ccggctatga agtcatcaaa atattggagc 4680
catatgtcgt gaatagttta gtccagagag cagaaaagtt taggcctctc attcattcct 4740
tgggagactt tcctgtattt ataaaagaca aggtaagtca acttgaagag acgttcggtc 4800
cctgtgcaag aaggttcttt agggctctgg atcaattcga caacatacat gacttggttt 4860
ttgtgtatgg ctgttacagg cattgggggc acccatatat agattatcga aagggtctgt 4920
caaaactata tgatcaggtt cacattaaaa aagtgataga taagtcctac caggagtgct 4980
tagcaagcga cctagccagg aggatcctta gatggggttt tgataagtac tccaagtggt 5040
atctggattc aagattccta gcccgagacc accccttgac tccttatatc aaaacccaaa 5100
catggccacc caaacatatt gtagacttgg tgggggatac atggcacaag ctcccgatca 5160
cgcagatctt tgagattcct gaatcaatgg atccgtcaga aatattggat gacaaatcac 5220
attctttcac cagaacgaga ctagcttctt ggctgtcaga aaaccgaggg ggacctgttc 5280
ctagcgaaaa agttattatc acggccctgt ctaagccgcc tgtcaatccc cgagagtttc 5340
tgaggtctat agacctcgga ggattgccag atgaagactt gataattggc ctcaagccaa 5400
aggaacggga attgaagatt gaaggtcgat tctttgctct aatgtcatgg aatctaagat 5460
tgtattttgt catcactgaa aaactcttgg ccaactacat cttgccactt tttgacgcgc 5520
tgactatgac agacaacctg aacaaggtgt ttaaaaagct gatcgacagg gtcaccgggc 5580
aagggctttt ggactattca agggtcacat atgcatttca cctggactat gaaaagtgga 5640
acaaccatca aagattagag tcaacagagg atgtattttc tgtcctagat caagtgtttg 5700
gattgaagag agtgttttct agaacacacg agttttttca aaaggcctgg atctattatt 5760
cagacagatc agacctcatc gggttacggg aggatcaaat atactgctta gatgcgtcca 5820
acggcccaac ctgttggaat ggccaggatg gcgggctaga aggcttacgg cagaagggct 5880
ggagtctagt cagcttattg atgatagata gagaatctca aatcaggaac acaagaacca 5940
aaatactagc tcaaggagac aaccaggttt tatgtccgac atatatgttg tcgccagggc 6000
tatctcaaga ggggctcctc tatgaattgg agagaatatc aaggaatgca ctttcgatat 6060
acagagccgt cgaggaaggg gcatctaagc tagggctgat catcaagaaa gaagagacca 6120
tgtgtagtta tgacttcctc atctatggaa aaaccccttt gtttagaggt aacatattgg 6180
tgcctgagtc caaaagatgg gccagagtct cttgcgtctc taatgaccaa atagtcaacc 6240
tcgccaatat aatgtcgaca gtgtccacca atgcgctaac agtggcacaa cactctcaat 6300
ctttgatcaa accgatgagg gattttctgc tcatgtcagt acaggcagtc tttcactacc 6360
tgctatttag cccaatctta aagggaagag tttacaagat tctgagcgct gaaggggata 6420
gctttctcct agccatgtca aggataatct atctagatcc ttctttggga ggggtatctg 6480
gaatgtccct cggaagattc catatacgac agttctcaga ccctgtctct gaagggttat 6540
ccttctggag agagatctgg ttaagctccc acgagtcctg gattcacgcg ttgtgtcaag 6600
aggctggaaa cccagatctt ggagagagaa cactcgagag cttcactcgc cttctagaag 6660
atcctaccac cttaaatatc agaggagggg ccagtcctac cattctactc aaggatgcaa 6720
tcagaaaggc tttatatgac gaggtggaca aggtggagaa ttcagagttt cgagaggcaa 6780
tcctgttgtc caagacccat agagataatt ttatactctt cttaacatct gttgagcctc 6840
tgtttcctcg atttctcagt gagctattca gttcgtcttt tttgggaatc cccgagtcaa 6900
tcattggatt gatacaaaac tcccgaacga taagaaggca gtttagaaag agtctctcaa 6960
aaactttaga agaatccttc tacaactcag agatccacgg gattagtcgg atgacccaga 7020
cacctcagag ggttgggggg gtgtggcctt gctcttcaga gagggcagat ctacttaggg 7080
agatctcttg gggaagaaaa gtggtaggca cgacagttcc tcacccttct gagatgttgg 7140
ggttacttcc caagtcctct atttcttgca cttgtggagc aacaggagga ggcaatccta 7200
gagtttctgt atcagtactc ccgtcctttg atcagtcatt tttttcacga ggccccctaa 7260
aggggtactt gggctcgtcc acctctatgt cgacccagct attccatgca tgggaaaaag 7320
tcactaatgt tcatgtggtg aagagagctc tatcgttaaa agaatctata aactggttca 7380
ttactagaga ttccaacttg gctcaagctc taattaggaa cattatgtct ctgacaggcc 7440
ctgatttccc tctagaggag gcccctgtct tcaaaaggac ggggtcagcc ttgcataggt 7500
tcaagtctgc cagatacagc gaaggagggt attcttctgt ctgcccgaac ctcctctctc 7560
atatttctgt tagtacagac accatgtctg atttgaccca agacgggaag aactacgatt 7620
tcatgttcca gccattgatg ctttatgcac agacatggac atcagagctg gtacagagag 7680
acacaaggct aagagactct acgtttcatt ggcacctccg atgcaacagg tgtgtgagac 7740
ccattgacga cgtgaccctg gagacctctc agatcttcga gtttccggat gtgtcgaaaa 7800
gaatatccag aatggtttct ggggctgtgc ctcacttcca gaggcttccc gatatccgtc 7860
tgagaccagg agattttgaa tctctaagcg gtagagaaaa gtctcaccat atcggatcag 7920
ctcaggggct cttatactca atcttagtgg caattcacga ctcaggatac aatgatggaa 7980
ccatcttccc tgtcaacata tacgacaagg tttcccctag agactatttg agagggctcg 8040
caaggggagt attgatagga tcctcgattt gcttcttgac aagaatgaca aatatcaata 8100
ttaatagacc tcttgaattg atctcagggg taatctcata tattctcctg aggctagata 8160
accatccctc cttgtacata atgctcagag aaccgtctct tagaggagag atattttcta 8220
tccctcagaa aatccccgcc gcttatccaa ccactatgaa agaaggcaac agatcaatct 8280
tgtgttatct ccaacatgtg ctacgctatg agcgagagat aatcacggcg tctccagaga 8340
atgactggct atggatcttt tcagacttta gaagtgccaa aatgacgtac ctaaccctca 8400
ttacttacca gtctcatctt ctactccaga gggttgagag aaacctatct aagagtatga 8460
gagataacct gcgacaattg agttccttga tgaggcaggt gctgggcggg cacggagaag 8520
ataccttaga gtcagacgac aacattcaac gactgctaaa agactcttta cgaaggacaa 8580
gatgggtgga tcaagaggtg cgccatgcag ctagaaccat gactggagat tacagcccca 8640
acaagaaggt gtcccgtaag gtaggatgtt cagaatgggt ctgctctgct caacaggttg 8700
cagtctctac ctcagcaaac ccggcccctg tctcggagct tgacataagg gccctctcta 8760
agaggttcca gaaccctttg atctcgggct tgagagtggt tcagtgggca accggtgctc 8820
attataagct taagcctatt ctagatgatc tcaatgtttt cccatctctc tgccttgtag 8880
ttggggacgg gtcagggggg atatcaaggg cagtcctcaa catgtttcca gatgccaagc 8940
ttgtgttcaa cagtctctta gaggtgaatg acctgatggc ttccggaaca catccactgc 9000
ctccttcagc aatcatgagg ggaggaaatg atatcgtctc cagagtgata gattttgact 9060
caatctggga aaaaccgtcc gacttgagaa acttggcaac ctggaaatac ttccagtcag 9120
tccaaaagca ggtcaacatg tcctatgacc tcattatttg cgatgcagaa gttactgaca 9180
ttgcatctat caaccggata accctgttaa tgtccgattt tgcattgtct atagatggac 9240
cactctattt ggtcttcaaa acttatggga ctatgctagt aaatccaaac tacaaggcta 9300
ttcaacacct gtcaagagcg ttcccctcgg tcacagggtt tatcacccaa gtaacttcgt 9360
ctttttcatc tgagctctac ctccgattct ccaaacgagg gaagtttttc agagatgctg 9420
agtacttgac ctcttccacc cttcgagaaa tgagccttgt gttattcaat tgtagcagcc 9480
ccaagagtga gatgcagaga gctcgttcct tgaactatca ggatcttgtg agaggatttc 9540
ctgaagaaat catatcaaat ccttacaatg agatgatcat aactctgatt gacagtgatg 9600
tagaatcttt tctagtccac aagatggttg atgatcttga gttacagagg ggaactctgt 9660
ctaaagtggc tatcattata gccatcatga tagttttctc caacagagtc ttcaacgttt 9720
ccaaacccct aactgacccc ttgttctatc caccgtctga tcccaaaatc ctgaggcact 9780
tcaacatatg ttgcagtact atgatgtatc tatctactgc tttaggtgac gtccctagct 9840
tcgcaagact tcacgacctg tataacagac ctataactta ttacttcaga aagcaattca 9900
ttcgagggaa cgtttatcta tcttggagtt ggtccaacga cacctcagtg ttcaaaaggg 9960
tagcctgtaa ttctagcctg agtctgtcat ctcactggat caggttgatt tacaagatag 10020
tgaagactac cagactcgtt ggcagcatca aggatctatc cagagaagtg gaaagacacc 10080
ttcataggta caacaggtgg atcaccctag aggatatcag atctagatca tccctactag 10140
actacagttg cctgtgatcc ggatactcct ggaagcctgc ccatgctaag actcttgtgt 10200
gatgtatctt gaaaaaaaca agatcctaaa tctgaacctt tggttgtttg attgtttttc 10260
tcatttttgt tgtttatttg ttaagcgt 10288
<210>13
<211>13150
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA-2g3rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2496)..(3101)
<220>
<221>misc_feature
<222>(3317)..(4888)
<220>
<221>misc_feature
<222>(4988)..(6559)
<220>
<221>misc_feature
<222>(6636)..(13016)
<400>13
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaacaggc aacaccactg ataaaatgaa ctttctacgt aagatagtga 2520
aaaattgcag ggacgaggac actcaaaaac cctctcccgt gtcagcccct ctggatgacg 2580
atgacttgtg gcttccaccc cctgaatacg tcccgctgaa agaacttaca agcaagaaga 2640
acatgaggaa cttttgtatc aacggagggg ttaaagtgtg tagcccgaat ggttactcgt 2700
tcaggatcct gcggcacatt ctgaaatcat tcgacgagat atattctggg aatcatagga 2760
tgatcgggtt agtcaaagta gttattggac tggctttgtc aggatctcca gtccctgagg 2820
gcatgaactg ggtatacaaa ttgaggagaa cctttatctt ccagtgggct gattccaggg 2880
gccctcttga aggggaggag ttggaatact ctcaggagat cacttgggat gatgatactg 2940
agttcgtcgg attgcaaata agagtgattg caaaacagtg tcatatccag ggcagaatct 3000
ggtgtatcaa catgaacccg agagcatgtc aactatggtc tgacatgtct cttcagacac 3060
aaaggtccga agaggacaaa gattcctctc tgcttctaga ataatcagat tatatcccgc 3120
aaatttatca cttgtttacc tctggaggag agaacatatg ggctcaactc caacccttgg 3180
gagcaatata acaaaaaaca tgttatggtg ccattaaacc gctgcatttc atcaaagtca 3240
agttgattac ctttacattt tgatcctctt ggatgtgaaa aaaactatta acatccctca 3300
aaagactcaa ggaaagatgg ttcctcaggc tctcctgttt gtaccccttc tggtttttcc 3360
attgtgtttt gggaaattcc ctatttacac gataccagac aagcttggtc cctggagccc 3420
gattgacata catcacctca gctgcccaaa caatttggta gtggaggacg aaggatgcac 3480
caacctgtca gggttctcct acatggaact taaagttgga tacatcttag ccataaaaat 3540
gaacgggttc acttgcacag gcgttgtgac ggaggctgaa acctacacta acttcgttgg 3600
ttatgtcaca accacgttca aaagaaagca tttccgccca acaccagatg catgtagagc 3660
cgcgtacaac tggaagatgg ccggtgaccc cagatatgaa gagtctctac acaatccgta 3720
ccctgactac cactggcttc gaactgtaaa aaccaccaag gagtctctcg ttatcatatc 3780
tccaagtgtg gcagatttgg acccatatga cagatccctt cactcgaggg tcttccctag 3840
cgggaagtgc tcaggagtag cggtgtcttc tacctactgc tccactaacc acgattacac 3900
catttggatg cccgagaatc cgagactagg gatgtcttgt gacattttta ccaatagtag 3960
agggaagaga gcatccaaag ggagtgagac ttgcggcttt gtagatgaaa gaggcctata 4020
taagtcttta aaaggagcat gcaaactcaa gttatgtgga gttctaggac ttagacttat 4080
ggatggaaca tgggtcgcga tgcaaacatc aaatgaaacc aaatggtgcc ctcccgatca 4140
gttggtgaac ctgcacgact ttcgctcaga cgaaattgag caccttgttg tagaggagtt 4200
ggtcaggaag agagaggagt gtctggatgc actagagtcc atcatgacaa ccaagtcagt 4260
gagtttcaga cgtctcagtc atttaagaaa acttgtccct gggtttggaa aagcatatac 4320
catattcaac aagaccttga tggaagccga tgctcactac aagtcagtcg agacttggaa 4380
tgagatcctc ccttcaaaag ggtgtttaag agttgggggg aggtgtcatc ctcatgtgaa 4440
cggggtgttt ttcaatggta taatattagg acctgacggc aatgtcttaa tcccagagat 4500
gcaatcatcc ctcctccagc aacatatgga gttgttggaa tcctcggtta tcccccttgt 4560
gcaccccctg gcagacccgt ctaccgtttt caaggacggt gacgaggctg aggattttgt 4620
tgaagttcac cttcccgatg tgcacaatca ggtctcagga gttgacttgg gtctcccgaa 4680
ctgggggaag tatgtattac tgagtgcagg ggccctgact gccttgatgt tgataatttt 4740
cctgatgaca tgttgtagaa gagtcaatcg atcagaacct acgcaacaca atctcagagg 4800
gacagggagg gaggtgtcag tcactcccca aagcgggaag atcatatctt catgggaatc 4860
acacaagagt gggggtgaga ccagactgtg aggactggcc gtcctttcaa ctatccaagt 4920
cctgaagatc acctcccctt ggggggttca tcatgaaaaa aactaacacc cctcctttcg 4980
ctgcaggatg gttcctcagg ctctcctgtt tgtacccctt ctggtttttc cattgtgttt 5040
tgggaaattc cctatttaca cgataccaga caagcttggt ccctggagcc cgattgacat 5100
acatcacctc agctgcccaa acaatttggt agtggaggac gaaggatgca ccaacctgtc 5160
agggttctcc tacatggaac ttaaagttgg atacatctta gccataaaaa tgaacgggtt 5220
cacttgcaca ggcgttgtga cggaggctga aacctacact aacttcgttg gttatgtcac 5280
aaccacgttc aaaagaaagc atttccgccc aacaccagat gcatgtagag ccgcgtacaa 5340
ctggaagatg gccggtgacc ccagatatga agagtctcta cacaatccgt accctgacta 5400
ccactggctt cgaactgtaa aaaccaccaa ggagtctctc gttatcatat ctccaagtgt 5460
ggcagatttg gacccatatg acagatccct tcactcgagg gtcttcccta gcgggaagtg 5520
ctcaggagta gcggtgtctt ctacctactg ctccactaac cacgattaca ccatttggat 5580
gcccgagaat ccgagactag ggatgtcttg tgacattttt accaatagta gagggaagag 5640
agcatccaaa gggagtgaga cttgcggctt tgtagatgaa agaggcctat ataagtcttt 5700
aaaaggagca tgcaaactca agttatgtgg agttctagga cttagactta tggatggaac 5760
atgggtcgcg atgcaaacat caaatgaaac caaatggtgc cctcccgatc agttggtgaa 5820
cctgcacgac tttcgctcag acgaaattga gcaccttgtt gtagaggagt tggtcaggaa 5880
gagagaggag tgtctggatg cactagagtc catcatgaca accaagtcag tgagtttcag 5940
acgtctcagt catttaagaa aacttgtccc tgggtttgga aaagcatata ccatattcaa 6000
caagaccttg atggaagccg atgctcacta caagtcagtc gagacttgga atgagatcct 6060
cccttcaaaa gggtgtttaa gagttggggg gaggtgtcat cctcatgtga acggggtgtt 6120
tttcaatggt ataatattag gacctgacgg caatgtctta atcccagaga tgcaatcatc 6180
cctcctccag caacatatgg agttgttgga atcctcggtt atcccccttg tgcaccccct 6240
ggcagacccg tctaccgttt tcaaggacgg tgacgaggct gaggattttg ttgaagttca 6300
ccttcccgat gtgcacaatc aggtctcagg agttgacttg ggtctcccga actgggggaa 6360
gtatgtatta ctgagtgcag gggccctgac tgccttgatg ttgataattt tcctgatgac 6420
atgttgtaga agagtcaatc gatcagaacc tacgcaacac aatctcagag ggacagggag 6480
ggaggtgtca gtcactcccc aaagcgggaa gatcatatct tcatgggaat cacacaagag 6540
tgggggtgag accagactgt gaggtaccgt cgagaaaaaa acattagatc agaagaacaa 6600
ctggcaacac ttctcaacct gagacctact tcaagatgct cgatcctgga gaggtctatg 6660
atgaccctat tgacccaatc gagttagagg atgaacccag aggaaccccc actgtcccca 6720
acatcttgag gaactctgac tacaatctca actctccttt gatagaagat cctgctagac 6780
taatgttaga atggttaaaa acagggaata gaccttatcg gatgactcta acagacaatt 6840
gctccaggtc tttcagagtt ttgaaagatt atttcaagaa ggtagatttg ggttctctca 6900
aggtgggcgg aatggctgca cagtcaatga tttctctctg gttatatggt gcccactctg 6960
aatccaacag gagccggaga tgtataacag acttggccca tttctattcc aagtcgtccc 7020
ccatagagaa gctgttgaat ctcacgctag gaaatagagg gctgagaatc cccccagagg 7080
gagtgttaag ttgccttgag agggttgatt atgataatgc atttggaagg tatcttgcca 7140
acacgtattc ctcttacttg ttcttccatg taatcacctt atacatgaac gccctagact 7200
gggatgaaga aaagaccatc ctagcattat ggaaagattt aacctcagtg gacatcggga 7260
aggacttggt aaagttcaaa gaccaaatat ggggactgct gatcgtgaca aaggactttg 7320
tttactccca aagttccaat tgtctttttg acagaaacta cacacttatg ctaaaagatc 7380
ttttcttgtc tcgcttcaac tccttaatgg tcttgctctc tcccccagag ccccgatact 7440
cagatgactt gatatctcaa ctatgccagc tgtacattgc tggggatcaa gtcttgtcta 7500
tgtgtggaaa ctccggctat gaagtcatca aaatattgga gccatatgtc gtgaatagtt 7560
tagtccagag agcagaaaag tttaggcctc tcattcattc cttgggagac tttcctgtat 7620
ttataaaaga caaggtaagt caacttgaag agacgttcgg tccctgtgca agaaggttct 7680
ttagggctct ggatcaattc gacaacatac atgacttggt ttttgtgtat ggctgttaca 7740
ggcattgggg gcacccatat atagattatc gaaagggtct gtcaaaacta tatgatcagg 7800
ttcacattaa aaaagtgata gataagtcct accaggagtg cttagcaagc gacctagcca 7860
ggaggatcct tagatggggt tttgataagt actccaagtg gtatctggat tcaagattcc 7920
tagcccgaga ccaccccttg actccttata tcaaaaccca aacatggcca cccaaacata 7980
ttgtagactt ggtgggggat acatggcaca agctcccgat cacgcagatc tttgagattc 8040
ctgaatcaat ggatccgtca gaaatattgg atgacaaatc acattctttc accagaacga 8100
gactagcttc ttggctgtca gaaaaccgag ggggacctgt tcctagcgaa aaagttatta 8160
tcacggccct gtctaagccg cctgtcaatc cccgagagtt tctgaggtct atagacctcg 8220
gaggattgcc agatgaagac ttgataattg gcctcaagcc aaaggaacgg gaattgaaga 8280
ttgaaggtcg attctttgct ctaatgtcat ggaatctaag attgtatttt gtcatcactg 8340
aaaaactctt ggccaactac atcttgccac tttttgacgc gctgactatg acagacaacc 8400
tgaacaaggt gtttaaaaag ctgatcgaca gggtcaccgg gcaagggctt ttggactatt 8460
caagggtcac atatgcattt cacctggact atgaaaagtg gaacaaccat caaagattag 8520
agtcaacaga ggatgtattt tctgtcctag atcaagtgtt tggattgaag agagtgtttt 8580
ctagaacaca cgagtttttt caaaaggcct ggatctatta ttcagacaga tcagacctca 8640
tcgggttacg ggaggatcaa atatactgct tagatgcgtc caacggccca acctgttgga 8700
atggccagga tggcgggcta gaaggcttac ggcagaaggg ctggagtcta gtcagcttat 8760
tgatgataga tagagaatct caaatcagga acacaagaac caaaatacta gctcaaggag 8820
acaaccaggt tttatgtccg acatatatgt tgtcgccagg gctatctcaa gaggggctcc 8880
tctatgaatt ggagagaata tcaaggaatg cactttcgat atacagagcc gtcgaggaag 8940
gggcatctaa gctagggctg atcatcaaga aagaagagac catgtgtagt tatgacttcc 9000
tcatctatgg aaaaacccct ttgtttagag gtaacatatt ggtgcctgag tccaaaagat 9060
gggccagagt ctcttgcgtc tctaatgacc aaatagtcaa cctcgccaat ataatgtcga 9120
cagtgtccac caatgcgcta acagtggcac aacactctca atctttgatc aaaccgatga 9180
gggattttct gctcatgtca gtacaggcag tctttcacta cctgctattt agcccaatct 9240
taaagggaag agtttacaag attctgagcg ctgaagggga tagctttctc ctagccatgt 9300
caaggataat ctatctagat ccttctttgg gaggggtatc tggaatgtcc ctcggaagat 9360
tccatatacg acagttctca gaccctgtct ctgaagggtt atccttctgg agagagatct 9420
ggttaagctc ccacgagtcc tggattcacg cgttgtgtca agaggctgga aacccagatc 9480
ttggagagag aacactcgag agcttcactc gccttctaga agatcctacc accttaaata 9540
tcagaggagg ggccagtcct accattctac tcaaggatgc aatcagaaag gctttatatg 9600
acgaggtgga caaggtggag aattcagagt ttcgagaggc aatcctgttg tccaagaccc 9660
atagagataa ttttatactc ttcttaacat ctgttgagcc tctgtttcct cgatttctca 9720
gtgagctatt cagttcgtct tttttgggaa tccccgagtc aatcattgga ttgatacaaa 9780
actcccgaac gataagaagg cagtttagaa agagtctctc aaaaacttta gaagaatcct 9840
tctacaactc agagatccac gggattagtc ggatgaccca gacacctcag agggttgggg 9900
gggtgtggcc ttgctcttca gagagggcag atctacttag ggagatctct tggggaagaa 9960
aagtggtagg cacgacagtt cctcaccctt ctgagatgtt ggggttactt cccaagtcct 10020
ctatttcttg cacttgtgga gcaacaggag gaggcaatcc tagagtttct gtatcagtac 10080
tcccgtcctt tgatcagtca tttttttcac gaggccccct aaaggggtac ttgggctcgt 10140
ccacctctat gtcgacccag ctattccatg catgggaaaa agtcactaat gttcatgtgg 10200
tgaagagagc tctatcgtta aaagaatcta taaactggtt cattactaga gattccaact 10260
tggctcaagc tctaattagg aacattatgt ctctgacagg ccctgatttc cctctagagg 10320
aggcccctgt cttcaaaagg acggggtcag ccttgcatag gttcaagtct gccagataca 10380
gcgaaggagg gtattcttct gtctgcccga acctcctctc tcatatttct gttagtacag 10440
acaccatgtc tgatttgacc caagacggga agaactacga tttcatgttc cagccattga 10500
tgctttatgc acagacatgg acatcagagc tggtacagag agacacaagg ctaagagact 10560
ctacgtttca ttggcacctc cgatgcaaca ggtgtgtgag acccattgac gacgtgaccc 10620
tggagacctc tcagatcttc gagtttccgg atgtgtcgaa aagaatatcc agaatggttt 10680
ctggggctgt gcctcacttc cagaggcttc ccgatatccg tctgagacca ggagattttg 10740
aatctctaag cggtagagaa aagtctcacc atatcggatc agctcagggg ctcttatact 10800
caatcttagt ggcaattcac gactcaggat acaatgatgg aaccatcttc cctgtcaaca 10860
tatacgacaa ggtttcccct agagactatt tgagagggct cgcaagggga gtattgatag 10920
gatcctcgat ttgcttcttg acaagaatga caaatatcaa tattaataga cctcttgaat 10980
tgatctcagg ggtaatctca tatattctcc tgaggctaga taaccatccc tccttgtaca 11040
taatgctcag agaaccgtct cttagaggag agatattttc tatccctcag aaaatccccg 11100
ccgcttatcc aaccactatg aaagaaggca acagatcaat cttgtgttat ctccaacatg 11160
tgctacgcta tgagcgagag ataatcacgg cgtctccaga gaatgactgg ctatggatct 11220
tttcagactt tagaagtgcc aaaatgacgt acctaaccct cattacttac cagtctcatc 11280
ttctactcca gagggttgag agaaacctat ctaagagtat gagagataac ctgcgacaat 11340
tgagttcctt gatgaggcag gtgctgggcg ggcacggaga agatacctta gagtcagacg 11400
acaacattca acgactgcta aaagactctt tacgaaggac aagatgggtg gatcaagagg 11460
tgcgccatgc agctagaacc atgactggag attacagccc caacaagaag gtgtcccgta 11520
aggtaggatg ttcagaatgg gtctgctctg ctcaacaggt tgcagtctct acctcagcaa 11580
acccggcccc tgtctcggag cttgacataa gggccctctc taagaggttc cagaaccctt 11640
tgatctcggg cttgagagtg gttcagtggg caaccggtgc tcattataag cttaagccta 11700
ttctagatga tctcaatgtt ttcccatctc tctgccttgt agttggggac gggtcagggg 11760
ggatatcaag ggcagtcctc aacatgtttc cagatgccaa gcttgtgttc aacagtctct 11820
tagaggtgaa tgacctgatg gcttccggaa cacatccact gcctccttca gcaatcatga 11880
ggggaggaaa tgatatcgtc tccagagtga tagattttga ctcaatctgg gaaaaaccgt 11940
ccgacttgag aaacttggca acctggaaat acttccagtc agtccaaaag caggtcaaca 12000
tgtcctatga cctcattatt tgcgatgcag aagttactga cattgcatct atcaaccgga 12060
taaccctgtt aatgtccgat tttgcattgt ctatagatgg accactctat ttggtcttca 12120
aaacttatgg gactatgcta gtaaatccaa actacaaggc tattcaacac ctgtcaagag 12180
cgttcccctc ggtcacaggg tttatcaccc aagtaacttc gtctttttca tctgagctct 12240
acctccgatt ctccaaacga gggaagtttt tcagagatgc tgagtacttg acctcttcca 12300
cccttcgaga aatgagcctt gtgttattca attgtagcag ccccaagagt gagatgcaga 12360
gagctcgttc cttgaactat caggatcttg tgagaggatt tcctgaagaa atcatatcaa 12420
atccttacaa tgagatgatc ataactctga ttgacagtga tgtagaatct tttctagtcc 12480
acaagatggt tgatgatctt gagttacaga ggggaactct gtctaaagtg gctatcatta 12540
tagccatcat gatagttttc tccaacagag tcttcaacgt ttccaaaccc ctaactgacc 12600
ccttgttcta tccaccgtct gatcccaaaa tcctgaggca cttcaacata tgttgcagta 12660
ctatgatgta tctatctact gctttaggtg acgtccctag cttcgcaaga cttcacgacc 12720
tgtataacag acctataact tattacttca gaaagcaatt cattcgaggg aacgtttatc 12780
tatcttggag ttggtccaac gacacctcag tgttcaaaag ggtagcctgt aattctagcc 12840
tgagtctgtc atctcactgg atcaggttga tttacaagat agtgaagact accagactcg 12900
ttggcagcat caaggatcta tccagagaag tggaaagaca ccttcatagg tacaacaggt 12960
ggatcaccct agaggatatc agatctagat catccctact agactacagt tgcctgtgat 13020
ccggatactc ctggaagcct gcccatgcta agactcttgt gtgatgtatc ttgaaaaaaa 13080
caagatccta aatctgaacc tttggttgtt tgattgtttt tctcattttt gttgtttatt 13140
tgttaagcgt 13150
<210>14
<211>11976
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA-pt rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2542)..(3147)
<220>
<221>misc_feature
<222>(3363)..(4934)
<220>
<221>misc_feature
<222>(5462)..(11842)
<400>14
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaactaac acccctcctt tcgctgcagt ttggtaccgt cgagaaaaaa 2520
acaggcaaca ccactgataa aatgaacttt ctacgtaaga tagtgaaaaa ttgcagggac 2580
gaggacactc aaaaaccctc tcccgtgtca gcccctctgg atgacgatga cttgtggctt 2640
ccaccccctg aatacgtccc gctgaaagaa cttacaagca agaagaacat gaggaacttt 2700
tgtatcaacg gaggggttaa agtgtgtagc ccgaatggtt actcgttcag gatcctgcgg 2760
cacattctga aatcattcga cgagatatat tctgggaatc ataggatgat cgggttagtc 2820
aaagtagtta ttggactggc tttgtcagga tctccagtcc ctgagggcat gaactgggta 2880
tacaaattga ggagaacctt tatcttccag tgggctgatt ccaggggccc tcttgaaggg 2940
gaggagttgg aatactctca ggagatcact tgggatgatg atactgagtt cgtcggattg 3000
caaataagag tgattgcaaa acagtgtcat atccagggca gaatctggtg tatcaacatg 3060
aacccgagag catgtcaact atggtctgac atgtctcttc agacacaaag gtccgaagag 3120
gacaaagatt cctctctgct tctagaataa tcagattata tcccgcaaat ttatcacttg 3180
tttacctctg gaggagagaa catatgggct caactccaac ccttgggagc aatataacaa 3240
aaaacatgtt atggtgccat taaaccgctg catttcatca aagtcaagtt gattaccttt 3300
acattttgat cctcttggat gtgaaaaaaa ctattaacat ccctcaaaag actcaaggaa 3360
agatggttcc tcaggctctc ctgtttgtac cccttctggt ttttccattg tgttttggga 3420
aattccctat ttacacgata ccagacaagc ttggtccctg gagcccgatt gacatacatc 3480
acctcagctg cccaaacaat ttggtagtgg aggacgaagg atgcaccaac ctgtcagggt 3540
tctcctacat ggaacttaaa gttggataca tcttagccat aaaaatgaac gggttcactt 3600
gcacaggcgt tgtgacggag gctgaaacct acactaactt cgttggttat gtcacaacca 3660
cgttcaaaag aaagcatttc cgcccaacac cagatgcatg tagagccgcg tacaactgga 3720
agatggccgg tgaccccaga tatgaagagt ctctacacaa tccgtaccct gactaccact 3780
ggcttcgaac tgtaaaaacc accaaggagt ctctcgttat catatctcca agtgtggcag 3840
atttggaccc atatgacaga tcccttcact cgagggtctt ccctagcggg aagtgctcag 3900
gagtagcggt gtcttctacc tactgctcca ctaaccacga ttacaccatt tggatgcccg 3960
agaatccgag actagggatg tcttgtgaca tttttaccaa tagtagaggg aagagagcat 4020
ccaaagggag tgagacttgc ggctttgtag atgaaagagg cctatataag tctttaaaag 4080
gagcatgcaa actcaagtta tgtggagttc taggacttag acttatggat ggaacatggg 4140
tcgcgatgca aacatcaaat gaaaccaaat ggtgccctcc cgatcagttg gtgaacctgc 4200
acgactttcg ctcagacgaa attgagcacc ttgttgtaga ggagttggtc aggaagagag 4260
aggagtgtct ggatgcacta gagtccatca tgacaaccaa gtcagtgagt ttcagacgtc 4320
tcagtcattt aagaaaactt gtccctgggt ttggaaaagc atataccata ttcaacaaga 4380
ccttgatgga agccgatgct cactacaagt cagtcgagac ttggaatgag atcctccctt 4440
caaaagggtg tttaagagtt ggggggaggt gtcatcctca tgtgaacggg gtgtttttca 4500
atggtataat attaggacct gacggcaatg tcttaatccc agagatgcaa tcatccctcc 4560
tccagcaaca tatggagttg ttggaatcct cggttatccc ccttgtgcac cccctggcag 4620
acccgtctac cgttttcaag gacggtgacg aggctgagga ttttgttgaa gttcaccttc 4680
ccgatgtgca caatcaggtc tcaggagttg acttgggtct cccgaactgg gggaagtatg 4740
tattactgag tgcaggggcc ctgactgcct tgatgttgat aattttcctg atgacatgtt 4800
gtagaagagt caatcgatca gaacctacgc aacacaatct cagagggaca gggagggagg 4860
tgtcagtcac tccccaaagc gggaagatca tatcttcatg ggaatcacac aagagtgggg 4920
gtgagaccag actgtgagga ctggccgtcc tttcaactat ccaagtcctg aagatcacct 4980
ccccttgggg ggttcttttt gaaaaaaacc tgggttcaat agtcctcctt gaactccatg 5040
caactgggta gattcaagag tcatgagatt ttcattaatc ctctcagttg atcaagcaag 5100
atcatgtaga ttctcataat aggggagatc ttctagcagt ttcagtgact aacggtactt 5160
tcattctcca ggaactgaca ccaacagttg tagacaaacc acggggtgtc tcgggtgact 5220
ctgtgcttgg gcacagacaa aggtcatggt gtgttccatg atagcggact caggatgagt 5280
taattgagag aggcagtctt cctcccgtga aggacataag cagtagctca caatcatctc 5340
gcgtctcagc aaagtgtgca taattataaa gtgctgggtc atctaagctt ttcagtcgag 5400
aaaaaaacat tagatcagaa gaacaactgg caacacttct caacctgaga cctacttcaa 5460
gatgctcgat cctggagagg tctatgatga ccctattgac ccaatcgagt tagaggatga 5520
acccagagga acccccactg tccccaacat cttgaggaac tctgactaca atctcaactc 5580
tcctttgata gaagatcctg ctagactaat gttagaatgg ttaaaaacag ggaatagacc 5640
ttatcggatg actctaacag acaattgctc caggtctttc agagttttga aagattattt 5700
caagaaggta gatttgggtt ctctcaaggt gggcggaatg gctgcacagt caatgatttc 5760
tctctggtta tatggtgccc actctgaatc caacaggagc cggagatgta taacagactt 5820
ggcccatttc tattccaagt cgtcccccat agagaagctg ttgaatctca cgctaggaaa 5880
tagagggctg agaatccccc cagagggagt gttaagttgc cttgagaggg ttgattatga 5940
taatgcattt ggaaggtatc ttgccaacac gtattcctct tacttgttct tccatgtaat 6000
caccttatac atgaacgccc tagactggga tgaagaaaag accatcctag cattatggaa 6060
agatttaacc tcagtggaca tcgggaagga cttggtaaag ttcaaagacc aaatatgggg 6120
actgctgatc gtgacaaagg actttgttta ctcccaaagt tccaattgtc tttttgacag 6180
aaactacaca cttatgctaa aagatctttt cttgtctcgc ttcaactcct taatggtctt 6240
gctctctccc ccagagcccc gatactcaga tgacttgata tctcaactat gccagctgta 6300
cattgctggg gatcaagtct tgtctatgtg tggaaactcc ggctatgaag tcatcaaaat 6360
attggagcca tatgtcgtga atagtttagt ccagagagca gaaaagttta ggcctctcat 6420
tcattccttg ggagactttc ctgtatttat aaaagacaag gtaagtcaac ttgaagagac 6480
gttcggtccc tgtgcaagaa ggttctttag ggctctggat caattcgaca acatacatga 6540
cttggttttt gtgtatggct gttacaggca ttgggggcac ccatatatag attatcgaaa 6600
gggtctgtca aaactatatg atcaggttca cattaaaaaa gtgatagata agtcctacca 6660
ggagtgctta gcaagcgacc tagccaggag gatccttaga tggggttttg ataagtactc 6720
caagtggtat ctggattcaa gattcctagc ccgagaccac cccttgactc cttatatcaa 6780
aacccaaaca tggccaccca aacatattgt agacttggtg ggggatacat ggcacaagct 6840
cccgatcacg cagatctttg agattcctga atcaatggat ccgtcagaaa tattggatga 6900
caaatcacat tctttcacca gaacgagact agcttcttgg ctgtcagaaa accgaggggg 6960
acctgttcct agcgaaaaag ttattatcac ggccctgtct aagccgcctg tcaatccccg 7020
agagtttctg aggtctatag acctcggagg attgccagat gaagacttga taattggcct 7080
caagccaaag gaacgggaat tgaagattga aggtcgattc tttgctctaa tgtcatggaa 7140
tctaagattg tattttgtca tcactgaaaa actcttggcc aactacatct tgccactttt 7200
tgacgcgctg actatgacag acaacctgaa caaggtgttt aaaaagctga tcgacagggt 7260
caccgggcaa gggcttttgg actattcaag ggtcacatat gcatttcacc tggactatga 7320
aaagtggaac aaccatcaaa gattagagtc aacagaggat gtattttctg tcctagatca 7380
agtgtttgga ttgaagagag tgttttctag aacacacgag ttttttcaaa aggcctggat 7440
ctattattca gacagatcag acctcatcgg gttacgggag gatcaaatat actgcttaga 7500
tgcgtccaac ggcccaacct gttggaatgg ccaggatggc gggctagaag gcttacggca 7560
gaagggctgg agtctagtca gcttattgat gatagataga gaatctcaaa tcaggaacac 7620
aagaaccaaa atactagctc aaggagacaa ccaggtttta tgtccgacat atatgttgtc 7680
gccagggcta tctcaagagg ggctcctcta tgaattggag agaatatcaa ggaatgcact 7740
ttcgatatac agagccgtcg aggaaggggc atctaagcta gggctgatca tcaagaaaga 7800
agagaccatg tgtagttatg acttcctcat ctatggaaaa acccctttgt ttagaggtaa 7860
catattggtg cctgagtcca aaagatgggc cagagtctct tgcgtctcta atgaccaaat 7920
agtcaacctc gccaatataa tgtcgacagt gtccaccaat gcgctaacag tggcacaaca 7980
ctctcaatct ttgatcaaac cgatgaggga ttttctgctc atgtcagtac aggcagtctt 8040
tcactacctg ctatttagcc caatcttaaa gggaagagtt tacaagattc tgagcgctga 8100
aggggatagc tttctcctag ccatgtcaag gataatctat ctagatcctt ctttgggagg 8160
ggtatctgga atgtccctcg gaagattcca tatacgacag ttctcagacc ctgtctctga 8220
agggttatcc ttctggagag agatctggtt aagctcccac gagtcctgga ttcacgcgtt 8280
gtgtcaagag gctggaaacc cagatcttgg agagagaaca ctcgagagct tcactcgcct 8340
tctagaagat cctaccacct taaatatcag aggaggggcc agtcctacca ttctactcaa 8400
ggatgcaatc agaaaggctt tatatgacga ggtggacaag gtggagaatt cagagtttcg 8460
agaggcaatc ctgttgtcca agacccatag agataatttt atactcttct taacatctgt 8520
tgagcctctg tttcctcgat ttctcagtga gctattcagt tcgtcttttt tgggaatccc 8580
cgagtcaatc attggattga tacaaaactc ccgaacgata agaaggcagt ttagaaagag 8640
tctctcaaaa actttagaag aatccttcta caactcagag atccacggga ttagtcggat 8700
gacccagaca cctcagaggg ttgggggggt gtggccttgc tcttcagaga gggcagatct 8760
acttagggag atctcttggg gaagaaaagt ggtaggcacg acagttcctc acccttctga 8820
gatgttgggg ttacttccca agtcctctat ttcttgcact tgtggagcaa caggaggagg 8880
caatcctaga gtttctgtat cagtactccc gtcctttgat cagtcatttt tttcacgagg 8940
ccccctaaag gggtacttgg gctcgtccac ctctatgtcg acccagctat tccatgcatg 9000
ggaaaaagtc actaatgttc atgtggtgaa gagagctcta tcgttaaaag aatctataaa 9060
ctggttcatt actagagatt ccaacttggc tcaagctcta attaggaaca ttatgtctct 9120
gacaggccct gatttccctc tagaggaggc ccctgtcttc aaaaggacgg ggtcagcctt 9180
gcataggttc aagtctgcca gatacagcga aggagggtat tcttctgtct gcccgaacct 9240
cctctctcat atttctgtta gtacagacac catgtctgat ttgacccaag acgggaagaa 9300
ctacgatttc atgttccagc cattgatgct ttatgcacag acatggacat cagagctggt 9360
acagagagac acaaggctaa gagactctac gtttcattgg cacctccgat gcaacaggtg 9420
tgtgagaccc attgacgacg tgaccctgga gacctctcag atcttcgagt ttccggatgt 9480
gtcgaaaaga atatccagaa tggtttctgg ggctgtgcct cacttccaga ggcttcccga 9540
tatccgtctg agaccaggag attttgaatc tctaagcggt agagaaaagt ctcaccatat 9600
cggatcagct caggggctct tatactcaat cttagtggca attcacgact caggatacaa 9660
tgatggaacc atcttccctg tcaacatata cgacaaggtt tcccctagag actatttgag 9720
agggctcgca aggggagtat tgataggatc ctcgatttgc ttcttgacaa gaatgacaaa 9780
tatcaatatt aatagacctc ttgaattgat ctcaggggta atctcatata ttctcctgag 9840
gctagataac catccctcct tgtacataat gctcagagaa ccgtctctta gaggagagat 9900
attttctatc cctcagaaaa tccccgccgc ttatccaacc actatgaaag aaggcaacag 9960
atcaatcttg tgttatctcc aacatgtgct acgctatgag cgagagataa tcacggcgtc 10020
tccagagaat gactggctat ggatcttttc agactttaga agtgccaaaa tgacgtacct 10080
aaccctcatt acttaccagt ctcatcttct actccagagg gttgagagaa acctatctaa 10140
gagtatgaga gataacctgc gacaattgag ttccttgatg aggcaggtgc tgggcgggca 10200
cggagaagat accttagagt cagacgacaa cattcaacga ctgctaaaag actctttacg 10260
aaggacaaga tgggtggatc aagaggtgcg ccatgcagct agaaccatga ctggagatta 10320
cagccccaac aagaaggtgt cccgtaaggt aggatgttca gaatgggtct gctctgctca 10380
acaggttgca gtctctacct cagcaaaccc ggcccctgtc tcggagcttg acataagggc 10440
cctctctaag aggttccaga accctttgat ctcgggcttg agagtggttc agtgggcaac 10500
cggtgctcat tataagctta agcctattct agatgatctc aatgttttcc catctctctg 10560
ccttgtagtt ggggacgggt caggggggat atcaagggca gtcctcaaca tgtttccaga 10620
tgccaagctt gtgttcaaca gtctcttaga ggtgaatgac ctgatggctt ccggaacaca 10680
tccactgcct ccttcagcaa tcatgagggg aggaaatgat atcgtctcca gagtgataga 10740
ttttgactca atctgggaaa aaccgtccga cttgagaaac ttggcaacct ggaaatactt 10800
ccagtcagtc caaaagcagg tcaacatgtc ctatgacctc attatttgcg atgcagaagt 10860
tactgacatt gcatctatca accggataac cctgttaatg tccgattttg cattgtctat 10920
agatggacca ctctatttgg tcttcaaaac ttatgggact atgctagtaa atccaaacta 10980
caaggctatt caacacctgt caagagcgtt cccctcggtc acagggttta tcacccaagt 11040
aacttcgtct ttttcatctg agctctacct ccgattctcc aaacgaggga agtttttcag 11100
agatgctgag tacttgacct cttccaccct tcgagaaatg agccttgtgt tattcaattg 11160
tagcagcccc aagagtgaga tgcagagagc tcgttccttg aactatcagg atcttgtgag 11220
aggatttcct gaagaaatca tatcaaatcc ttacaatgag atgatcataa ctctgattga 11280
cagtgatgta gaatcttttc tagtccacaa gatggttgat gatcttgagt tacagagggg 11340
aactctgtct aaagtggcta tcattatagc catcatgata gttttctcca acagagtctt 11400
caacgtttcc aaacccctaa ctgacccctt gttctatcca ccgtctgatc ccaaaatcct 11460
gaggcacttc aacatatgtt gcagtactat gatgtatcta tctactgctt taggtgacgt 11520
ccctagcttc gcaagacttc acgacctgta taacagacct ataacttatt acttcagaaa 11580
gcaattcatt cgagggaacg tttatctatc ttggagttgg tccaacgaca cctcagtgtt 11640
caaaagggta gcctgtaatt ctagcctgag tctgtcatct cactggatca ggttgattta 11700
caagatagtg aagactacca gactcgttgg cagcatcaag gatctatcca gagaagtgga 11760
aagacacctt cataggtaca acaggtggat caccctagag gatatcagat ctagatcatc 11820
cctactagac tacagttgcc tgtgatccgg atactcctgg aagcctgccc atgctaagac 11880
tcttgtgtga tgtatcttga aaaaaacaag atcctaaatc tgaacctttg gttgtttgat 11940
tgtttttctc atttttgttg tttatttgtt aagcgt 11976
<210>15
<211>12662
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERA-pt-GFP rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2505)..(3185)
<220>
<221>misc_feature
<222>(3228)..(3833)
<220>
<221>misc_feature
<222>(4049)..(5620)
<220>
<221>misc_feature
<222>(6148)..(12528)
<400>15
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaactaac acccctcctt tcgctgcagc caccatgggc gtgatcaagc 2520
ccgacatgaa gatcaagctg cggatggagg gcgccgtgaa cggccacaaa ttcgtgatcg 2580
agggcgacgg gaaaggcaag ccctttgagg gtaagcagac tatggacctg accgtgatcg 2640
agggcgcccc cctgcccttc gcttatgaca ttctcaccac cgtgttcgac tacggtaacc 2700
gtgtcttcgc caagtacccc aaggacatcc ctgactactt caagcagacc ttccccgagg 2760
gctactcgtg ggagcgaagc atgacatacg aggaccaggg aatctgtatc gctacaaacg 2820
acatcaccat gatgaagggt gtggacgact gcttcgtgta caaaatccgc ttcgacgggg 2880
tcaacttccc tgctaatggc ccggtgatgc agcgcaagac cctaaagtgg gagcccagta 2940
ccgagaagat gtacgtgcgg gacggcgtac tgaagggcga tgttaatatg gcactgctct 3000
tggagggagg cggccactac cgctgcgact tcaagaccac ctacaaagcc aagaaggtgg 3060
tgcagcttcc cgactaccac ttcgtggacc accgcatcga gatcgtgagc cacgacaagg 3120
actacaacaa agtcaagctg tacgagcacg ccgaagccca cagcggacta ccccgccagg 3180
ccggctaagg taccgtcgag aaaaaaacag gcaacaccac tgataaaatg aactttctac 3240
gtaagatagt gaaaaattgc agggacgagg acactcaaaa accctctccc gtgtcagccc 3300
ctctggatga cgatgacttg tggcttccac cccctgaata cgtcccgctg aaagaactta 3360
caagcaagaa gaacatgagg aacttttgta tcaacggagg ggttaaagtg tgtagcccga 3420
atggttactc gttcaggatc ctgcggcaca ttctgaaatc attcgacgag atatattctg 3480
ggaatcatag gatgatcggg ttagtcaaag tagttattgg actggctttg tcaggatctc 3540
cagtccctga gggcatgaac tgggtataca aattgaggag aacctttatc ttccagtggg 3600
ctgattccag gggccctctt gaaggggagg agttggaata ctctcaggag atcacttggg 3660
atgatgatac tgagttcgtc ggattgcaaa taagagtgat tgcaaaacag tgtcatatcc 3720
agggcagaat ctggtgtatc aacatgaacc cgagagcatg tcaactatgg tctgacatgt 3780
ctcttcagac acaaaggtcc gaagaggaca aagattcctc tctgcttcta gaataatcag 3840
attatatccc gcaaatttat cacttgttta cctctggagg agagaacata tgggctcaac 3900
tccaaccctt gggagcaata taacaaaaaa catgttatgg tgccattaaa ccgctgcatt 3960
tcatcaaagt caagttgatt acctttacat tttgatcctc ttggatgtga aaaaaactat 4020
taacatccct caaaagactc aaggaaagat ggttcctcag gctctcctgt ttgtacccct 4080
tctggttttt ccattgtgtt ttgggaaatt ccctatttac acgataccag acaagcttgg 4140
tccctggagc ccgattgaca tacatcacct cagctgccca aacaatttgg tagtggagga 4200
cgaaggatgc accaacctgt cagggttctc ctacatggaa cttaaagttg gatacatctt 4260
agccataaaa atgaacgggt tcacttgcac aggcgttgtg acggaggctg aaacctacac 4320
taacttcgtt ggttatgtca caaccacgtt caaaagaaag catttccgcc caacaccaga 4380
tgcatgtaga gccgcgtaca actggaagat ggccggtgac cccagatatg aagagtctct 4440
acacaatccg taccctgact accactggct tcgaactgta aaaaccacca aggagtctct 4500
cgttatcata tctccaagtg tggcagattt ggacccatat gacagatccc ttcactcgag 4560
ggtcttccct agcgggaagt gctcaggagt agcggtgtct tctacctact gctccactaa 4620
ccacgattac accatttgga tgcccgagaa tccgagacta gggatgtctt gtgacatttt 4680
taccaatagt agagggaaga gagcatccaa agggagtgag acttgcggct ttgtagatga 4740
aagaggccta tataagtctt taaaaggagc atgcaaactc aagttatgtg gagttctagg 4800
acttagactt atggatggaa catgggtcgc gatgcaaaca tcaaatgaaa ccaaatggtg 4860
ccctcccgat cagttggtga acctgcacga ctttcgctca gacgaaattg agcaccttgt 4920
tgtagaggag ttggtcagga agagagagga gtgtctggat gcactagagt ccatcatgac 4980
aaccaagtca gtgagtttca gacgtctcag tcatttaaga aaacttgtcc ctgggtttgg 5040
aaaagcatat accatattca acaagacctt gatggaagcc gatgctcact acaagtcagt 5100
cgagacttgg aatgagatcc tcccttcaaa agggtgttta agagttgggg ggaggtgtca 5160
tcctcatgtg aacggggtgt ttttcaatgg tataatatta ggacctgacg gcaatgtctt 5220
aatcccagag atgcaatcat ccctcctcca gcaacatatg gagttgttgg aatcctcggt 5280
tatccccctt gtgcaccccc tggcagaccc gtctaccgtt ttcaaggacg gtgacgaggc 5340
tgaggatttt gttgaagttc accttcccga tgtgcacaat caggtctcag gagttgactt 5400
gggtctcccg aactggggga agtatgtatt actgagtgca ggggccctga ctgccttgat 5460
gttgataatt ttcctgatga catgttgtag aagagtcaat cgatcagaac ctacgcaaca 5520
caatctcaga gggacaggga gggaggtgtc agtcactccc caaagcggga agatcatatc 5580
ttcatgggaa tcacacaaga gtgggggtga gaccagactg tgaggactgg ccgtcctttc 5640
aactatccaa gtcctgaaga tcacctcccc ttggggggtt ctttttgaaa aaaacctggg 5700
ttcaatagtc ctccttgaac tccatgcaac tgggtagatt caagagtcat gagattttca 5760
ttaatcctct cagttgatca agcaagatca tgtagattct cataataggg gagatcttct 5820
agcagtttca gtgactaacg gtactttcat tctccaggaa ctgacaccaa cagttgtaga 5880
caaaccacgg ggtgtctcgg gtgactctgt gcttgggcac agacaaaggt catggtgtgt 5940
tccatgatag cggactcagg atgagttaat tgagagaggc agtcttcctc ccgtgaagga 6000
cataagcagt agctcacaat catctcgcgt ctcagcaaag tgtgcataat tataaagtgc 6060
tgggtcatct aagcttttca gtcgagaaaa aaacattaga tcagaagaac aactggcaac 6120
acttctcaac ctgagaccta cttcaagatg ctcgatcctg gagaggtcta tgatgaccct 6180
attgacccaa tcgagttaga ggatgaaccc agaggaaccc ccactgtccc caacatcttg 6240
aggaactctg actacaatct caactctcct ttgatagaag atcctgctag actaatgtta 6300
gaatggttaa aaacagggaa tagaccttat cggatgactc taacagacaa ttgctccagg 6360
tctttcagag ttttgaaaga ttatttcaag aaggtagatt tgggttctct caaggtgggc 6420
ggaatggctg cacagtcaat gatttctctc tggttatatg gtgcccactc tgaatccaac 6480
aggagccgga gatgtataac agacttggcc catttctatt ccaagtcgtc ccccatagag 6540
aagctgttga atctcacgct aggaaataga gggctgagaa tccccccaga gggagtgtta 6600
agttgccttg agagggttga ttatgataat gcatttggaa ggtatcttgc caacacgtat 6660
tcctcttact tgttcttcca tgtaatcacc ttatacatga acgccctaga ctgggatgaa 6720
gaaaagacca tcctagcatt atggaaagat ttaacctcag tggacatcgg gaaggacttg 6780
gtaaagttca aagaccaaat atggggactg ctgatcgtga caaaggactt tgtttactcc 6840
caaagttcca attgtctttt tgacagaaac tacacactta tgctaaaaga tcttttcttg 6900
tctcgcttca actccttaat ggtcttgctc tctcccccag agccccgata ctcagatgac 6960
ttgatatctc aactatgcca gctgtacatt gctggggatc aagtcttgtc tatgtgtgga 7020
aactccggct atgaagtcat caaaatattg gagccatatg tcgtgaatag tttagtccag 7080
agagcagaaa agtttaggcc tctcattcat tccttgggag actttcctgt atttataaaa 7140
gacaaggtaa gtcaacttga agagacgttc ggtccctgtg caagaaggtt ctttagggct 7200
ctggatcaat tcgacaacat acatgacttg gtttttgtgt atggctgtta caggcattgg 7260
gggcacccat atatagatta tcgaaagggt ctgtcaaaac tatatgatca ggttcacatt 7320
aaaaaagtga tagataagtc ctaccaggag tgcttagcaa gcgacctagc caggaggatc 7380
cttagatggg gttttgataa gtactccaag tggtatctgg attcaagatt cctagcccga 7440
gaccacccct tgactcctta tatcaaaacc caaacatggc cacccaaaca tattgtagac 7500
ttggtggggg atacatggca caagctcccg atcacgcaga tctttgagat tcctgaatca 7560
atggatccgt cagaaatatt ggatgacaaa tcacattctt tcaccagaac gagactagct 7620
tcttggctgt cagaaaaccg agggggacct gttcctagcg aaaaagttat tatcacggcc 7680
ctgtctaagc cgcctgtcaa tccccgagag tttctgaggt ctatagacct cggaggattg 7740
ccagatgaag acttgataat tggcctcaag ccaaaggaac gggaattgaa gattgaaggt 7800
cgattctttg ctctaatgtc atggaatcta agattgtatt ttgtcatcac tgaaaaactc 7860
ttggccaact acatcttgcc actttttgac gcgctgacta tgacagacaa cctgaacaag 7920
gtgtttaaaa agctgatcga cagggtcacc gggcaagggc ttttggacta ttcaagggtc 7980
acatatgcat ttcacctgga ctatgaaaag tggaacaacc atcaaagatt agagtcaaca 8040
gaggatgtat tttctgtcct agatcaagtg tttggattga agagagtgtt ttctagaaca 8100
cacgagtttt ttcaaaaggc ctggatctat tattcagaca gatcagacct catcgggtta 8160
cgggaggatc aaatatactg cttagatgcg tccaacggcc caacctgttg gaatggccag 8220
gatggcgggc tagaaggctt acggcagaag ggctggagtc tagtcagctt attgatgata 8280
gatagagaat ctcaaatcag gaacacaaga accaaaatac tagctcaagg agacaaccag 8340
gttttatgtc cgacatatat gttgtcgcca gggctatctc aagaggggct cctctatgaa 8400
ttggagagaa tatcaaggaa tgcactttcg atatacagag ccgtcgagga aggggcatct 8460
aagctagggc tgatcatcaa gaaagaagag accatgtgta gttatgactt cctcatctat 8520
ggaaaaaccc ctttgtttag aggtaacata ttggtgcctg agtccaaaag atgggccaga 8580
gtctcttgcg tctctaatga ccaaatagtc aacctcgcca atataatgtc gacagtgtcc 8640
accaatgcgc taacagtggc acaacactct caatctttga tcaaaccgat gagggatttt 8700
ctgctcatgt cagtacaggc agtctttcac tacctgctat ttagcccaat cttaaaggga 8760
agagtttaca agattctgag cgctgaaggg gatagctttc tcctagccat gtcaaggata 8820
atctatctag atccttcttt gggaggggta tctggaatgt ccctcggaag attccatata 8880
cgacagttct cagaccctgt ctctgaaggg ttatccttct ggagagagat ctggttaagc 8940
tcccacgagt cctggattca cgcgttgtgt caagaggctg gaaacccaga tcttggagag 9000
agaacactcg agagcttcac tcgccttcta gaagatccta ccaccttaaa tatcagagga 9060
ggggccagtc ctaccattct actcaaggat gcaatcagaa aggctttata tgacgaggtg 9120
gacaaggtgg agaattcaga gtttcgagag gcaatcctgt tgtccaagac ccatagagat 9180
aattttatac tcttcttaac atctgttgag cctctgtttc ctcgatttct cagtgagcta 9240
ttcagttcgt cttttttggg aatccccgag tcaatcattg gattgataca aaactcccga 9300
acgataagaa ggcagtttag aaagagtctc tcaaaaactt tagaagaatc cttctacaac 9360
tcagagatcc acgggattag tcggatgacc cagacacctc agagggttgg gggggtgtgg 9420
ccttgctctt cagagagggc agatctactt agggagatct cttggggaag aaaagtggta 9480
ggcacgacag ttcctcaccc ttctgagatg ttggggttac ttcccaagtc ctctatttct 9540
tgcacttgtg gagcaacagg aggaggcaat cctagagttt ctgtatcagt actcccgtcc 9600
tttgatcagt catttttttc acgaggcccc ctaaaggggt acttgggctc gtccacctct 9660
atgtcgaccc agctattcca tgcatgggaa aaagtcacta atgttcatgt ggtgaagaga 9720
gctctatcgt taaaagaatc tataaactgg ttcattacta gagattccaa cttggctcaa 9780
gctctaatta ggaacattat gtctctgaca ggccctgatt tccctctaga ggaggcccct 9840
gtcttcaaaa ggacggggtc agccttgcat aggttcaagt ctgccagata cagcgaagga 9900
gggtattctt ctgtctgccc gaacctcctc tctcatattt ctgttagtac agacaccatg 9960
tctgatttga cccaagacgg gaagaactac gatttcatgt tccagccatt gatgctttat 10020
gcacagacat ggacatcaga gctggtacag agagacacaa ggctaagaga ctctacgttt 10080
cattggcacc tccgatgcaa caggtgtgtg agacccattg acgacgtgac cctggagacc 10140
tctcagatct tcgagtttcc ggatgtgtcg aaaagaatat ccagaatggt ttctggggct 10200
gtgcctcact tccagaggct tcccgatatc cgtctgagac caggagattt tgaatctcta 10260
agcggtagag aaaagtctca ccatatcgga tcagctcagg ggctcttata ctcaatctta 10320
gtggcaattc acgactcagg atacaatgat ggaaccatct tccctgtcaa catatacgac 10380
aaggtttccc ctagagacta tttgagaggg ctcgcaaggg gagtattgat aggatcctcg 10440
atttgcttct tgacaagaat gacaaatatc aatattaata gacctcttga attgatctca 10500
ggggtaatct catatattct cctgaggcta gataaccatc cctccttgta cataatgctc 10560
agagaaccgt ctcttagagg agagatattt tctatccctc agaaaatccc cgccgcttat 10620
ccaaccacta tgaaagaagg caacagatca atcttgtgtt atctccaaca tgtgctacgc 10680
tatgagcgag agataatcac ggcgtctcca gagaatgact ggctatggat cttttcagac 10740
tttagaagtg ccaaaatgac gtacctaacc ctcattactt accagtctca tcttctactc 10800
cagagggttg agagaaacct atctaagagt atgagagata acctgcgaca attgagttcc 10860
ttgatgaggc aggtgctggg cgggcacgga gaagatacct tagagtcaga cgacaacatt 10920
caacgactgc taaaagactc tttacgaagg acaagatggg tggatcaaga ggtgcgccat 10980
gcagctagaa ccatgactgg agattacagc cccaacaaga aggtgtcccg taaggtagga 11040
tgttcagaat gggtctgctc tgctcaacag gttgcagtct ctacctcagc aaacccggcc 11100
cctgtctcgg agcttgacat aagggccctc tctaagaggt tccagaaccc tttgatctcg 11160
ggcttgagag tggttcagtg ggcaaccggt gctcattata agcttaagcc tattctagat 11220
gatctcaatg ttttcccatc tctctgcctt gtagttgggg acgggtcagg ggggatatca 11280
agggcagtcc tcaacatgtt tccagatgcc aagcttgtgt tcaacagtct cttagaggtg 11340
aatgacctga tggcttccgg aacacatcca ctgcctcctt cagcaatcat gaggggagga 11400
aatgatatcg tctccagagt gatagatttt gactcaatct gggaaaaacc gtccgacttg 11460
agaaacttgg caacctggaa atacttccag tcagtccaaa agcaggtcaa catgtcctat 11520
gacctcatta tttgcgatgc agaagttact gacattgcat ctatcaaccg gataaccctg 11580
ttaatgtccg attttgcatt gtctatagat ggaccactct atttggtctt caaaacttat 11640
gggactatgc tagtaaatcc aaactacaag gctattcaac acctgtcaag agcgttcccc 11700
tcggtcacag ggtttatcac ccaagtaact tcgtcttttt catctgagct ctacctccga 11760
ttctccaaac gagggaagtt tttcagagat gctgagtact tgacctcttc cacccttcga 11820
gaaatgagcc ttgtgttatt caattgtagc agccccaaga gtgagatgca gagagctcgt 11880
tccttgaact atcaggatct tgtgagagga tttcctgaag aaatcatatc aaatccttac 11940
aatgagatga tcataactct gattgacagt gatgtagaat cttttctagt ccacaagatg 12000
gttgatgatc ttgagttaca gaggggaact ctgtctaaag tggctatcat tatagccatc 12060
atgatagttt tctccaacag agtcttcaac gtttccaaac ccctaactga ccccttgttc 12120
tatccaccgt ctgatcccaa aatcctgagg cacttcaaca tatgttgcag tactatgatg 12180
tatctatcta ctgctttagg tgacgtccct agcttcgcaa gacttcacga cctgtataac 12240
agacctataa cttattactt cagaaagcaa ttcattcgag ggaacgttta tctatcttgg 12300
agttggtcca acgacacctc agtgttcaaa agggtagcct gtaattctag cctgagtctg 12360
tcatctcact ggatcaggtt gatttacaag atagtgaaga ctaccagact cgttggcagc 12420
atcaaggatc tatccagaga agtggaaaga caccttcata ggtacaacag gtggatcacc 12480
ctagaggata tcagatctag atcatcccta ctagactaca gttgcctgtg atccggatac 12540
tcctggaagc ctgcccatgc taagactctt gtgtgatgta tcttgaaaaa aacaagatcc 12600
taaatctgaa cctttggttg tttgattgtt tttctcattt ttgttgttta tttgttaagc 12660
gt 12662
<210>16
<211>11914
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERAgm rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2505)..(4076)
<220>
<221>misc_feature
<222>(4122)..(4727)
<220>
<221>misc_feature
<222>(5400)..(11780)
<400>16
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaactaac acccctcctt tcgctgcagc caccatggtt cctcaggctc 2520
tcctgtttgt accccttctg gtttttccat tgtgttttgg gaaattccct atttacacga 2580
taccagacaa gcttggtccc tggagcccga ttgacataca tcacctcagc tgcccaaaca 2640
atttggtagt ggaggacgaa ggatgcacca acctgtcagg gttctcctac atggaactta 2700
aagttggata catcttagcc ataaaaatga acgggttcac ttgcacaggc gttgtgacgg 2760
aggctgaaac ctacactaac ttcgttggtt atgtcacaac cacgttcaaa agaaagcatt 2820
tccgcccaac accagatgca tgtagagccg cgtacaactg gaagatggcc ggtgacccca 2880
gatatgaaga gtctctacac aatccgtacc ctgactacca ctggcttcga actgtaaaaa 2940
ccaccaagga gtctctcgtt atcatatctc caagtgtggc agatttggac ccatatgaca 3000
gatcccttca ctcgagggtc ttccctagcg ggaagtgctc aggagtagcg gtgtcttcta 3060
cctactgctc cactaaccac gattacacca tttggatgcc cgagaatccg agactaggga 3120
tgtcttgtga catttttacc aatagtagag ggaagagagc atccaaaggg agtgagactt 3180
gcggctttgt agatgaaaga ggcctatata agtctttaaa aggagcatgc aaactcaagt 3240
tatgtggagt tctaggactt agacttatgg atggaacatg ggtcgcgatg caaacatcaa 3300
atgaaaccaa atggtgccct cccgatcagt tggtgaacct gcacgacttt cgctcagacg 3360
aaattgagca ccttgttgta gaggagttgg tcaggaagag agaggagtgt ctggatgcac 3420
tagagtccat catgacaacc aagtcagtga gtttcagacg tctcagtcat ttaagaaaac 3480
ttgtccctgg gtttggaaaa gcatatacca tattcaacaa gaccttgatg gaagccgatg 3540
ctcactacaa gtcagtcaga acttggaatg agatcctccc ttcaaaaggg tgtttaagag 3600
ttggggggag gtgtcatcct catgtgaacg gggtgttttt caatggtata atattaggac 3660
ctgacggcaa tgtcttaatc ccagagatgc aatcatccct cctccagcaa catatggagt 3720
tgttggaatc ctcggttatc ccccttgtgc accccctggc agacccgtct accgttttca 3780
aggacggtga cgaggctgag gattttgttg aagttcacct tcccgatgtg cacaatcagg 3840
tctcaggagt tgacttgggt ctcccgaact gggggaagta tgtattactg agtgcagggg 3900
ccctgactgc cttgatgttg ataattttcc tgatgacatg ttgtagaaga gtcaatcgat 3960
cagaacctac gcaacacaat ctcagaggga cagggaggga ggtgtcagtc actccccaaa 4020
gcgggaagat catatcttca tgggaatcac acaagagtgg gggtgagacc agactgtgat 4080
ttggtaccgt cgagaaaaaa acaggcaaca ccactgataa aatgaacttt ctacgtaaga 4140
tagtgaaaaa ttgcagggac gaggacactc aaaaaccctc tcccgtgtca gcccctctgg 4200
atgacgatga cttgtggctt ccaccccctg aatacgtccc gctgaaagaa cttacaagca 4260
agaagaacat gaggaacttt tgtatcaacg gaggggttaa agtgtgtagc ccgaatggtt 4320
actcgttcag gatcctgcgg cacattctga aatcattcga cgagatatat tctgggaatc 4380
ataggatgat cgggttagtc aaagtagtta ttggactggc tttgtcagga tctccagtcc 4440
ctgagggcat gaactgggta tacaaattga ggagaacctt tatcttccag tgggctgatt 4500
ccaggggccc tcttgaaggg gaggagttgg aatactctca ggagatcact tgggatgatg 4560
atactgagtt cgtcggattg caaataagag tgattgcaaa acagtgtcat atccagggca 4620
gaatctggtg tatcaacatg aacccgagag catgtcaact atggtctgac atgtctcttc 4680
agacacaaag gtccgaagag gacaaagatt cctctctgct tctagaataa tcagattata 4740
tcccgcaaat ttatcacttg tttacctctg gaggagagaa catatgggct caactccaac 4800
ccttgggagc aatataacaa aaaacatgtt atggtgccat taaaccgctg catttcatca 4860
aagtcaagtt gattaccttt acattttgat cctcttggat gtgaaaaaaa ctaacacccc 4920
tctgcagttt ggtaccttga aaaaaacctg ggttcaatag tcctccttga actccatgca 4980
actgggtaga ttcaagagtc atgagatttt cattaatcct ctcagttgat caagcaagat 5040
catgtagatt ctcataatag gggagatctt ctagcagttt cagtgactaa cggtactttc 5100
attctccagg aactgacacc aacagttgta gacaaaccac ggggtgtctc gggtgactct 5160
gtgcttgggc acagacaaag gtcatggtgt gttccatgat agcggactca ggatgagtta 5220
attgagagag gcagtcttcc tcccgtgaag gacataagca gtagctcaca atcatctcgc 5280
gtctcagcaa agtgtgcata attataaagt gctgggtcat ctaagctttt cagtcgagaa 5340
aaaaacatta gatcagaaga acaactggca acacttctca acctgagacc tacttcaaga 5400
tgctcgatcc tggagaggtc tatgatgacc ctattgaccc aatcgagtta gaggatgaac 5460
ccagaggaac ccccactgtc cccaacatct tgaggaactc tgactacaat ctcaactctc 5520
ctttgataga agatcctgct agactaatgt tagaatggtt aaaaacaggg aatagacctt 5580
atcggatgac tctaacagac aattgctcca ggtctttcag agttttgaaa gattatttca 5640
agaaggtaga tttgggttct ctcaaggtgg gcggaatggc tgcacagtca atgatttctc 5700
tctggttata tggtgcccac tctgaatcca acaggagccg gagatgtata acagacttgg 5760
cccatttcta ttccaagtcg tcccccatag agaagctgtt gaatctcacg ctaggaaata 5820
gagggctgag aatcccccca gagggagtgt taagttgcct tgagagggtt gattatgata 5880
atgcatttgg aaggtatctt gccaacacgt attcctctta cttgttcttc catgtaatca 5940
ccttatacat gaacgcccta gactgggatg aagaaaagac catcctagca ttatggaaag 6000
atttaacctc agtggacatc gggaaggact tggtaaagtt caaagaccaa atatggggac 6060
tgctgatcgt gacaaaggac tttgtttact cccaaagttc caattgtctt tttgacagaa 6120
actacacact tatgctaaaa gatcttttct tgtctcgctt caactcctta atggtcttgc 6180
tctctccccc agagccccga tactcagatg acttgatatc tcaactatgc cagctgtaca 6240
ttgctgggga tcaagtcttg tctatgtgtg gaaactccgg ctatgaagtc atcaaaatat 6300
tggagccata tgtcgtgaat agtttagtcc agagagcaga aaagtttagg cctctcattc 6360
attccttggg agactttcct gtatttataa aagacaaggt aagtcaactt gaagagacgt 6420
tcggtccctg tgcaagaagg ttctttaggg ctctggatca attcgacaac atacatgact 6480
tggtttttgt gtatggctgt tacaggcatt gggggcaccc atatatagat tatcgaaagg 6540
gtctgtcaaa actatatgat caggttcaca ttaaaaaagt gatagataag tcctaccagg 6600
agtgcttagc aagcgaccta gccaggagga tccttagatg gggttttgat aagtactcca 6660
agtggtatct ggattcaaga ttcctagccc gagaccaccc cttgactcct tatatcaaaa 6720
cccaaacatg gccacccaaa catattgtag acttggtggg ggatacatgg cacaagctcc 6780
cgatcacgca gatctttgag attcctgaat caatggatcc gtcagaaata ttggatgaca 6840
aatcacattc tttcaccaga acgagactag cttcttggct gtcagaaaac cgagggggac 6900
ctgttcctag cgaaaaagtt attatcacgg ccctgtctaa gccgcctgtc aatccccgag 6960
agtttctgag gtctatagac ctcggaggat tgccagatga agacttgata attggcctca 7020
agccaaagga acgggaattg aagattgaag gtcgattctt tgctctaatg tcatggaatc 7080
taagattgta ttttgtcatc actgaaaaac tcttggccaa ctacatcttg ccactttttg 7140
acgcgctgac tatgacagac aacctgaaca aggtgtttaa aaagctgatc gacagggtca 7200
ccgggcaagg gcttttggac tattcaaggg tcacatatgc atttcacctg gactatgaaa 7260
agtggaacaa ccatcaaaga ttagagtcaa cagaggatgt attttctgtc ctagatcaag 7320
tgtttggatt gaagagagtg ttttctagaa cacacgagtt ttttcaaaag gcctggatct 7380
attattcaga cagatcagac ctcatcgggt tacgggagga tcaaatatac tgcttagatg 7440
cgtccaacgg cccaacctgt tggaatggcc aggatggcgg gctagaaggc ttacggcaga 7500
agggctggag tctagtcagc ttattgatga tagatagaga atctcaaatc aggaacacaa 7560
gaaccaaaat actagctcaa ggagacaacc aggttttatg tccgacatat atgttgtcgc 7620
cagggctatc tcaagagggg ctcctctatg aattggagag aatatcaagg aatgcacttt 7680
cgatatacag agccgtcgag gaaggggcat ctaagctagg gctgatcatc aagaaagaag 7740
agaccatgtg tagttatgac ttcctcatct atggaaaaac ccctttgttt agaggtaaca 7800
tattggtgcc tgagtccaaa agatgggcca gagtctcttg cgtctctaat gaccaaatag 7860
tcaacctcgc caatataatg tcgacagtgt ccaccaatgc gctaacagtg gcacaacact 7920
ctcaatcttt gatcaaaccg atgagggatt ttctgctcat gtcagtacag gcagtctttc 7980
actacctgct atttagccca atcttaaagg gaagagttta caagattctg agcgctgaag 8040
gggatagctt tctcctagcc atgtcaagga taatctatct agatccttct ttgggagggg 8100
tatctggaat gtccctcgga agattccata tacgacagtt ctcagaccct gtctctgaag 8160
ggttatcctt ctggagagag atctggttaa gctcccacga gtcctggatt cacgcgttgt 8220
gtcaagaggc tggaaaccca gatcttggag agagaacact cgagagcttc actcgccttc 8280
tagaagatcc taccacctta aatatcagag gaggggccag tcctaccatt ctactcaagg 8340
atgcaatcag aaaggcttta tatgacgagg tggacaaggt ggagaattca gagtttcgag 8400
aggcaatcct gttgtccaag acccatagag ataattttat actcttctta acatctgttg 8460
agcctctgtt tcctcgattt ctcagtgagc tattcagttc gtcttttttg ggaatccccg 8520
agtcaatcat tggattgata caaaactccc gaacgataag aaggcagttt agaaagagtc 8580
tctcaaaaac tttagaagaa tccttctaca actcagagat ccacgggatt agtcggatga 8640
cccagacacc tcagagggtt gggggggtgt ggccttgctc ttcagagagg gcagatctac 8700
ttagggagat ctcttgggga agaaaagtgg taggcacgac agttcctcac ccttctgaga 8760
tgttggggtt acttcccaag tcctctattt cttgcacttg tggagcaaca ggaggaggca 8820
atcctagagt ttctgtatca gtactcccgt cctttgatca gtcatttttt tcacgaggcc 8880
ccctaaaggg gtacttgggc tcgtccacct ctatgtcgac ccagctattc catgcatggg 8940
aaaaagtcac taatgttcat gtggtgaaga gagctctatc gttaaaagaa tctataaact 9000
ggttcattac tagagattcc aacttggctc aagctctaat taggaacatt atgtctctga 9060
caggccctga tttccctcta gaggaggccc ctgtcttcaa aaggacgggg tcagccttgc 9120
ataggttcaa gtctgccaga tacagcgaag gagggtattc ttctgtctgc ccgaacctcc 9180
tctctcatat ttctgttagt acagacacca tgtctgattt gacccaagac gggaagaact 9240
acgatttcat gttccagcca ttgatgcttt atgcacagac atggacatca gagctggtac 9300
agagagacac aaggctaaga gactctacgt ttcattggca cctccgatgc aacaggtgtg 9360
tgagacccat tgacgacgtg accctggaga cctctcagat cttcgagttt ccggatgtgt 9420
cgaaaagaat atccagaatg gtttctgggg ctgtgcctca cttccagagg cttcccgata 9480
tccgtctgag accaggagat tttgaatctc taagcggtag agaaaagtct caccatatcg 9540
gatcagctca ggggctctta tactcaatct tagtggcaat tcacgactca ggatacaatg 9600
atggaaccat cttccctgtc aacatatacg acaaggtttc ccctagagac tatttgagag 9660
ggctcgcaag gggagtattg ataggatcct cgatttgctt cttgacaaga atgacaaata 9720
tcaatattaa tagacctctt gaattgatct caggggtaat ctcatatatt ctcctgaggc 9780
tagataacca tccctccttg tacataatgc tcagagaacc gtctcttaga ggagagatat 9840
tttctatccc tcagaaaatc cccgccgctt atccaaccac tatgaaagaa ggcaacagat 9900
caatcttgtg ttatctccaa catgtgctac gctatgagcg agagataatc acggcgtctc 9960
cagagaatga ctggctatgg atcttttcag actttagaag tgccaaaatg acgtacctaa 10020
ccctcattac ttaccagtct catcttctac tccagagggt tgagagaaac ctatctaaga 10080
gtatgagaga taacctgcga caattgagtt ccttgatgag gcaggtgctg ggcgggcacg 10140
gagaagatac cttagagtca gacgacaaca ttcaacgact gctaaaagac tctttacgaa 10200
ggacaagatg ggtggatcaa gaggtgcgcc atgcagctag aaccatgact ggagattaca 10260
gccccaacaa gaaggtgtcc cgtaaggtag gatgttcaga atgggtctgc tctgctcaac 10320
aggttgcagt ctctacctca gcaaacccgg cccctgtctc ggagcttgac ataagggccc 10380
tctctaagag gttccagaac cctttgatct cgggcttgag agtggttcag tgggcaaccg 10440
gtgctcatta taagcttaag cctattctag atgatctcaa tgttttccca tctctctgcc 10500
ttgtagttgg ggacgggtca ggggggatat caagggcagt cctcaacatg tttccagatg 10560
ccaagcttgt gttcaacagt ctcttagagg tgaatgacct gatggcttcc ggaacacatc 10620
cactgcctcc ttcagcaatc atgaggggag gaaatgatat cgtctccaga gtgatagatt 10680
ttgactcaat ctgggaaaaa ccgtccgact tgagaaactt ggcaacctgg aaatacttcc 10740
agtcagtcca aaagcaggtc aacatgtcct atgacctcat tatttgcgat gcagaagtta 10800
ctgacattgc atctatcaac cggataaccc tgttaatgtc cgattttgca ttgtctatag 10860
atggaccact ctatttggtc ttcaaaactt atgggactat gctagtaaat ccaaactaca 10920
aggctattca acacctgtca agagcgttcc cctcggtcac agggtttatc acccaagtaa 10980
cttcgtcttt ttcatctgag ctctacctcc gattctccaa acgagggaag tttttcagag 11040
atgctgagta cttgacctct tccacccttc gagaaatgag ccttgtgtta ttcaattgta 11100
gcagccccaa gagtgagatg cagagagctc gttccttgaa ctatcaggat cttgtgagag 11160
gatttcctga agaaatcata tcaaatcctt acaatgagat gatcataact ctgattgaca 11220
gtgatgtaga atcttttcta gtccacaaga tggttgatga tcttgagtta cagaggggaa 11280
ctctgtctaa agtggctatc attatagcca tcatgatagt tttctccaac agagtcttca 11340
acgtttccaa acccctaact gaccccttgt tctatccacc gtctgatccc aaaatcctga 11400
ggcacttcaa catatgttgc agtactatga tgtatctatc tactgcttta ggtgacgtcc 11460
ctagcttcgc aagacttcac gacctgtata acagacctat aacttattac ttcagaaagc 11520
aattcattcg agggaacgtt tatctatctt ggagttggtc caacgacacc tcagtgttca 11580
aaagggtagc ctgtaattct agcctgagtc tgtcatctca ctggatcagg ttgatttaca 11640
agatagtgaa gactaccaga ctcgttggca gcatcaagga tctatccaga gaagtggaaa 11700
gacaccttca taggtacaac aggtggatca ccctagagga tatcagatct agatcatccc 11760
tactagacta cagttgcctg tgatccggat actcctggaa gcctgcccat gctaagactc 11820
ttgtgtgatg tatcttgaaa aaaacaagat cctaaatctg aacctttggt tgtttgattg 11880
tttttctcat ttttgttgtt tatttgttaa gcgt 11914
<210>17
<211>11914
<212>DNA
<213>artiificial seuqence
<220>
<223>Recombinant ERAg3m rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2505)..(4076)
<220>
<221>misc_feature
<222>(4122)..(4727)
<220>
<221>misc_feature
<222>(5400)..(11780)
<400>17
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaactaac acccctcctt tcgctgcagc caccatggtt cctcaggctc 2520
tcctgtttgt accccttctg gtttttccat tgtgttttgg gaaattccct atttacacga 2580
taccagacaa gcttggtccc tggagtccga ttgacataca tcacctcagc tgcccaaaca 2640
atttggtagt ggaggacgaa ggatgcacca acctgtcagg gttctcctac atggaactta 2700
aagttggata catcttagcc ataaaagtga acgggttcac ttgcacaggc gttgtgacgg 2760
aggctgaaac ctacactaac ttcgttggtt atgtcacaac cacgttcaaa agaaagcatt 2820
tccgcccaac accagatgca tgtagagccg cgtacaactg gaagatggcc ggtgacccca 2880
gatatgaaga gtctctacac aatccgtacc ctgactaccg ctggcttcga actgtaaaaa 2940
ccaccaagga gtctctcgtt atcatatctc caagtgtggc agatttggac ccatatgaca 3000
gatcccttca ctcgagggtc ttccctagcg ggaagtgctc aggagtagcg gtgtcttcta 3060
cctactgctc cactaaccac gattacacca tttggatgcc cgagaatccg agactaggga 3120
tgtcttgtga catttttacc aatagtagag ggaagagagc atccaaaggg agtgagactt 3180
gcggctttgt agatgaaaga ggcctatata agtctttaaa aggagcatgc aaactcaagt 3240
tatgtggagt tctaggactt agacttatgg atggaacatg ggtctcgatg caaacatcaa 3300
atgaaaccaa atggtgccct cccgataagt tggtgaacct gcacgacttt cgctcagacg 3360
aaattgagca ccttgttgta gaggagttgg tcaggaagag agaggagtgt ctggatgcac 3420
tagagtccat catgacaacc aagtcagtga gtttcagacg tctcagtcat ttaagaaaac 3480
ttgtccctgg gtttggaaaa gcatatacca tattcaacaa gaccttgatg gaagccgatg 3540
ctcactacaa gtcagtcgag acttggaatg agatcctccc ttcaaaaggg tgtttaagag 3600
ttggggggag gtgtcatcct catgtgaacg gggtgttttt caatggtata atattaggac 3660
ctgacggcaa tgtcttaatc ccagagatgc aatcatccct cctccagcaa catatggagt 3720
tgttggaatc ctcggttatc ccccttgtgc accccctggc agacccgtct accgttttca 3780
aggacggtga cgaggctgag gattttgttg aagttcacct tcccgatgtg cacaatcagg 3840
tctcaggagt tgacttgggt ctcccgaact gggggaagta tgtattactg agtgcagggg 3900
ccctgactgc cttgatgttg ataattttcc tgatgacatg ttgtagaaga gtcaatcgat 3960
cagaacctac gcaacacaat ctcagaggga cagggaggga ggtgtcagtc actccccaaa 4020
gcgggaagat catatcttca tgggaatcac acaagagtgg gggtgagacc agactgtaat 4080
ttggtaccgt cgagaaaaaa acaggcaaca ccactgataa aatgaacttt ctacgtaaga 4140
tagtgaaaaa ttgcagggac gaggacactc aaaaaccctc tcccgtgtca gcccctctgg 4200
atgacgatga cttgtggctt ccaccccctg aatacgtccc gctgaaagaa cttacaagca 4260
agaagaacat gaggaacttt tgtatcaacg gaggggttaa agtgtgtagc ccgaatggtt 4320
actcgttcag gatcctgcgg cacattctga aatcattcga cgagatatat tctgggaatc 4380
ataggatgat cgggttagtc aaagtagtta ttggactggc tttgtcagga tctccagtcc 4440
ctgagggcat gaactgggta tacaaattga ggagaacctt tatcttccag tgggctgatt 4500
ccaggggccc tcttgaaggg gaggagttgg aatactctca ggagatcact tgggatgatg 4560
atactgagtt cgtcggattg caaataagag tgattgcaaa acagtgtcat atccagggca 4620
gaatctggtg tatcaacatg aacccgagag catgtcaact atggtctgac atgtctcttc 4680
agacacaaag gtccgaagag gacaaagatt cctctctgct tctagaataa tcagattata 4740
tcccgcaaat ttatcacttg tttacctctg gaggagagaa catatgggct caactccaac 4800
ccttgggagc aatataacaa aaaacatgtt atggtgccat taaaccgctg catttcatca 4860
aagtcaagtt gattaccttt acattttgat cctcttggat gtgaaaaaaa ctaacacccc 4920
tctgcagttt ggtaccttga aaaaaacctg ggttcaatag tcctccttga actccatgca 4980
actgggtaga ttcaagagtc atgagatttt cattaatcct ctcagttgat caagcaagat 5040
catgtagatt ctcataatag gggagatctt ctagcagttt cagtgactaa cggtactttc 5100
attctccagg aactgacacc aacagttgta gacaaaccac ggggtgtctc gggtgactct 5160
gtgcttgggc acagacaaag gtcatggtgt gttccatgat agcggactca ggatgagtta 5220
attgagagag gcagtcttcc tcccgtgaag gacataagca gtagctcaca atcatctcgc 5280
gtctcagcaa agtgtgcata attataaagt gctgggtcat ctaagctttt cagtcgagaa 5340
aaaaacatta gatcagaaga acaactggca acacttctca acctgagacc tacttcaaga 5400
tgctcgatcc tggagaggtc tatgatgacc ctattgaccc aatcgagtta gaggatgaac 5460
ccagaggaac ccccactgtc cccaacatct tgaggaactc tgactacaat ctcaactctc 5520
ctttgataga agatcctgct agactaatgt tagaatggtt aaaaacaggg aatagacctt 5580
atcggatgac tctaacagac aattgctcca ggtctttcag agttttgaaa gattatttca 5640
agaaggtaga tttgggttct ctcaaggtgg gcggaatggc tgcacagtca atgatttctc 5700
tctggttata tggtgcccac tctgaatcca acaggagccg gagatgtata acagacttgg 5760
cccatttcta ttccaagtcg tcccccatag agaagctgtt gaatctcacg ctaggaaata 5820
gagggctgag aatcccccca gagggagtgt taagttgcct tgagagggtt gattatgata 5880
atgcatttgg aaggtatctt gccaacacgt attcctctta cttgttcttc catgtaatca 5940
ccttatacat gaacgcccta gactgggatg aagaaaagac catcctagca ttatggaaag 6000
atttaacctc agtggacatc gggaaggact tggtaaagtt caaagaccaa atatggggac 6060
tgctgatcgt gacaaaggac tttgtttact cccaaagttc caattgtctt tttgacagaa 6120
actacacact tatgctaaaa gatcttttct tgtctcgctt caactcctta atggtcttgc 6180
tctctccccc agagccccga tactcagatg acttgatatc tcaactatgc cagctgtaca 6240
ttgctgggga tcaagtcttg tctatgtgtg gaaactccgg ctatgaagtc atcaaaatat 6300
tggagccata tgtcgtgaat agtttagtcc agagagcaga aaagtttagg cctctcattc 6360
attccttggg agactttcct gtatttataa aagacaaggt aagtcaactt gaagagacgt 6420
tcggtccctg tgcaagaagg ttctttaggg ctctggatca attcgacaac atacatgact 6480
tggtttttgt gtatggctgt tacaggcatt gggggcaccc atatatagat tatcgaaagg 6540
gtctgtcaaa actatatgat caggttcaca ttaaaaaagt gatagataag tcctaccagg 6600
agtgcttagc aagcgaccta gccaggagga tccttagatg gggttttgat aagtactcca 6660
agtggtatct ggattcaaga ttcctagccc gagaccaccc cttgactcct tatatcaaaa 6720
cccaaacatg gccacccaaa catattgtag acttggtggg ggatacatgg cacaagctcc 6780
cgatcacgca gatctttgag attcctgaat caatggatcc gtcagaaata ttggatgaca 6840
aatcacattc tttcaccaga acgagactag cttcttggct gtcagaaaac cgagggggac 6900
ctgttcctag cgaaaaagtt attatcacgg ccctgtctaa gccgcctgtc aatccccgag 6960
agtttctgag gtctatagac ctcggaggat tgccagatga agacttgata attggcctca 7020
agccaaagga acgggaattg aagattgaag gtcgattctt tgctctaatg tcatggaatc 7080
taagattgta ttttgtcatc actgaaaaac tcttggccaa ctacatcttg ccactttttg 7140
acgcgctgac tatgacagac aacctgaaca aggtgtttaa aaagctgatc gacagggtca 7200
ccgggcaagg gcttttggac tattcaaggg tcacatatgc atttcacctg gactatgaaa 7260
agtggaacaa ccatcaaaga ttagagtcaa cagaggatgt attttctgtc ctagatcaag 7320
tgtttggatt gaagagagtg ttttctagaa cacacgagtt ttttcaaaag gcctggatct 7380
attattcaga cagatcagac ctcatcgggt tacgggagga tcaaatatac tgcttagatg 7440
cgtccaacgg cccaacctgt tggaatggcc aggatggcgg gctagaaggc ttacggcaga 7500
agggctggag tctagtcagc ttattgatga tagatagaga atctcaaatc aggaacacaa 7560
gaaccaaaat actagctcaa ggagacaacc aggttttatg tccgacatat atgttgtcgc 7620
cagggctatc tcaagagggg ctcctctatg aattggagag aatatcaagg aatgcacttt 7680
cgatatacag agccgtcgag gaaggggcat ctaagctagg gctgatcatc aagaaagaag 7740
agaccatgtg tagttatgac ttcctcatct atggaaaaac ccctttgttt agaggtaaca 7800
tattggtgcc tgagtccaaa agatgggcca gagtctcttg cgtctctaat gaccaaatag 7860
tcaacctcgc caatataatg tcgacagtgt ccaccaatgc gctaacagtg gcacaacact 7920
ctcaatcttt gatcaaaccg atgagggatt ttctgctcat gtcagtacag gcagtctttc 7980
actacctgct atttagccca atcttaaagg gaagagttta caagattctg agcgctgaag 8040
gggatagctt tctcctagcc atgtcaagga taatctatct agatccttct ttgggagggg 8100
tatctggaat gtccctcgga agattccata tacgacagtt ctcagaccct gtctctgaag 8160
ggttatcctt ctggagagag atctggttaa gctcccacga gtcctggatt cacgcgttgt 8220
gtcaagaggc tggaaaccca gatcttggag agagaacact cgagagcttc actcgccttc 8280
tagaagatcc taccacctta aatatcagag gaggggccag tcctaccatt ctactcaagg 8340
atgcaatcag aaaggcttta tatgacgagg tggacaaggt ggagaattca gagtttcgag 8400
aggcaatcct gttgtccaag acccatagag ataattttat actcttctta acatctgttg 8460
agcctctgtt tcctcgattt ctcagtgagc tattcagttc gtcttttttg ggaatccccg 8520
agtcaatcat tggattgata caaaactccc gaacgataag aaggcagttt agaaagagtc 8580
tctcaaaaac tttagaagaa tccttctaca actcagagat ccacgggatt agtcggatga 8640
cccagacacc tcagagggtt gggggggtgt ggccttgctc ttcagagagg gcagatctac 8700
ttagggagat ctcttgggga agaaaagtgg taggcacgac agttcctcac ccttctgaga 8760
tgttggggtt acttcccaag tcctctattt cttgcacttg tggagcaaca ggaggaggca 8820
atcctagagt ttctgtatca gtactcccgt cctttgatca gtcatttttt tcacgaggcc 8880
ccctaaaggg gtacttgggc tcgtccacct ctatgtcgac ccagctattc catgcatggg 8940
aaaaagtcac taatgttcat gtggtgaaga gagctctatc gttaaaagaa tctataaact 9000
ggttcattac tagagattcc aacttggctc aagctctaat taggaacatt atgtctctga 9060
caggccctga tttccctcta gaggaggccc ctgtcttcaa aaggacgggg tcagccttgc 9120
ataggttcaa gtctgccaga tacagcgaag gagggtattc ttctgtctgc ccgaacctcc 9180
tctctcatat ttctgttagt acagacacca tgtctgattt gacccaagac gggaagaact 9240
acgatttcat gttccagcca ttgatgcttt atgcacagac atggacatca gagctggtac 9300
agagagacac aaggctaaga gactctacgt ttcattggca cctccgatgc aacaggtgtg 9360
tgagacccat tgacgacgtg accctggaga cctctcagat cttcgagttt ccggatgtgt 9420
cgaaaagaat atccagaatg gtttctgggg ctgtgcctca cttccagagg cttcccgata 9480
tccgtctgag accaggagat tttgaatctc taagcggtag agaaaagtct caccatatcg 9540
gatcagctca ggggctctta tactcaatct tagtggcaat tcacgactca ggatacaatg 9600
atggaaccat cttccctgtc aacatatacg acaaggtttc ccctagagac tatttgagag 9660
ggctcgcaag gggagtattg ataggatcct cgatttgctt cttgacaaga atgacaaata 9720
tcaatattaa tagacctctt gaattgatct caggggtaat ctcatatatt ctcctgaggc 9780
tagataacca tccctccttg tacataatgc tcagagaacc gtctcttaga ggagagatat 9840
tttctatccc tcagaaaatc cccgccgctt atccaaccac tatgaaagaa ggcaacagat 9900
caatcttgtg ttatctccaa catgtgctac gctatgagcg agagataatc acggcgtctc 9960
cagagaatga ctggctatgg atcttttcag actttagaag tgccaaaatg acgtacctaa 10020
ccctcattac ttaccagtct catcttctac tccagagggt tgagagaaac ctatctaaga 10080
gtatgagaga taacctgcga caattgagtt ccttgatgag gcaggtgctg ggcgggcacg 10140
gagaagatac cttagagtca gacgacaaca ttcaacgact gctaaaagac tctttacgaa 10200
ggacaagatg ggtggatcaa gaggtgcgcc atgcagctag aaccatgact ggagattaca 10260
gccccaacaa gaaggtgtcc cgtaaggtag gatgttcaga atgggtctgc tctgctcaac 10320
aggttgcagt ctctacctca gcaaacccgg cccctgtctc ggagcttgac ataagggccc 10380
tctctaagag gttccagaac cctttgatct cgggcttgag agtggttcag tgggcaaccg 10440
gtgctcatta taagcttaag cctattctag atgatctcaa tgttttccca tctctctgcc 10500
ttgtagttgg ggacgggtca ggggggatat caagggcagt cctcaacatg tttccagatg 10560
ccaagcttgt gttcaacagt ctcttagagg tgaatgacct gatggcttcc ggaacacatc 10620
cactgcctcc ttcagcaatc atgaggggag gaaatgatat cgtctccaga gtgatagatt 10680
ttgactcaat ctgggaaaaa ccgtccgact tgagaaactt ggcaacctgg aaatacttcc 10740
agtcagtcca aaagcaggtc aacatgtcct atgacctcat tatttgcgat gcagaagtta 10800
ctgacattgc atctatcaac cggataaccc tgttaatgtc cgattttgca ttgtctatag 10860
atggaccact ctatttggtc ttcaaaactt atgggactat gctagtaaat ccaaactaca 10920
aggctattca acacctgtca agagcgttcc cctcggtcac agggtttatc acccaagtaa 10980
cttcgtcttt ttcatctgag ctctacctcc gattctccaa acgagggaag tttttcagag 11040
atgctgagta cttgacctct tccacccttc gagaaatgag ccttgtgtta ttcaattgta 11100
gcagccccaa gagtgagatg cagagagctc gttccttgaa ctatcaggat cttgtgagag 11160
gatttcctga agaaatcata tcaaatcctt acaatgagat gatcataact ctgattgaca 11220
gtgatgtaga atcttttcta gtccacaaga tggttgatga tcttgagtta cagaggggaa 11280
ctctgtctaa agtggctatc attatagcca tcatgatagt tttctccaac agagtcttca 11340
acgtttccaa acccctaact gaccccttgt tctatccacc gtctgatccc aaaatcctga 11400
ggcacttcaa catatgttgc agtactatga tgtatctatc tactgcttta ggtgacgtcc 11460
ctagcttcgc aagacttcac gacctgtata acagacctat aacttattac ttcagaaagc 11520
aattcattcg agggaacgtt tatctatctt ggagttggtc caacgacacc tcagtgttca 11580
aaagggtagc ctgtaattct agcctgagtc tgtcatctca ctggatcagg ttgatttaca 11640
agatagtgaa gactaccaga ctcgttggca gcatcaagga tctatccaga gaagtggaaa 11700
gacaccttca taggtacaac aggtggatca ccctagagga tatcagatct agatcatccc 11760
tactagacta cagttgcctg tgatccggat actcctggaa gcctgcccat gctaagactc 11820
ttgtgtgatg tatcttgaaa aaaacaagat cctaaatctg aacctttggt tgtttgattg 11880
tttttctcat ttttgttgtt tatttgttaa gcgt 11914
<210>18
<211>13556
<212>DNA
<213>artificial sequence
<220>
<223>Recombinant ERAgmg rabies virus genome
<220>
<221>misc_feature
<222>(71)..(1420)
<220>
<221>misc_feature
<222>(1514)..(2404)
<220>
<221>misc_feature
<222>(2505)..(4076)
<220>
<221>misc_feature
<222>(4122)..(4727)
<220>
<221>misc_feature
<222>(4943)..(6514)
<220>
<221>misc_feature
<222>(7042)..(13422)
<400>18
acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60
cacccctaca atggatgccg acaagattgt attcaaagtc aataatcagg tggtctcttt 120
gaagcctgag attatcgtgg atcaacatga gtacaagtac cctgccatca aagatttgaa 180
aaagccctgt ataaccctag gaaaggctcc cgatttaaat aaagcataca agtcagtttt 240
gtcaggcatg agcgccgcca aacttgatcc tgacgatgta tgttcctatt tggcagcggc 300
aatgcagttt tttgagggga catgtccgga agactggacc agctatggaa tcgtgattgc 360
acgaaaagga gataagatca ccccaggttc tctggtggag ataaaacgta ctgatgtaga 420
agggaattgg gctctgacag gaggcatgga actgacaaga gaccccactg tccctgagca 480
tgcgtcctta gtcggtcttc tcttgagtct gtataggttg agcaaaatat ccgggcaaaa 540
cactggtaac tataagacaa acattgcaga caggatagag cagatttttg agacagcccc 600
ttttgttaaa atcgtggaac accatactct aatgacaact cacaaaatgt gtgctaattg 660
gagtactata ccaaacttca gatttttggc cggaacctat gacatgtttt tctcccggat 720
tgagcatcta tattcagcaa tcagagtggg cacagttgtc actgcttatg aagactgttc 780
aggactggta tcatttactg ggttcataaa acaaatcaat ctcaccgcta gagaggcaat 840
actatatttc ttccacaaga actttgagga agagataaga agaatgtttg agccagggca 900
ggagacagct gttcctcact cttatttcat ccacttccgt tcactaggct tgagtgggaa 960
atctccttat tcatcaaatg ctgttggtca cgtgttcaat ctcattcact ttgtaggatg 1020
ctatatgggt caagtcagat ccctaaatgc aacggttatt gctgcatgtg ctcctcatga 1080
aatgtctgtt ctagggggct atctgggaga ggaattcttc gggaaaggga catttgaaag 1140
aagattcttc agagatgaga aagaacttca agaatacgag gcggctgaac tgacaaagac 1200
tgacgtagca ctggcagatg atggaactgt caactctgac gacgaggact acttctcagg 1260
tgaaaccaga agtccggagg ctgtttatac tcgaatcatg atgaatggag gtcgactaaa 1320
gagatctcac atacggagat atgtctcagt cagttccaat catcaagccc gtccaaactc 1380
attcgccgag tttctaaaca agacatattc gagtgactca taagaagttg aataacaaaa 1440
tgccggaaat ctacggattg tgtatatcca tcatgaaaaa aactaacacc cctcctttcg 1500
aaccatccca aacatgagca agatctttgt caatcctagt gctattagag ccggtctggc 1560
cgatcttgag atggctgaag aaactgttga tctgatcaat agaaatatcg aagacaatca 1620
ggctcatctc caaggggaac ccatagaagt ggacaatctc cctgaggata tggggcgact 1680
tcacctggat gatggaaaat cgcccaaccc tggtgagatg gccaaggtgg gagaaggcaa 1740
gtatcgagag gactttcaga tggatgaagg agaggatcct agcttcctgt tccagtcata 1800
cctggaaaat gttggagtcc aaatagtcag acaaatgagg tcaggagaga gatttctcaa 1860
gatatggtca cagaccgtag aagagattat atcctatgtc gcggtcaact ttcccaaccc 1920
tccaggaaag tcttcagagg ataaatcaac ccagactact ggccgagagc tcaagaagga 1980
gacaacaccc actccttctc agagagaaag ccaatcatcg aaagccagga tggcggctca 2040
aattgcttct ggccctccag cccttgaatg gtcggccacc aatgaagagg atgatctatc 2100
agtggaggct gagatcgctc accagattgc agaaagtttc tccaaaaaat ataagtttcc 2160
ctctcgatcc tcagggatac tcttgtataa ttttgagcaa ttgaaaatga accttgatga 2220
tatagttaaa gaggcaaaaa atgtaccagg tgtgacccgt ttagcccatg acgggtccaa 2280
actcccccta agatgtgtac tgggatgggt cgctttggcc aactctaaga aattccagtt 2340
gttagtcgaa tccgacaagc tgagtaaaat catgcaagat gacttgaatc gctatacatc 2400
ttgctaaccg aacctctcca ctcagtccct ctagacaata aagtccgaga tgtcctaaag 2460
tcaacatgaa aaaaactaac acccctcctt tcgctgcagc caccatggtt cctcaggctc 2520
tcctgtttgt accccttctg gtttttccat tgtgttttgg gaaattccct atttacacga 2580
taccagacaa gcttggtccc tggagcccga ttgacataca tcacctcagc tgcccaaaca 2640
atttggtagt ggaggacgaa ggatgcacca acctgtcagg gttctcctac atggaactta 2700
aagttggata catcttagcc ataaaaatga acgggttcac ttgcacaggc gttgtgacgg 2760
aggctgaaac ctacactaac ttcgttggtt atgtcacaac cacgttcaaa agaaagcatt 2820
tccgcccaac accagatgca tgtagagccg cgtacaactg gaagatggcc ggtgacccca 2880
gatatgaaga gtctctacac aatccgtacc ctgactacca ctggcttcga actgtaaaaa 2940
ccaccaagga gtctctcgtt atcatatctc caagtgtggc agatttggac ccatatgaca 3000
gatcccttca ctcgagggtc ttccctagcg ggaagtgctc aggagtagcg gtgtcttcta 3060
cctactgctc cactaaccac gattacacca tttggatgcc cgagaatccg agactaggga 3120
tgtcttgtga catttttacc aatagtagag ggaagagagc atccaaaggg agtgagactt 3180
gcggctttgt agatgaaaga ggcctatata agtctttaaa aggagcatgc aaactcaagt 3240
tatgtggagt tctaggactt agacttatgg atggaacatg ggtcgcgatg caaacatcaa 3300
atgaaaccaa atggtgccct cccgatcagt tggtgaacct gcacgacttt cgctcagacg 3360
aaattgagca ccttgttgta gaggagttgg tcaggaagag agaggagtgt ctggatgcac 3420
tagagtccat catgacaacc aagtcagtga gtttcagacg tctcagtcat ttaagaaaac 3480
ttgtccctgg gtttggaaaa gcatatacca tattcaacaa gaccttgatg gaagccgatg 3540
ctcactacaa gtcagtcaga acttggaatg agatcctccc ttcaaaaggg tgtttaagag 3600
ttggggggag gtgtcatcct catgtgaacg gggtgttttt caatggtata atattaggac 3660
ctgacggcaa tgtcttaatc ccagagatgc aatcatccct cctccagcaa catatggagt 3720
tgttggaatc ctcggttatc ccccttgtgc accccctggc agacccgtct accgttttca 3780
aggacggtga cgaggctgag gattttgttg aagttcacct tcccgatgtg cacaatcagg 3840
tctcaggagt tgacttgggt ctcccgaact gggggaagta tgtattactg agtgcagggg 3900
ccctgactgc cttgatgttg ataattttcc tgatgacatg ttgtagaaga gtcaatcgat 3960
cagaacctac gcaacacaat ctcagaggga cagggaggga ggtgtcagtc actccccaaa 4020
gcgggaagat catatcttca tgggaatcac acaagagtgg gggtgagacc agactgtgat 4080
ttggtaccgt cgagaaaaaa acaggcaaca ccactgataa aatgaacttt ctacgtaaga 4140
tagtgaaaaa ttgcagggac gaggacactc aaaaaccctc tcccgtgtca gcccctctgg 4200
atgacgatga cttgtggctt ccaccccctg aatacgtccc gctgaaagaa cttacaagca 4260
agaagaacat gaggaacttt tgtatcaacg gaggggttaa agtgtgtagc ccgaatggtt 4320
actcgttcag gatcctgcgg cacattctga aatcattcga cgagatatat tctgggaatc 4380
ataggatgat cgggttagtc aaagtagtta ttggactggc tttgtcagga tctccagtcc 4440
ctgagggcat gaactgggta tacaaattga ggagaacctt tatcttccag tgggctgatt 4500
ccaggggccc tcttgaaggg gaggagttgg aatactctca ggagatcact tgggatgatg 4560
atactgagtt cgtcggattg caaataagag tgattgcaaa acagtgtcat atccagggca 4620
gaatctggtg tatcaacatg aacccgagag catgtcaact atggtctgac atgtctcttc 4680
agacacaaag gtccgaagag gacaaagatt cctctctgct tctagaataa tcagattata 4740
tcccgcaaat ttatcacttg tttacctctg gaggagagaa catatgggct caactccaac 4800
ccttgggagc aatataacaa aaaacatgtt atggtgccat taaaccgctg catttcatca 4860
aagtcaagtt gattaccttt acattttgat cctcttggat gtgaaaaaaa ctattaacat 4920
ccctcaaaag actcaaggaa agatggttcc tcaggctctc ctgtttgtac cccttctggt 4980
ttttccattg tgttttggga aattccctat ttacacgata ccagacaagc ttggtccctg 5040
gagcccgatt gacatacatc acctcagctg cccaaacaat ttggtagtgg aggacgaagg 5100
atgcaccaac ctgtcagggt tctcctacat ggaacttaaa gttggataca tcttagccat 5160
aaaaatgaac gggttcactt gcacaggcgt tgtgacggag gctgaaacct acactaactt 5220
cgttggttat gtcacaacca cgttcaaaag aaagcatttc cgcccaacac cagatgcatg 5280
tagagccgcg tacaactgga agatggccgg tgaccccaga tatgaagagt ctctacacaa 5340
tccgtaccct gactaccact ggcttcgaac tgtaaaaacc accaaggagt ctctcgttat 5400
catatctcca agtgtggcag atttggaccc atatgacaga tcccttcact cgagggtctt 5460
ccctagcggg aagtgctcag gagtagcggt gtcttctacc tactgctcca ctaaccacga 5520
ttacaccatt tggatgcccg agaatccgag actagggatg tcttgtgaca tttttaccaa 5580
tagtagaggg aagagagcat ccaaagggag tgagacttgc ggctttgtag atgaaagagg 5640
cctatataag tctttaaaag gagcatgcaa actcaagtta tgtggagttc taggacttag 5700
acttatggat ggaacatggg tcgcgatgca aacatcaaat gaaaccaaat ggtgccctcc 5760
cgatcagttg gtgaacctgc acgactttcg ctcagacgaa attgagcacc ttgttgtaga 5820
ggagttggtc aggaagagag aggagtgtct ggatgcacta gagtccatca tgacaaccaa 5880
gtcagtgagt ttcagacgtc tcagtcattt aagaaaactt gtccctgggt ttggaaaagc 5940
atataccata ttcaacaaga ccttgatgga agccgatgct cactacaagt cagtcagaac 6000
ttggaatgag atcctccctt caaaagggtg tttaagagtt ggggggaggt gtcatcctca 6060
tgtgaacggg gtgtttttca atggtataat attaggacct gacggcaatg tcttaatccc 6120
agagatgcaa tcatccctcc tccagcaaca tatggagttg ttggaatcct cggttatccc 6180
ccttgtgcac cccctggcag acccgtctac cgttttcaag gacggtgacg aggctgagga 6240
ttttgttgaa gttcaccttc ccgatgtgca caatcaggtc tcaggagttg acttgggtct 6300
cccgaactgg gggaagtatg tattactgag tgcaggggcc ctgactgcct tgatgttgat 6360
aattttcctg atgacatgtt gtagaagagt caatcgatca gaacctacgc aacacaatct 6420
cagagggaca gggagggagg tgtcagtcac tccccaaagc gggaagatca tatcttcatg 6480
ggaatcacac aagagtgggg gtgagaccag actgtgagga ctggccgtcc tttcaactat 6540
ccaagtcctg aagatcacct ccccttgggg ggttcttttt gaaaaaaacc tgggttcaat 6600
agtcctcctt gaactccatg caactgggta gattcaagag tcatgagatt ttcattaatc 6660
ctctcagttg atcaagcaag atcatgtaga ttctcataat aggggagatc ttctagcagt 6720
ttcagtgact aacggtactt tcattctcca ggaactgaca ccaacagttg tagacaaacc 6780
acggggtgtc tcgggtgact ctgtgcttgg gcacagacaa aggtcatggt gtgttccatg 6840
atagcggact caggatgagt taattgagag aggcagtctt cctcccgtga aggacataag 6900
cagtagctca caatcatctc gcgtctcagc aaagtgtgca taattataaa gtgctgggtc 6960
atctaagctt ttcagtcgag aaaaaaacat tagatcagaa gaacaactgg caacacttct 7020
caacctgaga cctacttcaa gatgctcgat cctggagagg tctatgatga ccctattgac 7080
ccaatcgagt tagaggatga acccagagga acccccactg tccccaacat cttgaggaac 7140
tctgactaca atctcaactc tcctttgata gaagatcctg ctagactaat gttagaatgg 7200
ttaaaaacag ggaatagacc ttatcggatg actctaacag acaattgctc caggtctttc 7260
agagttttga aagattattt caagaaggta gatttgggtt ctctcaaggt gggcggaatg 7320
gctgcacagt caatgatttc tctctggtta tatggtgccc actctgaatc caacaggagc 7380
cggagatgta taacagactt ggcccatttc tattccaagt cgtcccccat agagaagctg 7440
ttgaatctca cgctaggaaa tagagggctg agaatccccc cagagggagt gttaagttgc 7500
cttgagaggg ttgattatga taatgcattt ggaaggtatc ttgccaacac gtattcctct 7560
tacttgttct tccatgtaat caccttatac atgaacgccc tagactggga tgaagaaaag 7620
accatcctag cattatggaa agatttaacc tcagtggaca tcgggaagga cttggtaaag 7680
ttcaaagacc aaatatgggg actgctgatc gtgacaaagg actttgttta ctcccaaagt 7740
tccaattgtc tttttgacag aaactacaca cttatgctaa aagatctttt cttgtctcgc 7800
ttcaactcct taatggtctt gctctctccc ccagagcccc gatactcaga tgacttgata 7860
tctcaactat gccagctgta cattgctggg gatcaagtct tgtctatgtg tggaaactcc 7920
ggctatgaag tcatcaaaat attggagcca tatgtcgtga atagtttagt ccagagagca 7980
gaaaagttta ggcctctcat tcattccttg ggagactttc ctgtatttat aaaagacaag 8040
gtaagtcaac ttgaagagac gttcggtccc tgtgcaagaa ggttctttag ggctctggat 8100
caattcgaca acatacatga cttggttttt gtgtatggct gttacaggca ttgggggcac 8160
ccatatatag attatcgaaa gggtctgtca aaactatatg atcaggttca cattaaaaaa 8220
gtgatagata agtcctacca ggagtgctta gcaagcgacc tagccaggag gatccttaga 8280
tggggttttg ataagtactc caagtggtat ctggattcaa gattcctagc ccgagaccac 8340
cccttgactc cttatatcaa aacccaaaca tggccaccca aacatattgt agacttggtg 8400
ggggatacat ggcacaagct cccgatcacg cagatctttg agattcctga atcaatggat 8460
ccgtcagaaa tattggatga caaatcacat tctttcacca gaacgagact agcttcttgg 8520
ctgtcagaaa accgaggggg acctgttcct agcgaaaaag ttattatcac ggccctgtct 8580
aagccgcctg tcaatccccg agagtttctg aggtctatag acctcggagg attgccagat 8640
gaagacttga taattggcct caagccaaag gaacgggaat tgaagattga aggtcgattc 8700
tttgctctaa tgtcatggaa tctaagattg tattttgtca tcactgaaaa actcttggcc 8760
aactacatct tgccactttt tgacgcgctg actatgacag acaacctgaa caaggtgttt 8820
aaaaagctga tcgacagggt caccgggcaa gggcttttgg actattcaag ggtcacatat 8880
gcatttcacc tggactatga aaagtggaac aaccatcaaa gattagagtc aacagaggat 8940
gtattttctg tcctagatca agtgtttgga ttgaagagag tgttttctag aacacacgag 9000
ttttttcaaa aggcctggat ctattattca gacagatcag acctcatcgg gttacgggag 9060
gatcaaatat actgcttaga tgcgtccaac ggcccaacct gttggaatgg ccaggatggc 9120
gggctagaag gcttacggca gaagggctgg agtctagtca gcttattgat gatagataga 9180
gaatctcaaa tcaggaacac aagaaccaaa atactagctc aaggagacaa ccaggtttta 9240
tgtccgacat atatgttgtc gccagggcta tctcaagagg ggctcctcta tgaattggag 9300
agaatatcaa ggaatgcact ttcgatatac agagccgtcg aggaaggggc atctaagcta 9360
gggctgatca tcaagaaaga agagaccatg tgtagttatg acttcctcat ctatggaaaa 9420
acccctttgt ttagaggtaa catattggtg cctgagtcca aaagatgggc cagagtctct 9480
tgcgtctcta atgaccaaat agtcaacctc gccaatataa tgtcgacagt gtccaccaat 9540
gcgctaacag tggcacaaca ctctcaatct ttgatcaaac cgatgaggga ttttctgctc 9600
atgtcagtac aggcagtctt tcactacctg ctatttagcc caatcttaaa gggaagagtt 9660
tacaagattc tgagcgctga aggggatagc tttctcctag ccatgtcaag gataatctat 9720
ctagatcctt ctttgggagg ggtatctgga atgtccctcg gaagattcca tatacgacag 9780
ttctcagacc ctgtctctga agggttatcc ttctggagag agatctggtt aagctcccac 9840
gagtcctgga ttcacgcgtt gtgtcaagag gctggaaacc cagatcttgg agagagaaca 9900
ctcgagagct tcactcgcct tctagaagat cctaccacct taaatatcag aggaggggcc 9960
agtcctacca ttctactcaa ggatgcaatc agaaaggctt tatatgacga ggtggacaag 10020
gtggagaatt cagagtttcg agaggcaatc ctgttgtcca agacccatag agataatttt 10080
atactcttct taacatctgt tgagcctctg tttcctcgat ttctcagtga gctattcagt 10140
tcgtcttttt tgggaatccc cgagtcaatc attggattga tacaaaactc ccgaacgata 10200
agaaggcagt ttagaaagag tctctcaaaa actttagaag aatccttcta caactcagag 10260
atccacggga ttagtcggat gacccagaca cctcagaggg ttgggggggt gtggccttgc 10320
tcttcagaga gggcagatct acttagggag atctcttggg gaagaaaagt ggtaggcacg 10380
acagttcctc acccttctga gatgttgggg ttacttccca agtcctctat ttcttgcact 10440
tgtggagcaa caggaggagg caatcctaga gtttctgtat cagtactccc gtcctttgat 10500
cagtcatttt tttcacgagg ccccctaaag gggtacttgg gctcgtccac ctctatgtcg 10560
acccagctat tccatgcatg ggaaaaagtc actaatgttc atgtggtgaa gagagctcta 10620
tcgttaaaag aatctataaa ctggttcatt actagagatt ccaacttggc tcaagctcta 10680
attaggaaca ttatgtctct gacaggccct gatttccctc tagaggaggc ccctgtcttc 10740
aaaaggacgg ggtcagcctt gcataggttc aagtctgcca gatacagcga aggagggtat 10800
tcttctgtct gcccgaacct cctctctcat atttctgtta gtacagacac catgtctgat 10860
ttgacccaag acgggaagaa ctacgatttc atgttccagc cattgatgct ttatgcacag 10920
acatggacat cagagctggt acagagagac acaaggctaa gagactctac gtttcattgg 10980
cacctccgat gcaacaggtg tgtgagaccc attgacgacg tgaccctgga gacctctcag 11040
atcttcgagt ttccggatgt gtcgaaaaga atatccagaa tggtttctgg ggctgtgcct 11100
cacttccaga ggcttcccga tatccgtctg agaccaggag attttgaatc tctaagcggt 11160
agagaaaagt ctcaccatat cggatcagct caggggctct tatactcaat cttagtggca 11220
attcacgact caggatacaa tgatggaacc atcttccctg tcaacatata cgacaaggtt 11280
tcccctagag actatttgag agggctcgca aggggagtat tgataggatc ctcgatttgc 11340
ttcttgacaa gaatgacaaa tatcaatatt aatagacctc ttgaattgat ctcaggggta 11400
atctcatata ttctcctgag gctagataac catccctcct tgtacataat gctcagagaa 11460
ccgtctctta gaggagagat attttctatc cctcagaaaa tccccgccgc ttatccaacc 11520
actatgaaag aaggcaacag atcaatcttg tgttatctcc aacatgtgct acgctatgag 11580
cgagagataa tcacggcgtc tccagagaat gactggctat ggatcttttc agactttaga 11640
agtgccaaaa tgacgtacct aaccctcatt acttaccagt ctcatcttct actccagagg 11700
gttgagagaa acctatctaa gagtatgaga gataacctgc gacaattgag ttccttgatg 11760
aggcaggtgc tgggcgggca cggagaagat accttagagt cagacgacaa cattcaacga 11820
ctgctaaaag actctttacg aaggacaaga tgggtggatc aagaggtgcg ccatgcagct 11880
agaaccatga ctggagatta cagccccaac aagaaggtgt cccgtaaggt aggatgttca 11940
gaatgggtct gctctgctca acaggttgca gtctctacct cagcaaaccc ggcccctgtc 12000
tcggagcttg acataagggc cctctctaag aggttccaga accctttgat ctcgggcttg 12060
agagtggttc agtgggcaac cggtgctcat tataagctta agcctattct agatgatctc 12120
aatgttttcc catctctctg ccttgtagtt ggggacgggt caggggggat atcaagggca 12180
gtcctcaaca tgtttccaga tgccaagctt gtgttcaaca gtctcttaga ggtgaatgac 12240
ctgatggctt ccggaacaca tccactgcct ccttcagcaa tcatgagggg aggaaatgat 12300
atcgtctcca gagtgataga ttttgactca atctgggaaa aaccgtccga cttgagaaac 12360
ttggcaacct ggaaatactt ccagtcagtc caaaagcagg tcaacatgtc ctatgacctc 12420
attatttgcg atgcagaagt tactgacatt gcatctatca accggataac cctgttaatg 12480
tccgattttg cattgtctat agatggacca ctctatttgg tcttcaaaac ttatgggact 12540
atgctagtaa atccaaacta caaggctatt caacacctgt caagagcgtt cccctcggtc 12600
acagggttta tcacccaagt aacttcgtct ttttcatctg agctctacct ccgattctcc 12660
aaacgaggga agtttttcag agatgctgag tacttgacct cttccaccct tcgagaaatg 12720
agccttgtgt tattcaattg tagcagcccc aagagtgaga tgcagagagc tcgttccttg 12780
aactatcagg atcttgtgag aggatttcct gaagaaatca tatcaaatcc ttacaatgag 12840
atgatcataa ctctgattga cagtgatgta gaatcttttc tagtccacaa gatggttgat 12900
gatcttgagt tacagagggg aactctgtct aaagtggcta tcattatagc catcatgata 12960
gttttctcca acagagtctt caacgtttcc aaacccctaa ctgacccctt gttctatcca 13020
ccgtctgatc ccaaaatcct gaggcacttc aacatatgtt gcagtactat gatgtatcta 13080
tctactgctt taggtgacgt ccctagcttc gcaagacttc acgacctgta taacagacct 13140
ataacttatt acttcagaaa gcaattcatt cgagggaacg tttatctatc ttggagttgg 13200
tccaacgaca cctcagtgtt caaaagggta gcctgtaatt ctagcctgag tctgtcatct 13260
cactggatca ggttgattta caagatagtg aagactacca gactcgttgg cagcatcaag 13320
gatctatcca gagaagtgga aagacacctt cataggtaca acaggtggat caccctagag 13380
gatatcagat ctagatcatc cctactagac tacagttgcc tgtgatccgg atactcctgg 13440
aagcctgccc atgctaagac tcttgtgtga tgtatcttga aaaaaacaag atcctaaatc 13500
tgaacctttg gttgtttgat tgtttttctc atttttgttg tttatttgtt aagcgt 13556
<210>19
<211>10
<212>DNA
<213>artificial sequence
<220>
<223>hammerhead terminus
<400>19
tgttaagcgt 10
<210>20
<211>27
<212>DNA
<213>artificial sequence
<220>
<223>synthetic oligonucleotide encoding SV40nuclear localization
signal
<400>20
atgccaaaaa agaagagaaa ggtagaa 27
<210>21
<211>10
<212>DNA
<213>artificial sequence
<220>
<223>Artificial Kozak sequence
<400>21
accaccatgg 10
<210>22
<211>10
<212>DNA
<213>artificial sequence
<220>
<223>Artificial Kozak sequence
<400>22
accaccatga 10
<210>23
<211>10
<212>DNA
<213>artificial sequence
<220>
<223>Artificial Kozak sequence
<400>23
accaccatgc 10
<210>24
<211>11
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Le5
<400>24
acgcttaaca a 11
<210>25
<211>24
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer B1p5
<400>25
gtcgcttgct aagcactcct ggta 24
<210>26
<211>11
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Le3
<400>26
tgcgaattgt t 11
<210>27
<211>24
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Blp5
<400>27
ccaggagtgc ttagcaagcg acct 24
<210>28
<211>32
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Le5-Kpn
<400>28
ccgggtacca cgcttaacaa ccagatcaaa ga 32
<210>29
<211>31
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Le3-Blp
<400>29
taggtcgctt gctaagcact cctggtagga c 31
<210>30
<211>31
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Tr5-Blp
<400>30
gtcctaccag gagtgcttag caagcgacct a 31
<210>31
<211>34
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer Tr3-Pst
<400>31
aaaactgcag acgcttaaca aataaacaac aaaa 34
<210>32
<211>79
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer HH1
<400>32
caaggctagc tgttaagcgt ctgatgagtc cgtgaggacg aaactatagg aaaggaattc 60
ctatagtcgg taccacgct 79
<210>33
<211>79
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer HH2
<400>33
agcgtggtac cgactatagg aattcctttc ctatagtttc gtcctcacgg actcatcaga 60
cgcttaacag ctagccttg 79
<210>34
<211>108
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer HDV3
<400>34
gacctgcagg ggtcggcatg gcatctccac ctcctcgcgg tccgacctgg gcatccgaag 60
gaggacgcac gtccactcgg atggctaagg gagggcgcgg ccgcactc 108
<210>35
<211>108
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer HDV 4
<400>35
gagtgcggcc gcgccctccc ttagccatcc gagtggacgt gcgtcctcct tcggatgccc 60
aggtcggacc gcgaggaggt ggagatgcca tgccgacccc tgcaggtc 108
<210>36
<211>25
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5N
<400>36
accaccatgg atgccgacaa gattg 25
<210>37
<211>33
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3N
<400>37
ggcccatggt tatgagtcac tcgaatatgt ctt 33
<210>38
<211>33
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5P
<400>38
ttggtaccac catgagcaag atctttgtca atc 33
<210>39
<211>34
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3P
<400>39
ggagaggaat tcttagcaag atgtatagcg attc 34
<210>40
<211>32
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5G
<400>40
ttggtaccac catggttcct caggctctcc tg 32
<210>41
<211>33
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3G
<400>41
aaaactgcag tcacagtctg gtctcacccc cac 33
<210>42
<211>36
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5L
<400>42
accgctagca ccaccatgct cgatcctgga gaggtc 36
<210>43
<211>34
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3L
<400>43
aaaactgcag tcacaggcaa ctgtagtcta gtag 34
<210>44
<211>38
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5T7
<400>44
tcgctagcac caccatgaac acgattaaca tcgctaag 38
<210>45
<211>33
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3T7
<400>45
gatgaattct tacgcgaacg cgaagtccga ctc 33
<210>46
<211>63
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5T7NLS
<400>46
tcgctagcca ccatgccaaa aaagaagaga aaggtagaaa acacgattaa catcgctaag 60
aac 63
<210>47
<211>31
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer GFP5
<400>47
aaaactgcag gccaccatgg gcgtgatcaa g 31
<210>48
<211>31
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer GFP3
<400>48
ccgctcggta cctattagcc ggcctggcgg g 31
<210>49
<211>25
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer EF5G5
<400>49
caccatggtt cctcaggctc tcctg 25
<210>50
<211>23
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer EF5G3
<400>50
tcacagtctg gtctcacccc cac 23
<210>51
<211>46
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5delpsi
<400>51
ccctctgcag tttggtaccg tcgagaaaaa aacattagat cagaag 46
<210>52
<211>24
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer SnaB5
<400>52
atgaactttc tacgtaagat agtg 24
<210>53
<211>46
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3delpsi
<400>53
caaactgcag aggggtgtta gtttttttca aaaagaaccc cccaag 46
<210>54
<211>44
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3deltag
<400>54
caaactgcag aggggtgtta gtttttttca catccaagag gatc 44
<210>55
<211>42
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 5deltag
<400>55
cctctgcagt ttggtacctt gaaaaaaacc tgggttcaat ag 42
<210>56
<211>35
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer M5G
<400>56
ctcactacaa gtcagtcgag acttggaatg agatc 35
<210>57
<211>35
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer M3G
<400>57
gactgacttt gagtgagcat cggcttccat caagg 35
<210>58
<211>524
<212>PRT
<213>artificial sequence
<220>
<223>Mutated G protein Aa333
<400>58
Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu Val Phe Pro Leu
1 5 10 15
Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp Lys Leu Gly Pro
20 25 30
Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro Asn Asn Leu Val
35 40 45
Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe Ser Tyr Met Glu
50 55 60
Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met Asn Gly Phe Thr Cys
65 70 75 80
Thr Gly Val Val Thr Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr
85 90 95
Val Thr Thr Thr Phe Lys Arg Lys His Phe Arg Pro Thr Pro Asp Ala
100 105 110
Cys Arg Ala Ala Tyr Asn Trp Lys Met Ala Gly Asp Pro Arg Tyr Glu
115 120 125
Glu Ser Leu His Asn Pro Tyr Pro Asp Tyr His Trp Leu Arg Thr Val
130 135 140
Lys Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Val Ala Asp
145 150 155 160
Leu Asp Pro Tyr Asp Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly
165 170 175
Lys Cys Ser Gly Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His
180 185 190
Asp Tyr Thr Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys
195 200 205
Asp Ile Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser Glu
210 215 220
Thr Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly
225 230 235 240
Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met Asp
245 250 255
Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp Cys Pro
260 265 270
Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp Glu Ile Glu
275 280 285
His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu Glu Cys Leu Asp
290 295 300
Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val Ser Phe Arg Arg Pro
305 310 315 320
Ser His Leu Arg Lys Leu Val Pro Gly Phe Gly Lys Ala Tyr Thr Ile
325 330 335
Phe Asn Lys Thr Leu Met Glu Ala Asp Ala His Tyr Lys Ser Val Glu
340 345 350
Thr Trp Asn Glu Ile Leu Pro Ser Lys Gly Cys Leu Arg Val Gly Gly
355 360 365
Arg Cys His Pro His Val Asn Gly Val Phe Phe Asn Gly Ile Ile Leu
370 375 380
Gly Pro Asp Gly Asn Val Leu Ile Pro Glu Met Gln Ser Ser Leu Leu
385 390 395 400
Gln Gln His Met Glu Leu Leu Glu Ser Ser Val Ile Pro Leu Val His
405 410 415
Pro Leu Ala Asp Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu
420 425 430
Asp Phe Val Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly
435 440 445
Val Asp Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala
450 455 460
Gly Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys
465 470 475 480
Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly Thr
485 490 495
Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile Ser Ser
500 505 510
Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu
515 520
<210>59
<211>60
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer 3psicis
<400>59
ccaaactgca gcgaaaggag gggtgttagt ttttttcatg atgaaccccc caaggggagg 60
<210>60
<21l>39
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer cis55
<400>60
gactcactat agggagaccc aagctggcta gctgttaag 39
<210>61
<211>60
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer cis53
<400>61
ccaaactgca gcgaaaggag gggtgttagt ttttttcatg ttgactttag gacatctcgg 60
<210>62
<211>61
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer cis35
<400>62
cctttcgctg cagtttggta ccgtcgagaa aaaaacaggc aacaccactg ataaaatgaa 60
c 61
<210>63
<211>30
<212>DNA
<213>artificial sequence
<220>
<223>cis33
<400>63
cctccccttc aagagggccc ctggaatcag 30
<210>64
<211>46
<212>DNA
<213>artificial sequence
<220>
<223>oligonucleotide primer TU1
<400>64
ctaacacccc tcctttcgct gcagtttggt accgtcgaga aaaaaa 46
<210>65
<211>53
<212>DNA
<213>artificial sequence
<220>
<223>TU2
<400>65
tttttttgat tgtggggagg aaagcgacgt caaaccatgg cagctctttt ttt 53
Claims (40)
1.一种载体系统,包括:
含有全长狂犬病病毒反基因组DNA的第一载体,其中全长反基因组DNA选自全长狂犬病病毒反基因组DNA或者其衍生物;和,含
含有编码至少一种狂犬病病毒株ERA蛋白的核酸的多个辅助载体,
其中转染的宿主细胞中多个载体的表达引起重组狂犬病病毒的产生。
2.权利要求1的载体系统,其中全长反基因组DNA包括ERA株反基因组DNA或者其衍生物。
3.权利要求2的载体系统,其中全长反基因组DNA选自SEQ IDNO:8,SEQ ID NO:9,SEQ ID NO:10,SEQ ID NO:11,SEQ ID NO:12,SEQ ID NO:13,SEQ ID NO:14,SEQ ID NO:15,SEQ ID NO:16,SEQID NO:17或者SEQ ID NO:18。
4.权利要求1-3中任一项的载体系统,其中的载体是质粒。
5.权利要求1-4中任一项的载体系统,其中第一载体按5′到3′方向包括:锤头核酶;狂犬病病毒反基因组cDNA;和丁型肝炎病毒核酶,其中锤头核酶的多个核苷酸与狂犬病病毒的反义基因组序列互补。
6.权利要求5的载体系统,其中反基因组cDNA的转录受CMV启动子和噬菌体T7 RNA聚合酶启动子中至少一种的转录调控。
7.权利要求6的载体系统,其中反基因组cDNA的转录受CMV启动子和噬菌体T7 RNA聚合酶启动子两者的转录调控。
8.权利要求1-4任一项的载体系统,其中多个辅助载体包括:
含有编码狂犬病病毒N蛋白的多核苷酸序列的载体;
含有编码狂犬病病毒P蛋白的多核苷酸序列的载体;
含有编码狂犬病病毒M蛋白的多核苷酸序列的载体;
含有编码狂犬病病毒L蛋白的多核苷酸序列的载体;和
含有编码噬菌体T7 RNA聚合酶的多核苷酸序列的载体。
9.权利要求8的载体系统,还包括一种载体,所述载体含有编码狂犬病病毒G蛋白的多核苷酸序列。
10.权利要求8或者9的载体系统,其中T7 RNA聚合酶包括核定位信号(NLS)。
11.权利要求10的载体系统,其中编码狂犬病病毒N蛋白的多核苷酸序列的转录受T7启动子的转录调控,并且其中转录是独立于帽的。
12.权利要求8-11中任一项的载体系统,其中编码狂犬病病毒P、M、L或者G蛋白或者T7聚合酶的一种或者多种多核苷酸序列的转录受CMV启动子和T7启动子两者的转录调控。
13.一种重组病毒基因组,包括如SEQ ID NO:1所示的核酸。
14.一种重组病毒基因组,或者其衍生物,包括如SEQ ID NO:7所示的核酸。
15.权利要求14的重组病毒基因组,其中衍生病毒基因组包括SEQ ID NO:8,SEQ ID NO:9,SEQ ID NO:10,SEQ ID NO:11,SEQ IDNO:12,SEQ ID NO:13,SEQ ID NO:14,SEQ ID NO:15,SEQ ID NO:16,SEQ ID NO:17或者SEQ ID NO:18。
16.权利要求13-15中任一项的重组病毒基因组,还包括载体。
17.权利要求16的重组病毒基因组,其中的载体是质粒。
18.一种重组病毒,包括权利要求14或者15所述的基因组。
19.权利要求18的重组病毒,其中的病毒是减毒病毒。
20.一种包括至少一种重组狂犬病病毒基因组的活狂犬病疫苗,其中至少一种重组狂犬病基因组包括:
SEQ ID NO:8所示的序列;
SEQ ID NO:10所示的序列;或者
SEQ ID NO:13所示的序列。
21.权利要求20的狂犬病疫苗,其是减毒的。
22.一种分离蛋白,包括如下列所示的氨基酸序列:
SEQ ID NO:2(N蛋白);
SEQ ID NO:3(P蛋白);
SEQ ID NO:4(M蛋白);
SEQ ID NO:5(G蛋白);
SEQ ID NO:6(L蛋白);或者
SEQ ID NO:58(G蛋白Aa333)。
23.一种分离核酸分子,编码根据权利要求22的任一种蛋白。
24.权利要求23的分离核酸分子,包括如下所示的核苷酸序列:
a)SEQ ID NO:1的核苷酸71-1423(N蛋白);
b)SEQ ID NO:1的核苷酸1511-2407(P蛋白);
c)SEQ ID NO:1的核苷酸2491-3104(M蛋白);
d)SEQ ID NO:1的核苷酸3318-4892(G蛋白);
e)SEQ ID NO:1的核苷酸5418-11,801(L蛋白),或者
f)只是由于遗传密码简并性而不同于a)到e)之一的核苷酸序列。
25. 权利要求23的分离核酸分子,包括SEQ ID NO:8的核苷酸3317-4888所示的核苷酸序列,或者只是由于遗传密码简并性而与其不同的核苷酸序列。
26.一种组合物,包括至少一种权利要求22的分离蛋白。
27.权利要求26的组合物,还包括药学上可接受的载体,佐剂,或者其两种或更多种的组合。
28.一种在对象中引发针对抗原表位的免疫应答的方法,包括给对象导入权利要求26的组合物,从而在对象中引发免疫应答。
29.一种用于全长狂犬病病毒基因组测序的方法,包括:
逆转录狂犬病病毒基因组以产生互补DNA链;
扩增互补DNA链的第一部分和第二部分以产生第一和第二扩增的狂犬病病毒基因组片段;
克隆第一和第二扩增的狂犬病病毒基因组片段到载体中以产生连续的狂犬病病毒反基因组;和
对全长狂犬病病毒反基因组进行测序。
30.一种制备活狂犬病病毒疫苗的方法,包括将权利要求8-19中任一项的载体系统导入宿主细胞,并恢复活的重组狂犬病病毒。
31.权利要求30的方法,其中恢复的活重组狂犬病病毒适于用作活狂犬病病毒疫苗。
32.一种活狂犬病病毒疫苗,由权利要求31的方法制备。
33.一种对对象接种抗狂犬病的方法,包括给药对象有效量的权利要求20或者权利要求32的活狂犬病疫苗,使对象的细胞感染狂犬病疫苗,其中在对象中产生抗-狂犬病免疫应答。
34.权利要求33的方法,其中对象是人。
35.权利要求33的方法,其中对象是非人动物。
36.权利要求35的方法,其中非人动物是猫,狗,大鼠,小鼠,蝙蝠,狐狸,浣熊,松鼠,负鼠,山狗或者狼。
37.权利要求34-36中任一项的方法,其中的给药包括口服给药。
38.权利要求37的方法,其中口服给药包括通过设计成接种野生动物群体的食物-诱饵给药。
39.一种药物组合物,包括权利要求20或者权利要求32的活狂犬病疫苗和药学上可接受的载体或者赋形剂。
40.权利要求39的药物组合物,其中狂犬病疫苗是减毒的。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510811024.XA CN105441483B (zh) | 2005-10-14 | 2006-10-13 | 狂犬病病毒组合物和方法 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US72703805P | 2005-10-14 | 2005-10-14 | |
US60/727,038 | 2005-10-14 | ||
PCT/US2006/040134 WO2007047459A1 (en) | 2005-10-14 | 2006-10-13 | Rabies virus vector systems and compositions and methods thereof |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510811024.XA Division CN105441483B (zh) | 2005-10-14 | 2006-10-13 | 狂犬病病毒组合物和方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101287838A true CN101287838A (zh) | 2008-10-15 |
CN101287838B CN101287838B (zh) | 2016-03-30 |
Family
ID=37775467
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200680038314.4A Active CN101287838B (zh) | 2005-10-14 | 2006-10-13 | 狂犬病病毒组合物和方法 |
Country Status (9)
Country | Link |
---|---|
US (2) | US7863041B2 (zh) |
EP (2) | EP1945780B1 (zh) |
CN (1) | CN101287838B (zh) |
AU (1) | AU2006304268B2 (zh) |
BR (1) | BRPI0617373B1 (zh) |
CA (2) | CA2943039C (zh) |
HK (2) | HK1223968A1 (zh) |
WO (1) | WO2007047459A1 (zh) |
ZA (1) | ZA200803131B (zh) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101586120B (zh) * | 2009-07-15 | 2011-06-15 | 中国农业科学院哈尔滨兽医研究所 | 狂犬病病毒Flury-LEP疫苗株反向遗传操作系统及LEP绿色荧光蛋白重组病毒载体 |
CN101463355B (zh) * | 2009-01-15 | 2011-07-13 | 中国疾病预防控制中心病毒病预防控制所 | 狂犬病街毒株hn10全基因组序列及其制备方法和用途 |
CN102741399A (zh) * | 2009-10-20 | 2012-10-17 | 诺华有限公司 | 用于病毒拯救的改良反向遗传方法 |
CN103068985A (zh) * | 2010-06-24 | 2013-04-24 | 美国政府(由卫生和人类服务部、疾病控制和预防中心的部长所代表) | 抗狂犬病的广谱狂犬病病毒属病毒疫苗 |
CN103881985A (zh) * | 2014-02-19 | 2014-06-25 | 北京中联康生物科技有限公司 | 狂犬病病毒era减毒疫苗突变株及其制备方法和狂犬病活疫苗 |
CN103877570A (zh) * | 2014-02-19 | 2014-06-25 | 北京中联康生物科技有限公司 | 狂犬病活疫苗和制备狂犬病口服活疫苗的方法 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008054544A2 (en) * | 2006-05-22 | 2008-05-08 | Immune Disease Institute, Inc. | Method for delivery across the blood brain barrier |
JP5733976B2 (ja) | 2007-03-30 | 2015-06-10 | ザ・リサーチ・ファウンデーション・フォー・ザ・ステート・ユニヴァーシティー・オブ・ニュー・ヨーク | ワクチンに有用な弱毒化ウイルス |
WO2010033337A1 (en) * | 2008-09-17 | 2010-03-25 | The Government of the United States of America as represented by the Secretary of the Department | Rabies virus-based recombinant immunocontraceptive compositions and methods of use |
WO2013081571A2 (en) * | 2010-01-14 | 2013-06-06 | The Government Of The United States Of America, As Represented By The Secretary, Department Of Health And Human Services, Centers For Disease Control And Prevention | Isolated lyssavirus nucleic acid and protein sequences |
JP6091435B2 (ja) * | 2011-02-22 | 2017-03-08 | カリフォルニア インスティチュート オブ テクノロジー | アデノ随伴ウイルス(aav)ベクターを用いたタンパク質の送達 |
CN103906843B (zh) * | 2011-06-08 | 2016-12-07 | 维什瓦斯·乔希 | 双质粒哺乳动物表达系统 |
CA2838614A1 (en) | 2011-06-08 | 2013-04-04 | Vishwas Joshi | Two plasmid mammalian expression system |
US8689255B1 (en) | 2011-09-07 | 2014-04-01 | Imdb.Com, Inc. | Synchronizing video content with extrinsic data |
US9889192B2 (en) | 2012-10-01 | 2018-02-13 | Thomas Jefferson University | Immunization with rabies virus vector expressing foreign protein antigen |
EP3065711A4 (en) | 2013-11-05 | 2017-06-21 | Takara Bio USA, Inc. | Dry transfection compositions and methods for making and using the same |
RU2626605C2 (ru) * | 2015-11-25 | 2017-07-28 | Федеральное Государственное Бюджетное Учреждение Науки Институт Молекулярной Биологии Им. В.А. Энгельгардта Российской Академии Наук (Имб Ран) | Генетическая (рекомбинантная) ДНК-конструкция, содержащая кодон-оптимизированный ген гликопротеина (белка G) вируса бешенства с консенсусной аминокислотной последовательностью, которая составлена с учетом аминокислотных последовательностей белка G, выделяемого из штаммов вируса бешенства, циркулирующих на территории Российской Федерации |
KR102334083B1 (ko) * | 2020-05-04 | 2021-12-03 | 대한민국 | 형광 단백질을 발현하는 재조합 광견병 바이러스 및 이를 이용한 항체 검출 방법 |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4040904A (en) * | 1975-07-21 | 1977-08-09 | Slater Eban A | Novel rabies virus vaccine and processes |
US4393201A (en) * | 1981-11-04 | 1983-07-12 | The Wistar Institute | DNA Which codes for glycoprotein of era-strain rabies virus |
AU622426B2 (en) | 1987-12-11 | 1992-04-09 | Abbott Laboratories | Assay using template-dependent nucleic acid probe reorganization |
FR2633832B1 (fr) | 1988-07-05 | 1991-05-31 | Virbac | Vaccin antirabique avirulent |
EP0425563B1 (en) | 1988-07-20 | 1996-05-15 | David Segev | Process for amplifying and detecting nucleic acid sequences |
US5427930A (en) | 1990-01-26 | 1995-06-27 | Abbott Laboratories | Amplification of target nucleic acids using gap filling ligase chain reaction |
FR2693655B1 (fr) | 1992-07-20 | 1994-10-14 | Virbac | Vaccin antirabique avirulent. |
WO1994027639A1 (en) * | 1993-05-26 | 1994-12-08 | Thomas Jefferson University | Methods for treating post-exposure rabies and anti-rabies compositions |
WO1995009249A1 (en) | 1993-09-30 | 1995-04-06 | Thomas Jefferson University | Oral vaccination of mammals |
US5648211A (en) | 1994-04-18 | 1997-07-15 | Becton, Dickinson And Company | Strand displacement amplification using thermophilic enzymes |
KR100392551B1 (ko) | 1994-07-15 | 2003-10-30 | 아크조 노벨 엔브이 | 핵산 증폭방법을 개선시키기 위한 rna폴리머라제의 용도 |
PT702085E (pt) | 1994-07-18 | 2004-04-30 | Karl Klaus Conzelmann | Virus de arn de cadeia negativa nao segmentada infeccioso recombinante |
WO1996010400A1 (en) * | 1994-09-30 | 1996-04-11 | The Uab Research Foundation | Gene therapy vectors and vaccines based on non-segmented negatives stranded rna viruses |
AT402203B (de) | 1995-06-13 | 1997-03-25 | Himmler Gottfried Dipl Ing Dr | Verfahren zur transkriptionsfreien amplifizierung von nucleinsäuren |
DK0780475T4 (da) | 1995-08-09 | 2006-10-23 | Schweiz Serum & Impfinst | Fremgangsmåde til fremstilling af infektiöse negativ-streng RNA-virus |
WO2002036170A2 (en) * | 2000-11-03 | 2002-05-10 | Oxford Biomedica (Uk) Limited | Vector system for transducing the positive neurons |
ATE356202T1 (de) | 1998-11-27 | 2007-03-15 | Intervet Int Bv | Attenuierte stabile tollwutvirusmutante und lebende impfstoffe |
US20030224017A1 (en) * | 2002-03-06 | 2003-12-04 | Samal Siba K. | Recombinant Newcastle disease viruses useful as vaccines or vaccine vectors |
US7074413B2 (en) * | 2000-03-23 | 2006-07-11 | Thomas Jefferson University | Genetically engineered rabies recombinant vaccine for immunization of stray dogs and wildlife |
DE60208854T2 (de) | 2001-04-23 | 2006-08-17 | Akzo Nobel N.V. | Attenuierte rekombinante stabile Tollwutvirusmutanten und Lebendimpfstoffe |
EP1434862A4 (en) * | 2001-07-20 | 2005-04-06 | Univ Georgia Res Found | NUCLEOPROTEIC MUTATION-RELATED VIRUS AT A PHOSPHORYLATION SITE FOR OBTAINING A VACCINE AGAINST RABIES AND GENE THERAPY IN THE CENTRAL NERVOUS SYSTEM |
CA2463090C (en) * | 2001-10-10 | 2012-06-12 | Thomas Jefferson University | Recombinant rabies vaccine and methods of preparation and use |
US9912298B2 (en) | 2014-05-13 | 2018-03-06 | Skyworks Solutions, Inc. | Systems and methods related to linear load modulated power amplifiers |
-
2006
- 2006-10-13 WO PCT/US2006/040134 patent/WO2007047459A1/en active Application Filing
- 2006-10-13 US US12/090,083 patent/US7863041B2/en active Active
- 2006-10-13 BR BRPI0617373-0A patent/BRPI0617373B1/pt active IP Right Grant
- 2006-10-13 CN CN200680038314.4A patent/CN101287838B/zh active Active
- 2006-10-13 CA CA2943039A patent/CA2943039C/en active Active
- 2006-10-13 CA CA2624768A patent/CA2624768C/en active Active
- 2006-10-13 EP EP06836301.9A patent/EP1945780B1/en active Active
- 2006-10-13 EP EP11155879.7A patent/EP2351843B1/en active Active
- 2006-10-13 AU AU2006304268A patent/AU2006304268B2/en active Active
-
2008
- 2008-04-09 ZA ZA2008/03131A patent/ZA200803131B/en unknown
- 2008-12-17 HK HK16111244.3A patent/HK1223968A1/zh unknown
- 2008-12-17 HK HK08113710.4A patent/HK1123326A1/zh unknown
-
2010
- 2010-11-30 US US12/956,949 patent/US8865461B2/en active Active
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101463355B (zh) * | 2009-01-15 | 2011-07-13 | 中国疾病预防控制中心病毒病预防控制所 | 狂犬病街毒株hn10全基因组序列及其制备方法和用途 |
CN101586120B (zh) * | 2009-07-15 | 2011-06-15 | 中国农业科学院哈尔滨兽医研究所 | 狂犬病病毒Flury-LEP疫苗株反向遗传操作系统及LEP绿色荧光蛋白重组病毒载体 |
CN102741399A (zh) * | 2009-10-20 | 2012-10-17 | 诺华有限公司 | 用于病毒拯救的改良反向遗传方法 |
CN103068985A (zh) * | 2010-06-24 | 2013-04-24 | 美国政府(由卫生和人类服务部、疾病控制和预防中心的部长所代表) | 抗狂犬病的广谱狂犬病病毒属病毒疫苗 |
CN103881985A (zh) * | 2014-02-19 | 2014-06-25 | 北京中联康生物科技有限公司 | 狂犬病病毒era减毒疫苗突变株及其制备方法和狂犬病活疫苗 |
CN103877570A (zh) * | 2014-02-19 | 2014-06-25 | 北京中联康生物科技有限公司 | 狂犬病活疫苗和制备狂犬病口服活疫苗的方法 |
CN103877570B (zh) * | 2014-02-19 | 2016-01-20 | 北京中联康生物科技有限公司 | 狂犬病活疫苗和制备狂犬病口服活疫苗的方法 |
Also Published As
Publication number | Publication date |
---|---|
EP2351843A3 (en) | 2011-11-09 |
US7863041B2 (en) | 2011-01-04 |
WO2007047459A1 (en) | 2007-04-26 |
ZA200803131B (en) | 2018-11-28 |
US20110070264A1 (en) | 2011-03-24 |
HK1123326A1 (zh) | 2009-06-12 |
US20080274130A1 (en) | 2008-11-06 |
CA2624768A1 (en) | 2007-04-26 |
EP1945780B1 (en) | 2015-09-16 |
CA2943039C (en) | 2018-11-06 |
WO2007047459A8 (en) | 2007-09-20 |
AU2006304268A2 (en) | 2008-06-12 |
BRPI0617373B1 (pt) | 2022-04-05 |
EP1945780A1 (en) | 2008-07-23 |
EP2351843A2 (en) | 2011-08-03 |
CA2624768C (en) | 2016-10-04 |
CN101287838B (zh) | 2016-03-30 |
CA2943039A1 (en) | 2007-04-26 |
EP2351843B1 (en) | 2013-06-12 |
HK1223968A1 (zh) | 2017-08-11 |
AU2006304268A1 (en) | 2007-04-26 |
AU2006304268B2 (en) | 2011-09-29 |
BRPI0617373A2 (pt) | 2013-01-01 |
US8865461B2 (en) | 2014-10-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101287838A (zh) | 狂犬病病毒组合物和方法 | |
Bendahmane et al. | Display of epitopes on the surface of tobacco mosaic virus: impact of charge and isoelectric point of the epitope on virus-host interactions | |
JP3816126B2 (ja) | 組換え伝染性非セグメント化陰性鎖rnaウイルス | |
AU2017249424A1 (en) | Recombinant arterivirus replicon systems and uses thereof | |
KR101745029B1 (ko) | 재조합 조류 파라믹소바이러스 백신 및 이의 제조 및 사용 방법 | |
KR19990067271A (ko) | 재조합체 센다이 바이러스 | |
Shoji et al. | Generation and characterization of P gene-deficient rabies virus | |
KR20230111189A (ko) | 재프로그램 가능한 iscb 뉴클레아제 및 이의 용도 | |
Morimoto et al. | Characterization of P gene-deficient rabies virus: propagation, pathogenicity and antigenicity | |
CN110951699A (zh) | 表达犬瘟热病毒结构蛋白的重组狂犬病病毒及其应用 | |
US20160114026A1 (en) | Pan-lyssavirus vaccines against rabies | |
US20020009458A1 (en) | DNA sequences, molecules, vectors and vaccines for feline calicivirus disease and methods for producing and using same | |
CN105441483B (zh) | 狂犬病病毒组合物和方法 | |
KR20100121823A (ko) | 소 엔테로바이러스를 이용한 재조합 바이러스 벡터 및 그 제조방법 | |
JP2005065596A (ja) | 増殖能欠損狂犬病ウイルス | |
CN108642019A (zh) | 一种耐胃肠道降解的重组狂犬病口服疫苗毒株及其制备方法 | |
CN114807232A (zh) | 一株西尼罗病毒感染性克隆的构建及其应用 | |
KR20230005191A (ko) | 결손 간섭 바이러스 게놈 | |
MX2008004860A (es) | Sistemas de vector de virus de rabia y composiciones y metodos de los mismos | |
Kemdirim | Molecular cloning of the polymerase genes of influenza B virus: complete nucleotide sequence of the virus genome RNA segment encoding the PBI protein | |
MXPA99011361A (en) | Dna sequences, molecules, vectors and vaccines for feline calicivirus disease and methods for producing and using same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1123326 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1123326 Country of ref document: HK |