CN1261403A - 新的肺炎衣原体表面蛋白 - Google Patents
新的肺炎衣原体表面蛋白 Download PDFInfo
- Publication number
- CN1261403A CN1261403A CN98806428A CN98806428A CN1261403A CN 1261403 A CN1261403 A CN 1261403A CN 98806428 A CN98806428 A CN 98806428A CN 98806428 A CN98806428 A CN 98806428A CN 1261403 A CN1261403 A CN 1261403A
- Authority
- CN
- China
- Prior art keywords
- seq
- ser
- gly
- thr
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 241001647372 Chlamydia pneumoniae Species 0.000 title claims abstract description 129
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 114
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 42
- 208000015181 infectious disease Diseases 0.000 claims abstract description 41
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 15
- 238000003745 diagnosis Methods 0.000 claims abstract description 11
- 150000007523 nucleic acids Chemical group 0.000 claims description 60
- 238000012360 testing method Methods 0.000 claims description 44
- 235000018102 proteins Nutrition 0.000 claims description 38
- 108020004707 nucleic acids Proteins 0.000 claims description 29
- 102000039446 nucleic acids Human genes 0.000 claims description 29
- 241000124008 Mammalia Species 0.000 claims description 26
- 238000000034 method Methods 0.000 claims description 26
- 238000003752 polymerase chain reaction Methods 0.000 claims description 22
- 238000002405 diagnostic procedure Methods 0.000 claims description 18
- 206010035664 Pneumonia Diseases 0.000 claims description 14
- 101710116435 Outer membrane protein Proteins 0.000 claims description 12
- 230000003053 immunization Effects 0.000 claims description 10
- 238000002649 immunization Methods 0.000 claims description 9
- 239000000203 mixture Substances 0.000 claims description 9
- 241000498849 Chlamydiales Species 0.000 claims description 7
- 238000009007 Diagnostic Kit Methods 0.000 claims description 7
- 239000002773 nucleotide Substances 0.000 claims description 7
- 125000003729 nucleotide group Chemical group 0.000 claims description 7
- 210000002808 connective tissue Anatomy 0.000 claims description 5
- 230000003321 amplification Effects 0.000 claims description 4
- 238000001514 detection method Methods 0.000 claims description 4
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 4
- 230000008827 biological function Effects 0.000 claims description 3
- 229960005486 vaccine Drugs 0.000 abstract description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 15
- 108010052285 Membrane Proteins Proteins 0.000 abstract description 3
- 230000007170 pathology Effects 0.000 abstract description 3
- 230000001717 pathogenic effect Effects 0.000 abstract description 2
- 230000000241 respiratory effect Effects 0.000 abstract description 2
- 244000052769 pathogen Species 0.000 abstract 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 139
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 92
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 85
- 102220369445 c.668T>C Human genes 0.000 description 76
- 102220023256 rs387907547 Human genes 0.000 description 68
- 235000001014 amino acid Nutrition 0.000 description 66
- 150000001413 amino acids Chemical class 0.000 description 66
- 102220023257 rs387907546 Human genes 0.000 description 43
- 102220023258 rs387907548 Human genes 0.000 description 42
- 102220369447 c.1352G>A Human genes 0.000 description 36
- 108020004414 DNA Proteins 0.000 description 33
- 210000002966 serum Anatomy 0.000 description 28
- 238000006243 chemical reaction Methods 0.000 description 23
- 230000009182 swimming Effects 0.000 description 23
- 102220369446 c.1274G>A Human genes 0.000 description 21
- 241000699666 Mus <mouse, genus> Species 0.000 description 20
- 239000000523 sample Substances 0.000 description 20
- 241000283973 Oryctolagus cuniculus Species 0.000 description 18
- 108090000765 processed proteins & peptides Proteins 0.000 description 18
- 239000000427 antigen Substances 0.000 description 17
- 108091007433 antigens Proteins 0.000 description 17
- 102000036639 antigens Human genes 0.000 description 17
- 210000004027 cell Anatomy 0.000 description 17
- 239000000047 product Substances 0.000 description 16
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 14
- 241000894006 Bacteria Species 0.000 description 13
- 241000606153 Chlamydia trachomatis Species 0.000 description 13
- 229940038705 chlamydia trachomatis Drugs 0.000 description 13
- 230000000875 corresponding effect Effects 0.000 description 13
- 241001647378 Chlamydia psittaci Species 0.000 description 12
- 230000008859 change Effects 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 238000003119 immunoblot Methods 0.000 description 11
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 239000000020 Nitrocellulose Substances 0.000 description 8
- 229920001220 nitrocellulos Polymers 0.000 description 8
- 238000002965 ELISA Methods 0.000 description 7
- 230000036039 immunity Effects 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 229920001184 polypeptide Polymers 0.000 description 7
- 102000004196 processed proteins & peptides Human genes 0.000 description 7
- 108010005774 beta-Galactosidase Proteins 0.000 description 6
- 238000006062 fragmentation reaction Methods 0.000 description 6
- 210000004379 membrane Anatomy 0.000 description 6
- IXJYMUFPNFFKIB-FMONCPFKSA-N pomp protocol Chemical compound S=C1N=CNC2=C1NC=N2.O=C1C=C[C@]2(C)[C@H]3C(=O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1.C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1.C([C@H](C[C@]1(C(=O)OC)C=2C(=C3C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C=O)=CC=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1NC1=CC=CC=C21 IXJYMUFPNFFKIB-FMONCPFKSA-N 0.000 description 6
- 101710105759 Major outer membrane porin Proteins 0.000 description 5
- 101710164702 Major outer membrane protein Proteins 0.000 description 5
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 5
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 5
- 230000005847 immunogenicity Effects 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 102220004457 rs11567847 Human genes 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 4
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- 239000012722 SDS sample buffer Substances 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 102000005936 beta-Galactosidase Human genes 0.000 description 4
- 239000002158 endotoxin Substances 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 210000003000 inclusion body Anatomy 0.000 description 4
- 229920006008 lipopolysaccharide Polymers 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000012188 paraffin wax Substances 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 241000606069 Chlamydiaceae Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 3
- 102100037850 Interferon gamma Human genes 0.000 description 3
- 108010074328 Interferon-gamma Proteins 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 241000204031 Mycoplasma Species 0.000 description 3
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 206010057190 Respiratory tract infections Diseases 0.000 description 3
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 3
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 3
- 230000001154 acute effect Effects 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 238000011109 contamination Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000004043 dyeing Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- 206010003210 Arteriosclerosis Diseases 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 2
- 238000011238 DNA vaccination Methods 0.000 description 2
- 208000001860 Eye Infections Diseases 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- 102000013462 Interleukin-12 Human genes 0.000 description 2
- 108010065805 Interleukin-12 Proteins 0.000 description 2
- 102000000588 Interleukin-2 Human genes 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 2
- 101150063378 OMP gene Proteins 0.000 description 2
- 208000037581 Persistent Infection Diseases 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- 206010036790 Productive cough Diseases 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 2
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 2
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 208000011775 arteriosclerosis disease Diseases 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 208000006673 asthma Diseases 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- YPHMISFOHDHNIV-FSZOTQKASA-N cycloheximide Chemical compound C1[C@@H](C)C[C@H](C)C(=O)[C@@H]1[C@H](O)CC1CC(=O)NC(=O)C1 YPHMISFOHDHNIV-FSZOTQKASA-N 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 208000011323 eye infectious disease Diseases 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000010166 immunofluorescence Methods 0.000 description 2
- 230000002163 immunogen Effects 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 208000030603 inherited susceptibility to asthma Diseases 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 2
- 238000011587 new zealand white rabbit Methods 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 230000008521 reorganization Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000012723 sample buffer Substances 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006557 surface reaction Methods 0.000 description 2
- 229960004799 tryptophan Drugs 0.000 description 2
- 102000003390 tumor necrosis factor Human genes 0.000 description 2
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 101710134784 Agnoprotein Proteins 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- BJFKXBOBGVWFCT-GUBZILKMSA-N Ala-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)C(C)C)C(O)=O BJFKXBOBGVWFCT-GUBZILKMSA-N 0.000 description 1
- 206010002198 Anaphylactic reaction Diseases 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 241001674218 Chlamydia pecorum Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 206010035673 Pneumonia chlamydial Diseases 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- 101710124584 Probable DNA-binding protein Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 206010037151 Psittacosis Diseases 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 206010070834 Sensitisation Diseases 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- 206010046306 Upper respiratory tract infection Diseases 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- IBIDRSSEHFLGSD-YUMQZZPRSA-N Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-YUMQZZPRSA-N 0.000 description 1
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- KLOHDWPABZXLGI-YWUHCJSESA-M ampicillin sodium Chemical compound [Na+].C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C([O-])=O)(C)C)=CC=CC=C1 KLOHDWPABZXLGI-YWUHCJSESA-M 0.000 description 1
- 230000036783 anaphylactic response Effects 0.000 description 1
- 208000003455 anaphylaxis Diseases 0.000 description 1
- 244000037640 animal pathogen Species 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 101150010487 are gene Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000000621 bronchi Anatomy 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000006059 cover glass Substances 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004945 emulsification Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 229940124644 immune regulator Drugs 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 201000001371 inclusion conjunctivitis Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 244000000010 microbial pathogen Species 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- IPSIPYMEZZPCPY-UHFFFAOYSA-N new fuchsin Chemical compound [Cl-].C1=CC(=[NH2+])C(C)=CC1=C(C=1C=C(C)C(N)=CC=1)C1=CC=C(N)C(C)=C1 IPSIPYMEZZPCPY-UHFFFAOYSA-N 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 201000000901 ornithosis Diseases 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 239000011505 plaster Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000019633 pungent taste Nutrition 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 150000003839 salts Chemical group 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 230000008313 sensitization Effects 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 206010044325 trachoma Diseases 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 210000005239 tubule Anatomy 0.000 description 1
- 238000004879 turbidimetry Methods 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 229940124856 vaccine component Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/295—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Chlamydiales (O)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P11/00—Drugs for disorders of the respiratory system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/12—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
- C07K16/1203—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria
- C07K16/125—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria from Chlamydiales (O)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Animal Behavior & Ethology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Engineering & Computer Science (AREA)
- Pulmonology (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
本发明涉及对来自人呼吸道病原体肺炎衣原体的、编码大小约为89—101kDa和56—57kDa(较佳的约为89.6—100.3kDa和约为56.1kDa)的表面膜蛋白的基因家族成员的鉴定。本发明涉及新的DNA序列,对应蛋白质的推定的氨基酸序列,以及该DNA序列和蛋白在病理学和流行病学上诊断由肺炎衣原体引起的感染的用途,以及作为疫苗组分的用途。
Description
本发明涉及人呼吸道病原体肺炎衣原体(Chlamydia pneumoniae)的、编码大小约为89-101kDa和56-57kDa(较佳的约为89.6-100.3kDa和约为56.1kDa)的表面(surfaceexposed)膜蛋白的基因家族成员的鉴定。本发明涉及新的DNA序列,其对应蛋白质的推定的氨基酸序列,以及该DNA序列和蛋白在病理学和流行病学上诊断由肺炎衣原体引起的感染和作为疫苗组分的用途。
总体背景
肺炎衣原体是专性胞内细菌(Christiansen和Birkelund(1992);Grayston等人,(1986))。它具有和革兰阴性菌一样的细胞壁结构,具有外膜、周质间隙和胞质膜。用洗涤剂十二烷基肌氨酸钠可以纯化革兰阴性菌的外膜。该组分被命名为“外膜复合物(OMC)”(Caldwell等人,(1981))。肺炎衣原体的COMC(衣原体外膜复合物)含有四组蛋白质:经SDS-PAGE测得的98kDa高分子量蛋白、62/60kDa的富含半胱氨酸的外膜蛋白2(Omp2)的两条条带、38kDa的主要外膜蛋白(MOMP)、以及12kDa的低分子量脂蛋白Omp3。Omp2/Omp3和MOMP蛋白存在于所有衣原体种类的COMC中,已经从沙眼衣原体(C.trachomtis)、鹦鹉热衣原体(C.psittaci)和肺炎衣原体中克隆出这些基因。然而编码肺炎衣原体COMC中98kDa蛋白的基因还未被鉴别或克隆。
肺炎衣原体血清学和鉴定的现有状况
肺炎衣原体是一种专性胞内细菌,属于衣原体属,该衣原体属可以分为四种:沙眼衣原体、肺炎衣原体、鹦鹉热衣原体和C.pecorum。四个种类的共同之处是它们在细胞内的专性生长,且它们具有胞外感染性颗粒(原体,EB)和胞内复制形式(网状体,RB)的双相生命周期。另外,衣原体种类的特征是在人类感染中高度免疫原性的共同脂多糖(LPS)表位。沙眼衣原体引起人眼感染(沙眼)和生殖感染。鹦鹉热衣原体是一类可变的动物病原体,其中的禽株偶尔会感染人并引起严重的肺炎鸟疫。第一个肺炎衣原体分离物从眼感染获得,但是它被归类为非典型(non-typable)的衣原体。在芬兰肺炎流行性爆发时,发现患者在衣原体属特异性测试(淋巴肉芽肿皮肤试验抗原测试)中有阳性反应,且患者表现非典型的衣原体分离物的滴度升高。在西雅图上呼吸道感染爆发时获得了类似的分离物,该衣原体分离物归类为一种新的种类肺炎衣原体(Grayston等人,1989)。另外,有人指出肺炎衣原体涉及到粥样动脉硬化病变发展和刺激性支气管哮喘(Kuo等人,1995)。这两种病症被认为是由于慢性感染或过敏反应或两者共同引起的。
肺炎衣原体感染的诊断
诊断肺炎衣原体引起的急性呼吸道感染是困难的。对患者样品的肺炎衣原体培养是不敏感的,甚至是选择合适的组织培养细胞未分离时也如此。因此Campbell等人(1992)开发出一种肺炎衣原体特异性的聚合酶链反应(PCR)。
尽管在一些研究中已经用该PCR方法检测了肺炎衣原体,但是该方法是否适用于所有的临床场合仍有争论。其原因是,急性呼吸道感染中携带肺炎衣原体的细胞还未确定,且预计会有慢性携带状态,只是不知道它们存在于哪些器官和细胞中。另外,PCR测试很难进行,因为这些细菌的产量很低,且患者样品中存在抑制性的物质。因此,开发出敏感的、特异性的血清型诊断方法来检测急性和慢性感染是将会有很大价值。衣原体感染的血清学诊断方法目前是基于属特异性的测试(如淋巴肉芽肿皮肤试验抗原测试和ELISA(测定LPS的抗体)),或是更特异性的种属测试(用微量免疫荧光法(Micro-IF)测定针对纯化的EB的抗体)(Wang等人,1970)。然而,微量免疫荧光法要用显微镜来读数,且为了确保有正确的读数,必须将结果与用作抗原(由于与共有的LPS表位的交叉反应抗体)的沙眼衣原体的结果比较。因此,正如Kuo等人(1995)年所表述的,“本领域中主要需要一种用于临床实验室的测试感染的迅速、可靠实验室方法”,本领域迫切需要开发出用于针对肺炎衣原体的种特异性诊断方法。另外,肺炎衣原体可能涉及粥样动脉硬化和支气管哮喘显然也使得开发有效的疫苗有正当的理由。
发明详细公开
本发明的目的是提供有效诊断肺炎衣原体感染的方法以及开发出有效抵抗该微生物感染的疫苗。因此,本发明涉及种特异性的哺乳动物(如人)体内肺炎衣原体感染的诊断测试方法,所述测试是根据对抗体进行检测,该抗体针对大小约为89-101kDa和56-57kDa(较佳的约为89.6-100.3kDa和56.1kDa)(推导的氨基酸序列的大小范围为100.3至89.6,除大小为56.1的Omp13外)的表面膜蛋白,或是基于对编码该蛋白或其变体或亚序列的核酸片段进行检测。本发明还涉及本发明蛋白的氨基酸序列,其变体和亚序列,以及编码这些蛋白或其变体或亚序列的核酸片段。本发明还涉及针对本发明蛋白的抗体。本发明还涉及本发明的核酸片段和蛋白在诊断肺炎衣原体以及抗肺炎衣原体疫苗中的用途。
在本发明公开前,只有很少的肺炎衣原体基因被测序。它们主要是编码已知的沙眼衣原体同系物(MOMP、Omp2、Omp3、Kdo转移酶)的基因、热休克蛋白基因GroEl/Es和Dnak(一种核酸酶P同系物)和编码功能未知的76kDa蛋白的基因。为何至今只有如此少的基因被克隆的原因是因为从宿主细胞纯化后能获得的肺炎衣原体的产量非常低。在这种纯化后,DNA必须从EB中纯化出,而在此步骤肺炎衣原体DNA容易被宿主细胞DNA污染。除了这些固有的困难外,培育肺炎衣原体并用DNA技术产生很少数量(几个μ)DNA的表达文库非常困难。自1993年起已经知道(Melgosa等人,1993),98kDa的蛋白存在于肺炎衣原体的OMC中。尽管Melgosa提到98kDa的蛋白条带是肺炎衣原体OMC的一部分,但是其基因序列以及推导的氨基酸序列一直未被确定。该文中只描述了用SDS-PAGE大致分离出的肺炎衣原体蛋白的条带。然而,在本发明前,编码该蛋白的基因一直未被确定。发现98kDa蛋白与人血清只有很弱的反应或没有反应(Campbell等人,1990),在本发明者的工作之前,还没有认识到89-101kDa蛋白是表面外露的或它们实际上有免疫原性。在该报道中描述了许多人血清样品能与SDS-PAGE中迁移至98kDa的肺炎衣原体蛋白反应。该蛋白没有进一步作特性分析,因此它与本申请没有冲突。
Halme等人1997年描述了92-98kDa肺炎衣原体蛋白中存在人T细胞表位。从总衣原体蛋白的SDS-PAGE中洗脱出该蛋白,但是该蛋白的身份没有被确定。
用抗体来筛选表达文库是克隆编码蛋白的抗原性部分基因片段众所周知的方法。然而,由于患者的血清没有表现出与98kDa蛋白有明显的反应,因此不可能用患者血清来克隆该蛋白。
已知由发明者获得的单克隆抗体能与肺炎衣原体表面上的构象表位反应,它们还与肺炎衣原体OMC反应(经免疫电子显微镜测得)(Christiansen等人,1994)。另外,98kDa蛋白是肺炎衣原体OMC中唯一未知的蛋白(Melgosa等人,1993)。本发明者选择采用非常规的步骤来克隆编码至今未知的98kDa蛋白的基因:纯化肺炎衣原体OMC,在免疫接种前用SDS处理抗原破坏免疫原性构象表位。从而获得对免疫原性较低的线性表位的抗体(PAB150)。这提供了获得能检测该蛋白的抗血清的可能性,并表明编码本发明的89-101kDa和56kDa蛋白的基因家族能在重组体大肠杆菌的菌落印迹中检测到。
肺炎衣原体感染的小鼠产生了针对发明者鉴定并命名为Omp4-15的抗体,但是不识别通常用于SDS-PAGE和免疫印迹的经SDS处理热变性的抗原。但是,如果抗原没有热变性,则可见到强反应。因此,如果在人感染中见到类似的反应,则本发明的抗原很有可能在血清诊断测试中具有不可估量的用途,并且很可能用作预防感染的疫苗。
通过产生针对肺炎衣原体COMC的抗体,获得与所有蛋白反应的多克隆抗体(PAB150)。用该抗体鉴别肺炎衣原体DNA表达文库中编码89.6-101.3kDa和56.1kDa的基因。与本发明有关的一个问题是在肺炎衣原体中发现了包含许多类似基因的一个家族。因此,需要大量不同的克隆来鉴别诸片段簇。本发明者成功克隆并测序以上四个基因只是因为用SDS-变性抗原产生的家兔抗体含有针对位于该蛋白家族不同成员中的许多不同表位的抗体。通过该方法,一个基因已被完全测序,第二个基因除远部外已测序,并获得了两个附加基因的更短的片段。为了获得附加基因的DNA序列并搜寻该基因家族的更多成员,用测序基因衍生的引物和数据库中已经公开基因的引物进行长范围(long range)PCR。该方法检测到属于该家族的另外8个基因。这些基因位于两个基因簇中:Omp12,11,10,5,4,13和14在一个簇中,Omp6,7,8,9和15在第二个簇中。从Omp4,5,6,7,8,9,10,11和13获得全序列,并获得Omp12,14部分序列。Omp13是1545核苷酸的截短的基因,全长基因的其余部分为2526(Omp7)至2838(Omp15)的核苷酸。推导的氨基酸序列揭示,推定的多肽为89.6至100.3kDa,除Omp13为56.1kDa外。与所有序列相比,推导的氨基酸序列的排列表现出最大相同性为49%(Omp5/Omp9)。除了Omp13外,最低的同源性是Omp7与其它任何氨基酸序列的相同性不到34%。Omp13与所有其它序列的同源性评分为29-32%。
在本文中,SEQ ID NO:1和2对应于Omp4,SEQ ID NO:3和4对应于Omp5,SEQ ID NO:5和6对应于Omp6,SEQ ID NO:7和8对应于Omp7,SEQ ID NO:9和10对应于Omp8,SEQ ID NO:11和12对应于Omp9,SEQ ID NO:13和14对应于Omp10,SEQ ID NO:15和16对应于Omp11,SEQ ID NO:17和18对应于Omp12,SEQ ID NO:19和20对应于Omp13,SEQ ID NO:21和22对应于Omp14,SEQ ID NO:23和24对应于Omp15。
下面列出了本发明的Omp蛋白估计的大小。Omp4的大小为98.9kDa,Omp5的估计大小为97.2kDa,Omp6的估计大小为100.3kDa,Omp7的估计大小为89.7kDa,Omp8的估计大小为90.0kDa,Omp9的估计大小为96.7kDa,Omp10的估计大小为98.4kDa,Omp11的估计大小为97.6kDa,Omp13的估计大小为56.1kDa,Omp12和Omp14为一部分。
另外,SEQ ID NO:25是SEQ ID NO:3的亚序列,SEQ ID NO:26是SEQ ID NO:4的亚序列,SEQ ID NO:27是SEQ ID NO:5的亚序列,SEQ ID NO:28是SEQ ID NO:6的亚序列,SEQ ID NO:29是SEQ ID NO:7的亚序列,SEQ ID NO:30是SEQ ID NO:8的亚序列。
部分omp蛋白以融合蛋白形式表达并产生了针对这些蛋白的小鼠多克隆单特异性抗体。这些抗体在免疫荧光和免疫电镜法中与肺炎衣原体的表面反应。这第一次表明肺炎衣原体中89-101kDa和56-57kDa蛋白家族包含表面的外膜蛋白。这一重要发现导致这样一个认识,即89-101kDa和56-57kDa肺炎衣原体蛋白家族成员是开发血清学诊断测试肺炎衣原体、以及开发抵抗肺炎衣原体感染的疫苗(根据采用这些蛋白)的优秀的候选物。另外,这些蛋白可用作流行病学标记,针对这些蛋白的多克隆单特异性血清可用来检测人组织中的肺炎衣原体或用来检测组织培养物中的肺炎衣原体分离物。另外,编码89-101kDa和56-57kDa(如89.6-100.3kDa和56.1蛋白家族)的基因可用来开发基于核酸检测/扩增的种特异性诊断测试。
将全长Omp14克隆到允许Omp4多肽表达的表达载体系统中。用该多肽作为抗原来免疫家兔。由于该蛋白在变性条件下纯化,因此其抗体不与肺炎衣原体天然表面反应,但是在采用纯化的肺炎衣原体EB作为抗原的免疫印迹中,它和98kDa蛋白反应。另外,抗体在试验性感染小鼠的石蜡包埋肺组织切片中反应。
本发明较宽的方面涉及种特异性地诊断测试肺炎衣原体感染哺乳动物(如人)的方法,所述测试方法包括检测患者体内或较佳的是患者样品中是否存在针对分子量为89-101kDa或56-57kDa的肺炎衣原体外膜蛋白的抗体,或检测是否存在编码所述外膜蛋白或其片段的核酸片段。
在本申请的内容中,术语“患者样品”应指患者(如人患者)的一定量的血清、血浆、粘液、组织、咳出物、强迫咳出的痰液或支气管吸出物、尿液、脑脊髓液、粥样动脉硬化病灶、粘膜拭子(mucosal swap)、来自所述患者的组织培养的细胞、或以任何方式源自所述患者的物质。本发明的人体内测试包括本领域中已知的皮肤测试,如真皮内测试,例如类似于Mantaux测试。在对于测试非常敏感的某些患者(通常情况是儿童)体内,测试可以是非侵入型的,例如用膏药在皮肤表面测试。
在本文中,术语89-101kDa蛋白指通常在肺炎衣原体外膜中的蛋白,其在SDS-PAGE中所见的一条或多条条带,表观分子量基本上在89-101kDa范围内。根据推导的氨基酸序列,分子大小在89.6-100.3kDa之间不等。
基于利用属于本申请书公开的基因家族的基因的种特异性血清诊断测试在本发明的范围内。
本发明的较佳实例涉及本发明的种特异性诊断测试,其中外膜蛋白具有选自SEQID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24的序列。
当采用本发明的蛋白时,术语“变体”应理解成表现出与本发明的某一种蛋白的相似性低于100%的氨基酸序列。变体序列的大小可以和与之比较的序列相同或不同。变体通常宜表现出序列相似性至少为50%,较佳的至少60%,更佳的至少70%,例如至少80%、90%、95%或98%。
关于本发明蛋白序列的术语“序列相似性”指本发明蛋白和长度相同或不同的对比蛋白中相同的和保守性变化的氨基酸残基(在位置和类型上)的百分数。关于本发明蛋白序列的术语“序列相同性”指本发明蛋白和长度相同或不同的对比蛋白中在位置和类型上相同的氨基酸的百分数。
在本发明的范围内,本发明的一种蛋白的亚序列是指取自SEQ ID NO:2、SEQ IDNO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24的氨基酸残基的连续序列。一个亚序列通常包含至少100个氨基酸,较佳的至少80个氨基酸,更佳的至少70个氨基酸,如50个氨基酸。它甚至可以短达10-50个氨基酸,例如20-40个氨基酸,例如约30个氨基酸。亚序列通常表现出的序列同源性至少为50%,较佳的至少为60%,更佳的至少70%,例如至少80%,如至少90%,95%或98%。
本发明的诊断测试所包括的免疫试验选自直接或间接的EIA(如ELISA)、免疫印迹技术(如Western印迹)、放射免疫试验和其它任何非酶联抗体结合试验或诸如荧光、凝集或沉淀反应的程序,以及比浊法。
本发明的一个较佳实例涉及本发明的种特异性诊断测试,所述测试包括ELISA,其中检测样品中针对本发明蛋白或其片段的抗体。
本发明的一个较佳的实例是以检测样品中针对本发明蛋白的抗体为基础的ELISA。该ELISA可采用本发明的蛋白或其变体(即抗原)作为包被试剂。ELISA通常根据本领域熟知的标准方法(例如David Lane Harlow编辑的“抗体实验手册”(ColdSpring Habor laboratories,1988)中所述的方法,该文纳入本文作参考)来进行。
重组蛋白可用按照下文实施例中所述的方法获得的DNA序列来生产。这些DNA序列(包括本发明基因家族中每个基因的整个编码区)将被克隆到表达载体中,从该表达载体中可以纯化获得推导的蛋白序列。在ELISA中用单克隆和多克隆抗体以及实验性感染的小鼠的血清和人患者的血清分析纯化获得的蛋白的反应性。
从实验性感染的小鼠血清的反应性知道,它主要识别的是非线性表位。因此,可以考虑用本领域中已知的不同形式的纯化方案来分析不连续表位的存在,和分析人免疫应答是否也针对这类表位。
本发明的较佳实例涉及本发明的种特异性诊断测试,其中核酸片段具有选自SEQID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21和SEQ ID NO:23的序列。
关于本发明的核酸片段,术语“变体”应理解成表现出的序列同源性低于100%的核酸序列。变体序列的大小可以和与之比较的序列相同或不同。变体通常宜表现出序列同源性至少为50%,较佳的至少60%,更佳的至少70%,例如至少80%、90%、95%或98%。
关于本发明的核酸片段,术语“序列同源性”指本发明的核酸片段中的核酸在位置和类型上与长度相同或不同的序列对比时相匹配核酸百分数。
为了获得关于本发明每个基因总体分布的信息,对于所有可获得的肺炎衣原体分离物上的每个基因进行PCR。这将提供关于本发明的基因或核酸片段的总体变化性的信息。对可变区测序。可用PCR从患者样品扩增出流行病学基因的可变部分。不变的部分可用来PCR扩增和分析用于诊断测试的可能性。预计,如果发现可变性,则可变区的PCR可用于流行病学。不变区的PCR可用作种特异性诊断测试。用编码所有已知分离物中不变蛋白的基因作为PCR的靶来制备编码功能未知蛋白的基因。
本发明的特别佳的实例涉及本发明的诊断测试,其中利用核酸扩增、较佳的是聚合酶链反应(PCR)来检测核酸片段。
以PCR为基础的测试在本发明的范围内,该测试的目的是检测本发明的核酸片段或其变体。PCR测试通常根据本领域熟知的方法来进行,其通常包括能检测和区别本发明的核酸片段的PCR测试。较佳的是定量的竞争性PCR测试或嵌套式PCR测试。本发明的PCR测试通常根据EPB540588、EPA586112、EPA643140或EPA669401中详细描述的方法来设计,这些专利均纳入本文作参考。
在本发明的范围内,本发明的一种核酸片段的变体和亚序列指取自SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21和SEQ ID NO:23的连续延伸核酸。变体或亚序列宜包含至少100个核酸,较佳的至少80个核酸,更佳的至少70个核酸,例如至少50个核酸。它甚至可以短达10-50个核酸,例如20-40个核酸,例如约30个核酸。亚序列通常表现出至少30%的序列同源性,较佳的为至少60%,更佳的为至少70%,例如至少80%,如至少90%、95%或98%。亚序列越短,所需的同源性越高。因此,100个核酸或更少的亚序列必须表现出至少80%的同源性。
本发明的一个非常重要的方面涉及本发明衍生自肺炎衣原体的蛋白,该蛋白具有的氨基酸序列选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ IDNO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24,具有的序列相似性至少为50%,较佳的至少60%,更佳的至少70%,例如至少80%,如至少90%、95%或98%,并具有类似的生物功能。
术语“类似的生物功能”指该蛋白表现出类似于可从肺炎衣原体膜蛋白衍生获得的蛋白的性质特征。这些蛋白包括GGAI的重复基序(至少2个,较佳的至少3个重复序列)和/或保守的色氨酸(w)位置。
比较编码Omp4-15的基因的DNA序列,结果表明,单个基因之间的总体相似性范围在43-55%之间。比较Omp4-15的氨基酸序列结果表明相同性为34-49%,相似性为53-64%。同源性一般沿推导的氨基酸整个长度分散散布。然而,从图8A-J看出,存在一些区域其同源性更明确。这见于重复序列中,其中序列GGAI在基因中重复4-7次。令人感兴趣的是,对于编码四个氨基酸GGAI的序列,DNA同源性并不保守。这可能表明了蛋白该部分的功能性作用,并表明该重复结构并不是由基因复制而产生的。除了四个氨基酸的重复序列GGAI外,氨基酸400-490的区域所具有的同源性要高于该蛋白的其它部分,在所有序列中均有保守序列FYDPI。功能上相似性进一步表明,氨基酸色氨酸(W)在该蛋白C端部分4-6个局部区域完全保守。
由于本发明的基因和推导的氨基酸序列没有一个相同,因此下列内容在本发明的范围内:单特异性抗体的生产,用所述抗体来鉴别哪个肺炎衣原体蛋白被表达,用所述抗体鉴别所述肺炎衣原体蛋白在发育生命周期的何时被表达,用所述抗体鉴定所述肺炎衣原体蛋白的精确的细胞定位。用针对本发明蛋白的单特异性抗体来测定所述蛋白的哪个部分是外露的,肺炎衣原体COMC中的蛋白是怎样相互作用的,这些也在本发明的范围内。
本发明较佳的实例涉及包含本发明蛋白质亚序列的多肽,所述亚序列包含序列GGAI。本发明其它较佳的实例涉及包含本发明蛋白亚序列的多肽,所述亚序列包含序列FSGE。
本发明的多肽的长度通常为至少6个氨基酸,较佳的至少15个氨基酸,较佳的至少20个氨基酸,较佳的至少25个氨基酸,较佳的至少30个氨基酸,较佳的至少35个氨基酸,较佳的至少40个氨基酸,较佳的至少45个氨基酸,较佳的至少50个氨基酸,较佳的至少55个氨基酸,较佳的至少100个氨基酸。
本发明非常重要的一方面涉及本发明衍生自肺炎衣原体的核酸片段,其变体和亚序列。
本发明另一个重要的方面涉及针对本发明蛋白的抗体,这些抗体包括针对具有选自序列SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24的蛋白的多克隆单特异性抗体和单克隆抗体。
本发明的一个非常重要的方面涉及用来诊断哺乳动物(如人)感染肺炎衣原体的诊断试剂盒,所述试剂盒包含的一种或多种蛋白具有选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24的氨基酸序列。
本发明另一非常重要的方面涉及用来诊断哺乳动物(如人)感染肺炎衣原体的诊断试剂盒,所述试剂盒包含针对具有选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24的氨基酸序列的蛋白的抗体。本发明的诊断试剂盒中包括的抗体可以是多克隆抗体或单克隆抗体或其混合物。
本发明还有一个重要的方面涉及用来诊断哺乳动物(如人)感染肺炎衣原体的诊断试剂盒,所述试剂盒包含具有选自SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQID NO:17、SEQ ID NO:19、SEQ ID NO:21和SEQ ID NO:23的序列的一种或多种核酸片段。
本发明一方面涉及一种用于免疫接种哺乳动物(如人)抵抗肺炎衣原体的组合物,所述组合物包含具有选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ IDNO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24的氨基酸序列的一种或多种蛋白质。
预计本发明的蛋白在预防哺乳动物(如人)感染肺炎衣原体上有重要作用。因此,通常用重组技术产生本发明的蛋白(包括其变体和亚序列),然后用作免疫接种哺乳动物(如家兔)的抗原。随后,用组织培养试验分析免疫获得的超级免疫血清抵抗肺炎衣原体的保护作用。另外,可以考虑生产单克隆抗体(通常用标准的杂交瘤技术),并分析其抵抗肺炎衣原体感染的保护作用。
预计在本发明的蛋白(包括所述蛋白的亚序列)中可以找到特别感兴趣的和免疫原性表位。较佳的是用包含本发明蛋白的这些亚序列的多肽来免疫哺乳动物(如人)以抵抗肺炎衣原体。
本发明的一个重要的方面涉及下述蛋白在诊断哺乳动物(如人)受肺炎衣原体感染中的应用,该蛋白具有的序列选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24。
本发明的一个较佳的实例涉及非变性形式的本发明蛋白在诊断哺乳动物(如人)受肺炎衣原体感染中的应用。
本发明的一个非常重要的方面涉及下述蛋白在免疫哺乳动物(如人)抵抗肺炎衣原体中的应用,该蛋白具有的序列选自SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22和SEQ ID NO:24。
本发明的一个较佳的实例涉及非变性形式的本发明蛋白在免疫哺乳动物(如人)抵抗肺炎衣原体中的应用。
本发明的一个非常重要的方面涉及下述核酸片段在免疫哺乳动物(如人)抵抗肺炎衣原体中的应用,该核酸片段具有的核苷酸序列选自SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21和SEQ ID NO:23。
预计可以开发出一类用基因枪接种小鼠抗肺炎衣原体的疫苗。通常,含有本发明的核酸片段、核酸片段组合的不同基因构建物可用于基因枪方法。随后分析小鼠产生的体液和细胞免疫应答以及在攻击后其抵抗肺炎衣原体感染的保护作用。
鉴于此,本发明还涉及本发明蛋白作为药物(疫苗)的用途以及用来制备抵抗肺炎衣原体感染的疫苗的用途。
含有蛋白序列作为活性组分的疫苗的制备方法是本领域中所熟知的,例子有美国专利4,608,251;4,601,903;4,599,231;4,599,230;4,596,792和4,578,770,所有这些文献均纳入本文作参考。通常,这些疫苗被制成液体溶液或悬浮液形式的可注射剂;还可制成适合在注射液体前溶于溶液或悬浮液的固体形式。制剂还可乳化。活性免疫组分通常和药学上可接受的并与活性组分相容的赋形剂混合。合适的赋形剂例如是水、盐水、葡萄糖、甘油、乙醇等以及它们的组合。另外,如果需要,疫苗还可含有少量辅助物质如润湿剂或乳化剂、pH缓冲剂、或增强疫苗效果的佐剂。
疫苗通常是经肠胃外注射给予,例如皮下注射或肌内注射。适用于其它给药方式的其它制剂包括栓剂以及在一些场合下的口服制剂。这些组合物的形式是溶液、悬浮液、片剂、丸剂、胶囊剂、缓释制剂或粉末,含有10-95%(较佳的为25-70%)的活性组分,并任选地含有合适的载体。
蛋白序列可以本领域已知的中性或盐形式配制入疫苗内。疫苗以相容于剂型的方式给予,其量在治疗上是有效的并且有免疫原性。给药量取决于待治疗的对象。合适的剂量范围为每次接种约几百微克活性组分,较佳的范围约为0.1-1000微克。如果疫苗还包含本领域已知的佐剂物质,则可增强免疫应答。其它可能性包括采用免疫调节物质如淋巴因子(如IFN-γ、IL-2和IL-12)或合成的IFN-γ诱导剂如与上述佐剂组合的聚I∶C。
通过下述方式还可产生活疫苗:将编码本发明的蛋白片段或蛋白的至少一个核酸片段导入非病原性微生物中,使蛋白片段或蛋白在微生物表面表达(例如包括膜锚着部分的融合蛋白形式或携带允许锚着到膜上的脂质化(lipidation)信号的略经修饰的蛋白或蛋白片段形式)。技术人员知道怎样使相关表达系统适合此目的。
本发明另一部分依据的事实是,最近的研究揭示,克隆到在载体中的在真核细胞内的非复制性的DNA片段可通过(例如)肌内注射或经皮肤给药(所谓的“基因枪”方法)导入动物(包括人)体内。DNA被肌细胞摄取,感兴趣的基因被在真核细胞内起作用的启动子(例如病毒启动子)表达,随后基因产物刺激免疫系统。Ulmer等人在1993年回顾了这些新发现的方法,该文纳入本文作参考。
因此,编码本发明蛋白的核酸片段可用来实现体内表达抗原,即核酸片段可用作所谓的DNA疫苗。因此,本发明还涉及一种疫苗,它包含编码本发明的蛋白片段或蛋白的核酸片段,该疫苗使已给予疫苗的哺乳动物(如人)在体内表达抗原,表达的抗原量能有效地使哺乳动物(如人)对体内肺炎衣原体感染的抵抗力大大增加。
通过将编码表达产物的基因和编码能调节免疫应答的蛋白的DNA片段一起给予,可以增强这种“DNA疫苗”的效率。例如,通过给予两种分开的DNA片段或给予包含在同一载体中的两种DNA片段,编码淋巴因子前体或淋巴因子(例如IFN-γ、IL-2或IL-12)的基因可以和编码免疫原性蛋白片段或蛋白的基因一同给予。还有一个可能性是给予包含多个核苷酸序列的DNA片段,其中每个核苷酸序列编码本文公开的蛋白片段和蛋白的相对表位,用这些表位的宽谱来实现连续致敏免疫应答。
下列实验为非限制性实施例,目的是为了描述本发明的某些特征和实例。
附图图例
图1显示了负染色的纯化肺炎衣原体EB(A)和纯化的OMC(B)的电镜结果。
图2显示了纯化的EB和OMC银染的15%SDS-PAGE结果。泳道1,纯化的肺炎衣原体EB;泳道2,肺炎衣原体OMC;泳道3,纯化的沙眼衣原体EB;和泳道4,沙眼衣原体OMC。
图3显示了经10%SDS-PAGE分离、转移到硝酸纤维素膜并和家兔抗肺炎衣原体OME血清反应的免疫印迹。
图4显示了经家兔抗肺炎衣原体血清检测的重组pEX的考马斯蓝染色7.5%SDS-PAGE结果。箭头表示117kDaβ半乳糖苷酶蛋白的定位。
图5显示了经菌落印迹检测、7.5%SDS-PAGE分离、转移到硝酸纤维素膜上并和家兔抗肺炎衣原体OMC反应的重组体pEX克隆的免疫印迹。泳道1,海蓝色(seablue)分子量标准。泳道2-6,于42℃培育以诱导产生β半乳糖苷酶融合蛋白的pEX克隆。
图6显示了Omp4和Omp5的序列策略。箭头表示用来测序的引物。
图7。肺炎衣原体omp基因。基因排列在两个簇中。在第1簇中发现Omp12、11、10、5、4、13和14。在第2簇中发现Omp6、7、8、9和15。
图8A-J显示了肺炎衣原体4-15的序列对比,采用GCG软件包中的pile up程序。
图9显示了肺炎衣原体感染的HeLa细胞感染后72小时与针对pEX3-36融合蛋白的小鼠单特异性抗血清反应的免疫荧光。pEX3-36是Omp5基因的一部分。
图10显示了肺炎衣原体EB的免疫印迹。泳道1-3,在SDS-样品缓冲液中加热至100℃,泳道4-6未加热。泳道1和家兔抗肺炎衣原体OMC反应;泳道2和4是前血清(pre-serum);泳道3和5是多克隆家兔抗pEX1-1融合蛋白;泳道6是单克隆抗体26.1。
图11显示了肺炎衣原体的免疫印迹。泳道1-4,在SDS-缓冲液中加热至100℃,泳道5-6未加热。与107CFU的肺炎衣原体感染后14天的C57-黑色小鼠的血清反应。泳道1和5,小鼠1;泳道2和6小鼠2;泳道3和5,小鼠3;泳道4和8,小鼠4。
图12显示了小鼠肺组织的免疫组织学分析,肺炎包涵体存在于支气管上皮组织和肺胞壁中(箭头所指处)。
实施例1
编码98/95 kDa肺炎衣原体COMC蛋白的基因的克隆
肺炎衣原体EB和COMC的纯化
在HeLa细胞中培育肺炎衣原体。培育按照Miyashita和Matsumoto(1992)的说明书进行,只是略作修改,以100,000Xg离心上清液和后一次沉淀以及混浊的底层。通过1000xg离心30分钟,使微生物附着于HeLa细胞,然后在5%CO2气氛下在含有5%胎牛血清(FCS,Gibco BRL,Germany目录号10106.169)庆大霉素的RPMI1640培养基(Gibco BRL,Germany目录号51 800-27)中37℃培育2小时。将培养基换成另外还含有1毫克/毫升放线菌酮的培养基。培育48小时后,从培养中取出盖玻片,用对肺炎衣原体有特异性的抗体(单克隆抗体26.1)(Christiansen等人,1994)和对种沙眼衣原体有特异性的单克隆抗体(单克隆抗体32.3,Loke diagnostics,ArhusDenmark)测试包涵体,以确保没有发生沙眼衣原体污染。通过Hoechst染色并在BEa和BEg培养基(Freund等人,1979)中培养来测试HeLa细胞中有无支原体污染。另外,通过在BEa和BEg培养基中培育来测试肺炎衣原体原种中有无支原体污染。在培养物或细胞中没有检测到沙眼衣原体、支原体或细菌污染。感染后72小时,以PBS洗涤单层,在PBS中用橡胶刮棒打松细胞,用超声法从宿主细胞中释放出衣原体。在不连续密度梯度上纯化肺炎衣原体EB和RB(Miyashita等人,1992)。用负染色和电子显微镜确认衣原体EB的纯度(图1),只检测到符合肺炎衣原体EB结构的大小为0.3-0.5mm的颗粒。如Caldwell等人(1981)中所述的那样,对纯化的衣原体EB进行十二烷基肌氨酸钠提取,改动之处在于用简单的超声破碎来悬浮COMC。用电子镜和负染色测试纯化的COMC(图1),从中看到折叠的外膜复合物。
SDS-PAGE分析纯化的EB和COMC
在15%SDS-聚丙烯酰胺凝胶上分离纯化的EB和肺炎衣原体OMC中的蛋白,对凝胶银染(图2),在泳道1中看到纯化的EB含有100/95 kDa的主要蛋白和38kDa的蛋白,在纯化的COMC(泳道2)中这两种蛋白组也是主要的。另外,分子量为62/60kDa、55kDa和12kDa的蛋白已经在COMC制备物中增浓。在比较纯化的肺炎衣原体EB和纯化的沙眼衣原体EB(泳道3)时,看到沙眼衣原体EB中主要的蛋白是主要外膜蛋白(MOMP),它也是沙眼衣原体COMC制备物(泳道4)中的主要条带,在该制备物中看到60/62kDa的Omp2和12kDa的Omp3。然而没有检测到如同肺炎衣原体COMC制备物的大小为100/95kDa的主要条带。
抗肺炎衣原体COMC的家兔多克隆抗体的产生
为了确保产生在免疫印迹和菌落印迹中识别所有肺炎衣原体蛋白的家兔抗体,将10微克COMC抗原溶解在20微升SDS样品缓冲液中,然后分装在5个小管中。用1毫升PBS和1毫升Freund不完全佐剂(Difco laboratories,USA目录号0639-606)进一步稀释溶解的抗原,并注射新西兰白兔四头肌内。以间隔一周时间肌内注射三次,再过三周,静脉内注射稀释在1毫升PBS中的溶解的COMC蛋白,两周后重复该步骤。在开始免疫后的11周后,从家兔获得血清。用SDS-PAGE分离纯化的肺炎衣原体EB,将蛋白电转移到硝酸纤维素膜上。用多克隆COMC抗体封闭膜并免疫染色(图3)。该血清识别EB制备物中大小为100/95,60和38kDa的蛋白。这与外膜蛋白的大小相符。
COMC蛋白的克隆
由于在HeLa细胞中培育肺炎衣原体,因此EB制备物中可能存在污染性的宿主细胞DNA。因此,用DNA酶处理纯化的EB制备物以除去污染性DNA。然后用CsCl梯度离心纯化肺炎衣原体DNA。用Sau3A部分消化肺炎衣原体DNA,将含有大小约为0.5-0.4kb的DNA片段的级分克隆到表达载体系统pEX(Boehringer,Germany目录号1034766,1034774,1034782)中。pEX载体系统具有β半乳糖苷酶基因,β-半乳糖苷酶基因的3′端有多个克隆位点。基因的表达由PR启动子调节,因此可以通过将温度从32℃升高到42℃来诱导蛋白的表达。将重组细菌菌落转移到硝酸纤维素膜上,将温度升高至42℃2小时。将硝酸纤维素膜置于滤膜上浸泡在5%SDS中,使细菌裂解。用针对肺炎衣原体COMC的多克隆抗体检测表达外膜蛋白的菌落。将阳性克隆培育在悬浮液中并于42℃诱导2小时。用SDS-PAGE分析克隆的蛋白全貌,观察到诱导的β半乳糖苷酶的大小增加(图4)。另外,将蛋白电泳转移到硝酸纤维素膜上,确认与针对COMC的多克隆血清的反应(图5)。
阳性COMC克隆的测序
为了对pEX克隆进行特性分析,对插入的肺炎衣原体DNA测序。在GenEmbl数据库中针对原核序列搜索所得DNA序列。搜索结果鉴定了6个克隆是Omp2基因的部分,2个克隆是Omp3基因的部分,2个克隆是MOMP基因的部分,提示COMC蛋白已经被成功克隆。另外,获得32个克隆,其中含有在GenEmbl数据库中未发现的DNA序列。然而,这些序列可能以两个4-6个克隆的毗连群(contic)簇集,三个克隆是相同的。发现另外19个克隆与该毗连群不重叠(图7)。为了获得关于基因的更多的序列信息,用BamHI限制性酶整个消化肺炎衣原体DNA,将片段克隆到载体pBluescript中。将连接的DNA电转化到大肠杆菌XL1-Blue中,并在含有氨苄青霉素的平板上选择。将重组细菌菌落转移到硝酸纤维素膜上,用pEX1-1克隆的插入物作为探针进行菌落杂交。发现了含有4.5kb的单个BamHI片段的一个克隆,用Southern印迹确认此克隆和探针杂交。用各约为300bp的合成引物对此克隆的插入物进行双向测序。BamHI片段的序列使得其能连接pEX克隆的两个毗连群。和pEX克隆一起总共能装配出编码两个新COMC蛋白的6.5kb DNA序列(图6)。
采用来自已知Omp基因和来自其它已知基因的引物在纯化的肺炎衣原体DNA上进行PCR,获得另外的序列。对获得的PCR产物进行测序。序列结构显示在图7中。检测到另外8个Omp基因。推导的氨基酸序列的序列对比显示在图8A和B中。
DNA序列的分析
DNA序列编码大小为89.6-100.3kDa的Omp4-15蛋白(Omp13为56.1kDa)。Omp4和Omp5以相反方向转录。可能的终止结构位于Omp4下游。Omp5基因的3′端没有被克隆,因为该基因内存在BamHI限制性酶切位点。用GCG软件包(Wisconsin软件包,8.1-UNIX版,1995年8月,序列分析软件包)的gap程序比较Omp4和Omp5的翻译的DNA序列。两个基因的氨基酸相同性为41%(相似性为61%),在Omp4中,信号肽酶1的可能的断裂位点在氨基酸17位,而在Omp5中在氨基酸25位。当将其它两个pEX克隆编码的氨基酸序列与Omp4和Omp5相比时,它们与这些基因也具有氨基酸同源性。发现两个克隆与Omp4和Omp5蛋白中的同一区域有同源性。因此,pEX克隆必定源自另两个基因。因此将这些基因命名为Omp6和Omp7。对其它基因进行类似的分析。与Omp4和Omp5中所见相反,其它推定的Omp蛋白均没有信号肽断裂位点。
实施例2
针对pEX融合蛋白以及加上Omp4的全长重组的多克隆单特异性抗体
为了调查Omp4-7蛋白的拓扑结构,从每个基因中选出代表性的pEX克隆。诱导出β-半乳糖苷酶/Omp的融合蛋白,如同包涵体那样部分纯化。用该抗原以一周的间隔对Balb/c小时肌内免疫三次,6周后获得小鼠血清。用肺炎衣原体感染HeLa细胞。感染72小时后用3.7%甲醛固定单层。由于外膜蛋白被甲醛广泛交联,因此该处理使得抗体不能透过衣原体外膜。用0.2% Triton X100渗透HeLa细胞,用PBS洗涤单层,然后用20%(v/v)FCS培育,以灭活甲醛游离基团。用含20%(v/v)FCS的PBS稀释小鼠血清1∶100,并和单层一起培育半小时。用PBS洗涤单层,加入偶联FITCH的第二家兔抗小鼠血清反应半小时,洗涤单层并固定。一些抗体与包涵体中的EB强烈反应(图9)。尽管作了甲醛固定,但是也不能排除EB表面因处理而改变,这样抗体能到达Omp4-7。因此,用针对克隆pEX3-36的抗体进行免疫电子显微镜法确认反应。将纯化的肺炎衣原体EB吸收到碳涂覆的镍栅格上。吸收后,用PBS洗涤栅格,用溶解在PBS中的0.5%卵白蛋白封闭。用相同缓冲液1∶100稀释抗体,并培育30分钟。用PBS洗涤栅格。在栅格中加入用含1%明胶的PBS稀释的偶联10nm胶体金的家兔抗小鼠Ig,放置半小时。栅格用含1%明胶的PBS洗3次,再用PBS洗3次,栅格用0.7%磷钨酸反染色(contrastain)。在40kV的Jeol 1010电镜中分析栅格。看到金颗粒覆盖了纯化的EB的表面。由于肺炎衣原体EB在纯化或和抗体反应条件下没有接触洗涤剂或固定,因此这些结果表明克隆的蛋白具有外露的表位。
针对Omp4的多克隆单特异性抗体
用含有LIC位点的引物作PCR扩增Omp4基因,将PCR产物克隆到pET-30 LIC载体(Novagen)中。通过IPTG诱导合成来表达组氨酸标记的融合蛋白,并在镍柱上纯化。用纯化的Omp4蛋白免疫家兔(6次,每次8微克)。
用针对重组体Omp4的家兔多克隆抗体检测石蜡包埋切片中的肺炎衣原体
在经鼻感染后3天,获得肺炎衣原体感染的小鼠的肺。将组织样品固定在4%甲醛中,用石蜡包埋,切片并在染色前去除石蜡。使切片和以TBS 1∶200稀释的家兔血清室温培育30分钟。在TBS中洗涤两次后,用1∶300稀释于TBS中的第二抗体(生物素化山羊抗家兔抗体)培育切片,然后用TBS洗涤两次。用链霉亲和素-生物素复合物(streptABCompelx/AP,Dako)对切片染色30分钟,洗涤,并在镜检下用chromagen+new fuchsin(Vector Laboratories)显影。用苏木素对切片反染色,用显微镜分析。
用超免疫单特异性家兔抗血清进行免疫印迹分析
用含有LTC位点的引物PCR扩增pEX1-1克隆的插入物。因此,PCR产物能插入pET-32 LIC载体(Novagen,UK目录号69076-1)中。因此,pEX1-1克隆的插入物序列在新载体中以融合蛋白形式表达,由pET-32 LIC载体编码的融合蛋白部分具有一排6个组氨酸残基。在该载体中诱导融合蛋白的表达,由于组氨酸残基对二价阳离子有高亲合力,因此该融合蛋白可在变性条件下在Ni2+柱上纯化。用纯化的蛋白免疫新西兰白色家兔。肌内免疫6次、静脉内免疫2次后,获得家兔血清。将纯化的肺炎衣原体EB溶解在SDS样品缓冲液中。一半样品在样品缓冲液中加热至100℃,而另一半样品则不加热。用SDS-PAGE分离样品,将蛋白转移到硝酸纤维素膜上,使血清和条带反应。对于加热至100℃的样品,血清识别约98kDa的高分子量条带。这与Omp5的预计大小相符,pEX1-1克隆是其一部分,然而,当抗体与未加热EB条带反应时,图形是不同的。现在发现了大小为75kDa的条带,另外在条带上方还观察到较弱的条带(图1a)。这些数据证明,Omp5需要在SDS-样品缓冲液中煮沸来完全变性,并迁移至从基因产物预计的大小处。当样品未煮沸时,蛋白没有完全变性,更少的SDS与蛋白结合,它具有更具球形的结构,其在丙烯酰胺凝胶中会迁移至更远处。条带图形看上去与用单克隆抗体(MAb26.1)的结果相同(泳道6)(我们以前已经描述过(Christiansen等人,1994)),即与肺炎衣原体EB表面反应,但是抗体不与免疫印迹中SDS完全变性的肺炎衣原体EB反应。
C57黑色小鼠的实验性感染
由于了解到没有煮沸的Omp4-7蛋白的迁移发生改变,我们选择在小鼠受实验性感染后分析针对肺炎衣原体EB的抗体。为了从肺炎衣原体引起的感染获得抗体,在微量乙醚麻醉下用107 CFI肺炎衣原体鼻内接种C57黑色小鼠。感染14天后获得血清样品,分析肺的病理学变化。在两个小鼠的肺切片中观察到严重的肺炎,而在第3只小鼠中只发现有少量改变。1∶100稀释小鼠血清,并和溶解在煮沸和没有煮沸的样品缓冲液中的纯化的EB反应。在已加热至100℃的制备物中,来自两只小鼠的血清与60/62kDa条带以及55kDa的较弱条带反应强烈,但是对于大小为Omp4-7的蛋白没有观察到反应(图11)。然而,当血清和未加热过的制备物反应时,它们和大小约为75kDa的宽条带均有强反应。这与未加热制备物中Omp4-7蛋白的大小相符。因此,可以归结为,肺炎衣原体感染后抗体识别的Omp4-7蛋白的表位是不连续的表位,因为抗原完全变性完全破坏了表位。在未加热样品中观察到的75kD蛋白不是Omp2(如Omp2特异性抗体免疫印迹所表明)。
实施例3
肺炎衣原体Omp4-7和鹦鹉热衣原体推定的外膜蛋白(POMP)的比较
Longbottom等人1996年已经公开了鹦鹉热衣原体的98-90kDa的部分序列。他们已经将该家族中5个基因的全序列输入了EMBL数据库。他们已将基因命名为“推定的外膜蛋白”(POMP),因为还未确定它们的精确位置。该家族由两个完全相同的基因组成,这两个基因与这些基因高度同源。他们计算出分子大小为90和91kDa。第5个基因编码98kDa的蛋白。用GCG软件包的pileup程序比较肺炎衣原体Omp4-7蛋白的序列和鹦鹉热衣原体POMP蛋白的序列。氨基酸同源性在51-63%范围内。发现肺炎衣原体Omp4-5蛋白与鹦鹉热衣原体的98kDaPOMP蛋白最相关。令人感兴趣的是,与其它鹦鹉热衣原体基因相比,98kDa的鹦鹉热衣原体POMP蛋白和肺炎衣原体基因更相关。GGAI的重复序列在98kDa POMP蛋白中是保守的,但是在90和91kDa鹦鹉热衣原体POMP蛋白中只有3个GGAI重复序列。对于鹦鹉热衣原体,已经显示,针对这些蛋白的抗体似乎对感染有保护作用。
参考文献
1.Caldwell,H.D.,J.Kronhout和J.Schacheter,感染免疫学(Infect.Immun.)31,1161-1176(1981).
2.Campbell,L.A.,M.P.Melgosa,D.J.Hamilton,C.-C.Kuo和J.T.Grayston,临床微生物杂志(J.Clinical Microbiol.),30,434-439(1992).
3.Christiansen,G.,和S.Birkelund.欧洲微生物(Eur.Microbiol).1∶24-29(1992).
4.Christiansen,G.,L.Φstergaard,和S.Birkelund,第8次人感染国际讨论会,Orfila等人编辑,173-176页,1994.
5.Grayston,J.T.,Kuo,C.-C.,Campbell,L.A.,和Vang,S.-P.Int.J.Syst.Bacteriol.39,88-90(1989).
6.Grayston,J.T.,C.-C.Kuo,S.-P.Wang和J.Altman.1986.N.Engl.J.Med.315,161-168(1986).
7.Kuo,C.C.,L.A.Jackson,L.A.Campbell和J.T.Grayston.临床微生物综述(Clin.Microbiol.Rev.)8,451-461(1995).
8.Longbottom,D.,M.Russell,G.E.Jones,A.Lainson,和A.J.Herring.FEMSMicrobiol.Lett.142,277-281(1996).
9.Melgosa,M.P.,C.-C.Kuo和L.A.Campbell,FEMS Microbiol.Lett.112,199-204(1993).
10.Campbell,L.A.,C.-C.Kuo,S.P.Wang和J.T.Grayston.临床微生物杂志,28,1261-1264(1990).
11.Halme,S.,P.Saikku和H.-M.Surcel.Scand.J.Immunol.45,378-384(1997).
12.Miyashita,N.和A.Matsumoto.临床微生物杂志30,2911-2916(1992).
13.Wang,S.P.和J.T.Grayston,Am.J.Ophtalmol.70,367-374(1970).
14.Freund,E.A.,H.Erno和R.M.Lemcke.支原体的鉴定,377-443页,I.Norris和J.R.Bergen;微生物学方法(Method in Microbiology)13卷,A.P.Inc.London 1979.
序列表
(1)一般信息
(i)申请人
(A)姓名:Svend Birkelund
(B)街道:Dept.of Medical Microbiology and Immunology,
University of Arhus
(C)城市:Arhus C
(D)省份:
(E)国家:Denmark
(F)邮政编码(邮编):8000
(ii)发明名称:肺炎衣原体抗原
(iii)序列数目:30
(iv)计算机可读形式:
(A)记录介质类型:软盘
(B)计算机:IBM兼容型
(C)操作系统:DOS
(D)软件:FastSEQ for Windows Version 2.0
(v)本申请资料:
(A)申请号:
(2)SEQ ID NO:1的信息:
(i)序列特征:
(A)长度:3200碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ix)特征:
(A)名称/关键字:编码序列
(B)位置:205...2987
(D)其它信息:
(xi)序列描述:SEQ ID NO:1:CAATGTCGAA GAGAGCACTA ACCAGGAAAA TTGCGATTTC ATAAACCCAC TTTATTATTA 60AATTCTTACT TGCGTCATAT AAAATAGAAA ACTCAGAGAG TCAAGATAAA AATTCTTGAC 120AGCTGTTTTG TCATCTTTAA CTTGATTTAC TTATTTTGTT TCTATATTGA TGCGAATAGT 180TCTCTAAAAAA ACAAAAAGCAT TACC ATG AAG ACT TCG ATT CCT TGG GTT TTA 231
Met Lys Thr Ser Ile Pro Trp Val Leu
1 5GTT TCC TCC GTG TTA GCT TTC TCA TGT CAC CTA CAG TCA CTA GCT AAC 279Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gln Ser Leu Ala Asn10 15 20 25GAG GAA CTT TTA TCA CCT GAT GAT AGC TTT AAT GGA AAT ATC GAT TCA 327Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly Asn Ile Asp Ser
30 35 40GGA ACG TTT ACT CCA AAA ACT TCA GCC ACA ACA TAT TCT CTA ACA GGA 375Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr Ser Leu Thr Gly
45 50 55GAT GTC TTC TTT TAC GAG CCT GGA AAA GGC ACT CCC TTA TCT GAC AGT 423Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro Leu Ser Asp Ser
60 65 70TGT TTT AAG CAA ACC ACG GAC AAT CTT ACC TTC TTG GGG AAC GGT CAT 471Cys Phe Lys Gln Thr Thr Asp Asn Leu Thr Phe Leu Gly Asn Gly His
75 80 85AGC TTA ACG TTT GGC TTT ATA GAT GCT GGC ACT CAT GCA GGT GCT GCT 519Ser Leu Thr Phe Gly Phe Ile Asp Ala Gly Thr His Ala Gly Ala Ala90 95 100 105GCA TCT ACA ACA GCA AAT AAG AAT CTT ACC TTC TCA GGG TTT TCC TTA 567Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser Gly Phe Ser Leu
110 115 120CTG AGT TTT GAT TCC TCT CCT AGC ACA ACG GTT ACT ACA GGT CAG GGA 615Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr Thr Gly Gln Gly
125 130 135ACG CTT TCC TCA GCA GGA GGC GTA AAT TTA GAA AAT ATT CGT AAA CTT 663Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn Ile Arg Lys Leu
140 145 150GTA GTT GCT GGG AAT TTT TCT ACT GCA GAT GGT GGA GCT ATC AAA GGA 711Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly Ala Ile Lys Gly
155 160 165GCG TCT TTC CTT TTA ACT GGC ACT TCT GGA GAT GCT CTT TTT AGT AAC 759Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala Leu Phe Ser Asn170 175 180 185AAC TCT TCA TCA ACA AAG GGA GGA GCA ATT GCT ACT ACA GCA GGC GCT 807Asn Ser Ser Ser Thr Lys Gly Gly Ala Ile Ala Thr Thr Ala Gly Ala
190 195 200CGC ATA GCA AAT AAC ACA GGT TAT GTT AGA TTC CTA TCT AAC ATA GCG 855Arg Ile Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu Ser Asn Ile Ala
205 210 215TCT ACG TCA GGA GGC GCT ATC GAT GAT GAA GGC ACG TCG ATA CTA TCG 903Ser Thr Ser Gly Gly Ala Ile Asp Asp Glu Gly Thr Ser Ile Leu Ser
220 225 230AAC AAC AAA TTT CTA TAT TTT GAA GGG AAT GCA GCG AAA ACT ACT GGC 951Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala Lys Thr Thr Gly
235 240 245GGT GCG ATC TGC AAC ACC AAG GCG AGT GGA TCT CCT GAA CTG ATA ATC 999Gly Ala Ile Cys Asn Thr Lys Ala Ser Gly Ser Pro Glu Leu Ile Ile250 255 260 265TCT AAC AAT AAG ACT CTG ATC TTT GCT TCA AAC GTA GCA GAA ACA AGC 1047Ser Asn Asn Lys Thr Leu Ile Phe Ala Ser Asn Val Ala Glu Thr Ser
270 275 280GGT GGC GCC ATC CAT GCT AAA AAG CTA GCC CTT TCC TCT GGA GGC TTT 1095Gly Gly Ala Ile His Ala Lys Lys Leu Ala Leu Ser Ser Gly Gly Phe
285 290 295ACA GAG TTT CTA CGA AAT AAT GTC TCA TCA GCA ACT CCT AAG GGG GGT 1143Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr Pro Lys Gly Gly
300 305 310GCT ATC AGC ATC GAT GCC TCA GGA GAG CTC AGT CTT TCT GCA GAG ACA 1191Ala Ile Ser Ile Asp Ala Ser Gly Glu Leu Ser Leu Ser Ala Glu Thr
315 320 325GGA AAC ATT ACC TTT GTA AGA AAT ACC CTT ACA ACA ACC GGA AGT ACC 1239Gly Asn Ile Thr Phe Val Arg Asn Thr Leu Thr Thr Thr Gly Ser Thr330 335 340 345GAT ACT CCT AAA CGT AAT GCG ATC AAC ATA GGA AGT AAC GGG AAA TTC 1287Asp Thr Pro Lys Arg Asn Ala Ile Asn Ile Gly Ser Asn Gly Lys Phe
350 355 360ACG GAA TTA CGG GCT GCT AAA AAT CAT ACA ATT TTC TTC TAT GAT CCC 1335Thr Glu Leu Arg Ala Ala Lys Asn His Thr Ile Phe Phe Tyr Asp Pro
365 370 375ATC ACT TCA GAA GGA ACC TCA TCA GAC GTA TTG AAG ATA AAT AAC GGC 1383Ile Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys Ile Asn Asn Gly
380 385 390TCT GCG GGA GCT CTC AAT CCA TAT CAA GGA ACG ATT CTA TTT TCT GGA 1431Ser Ala Gly Ala Leu Asn Pro Tyr Gln Gly Thr Ile Leu Phe Ser Gly
395 400 405GAA ACC CTA ACA GCA GAT GAA CTT AAA GTT GCT GAC AAT TTA AAA TCT 1479Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp Asn Leu Lys Ser410 415 420 425TCA TTC ACG CAG CCA GTC TCC CTA TCC GGA GGA AAG TTA TTG CTA CAA 1527Ser Phe Thr Gln Pro Val Ser Leu Ser Gly Gly Lys Leu Leu Leu Gln
430 435 440AAG GGA GTC ACT TTA GAG AGC ACG AGC TTC TCT CAA GAG GCC GGT TCT 1575Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gln Glu Ala Gly Ser
445 450 455CTC CTC GGC ATG GAT TCA GGA ACG ACA TTA TCA ACT ACA GCT GGG AGT 1623Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr Thr Ala Gly Ser
460 465 470ATT ACA ATC ACG AAC CTA GGA ATC AAT GTT GAC TCC TTA GGT CTT AAG 1671Ile Thr Ile Thr Asn Leu Gly Ile Asn Val Asp Ser Leu Gly Leu Lys
475 480 485CAG CCC GTC AGC CTA ACA GCA AAA GGT GCT TCA AAT AAA GTG ATC GTA 1719Gln Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn Lys Val Ile Val490 495 500 505TCT GGG AAG CTC AAC CTG ATT GAT ATT GAA GGG AAC ATT TAT GAA AGT 1767Ser Gly Lys Leu Asn Leu Ile Asp Ile Glu Gly Asn Ile Tyr Glu Ser
510 515 520CAT ATG TTC AGC CAT GAC CAG CTC TTC TCT CTA TTA AAA ATC ACG GTT 1815His Met Phe Ser His Asp Gln Leu Phe Ser Leu Leu Lys Ile Thr Val
525 530 535GAT GCT GAT GTT GAT ACT AAC GTT GAC ATC AGC AGC CTT ATC CCT GTT 1863Asp Ala Asp Val Asp Thr Asn Val Asp Ile Ser Ser Leu Ile Pro Val
540 545 550CCT GCT GAG GAT CCT AAT TCA GAA TAC GGA TTC CAA GGA CAA TGG AAT 1911Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gln Gly Gln Trp Asn
555 560 565GTT AAT TGG ACT ACG GAT ACA GCT ACA AAT ACA AAA GAG GCC ACG GCA 1959Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys Glu Ala Thr Ala570 575 580 585ACT TGG ACC AAA ACA GGA TTT GTT CCC AGC CCC GAA AGA AAA TCT GCG 2007Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu Arg Lys Ser Ala
590 595 600TTA GTA TGC AAT ACC CTA TGG GGA GTC TTT ACT GAC ATT CGC TCT CTG 2055Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp Ile Arg Ser Leu
605 610 615CAA CAG CTT GTA GAG ATC GGC GCA ACT GGT ATG GAA CAC AAA CAA GGT 2103Gln Gln Leu Val Glu Ile Gly Ala Thr Gly Met Glu His Lys Gln Gly
620 625 630TTC TGG GTT TCC TCC ATG ACG AAC TTC CTG CAT AAG ACT GGA GAT GAA 2151Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys Thr Gly Asp Glu
635 640 645AAT CGC AAA GGC TTC CGT CAT ACC TCT GGA GGC TAC GTC ATC GGT GGA 2199Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr Val Ile Gly Gly650 655 660 665AGT GCT CAC ACT CCT AAA GAC GAC CTA TTT ACC TTT GCG TTC TGC CAT 2247Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe Ala Phe Cys His
670 675 680CTC TTT GCT AGA GAC AAA GAT TGT TTT ATC GCT CAC AAC AAC TCT AGA 2295Leu Phe Ala Arg Asp Lys Asp Cys Phe Ile Ala His Asn Asn Ser Arg
685 690 695ACC TAC GGT GGA ACT TTA TTC TTC AAG CAC TCT CAT ACC CTA CAA CCC 2343Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His Thr Leu Gln Pro
700 705 710CAA AAC TAT TTG AGA TTA GGA AGA GCA AAG TTT TCT GAA TCA GCT ATA 2391Gln Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser Glu Ser Ala Ile
715 720 725GAA AAA TTC CCT AGG GAA ATT CCC CTA GCC TTG GAT GTC CAA GTT TCG 2439Glu Lys Phe Pro Arg Glu Ile Pro Leu Ala Leu Asp Val Gln Val Ser730 735 740 745TTC AGC CAT TCA GAC AAC CGT ATG GAA ACG CAC TAT ACC TCA TTG CCA 2487Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr Thr Ser Leu Pro
750 755 760GAA TCC GAA GGT TCT TGG AGC AAC GAG TGT ATA GCT GGT GGT ATC GGC 2535Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys Ile Ala Gly Gly Ile Gly
765 770 775CTA GAC CTT CCT TTT GTT CTT TCC AAC CCA CAT CCT CTT TTC AAG ACC 2583Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro Leu Phe Lys Thr
780 785 790TTC ATT CCA CAG ATG AAA GTC GAA ATG GTT TAT GTA TCA CAA AAT AGC 2631Phe Ile Pro Gln Met Lys Val Glu Met Val Tyr Val Ser Gln Asn Ser
795 800 805TTC TTC GAA AGC TCT AGT GAT GGC CGT GGT TTT AGT ATT GGA AGG CTG 2679Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser Ile Gly Arg Leu810 815 820 825CTT AAC CTC TCG ATT CCT GTG GGT GCG AAA TTC GTG CAG GGG GAT ATC 2727Leu Asn Leu Ser Ile Pro Val Gly Ala Lys Phe Val Gln Gly Asp Ile
830 835 840GGA GAT TCC TAC ACC TAT GAT CTC TCA GGA TTC TTT GTT TCC GAT GTC 2775Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe Val Ser Asp Val
845 850 855TAT CGT AAC AAT CCC CAA TCT ACA GCG ACT CTT GTG ATG AGC CCA GAC 2823Tyr Arg Asn Asn Pro Gln Ser Thr Ala Thr Leu Val Met Ser Pro Asp
860 865 870TCT TGG AAA ATT CGC GGT GGC AAT CTT TCA AGA CAG GCA TTT TTA CTG 2871Ser Trp Lys Ile Arg Gly Gly Asn Leu Ser Arg Gln Ala Phe Leu Leu
875 880 885AGG GGT AGC AAC AAC TAC GTC TAC AAC TCC AAT TGT GAG CTC TTC GGA 2919Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys Glu Leu Phe Gly890 895 900 905CAT TAC GCT ATG GAA CTC CGT GGA TCT TCA AGG AAC TAC AAT GTA GAT 2967His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn Tyr Asn Val Asp
910 915 920GTT GGT ACC AAA CTC CGA TT CTAGATTGCT AAAACTCCCT AGTTCTTCTA GGGAG 3022Val Gly Thr Lys Leu Arg Phe
925TTTTCTCATA CTTTTAGGGA AATATTTGCT ATAGGGAATG CTTTCCTTGC AAACTGTAAA 3082AAATAACATT TGTCCCTCTT CAAAAAAGAT TTCTTTTAAT AATTTCTAGT TATAATTTTA 3142TTTTAAAAAC AGTTAAATAA TTAATAGACA ATAATCTATT CTTATTGACT TCTTTTTT 3200
(2)SEQ ID NO:2的信息:
(i)序列特征:
(A)长度:928氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:蛋白质
(v)片段类型:内部
(xi)序列描述:SEQ ID NO:2:Met Lys Thr Ser Ile Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe1 5 10 15Ser Cys His Leu Gln Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp
20 25 30Asp Ser Phe Asn Gly Asn Ile Asp Ser Gly Thr Phe Thr Pro Lys Thr
35 40 45Ser Ala Thr Thr Tyr Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro
50 55 60Gly Lys Gly Thr Pro Leu Ser Asp Ser Cys Phe Lys Gln Thr Thr Asp65 70 75 80Asn Leu Thr Phe Leu Gly Asn Gly His Ser Leu Thr Phe Gly Phe Ile
85 90 95Asp Ala Gly Thr His Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys
100 105 110Asn Leu Thr Phe Ser Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro
115 120 125Ser Thr Thr Val Thr Thr Gly Gln Gly Thr Leu Ser Ser Ala Gly Gly130 135 140Val Asn Leu Glu Asn Ile Arg Lys Leu Val Val Ala Gly Asn Phe Ser145 150 155 160Thr Ala Asp Gly Gly Ala Ile Lys Gly Ala Ser Phe Leu Leu Thr Gly
165 170 175Thr Ser Gly Asp Ala Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly
180 185 190Gly Ala Ile Ala Thr Thr Ala Gly Ala Arg Ile Ala Asn Asn Thr Gly
195 200 205Tyr Val Arg Phe Leu Ser Asn Ile Ala Ser Thr Ser Gly Gly Ala Ile
210 215 220Asp Asp Glu Gly Thr Ser Ile Leu Ser Asn Asn Lys Phe Leu Tyr Phe225 230 235 240Glu Gly Asn Ala Ala Lys Thr Thr Gly Gly Ala Ile Cys Asn Thr Lys
245 250 255Ala Ser Gly Ser Pro Glu Leu Ile Ile Ser Asn Asn Lys Thr Leu Ile
260 265 270Phe Ala Ser Asn Val Ala Glu Thr Ser Gly Gly Ala Ile His Ala Lys
275 280 285Lys Leu Ala Leu Ser Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn
290 295 300Val Ser Ser Ala Thr Pro Lys Gly Gly Ala Ile Ser Ile Asp Ala Ser305 310 315 320Gly Glu Leu Ser Leu Ser Ala Glu Thr Gly Asn Ile Thr Phe Val Arg
325 330 335Asn Thr Leu Thr Thr Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala
340 345 350Ile Asn Ile Gly Ser Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys
355 360 365Asn His Thr Ile Phe Phe Tyr Asp Pro Ile Thr Ser Glu Gly Thr Ser
370 375 380Ser Asp Val Leu Lys Ile Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro385 390 395 400Tyr Gln Gly Thr Ile Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu
405 410 415Leu Lys Val Ala Asp Asn Leu Lys Ser Ser Phe Thr Gln Pro Val Ser
420 425 430Leu Ser Gly Gly Lys Leu Leu Leu Gln Lys Gly Val Thr Leu Glu Ser
435 440 445Thr Ser Phe Ser Gln Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly
450 455 460Thr Thr Leu Ser Thr Thr Ala Gly Ser Ile Thr Ile Thr Asn Leu Gly465 470 475 480Ile Asn Val Asp Ser Leu Gly Leu Lys Gln Pro Val Ser Leu Thr Ala
485 490 495Lys Gly Ala Ser Asn Lys Val Ile Val Ser Gly Lys Leu Asn Leu Ile
500 505 510Asp Ile Glu Gly Asn Ile Tyr Glu Ser His Met Phe Ser His Asp Gln
515 520 525Leu Phe Ser Leu Leu Lys Ile Thr Val Asp Ala Asp Val Asp Thr Asn
530 535 540Val Asp Ile Ser Ser Leu Ile Pro Val Pro Ala Glu Asp Pro Asn Ser545 550 555 560Glu Tyr Gly Phe Gln Gly Gln Trp Asn Val Asn Trp Thr Thr Asp Thr
565 570 575Ala Thr Asn Thr Lys Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe
580 585 590Val Pro Ser Pro Glu Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp
595 600 605Gly Val Phe Thr Asp Ile Arg Ser Leu Gln Gln Leu Val Glu Ile Gly
610 615 620Ala Thr Gly Met Glu His Lys Gln Gly Phe Trp Val Ser Ser Met Thr625 630 635 640Asn Phe Leu His Lys Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His
645 650 655Thr Ser Gly Gly Tyr Val Ile Gly Gly Ser Ala His Thr Pro Lys Asp
660 665 670Asp Leu Phe Thr Phe Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp
675 680 685Cys Phe Ile Ala His Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe
690 695 700Phe Lys His Ser His Thr Leu Gln Pro Gln Asn Tyr Leu Arg Leu Gly705 710 715 720Arg Ala Lys Phe Ser Glu Ser Ala Ile Glu Lys Phe Pro Arg Glu Ile
725 730 735Pro Leu Ala Leu Asp Val Gln Val Ser Phe Ser His Ser Asp Asn Arg
740 745 750Met Glu Thr His Tyr Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser
755 760 765Asn Glu Cys Ile Ala Gly Gly Ile Gly Leu Asp Leu Pro Phe Val Leu
770 775 780Ser Asn Pro His Pro Leu Phe Lys Thr Phe Ile Pro Gln Met Lys Val785 790 795 800Glu Met Val Tyr Val Ser Gln Asn Ser Phe Phe Glu Ser Ser Ser Asp
805 810 815Gly Arg Gly Phe Ser Ile Gly Arg Leu Leu Asn Leu Ser Ile Pro Val
820 825 830Gly Ala Lys Phe Val Gln Gly Asp Ile Gly Asp Ser Tyr Thr Tyr Asp
835 840 845Leu Ser Gly Phe Phe Val Ser Asp Val Tyr Arg Asn Asn Pro Gln Ser
850 855 860Thr Ala Thr Leu Val Met Ser Pro Asp Ser Trp Lys Ile Arg Gly Gly865 870 875 880Asn Leu Ser Arg Gln Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val
885 890 895Tyr Asn Ser Asn Cys Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg
900 905 910Gly Ser Ser Arg Asn Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe
915 920 925
(2)SEQ ID NO:3的信息:
(i)序列特征:
(A)长度:2815碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:3:ATGAAATCGC AATTTTCCTG GTTAGTGCTC TCTTCGACAT TGGCATGTTT TACTAGTTGT 60TCCACTGTTT TTGCTGCAAC TGCTGAAAAT ATAGGCCCCT CTGATAGCTT TGACGGAAGT 120ACTAACACAG GCACCTATAC TCCTAAAAAT ACGACTACTG GAATAGACTA TACTCTGACA 180GGAGATATAA CTCTGCAAAA CCTTGGGGAT TCGGCAGCTT TAACGAAGGG TTGTTTTTCT 240GACACTACGG AATCTTTAAG CTTTGCCGGT AAGGGGTACT CACTTTCTTT TTTAAATATT 300AAGTCTAGTG CTGAAGGCGC AGCACTTTCT GTTACAACTG ATAAAAATCT GTCGCTAACA 360GGATTTTCGA GTCTTACTTT CTTAGCGGCC CCATCATCGG TAATCACAAC CCCCTCAGGA 420AAAGGTGCAG TTAAATGTGG AGGGGATCTT ACATTTGATA ACAATGGAAC TATTTTATTT 480AAACAAGATT ACTGTGAGGA AAATGGCGGA GCCATTTCTA CCAAGAATCT TTCTTTGAAA 540AACAGCACGG GATCGATTTC TTTTGAAGGG AATAAATCGA GCGCAACAGG GAAAAAAGGT 600GGGGCTATTT GTGCTACTGG TACTGTAGAT ATTACAAATA ATACGGCTCC TACCCTCTTC 660TCGAACAATA TTGCTGAAGC TGCAGGTGGA GCTATAAATA GCACAGGAAA CTGTACAATT 720ACAGGGAATA CGTCTCTTGT ATTTTCTGAA AATAGTGTGA CAGCGACCGC AGGAAATGGA 780GGAGCTCTTT CTGGAGATGC CGATGTTACC ATATCTGGGA ATCAGAGTGT AACTTTCTCA 840GGAAACCAAG CTGTAGCTAA TGGCGGAGCC ATTTATGCTA AGAAGCTTAC ACTGGCTTCC 900GGGGGGGGGG GGGGTATCTC CTTTTCTAAC AATATAGTCC AAGGTACCAC TGCAGGTAAT 960GGTGGAGCCA TTTCTATACT GGCAGCTGGA GAGTGTAGTC TTTCAGCAGA AGCAGGGGAC 1020ATTACCTTCA ATGGGAATGC CATTGTTGCA ACTACACCAC AAACTACAAA AAGAAATTCT 1080ATTGACATAG GATCTACTGC AAAGATCACG AATTTACGTG CAATATCTGG GCATAGCATC 1140TTTTTCTACG ATCCGATTAC TGCTAATACG GCTGCGGATT CTACAGATAC TTTAAATCTC 1200AATAAGGCTG ATGCAGGTAA TAGTACAGAT TATAGTGGGT CGATTGTTTT TTCTGGTGAA 1260AAGCTCTCTG AAGATGAAGC AAAAGTTGCA GACAACCTCA CTTCTACGCT GAAGCAGCCT 1320GTAACTCTAA CTGCAGGAAA TTTAGTACTT AAACGTGGTG TCACTCTCGA TACGAAAGGC 1380TTTACTCAGA CCGCGGGTTC CTCTGTTATT ATGGATGCGG GCACAACGTT AAAAGCAAGT 1440ACAGAGGAGG TCACTTTAAC AGGTCTTTCC ATTCCTGTAG ACTCTTTAGG CGAGGGTAAG 1500AAAGTTGTAA TTGCTGCTTC TGCAGCAAGT AAAAATGTAG CCCTTAGTGG TCCGATTCTT 1560CTTTTGGATA ACCAAGGGAA TGCTTATGAA AATCACGACT TAGGAAAAAC TCAAGACTTT 1620TCATTTGTGC AGCTCTCTGC TCTGGGTACT GCAACAACTA CAGATGTTCC AGCGGTTCCT 1680ACAGTAGCAA CTCCTACGCA CTATGGGTAT CAAGGTACTT GGGGAATGAC TTGGGTTGAT 1740GATACCGCAA GCACTCCAAA GACTAAGACA GCGACATTAG CTTGGACCAA TACAGGCTAC 1800CTTCCGAATC CTGAGCGTCA AGGACCTTTA GTTCCTAATA GCCTTTGGGG ATCTTTTTCA 1860GACATCCAAG CGATTCAAGG TGTCATAGAG AGAAGTGCTT TGACTCTTTG TTCAGATCGA 1920GGCTTCTGGG CTGCGGGAGT CGCCAATTTC TTAGATAAAG ATAAGAAAGG GGAAAAACGC 1980AAATACCGTC ATAAATCTGG TGGATATGCT ATCGGAGGTG CAGCGCAAAC TTGTTCTGAA 2040AACTTAATTA GCTTTGCCTT TTGCCAACTC TTTGGTAGCG ATAAAGATTT CTTAGTCGCT 2100AAAAATCATA CTGATACCTA TGCAGGAGCC TTCTATATCC AACACATTAC AGAATGTAGT 2160GGGTTCATAG GTTGTCTCTT AGATAAACTT CCTGGCTCTT GGAGTCATAA ACCCCTCGTT 2220TTAGAAGGGC AGCTCGCTTA TAGCCACGTC AGTAATGATC TGAAGACAAA GTATACTGCG 2280TATCCTGAGG TGAAAGGTTC TTGGGGGAAT AATGCTTTTA ACATGATGTT GGGAGCTTCT 2340TCTCATTCTT ATCCTGAATA CCTGCATTGT TTTGATACCT ATGCTCCATA CATCAAACTG 2400AATCTGACCT ATATACGTCA GGACAGCTTC TCGGAGAAAG GTACAGAAGG AAGATCTTTT 2460GATGACAGCA ACCTCTTCAA TTTATCTTTG CCTATAGGGG TGAAGTTTGA GAAGTTCTCT 2520GATTGTAATG ACTTTTCTTA TGATCTGACT TTATCCTATG TTCCTGATCT TATCCGCAAT 2580GATCCCAAAT GCACTACAGC ACTTGTAATC AGCGGAGCCT CTTGGGAAAC TTATGCCAAT 2640AACTTAGCAC GACAGGCCTT GCAAGTGCGT GCAGGCAGTC ACTACGCCTT CTCTCCTATG 2700TTTGAAGTGC TCGGCCAGTT TGTCTTTGAA GTTCGTGGAT CCTCACGGAT TTATAATGTA 2760GATCTTGGGG GTAAGTTCCA ATTCTAGGAG CGTCTCTCAT GTCTCAGAAA TTCTG 2815
(2)SEQ ID NO:4的信息:
(i)序列特征:
(A)长度:928氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:4:Met Lys Ser Gln Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys1 5 10 15Phe Thr Ser Cys Ser Thr ValPhe Ala Ala Thr Ala Glu Asn Ile Gly
20 25 30Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro
35 40 45Lys Asn Thr Thr Thr Gly Ile Asp Tyr Thr Leu Thr Gly Asp Ile Thr
50 55 60Leu Gln Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser65 70 75 80Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser
85 90 95Phe Leu Asn Ile Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr
100 105 110Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu
115 120 125Ala Ala Pro Ser Ser Val Ile Thr Thr Pro Ser Gly Lys Gly Ala Val
130 135 140Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr Ile Leu Phe145 150 155 160Lys Gln Asp Tyr Cys Glu Glu Asn Gly Gly Ala Ile Ser Thr Lys Asn
165 170 175Leu Ser Leu Lys Asn Ser Thr Gly Ser Ile Ser Phe Glu Gly Asn Lys
180 185 190Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala Ile Cys Ala Thr Gly Thr
195 200 205Val Asp Ile Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn Ile
210 215 220Ala Glu Ala Ala Gly Gly Ala Ile Asn Ser Thr Gly Asn Cys Thr Ile225 230 235 240Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr
245 250 255Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr Ile Ser
260 265 270Gly Asn Gln Ser Val Thr Phe Ser Gly Asn Gln Ala Val Ala Asn Gly
275 280 285Gly Ala Ile Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly
290 295 300Gly Ile Ser Phe Ser Asn Asn He Val Gln Gly Thr Thr Ala Gly Asn305 310 315 320Gly Gly Ala Ile Ser Ile Leu Ala Ala Gly Gh Cys Ser Leu Ser Ala
325 330 335Gh Ala Gly Asp Ile Thr Phe Asn Gly Asn Ala Ile Val Ala Thr Thr
340 345 350Pro Gln Thr Thr Lys Arg Asn Ser Ile Asp Ile Gly Ser Thr Ala Lys
355 360 365Ile Thr Asn Leu Arg Ala Ile Ser Gly His Ser Ile Phe Phe Tyr Asp
370 375 380Pro Ile Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu385 390 395 400Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser Ile Val
405 410 415Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn
420 425 430Leu Thr Ser Thr Leu Lys Gln Pro ValThr Leu Thr Ala Gly Asn Leu
435 440 445Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gln Thr
450 455 460Ala Gly Ser Ser Val Ile Met Asp Ala Gly Thr Thr Leu Lys Ala Ser465 470 475 480Thr Glu Glu Val Thr Leu Thr Gly Leu Ser Ile Pro Val Asp Ser Leu
485 490 495Gly Glu Gly Lys Lys Val Val Ile Ala Ala Ser Ala Ala Ser Lys Asn
500 505 510Val Ala Leu Ser Gly Pro Ile Leu Leu Leu Asp Asn Gln Gly Asn Ala
515 520 525Tyr Glu Asn His Asp Leu Gly Lys Thr Gln Asp Phe Ser Phe Val Gln
530 535 540Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro545 550 555 560Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gln Gly Thr Trp Gly Met
565 570 575Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr
580 585 590Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gln Gly
595 600 605Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp Ile Gln Ala
610 615 620Ile Gln Gly Val Ile Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg625 630 635 640Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys
645 650 655Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala Ile Gly
660 665 670Gly Ala Ala Gln Thr Cys Ser Glu Asn Leu Ile Ser Phe Ala Phe Cys
675 680 685Gln Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr
690 695 700Asp Thr Tyr Ala Gly Ala Phe Tyr Ile Gln His Ile Thr Glu Cys Ser705 710 715 720Gly Phe Ile Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His
725 730 735Lys Pro Leu Val Leu Glu Gly Gln Leu Ala Tyr Ser His Val Ser Asn
740 745 750Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp
755 760 765Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr
770 775 780Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr Ile Lys Leu785 790 795 800Asn Leu Thr Tyr Ile Arg Gln Asp Ser Phe Ser Glu Lys Gly Thr Glu
805 810 815Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro Ile
820 825 830Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp
835 840 845Leu Thr Leu Ser Tyr Val Pro Asp Leu Ile Arg Asn Asp Pro Lys Cys
850 855 860Thr Thr Ala Leu Val Ile Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn865 870 875 880Asn Leu Ala Arg Gln Ala Leu Gln Val Arg Ala Gly Ser His Tyr Ala
885 890 895Phe Ser Pro Met Phe Glu Val Leu Gly Gln Phe ValPhe Glu Val Arg
900 905 910Gly Ser Ser Arg Ile Tyr Asn Val Asp Leu Gly Gly Lys Phe Gln Phe
915 920 925
(2)SEQ ID NO:5的信息:
(i)序列特征:
(A)长度:3052碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:5:ATGCGATTTT CGCTCTGCGG ATTTCCTCTA GTTTTTTCTT TAACATTGCT CTCAGTCTTC 60GACACTTCTT TGAGTGCTAC TACGATTTCT TTAACCCCAG AAGATAGTTT TCATGGAGAT 120AGTCAGAATG CAGAACGTTC TTATAATGTT CAAGCTGGGG ATGTCTATAG CCTTACTGGT 180GATGTCTCAA TATCTAACGT CGATAACTCT GCATTAAATA AAGCCTGCTT CAATGTGACC 240TCAGGAAGTG TGACGTTCGC AGGAAATCAT CATGGGTTAT ATTTTAATAA TATTTCCTCA 300GGAACTACAA AGGAAGGGGC TGTACTTTGT TGCCAAGATC CTCAAGCAAC GGCACGTTTT 360TCTGGGTTCT CCACGCTCTC TTTTATTCAG AGCCCCGGAG ATATTAAAGA ACAGGGATGT 420CTCTATTCAA AAAATGCACT TATGCTCTTA AACAATTATG TAGTGCGTTT TGAACAAAAC 480CAAAGTAAGA CTAAAGGCGG AGCTATTAGT GGGGCGAATG TTACTATAGT AGGCAACTAC 540GATTCCGTCT CTTTCTATCA GAATGCAGCC ACTTTTGGAG GTGCTATCCA TTCTTCAGGT 600CCCCTACAGA TTGCAGTAAA TCAGGCAGAG ATAAGATTTG CACAAAATAC TGCCAAGAAT 660GGTTCTGGAG GGGCTTTGTA CTCCGATGGT GATATTGATA TTGATCAGAA TGCTTATGTT 720CTATTTCGAG AAAATGAGGC ATTGACTACT GCTATAGGTA AGGGAGGGGC TGTCTGTTGT 780CTTCCCACTT CAGGAAGTAG TACTCCAGTT CCTATTGTGA CTTTCTCTGA CAATAAACAG 840TTAGTCTTTG AAAGAAACCA TTCCATAATG GGTGGCGGAG CCATTTATGC TAGGAAACTT 900AGCATCTCTT CAGGAGGTCC TACTCTATTT ATCAATAATA TATCATATGC AAATTCGCAA 960AATTTAGGTG GAGCTATTGC CATTGATACT GGAGGGGAGA TCAGTTTATC AGCAGAGAAA 1020GGAACAATTA CATTCCAAGG AAACCGGACG AGCTTACCGT TTTTGAATGG CATCCATCTT 1080TTACAAAATG CTAAATTCCT GAAATTACAG GCGAGAAATG GATGCTCTAT AGAATTTTAT 1140GATCCTATTA CTTCTGAAGC AGATGGGTCT ACCCAATTGA ATATCAACGG AGATCCTAAA 1200AATAAAGAGT ACACAGGGAC CATACTCTTT TCTGGAGAAA AGAGTCTAGC AAACGATCCT 1260AGGGATTTTA AATCTACAAT CCCTCAGAAC GTCAACCTGT CTGCAGGATA CTTAGTTATT 1320AAAGAGGGGG CCGAAGTCAC AGTTTCAAAA TTCACGCAGT CTCCAGGATC GCATTTAGTT 1380TTAGATTTAG GAACCAAACT GATAGCCTCT AAGGAAGACA TTGCCATCAC AGGCCTCGCG 1440ATAGATATAG ATAGCTTAAG CTCATCCTCA ACAGCAGCTG TTATTAAAGC AAACACCGCA 1500AATAAACAGA TATCCGTGAC GGACTCTATA GAACTTATCT CGCCTACTGG CAATGCCTAT 1560GAAGATCTCA GAATGAGAAA TTCACAGACG TTCCCTCTGC TCTCTTTAGA GCCTGGAGCC 1620GGGGGTAGTG TGACTGTAAC TGCTGGAGAT TTCCTACCGG TAAGTCCCCA TTATGGTTTT 1680CAAGGCAATT GGAAATTAGC TTGGACAGGA ACTGGAAACA AAGTTGGAGA ATTCTTCTGG 1740GATAAAATAA ATTATAAGCC TAGACCTGAA AAAGAAGGAA ATTTAGTTCC TAATATCTTG 1800TGGGGGAATG CTGTAAATGT CAGATCCTTA ATGCAGGTTC AAGAGACCCA TGCATCGAGC 1860TTACAGACAG ATCGAGGGCT GTGGATCGAT GGAATTGGGA ATTTCTTCCA TGTATCTGCC 1920TCCGAAGACA ATATAAGGTA CCGTCATAAC AGCGGTGGAT ATGTTCTATC TGTAAATAAT 1980GAGATCACAC CTAAGCACTA TACTTCGATG GCATTTTCCC AACTCTTTAG TAGAGACAAG 2040GACTATGCGG TTTCCAACAA CGAATACAGA ATGTATTTAG GATCGTATCT CTATCAATAT 2100ACAACCTCCC TAGGGAATAT TTTCCGTTAT GCTTCGCGTA ACCCTAATGT AAACGTCGGG 2160ATTCTCTCAA GAAGGTTTCT TCAAAATCCT CTTATGATTT TTCATTTTTT GTGTGCTTAT 2220GGTCATGCCA CCAATGATAT GAAAACAGAC TACGCAAATT TCCCTATGGT GAAAAACAGC 2280TGGAGAAACA ATTGTTGGGC TATAGAGTGC GGAGGGAGCA TGCCTCTATT GGTATTTGAG 2340AACGGAAGAC TTTTCCAAGG TGCCATCCCA TTTATGAAAC TACAATTAGT TTATGCTTAT 2400CAGGGAGATT TCAAAGAGAC GACTGCAGAT GGCCGTAGAT TTAGTAATGG GAGTTTAACA 2460TCGATTTCTG TACCTCTAGG CATACGCTTT GAGAAGCTGG CACTTTCTCA GGATGTACTC 2520TATGACTTTA GTTTCTCCTA TATTCCTGAT ATTTTCCGTA AGGATCCCTC ATGTGAAGCT 2580GCTCTGGTGA TTAGCGGAGA CTCCTGGCTT GTTCCGGCAG CACACGTATC AAGACATGCT 2640TTTGTAGGGA GTGGAACGGG TCGGTATCAC TTTAACGACT ATACTGAGCT CTTATGTCGA 2700GGAAGTATAG AATGCCGCCC CCATGCTAGG AATTATAATA TAAACTGTGG AAGCAAATTT 2760CGTTTTTAGA AGGTTTCCAT TGCCTGTGTG GTTCCGGATC TTAACTATAA ATCCTGGACT 2820ATGGATCATA GGCATTGGGT TTCTCGAACT TGTGTGGAGA ATAACGACAT TTTATATGCA 2880TAACGGAATA CTCGTATCAC CTCAGCCCCT AGAGACATTC TTTAGGGGTT CTTTATTTGT 2940CTAAACTTCG TATTTTATCG AGAATCCTTT ACGTTCTTGG TTTGCTTGTC TCCGAGGAGT 3000TCTCTAACGA ATCATAGGGA TTCCAGGGTT CTGTTCCTTG AGTCCTTTGG CA 3052
(2)SEQ ID NO:6的信息:
(i)序列特征:
(A)长度:922氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:6:Met Arg Phe Ser Leu Cys Gly Phe Pro Leu ValPhe Ser Leu Thr Leu1 5 10 15Leu Ser Val Phe Asp Thr Ser Leu Ser Ala Thr Thr Ile Ser Leu Thr
20 25 30Pro Glu Asp Ser Phe His Gly Asp Ser Gln Asn Ala Glu Arg Ser Tyr
35 40 45Asn Val Gln Ala Gly Asp Val Tyr Ser Leu Thr Gly Asp Val Ser Ile
50 55 60Ser Asn Val Asp Asn Ser Ala Leu Asn Lys Ala Cys Phe Asn Val Thr65 70 75 80Ser Gly Ser Val Thr Phe Ala Gly Asn His His Gly Leu Tyr Phe Asn
85 90 95Asn Ile Ser Ser Gly Thr Thr Lys Glu Gly Ala Val Leu Cys Cys Gln
100 105 110Asp Pro Gln Ala Thr Ala Arg Phe Ser Gly Phe Ser Thr Leu Ser Phe
115 120 125Ile Gln Ser Pro Gly Asp Ile Lys Glu Gln Gly Cys Leu Tyr Ser Lys
130 135 140Asn Ala Leu Met Leu Leu Asn Asn Tyr Val Val Arg Phe Glu Gln Asn145 150 155 160Gln Ser Lys Thr Lys Gly Gly Ala Ile Ser Gly Ala Asn Val Thr Ile
165 170 175Val Gly Asn Tyr Asp Ser Val Ser Phe Tyr Gln Asn Ala Ala Thr Phe
180 185 190Gly Gly Ala Ile His Ser Ser Gly Pro Leu Gln Ile Ala Val Asn Gln
195 200 205Ala Glu Ile Arg Phe Ala Gln Asn Thr Ala Lys Asn Gly Ser Gly Gly
210 215 220Ala Leu Tyr Ser Asp Gly Asp Ile Asp Ile Asp Gln Asn Ala Tyr Val225 230 235 240Leu Phe Arg Glu Asn Glu Ala Leu Thr Thr Ala Ile Gly Lys Gly Gly
245 250 255Ala Val Cys Cys Leu Pro Thr Ser Gly Ser Ser Thr Pro Val Pro Ile
260 265 270Val Thr Phe Ser Asp Asn Lys Gln Leu Val Phe Glu Arg Asn His Ser
275 280 285Ile Met Gly Gly Gly Ala Ile Tyr Ala Arg Lys Leu Ser Ile Ser Ser
290 295 300Gly Gly Pro Thr Leu Phe Ile Asn Asn Ile Ser Tyr Ala Asn Ser Gln305 310 315 320Asn Leu Gly Gly Ala Ile Ala Ile Asp Thr Gly Gly Glu Ile Ser Leu
325 330 335Ser Ala Glu Lys Gly Thr Ile Thr Phe Gln Gly Asn Arg Thr Ser Leu
340 345 350Pro Phe Leu Asn Gly Ile His Leu Leu Gln Asn Ala Lys Phe Leu Lys
355 360 365Leu Gln Ala Arg Asn Gly Cys Ser Ile Glu Phe Tyr Asp Pro Ile Thr
370 375 380Ser Glu Ala Asp Gly Ser Thr Gln Leu Asn Ile Asn Gly Asp Pro Lys385 390 395 400Asn Lys Glu Tyr Thr Gly Thr Ile Leu Phe Ser Gly Glu Lys Ser Leu
405 410 415Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr Ile Pro Gln Asn Val Asn
420 425 430Leu Ser Ala Gly Tyr Leu Val Ile Lys Glu Gly Ala Glu Val Thr Val
435 440 445Ser Lys Phe Thr Gln Ser Pro Gly Ser His Leu Val Leu Asp Leu Gly
450 455 460Thr Lys Leu Ile Ala Ser Lys Glu Asp Ile Ala Ile Thr Gly Leu Ala465 470 475 480Ile Asp Ile Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala Val Ile Lys
485 490 495Ala Asn Thr Ala Asn Lys Gln Ile Ser Val Thr Asp Ser Ile Glu Leu
500 505 510Ile Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met Arg Asn Ser
515 520 525Gln Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly Gly Ser Val
530 535 540Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His Tyr Gly Phe545 550 555 560Gln Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn Lys Val Gly
565 570 575Glu Phe Phe Trp Asp Lys Ile Asn Tyr Lys Pro Arg Pro Glu Lys Glu
580 585 590Gly Asn Leu Val Pro Asn Ile Leu Trp Gly Asn Ala Val Asn Val Arg
595 600 605Ser Leu Met Gln Val Gln Glu Thr His Ala Ser Ser Leu Gln Thr Asp
610 615 620Arg Gly Leu Trp Ile Asp Gly Ile Gly Asn Phe Phe His ValSer Ala625 630 635 640Ser Glu Asp Asn Ile Arg Tyr Arg His Asn Ser Gly Gly Tyr Val Leu
645 650 655Ser Val Asn Asn Glu Ile Thr Pro Lys His Tyr Thr Ser Met Ala Phe
660 665 670Ser Gln Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser Asn Asn Glu
675 680 685Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gln Tyr Thr Thr Ser Leu
690 695 700Gly Asn Ile Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val Asn Val Gly705 710 715 720Ile Leu Ser Arg Arg Phe Leu Gln Asn Pro Leu Met Ile Phe His Phe
725 730 735Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr Asp Tyr Ala
740 745 750Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys Trp Ala Ile
755 760 765Glu Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn Gly Arg Leu
770 775 780Phe Gln Gly Ala Ile Pro Phe Met Lys Leu Gln Leu Val Tyr Ala Tyr785 790 795 800Gln Gly Asp Phe Lys Glu Thr Thr Ala Asp Gly Arg Arg Phe Ser Asn
805 810 815Gly Ser Leu Thr Ser Ile Ser Val Pro Leu Gly Ile Arg Phe Glu Lys
820 825 830Leu Ala Leu Ser Gln Asp Val Leu Tyr Asp Phe Ser Phe Ser Tyr Ile
835 840 845Pro Asp Ile Phe Arg Lys Asp Pro Ser Cys Glu Ala Ala Leu Val Ile
850 855 860Ser Gly Asp Ser Trp Leu Val Pro Ala Ala His Val Ser Arg His Ala865 870 875 880Phe Val Gly Ser Gly Thr Gly Arg Tyr His Phe Asn Asp Tyr Thr Glu
885 890 895Leu Leu Cys Arg Gly Ser Ile Glu Cys Arg Pro His Ala Arg Asn Tyr
900 905 910Asn Ile Asn Cys Gly Ser Lys Phe Arg Phe
915 920
(2)SEQ ID NO:7的信息:
(i)序列特征:
(A)长度:2526碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:7:ATGAAGATTC CACTCCGCTT TTTATTGATA TCATTAGTAC CTACGCTTTC TATGTCGAAT 60TTATTAGGAG CTGCTACTAC CGAAGAGCTA TCGGCTAGCA ATAGCTTCGA TGGAACTACA 120TCAACAACAA GCTTTTCTAG TAAAACATCA TCGGCTACAG ATGGCACCAA TTATGTTTTT 180AAAGATTCTG TAGTTATAGA AAATGTACCC AAAACAGGGG AAACTCAGTC TACTAGTTGT 240TTTAAAAATG ACGCTGCAGC TGGAGATCTA AATTTCTTAG GAGGGGGATT TTCTTTCACA 300TTTAGCAATA TCGATGCAAC CACGGCTTCT GGAGCTGCTA TTGGAAGTGA AGCAGCTAAT 360AAGACAGTCA CGTTATCAGG ATTTTCGGCA CTTTCTTTTC TTAAATCCCC AGCAAGTACA 420GTGACTAATG GATTGGGAGC TATCAATGTT AAAGGGAATT TAAGCCTATT GGATAATGAT 480AAGGTATTGA TTCAGGACAA TTTCTCAACA GGAGATGGCG GAGCAATTAA TTGTGCAGGC 540TCCTTGAAGA TCGCAAACAA TAAGTCCCTT TCTTTTATTG GAAATAGTTC TTCAACACGT 600GGCGGAGCGA TTCATACCAA AAACCTCACA CTATCTTCTG GTGGGGAAAC TCTATTTCAG 660GGGAATACAG CGCCTACGGC TGCTGGTAAA GGAGGTGCTA TCGCGATTGC AGACTCTGGC 720ACCCTATCCA TTTCTGGAGA CAGTGGCGAC ATTATCTTTG AAGGCAATAC GATAGGAGCT 780ACAGGAACCG TCTCTCATAG TGCTATTGAT TTAGGAACTA GCGCTAAGAT AACTGCGTTA 840CGTGCTGCGC AAGGACATAC GATATACTTT TATGATCCGA TTACTGTAAC AGGATCGACA 900TCTGTTGCTG ATGCTCTCAA TATTAATAGC CCTGATACTG GAGATAACAA AGAGTATACG 960GGAACCATAG TCTTTTCTGG AGAGAAGCTC ACGGAGGCAG AAGCTAAAGA TGAGAAGAAC 1020CGCACTTCTA AATTACTTCA AAATGTTGCT TTTAAAAATG GGACTGTAGT TTTAAAAGGT 1080GATGTCGTTT TAAGTGCGAA CGGTTTCTCT CAGGATGCAA ACTCTAAGTT GATTATGGAT 1140TTAGGGACGT CGTTGGTTGC AAACACCGAA AGTATCGAGT TAACGAATTT GGAAATTAAT 1200ATAGACTCTC TCAGGAACGG GAAAAAGATA AAACTCAGTG CTGCCACAGC TCAGAAAGAT 1260ATTCGTATAG ATCGTCCTGT TGTACTGGCA ATTAGCGATG AGAGTTTTTA TCAAAATGGC 1320TTTTTGAATG AGGACCATTC CTATGATGGG ATTCTTGAGT TAGATGCTGG GAAAGACATC 1380GTGATTTCTG CAGATTCTCG CAGTATAAAT GCTGTACAAT CTCCGTATGG CTATCAGGGA 1440AAGTGGACAA TCAATTGGTC TACTGATGAT AAGAAAGCTA CGGTTTCTTG GGCAAAGCAA 1500AGTTTTAATC CCACTGCTGA GCAGGAGGCT CCGTTAGTTC CTAATCTTCT TTGGGGTTCT 1560TTTATAGATG TTCGTCCCTT CCAAAATTTT ATAGAGCTAG GTACTGAAGG TGCTCCTTAC 1620GAAAAGAGAT TTTGGGTTGC AGGCATTTCC AATGTTTTGC ATAGGAGCGG TCGTGAAAAT 1680CAAAGGAAAT TCCGTCATGT GAGTGGAGGT GCTGTAGTAG GTGCTAGCAC GAGGATGCCG 1740GGTGGTGATA CCTTGTCTCT GGGTTTTGCT CAGCTCTTTG CGCGTGACAA AGACTACTTT 1800ATGAATACCA ATTTCGCAAA GACCTACGCA GGATCTTTAC GTTTGCAGCA CGATGCTTCC 1860CTATACTCTG TGGTGAGTAT CCTTTTAGGA GAGGGAGGAC TCCGCGAGAT CCTGTTGCCT 1920TATGTTTCCA AGACTCTGCC GTGCTCTTTC TATGGGCAGC TTAGCTACGG CCATACGGAT 1980CATCGCATGA AGACCGAGTC TCTACCCCCC CCCCCCCCGA CGCTCTCGAC GGATCATACT 2040TCTTGGGGAG GATATGTCTG GGCTGGAGAG CTGGGAACTC GAGTTGCTGT TGAAAATACC 2100AGCGGCAGAG GATTTTTCCG AGAGTACACT CCATTTGTAA AAGTCCAAGC TGTTTACTCG 2160CGCCAAGATA GCTTTGTTGA ACTAGGAGCT ATCAGTCGTG ATTTTAGTGA TTCGCATCTT 2220TATAACCTTG CGATTCCTCT TGGAATCAAG TTAGAGAAAC GGTTTGCAGA GCAATATTAT 2280CATGTTGTAG CGATGTATTC TCCAGATGTT TGTCGTAGTA ACCCCAAATG TACGACTACC 2340CTACTTTCCA ACCAAGGGAG TTGGAAGACC AAAGGTTCGA ACTTAGCAAG ACAGGCTGGT 2400ATTGTTCAGG CCTCAGGTTT TCGATGTTTG GGAGCTGCAG CAGAGCTTTT CGGGAACTTT 2460GGCTTTGAAT GGCGGGGATC TTCTCGTAGC TATAATGTAG ATGCGGGTAG CAAAATCAAA 2520TTTTAG 2526
(2)SEQ ID NO:8的信息:
(i)序列特征:
(A)长度:841氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:8:Met Lys Ile Pro Leu Arg Phe Leu Leu Ile Ser Leu Val Pro Thr Leu1 5 10 15Ser Met Ser Asn Leu Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala
20 25 30Ser Asn Ser Phe Asp Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys
35 40 45Thr Ser Ser Ala Thr Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val
50 55 60Val Ile Glu Asn Val Pro Lys Thr Gly Glu Thr Gln Ser Thr Ser Cys65 70 75 80Phe Lys Asn Asp Ala Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly
85 90 95Phe Ser Phe Thr Phe Ser Asn Ile Asp Ala Thr Thr Ala Ser Gly Ala
100 105 110Ala Ile Gly Ser Glu Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe
115 120 125Ser Ala Leu Ser Phe Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly
130 135 140Leu Gly Ala Ile Asn Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp145 150 155 160Lys Val Leu Ile Gln Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala Ile
165 170 175Asn Cys Ala Gly Ser Leu Lys Ile Ala Asn Asn Lys Ser Leu Ser Phe
180 185 190Ile Gly Asn Ser Ser Ser Thr Arg Gly Gly Ala Ile His Thr Lys Asn
195 200 205Leu Thr Leu Ser Ser Gly Gly Glu Thr Leu Phe Gln Gly Asn Thr Ala
210 215 220Pro Thr Ala Ala Gly Lys Gly Gly Ala Ile Ala Ile Ala Asp Ser Gly225 230 235 240Thr Leu Ser Ile Ser Gly Asp Ser Gly Asp Ile Ile Phe Glu Gly Asn
245 250 255Thr Ile Gly Ala Thr Gly Thr Val Ser His Ser Ala Ile Asp Leu Gly
260 265 270Thr Ser Ala Lys Ile Thr Ala Leu Arg Ala Ala Gln Gly His Thr Ile
275 280 285Tyr Phe Tyr Asp Pro Ile Thr Val Thr Gly Ser Thr Ser Val Ala Asp
290 295 300Ala Leu Asn Ile Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr305 310 315 320Gly Thr Ile Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys
325 330 335Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gln Asn Val Ala Phe Lys
340 345 350Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly
355 360 365Phe Ser Gln Asp Ala Asn Ser Lys Leu Ile Met Asp Leu Gly Thr Ser
370 375 380Leu Val Ala Asn Thr Glu Ser Ile Glu Leu Thr Asn Leu Glu Ile Asn385 390 395 400Ile Asp Ser Leu Arg Asn Gly Lys Lys Ile Lys Leu Ser Ala Ala Thr
405 410 415Ala Gln Lys Asp Ile Arg Ile Asp Arg Pro Val Val Leu Ala Ile Ser
420 425 430Asp Glu Ser Phe Tyr Gln Asn Gly Phe Leu Asn Glu Asp His Ser Tyr
435 440 445Asp Gly Ile Leu Glu Leu Asp Ala Gly Lys Asp Ile Val Ile Ser Ala
450 455 460Asp Ser Arg Ser Ile Asn Ala Val Gln Ser Pro Tyr Gly Tyr Gln Gly465 470 475 480Lys Trp Thr Ile Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser
485 490 495Trp Ala Lys Gln Ser Phe Asn Pro Thr Ala Glu Gln Glu Ala Pro Leu
500 505 510Val Pro Asn Leu Leu Trp Gly Ser Phe Ile Asp Val Arg Pro Phe Gln
515 520 525Asn Phe Ile Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe
530 535 540Trp Val Ala Gly Ile Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn545 550 555 560Gln Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser
565 570 575Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gln Leu
580 585 590Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr
595 600 605Tyr Ala Gly Ser Leu Arg Leu Gln His Asp Ala Ser Leu Tyr Ser Val
610 615 620Val Ser Ile Leu Leu Gly Glu Gly Gly Leu Arg Glu Ile Leu Leu Pro625 630 635 640Tyr Val Ser Lys Thr Leu Pro Cys Ser Phe Tyr Gly Gln Leu Ser Tyr
645 650 655Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro
660 665 670Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala
675 680 685Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly
690 695 700Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gln Ala Val Tyr Ser705 710 715 720Arg Gln Asp Ser Phe Val Glu Leu Gly Ala Ile Ser Arg Asp Phe Ser
725 730 735Asp Ser His Leu Tyr Asn Leu Ala Ile Pro Leu Gly Ile Lys Leu Glu
740 745 750Lys Arg Phe Ala Glu Gln Tyr Tyr His Val Val Ala Met Tyr Ser Pro
755 760 765Asp Val Cys Arg Ser Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn
770 775 780Gln Gly Ser Trp Lys Thr Lys Gly Ser Asn Leu Ala Arg Gln Ala Gly785 790 795 800Ile Val Gln Ala Ser Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu
805 810 815Phe Gly Asn Phe Gly Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn
820 825 830Val Asp Ala Gly Ser Lys Ile Lys Phe
835 840
(2)SEQ ID NO:9的信息:
(i)序列特征:
(A)长度:2787碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:9:ATGAAGTCTT CTTTCCCCAA GTTTGTATTT TCTACATTTG CTATTTTCCC TTTGTCTATG 60ATTGCTACCG AGACAGTTTT GGATTCAAGT GCGAGTTTCG ATGGGAATAA AAATGGTAAT 120TTTTCAGTTC GTGAGAGTCA GGAAGATGCT GGAACTACCT ACCTATTTAA GGGAAATGTC 180ACTCTAGAAA ATATTCCTGG AACAGGCACA GCAATCACAA AAAGCTGTTT TAACAACACT 240AAGGGCGATT TGACTTTCAC AGGTAACGGG AACTCTCTAT TGTTCCAAAC GGTGGATGCA 300GGGACTGTAG CAGGGGCTGC TGTTAACAGC AGCGTGGTAG ATAAATCTAC CACGTTTATA 360GGGTTTTCTT CGCTATCTTT TATTGCGTCT CCTGGAAGTT CGATAACTAC CGGCAAAGGA 420GCCGTTAGCT GCTCTACGGG TAGCTTGAAG TTTGACAAAA ATGTCAGTTT GCTCTTCAGC 480AAAAACTTTT CAACGGATAA TGGCGGTGCT ATCACCGCAA AAACTCTTTC ATTAACAGGG 540ACTACAATGT CAGCTCTGTT TTCTGAAAAT ACCTCCTCAA AGAAAGGCGG AGCCATTCAG 600ACTTCCGATG CCCTTACCAT TACTGGAAAC CAAGGGGAAG TCTCTTTTTC TGACAATACT 660TCTTCGGATT CTGGAGCTGC AATTTTTACA GAAGCCTCGG TGACTATTTC TAATAATGCT 720AAAGTTTCCT TTATTGACAA TAAGGTCACA GGAGCGAGCT CCTCAACAAC GGGGGATATG 780TCAGGAGGTG CTATCTGTGC TTATAAAACT AGTACAGATA CTAAGGTCAC CCTCACTGGA 840AATCAGATGT TACTCTTCAG CAACAATACA TCGACAACAG CGGGAGGAGC TATCTATGTG 900AAAAAGCTCG AACTGGCTTC CGGAGGACTT ACCCTATTCA GTAGAAATAG TGTCAATGGA 960GGTACAGCTC CTAAAGGTGG AGCCATAGCT ATCGAAGATA GTGGGGAATT GAGTTTATCC 1020GCCGATAGTG GTGACATTGT CTTTTTAGGG AATACAGTCA CTTCTACTAC TCCTGGGACG 1080AATAGAAGTA GTATCGACTT AGGAACGAGT GCAAAGATGA CAGCTTTGCG TTCTGCTGCT 1140GGTAGAGCCA TCTACTTCTA TGATCCCATA ACTACAGGAT CTTCCACAAC AGTTACAGAT 1200GTCTTAAAAG TTAATGAGAC TCCGGCAGAT TCTGCACTAC AATATACAGG GAACATCATC 1260TTCACAGGAG AAAAGTTATC AGAGACAGAG GCCGCAGATT CTAAAAATCT TACTTCGAAG 1320CTACTACAGC CTGTAACTCT TTCAGGAGGT ACTCTATCTT TAAAACATGG AGTGACTCTG 1380CAGACTCAGG CATTCACTCA ACAGGCAGAT TCTCGTCTCG AAATGGACGT AGGAACTACT 1440CTAGAACCTG CTGATACTAG CACCATAAAC AATTTGGTCA TTAACATCAG TTCTATAGAC 1500GGTGCAAAGA AGGCAAAAAT AGAAACCAAA GCTACGTCAA AAAATCTGAC TTTATCTGGA 1560ACCATCACTT TATTGGACCC GACGGGCACG TTTTATGAAA ATCATAGTTT AAGAAATCCT 1620CAGTCCTACG ACATCTTAGA GCTCAAAGCT TCTGGAACTG TAACAAGCAC CGCAGTGACT 1680CCAGATCCTA TAATGGGTGA GAAATTCCAT TACGGCTATC AGGGAACTTG GGGCCCAATT 1740GTTTGGGGGA CAGGGGCTTC TACGACTGCA ACCTTCAACT GGACTAAAAC TGGCTATATT 1800CCTAATCCCG AGCGTATCGG CTCTTTAGTC CCTAATAGCT TATGGAATGC ATTTATAGAT 1860ATTAGCTCTC TCCATTATCT TATGGAGACT GCAAACGAAG GGTTGCAGGG AGACCGTGCT 1920TTTTGGTGTG CTGGATTATC TAACTTCTTC CATAAGGATA GTACAAAAAC ACGACGCGGG 1980TTTCGCCATT TGAGTGGCGG TTATGTCATA GGAGGAAACC TACATACTTG TTCAGATAAG 2040ATTCTTAGTG CTGCATTTTG TCAGCTCTTT GGAAGAGATA GAGACTACTT TGTAGCTAAG 2100AATCAAGGTA CAGTCTACGG AGGAACTCTC TATTACCAGC ACAACGAAAC CTATATCTCT 2160CTTCCTTGCA AACTACGGCC TTGTTCGTTG TCTTATGTTC CTACAGAGAT TCCTGTTCTC 2220TTTTCAGGAA ACCTTAGCTA CACCCATACG GATAACGATC TGAAAACCAA GTATACAACA 2280TATCCTACTG TTAAAGGAAG CTGGGGGAAT GATAGTTTCG CTTTAGAATT CGGTGGAAGA 2340GCTCCGATTT GCTTAGATGA AAGTGCTCTA TTTGAGCAGT ACATGCCCTT CATGAAATTG 2400CAGTTTGTCT ATGCACATCA GGAAGGTTTT AAAGAACAGG GAACAGAAGC TCGTGAATTT 2460GGAAGTAGCC GTCTTGTGAA TCTTGCCTTA CCTATCGGGA TCCGATTTGA TAAGGAATCA 2520GACTGCCAAG ATGCAACGTA CAATCTAACT CTTGGTTATA CTGTGGATCT TGTTCGTAGT 2580AACCCCGACT GTACGACAAC ACTGCGAATT AGCGGTGATT CTTGGAAAAC CTTCGGTACG 2640AATTTGGCAA GACAAGCTTT AGTCCTTCGT GCAGGGAACC ATTTTTGCTT TAACTCAAAT 2700TTTGAAGCCT TTAGCCAATT TTCTTTTGAA TTGCGTGGGT CATCTCGCAA TTACAATGTA 2760GACTTAGGAG CAAAATACCA ATTCTAA 2787
(2)SEQ ID NO:10的信息:
(i)序列特征:
(A)长度:928氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:10:Met Lys Ser Ser Phe Pro Lys Phe Val Phe Ser Thr Phe Ala Ile Phe1 5 10 15Pro Leu Ser Met Ile Ala Thr Glu Thr Val Leu Asp Ser Ser Ala Ser
20 25 30Phe Asp Gly Asn Lys Asn Gly Asn Phe Ser Val Arg Glu Ser Gln Glu
35 40 45Asp Ala Gly Thr Thr Tyr Leu Phe Lys Gly Asn Val Thr Leu Glu Asn
50 55 60Ile Pro Gly Thr Gly Thr Ala Ile Thr Lys Ser Cys Phe Asn Asn Thr65 70 75 80Lys Gly Asp Leu Thr Phe Thr Gly Asn Gly Asn Ser Leu Leu Phe Gln
85 90 95Thr Val Asp Ala Gly Thr Val Ala Gly Ala Ala Val Asn Ser Ser Val
100 105 110Val Asp Lys Ser Thr Thr Phe Ile Gly Phe Ser Ser Leu Ser Phe Ile
115 120 125Ala Ser Pro Gly Ser Ser Ile Thr Thr Gly Lys Gly Ala Val Ser Cys
130 135 140Ser Thr Gly Ser Leu Lys Phe Asp Lys Asn Val Ser Leu Leu Phe Ser145 150 155 160Lys Asn Phe Ser Thr Asp Asn Gly Gly Ala Ile Thr Ala Lys Thr Leu
165 170 175Ser Leu Thr Gly Thr Thr Met Ser Ala Leu Phe Ser Glu Asn Thr Ser
180 185 190Ser Lys Lys Gly Gly Ala Ile Gln Thr Ser Asp Ala Leu Thr Ile Thr
195 200 205Gly Asn Gln Gly Glu Val Ser Phe Ser Asp Asn Thr Ser Ser Asp Ser
210 215 220Gly Ala Ala Ile Phe Thr Glu Ala Ser Val Thr Ile Ser Asn Asn Ala225 230 235 240Lys Val Ser Phe Ile Asp Asn Lys Val Thr Gly Ala Ser Ser Ser Thr
245 250 255Thr Gly Asp Met Ser Gly Gly Ala Ile Cys Ala Tyr Lys Thr Ser Thr
260 265 270Asp Thr Lys Val Thr Leu Thr Gly Asn Gln Met Leu Leu Phe Ser Asn
275 280 285Asn Thr Ser Thr Thr Ala Gly Gly Ala Ile Tyr Val Lys Lys Leu Glu
290 295 300Leu Ala Ser Gly Gly Leu Thr Leu Phe Ser Arg Asn Ser Val Asn Gly305 310 315 320Gly Thr Ala Pro Lys Gly Gly Ala Ile Ala Ile Glu Asp Ser Gly Glu
325 330 335Leu Ser Leu Ser Ala Asp Ser Gly Asp Ile Val Phe Leu Gly Asn Thr
340 345 350Val Thr Ser Thr Thr Pro Gly Thr Asn Arg Ser Ser Ile Asp Leu Gly
355 360 365Thr Ser Ala Lys Met Thr Ala Leu Arg Ser Ala Ala Gly Arg Ala Ile
370 375 380Tyr Phe Tyr Asp Pro Ile Thr Thr Gly Ser Ser Thr Thr Val Thr Asp385 390 395 400Val Leu Lys Val Asn Glu Thr Pro Ala Asp Ser Ala Leu Gln Tyr Thr
405 410 415Gly Asn Ile Ile Phe Thr Gly Glu Lys Leu Ser Glu Thr Glu Ala Ala
420 425 430AspSer Lys Asn Leu Thr Ser Lys Leu Leu Gln Pro Val Thr Leu Ser
435 440 445Gly Gly Thr Leu Ser Leu Lys His Gly Val Thr Leu Gln Thr Gln Ala
450 455 460Phe Thr Gln Gln Ala Asp Ser Arg Leu Glu Met Asp Val Gly Thr Thr465 470 475 480Leu Glu Pro Ala Asp Thr Ser Thr Ile Asn Asn Leu Val Ile Asn Ile
485 490 495Ser Ser Ile Asp Gly Ala Lys Lys Ala Lys Ile Glu Thr Lys Ala Thr
500 505 510Ser Lys Asn Leu Thr Leu Ser Gly Thr Ile Thr Leu Leu Asp Pro Thr
515 520 525Gly Thr Phe Tyr Glu Asn His Ser Leu Arg Asn Pro Gln Ser Tyr Asp
530 535 540Ile Leu Glu Leu Lys Ala Ser Gly Thr Val Thr Ser Thr Ala Val Thr545 550 555 560Pro Asp Pro Ile Met Gly Glu Lys Phe His Tyr Gly Tyr Gln Gly Thr
565 570 575Trp Gly Pro Ile Val Trp Gly Thr Gly Ala Ser Thr Thr Ala Thr Phe
580 585 590Asn Trp Thr Lys Thr Gly Tyr Ile Pro Asn Pro Glu Arg Ile Gly Ser
595 600 605Leu Val Pro Asn Ser Leu Trp Asn Ala Phe Ile Asp Ile Ser Ser Leu
610 615 620His Tyr Leu Met Glu Thr Ala Asn Glu Gly Leu Gln Gly Asp Arg Ala625 630 635 640Phe Trp Cys Ala Gly Leu Ser Asn Phe Phe His Lys Asp Ser Thr Lys
645 650 655Thr Arg Arg Gly Phe Arg His Leu Ser Gly Gly Tyr Val Ile Gly Gly
660 665 670Asn Leu His Thr Cys Ser Asp Lys Ile Leu Ser Ala Ala Phe Cys Gln
675 680 685Leu Phe Gly Arg Asp Arg Asp Tyr Phe Val Ala Lys Asn Gln Gly Thr
690 695 700Val Tyr Gly Gly Thr Leu Tyr Tyr Gln His Asn Glu Thr Tyr Ile Ser705 710 715 720Leu Pro Cys Lys Leu Arg Pro Cys Ser Leu Ser Tyr Val Pro Thr Glu
725 730 735Ile Pro Val Leu Phe Ser Gly Asn Leu Ser Tyr Thr His Thr Asp Asn
740 745 750Asp Leu Lys Thr Lys Tyr Thr Thr Tyr Pro Thr Val Lys Gly Ser Trp
755 760 765Gly Asn Asp Ser Phe Ala Leu Glu Phe Gly Gly Arg Ala Pro Ile Cys
770 775 780Leu Asp Glu Ser Ala Leu Phe Glu Gln Tyr Met Pro Phe Met Lys Leu785 790 795 800Gln Phe Val Tyr Ala His Gln Glu Gly Phe Lys Glu Gln Gly Thr Glu
805 810 815Ala Arg Glu Phe Gly Ser Ser Arg Leu Val Asn Leu Ala Leu Pro Ile
820 825 830Gly Ile Arg Phe Asp Lys Glu Ser Asp Cys Gln Asp Ala Thr Tyr Asn
835 840 845Leu Thr Leu Gly Tyr Thr Val Asp Leu ValArg Ser Asn Pro Asp Cys
850 855 860Thr Thr Thr Leu Arg Ile Ser Gly Asp Ser Trp Lys Thr Phe Gly Thr865 870 875 880Asn Leu Ala Arg Gln Ala Leu Val Leu Arg Ala Gly Asn His Phe Cys
885 890 895Phe Asn Ser Asn Phe Glu Ala Phe Ser Gln Phe Ser Phe Glu Leu Arg
900 905 910Gly Ser Ser Arg Asn Tyr Asn Val Asp Leu Gly Ala Lys Tyr Gln Phe
915 920 925
(2)SEQ ID NO:11的信息:
(i)序列特征:
(A)长度:2757碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:11:ATGAGATCGT CTTTTTCCTT GTTATTAATA TCTTCATCTC TAGCCTTTCC TCTCTTAATG 60AGTGTTTCTG CAGATGCTGC CGATCTCACA TTAGGGAGTC GTGACAGTTA TAATGGTGAT 120ACAAGCACCA CAGAATTTAC TCCTAAAGCG GCAACTTCTG ATGCTAGTGG CACGACCTAT 180ATTCTCGATG GGGATGTCTC GATAAGCCAA GCAGGGAAAC AAACGAGCTT AACCACAAGT 240TGTTTTTCTA ACACTGCAGG AAATCTTACC TTCTTAGGGA ACGGATTTTC TCTTCATTTT 300GACAATATTA TTTCGTCTAC TGTTGCAGGT GTTGTTGTTA GCAATACAGC AGCTTCTGGG 360ATTACGAAAT TCTCAGGATT TTCAACTCTT CGGATGCTTG CAGCTCCTAG GACCACAGGT 420AAAGGAGCCA TTAAAATTAC CGATGGTCTG GTGTTTGAGA GTATAGGGAA TCTTGACCAA 480AATGAAAATG CCTCTAGTGA AAATGGGGGA GCCATCAATA CGAAGACTTT GTCTTTGACT 540GGGAGTACGC GGTTTGTAGC GTTCCTTGGC AATAGCTCGT CGCAACAAGG GGGAGCGATC 600TATGCTTCTG GTGACTCTGT GATTTCTGAG AATGCAGGAA TCTTGAGCTT CGGAAACAAC 660AGTGCGACAA CATCAGGAGG CGCGATCTCT GCTGAAGGGA ACCTTGTGAT CTCCAATAAC 720CAAAATATCT TTTTCGATGG CTGCAAAGCA ACTACAAATG GCGGAGCTAT TGATTGTAAC 780AAAGCAGGGG CGAACCCAGA CCCTATCTTG ACTCTTTCAG GAAATGAGAG CCTGCATTTT 840CTGAATAACA CAGCAGGAAA TAGTGGAGGT GCGATTTATA CCAAAAAATT GGTGTTATCC 900TCAGGACGAG GAGGAGTGTT ATTTTCTAAC AACAAAGCTG CGAATGCTAC TCCTAAAGGA 960GGGGCAATTG CGATTCTAGA TTCTGGAGAG ATTAGCATTT CTGCAGATCT CGGCAATATC 1020ATTTTCGAGG GCAATACTAC GAGCACTACA GGAAGTCCTG CGAGTGTGAC CAGAAATGCT 1080ATAGATCTTG CATCGAATGC AAAATTTTTA AATCTCCGAG CGACTCGGGG AAATAAAGTT 1140ATTTTCTATG ATCCTATCAC GAGCTCAGGA GCTACTGATA AGCTCTCTTT GAATAAAGCT 1200GACGCAGGAT CTGGAAATAC CTATGAAGGC TACATCGTTT TCTCTGGAGA GAAACTCTCA 1260GAAGAGGAAC TTAAGAAACC TGACAATCTG AAGTCTACAT TTACACAGGC TGTAGAGCTT 1320GCTGCAGGTG CCTTAGTATT GAAAGATGGA GTGACTGTAG TTGCAAATAC TATAACGCAG 1380GTCGAGGGAT CGAAAGTCGT TATGGATGGA GGGACTACTT TTGAGGCAAG CGCTGAGGGG 1440GTCACTCTCA ATGGCCTAGC CATTAATATA GATTCCTTAG ATGGGACAAA TAAAGCTATC 1500ATTAAGGCGA CGGCAGCAAG TAAGGATGTT GCCTTATCAG GGCCTATCAT GCTTGTAGAT 1560GCTCAGGGGA ACTATTATGA GCATCATAAT CTCAGTCAAC AGCAGGTCTT TCCTTTAATA 1620GAGCTTTCTG CACAAGGAAC GATGACTACT ACAGATATCC CCGATACCCC AATTCTAAAT 1680ACTACGAATC ACTATGGGTA TCAAGGAACT GGAATAATTG TTTGGGTCGA CGATGCAACT 1740GCAAAAACAA AAAATGCTAC CTTAACTTGG ACTAAAACAG GATACAAGCC GAATCCAGAA 1800CGTCAGGGAC CTTTGGTTCC TAATAGCCTG TGGGGTTCTT TTGTCGATGT CCGCTCCATT 1860CAGAGCCTCA TGGACCGGAG CACAAGTTCG TTATCTTCGT CAACAAATTT GTGGGTATCA 1920GGAATCGCGG ACTTTTTGCA TGAAGATCAG AAAGGAAACC AACGTAGTTA TCGTCATTCT 1980AGCGCGGGTT ATGCATTAGG AGGAGGATTC TTCACGGCTT CTGAAAATTT CTTTAATTTT 2040GCTTTTTGTC AGCTTTTTGG CTACGACAAG GACCATCTTG TGGCTAAGAA CCATACCCAT 2100GTATATGCAG GGGCAATGAG TTACCGACAC CTCGGAGAGT CTAAGACCCT CGCTAAGATT 2160TTGTCAGGAA ATTCTGACTC CCTACCTTTT GTCTTCAATG CTCGGTTTGC TTATGGCCAT 2220ACCGACAATA ACATGACCAC AAAGTACACT GGCTATTCTC CTGTTAAGGG AAGCTGGGGA 2280AATGATGCCT TCGGTATAGA ATGTGGAGGA GCTATCCCGG TAGTTGCTTC AGGACGTCGG 2340TCTTGGGTGG ATACCCACAC GCCATTTCTA AACCTAGAGA TGATCTATGC ACATCAGAAT 2400GACTTTAAGG AAAACGGCAC AGAAGGCCGT TCTTTCCAAA GTGAAGACCT CTTCAATCTA 2460GCGGTTCCTG TAGGGATAAA ATTTGAGAAA TTCTCCGATA AGTCTACGTA TGATCTCTCC 2520ATAGCTTACG TTCCCGATGT GATTCGTAAT GATCCAGGCT GCACGACAAC TCTTATGGTT 2580TCTGGGGATT CTTGGTCGAC ATGTGGTACA AGCTTGTCTA GACAAGCTCT TCTTGTACGT 2640GCTGGAAATC ATCATGCCTT TGCTTCAAAC TTTGAAGTTT TCAGTCAGTT TGAAGTCGAG 2700TTGCGAGGTT CTTCTCGTAG CTATGCTATC GATCTTGGAG GAAGATTCGG ATTTTAA 2757
(2)SEQ ID NO:12的信息:
(i)序列特征:
(A)长度:918氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:12:Met Arg Ser Ser Phe Ser Leu Leu Leu Ile Ser Ser Ser Leu Ala Phe1 5 10 15Pro Leu Leu Met Ser Val Ser Ala Asp Ala Ala Asp Leu Thr Leu Gly
20 25 30Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr Thr Glu Phe Thr Pro
35 40 45Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr Tyr Ile Leu Asp Gly
50 55 60Asp Val Ser Ile Ser Gln Ala Gly Lys Gln Thr Ser Leu Thr Thr Ser65 70 75 80Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe Leu Gly Asn Gly Phe
85 90 95Ser Leu His Phe Asp Asn Ile Ile Ser Ser Thr Val Ala Gly Val Val
100 105 110Val Ser Asn Thr Ala Ala Ser Gly Ile Thr Lys Phe Ser Gly Phe Ser
115 120 125Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr Gly Lys Gly Ala Ile
130 135 140Lys Ile Thr Asp Gly Leu Val Phe Glu Ser Ile Gly Asn Leu Asp Gln145 150 155 160Asn Glu Asn Ala Ser Ser Glu Asn Gly Gly Ala Ile Asn Thr Lys Thr
165 170 175Leu Ser Leu Thr Gly Ser Thr Arg Phe Val Ala Phe Leu Gly Asn Ser
180 185 190Ser Ser Gln Gln Gly Gly Ala Ile Tyr Ala Ser Gly Asp Ser Val Ile
195 200 205Ser Glu Asn Ala Gly Ile Leu Ser Phe Gly Asn Asn Ser Ala Thr Thr
210 215 220Ser Gly Gly Ala Ile Ser Ala Glu Gly Asn Leu Val Ile Ser Asn Asn225 230 235 240Gln Asn Ile Phe Phe Asp Gly Cys Lys Ala Thr Thr Asn Gly Gly Ala
245 250 255Ile Asp Cys Asn Lys Ala Gly Ala Asn Pro Asp Pro Ile Leu Thr Leu
260 265 270Ser Gly Asn Glu Ser Leu His Phe Leu Asn Asn Thr Ala Gly Asn Ser
275 280 285Gly Gly Ala Ile Tyr Thr Lys Lys Leu Val Leu Ser Ser Gly Arg Gly
290 295 300Gly Val Leu Phe Ser Asn Asn Lys Ala Ala Asn Ala Thr Pro Lys Gly305 310 315 320Gly Ala Ile Ala Ile Leu Asp Ser Gly Glu Ile Ser Ile Ser Ala Asp
325 330 335Leu Gly Asn Ile Ile Phe Glu Gly Asn Thr Thr Ser Thr Thr Gly Ser
340 345 350Pro Ala Ser Val Thr Arg Asn Ala Ile Asp Leu Ala Ser Asn Ala Lys
355 360 365Phe Leu Asn Leu Arg Ala Thr Arg Gly Asn Lys Val Ile Phe Tyr Asp
370 375 380Pro Ile Thr Ser Ser Gly Ala Thr Asp Lys Leu Ser Leu Asn Lys Ala385 390 395 400Asp Ala Gly Ser Gly Asn Thr Tyr Glu Gly Tyr Ile Val Phe Ser Gly
405 410 415Glu Lys Leu Ser Glu Glu Glu Leu Lys Lys Pro Asp Asn Leu Lys Ser
420 425 430Thr Phe Thr Gln Ala Val Glu Leu Ala Ala Gly Ala Leu Val Leu Lys
435 440 445Asp Gly Val Thr Val Val Ala Asn Thr Ile Thr Gln Val Glu Gly Ser
450 455 460Lys Val Val Met Asp Gly Gly Thr Thr Phe Glu Ala Ser Ala Glu Gly465 470 475 480Val Thr Leu Asn Gly Leu Ala Ile Asn Ile Asp Ser Leu Asp Gly Thr
485 490 495Asn Lys Ala Ile Ile Lys Ala Thr Ala Ala Ser Lys Asp Val Ala Leu
500 505 510Ser Gly Pro Ile Met Leu Val Asp Ala Gln Gly Asn Tyr Tyr Glu His
515 520 525His Asn Leu Ser Gln Gln Gln Val Phe Pro Leu Ile Glu Leu Ser Ala
530 535 540Gln Gly Thr Met Thr Thr Thr Asp Ile Pro Asp Thr Pro Ile Leu Asn545 550 555 560Thr Thr Asn His Tyr Gly Tyr Gln Gly Thr Gly Ile Ile Val Trp Val
565 570 575Asp Asp Ala Thr Ala Lys Thr Lys Asn Ala Thr Leu Thr Trp Thr Lys
580 585 590Thr Gly Tyr Lys Pro Asn Pro Glu Arg Gln Gly Pro Leu Val Pro Asn
595 600 605Ser Leu Trp Gly Ser Phe Val Asp Val Arg Ser Ile Gln Ser Leu Met
610 615 620Asp Arg Ser Thr Ser Ser Leu Ser Ser Ser Thr Asn Leu Trp Val Ser625 630 635 640Gly Ile Ala Asp Phe Leu His Glu Asp Gln Lys Gly Asn Gln Arg Ser
645 650 655Tyr Arg His Ser Ser Ala Gly Tyr Ala Leu Gly Gly Gly Phe Phe Thr
660 665 670Ala Ser Glu Asn Phe Phe Asn Phe Ala Phe Cys Gln Leu Phe Gly Tyr
675 680 685Asp Lys Asp His Leu Val Ala Lys Asn His Thr His Val Tyr Ala Gly
690 695 700AIa Met Ser Tyr Arg His Leu Gly Glu Ser Lys Thr Leu Ala Lys Ile705 710 715 720Leu Ser Gly Asn Ser Asp Ser Leu Pro Phe Val Phe Asn Ala Arg Phe
725 730 735Ala Tyr Gly His Thr Asp Asn Asn Met Thr Thr Lys Tyr Thr Gly Tyr
740 745 750Ser Pro Val Lys Gly Ser Trp Gly Asn Asp Ala Phe Gly Ile Glu Cys
755 760 765Gly Gly Ala Ile Pro Val Val Ala Ser Gly Arg Arg Ser Trp Val Asp
770 775 780Thr His Thr Pro Phe Leu Asn Leu Glu Met Ile Tyr Ala His Gln Asn785 790 795 800Asp Phe Lys Glu Asn Gly Thr Glu Gly Arg Ser Phe Gln Ser Glu Asp
805 810 815Leu Phe Asn Leu Ala Val Pro Val Gly Ile Lys Phe Glu Lys Phe Ser
820 825 830Asp Lys Ser Thr Tyr Asp Leu Ser Ile Ala Tyr Val Pro Asp Val Ile
835 840 845Arg Asn Asp Pro Gly Cys Thr Thr Thr Leu Met Val Ser Gly Asp Ser
850 855 860Trp Ser Thr Cys Gly Thr Ser Leu Ser Arg Gln Ala Leu Leu Val Arg865 870 875 880Ala Gly Asn His His Ala Phe Ala Ser Asn Phe Glu Val Phe Ser Gln
885 890 895Phe Glu Val Glu Leu Arg Gly Ser Ser Arg Ser Tyr Ala Ile Asp Leu
900 905 910Gly Gly Arg Phe Gly Phe
915
(2)SEQ ID NO:13的信息:
(i)序列特征:
(A)长度:2787碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:13:ATGAAATCCT CTCTTCATTG GTTTGTAATC TCGTCATCTT TAGCACTTCC CTTGTCACTA 60AATTTCTCTG CGTTTGCTGC TGTTGTTGAA ATCAATCTAG GACCTACCAA TAGCTTCTCT 120GGACCAGGAA CCTACACTCC TCCAGCCCAA ACAACAAATG CAGATGGAAC TATCTATAAT 180CTAACAGGGG ATGTCTCAAT CACCAATGCA GGATCTCCGA CAGCTCTAAC CGCTTCCTGC 240TTTAAAGAAA CTACTGGGAA TCTTTCTTTC CAAGGCCACG GCTACCAATT TCTCCTACAA 300AATATCGATG CGGGAGCGAA CTGTACCTTT ACCAATACAG CTGCAAATAA GCTTCTCTCC 360TTTTCAGGAT TCTCCTATTT GTCACTAATA CAAACCACGA ATGCTACCAC AGGAACAGGA 420GCCATCAAGT CCACAGGAGC TTGTTCTATT CAGTCGAACT ATAGTTGCTA CTTTGGCCAA 480AACTTTTCTA ATGACAATGG AGGCGCCCTC CAAGGCAGCT CTATCAGTCT ATCGCTAAAC 540CCCAACCTAA CGTTTGCCAA AAACAAAGCA ACGCAAAAAG GGGGTGCCCT CTATTCCACG 600GGAGGGATTA CAATTAACAA TACGTTAAAC TCAGCATCAT TTTCTGAAAA TACCGCGGCG 660AACAATGGCG GAGCCATTTA CACGGAAGCT AGCAGTTTTA TTAGCAGCAA CAAAGCAATT 720AGCTTTATAA ACAATAGTGT GACCGCAACC TCAGCTACAG GGGGAGCCAT TTACTGTAGT 780AGTACATCAG CCCCCAAACC AGTCTTAACT CTATCAGACA ACGGGGAACT GAACTTTATA 840GGAAATACAG CAATTACTAG TGGTGGGGCG ATTTATACTG ACAATCTAGT TCTTTCTTCT 900GGAGGACCTA CGCTTTTTAA AAACAACTCT GCTATAGATA CTGCAGCTCC CTTAGGAGGA 960GCAATTGCGA TTGCTGACTC TGGATCTTTG AGTCTTTCGG CTCTTGGTGG AGACATCACT 1020TTTGAAGGAA ACACAGTAGT CAAAGGAGCT TCTTCGAGTC AGACCACTAC CAGAAATTCT 1080ATTAACATCG GAAACACCAA TGCTAAGATT GTACAGCTGC GAGCCTCTCA AGGCAATACT 1140ATCTACTTCT ATGATCCTAT AACAACTAAC CATACTGCAG CTCTCTCAGA TGCTCTAAAC 1200TTAAATGGTC CTGACCTTGC AGGGAATGCT GCATATCAAG GAACCATCGT ATTTTCTGGA 1260GAGAAGCTCT CGGAAGCAGA AGCTGCAGAA GCTGATAATC TCAAATCTAC AATTCAGCAA 1320CCTCTAACTC TTGCGGGAGG GCAACTCTCT CTTAAATCAG GAGTCACTCT AGTTGCTAAG 1380TCCTTTTCGC AATCTCCGGG CTCTACCCTC CTCATGGATG CAGGGACCAC ATTAGAAACC 1440GCTGATGGGA TCACTATCAA TAATCTTGTT CTCAATGTAG ATTCCTTAAA AGAGACCAAG 1500AAGGCTACGC TAAAAGCAAC ACAAGCAAGT CAGACAGTCA CTTTATCTGG ATCGCTCTCT 1560CTTGTAGATC CTTCTGGAAA TGTCTACGAA GATGTCTCTT GGAATAACCC TCAAGTCTTT 1620TCTTGTCTCA CTCTTACTGC TGACGACCCC GCGAATATTC ACATCACAGA CTTAGCTGCT 1680GATCCCCTAG AAAAAAATCC TATCCATTGG GGATACCAAG GGAATTGGGC ATTATCTTGG 1740CAAGAGGATA CTGCGACTAA ATCCAAAGCA GCGACTCTTA CCTGGACAAA AACAGGATAC 1800AATCCGAATC CTGAGCGTCG TGGAACCTTA GTTGCTAACA CGCTATGGGG ATCCTTTGTT 1860GATGTGCGCT CCATACAACA GCTTGTAGCC ACTAAAGTAC GCCAATCTCA AGAAACTCGC 1920GGCATCTGGT GTGAAGGGAT CTCGAACTTC TTCCATAAAG ATAGCACGAA GATAAATAAA 1980GGTTTTCGCC ACATAAGTGC AGGTTATGTT GTAGGAGCGA CTACAACATT AGCTTCTGAT 2040AATCTTATCA CTGCAGCCTT CTGCCAATTA TTCGGGAAAG ATAGAGATCA CTTTATAAAT 2100AAAAATAGAG CTTCTGCCTA TGCAGCTTCT CTCCATCTCC AGCATCTAGC GACCTTGTCT 2160TCTCCAAGCT TGTTACGCTA CCTTCCTGGA TCTGAAAGTG AGCAGCCTGT CCTCTTTGAT 2220GCTCAGATCA GCTATATCTA TAGTAAAAAT ACTATGAAAA CCTATTACAC CCAAGCACCA 2280AAGGGAGAGA GCTCGTGGTA TAATGACGGT TGCGCTCTGG AACTTGCGAG CTCCCTACCA 2340CACACTGCTT TAAGCCATGA GGGTCTCTTC CACGCGTATT TTCCTTTCAT CAAAGTAGAA 2400GCTTCGTACA TACACCAAGA TAGCTTCAAA GAACGTAATA CTACCTTGGT ACGATCTTTC 2460GATAGCGGTG ATTTAATTAA CGTCTCTGTG CCTATTGGAA TTACCTTCGA GAGATTCTCG 2520AGAAACGAGC GTGCGTCTTA CGAAGCTACT GTCATCTACG TTGCCGATGT CTATCGTAAG 2580AATCCTGACT GCACGACAGC TCTCCTAATC AACAATACCT CGTGGAAAAC TACAGGAACG 2640AATCTCTCAA GACAAGCTGG TATCGGAAGA GCAGGGATCT TTTATGCCTT CTCTCCAAAT 2700CTTGAGGTCA CAAGTAACCT ATCTATGGAA ATTCGTGGAT CTTCACGCAG CTACAATGCA 2760GATCTTGGAG GTAAGTTCCA GTTCTAA 2787
(2)SEQ ID NO:14的信息:
(i)序列特征:
(A)长度:928氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:14:Met Lys Ser Ser Leu His Trp Phe Val Ile Ser Ser Ser Leu Ala Leu1 5 10 15Pro Leu Ser Leu Asn Phe Ser Ala Phe Ala Ala ValVal Glu Ile Asn
20 25 30Leu Gly Pro Thr Asn Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro
35 40 45Ala Gln Thr Thr Asn Ala Asp Gly Thr Ile Tyr Asn Leu Thr Gly Asp
50 55 60Val Ser Ile Thr Asn Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys65 70 75 80Phe Lys Glu Thr Thr Gly Asn Leu Ser Phe Gln Gly His Gly Tyr Gln
85 90 95Phe Leu Leu Gln Asn Ile Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn
100 105 110Thr Ala Ala Asn Lys Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser
115 120 125Leu Ile Gln Thr Thr Asn Ala Thr Thr Gly Thr Gly Ala Ile Lys Ser
130 135 140Thr Gly Ala Cys Ser Ile Gln Ser Asn Tyr Ser Cys Tyr Phe Gly Gln145 150 155 160Asn Phe Ser Asn Asp Asn Gly Gly Ala Leu Gln Gly Ser Ser Ile Ser
165 170 175Leu Ser Leu Asn Pro Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gln
180 185 190Lys Gly Gly Ala Leu Tyr Ser Thr Gly Gly lle Thr Ile Asn Asn Thr
195 200 205Leu Asn Ser Ala Ser Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly
210 215 220Ala Ile Tyr Thr Glu Ala Ser Ser Phe Ile Ser Ser Asn Lys Ala Ile225 230 235 240Ser Phe Ile Asn Asn Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala
245 250 255lle Tyr Cys Ser Ser Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser
260 265 270Asp Asn Gly Glu Leu Asn Phe Ile Gly Asn Thr Ala Ile Thr Ser Gly
275 280 285Gly Ala Ile Tyr Thr Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr
290 295 300Leu Phe Lys Asn Asn Ser Ala Ile Asp Thr Ala Ala Pro Leu Gly Gly305 310 315 320Ala Ile Ala Ile Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly
325 330 335Gly Asp Ile Thr Phe Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser
340 345 350Ser Gln Thr Thr Thr Arg Asn Ser Ile Asn Ile Gly Asn Thr Asn Ala
355 360 365Lys Ile Val Gln Leu Arg Ala Ser Gln Gly Asn Thr Ile Tyr Phe Tyr
370 375 380Asp Pro Ile Thr Thr Asn His Thr Ala Ala Leu Ser Asp Ala Leu Asn385 390 395 400Leu Asn Gly Pro Asp Leu Ala Gly Asn Pro Ala Tyr Gln Gly Thr Ile
405 410 415Val Phe Ser Gly Glu Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp
420 425 430Asn Leu Lys Ser Thr Ile Gln Gln Pro Leu Thr Leu Ala Gly Gly Gln
435 440 445Leu Ser Leu Lys Ser Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gln
450 455 460Ser Pro Gly Ser Thr Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr465 470 475 480Ala Asp Gly Ile Thr Ile Asn Asn Leu Val Leu Asn Val Asp Ser Leu
485 490 495Lys Glu Thr Lys Lys Ala Thr Leu Lys Ala Thr Gln Ala Ser Gln Thr
500 505 510Val Thr Leu Ser Gly Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val
515 520 525Tyr Glu Asp Val Ser Trp Asn Asn Pro Gln Val Phe Ser Cys Leu Thr
530 535 540Leu Thr Ala Asp Asp Pro Ala Asn Ile His Ile Thr Asp Leu Ala Ala545 550 555 560Asp Pro Leu Glu Lys Asn Pro Ile His Trp Gly Tyr Gln Gly Asn Trp
565 570 575Ala Leu Ser Trp Gln Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr
580 585 590Leu Thr Trp Thr Lys Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly
595 600 605Thr Leu Val Ala Asn Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser
610 615 620Ile Gln Gln Leu Val Ala Thr Lys Val Arg Gln Ser Gln Glu Thr Arg625 630 635 640Gly Ile Trp Cys Glu Gly Ile Ser Asn Phe Phe His Lys Asp Ser Thr
645 650 655Lys Ile Asn Lys Gly Phe Arg His Ile Ser Ala Gly Tyr Val Val Gly
660 665 670Ala Thr Thr Thr Leu Ala Ser Asp Asn Leu Ile Thr Ala Ala Phe Cys
675 680 685Gln Leu Phe Gly Lys Asp Arg Asp His Phe Ile Asn Lys Asn Arg Ala
690 695 700Ser Ala Tyr Ala Ala Ser Leu His Leu Gln His Leu Ala Thr Leu Ser705 710 715 720Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gln Pro
725 730 735Val Leu Phe Asp Ala Gln Ile Ser Tyr Ile Tyr Ser Lys Asn Thr Met
740 745 750Lys Thr Tyr Tyr Thr Gln Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn
755 760 765Asp Gly Cys Ala Leu Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu
770 775 780Ser His Glu Gly Leu Phe His Ala Tyr Phe Pro Phe Ile Lys Val Glu785 790 795 800Ala Ser Tyr Ile His Gln Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu
805 810 815Val Arg Ser Phe Asp Ser Gly Asp Leu Ile Asn Val Ser Val Pro Ile
820 825 830Gly Ile Thr Phe Glu Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu
835 840 845Ala Thr Val Ile Tyr Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys
850 855 860Thr Thr Ala Leu Leu Ile Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr865 870 875 880Asn Leu Ser Arg Gln Ala Gly Ile Gly Arg Ala Gly Ile Phe Tyr Ala
885 890 895Phe Ser Pro Asn Leu Glu Val Thr Ser Asn Leu Ser Met Glu Ile Arg
900 905 910Gly Ser Ser Arg Ser Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gln Phe
915 920 925
(2)SEQ ID NO:15的信息:
(i)序列特征:
(A)长度:2793碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:15:ATGAAAATAC CCTTGCACAA ACTCCTGATC TCTTCGACTC TTGTCACTCC CATTCTATTG 60AGCATTGCAA CTTACGGAGC AGATGCTTCT TTATCCCCTA CAGATAGCTT TGATGGAGCG 120GGCGGCTCTA CATTTACTCC AAAATCTACA GCAGATGCCA ATGGAACGAA CTATGTCTTA 180TCAGGAAATG TCTATATAAA CGATGCTGGG AAAGGCACAG CATTAACAGG CTGCTGCTTT 240ACAGAAACTA CGGGTGATCT GACATTTACT GGAAAGGGAT ACTCATTTTC ATTCAACACG 300GTAGATGCGG GTTCGAATGC AGGAGCTGCG GCAAGCACAA CTGCTGATAA AGCCCTAACA 360TTCACAGGAT TTTCTAACCT TTCCTTCATT GCAGCTCCTG GAACTACAGT TGCTTCAGGA 420AAAAGTACTT TAAGTTCTGC AGGAGCCTTA AATCTTACCG ATAATGGAAC GATTCTCTTT 480AGCCAAAACG TCTCCAATGA AGCTAATAAC AATGGCGGAG CGATCACCAC AAAAACTCTT 540TCTATTTCTG GGAATACCTC TTCTATAACC TTCACTAGTA ATAGCGCAAA AAAATTAGGT 600GGAGCGATCT ATAGCTCTGC GGCTGCAAGT ATTTCAGGAA ACACCGGCCA GTTAGTCTTT 660ATGAATAATA AAGGAGAAAC TGGGGGCGGG GCTCTGGGCT TTGAAGCCAG CTCCTCGATT 720ACTCAAAATA GCTCCCTTTT CTTCTCTGGA AACACTGCAA CAGATGCTGC AGGCAAGGGC 780GGGGCCATTT ATTGTGAAAA AACAGGAGAG ACTCCTACTC TTACTATCTC TGGAAATAAA 840AGTCTGACCT TCGCCGAGAA CTCTTCAGTA ACTCAAGGCG GAGCAATCTG TGCCCATGGT 900CTAGATCTTT CCGCTGCTGG CCCTACCCTA TTTTCAAATA ATAGATGCGG GAACACAGCT 960GCAGGCAAGG GCGGCGCTAT TGCAATTGCC GACTCTGGAT CTTTAAGTCT CTCTGCAAAT 1020CAAGGAGACA TCACGTTCCT TGGCAACACT CTAACCTCAA CCTCCGCGCC AACATCGACA 1080CGGAATGCTA TCTACCTGGG ATCGTCAGCA AAAATTACGA ACTTAAGGGC AGCCCAAGGC 1140CAATCTATCT ATTTCTATGA TCCGATTGCA TCTAACACCA CAGGAGCTTC AGACGTTCTG 1200ACCATCAACC AACCGGATAG CAACTCGCCT TTAGATTATT CAGGAACGAT TGTATTTTCT 1260GGGGAAAAGC TCTCTGCAGA TGAAGCGAAA GCTGCTGATA ACTTCACATC TATATTAAAG 1320CAACCATTGG CTCTAGCCTC TGGAACCTTA GCACTCAAAG GAAATGTCGA GTTAGATGTC 1380AATGGTTTCA CACAGACTGA AGGCTCTACA CTCCTCATGC AACCAGGAAC AAAGCTCAAA 1440GCAGATACTG AAGCTATCAG TCTTACCAAA CTTGTCGTTG ATCTTTCTGC CTTAGAGGGA 1500AATAAGAGTG TGTCCATTGA AACAGCAGGA GCCAACAAAA CTATAACTCT AACCTCTCCT 1560CTTGTTTTCC AAGATAGTAG CGGCAATTTT TATGAAAGCC ATACGATAAA CCAAGCCTTC 1620ACGCAGCCTT TGGTGGTATT CACTGCTGCT ACTGCTGCTA GCGATATTTA TATCGATGCG 1680CTTCTCACTT CTCCAGTACA AACTCCAGAA CCTCATTACG GGTATCAGGG ACATTGGGAA 1740GCCACTTGGG CAGACACATC AACTGCAAAA TCAGGAACTA TGACTTGGGT AACTACGGGC 1800TACAACCCTA ATCCTGAGCG TAGAGCTTCC GTAGTTCCCG ATTCATTATG GGCATCCTTT 1860ACTGACATTC GCACTCTACA GCAGATCATG ACATCTCAAG CGAATAGTAT CTATCAGCAA 1920CGAGGACTCT GGGCATCAGG AACTGCGAAT TTCTTCCATA AGGATAAATC AGGAACTAAC 1980CAAGCATTCC GACATAAAAG CTACGGCTAT ATTGTTGGAG GAAGTGCTGA AGATTTTTCT 2040GAAAATATCT TCAGTGTAGC TTTCTGCCAG CTCTTCGGTA AAGATAAAGA CCTGTTTATA 2100GTTGAAAATA CCTCTCATAA CTATTTAGCG TCGCTATACC TGCAACATCG AGCATTCCTA 2160GGAGGACTTC CCATGCCCTC ATTTGGAAGT ATCACCGACA TGCTGAAAGA TATTCCTCTC 2220ATTTTGAATG CCCAGCTAAG CTACAGCTAC ACTAAAAATG ATATGGATAC TCGCTATACT 2280TCCTATCCTG AAGCTCAAGG TTCTTGGACC AATAATTCTG GGGCTCTAGA GCTCGGAGGA 2340TCTCTGGCTC TATATCTCCC TAAAGAAGCA CCGTTCTTCC AGGGATATTT CCCCTTCTTA 2400AAGTTCCAGG CAGTCTACAG CCGCCAACAA AACTTTAAAG AGAGTGGCGC TGAAGCCCGT 2460GCTTTTGATG ATGGAGACCT AGTGAACTGC TCTATCCCTG TCGGCATTCG GTTAGAAAAA 2520ATCTCCGAAG ATGAAAAAAA TAATTTCGAG ATTTCTCTAG CCAACATTGG TGATGTGTAT 2580CGTAAAAATC CCCGTTCGCG TACTTCTCTA ATGGTCAGTG GAGCCTCTTG GACTTCGCTA 2640TGTAAAAACC TCGCACGACA AGCCTTCTTA GCAAGTGCTG GAAGCCATCT GACTCTCTCC 2700CCTCATGTAG AACTCTCTGG GGAAGCTGCT TATGAGCTTC GTGGCTCAGC ACACATCTAC 2760AATGTAGATT GTGGGCTAAG ATACTCATTC TAG 2793
(2)SEQ ID NO:16的信息:
(i)序列特征:
(A)长度:930氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:16:Met Lys Ile Pro Leu His Lys Leu Leu Ile Ser Ser Thr Leu Val Thr1 5 10 15Pro Ile Leu Leu Ser Ile Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser
20 25 30Pro Thr Asp Ser Phe Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys
35 40 45Ser Thr Ala Asp Ala Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val
50 55 60Tyr Ile Asn Asp Ala Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe65 70 75 80Thr Glu Thr Thr Gly Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe
85 90 95Ser Phe Asn Thr Val Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser
100 105 110Thr Thr Ala Asp Lys Ala Leu Thr Phe Thr Gly Phe Ser Asn Leu Ser
115 120 125Phe Ile Ala Ala Pro Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu
130 135 140Ser Ser Ala Gly Ala Leu Asn Leu Thr Asp Asn Gly Thr Ile Leu Phe145 150 155 160Ser Gln Asn Val Ser Asn Glu Ala Asn Asn Asn Gly Gly Ala Ile Thr
165 170 175Thr Lys Thr Leu Ser Ile Ser Gly Asn Thr Ser Ser Ile Thr Phe Thr
180 185 190Ser Asn Ser Ala Lys Lys Leu Gly Gly Ala Ile Tyr Ser Ser Ala Ala
195 200 205Ala Ser Ile Ser Gly Asn Thr Gly Gln Leu Val Phe Met Asn Asn Lys
210 215 220Gly Glu Thr Gly Gly Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser Ile225 230 235 240Thr Gln Asn Ser Ser Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala
245 250 255Ala Gly Lys Gly Gly Ala Ile Tyr Cys Glu Lys Thr Gly Glu Thr Pro
260 265 270Thr Leu Thr Ile Ser Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser
275 280 285Ser Val Thr Gln Gly Gly Ala Ile Cys Ala His Gly Leu Asp Leu Ser
290 295 300Ala Ala Gly Pro Thr Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala305 310 315 320Ala Gly Lys Gly Gly Ala Ile Ala Ile Ala Asp Ser Gly Ser Leu Ser
325 330 335Leu Ser Ala Asn Gln Gly Asp Ile Thr Phe Leu Gly Asn Thr Leu Thr
340 345 350Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala Ile Tyr Leu Gly Ser
355 360 365Ser Ala Lys Ile Thr Asn Leu Arg Ala Ala Gln Gly Gln Ser Ile Tyr
370 375 380Phe Tyr Asp Pro Ile Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu385 390 395 400Thr Ile Asn Gln Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr
405 410 415Ile Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala
420 425 430Asp Asn Phe Thr Ser Ile Leu Lys Gln Pro Leu Ala Leu Ala Ser Gly
435 440 445Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr
450 455 460Gln Thr Glu Gly Ser Thr Leu Leu Met Gln Pro Gly Thr Lys Leu Lys465 470 475 480Ala Asp Thr Glu Ala Ile Ser Leu Thr Lys Leu Val Val Asp Leu Ser
485 490 495Ala Leu Glu Gly Asn Lys Ser Val Ser Ile Glu Thr Ala Gly Ala Asn
500 505 510Lys Thr Ile Thr Leu Thr Ser Pro Leu Val Phe Gln Asp Ser Ser Gly
515 520 525Asn Phe Tyr Glu Ser His Thr Ile Asn Gln Ala Phe Thr Gln Pro Leu
530 535 540Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp Ile Tyr Ile Asp Ala545 550 555 560Leu Leu Thr Ser Pro Val Gln Thr Pro Glu Pro His Tyr Gly Tyr Gln
565 570 575Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly
580 585 590Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg
595 600 605Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp Ile Arg
610 615 620Thr Leu Gln Gln Ile Met Thr Ser Gln Ala Asn Ser Ile Tyr Gln Gln625 630 635 640Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys
645 650 655Ser Gly Thr Asn Gln Ala Phe Arg His Lys Ser Tyr Gly Tyr Ile Val
660 665 670Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn Ile Phe Ser Val Ala Phe
675 680 685Cys Gln Leu Phe Gly Lys Asp Lys Asp Leu Phe Ile Val Glu Asn Thr
690 695 700Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gln His Arg Ala Phe Leu705 710 715 720Gly Gly Leu Pro Met Pro Ser Phe Gly Ser Ile Thr Asp Met Leu Lys
725 730 735Asp Ile Pro Leu Ile Leu Asn Ala Gln Leu Ser Tyr Ser Tyr Thr Lys
740 745 750Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gln Gly Ser
755 760 765Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu
770 775 780Tyr Leu Pro Lys Glu Ala Pro Phe Phe Gln Gly Tyr Phe Pro Phe Leu785 790 795 800Lys Phe Gln Ala Val Tyr Ser Arg Gln Gln Asn Phe Lys Glu Ser Gly
805 810 815Ala Glu Ala Arg Ala Phe Asp Asp Gly Asp Leu Val Asn Cys Ser Ile
820 825 830Pro Val Gly Ile Arg Leu Glu Lys Ile Ser Glu Asp Glu Lys Asn Asn
835 840 845Phe Glu Ile Ser Leu Ala Asn Ile Gly Asp Val Tyr Arg Lys Asn Pro
850 855 860Arg Ser Arg Thr Ser Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu865 870 875 880Cys Lys Asn Leu Ala Arg Gln Ala Phe Leu Ala Ser Ala Gly Ser His
885 890 895Leu Thr Leu Ser Pro His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu
900 905 910Leu Arg Gly Ser Ala His Ile Tyr Asn Val Asp Cys Gly Leu Arg Tyr
915 920 925Ser Phe
930
(2)SEQ ID NO:17的信息:
(i)序列特征:
(A)长度:840碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:17:GAAGACAATA TAAGGTACCG TCATAACAGC GGGGGTTATG CACTAGGGAT CACAGCAACA 60ACTCCTGCCG AGGATCAGCT TACTTTTGCC TTCTGCCAGC TCTTTGCTAG AGATCGCAAT 120CATATTACAG GTAAGAACCA CGGAGATACT TACGGTGCCT CTTTGTATTT CCACCATACA 180GAAGGGCTCT TCGACATCGC CAATTTCCTC TGGGGAAAAG CAACCCGAGC TCCCTGGGTG 240CTCTCTGAGA TCTCCCAGAT CATTCCTTTA TCGTTCGATG CTAAATTCAG TTATCTCCAT 300ACAGACAACC ACATGAAGAC ATATTATACC GATAACTCTA TCATCAAGGG TTCTTGGAGA 360AACGATGCCT TCTGTGCAGA TCTTGGAGCT AGCCTGCCTT TTGTTATTTC CGTTCCGTAT 420CTTCTGAAAG AAGTCGAACC TTTTGTCAAA GTACAGTATA TCTATGCGCA TCAGCAAGAC 480TTCTACGAGC GTCATGCTGA AGGACGCGCT TTCAATAAAA GCGAGCTTAT CAACGTAGAG 540ATTCCTATAG GCGTCACCTT CGAAAGAGAC TCAAAATCAG AAAAGGGAAC TTACGATCTT 600ACTCTTATGT ATATACTCGA TGCTTACCGA CGCAATCCTA AATGTCAAAC TTCCCTAATA 660GCTAGCGATG CTAACTGGAT GGCCTATGGT ACCAACCTCG CACGACAAGG TTTTTCTGTT 720CGTGCTGCGA ACCATTTCCA AGTGAACCCC CACATGGAAA TCTTCGGTCA ATTCGCTTTT 780GAAGTACGAA GTTCTTCACG AAATTATAAT ACAAACCTAG GCTCTAAGTT TTGTTTCTAG 840
(2)SEQ ID NO:18的信息:
(i)序列特征:
(A)长度:279氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:18:Glu Asp Asn Ile Arg Tyr Arg His Asn Ser Gly Gly Tyr Ala Leu Gly1 5 10 15Ile Thr Ala Thr Thr Pro Ala Glu Asp Gln Leu Thr Phe Ala Phe Cys
20 25 30Gln Leu Phe Ala Arg Asp Arg Asn His Ile Thr Gly Lys Asn His Gly
35 40 45Asp Thr Tyr Gly Ala Ser Leu Tyr Phe His His Thr Glu Gly Leu Phe
50 55 60Asp Ile Ala Asn Phe Leu Trp Gly Lys Ala Thr Arg Ala Pro Trp Val65 70 75 80Leu Ser Glu Ile Ser Gln Ile Ile Pro Leu Ser Phe Asp Ala Lys Phe
85 90 95Ser Tyr Leu His Thr Asp Asn His Met Lys Thr Tyr Tyr Thr Asp Asn
100 105 110Ser Ile Ile Lys Gly Ser Trp Arg Asn Asp Ala Phe Cys Ala Asp Leu
115 120 125Gly Ala Ser Leu Pro Phe Val Ile Ser Val Pro Tyr Leu Leu Lys Glu
130 135 140Val Glu Pro Phe Val Lys Val Gln Tyr Ile Tyr Ala His Gln Gln Asp145 150 155 160Phe Tyr Glu Arg His Ala Glu Gly Arg Ala Phe Asn Lys Ser Glu Leu
165 170 175lle Asn Val Glu Ile Pro Ile Gly Val Thr Phe Glu Arg Asp Ser Lys
180 185 190Ser Glu Lys Gly Thr Tyr Asp Leu Thr Leu Met Tyr Ile Leu Asp Ala
195 200 205Tyr Arg Arg Asn Pro Lys Cys Gln Thr Ser Leu Ile Ala Ser Asp Ala
210 215 220Asn Trp Met Ala Tyr Gly Thr Asn Leu Ala Arg Gln Gly Phe Ser Val225 230 235 240Arg Ala Ala Asn His Phe Gln Val Asn Pro His Met Glu Ile Phe Gly
245 250 255Gln Phe Ala Phe Glu Val Arg Ser Ser Ser Arg Asn Tyr Asn Thr Asn
260 265 270Leu Gly Ser Lys Phe Cys Phe
275
(2)SEQ ID NO:19的信息:
(i)序列特征:
(A)长度:1545碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:19:ATGACCATAC TTCGAAATTT TCTTACCTGC TCGGCTTTAT TCCTCGCTCT CCCTGCAGCA 60GCACAAGTTG TATATCTTCA TGAAAGTGAT GGTTATAACG GTGCTATCAA TAATAAAAGC 120TTAGAACCTA AAATTACCTG TTATCCAGAA GGAACTTCTT ACATCTTTCT AGATGACGTG 180AGGATTTCCA ACGTTAAGCA TGATCAAGAA GATGCTGGGG TTTTTATAAA TCGATCTGGG 240AATCTTTTTT TCATGGGCAA CCGTTGCAAC TTCACTTTTC ACAACCTTAT GACCGAGGGT 300TTTGGCGCTG CCATTTCGAA CCGCGTTGGA GACACCACTC TCACTCTCTC TAATTTTTCT 360TACTTAACGT TCACCTCAGC ACCTCTACTA CCTCAAGGAC AAGGAGCGAT TTATAGTCTT 420GGTTCCGTGA TGATCGAAAA TAGTGAGGAA GTGACTTTCT GTGGGAACTA CTCTTCGTGG 480AGTGGAGCTG CGATTTATAC TCCCTACCTT TTAGGTTCTA AGGCGAGTCG TCCTTCAGTA 540AATCTCAGCG GGAACCGCTA CCTGGTGTTT AGAGACTATG TGAGCCAAGG TTATGGCGGC 600GCCGTATCTA CCCACAATCT CACACTCACG ACTCGAGGAC CTTCGTGTTT TGAAAATAAT 660CATGCTTATC ATGACGTGAA TAGTAATGGA GGAGCCATTG CCATTGCTCC TGGAGGATCG 720ATCTCTATAT CCGTGAAAAG CGGAGATCTC ATCTTCAAAG GAAATACAGC ATCACAAGAC 780GGAAATACAA TACACAACTC CATCCATCTG CAATCTGGAG CACAGTTTAA GAACCTACGT 840GCTGTTTCAG AATCCGGAGT TTATTTCTAT GATCCTATAA GCCATAGCGA GTCGCATAAA 900ATTACAGATC TTGTAATCAA TGCTCCTGAA GGAAAGGAAA CTTATGAAGG AACAATTAGC 960TTCTCAGGAC TATGCCTGGA TGATCATGAA GTTTGTGCGG AAAATCTTAC TTCCACAATC 1020CTACAAGATG TCACATTAGC AGGAGGAACT CTCTCTCTAT CGGATGGGGT TACCTTGCAA 1080CTGCATTCTT TTAAGCAGGA AGCAAGCTCT ACGCTTACTA TGTCTCCAGG AACCACTCTG 1140CTCTGCTCAG GAGATGCTCG GGTTCAGAAT CTGCACATCC TGATTGAAGA TACCGACAAC 1200TTTGTTCCTG TAAGGATTCG CGCCGAGGAC AAGGATGCTC TTGTCTCATT AGAAAAACTT 1260AAAGTTGCCT TTGAGGCTTA TTGGTCCGTC TATGACTTTC CTCAATTTAA GGAAGCCTTT 1320ACGATTCCTC TTCTTGAACT TCTAGGGCCT TCTTTTGACA GTCTTCTCCT AGGGGAGACC 1380ACTTTGGAGA GAACCCAAGT CACAACAGAG AATGACGCCG TTCGAGGTTT CTGGTCCCTA 1440AGCTGGGAAG AGTACCCCCC TTCTCTGGAT AAAGACAGAA GGATCACACC AACTAAGAAA 1500ACTGTTTTCC TCACTTGGAA TCCTGAGATC ACTTCTACGC CATAA 1545
(2)SEQ ID NO:20的信息:
(i)序列特征:
(A)长度:514氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:20:Met Thr Ile Leu Arg Asn Phe Leu Thr Cys Ser Ala Leu Phe Leu Ala1 5 10 15Leu Pro Ala Ala Ala Gln Val Val Tyr Leu His Glu Ser Asp Gly Tyr
20 25 30Asn Gly Ala Ile Asn Asn Lys Ser Leu Glu Pro Lys Ile Thr Cys Tyr
35 40 45Pro Glu Gly Thr Ser Tyr Ile Phe Leu Asp Asp Val Arg Ile Ser Asn
50 55 60Val Lys His Asp Gln Glu Asp Ala Gly Val Phe Ile Asn Arg Ser Gly65 70 75 80Asn Leu Phe Phe Met Gly Asn Arg Cys Asn Phe Thr Phe His Asn Leu
85 90 95Met Thr Glu Gly Phe Gly Ala Ala Ile Ser Asn Arg Val Gly Asp Thr
100 105 110Thr Leu Thr Leu Ser Asn Phe Ser Tyr Leu Thr Phe Thr Ser Ala Pro
115 120 125Leu Leu Pro Gln Gly Gln Gly Ala Ile Tyr Ser Leu Gly Ser Val Met
130 135 140Ile Glu Asn Ser Glu Glu Val Thr Phe Cys Gly Asn Tyr Ser Ser Trp145 150 155 160Ser Gly Ala Ala Ile Tyr Thr Pro Tyr Leu Leu Gly Ser Lys Ala Ser
165 170 175Arg Pro Ser Val Asn Leu Ser Gly Asn Arg Tyr Leu Val Phe Arg Asp
180 185 190Tyr Val Ser Gln Gly Tyr Gly Gly Ala Val Ser Thr His Asn Leu Thr
195 200 205Leu Thr Thr Arg Gly Pro Ser Cys Phe Glu Asn Asn His Ala Tyr His
210 215 220Asp Val Asn Ser Asn Gly Gly Ala Ile Ala Ile Ala Pro Gly Gly Ser225 230 235 240Ile Ser Ile Ser Val Lys Ser Gly Asp Leu Ile Phe Lys Gly Asn Thr
245 250 255Ala Ser Gln Asp Gly Asn Thr Ile His Asn Ser Ile His Leu Gln Ser
260 265 270Gly Ala Gln Phe Lys Asn Leu Arg Ala Val Ser Glu Ser Gly Val Tyr
275 280 285Phe Tyr Asp Pro Ile Ser His Ser Glu Ser His Lys Ile Thr Asp Leu
290 295 300Val Ile Asn Ala Pro Glu Gly Lys Glu Thr Tyr Glu Gly Thr Ile Ser305 310 315 320Phe Ser Gly Leu Cys Leu Asp Asp His Glu Val Cys Ala Glu Asn Leu
325 330 335Thr Ser Thr Ile Leu Gln Asp Val Thr Leu Ala Gly Gly Thr Leu Ser
340 345 350Leu Ser Asp Gly Val Thr Leu Gln Leu His Ser Phe Lys Gln Glu Ala
355 360 365Ser Ser Thr Leu Thr Met Ser Pro Gly Thr Thr Leu Leu Cys Ser Gly
370 375 380Asp Ala Arg Val Gln Asn Leu His Ile Leu Ile Glu Asp Thr Asp Asn385 390 395 400Phe Val Pro Val Arg Ile Arg Ala Glu Asp Lys Asp Ala Leu Val Ser
405 410 415Leu Glu Lys Leu Lys Val Ala Phe Glu Ala Tyr Trp Ser Val Tyr Asp
420 425 430Phe Pro Gln Phe Lys Glu Ala Phe Thr Ile Pro Leu Leu Glu Leu Leu
435 440 445Gly Pro Ser Phe Asp Ser Leu Leu Leu Gly Glu Thr Thr Leu Glu Arg
450 455 460Thr Gln Val Thr Thr Glu Asn Asp Ala Val Arg Gly Phe Trp Ser Leu465 470 475 480Ser Trp Glu Glu Tyr Pro Pro Ser Leu Asp Lys Asp Arg Arg Ile Thr
485 490 495Pro Thr Lys Lys Thr Val Phe Leu Thr Trp Asn Pro Glu Ile Thr Ser
500 505 510Thr Pro
(2)SEQ ID NO:21的信息:
(i)序列特征:
(A)长度:787碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:21:ATGAAAACGT CTATTCGTAA GTTCTTAATT TCTACCACAC TGGCGCCATG TTTTGCTTCA 60ACAGCGTTTA CTGTAGAAGT TATCATGCCT TCCGAGAACT TTGATGGATC GAGTGGGAAG 120ATTTTTCCTT ACACAACACT TTCTGATCCT AGAGGGACAC TCTGTATTTT TTCAGGGGAT 180CTCTACATTG CGAATCTTGA TAATGCCATA TCCAGAACCT CTTCCAGTTG CTTTAGCAAT 240AGGGCGGGAG CACTACAAAT CTTAGGAAAA GGTGGGGTTT TCTCCTTCTT AAATATCCGT 300TCTTCAGCTG ACGGAGCCGC GATTAGTAGT GTAATCACCC AAAATCCTGA ACTATGTCCC 360TTGAGTTTTT CAGGATTTAG TCAGATGATC TTCGATAACT GTGAATCTTT GACTTCAGAT 420ACCTCAGCGA GTAATGTCAT ACCTCACGCA TCGGCGATTT ACGCTACAAC GCCCATGCTC 480TTTACAAACA ATGACTCCAT ACTATTCCAA TACAACCGTT CTGCAGGATT TGGAGCTGCC 540ATTCGAGGCA CAAGCATCAC AATAGAAAAT ACGAAAAAGA GCCTTCTCTT TAATGGTAAT 600GGATCCATCT CTAATGGAGG GGCCCTCACG GGATCTGCAG CGATCAACCT CATCAACAAT 660AGCGCTCCTG TGATTTTCTC AACGAATGCT ACAGGGATCT ATGGTGGGGC TATTTACCTT 720ACCGGAGGAT CTATGCTCAC CTCTGGGAAC CTCTCAGGAG TCTTGTTCGT TTATAATAGC 780TCGCGCT 787
(2)SEQ ID NO:22的信息:
(i)序列特征:
(A)长度:262氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:22:Met Lys Thr Ser Ile Arg Lys Phe Leu Ile Ser Thr Thr Leu Ala Pro1 5 10 15Cys Phe Ala Ser Thr Ala Phe Thr Val Glu Val Ile Met Pro Ser Glu
20 25 30Asn Phe Asp Gly Ser Ser Gly Lys Ile Phe Pro Tyr Thr Thr Leu Ser
35 40 45Asp Pro Arg Gly Thr Leu Cys Ile Phe Ser Gly Asp Leu Tyr Ile Ala
50 55 60Asn Leu Asp Asn Ala Ile Ser Arg Thr Ser Ser Ser Cys Phe Ser Asn65 70 75 80Arg Ala Gly Ala Leu Gln Ile Leu Gly Lys Gly Gly Val Phe Ser Phe
85 90 95Leu Asn Ile Arg Ser Ser Ala Asp Gly Ala Ala Ile Ser Ser Val Ile
100 105 110Thr Gln Asn Pro Glu Leu Cys Pro Leu Ser Phe Ser Gly Phe Ser Gln
115 120 125Met Ile Phe Asp Asn Cys Glu Ser Leu Thr Ser Asp Thr Ser Ala Ser
130 135 140Asn Val Ile Pro His Ala Ser Ala Ile Tyr Ala Thr Thr Pro Met Leu145 150 155 160Phe Thr Asn Asn Asp Ser Ile Leu Phe Gln Tyr Asn Arg Ser Ala Gly
165 170 175Phe Gly Ala Ala Ile Arg Gly Thr Ser Ile Thr Ile Glu Asn Thr Lys
180 185 190Lys Ser Leu Leu Phe Asn Gly Asn Gly Ser Ile Ser Asn Gly Gly Ala
195 200 205Leu Thr Gly Ser Ala Ala Ile Asn Leu Ile Asn Asn Ser Ala Pro Val
210 215 220Ile Phe Ser Thr Asn Ala Thr Gly Ile Tyr Gly Gly Ala Ile Tyr Leu225 230 235 240Thr Gly Gly Ser Met Leu Thr Ser Gly Asn Leu Ser Gly Val Leu Phe
245 250 255Val Tyr Asn Ser Ser Arg
260
(2)SEQ ID NO:23的信息:
(i)序列特征:
(A)长度:2838碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:基因组DNA
(xi)序列描述:SEQ ID NO:23:ATGAAGACTT CAGTTTCTAT GTTGTTGGCC CTGCTTTGCT CGGGGGCTAG CTCTATTGTA 60CTCCATGCCG CAACCACTCC ACTAAATCCT GAAGATGGGT TTATTGGGGA GGGCAATACA 120AATACTTTTT CTCCGAAATC TACAACGGAT GCTGCAGGAA CTACCTACTC TCTCACAGGA 180GAGGTTCTGT TTATAGATCC GGGGAAAGGT GGTTCAATTA CAGGAACTTG CTTTGTAGAA 240ACTGCTGGCG ATCTTACATT TTTAGGTAAT GGAAATACCC TAAAGTTCCT GTCGGTAGAT 300GCAGGTGCTA ATATCGCGGT TGCTCATGTA CAAGGAAGTA AGAATTTAAG CTTCACAGAT 360TTCCTTTCTC TGGTGATCAC AGAATCTCCA AAATCCGCTG TTAGTACAGG AAAAGGTAGC 420CTAGTCAGTT CAGGTGCAGT CCAACTGCAA GATATAAACA CTCTAGTTCT TACAAGCAAT 480GCCTCTGTCG AAGATGGTGG CGTGATTAAA GGAAACTCCT GCTTGATTCA GGGAATCAAA 540AATAGTGCGA TTTTTGGACA AAATACATCT TCGAAAAAAG GAGGGGCGAT CTCCACGACT 600CAAGGACTCA CCATAGAGAA TAACTTAGGG ACGCTAAAGT TCAATGAAAA CAAAGCAGTG 660ACCTCAGGAG GCGCCTTAGA TTTAGGAGCC GCGTCTACAT TCACTGCGAA CCATGAGTTG 720ATATTTTCAC AAAATAAGAC TTCTGGGAAT GCTGCAAATG GCGGAGCCAT AAATTGCTCA 780GGCGACCTAA CATTTACTGA TAACACTTCT TTGTTACTTC AAGAAAATAG CACAATGCAG 840GATGGTGGAG CTTTGTGTAG CACAGGAACC ATAAGCATTA CCGGTAGTGA TTCTATCAAT 900GTGATAGGAA ATACTTCAGG ACAAAAAGGA GGAGCGATTT CTGCAGCTTC TCTCAAGATT 960TTGGGAGGGC AGGGAGGCGC TCTCTTTTCT AATAACGTAG TGACTCATGC CACCCCTCTA 1020GGAGGTGCCA TTTTTATCAA CACAGGAGGA TCCTTGCAGC TCTTCACTCA AGGAGGGGAT 1080ATCGTATTCG AGGGGAATCA GGTCACTACA ACAGCTCCAA ATGCTACCAC TAAGAGAAAT 1140GTAATTCACC TCGAGAGCAC CGCGAAGTGG ACGGGACTTG CTGCAAGTCA AGGTAACGCT 1200ATCTATTTCT ATGATCCCAT TACCACCAAC GATACGGGAG CAAGCGATAA CTTACGTATC 1260AATGAGGTCA GTGCAAATCA AAAGCTCTCG GGATCTATAG TATTTTCTGG AGAGAGATTG 1320TCGACAGCAG AAGCTATAGC TGAAAATCTT ACTTCGAGGA TCAACCAGCC TGTCACTTTA 1380GTAGAGGGGA GCTTAGAACT TAAACAGGGA GTGACCTTGA TCACACAAGG ATTCTCGCAG 1440GAGCCAGAAT CCACGCTTCT TTTGGATTTG GGGACCTCAT TACAAGCTTC TACAGAAGAT 1500ATCGTCATCA CAAATTCATC TATAAATGCC GATACCATTT ACGGAAAGAA TCCAATCAAT 1560ATTGTAGCTT CAGCAGCGAA TAAGAACATT ACCCTAACAG GAACCTTAGC ACTTGTAAAT 1620GCAGATGGAG CTTTGTATGA GAACCATACC TTGCAAGACT CTCAAGATTA TAGCTTTGTA 1680AAGTTATCTC CAGGAGCGGG AGGGACTATA ATTACTCAAG ATGCTTCTCA GAAGCTTCTT 1740GAAGTAGCTC CTTCTAGACC ACATTATGGC TATCAAGGAC ATTGGAATGT GCAAGTCATC 1800CCAGGAACGG GAACTCAACC GAGCCAGGCA AATTTAGAAT GGGTGCGGAC AGGATACCTT 1860CCGAATCCCG AACGGCAAGG ATTTTTAGTT CCCAATAGCC TGTGGGGTTC TTTTGTTGAT 1920CAGCGTGCTA TCCAAGAAAT CATGGTAAAT AGTAGCCAAA TCTTATGTCA GGAACGGGGA 1980GTCTGGGGAG CTGGAATTGC TAATTTCCTA CATAGAGATA AAATTAATGA GCACGGCTAT 2040CGCCATAGCG GTGTCGGTTA TCTTGTGGGA GTTGGCACTC ATGCTTTTTC TGATGCTACG 2100ATAAATGCGG CTTTTTGCCA GCTCTTCAGT AGAGATAAAG ACTACGTAGT ATCCAAAAAT 2160CATGGAACTA GCTACTCAGG GGTCGTATTT CTTGAGGATA CCCTAGAGTT TAGAAGTCCA 2220CAGGGATTCT ATACTGATAG CTCCTCAGAA GCTTGCTGTA ACCAAGTCGT CACTATAGAT 2280ATGCAGTTGT CTTACAGCCA TAGAAATAAT GATATGAAAA CCAAATACAC GACATATCCA 2340GAAGCTCAGG GATCTTGGGC AAATGATGTT TTTGGTCTTG AGTTTGGAGC GACTACATAC 2400TACTACCCTA ACAGTACTTT TTTATTTGAT TACTACTCTC CGTTTCTCAG GCTGCAGTGC 2460ACCTATGCTC ACCAGGAAGA CTTCAAAGAG ACAGGAGGTG AGGTTCGTCA CTTTACTAGC 2520GGAGATCTTT TCAATTTAGC AGTTCCTATT GGCGTGAAGT TTGAGAGATT TTCAGACTGT 2580AAAAGGGGAT CTrATGAACT TACCCTTGCT TATGTTCCTG ATGTGATTCG CAAAGATCCC 2640AAGAGCACGG CAACATTGGC TAGTGGAGCT ACGTGGAGCA CCCACGGAAA CAATCTCTCC 2700AGACAAGGAT TACAACTGCG TTTAGGGAAC CACTGTCTCA TAAATCCTGG AATTGAGGTG 2760TTCAGTCACG GAGCTATTGA ATTGCGGGGA TCCTCTCGTA ATTATAACAT CAATCTCGGG 2820GGTAAATACC GATTTTAA 2838
(2)SEQ ID NO:24的信息:
(i)序列特征:
(A)长度:946氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:肽
(xi)序列描述:SEQ ID NO:24:Met Lys Thr Ser Val Ser Met Leu Leu Ala Leu Leu Cys Ser Gly Ala1 5 10 15Ser Ser Ile Val Leu His Ala Ala Thr Thr Pro Leu Asn Pro Glu Asp
20 25 30Gly Phe Ile Gly Glu Gly Asn Thr Asn Thr Phe Ser Pro Lys Ser Thr
35 40 45Thr Asp Ala Ala Gly Thr Thr Tyr Ser Leu Thr Gly Glu Val Leu Phe
50 55 60Ile Asp Pro Gly Lys Gly Gly Ser Ile Thr Gly Thr Cys Phe Val Glu65 70 75 80Thr Ala Gly Asp Leu Thr Phe Leu Gly Asn Gly Asn Thr Leu Lys Phe
85 90 95Leu Ser Val Asp Ala Gly Ala Asn Ile Ala Val Ala His Val Gln Gly
100 105 110Ser Lys Asn Leu Ser Phe Thr Asp Phe Leu Ser Leu Val Ile Thr Glu
115 120 125Ser Pro Lys Ser Ala Val Ser Thr Gly Lys Gly Ser Leu Val Ser Ser
130 135 140Gly Ala Val Gln Leu Gln Asp Ile Asn Thr Leu Val Leu Thr Ser Asn145 150 155 160Ala Ser Val Glu Asp Gly Gly Val Ile Lys Gly Asn Ser Cys Leu Ile
165 170 175Gln Gly Ile Lys Asn Ser Ala Ile Phe Gly Gln Asn Thr Ser Ser Lys
180 185 190Lys Gly Gly Ala Ile Ser Thr Thr Gln Gly Leu Thr Ile Glu Asn Asn
195 200 205Leu Gly Thr Leu Lys Phe Asn Glu Asn Lys Ala Val Thr Ser Gly Gly
210 215 220Ala Leu Asp Leu Gly Ala Ala Ser Thr Phe Thr Ala Asn His Glu Leu225 230 235 240Ile Phe Ser Gln Asn Lys Thr Ser Gly Asn Ala Ala Asn Gly Gly Ala
245 250 255Ile Asn Cys Ser Gly Asp Leu Thr Phe Thr Asp Asn Thr Ser Leu Leu
260 265 270Leu Gln Glu Asn Ser Thr Met Gln Asp Gly Gly Ala Leu Cys Ser Thr
275 280 285Gly Thr Ile Ser Ile Thr Gly Ser Asp Ser Ile Asn Val Ile Gly Asn
290 295 300Thr Ser Gly Gln Lys Gly Gly Ala Ile Ser Ala Ala Ser Leu Lys Ile305 310 315 320Leu Gly Gly Gln Gly Gly Ala Leu Phe Ser Asn Asn Val Val Thr His
325 330 335Ala Thr Pro Leu Gly Gly Ala Ile Phe Ile Asn Thr Gly Gly Ser Leu
340 345 350Gln Leu Phe Thr Gln Gly Gly Asp Ile Val Phe Glu Gly Asn Gln Val
355 360 365Thr Thr Thr Ala Pro Asn Ala Thr Thr Lys Arg Asn Val Ile His Leu
370 375 380Glu Ser Thr Ala Lys Trp Thr Gly Leu Ala Ala Ser Gln Gly Asn Ala385 390 395 400Ile Tyr Phe Tyr Asp Pro Ile Thr Thr Asn Asp Thr Gly Ala Ser Asp
405 410 415Asn Leu Arg Ile Asn Glu Val Ser Ala Asn Gln Lys Leu Ser Gly Ser
420 425 430Ile Val Phe Ser Gly Glu Arg Leu Ser Thr Ala Glu Ala Ile Ala Glu
435 440 445Asn Leu Thr Ser Arg Ile Asn Gln Pro Val Thr Leu Val Glu Gly Ser
450 455 460Leu Glu Leu Lys Gln Gly Val Thr Leu Ile Thr Gln Gly Phe Ser Gln465 470 475 480Glu Pro Glu Ser Thr Leu Leu Leu Asp Leu Gly Thr Ser Leu Gln Ala
485 490 495Ser Thr Glu Asp Ile Val Ile Thr Asn Ser Ser Ile Asn Ala Asp Thr
500 505 510Ile Tyr Gly Lys Asn Pro Ile Asn Ile Val Ala Ser Ala Ala Asn Lys
515 520 525Asn Ile Thr Leu Thr Gly Thr Leu Ala Leu Val Asn Ala Asp Gly Ala
530 535 540Leu Tyr Glu Asn His Thr Leu Gln Asp Ser Gln Asp Tyr Ser Phe Val545 550 555 560Lys Leu Ser Pro Gly Ala Gly Gly Thr Ile Ile Thr Gln Asp Ala Ser
565 570 575Gln Lys Leu Leu Glu Val Ala Pro Ser Arg Pro His Tyr Gly Tyr Gln
580 585 590Gly His Trp Asn Val Gln Val Ile Pro Gly Thr Gly Thr Gln Pro Ser
595 600 605Gln Ala Asn Leu Glu Trp Val Arg Thr Gly Tyr Leu Pro Asn Pro Glu
610 615 620Arg Gln Gly Phe Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Val Asp625 630 635 640Gln Arg Ala Ile Gln Glu Ile Met Val Asn Ser Ser Gln Ile Leu Cys
645 650 655Gln Glu Arg Gly Val Trp Gly Ala Gly Ile Ala Asn Phe Leu His Arg
660 665 670Asp Lys Ile Asn Glu His Gly Tyr Arg His Ser Gly Val Gly Tyr Leu
675 680 685Val Gly Val Gly Thr His Ala Phe Ser Asp Ala Thr Ile Asn Ala Ala
690 695 700Phe Cys Gln Leu Phe Ser Arg Asp Lys Asp Tyr Val Val Ser Lys Asn705 710 715 720His Gly Thr Ser Tyr Ser Gly Val Val Phe Leu Glu Asp Thr Leu Glu
725 730 735Phe Arg Ser Pro Gln Gly Phe Tyr Thr Asp Ser Ser Ser Glu Ala Cys
740 745 750Cys Asn Gln Val Val Thr Ile Asp Met Gln Leu Ser Tyr Ser His Arg
755 760 765Asn Asn Asp Met Lys Thr Lys Tyr Thr Thr Tyr Pro Glu Ala Gln Gly
770 775 780Ser Trp Ala Asn Asp Val Phe Gly Leu Glu Phe Gly Ala Thr Thr Tyr785 790 795 800Tyr Tyr Pro Asn Ser Thr Phe Leu Phe Asp Tyr Tyr Ser Pro Phe Leu
805 810 815Arg Leu Gln Cys Thr Tyr Ala His Gln Glu Asp Phe Lys Glu Thr Gly
820 825 830Gly Glu Val Arg His Phe Thr Ser Gly Asp Leu Phe Asn Leu Ala Val
835 840 845Pro IIe Gly Val Lys Phe Glu Arg Phe Ser Asp Cys Lys Arg Gly Ser
850 855 860Tyr Glu Leu Thr Leu Ala Tyr Val Pro Asp Val Ile Arg Lys Asp Pro865 870 875 880Lys Ser Thr Ala Thr Leu Ala Ser Gly Ala Thr Trp Ser Thr His Gly
885 890 895Asn Asn Leu Ser Arg Gln Gly Leu Gln Leu Arg Leu Gly Asn His Cys
900 905 910Leu Ile Asn Pro Gly Ile Glu Val Phe Ser His Gly Ala Ile Glu Leu
915 920 925Arg Gly Ser Ser Arg Asn Tyr Asn Ile Asn Leu Gly Gly Lys Tyr Arg
930 935 940Phe945
(2)SEQ ID NO:25的信息:
(i)序列特征:
(A)长度:3000碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ix)特征:
(A)名称/关键字:编码序列
(B)位置:259...3000
(D)其它信息:
(xi)序列描述:SEQ ID NO:25:ATCAGGTGAT AAAAGTTCCT CGTTAGCTAG TGACTGTAGG TGACATGAGA AAGCTAACAC 60GGAGGAAACT AAAACCCAAG GAATCGAAGT CTTCATGGTA ATGCTTTTGT TTTTTAGAGA 120ACTATTCGCA TCAATATAGA AACAAAATAA GTAAATCAAG TTAAAGATGA CAAAACAGCT 180GTCAAGAATT TTTATCTTGA CTCTCTGAGT TTTCTATTTT ATATGACGCA AGTAAGAATT 240TAATAATAAA GTGGGTTT ATG AAA TCG CAA TTT TCC TGG TTA GTG CTC TCT 291
Met Lys Ser Gln Phe Ser Trp Leu Val Leu Ser
1 5 10TCG ACA TTG GCA TGT TTT ACT AGT TGT TCC ACT GTT TTT GCT GCA ACT 339Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr
15 20 25GCT GAA AAT ATA GGC CCC TCT GAT AGC TTT GAC GGA AGT ACT AAC ACA 387Ala Glu Asn Ile Gly Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr
30 35 40GGC ACC TAT ACT CCT AAA AAT ACG ACT ACT GGA ATA GAC TAT ACT CTG 435Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly Ile Asp Tyr Thr Leu
45 50 55ACA GGA GAT ATA ACT CTG CAA AAC CTT GGG GAT TCG GCA GCT TTA ACG 483Thr Gly Asp Ile Thr Leu Gln Asn Leu Gly Asp Ser Ala Ala Leu Thr60 65 70 75AAG GGT TGT TTT TCT GAC ACT ACG GAA TCT TTA AGC TTT GCC GGT AAG 531Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys
80 85 90GGG TAC TCA CTT TCT TTT TTA AAT ATT AAG TCT AGT GCT GAA GGC GCA 579Gly Tyr Ser Leu Ser Phe Leu Asn Ile Lys Ser Ser Ala Glu Gly Ala
95 100 105GCA CTT TCT GTT ACA ACT GAT AAA AAT CTG TCG CTA ACA GGA TTT TCG 627Ala Leu Ser Val Thr Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser
110 115 120AGT CTT ACT TTC TTA GCG GCC CCA TCA TCG GTA ATC ACA ACC CCC TCA 675Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser Val Ile Thr Thr Pro Ser
125 130 135GGA AAA GGT GCA GTT AAA TGT GGA GGG GAT CTT ACA TTT GAT AAC AAT 723Gly Lys Gly Ala Val Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn140 145 150 155GGA ACT ATT TTA TTT AAA CAA GAT TAC TGT GAG GAA AAT GGC GGA GCC 771Gly Thr Ile Leu Phe Lys Gln Asp Tyr Cys Glu Glu Asn Gly Gly Ala
160 165 170ATT TCT ACC AAG AAT CTT TCT TTG AAA AAC AGC ACG GGA TCG ATT TCT 819Ile Ser Thr Lys Asn Leu Ser Leu Lys Asn Ser Thr Gly Ser Ile Ser
175 180 185TTT GAA GGG AAT AAA TCG AGC GCA ACA GGG AAA AAA GGT GGG GCT ATT 867Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala Ile
190 195 200TGT GCT ACT GGT ACT GTA GAT ATT ACA AAT AAT ACG GCT CCT ACC CTC 915Cys Ala Thr Gly Thr Val Asp Ile Thr Asn Asn Thr Ala Pro Thr Leu
205 210 215TTC TCG AAC AAT ATT GCT GAA GCT GCA GGT GGA GCT ATA AAT AGC ACA 963Phe Ser Asn Asn Ile Ala Glu Ala Ala Gly Gly Ala Ile Asn Ser Thr220 225 230 235GGA AAC TGT ACA ATT ACA GGG AAT ACG TCT CTT GTA TTT TCT GAA AAT 1011Gly Asn Cys Thr Ile Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn
240 245 250AGT GTG ACA GCG ACC GCA GGA AAT GGA GGA GCT CTT TCT GGA GAT GCC 1059Ser Val Thr Ala Thr Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala
255 260 265GAT GTT ACC ATA TCT GGG AAT CAG AGT GTA ACT TTC TCA GGA AAC CAA 1107Asp Val Thr Ile Ser Gly Asn Gln Ser Val Thr Phe Ser Gly Asn Gln
270 275 280GCT GTA GCT AAT GGC GGA GCC ATT TAT GCT AAG AAG CTT ACA CTG GCT 1155Ala Val Ala Asn Gly Gly Ala Ile Tyr Ala Lys Lys Leu Thr Leu Ala
285 290 295TCC GGG GGG GGG GGG GGT ATC TCC TTT TCT AAC AAT ATA GTC CAA GGT 1203Ser Gly Gly Gly Gly Gly Ile Ser Phe Ser Asn Asn Ile Val Gln Gly300 305 310 315ACC ACT GCA GGT AAT GGT GGA GCC ATT TCT ATA CTG GCA GCT GGA GAG 1251Thr Thr Ala Gly Asn Gly Gly Ala Ile Ser Ile Leu Ala Ala Gly Glu
320 325 330TGT AGT CTT TCA GCA GAA GCA GGG GAC ATT ACC TTC AAT GGG AAT GCC 1299Cys Ser Leu Ser Ala Glu Ala Gly Asp Ile Thr Phe Asn Gly Asn Ala
335 340 345ATT GTT GCA ACT ACA CCA CAA ACT ACA AAA AGA AAT TCT ATT GAC ATA 1347Ile Val Ala Thr Thr Pro Gln Thr Thr Lys Arg Asn Ser Ile Asp Ile
350 355 360GGA TCT ACT GCA AAG ATC ACG AAT TTA CGT GCA ATA TCT GGG CAT AGC 1395Gly Ser Thr Ala Lys Ile Thr Asn Leu Arg Ala Ile Ser Gly His Ser
365 370 375ATC TTT TTC TAC GAT CCG ATT ACT GCT AAT ACG GCT GCG GAT TCT ACA 1443Ile Phe Phe Tyr Asp Pro Ile Thr Ala Asn Thr Ala Ala Asp Ser Thr380 385 390 395GAT ACT TTA AAT CTC AAT AAG GCT GAT GCA GGT AAT AGT ACA GAT TAT 1491Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr
400 405 410AGT GGG TCG ATT GTT TTT TCT GGT GAA AAG CTC TCT GAA GAT GAA GCA 1539Ser Gly Ser Ile Val Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala
415 420 425AAA GTT GCA GAC AAC CTC ACT TCT ACG CTG AAG CAG CCT GTA ACT CTA 1587Lys Val Ala Asp Asn Leu Thr Ser Thr Leu Lys Gln Pro Val Thr Leu
430 435 440ACT GCA GGA AAT TTA GTA CTT AAA CGT GGT GTC ACT CTC GAT ACG AAA 1635Thr Ala Gly Asn Leu Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys
445 450 455GGC TTT ACT CAG ACC GCG GGT TCC TCT GTT ATT ATG GAT GCG GGC ACA 1683Gly Phe Thr Gln Thr Ala Gly Ser Ser Val Ile Met Asp Ala Gly Thr460 465 470 475ACG TTA AAA GCA AGT ACA GAG GAG GTC ACT TTA ACA GGT CTT TCC ATT 1731Thr Leu Lys Ala Ser Thr Glu Glu Val Thr Leu Thr Gly Leu Ser Ile
480 485 490CCT GTA GAC TCT TTA GGC GAG GGT AAG AAA GTT GTA ATT GCT GCT TCT 1779Pro Val Asp Ser Leu Gly Glu Gly Lys Lys Val Val Ile Ala Ala Ser
495 500 505GCA GCA AGT AAA AAT GTA GCC CTT AGT GGT CCG ATT CTT CTT TTG GAT 1827Ala Ala Ser Lys Asn Val Ala Leu Ser Gly Pro Ile Leu Leu Leu Asp
510 515 520AAC CAA GGG AAT GCT TAT GAA AAT CAC GAC TTA GGA AAA ACT CAA GAC 1875Asn Gln Gly Asn Ala Tyr Glu Asn His Asp Leu Gly Lys Thr Gln Asp
525 530 535TTT TCA TTT GTG CAG CTC TCT GCT CTG GGT ACT GCA ACA ACT ACA GAT 1923Phe Ser Phe Val Gln Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp540 545 550 555GTT CCA GCG GTT CCT ACA GTA GCA ACT CCT ACG CAC TAT GGG TAT CAA 1971Val Pro Ala Val Pro Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gln
560 565 570GGT ACT TGG GGA ATG ACT TGG GTT GAT GAT ACC GCA AGC ACT CCA AAG 2019Gly Thr Trp Gly Met Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys
575 580 585ACT AAG ACA GCG ACA TTA GCT TGG ACC AAT ACA GGC TAC CTT CCG AAT 2067Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn
590 595 600CCT GAG CGT CAA GGA CCT TTA GTT CCT AAT AGC CTT TGG GGA TCT TTT 2115Pro Glu Arg Gln Gly Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe
605 610 615TCA GAC ATC CAA GCG ATT CAA GGT GTC ATA GAG AGA AGT GCT TTG ACT 2163Ser Asp Ile Gln Ala Ile Gln Gly Val Ile Glu Arg Ser Ala Leu Thr620 625 630 635CTT TGT TCA GAT CGA GGC TTC TGG GCT GCG GGA GTC GCC AAT TTC TTA 2211Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu
640 645 650GAT AAA GAT AAG AAA GGG GAA AAA CGC AAA TAC CGT CAT AAA TCT GGT 2259Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly
655 660 665GGA TAT GCT ATC GGA GGT GCA GCG CAA ACT TGT TCT GAA AAC TTA ATT 2307Gly Tyr Ala Ile Gly Gly Ala Ala Gln Thr Cys Ser Glu Asn Leu Ile
670 675 680AGC TTT GCC TTT TGC CAA CTC TTT GGT AGC GAT AAA GAT TTC TTA GTC 2355Ser Phe Ala Phe Cys Gln Leu Phe Gly Ser Asp Lys Asp Phe Leu Val
685 690 695GCT AAA AAT CAT ACT GAT ACC TAT GCA GGA GCC TTC TAT ATC CAA CAC 2403Ala Lys Asn His Thr Asp Thr Tyr Ala Gly Ala Phe Tyr Ile Gln His700 705 710 715ATT ACA GAA TGT AGT GGG TTC ATA GGT TGT CTC TTA GAT AAA CTT CCT 2451Ile Thr Glu Cys Ser Gly Phe Ile Gly Cys Leu Leu Asp Lys Leu Pro
720 725 730GGC TCT TGG AGT CAT AAA CCC CTC GTT TTA GAA GGG CAG CTC GCT TAT 2499Gly Ser Trp Ser His Lys Pro Leu Val Leu Glu Gly Gln Leu Ala Tyr
735 740 745AGC CAC GTC AGT AAT GAT CTG AAG ACA AAG TAT ACT GCG TAT CCT GAG 2547Ser His Val Ser Asn Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu
750 755 760GTG AAA GGT TCT TGG GGG AAT AAT GCT TTT AAC ATG ATG TTG GGA GCT 2595Val Lys Gly Ser Trp Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala
765 770 775TCT TCT CAT TCT TAT CCT GAA TAC CTG CAT TGT TTT GAT ACC TAT GCT 2643Ser Ser His Ser Tyr Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala780 785 790 795CCA TAC ATC AAA CTG AAT CTG ACC TAT ATA CGT CAG GAC AGC TTC TCG 2691Pro Tyr Ile Lys Leu Asn Leu Thr Tyr Ile Arg Gln Asp Ser Phe Ser
800 805 810GAG AAA GGT ACA GAA GGA AGA TCT TTT GAT GAC AGC AAC CTC TTC AAT 2739Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn
815 820 825TTA TCT TTG CCT ATA GGG GTG AAG TTT GAG AAG TTC TCT GAT TGT AAT 2787Leu Ser Leu Pro Ile Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn
830 835 840GAC TTT TCT TAT GAT CTG ACT TTA TCC TAT GTT CCT GAT CTT ATC CGC 2835Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr Val Pro Asp Leu Ile Arg
845 850 855AAT GAT CCC AAA TGC ACT ACA GCA CTT GTA ATC AGC GGA GCC TCT TGG 2883Asn Asp Pro Lys Cys Thr Thr Ala Leu Val Ile Ser Gly Ala Ser Trp860 865 870 875GAA ACT TAT GCC AAT AAC TTA GCA CGA CAG GCC TTG CAA GTG CGT GCA 2931Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gln Ala Leu Gln Val Arg Ala
880 885 890GGC AGT CAC TAC GCC TTC TCT CCT ATG TTT GAA GTG CTC GGC CAG TTT 2979Gly Ser His Tyr Ala Phe Ser Pro Met Phe Glu Val Leu Gly Gln Phe
895 900 905GTC TTT GAA GTT CGT GGA TCC 3000Val Phe Glu Val Arg Gly Ser
910
(2)SEQ ID NO:26的信息:
(i)序列特征:
(A)长度:914氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:蛋白质
(v)片段类型:内部
(xi)序列描述:SEQ ID NO:26:Met Lys Ser Gln Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys1 5 10 15Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn Ile Gly
20 25 30Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro
35 40 45Lys Asn Thr Thr Thr Gly Ile Asp Tyr Thr Leu Thr Gly Asp Ile Thr
50 55 60Leu Gln Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser65 70 75 80Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser
85 90 95Phe Leu Asn Ile Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr
100 105 110Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu
115 120 125Ala Ala Pro Ser Ser Val Ile Thr Thr Pro Ser Gly Lys Gly Ala Val130 135 140Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr Ile Leu Phe145 150 155 160Lys Gln Asp Tyr Cys Glu Glu Asn Gly Gly Ala Ile Ser Thr Lys Asn
165 170 175Leu Ser Leu Lys Asn Ser Thr Gly Ser Ile Ser Phe Glu Gly Asn Lys
180 185 190Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala Ile Cys Ala Thr Gly Thr
195 200 205Val Asp Ile Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn Ile
210 215 220Ala Glu Ala Ala Gly Gly Ala Ile Asn Ser Thr Gly Asn Cys Thr Ile225 230 235 240Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr
245 250 255Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr Ile Ser
260 265 270Gly Asn Gln Ser Val Thr Phe Ser Gly Asn Gln Ala Val Ala Asn Gly
275 280 285Gly Ala Ile Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly
290 295 300Gly Ile Ser Phe Ser Asn Asn Ile Val Gln Gly Thr Thr Ala Gly Asn305 310 315 320Gly Gly Ala Ile Ser Ile Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala
325 330 335Glu Ala Gly Asp Ile Thr Phe Asn Gly Asn Ala Ile Val Ala Thr Thr
340 345 350Pro Gln Thr Thr Lys Arg Asn Ser Ile Asp Ile Gly Ser Thr Ala Lys
355 360 365Ile Thr Asn Leu Arg Ala Ile Ser Gly His Ser Ile Phe Phe Tyr Asp
370 375 380Pro Ile Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu385 390 395 400Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser Ile Val
405 410 415Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn
420 425 430Leu Thr Ser Thr Leu Lys Gln Pro ValThr Leu Thr Ala Gly Asn Leu
435 440 445Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gln Thr
450 455 460Ala Gly Ser Ser Val Ile Met Asp Ala Gly Thr Thr Leu Lys Ala Ser465 470 475 480Thr Glu Glu Val Thr Leu Thr Gly Leu Ser Ile Pro Val Asp Ser Leu
485 490 495Gly Glu Gly Lys Lys Val Val Ile Ala Ala Ser Ala Ala Ser Lys Asn
500 505 510Val Ala Leu Ser Gly Pro Ile Leu Leu Leu Asp Asn Gln Gly Asn Ala
515 520 525Tyr Glu Asn His Asp Leu Gly Lys Thr Gln Asp Phe Ser Phe Val Gln
530 535 540Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro545 550 555 560Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gln Gly Thr Trp Gly Met
565 570 575Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr
580 585 590Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gln Gly
595 600 605Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp Ile Gln Ala
610 615 620Ile Gln Gly Val Ile Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg625 630 635 640Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys
645 650 655Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala Ile Gly
660 665 670Gly Ala Ala Gln Thr Cys Ser Glu Asn Leu Ile Ser Phe Ala Phe Cys
675 680 685Gln Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr
690 695 700Asp Thr Tyr Ala Gly Ala Phe Tyr Ile Gln His Ile Thr Glu Cys Ser705 710 715 720Gly Phe Ile Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His
725 730 735Lys Pro Leu Val Leu Glu Gly Gln Leu Ala Tyr Ser His Val Ser Asn
740 745 750Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp
755 760 765Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr
770 775 780Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr Ile Lys Leu785 790 795 800Asn Leu Thr Tyr Ile Arg Gln Asp Ser Phe Ser Glu Lys Gly Thr Glu
805 810 815Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro Ile
820 825 830Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp
835 840 845Leu Thr Leu Ser Tyr Val Pro Asp Leu Ile Arg Asn Asp Pro Lys Cys
850 855 860Thr Thr Ala Leu Val Ile Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn865 870 875 880Asn Leu Ala Arg Gln Ala Leu Gln Val Arg Ala Gly Ser His Tyr Ala
885 890 895Phe Ser Pro Met Phe Glu Val Leu Gly Gln Phe Val Phe Glu Val Arg
900 905 910Gly Ser
(2)SEQ ID N0:27的信息:
(i)序列特征:
(A)长度:1200碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ix)特征:
(A)名称/关键字:编码序列
(B)位置:1...1200
(D)其它信息:
(xi)序列描述:SEQ ID NO:27:GAT CCT AAA AAT AAA GAG TAC ACA GGG ACC ATA CTC TTT TCT GGA GAA 48Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr Ile Leu Phe Ser Gly Glu1 5 10 15AAG AGT CTA GCA AAC GAT CCT AGG GAT TTT AAA TCT ACA ATC CCT CAG 96Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr Ile Pro Gln
20 25 30AAC GTC AAC CTG TCT GCA GGA TAC TTA GTT ATT AAA GAG GGG GCC GAA 144Asn Val Asn Leu Ser Ala Gly Tyr Leu Val Ile Lys Glu Gly Ala Glu
35 40 45GTC ACA GTT TCA AAA TTC ACG CAG TCT CCA GGA TCG CAT TTA GTT TTA 192Val Thr Val Ser Lys Phe Thr Gln Ser Pro Gly Ser His Leu Val Leu
50 55 60GAT TTA GGA ACC AAA CTG ATA GCC TCT AAG GAA GAC ATT GCC ATC ACA 240Asp Leu Gly Thr Lys Leu Ile Ala Ser Lys Glu Asp Ile Ala Ile Thr65 70 75 80GGC CTC GCG ATA GAT ATA GAT AGC TTA AGC TCA TCC TCA ACA GCA GCT 288Gly Leu Ala Ile Asp Ile Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala
85 90 95GTT ATT AAA GCA AAC ACC GCA AAT AAA CAG ATA TCC GTG ACG GAC TCT 336Val Ile Lys Ala Asn Thr Ala Asn Lys Gln Ile Ser Val Thr Asp Ser
100 105 110ATA GAA CTT ATC TCG CCT ACT GGC AAT GCC TAT GAA GAT CTC AGA ATG 384Ile Glu Leu Ile Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met
115 120 125AGA AAT TCA CAG ACG TTC CCT CTG CTC TCT TTA GAG CCT GGA GCC GGG 432Arg Asn Ser Gln Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly
130 135 140GGT AGT GTG ACT GTA ACT GCT GGA GAT TTC CTA CCG GTA AGT CCC CAT 480Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His145 150 155 160TAT GGT TTT CAA GGC AAT TGG AAA TTA GCT TGG ACA GGA ACT GGA AAC 528Tyr Gly Phe Gln Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn
165 170 175AAA GTT GGA GAA TTC TTC TGG GAT AAA ATA AAT TAT AAG CCT AGA CCT 576Lys Val Gly Glu Phe Phe Trp Asp Lys Ile Asn Tyr Lys Pro Arg Pro
180 185 190GAA AAA GAA GGA AAT TTA GTT CCT AAT ATC TTG TGG GGG AAT GCT GTA 624Glu Lys Glu Gly Asn Leu Val Pro Asn Ile Leu Trp Gly Asn Ala Val
195 200 205AAT GTC AGA TCC TTA ATG CAG GTT CAA GAG ACC CAT GCA TCG AGC TTA 672Asn Val Arg Ser Leu Met Gln Val Gln Glu Thr His Ala Ser Ser Leu
210 215 220CAG ACA GAT CGA GGG CTG TGG ATC GAT GGA ATT GGG AAT TTC TTC CAT 720Gln Thr Asp Arg Gly Leu Trp Ile Asp Gly Ile Gly Asn Phe Phe His225 230 235 240GTA TCT GCC TCC GAA GAC AAT ATA AGG TAC CGT CAT AAC AGC GGT GGA 768Val Ser Ala Ser Glu Asp Asn Ile Arg Tyr Arg His Asn Ser Gly Gly
245 250 255TAT GTT CTA TCT GTA AAT AAT GAG ATC ACA CCT AAG CAC TAT ACT TCG 816Tyr Val Leu Ser Val Asn Asn Glu Ile Thr Pro Lys His Tyr Thr Ser
260 265 270ATG GCA TTT TCC CAA CTC TTT AGT AGA GAC AAA GAC TAT GCG GTT TCC 864Met Ala Phe Ser Gln Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser
275 280 285AAC AAC GAA TAC AGA ATG TAT TTA GGA TCG TAT CTC TAT CAA TAT ACA 912Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gln Tyr Thr
290 295 300ACC TCC CTA GGG AAT ATT TTC CGT TAT GCT TCG CGT AAC CCT AAT GTA 960Thr Ser Leu Gly Asn Ile Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val305 310 315 320AAC GTC GGG ATT CTC TCA AGA AGG TTT CTT CAA AAT CCT CTT ATG ATT 1008Asn Val Gly Ile Leu Ser Arg Arg Phe Leu Gln Asn Pro Leu Met Ile
325 330 335TTT CAT TTT TTG TGT GCT TAT GGT CAT GCC ACC AAT GAT ATG AAA ACA 1056Phe His Phe Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr
340 345 350GAC TAC GCA AAT TTC CCT ATG GTG AAA AAC AGC TGG AGA AAC AAT TGT 1104Asp Tyr Ala Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys
355 360 365TGG GCT ATA AAA TGC GGA GGG AGC ATG CCT CTA TTG GTA TTT GAA AAC 1152Trp Ala Ile Lys Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn
370 375 380GGA AAA CTT TTC CAA GGT GCC ATC CCA TTT ATG AAA CTA CAA TTA GTT 1200Gly Lys Leu Phe Gln Gly Ala Ile Pro Phe Met Lys Leu Gln Leu Val385 390 395 400
(2)SEQ ID NO:28的信息:
(i)序列特征:
(A)长度:400氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:蛋白质
(v)片段类型:内部
(xi)序列描述:SEQ ID NO:28:Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr Ile Leu Phe Ser Gly Glu1 5 10 15Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr Ile Pro Gln
20 25 30Asn Val Asn Leu Ser Ala Gly Tyr Leu Val Ile Lys Glu Gly Ala Glu
35 40 45Val Thr Val Ser Lys Phe Thr Gln Ser Pro Gly Ser His Leu Val Leu
50 55 60Asp Leu Gly Thr Lys Leu Ile Ala Ser Lys Glu Asp Ile Ala Ile Thr65 70 75 80Gly Leu Ala Ile Asp Ile Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala
85 90 95Val Ile Lys Ala Asn Thr Ala Asn Lys Gln Ile Ser Val Thr Asp Ser
100 105 110Ile Glu Leu Ile Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met
115 120 125Arg Asn Ser Gln Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly
130 135 140Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His145 150 155 160Tyr Gly Phe Gln Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn
165 170 175Lys Val Gly Glu Phe Phe Trp Asp Lys Ile Asn Tyr Lys Pro Arg Pro
180 185 190Glu Lys Glu Gly Asn Leu Val Pro Asn Ile Leu Trp Gly Asn Ala Val
195 200 205Asn Val Arg Ser Leu Met Gln Val Gln Glu Thr His Ala Ser Ser Leu
210 215 220Gln Thr Asp Arg Gly Leu Trp Ile Asp Gly Ile Gly Asn Phe Phe His225 230 235 240Val Ser Ala Ser Glu Asp Asn Ile Arg Tyr Arg His Asn Ser Gly Gly
245 250 255Tyr Val Leu Ser Val Asn Asn Glu Ile Thr Pro Lys His Tyr Thr Ser
260 265 270Met Ala Phe Ser Gln Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser
275 280 285Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gln Tyr Thr
290 295 300Thr Ser Leu Gly Asn Ile Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val305 310 315 320Asn Val Gly Ile Leu Ser Arg Arg Phe Leu Gln Asn Pro Leu Met Ile
325 330 335Phe His Phe Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr
340 345 350Asp Tyr Ala Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys
355 360 365Trp Ala Ile Lys Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn
370 375 380Gly Lys Leu Phe Gln Gly Ala Ile Pro Phe Met Lys Leu Gln Leu Val385 390 395 400
(2)SEQ ID NO:29的信息:
(i)序列特征:
(A)长度:1830碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:cDNA
(ix)特征:
(A)名称/关键字:编码序列
(B)位置:1...1830
(D)其它信息:
(xi)序列描述:SEQ ID NO:29:GAT CTC ACA TTA GGG AGT CGT GAC AGT TAT AAT GGT GAT ACA AGC ACC 48Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr1 5 10 15ACA GAA TTT ACT CCT AAA GCG GCA ACT TCT GAT GCT AGT GGC ACG ACC 96Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr
20 25 30TAT ATT CTC GAT GGG GAT GTC TCG ATA AGC CAA GCA GGG AAA CAA ACG 144Tyr Ile Leu Asp Gly Asp Val Ser Ile Ser Gln Ala Gly Lys Gln Thr
35 40 45AGC TTA ACC ACA AGT TGT TTT TCT AAC ACT GCA GGA AAT CTT ACC TTC 192Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe
50 55 60TTA GGG AAC GGA TTT TCT CTT CAT TTT GAC AAT ATT ATT TCG TCT ACT 240Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn Ile Ile Ser Ser Thr65 70 75 80GTT GCA GGT GTT GTT GTT AGC AAT ACA GCA GCT TCT GGG ATT ACG AAA 288Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly Ile Thr Lys
85 90 95TTC TCA GGA TTT TCA ACT CTT CGG ATG CTT GCA GCT CCT AGG ACC ACA 336Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr
100 105 110GGT AAA GGA GCC ATT AAA ATT ACC GAT GGT CTG GTG TTT GAG AGT ATA 384Gly Lys Gly Ala Ile Lys Ile Thr Asp Gly Leu Val Phe Glu Ser Ile
115 120 125GGG AAT CTT GAT CCG ATT ACT GTA ACA GGA TCG ACA TCT GTT GCT GAT 432Gly Asn Leu Asp Pro Ile Thr Val Thr Gly Ser Thr Ser Val Ala Asp
130 135 140GCT CTC AAT ATT AAT AGC CCT GAT ACT GGA GAT AAC AAA GAG TAT ACG 480Ala Leu Asn Ile Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr145 150 155 160GGA ACC ATA GTC TTT TCT GGA GAG AAG CTC ACG GAG GCA GAA GCT AAA 528Gly Thr Ile Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys
165 170 175GAT GAG AAG AAC CGC ACT TCT AAA TTA CTT CAA AAT GTT GCT TTT AAA 576Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gln Asn Val Ala Phe Lys
180 185 190AAT GGG ACT GTA GTT TTA AAA GGT GAT GTC GTT TTA AGT GCG AAC GGT 624Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly
195 200 205TTC TCT CAG GAT GCA AAC TCT AAG TTG ATT ATG GAT TTA GGG ACG TCG 672Phe Ser Gln Asp Ala Asn Ser Lys Leu Ile Met Asp Leu Gly Thr Ser
210 215 220TTG GTT GCA AAC ACC GAA AGT ATC GAG TTA ACG AAT TTG GAA ATT AAT 720Leu Val Ala Asn Thr Glu Ser Ile Glu Leu Thr Asn Leu Glu Ile Asn225 230 235 240ATA GAC TCT CTC AGG AAC GGG AAA AAG ATA AAA CTC AGT GCT GCC ACA 768Ile Asp Ser Leu Arg Asn Gly Lys Lys Ile Lys Leu Ser Ala Ala Thr
245 250 255GCT CAG AAA GAT ATT CGT ATA GAT CGT CCT GTT GTA CTG GCA ATT AGC 816Ala Gln Lys Asp Ile Arg Ile Asp Arg Pro Val Val Leu Ala Ile Ser
260 265 270GAT GAG AGT TTT TAT CAA AAT GGC TTT TTG AAT GAG GAC CAT TCC TAT 864Asp Glu Ser Phe Tyr Gln Asn Gly Phe Leu Asn Glu Asp His Ser Tyr
275 280 285GAT GGG ATT CTT GAG TTA GAT GCT GGG AAA GAC ATC GTG ATT TCT GCA 912Asp Gly Ile Leu Glu Leu Asp Ala Gly Lys Asp Ile Val Ile Ser Ala
290 295 300GAT TCT CGC AGT ATA GAT GCT GTA CAA TCT CCG TAT GGC TAT CAG GGA 960Asp Ser Arg Ser Ile Asp Ala Val Gln Ser Pro Tyr Gly Tyr Gln Gly305 310 315 320AAG TGG ACG ATC AAT TGG TCT ACT GAT GAT AAG AAA GCT ACG GTT TCT 1008Lys Trp Thr Ile Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser
325 330 335TGG GCG AAG CAG AGT TTT AAT CCC ACT GCT GAG CAG GAG GCT CCG TTA 1056Trp Ala Lys Gln Ser Phe Asn Pro Thr Ala Glu Gln Glu Ala Pro Leu
340 345 350GTT CCT AAT CTT CTT TGG GGT TCT TTT ATA GAT GTT CGT TCC TTC CAG 1104Val Pro Asn Leu Leu Trp Gly Ser Phe Ile Asp Val Arg Ser Phe Gln
355 360 365AAT TTT ATA GAG CTA GGT ACT GAA GGT GCT CCT TAC GAA AAG AGA TTT 1152Asn Phe Ile Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe
370 375 380TGG GTT GCA GGC ATT TCC AAT GTT TTG CAT AGG AGC GGT CGT GAA AAT 1200Trp Val Ala Gly Ile Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn385 390 395 400CAA AGG AAA TTC CGT CAT GTG AGT GGA GGT GCT GTA GTA GGT GCT AGC 1248Gln Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser
405 410 415ACG AGG ATG CCG GGT GGT GAT ACC TTG TCT CTG GGT TTT GCT CAG CTC 1296Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gln Leu
420 425 430TTT GCG CGT GAC AAA GAC TAC TTT ATG AAT ACC AAT TTC GCA AAG ACC 1344Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr
435 440 445TAC GCA GGA TCT TTA CGT TTG CAG CAC GAT GCT TCC CTA TAC TCT GTG 1392Tyr Ala Gly Ser Leu Arg Leu Gln His Asp Ala Ser Leu Tyr Ser Val
450 455 460GTG AGT ATC CTT TTA GGA GAG GGA GGA CTC CGC GAG ATC CTG TTG CCT 1440Val Ser Ile Leu Leu Gly Glu Gly Gly Leu Arg Glu Ile Leu Leu Pro465 470 475 480TAT GTT TCC AAT ACT CTG CCG TGC TCT TTC TAT GGG CAG CTT AGC TAC 1488Tyr Val Ser Asn Thr Leu Pro Cys Ser Phe Tyr Gly Gln Leu Ser Tyr
485 490 495GGC CAT ACG GAT CAT CGC ATG AAG ACC GAG TCT CTA CCC CCC CCC CCC 1536Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro
500 505 510CCG ACG CTC TCG ACG GAT CAT ACT TCT TGG GGA GGA TAT GTC TGG GCT 1584Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala
515 520 525GGA GAG CTG GGA ACT CGA GTT GCT GTT GAA AAT ACC AGC GGC AGA GGA 1632Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly
530 535 540TTT TTC CGA GAG TAC ACT CCA TTT GTA AAA GTC CAA GCT GTT TAC TCG 1680Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gln Ala Val Tyr Ser545 550 555 560CGC CAA GAT AGC TTT GTT GAA CTA GGA GCT ATC AGT CGT GAT TTT AGT 1728Arg Gln Asp Ser Phe Val Glu Leu Gly Ala Ile Ser Arg Asp Phe Ser
565 570 575GAT TCG CAT CTT TAT AAC CTT GCG ATT CCT CTT GGA ATC AAG TTA GAG 1776Asp Ser His Leu Tyr Asn Leu Ala Ile Pro Leu Gly Ile Lys Leu Glu
580 585 590AAA CGG TTT GCA GAG CAA TAT TAT CAT GTT GTT GCG ATG TAT TCT CCA 1824Lys Arg Phe Ala Glu Gln Tyr Tyr His Val Val Ala Met Tyr Ser Pro
595 600 605GAT GTT 1830Asp Val
610
(2)SEQ ID NO:30的信息:
(i)序列特征:
(A)长度:610氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ii)分子类型:蛋白质
(v)片段类型:内部
(xi)序列描述:SEQ ID NO:30:Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr1 5 10 15Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr
20 25 30Tyr Ile Leu Asp Gly Asp Val Ser Ile Ser Gln Ala Gly Lys Gln Thr
35 40 45Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe
50 55 60Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn Ile Ile Ser Ser Thr65 70 75 80Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly Ile Thr Lys
85 90 95Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr
100 105 110Gly Lys Gly Ala Ile Lys Ile Thr Asp Gly Leu Val Phe Glu Ser Ile
115 120 125Gly Asn Leu Asp Pro Ile Thr Val Thr Gly Ser Thr Ser Val Ala Asp
130 135 140Ala Leu Asn Ile Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr145 150 155 160Gly Thr Ile Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys
165 170 175Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gln Asn Val Ala Phe Lys
180 185 190Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly
195 200 205Phe Ser Gln Asp Ala Asn Ser Lys Leu Ile Met Asp Leu Gly Thr Ser
210 215 220Leu Val Ala Asn Thr Glu Ser Ile Glu Leu Thr Asn Leu Glu Ile Asn225 230 235 240Ile Asp Ser Leu Arg Asn Gly Lys Lys Ile Lys Leu Ser Ala Ala Thr
245 250 255Ala Gln Lys Asp Ile Arg Ile Asp Arg Pro Val Val Leu Ala Ile Ser
260 265 270Asp Glu Ser Phe Tyr Gln Asn Gly Phe Leu Asn Glu Asp His Ser Tyr
275 280 285Asp Gly Ile Leu Glu Leu Asp Ala Gly Lys Asp Ile Val Ile Ser Ala
290 295 300Asp Ser Arg Ser Ile Asp Ala Val Gln Ser Pro Tyr Gly Tyr Gln Gly305 310 315 320Lys Trp Thr Ile Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser
325 330 335Trp Ala Lys Gln Ser Phe Asn Pro Thr Ala Glu Gln Glu Ala Pro Leu
340 345 350Val Pro Asn Leu Leu Trp Gly Ser Phe Ile Asp Val Arg Ser Phe Gln
355 360 365Asn Phe Ile Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe
370 375 380Trp Val Ala Gly Ile Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn385 390 395 400Gln Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser
405 410 415Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gln Leu
420 425 430Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr
435 440 445Tyr Ala Gly Ser Leu Arg Leu Gln His Asp Ala Ser Leu Tyr Ser Val
450 455 460Val Ser Ile Leu Leu Gly Glu Gly Gly Leu Arg Glu Ile Leu Leu Pro465 470 475 480Tyr Val Ser Asn Thr Leu Pro Cys Ser Phe Tyr Gly Gln Leu Ser Tyr
485 490 495Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro
500 505 510Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala
515 520 525Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly
530 535 540Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gln Ala Val Tyr Ser545 550 555 560Arg Gln Asp Ser Phe Val Glu Leu Gly Ala Ile Ser Arg Asp Phe Ser
565 570 575Asp Ser His Leu Tyr Asn Leu Ala Ile Pro Leu Gly Ile Lys Leu Glu
580 585 590Lys Arg Phe Ala Glu Gln Tyr Tyr His Val Val Ala Met Tyr Ser Pro
595 600 605Asp Val
610
Claims (17)
1.一种鉴定哺乳动物如人感染肺炎衣原体的种特异性诊断测试方法,所述测试方法包括检测患者体内或患者样品内是否有针对肺炎衣原体外膜的一种或多种蛋白的抗体存在,所述蛋白的分子量为100.3-89.6kDa或56.1kDa,或检测编码所述外膜蛋白的核酸片段是否存在。
2.根据权利要求1所述的诊断测试方法,其中外膜蛋白具有SEQ ID NO:2、SEQID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示的序列,或其变体或亚序列。
3.根据权利要求1所述的诊断测试方法,其中核酸片段具有SEQ ID NO:1、SEQID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21或SEQ ID NO:23所示的序列或其变体或亚序列。
4.根据权利要求3所述的诊断测试方法,其中用核酸扩增检测核酸片段。
5.根据权利要求4所述的诊断测试方法,其中用聚合酶链反应检测核酸片段。
6.一种衍生自肺炎衣原体的核酸片段,它包含SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21或SEQ ID NO:23所示的核苷酸序列,或与任一所述序列的序列同源性至少为50%的所述核苷酸序列的变体或亚序列。
7.一种衍生自肺炎衣原体的蛋白质,它具有SEQ ID NO:2、SEQ ID NO:4、SEQID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ IDNO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示的氨基酸序列,或序列同源性至少为50%且具有类似生物功能的它的变体或亚序列。
8.一种多克隆单特异性抗体,它针对具有SEQ ID NO:2、SEQ ID NO:4、SEQ IDNO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示序列的蛋白或其变体或亚序列。
9.一种用来诊断哺乳动物如人感染肺炎衣原体的诊断试剂盒,所述试剂盒包含具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示氨基酸序列的蛋白质或其变体或亚序列。
10.一种用来诊断哺乳动物如人感染肺炎衣原体的诊断试剂盒,所述试剂盒包含抗体,该抗体针对具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示氨基酸序列的蛋白或其变体或亚序列。
11.一种用来诊断哺乳动物如人感染肺炎衣原体的诊断试剂盒,所述试剂盒包含具有SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21或SEQ ID NO:23所示序列的核酸片段或其变体或亚序列。
12.一种用来免疫接种哺乳动物如人来抵抗肺炎衣原体的组合物,所述组合物包含具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示氨基酸序列的蛋白质或其变体或亚序列。
13.具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ IDNO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ IDNO:20、SEQ ID NO:22或SEQ ID NO:24所示序列的蛋白质或其变体或亚序列在诊断哺乳动物如人感染肺炎衣原体中的应用。
14.非变性形式的具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示序列的蛋白质或其变体或亚序列在诊断哺乳动物如人感染肺炎衣原体中的应用。
15.具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ IDNO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ IDNO:20、SEQ ID NO:22或SEQ ID NO:24所示序列的蛋白质或其变体或亚序列在免疫接种哺乳动物如人以抵抗肺炎衣原体中的应用。
16.非变性形式的具有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22或SEQ ID NO:24所示序列的蛋白质或其变体或亚序列在免疫接种哺乳动物如人以抵抗肺炎衣原体中的应用。
17.核酸片段或其变体或亚序列在免疫接种哺乳动物如人以抵抗肺炎衣原体中的应用,其中核酸片段具有SEQ ID NO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21或SEQ ID NO:23所示序列,所述变体或亚序列与所述任一序列的序列同源性至少为50%。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DK0744/97 | 1997-06-23 | ||
DK0744/1997 | 1997-06-23 | ||
DK74497 | 1997-06-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1261403A true CN1261403A (zh) | 2000-07-26 |
CN1249233C CN1249233C (zh) | 2006-04-05 |
Family
ID=8097044
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB988064286A Expired - Fee Related CN1249233C (zh) | 1997-06-23 | 1998-06-19 | 肺炎衣原体表面蛋白 |
Country Status (13)
Country | Link |
---|---|
US (2) | US7264941B1 (zh) |
EP (1) | EP1007685B1 (zh) |
JP (1) | JP4160640B2 (zh) |
CN (1) | CN1249233C (zh) |
AT (1) | ATE323164T1 (zh) |
AU (1) | AU749382B2 (zh) |
BR (1) | BR9810288A (zh) |
CA (1) | CA2294651A1 (zh) |
DE (1) | DE69834194T2 (zh) |
DK (1) | DK1007685T3 (zh) |
ES (1) | ES2264810T3 (zh) |
PT (1) | PT1007685E (zh) |
WO (1) | WO1998058953A2 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113785198A (zh) * | 2019-05-06 | 2021-12-10 | 阿罗塞尔公司 | 呼吸道感染检测和分类 |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6710033B1 (en) | 1996-08-14 | 2004-03-23 | Vanderbilt University | Methods and treatment of multiple sclerosis |
US6890526B2 (en) | 1997-05-06 | 2005-05-10 | Vanderbilt University | Methods and reagents for the treatment of multiple sclerosis |
EP1129202A2 (en) * | 1998-10-28 | 2001-09-05 | Aventis Pasteur Limited | Chlamydia antigenes and corresponding dna fragments and uses thereof |
US6649370B1 (en) * | 1998-10-28 | 2003-11-18 | Aventis Pasteur Limited | Chlamydia antigens and corresponding DNA fragments and uses thereof |
US6872814B2 (en) * | 1998-10-29 | 2005-03-29 | Aventis Pasteur Limited | Chlamydia antigens and corresponding DNA fragments and uses thereof |
JP2002531095A (ja) * | 1998-12-01 | 2002-09-24 | アベンティス、パストゥール、リミテッド | クラミジア抗原および対応するdna断片ならびにその使用 |
AU3790900A (en) | 1998-12-01 | 2000-06-19 | Aventis Pasteur Limited | Chlamydia antigens and corresponding DNA fragments and uses thereof |
EP2172214A3 (en) | 1999-03-12 | 2011-03-16 | Aventis Pasteur Limited | Chlamydia antigens and corresponding DNA fragments and uses thereof |
DE60036698T2 (de) * | 1999-05-03 | 2008-07-24 | Sanofi Pasteur Ltd., Toronto | Chlamydia-antigene und entsprechende dna-fragmente und deren verwendungen |
NZ517952A (en) * | 1999-09-20 | 2004-01-30 | Aventis Pasteur | Chlamydia antigens and corresponding DNA fragments and uses thereof |
US20010048927A1 (en) | 2000-02-01 | 2001-12-06 | Richard Stephens | Porin B (PorB) as a therapeutic target for prevention and treatment of infection by chlamydia |
CA2414884A1 (en) * | 2000-07-03 | 2002-01-10 | Chiron S.P.A. | Immunisation against chlamydia pneumoniae |
US7105171B2 (en) | 2002-03-07 | 2006-09-12 | The Regents Of The University Of California | Porin B (PorB) as a therapeutic target for prevention and treatment of infection by Chlamydia |
US11091792B2 (en) * | 2016-05-04 | 2021-08-17 | Children's Hospital & Research Center At Oakland | Rapid extraction of nucleic acids from clinical samples for downstream applications |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU692889B2 (en) * | 1994-08-03 | 1998-06-18 | Hitachi Chemical Company, Ltd. | Monoclonal antibodies having a reactivity specific to Chlamydia pneumoniae, method for production of the monoclonal antibodies, such antibody producing cells, method for production of the cells, method for detection and/or measurement of Chlamydia pneumoniae, reagents for the method, method and agents for diagnosis of chlamydia pneumoniae infection |
WO1996009320A1 (en) | 1994-09-20 | 1996-03-28 | Hitachi Chemical Company, Ltd. | Antigenic polypeptide of chlamydia pneumoniae |
US6248329B1 (en) * | 1998-06-01 | 2001-06-19 | Ramaswamy Chandrashekar | Parasitic helminth cuticlin nucleic acid molecules and uses thereof |
-
1998
- 1998-06-19 CA CA002294651A patent/CA2294651A1/en not_active Abandoned
- 1998-06-19 DK DK98928179T patent/DK1007685T3/da active
- 1998-06-19 AU AU80119/98A patent/AU749382B2/en not_active Ceased
- 1998-06-19 ES ES98928179T patent/ES2264810T3/es not_active Expired - Lifetime
- 1998-06-19 PT PT98928179T patent/PT1007685E/pt unknown
- 1998-06-19 EP EP98928179A patent/EP1007685B1/en not_active Expired - Lifetime
- 1998-06-19 DE DE69834194T patent/DE69834194T2/de not_active Expired - Lifetime
- 1998-06-19 US US09/446,677 patent/US7264941B1/en not_active Expired - Fee Related
- 1998-06-19 AT AT98928179T patent/ATE323164T1/de not_active IP Right Cessation
- 1998-06-19 JP JP50361399A patent/JP4160640B2/ja not_active Expired - Fee Related
- 1998-06-19 WO PCT/DK1998/000266 patent/WO1998058953A2/en active IP Right Grant
- 1998-06-19 BR BR9810288-5A patent/BR9810288A/pt not_active Application Discontinuation
- 1998-06-19 CN CNB988064286A patent/CN1249233C/zh not_active Expired - Fee Related
-
2007
- 2007-07-12 US US11/826,125 patent/US20080226679A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113785198A (zh) * | 2019-05-06 | 2021-12-10 | 阿罗塞尔公司 | 呼吸道感染检测和分类 |
Also Published As
Publication number | Publication date |
---|---|
ES2264810T3 (es) | 2007-01-16 |
US20080226679A1 (en) | 2008-09-18 |
AU8011998A (en) | 1999-01-04 |
WO1998058953A2 (en) | 1998-12-30 |
EP1007685B1 (en) | 2006-04-12 |
DE69834194T2 (de) | 2007-03-29 |
CN1249233C (zh) | 2006-04-05 |
PT1007685E (pt) | 2006-08-31 |
AU749382B2 (en) | 2002-06-27 |
EP1007685A2 (en) | 2000-06-14 |
CA2294651A1 (en) | 1998-12-30 |
DK1007685T3 (da) | 2006-08-14 |
ATE323164T1 (de) | 2006-04-15 |
DE69834194D1 (de) | 2006-05-24 |
BR9810288A (pt) | 2000-09-19 |
JP2002510970A (ja) | 2002-04-09 |
WO1998058953A3 (en) | 1999-03-18 |
JP4160640B2 (ja) | 2008-10-01 |
US7264941B1 (en) | 2007-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1284965A (zh) | 脑膜炎奈瑟氏球菌的新表面蛋白 | |
CN1261403A (zh) | 新的肺炎衣原体表面蛋白 | |
CN1268637C (zh) | 衣原体蛋白、其基因序列及用途 | |
CN1117149C (zh) | 用于免疫治疗和诊断结核病的化合物和方法 | |
CN1222618C (zh) | 卡他摩拉克菌外膜蛋白-106多肽及其基因序列和用途 | |
CN1235555A (zh) | 治疗和诊断分枝杆菌感染的化合物和方法 | |
CN1437653A (zh) | 抗原多肽 | |
CN1251611A (zh) | 粘膜炎莫拉菌的uspa1和uspa2抗原 | |
CN1192241A (zh) | Hsp70家族的链球菌热休克蛋白 | |
CN1317014A (zh) | 检测微生物的抗体 | |
CN1312723A (zh) | 诊断结核病的化合物和方法 | |
CN1200146A (zh) | 用于结核病诊断的化合物和方法 | |
CN1440419A (zh) | 治疗和预防细菌感染的化合物和方法 | |
CN1263436A (zh) | 在螺杆菌属基因组中鉴定编码新型螺杆菌属多肽的多核苷酸 | |
CN1309707A (zh) | 来自脑膜炎奈瑟氏球菌的basb029多核苷酸和多酞 | |
CN1351653A (zh) | 抗细菌疫苗组合物 | |
CN1336957A (zh) | 来自脑膜炎奈瑟氏球菌的basb006多核苷酸和多肽 | |
CN1235513A (zh) | 与幽门螺杆菌及其疫苗组合物有关的核酸和氨基酸序列 | |
CN1427892A (zh) | 毒力基因、蛋白及其用途 | |
CN1350585A (zh) | 脑膜炎奈瑟氏菌多肽basb052 | |
CN1856505A (zh) | 分泌型衣原体多肽、其编码多核苷酸及其治疗和诊断用途 | |
CN1204253C (zh) | 奈瑟氏球菌乳铁蛋白结合蛋白 | |
CN1245419C (zh) | 脑膜炎奈瑟氏球菌(Neisseria meningitidis)的多核苷酸和多肽BASB033及其用途 | |
CN1330714A (zh) | 毒力基因和蛋白质及其用途 | |
CN1342088A (zh) | 包含胆碱结合蛋白an-末端截取物的氨基酸的多肽、由该多肽衍生的疫苗及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |