CN1312723A - 诊断结核病的化合物和方法 - Google Patents
诊断结核病的化合物和方法 Download PDFInfo
- Publication number
- CN1312723A CN1312723A CN99809541A CN99809541A CN1312723A CN 1312723 A CN1312723 A CN 1312723A CN 99809541 A CN99809541 A CN 99809541A CN 99809541 A CN99809541 A CN 99809541A CN 1312723 A CN1312723 A CN 1312723A
- Authority
- CN
- China
- Prior art keywords
- seq
- sequence
- polypeptide
- ala
- dna sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 201000008827 tuberculosis Diseases 0.000 title claims abstract description 145
- 238000000034 method Methods 0.000 title claims description 86
- 238000003745 diagnosis Methods 0.000 title description 11
- 150000001875 compounds Chemical class 0.000 title description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 227
- 229920001184 polypeptide Polymers 0.000 claims abstract description 197
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 194
- 241000187479 Mycobacterium tuberculosis Species 0.000 claims abstract description 190
- 239000000427 antigen Substances 0.000 claims abstract description 181
- 108091007433 antigens Proteins 0.000 claims abstract description 181
- 102000036639 antigens Human genes 0.000 claims abstract description 181
- 108020004414 DNA Proteins 0.000 claims abstract description 74
- 230000000890 antigenic effect Effects 0.000 claims abstract description 61
- 238000001514 detection method Methods 0.000 claims abstract description 48
- 239000012472 biological sample Substances 0.000 claims abstract description 39
- 150000001413 amino acids Chemical class 0.000 claims description 274
- 235000001014 amino acid Nutrition 0.000 claims description 272
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 218
- 210000002966 serum Anatomy 0.000 claims description 130
- 108090000623 proteins and genes Proteins 0.000 claims description 94
- 102000004169 proteins and genes Human genes 0.000 claims description 77
- 235000018102 proteins Nutrition 0.000 claims description 75
- 238000012360 testing method Methods 0.000 claims description 69
- 239000000523 sample Substances 0.000 claims description 64
- 230000004927 fusion Effects 0.000 claims description 59
- 208000015181 infectious disease Diseases 0.000 claims description 51
- 238000003752 polymerase chain reaction Methods 0.000 claims description 29
- 239000002773 nucleotide Substances 0.000 claims description 26
- 125000003729 nucleotide group Chemical group 0.000 claims description 26
- 210000004369 blood Anatomy 0.000 claims description 25
- 239000008280 blood Substances 0.000 claims description 25
- 239000003795 chemical substances by application Substances 0.000 claims description 24
- 230000000295 complement effect Effects 0.000 claims description 24
- 239000013615 primer Substances 0.000 claims description 23
- 239000003155 DNA primer Substances 0.000 claims description 21
- 239000007790 solid phase Substances 0.000 claims description 21
- 108020005187 Oligonucleotide Probes Proteins 0.000 claims description 16
- 239000002751 oligonucleotide probe Substances 0.000 claims description 16
- 210000004027 cell Anatomy 0.000 claims description 15
- 238000009396 hybridization Methods 0.000 claims description 14
- 125000006853 reporter group Chemical group 0.000 claims description 14
- 239000007767 bonding agent Substances 0.000 claims description 13
- 238000009007 Diagnostic Kit Methods 0.000 claims description 12
- 239000003755 preservative agent Substances 0.000 claims description 8
- 230000002335 preservative effect Effects 0.000 claims description 8
- 241000894007 species Species 0.000 claims description 8
- 206010036790 Productive cough Diseases 0.000 claims description 7
- 239000000463 material Substances 0.000 claims description 7
- 210000003802 sputum Anatomy 0.000 claims description 7
- 208000024794 sputum Diseases 0.000 claims description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 6
- 239000013604 expression vector Substances 0.000 claims description 6
- 210000002381 plasma Anatomy 0.000 claims description 6
- 101710166488 6 kDa early secretory antigenic target Proteins 0.000 claims description 5
- 102000004190 Enzymes Human genes 0.000 claims description 5
- 108090000790 Enzymes Proteins 0.000 claims description 5
- 210000001175 cerebrospinal fluid Anatomy 0.000 claims description 5
- 210000003296 saliva Anatomy 0.000 claims description 5
- 210000002700 urine Anatomy 0.000 claims description 5
- 241000588724 Escherichia coli Species 0.000 claims description 4
- 230000003321 amplification Effects 0.000 claims description 4
- 230000005847 immunogenicity Effects 0.000 claims description 4
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 4
- 239000004033 plastic Substances 0.000 claims description 4
- 229920003023 plastic Polymers 0.000 claims description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 3
- 229960002685 biotin Drugs 0.000 claims description 3
- 235000020958 biotin Nutrition 0.000 claims description 3
- 239000011616 biotin Substances 0.000 claims description 3
- 239000004816 latex Substances 0.000 claims description 3
- 229920000126 latex Polymers 0.000 claims description 3
- 101710186708 Agglutinin Proteins 0.000 claims description 2
- 101710146024 Horcolin Proteins 0.000 claims description 2
- 101710189395 Lectin Proteins 0.000 claims description 2
- 101710179758 Mannose-specific lectin Proteins 0.000 claims description 2
- 101710150763 Mannose-specific lectin 1 Proteins 0.000 claims description 2
- 101710150745 Mannose-specific lectin 2 Proteins 0.000 claims description 2
- 101710120037 Toxin CcdB Proteins 0.000 claims description 2
- 239000000910 agglutinin Substances 0.000 claims description 2
- 210000004962 mammalian cell Anatomy 0.000 claims description 2
- 238000003259 recombinant expression Methods 0.000 claims description 2
- 210000005253 yeast cell Anatomy 0.000 claims description 2
- 229920002160 Celluloid Polymers 0.000 claims 2
- 239000008187 granular material Substances 0.000 claims 1
- 239000000693 micelle Substances 0.000 claims 1
- 239000003153 chemical reaction reagent Substances 0.000 abstract description 6
- 102000053602 DNA Human genes 0.000 abstract 1
- 238000005034 decoration Methods 0.000 abstract 1
- 108020004707 nucleic acids Proteins 0.000 description 146
- 102000039446 nucleic acids Human genes 0.000 description 146
- 150000007523 nucleic acids Chemical class 0.000 description 146
- 239000002585 base Substances 0.000 description 145
- 125000003275 alpha amino acid group Chemical group 0.000 description 117
- 239000002299 complementary DNA Substances 0.000 description 93
- 102220023258 rs387907548 Human genes 0.000 description 60
- 102220369447 c.1352G>A Human genes 0.000 description 52
- 102220369446 c.1274G>A Human genes 0.000 description 48
- 230000009257 reactivity Effects 0.000 description 38
- 102220369445 c.668T>C Human genes 0.000 description 35
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 30
- 230000008521 reorganization Effects 0.000 description 30
- 238000012216 screening Methods 0.000 description 30
- 238000006243 chemical reaction Methods 0.000 description 26
- 102220023257 rs387907546 Human genes 0.000 description 26
- 241000193830 Bacillus <bacterium> Species 0.000 description 24
- 238000005516 engineering process Methods 0.000 description 22
- 102220023256 rs387907547 Human genes 0.000 description 22
- 238000000746 purification Methods 0.000 description 21
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 19
- 108010074328 Interferon-gamma Proteins 0.000 description 18
- 102100037850 Interferon gamma Human genes 0.000 description 17
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 17
- 239000006166 lysate Substances 0.000 description 17
- 238000002360 preparation method Methods 0.000 description 16
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 15
- 241000283973 Oryctolagus cuniculus Species 0.000 description 15
- 230000029087 digestion Effects 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 210000001744 T-lymphocyte Anatomy 0.000 description 14
- 238000010276 construction Methods 0.000 description 14
- 101100224410 Solanum tuberosum DPEP gene Proteins 0.000 description 13
- 239000001963 growth medium Substances 0.000 description 13
- 230000001580 bacterial effect Effects 0.000 description 12
- 108091034117 Oligonucleotide Proteins 0.000 description 11
- 238000009795 derivation Methods 0.000 description 11
- 230000000694 effects Effects 0.000 description 11
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 10
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 10
- 230000002441 reversible effect Effects 0.000 description 10
- 230000008878 coupling Effects 0.000 description 9
- 238000010168 coupling process Methods 0.000 description 9
- 238000010790 dilution Methods 0.000 description 9
- 239000012895 dilution Substances 0.000 description 9
- 239000003480 eluent Substances 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 9
- UTGQNNCQYDRXCH-UHFFFAOYSA-N N,N'-diphenyl-1,4-phenylenediamine Chemical compound C=1C=C(NC=2C=CC=CC=2)C=CC=1NC1=CC=CC=C1 UTGQNNCQYDRXCH-UHFFFAOYSA-N 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 238000005859 coupling reaction Methods 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 8
- 241000725303 Human immunodeficiency virus Species 0.000 description 7
- 238000010521 absorption reaction Methods 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 7
- 230000036039 immunity Effects 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 230000009182 swimming Effects 0.000 description 7
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 230000007850 degeneration Effects 0.000 description 6
- 238000002405 diagnostic procedure Methods 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 102220004457 rs11567847 Human genes 0.000 description 6
- 230000003248 secreting effect Effects 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 6
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 5
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 5
- 101100084404 Mus musculus Prodh gene Proteins 0.000 description 5
- 241000186362 Mycobacterium leprae Species 0.000 description 5
- 102100023870 YLP motif-containing protein 1 Human genes 0.000 description 5
- 229960000190 bacillus calmette–guérin vaccine Drugs 0.000 description 5
- 239000012141 concentrate Substances 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 238000002372 labelling Methods 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 4
- 239000007989 BIS-Tris Propane buffer Substances 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 241000283707 Capra Species 0.000 description 4
- 101100117387 Catharanthus roseus DPAS gene Proteins 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- 101000649210 Homo sapiens Skin-specific protein 32 Proteins 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 101100170937 Mus musculus Dnmt1 gene Proteins 0.000 description 4
- 241000186359 Mycobacterium Species 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 239000000020 Nitrocellulose Substances 0.000 description 4
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 4
- 102100027923 Skin-specific protein 32 Human genes 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 238000000502 dialysis Methods 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 210000002540 macrophage Anatomy 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 229920001220 nitrocellulos Polymers 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 238000010008 shearing Methods 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 229960001005 tuberculin Drugs 0.000 description 4
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 3
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 3
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 3
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 3
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 3
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 3
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 241000186366 Mycobacterium bovis Species 0.000 description 3
- 241001467552 Mycobacterium bovis BCG Species 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 208000036981 active tuberculosis Diseases 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 238000005571 anion exchange chromatography Methods 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 3
- 230000004663 cell proliferation Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 3
- 239000000706 filtrate Substances 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 238000003119 immunoblot Methods 0.000 description 3
- 230000002163 immunogen Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 108010034507 methionyltryptophan Proteins 0.000 description 3
- 238000005497 microtitration Methods 0.000 description 3
- 201000000050 myeloid neoplasm Diseases 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 3
- 238000004007 reversed phase HPLC Methods 0.000 description 3
- 230000000405 serological effect Effects 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 210000004988 splenocyte Anatomy 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 230000000638 stimulation Effects 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 229940104230 thymidine Drugs 0.000 description 3
- GMRQFYUYWCNGIN-UHFFFAOYSA-N 1,25-Dihydroxy-vitamin D3' Natural products C1CCC2(C)C(C(CCCC(C)(C)O)C)CCC2C1=CC=C1CC(O)CC(O)C1=C GMRQFYUYWCNGIN-UHFFFAOYSA-N 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 2
- 229910000013 Ammonium bicarbonate Inorganic materials 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101710150190 Beta-secretase 2 Proteins 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 2
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical class OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 2
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 2
- 101100260702 Mus musculus Tinagl1 gene Proteins 0.000 description 2
- 241000187482 Mycobacterium avium subsp. paratuberculosis Species 0.000 description 2
- 241001049988 Mycobacterium tuberculosis H37Ra Species 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 241001597008 Nomeidae Species 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 2
- ATUOYWHBWRKTHZ-UHFFFAOYSA-N Propane Chemical compound CCC ATUOYWHBWRKTHZ-UHFFFAOYSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 239000003513 alkali Substances 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 235000012538 ammonium bicarbonate Nutrition 0.000 description 2
- 239000001099 ammonium carbonate Substances 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000008485 antagonism Effects 0.000 description 2
- 230000001458 anti-acid effect Effects 0.000 description 2
- 101150088826 arg1 gene Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 229960005084 calcitriol Drugs 0.000 description 2
- GMRQFYUYWCNGIN-NKMMMXOESA-N calcitriol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=C\C=C1\C[C@@H](O)C[C@H](O)C1=C GMRQFYUYWCNGIN-NKMMMXOESA-N 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 235000013877 carbamide Nutrition 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 239000000039 congener Substances 0.000 description 2
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 201000006674 extrapulmonary tuberculosis Diseases 0.000 description 2
- 125000000524 functional group Chemical group 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000003365 glass fiber Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 150000002460 imidazoles Chemical class 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000005342 ion exchange Methods 0.000 description 2
- 230000002147 killing effect Effects 0.000 description 2
- 101150066555 lacZ gene Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 210000004379 membrane Anatomy 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 235000013336 milk Nutrition 0.000 description 2
- 239000008267 milk Substances 0.000 description 2
- 210000004080 milk Anatomy 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 201000010098 pleural tuberculosis Diseases 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 239000001117 sulphuric acid Substances 0.000 description 2
- 235000011149 sulphuric acid Nutrition 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 230000010148 water-pollination Effects 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- SEFVRKXJJPMVHQ-YUMQZZPRSA-N (2s)-2-[[2-[[(2s)-2-[(2-aminoacetyl)amino]-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]butanedioic acid Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O SEFVRKXJJPMVHQ-YUMQZZPRSA-N 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- VYMPLPIFKRHAAC-UHFFFAOYSA-N 1,2-ethanedithiol Chemical compound SCCS VYMPLPIFKRHAAC-UHFFFAOYSA-N 0.000 description 1
- AZQWKYJCGOJGHM-UHFFFAOYSA-N 1,4-benzoquinone Chemical compound O=C1C=CC(=O)C=C1 AZQWKYJCGOJGHM-UHFFFAOYSA-N 0.000 description 1
- BMGMINKVTPDDRZ-UHFFFAOYSA-N 2-acetamido-n-[1-[[5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methylpentanamide;n-[1-[[5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methyl-2-(propanoylamino)pentanamide Chemical compound CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N.CCC(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N BMGMINKVTPDDRZ-UHFFFAOYSA-N 0.000 description 1
- 102100025230 2-amino-3-ketobutyrate coenzyme A ligase, mitochondrial Human genes 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- YRNWIFYIFSBPAU-UHFFFAOYSA-N 4-[4-(dimethylamino)phenyl]-n,n-dimethylaniline Chemical compound C1=CC(N(C)C)=CC=C1C1=CC=C(N(C)C)C=C1 YRNWIFYIFSBPAU-UHFFFAOYSA-N 0.000 description 1
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 1
- 102100038222 60 kDa heat shock protein, mitochondrial Human genes 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 108010087522 Aeromonas hydrophilia lipase-acyltransferase Proteins 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- XAEWTDMGFGHWFK-IMJSIDKUSA-N Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O XAEWTDMGFGHWFK-IMJSIDKUSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 1
- ZDBWKBCKYJGKGP-DCAQKATOSA-N Arg-Leu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O ZDBWKBCKYJGKGP-DCAQKATOSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- NZQFXJKVNUZYAG-BPUTZDHNSA-N Arg-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 NZQFXJKVNUZYAG-BPUTZDHNSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- 206010003445 Ascites Diseases 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 108010055400 Aspartate kinase Proteins 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108010058432 Chaperonin 60 Proteins 0.000 description 1
- 101710098119 Chaperonin GroEL 2 Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 206010011224 Cough Diseases 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- ZRXBYKAOFHLTDN-GUBZILKMSA-N Gln-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N ZRXBYKAOFHLTDN-GUBZILKMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- 102100036263 Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Human genes 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 102000018932 HSP70 Heat-Shock Proteins Human genes 0.000 description 1
- 108010027992 HSP70 Heat-Shock Proteins Proteins 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- GHAFKUCRIVBLDJ-IHRRRGAJSA-N His-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N GHAFKUCRIVBLDJ-IHRRRGAJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- MUENHEQLLUDKSC-PMVMPFDFSA-N His-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 MUENHEQLLUDKSC-PMVMPFDFSA-N 0.000 description 1
- 101001001786 Homo sapiens Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Proteins 0.000 description 1
- 101000869690 Homo sapiens Protein S100-A8 Proteins 0.000 description 1
- 108010064711 Homoserine dehydrogenase Proteins 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010015268 Integration Host Factors Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 108010011078 Leupeptins Proteins 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- 241000282553 Macaca Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 1
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- 241000187480 Mycobacterium smegmatis Species 0.000 description 1
- RKOTXQYWCBGZLP-UHFFFAOYSA-N N-[(2,4-difluorophenyl)methyl]-2-ethyl-9-hydroxy-3-methoxy-1,8-dioxospiro[3H-pyrido[1,2-a]pyrazine-4,3'-oxolane]-7-carboxamide Chemical compound CCN1C(OC)C2(CCOC2)N2C=C(C(=O)NCC3=C(F)C=C(F)C=C3)C(=O)C(O)=C2C1=O RKOTXQYWCBGZLP-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 208000037581 Persistent Infection Diseases 0.000 description 1
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- 102000006335 Phosphate-Binding Proteins Human genes 0.000 description 1
- 108010058514 Phosphate-Binding Proteins Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- BEPSGCXDIVACBU-IUCAKERBSA-N Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BEPSGCXDIVACBU-IUCAKERBSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- 102100032442 Protein S100-A8 Human genes 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- BUGBHKTXTAQXES-UHFFFAOYSA-N Selenium Chemical compound [Se] BUGBHKTXTAQXES-UHFFFAOYSA-N 0.000 description 1
- 206010070834 Sensitisation Diseases 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 230000006052 T cell proliferation Effects 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- YZCKVEUIGOORGS-NJFSPNSNSA-N Tritium Chemical compound [3H] YZCKVEUIGOORGS-NJFSPNSNSA-N 0.000 description 1
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- BSSJIVIFAJKLEK-XIRDDKMYSA-N Trp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BSSJIVIFAJKLEK-XIRDDKMYSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000038016 acute inflammation Diseases 0.000 description 1
- 230000006022 acute inflammation Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 125000003172 aldehyde group Chemical group 0.000 description 1
- 102000019199 alpha-Mannosidase Human genes 0.000 description 1
- 108010012864 alpha-Mannosidase Proteins 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 229960003896 aminopterin Drugs 0.000 description 1
- 238000005349 anion exchange Methods 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010091818 arginyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 108091006004 biotinylated proteins Proteins 0.000 description 1
- 230000036770 blood supply Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229940097572 chloromycetin Drugs 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- -1 cofactor Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 239000000385 dialysis solution Substances 0.000 description 1
- 108010079167 dihydrolipoamide succinyltransferase Proteins 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000013024 dilution buffer Substances 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 208000017574 dry cough Diseases 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000003344 environmental pollutant Substances 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 229940087586 escherichia coli antigen Drugs 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000011010 flushing procedure Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- CEAZRRDELHUEMR-UHFFFAOYSA-N gentamicin Chemical class O1C(C(C)NC)CCC(N)C1OC1C(O)C(OC2C(C(NC)C(C)(O)CO2)O)C(N)CC1N CEAZRRDELHUEMR-UHFFFAOYSA-N 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- ZBKIUFWVEIBQRT-UHFFFAOYSA-N gold(1+) Chemical compound [Au+] ZBKIUFWVEIBQRT-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 210000004754 hybrid cell Anatomy 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 229960003130 interferon gamma Drugs 0.000 description 1
- 230000014828 interferon-gamma production Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 238000005374 membrane filtration Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 229940126619 mouse monoclonal antibody Drugs 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 210000003200 peritoneal cavity Anatomy 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 235000020030 perry Nutrition 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 1
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 231100000719 pollutant Toxicity 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 239000001294 propane Substances 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 208000008128 pulmonary tuberculosis Diseases 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000000682 scanning probe acoustic microscopy Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 229910052711 selenium Inorganic materials 0.000 description 1
- 239000011669 selenium Substances 0.000 description 1
- 230000008313 sensitization Effects 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- PNGLEYLFMHGIQO-UHFFFAOYSA-M sodium;3-(n-ethyl-3-methoxyanilino)-2-hydroxypropane-1-sulfonate;dihydrate Chemical compound O.O.[Na+].[O-]S(=O)(=O)CC(O)CN(CC)C1=CC=CC(OC)=C1 PNGLEYLFMHGIQO-UHFFFAOYSA-M 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000011343 solid material Substances 0.000 description 1
- 235000014347 soups Nutrition 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- HNKJADCVZUBCPG-UHFFFAOYSA-N thioanisole Chemical compound CSC1=CC=CC=C1 HNKJADCVZUBCPG-UHFFFAOYSA-N 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 238000000954 titration curve Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 229910052722 tritium Inorganic materials 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 102000003390 tumor necrosis factor Human genes 0.000 description 1
- 238000000539 two dimensional gel electrophoresis Methods 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000003828 vacuum filtration Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
- A61P31/06—Antibacterial agents for tuberculosis
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/35—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/12—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
- C07K16/1267—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria
- C07K16/1289—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria from Mycobacteriaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/689—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
- G01N33/5695—Mycobacteria
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Abstract
本发明提供了一种多肽,它含有可溶性结核分枝杆菌抗原的抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的所述抗原的变体。本发明还提供了编码这些多肽的DNA分子,这些多肽和DNA分子用于检测生物样品中结核分枝杆菌感染的用途,以及含有这些多肽和DNA分子的试剂盒。
Description
相关申请交叉文献
本申请是1998年2月18日提交的美国申请No.09/024,753的部分续展申请;后者是1997年10月1日提交的美国申请No.08/942,341的部分续展申请;后者是1997年3月13日提交的美国申请08/818,111的部分续展申请;后者是1996年10月11日提交的美国申请No.08/729,622的部分续展申请;后者要求了1996年8月30日提交的PCT申请No.PCT/US96/14675的优先权;是1996年7月12日提交的美国申请No.08/680,574的部分续展申请;后者是1996年6月5日提交的美国申请No.08/658,800的部分续展申请;后者是1996年3月22日提交、现在放弃的美国申请No.08/620,280的部分续展申请;后者是1995年9月22日提交、现在放弃的美国申请No.08/532,136的部分续展申请;后者是1995年9月1日提交、现在放弃的美国申请No.08/523,435的续展申请。
技术领域
本发明总地涉及结核分枝杆菌(Mycobacterium tuberculosis)感染的检测。本发明更具体地涉及包含结核分枝杆菌抗原、其一部分或其它变体的多肽,以及这些多肽用来血清诊断结核分枝杆菌感染的用途。
发明背景
结核病是一种慢性传染性疾病,它通常由结核分枝杆菌感染引起。它在发展中国家中是主要的疾病,对于世界上的发达地区也是正增加的问题,每年有大约800万新的病例和300万死者。尽管感染可能在相当长的时间内没有症状,但是该病最常见的表现为肺部急性炎症,导致发热和干咳。若不治疗,通常会导致严重的并发症和死亡。
虽然结核病通常可用长期抗生素治疗来控制,但是这些治疗却不足以防止疾病的传播。受感染的个体可能没有症状,但有时有传染性。另外,虽然遵从治疗方案很关键,但是患者的行为很难监控。一些患者没有完成疗程,这样会导致治疗没有效果并产生抗药性。
抑制结核病的传播需要有效的疫苗接种和准确地早期诊断该疾病。当前,用活细菌免疫接种是诱导保护性免疫力最有效的方法。用于此目的的最常用的分枝杆菌是Bacillus Calmette-Guerin(BCG)卡介苗,一种牛分枝杆菌的无毒菌株。然而,BCG的安全性和有效性引起了争论,一些国家(如美国)不对一般公众接种该疫苗。诊断通常用皮肤试验来实现,该试验涉及真皮内接触结核菌素PPD(纯化的蛋白衍生物)。抗原特异性T细胞应答在注射后48-72小时在注射部位导致可测定的培育,表明接触过分枝杆菌抗原。然而,该试验的问题是敏感性和特异性,且不能区别接种过BCG疫苗的个体和感染的个体。
尽管已证明巨噬细胞起结核分枝杆菌免疫力的主要效应物的作用,但T细胞却是该免疫力的主要诱导物。T细胞在抵抗结核分枝杆菌感染的保护中所起的基本作用AIDS患者中通过结核分枝杆菌的发病频繁而得以阐明,这是由于人免疫缺陷病毒(HIV)感染伴有CD4 T细胞损耗。已经证明分枝杆菌反应性CD4 T细胞是γ干扰素(IFN-γ)的强效生产者,进而证明γ干扰素引发了小鼠体内巨噬细胞的抗分枝杆菌作用。尽管IFN-γ在人体内的作用尚不清楚,但是研究已经表明,单用1,25-二羟基-维生素D3或与IFN-γ或肿瘤坏死因子α合用激活了人巨噬细胞,从而抑制了结核分枝杆菌感染。另外,已知IFN-γ刺激人巨噬细胞产生1,25-二羟基-维生素D3。类似地,业已证明IL-12在刺激对结核分枝杆菌感染的抵抗力中起作用。关于结核分枝杆菌感染的免疫学的综述参见Chan和Kaufmann在《结核病:发病机理、保护和控制》,Bloom编辑,ASM出版社,Washington,DC,1994。
因此,本领域中需要有改进的诊断方法来检测结核病。本发明实现了这一需求并且还提供了其它有关的优点。
发明概述
简言之,本发明提供了诊断结核病的组合物和方法。一方面,本发明提供了多肽,该多肽含有可溶性结核分枝杆菌抗原的抗原性部分、或仅仅在保守性置换和/或修饰中有所不同的该抗原的变体。在该方面的一个实施方案中,可溶性抗原具有下列N-端序列中的一个序列:
(a)Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly-
Gln-Val-Val-Ala-Ala-Leu(SEQ ID NO:115);
(b)Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro-
Ser(SEQ ID NO:116);
(c)Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-
Ala-Lys-Glu-Gly-Arg(SEQ ID NO:117);
(d)Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly-
Pro(SEQ ID NO:118);
(e)Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val
(SEQ ID NO:119);
(f)Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro(SEQ ID
NO:120);
(g)Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Thr-Ala-Ala-Ser-
Pro-Pro-Ser(SEQ ID NO:121);
(h)Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Thr-
Gly(SEQ ID NO:122);
(i)Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Leu-
Thr-Ser-Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe-
Ala-Asn(SEQ ID NO:123);
(j)Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-Ile-Lys-Val-Thr-Asp-Ala-
Ser,(SEQ ID NO:129)
(k)Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-Ala-
Asp;(SEQ ID NO:130)或
(l)Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gln-Ala-
Gly;(SEQ ID NO:131)
其中Xaa可以是任何氨基酸。
在一个相关的方面,提供了多肽,该多肽含有结核分枝杆菌抗原的免疫原性部分、或仅仅在保守性置换和/或修饰中有所不同的该抗原的变体,该抗原具有下列N-端序列之一:
(m)Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys-
Ile-Asn-Val-His-Leu-Val;(SEQ ID NO:132)或
(n)Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr-
Tyr-Pro-Gly-Gly-Arg-Arg-Xaa-Phe;(SEQ ID NO:124)
其中Xaa可以是任何氨基酸。
在另一个实施方案中,可溶性结核分枝杆菌抗原包含由DNA序列编码的氨基酸序列,该DNA序列选自:SEQ ID NO:1、2、4-10、13-25、52、94和96中描述的序列、所述序列的互补序列、以及在中等严谨条件下与SEQ ID NO:1、2、4-10、13-25、52、94和96中描述的序列或其互补序列杂交的DNA序列。
在一个相关的方面,多肽包含结核分枝杆菌抗原的抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的该抗原的变体,其中抗原包含的氨基酸序列由DNA序列编码,该DNA序列选自:SEQ ID NO:26-51、133、134、158-178、184-188、194-196、198、210-220、232、234、235、237-242、248-251、256-271、287、288、290-293和298-337中描述的序列、所述序列的互补序列、以及在中等严谨条件下与SEQ ID NO:26-51、133、134、158-178、184-188、194-196、198、210-220、232、234、235、237-242、248-251、256-271、287、288、290-293和298-337中描述的序列或其互补序列杂交的DNA序列。
在相关方面,还提供了编码上述多肽的DNA序列、含有这些DNA序列的重组表达载体以及转化或转染了这些表达载体的宿主细胞。
另一方面,本发明提供了融合蛋白,该蛋白含有第一种和第二种本发明的多肽,或者本发明的多肽与一种已知的结核分枝杆菌抗原。
在本发明的另一方面,提供了检测患者结核病的方法和诊断试剂盒。该方法包括:(a)使生物样品与至少一种上述多肽接触;和(b)检测样品中结合上述多肽的抗体是否存在,从而检测生物样品中结核分枝杆菌的感染。合适的生物样品包括全血、痰液、血清、血浆、唾液、脑脊液和尿液。诊断试剂盒包含一种或多种上述多肽与检测试剂联合使用。
本发明还提供检测结核分枝杆菌感染的方法,该方法包括:(a)从患者获得生物样品;(b)使样品在聚合酶链反应中与至少一个寡核苷酸引物接触,该寡核苷酸引物对编码上述多肽的DNA序列有特异性;和(c)检测样品中在第一和第二寡核苷酸引物存在下扩增的DNA序列。在一个实施方案中,寡核苷酸引物含有该DNA序列的至少大约10个连续的核苷酸。
另一方面,本发明提供了检测患者结核分枝杆菌感染的方法,该方法包括:(a)获得患者的生物样品;(b)使样品与对编码上述多肽的DNA序列有特异性的寡核苷酸探针接触;和(c)检测样品中与寡核苷酸探针杂交的DNA序列。在一个实施方案中,寡核苷酸探针包含该DNA序列的至少约15个连续的核苷酸。
另一方面,本发明提供了与上述多肽结合抗体(单克隆抗体和多克隆抗体),以及它们在检测结核分枝杆菌感染中的用途。
在参看了下列详细描述和所附附图后,本发明的这些方面和其它方面将是显而易见的。本文公开的所有文献均全部纳入本文作参考,正如每一份文献单独纳入本文一样。
附图简述和序列说明
图1A和B说明了实施例1中描述的14Kd,20Kd和26Kd抗原分别刺激第一和第二结核分枝杆菌免疫供体T细胞的增殖和干扰素γ产生情况。
图2A-D描述了针对分泌型结核分枝杆菌蛋白(已知的结核分枝杆菌抗原85b)以及本发明抗原Tb38-1和TbH-9产生的抗血清,分别与结核分枝杆菌裂解液(泳道2)、结核分枝杆菌分泌性蛋白(泳道3)、重组Tb38-1(泳道4)、重组TbH-9(泳道5)以及重组85b(泳道5)的反应性。
图3A描述了分泌型结核分枝杆菌蛋白、重组TbH-9以及对照抗原TbRa11刺激TbH-9特异性T细胞克隆增殖。
图3B描述了分泌型结核分枝杆菌蛋白、PPD和重组TbH-9刺激TbH-9-特异性T细胞克隆产生干扰素γ。
图4描述了两种典型的多肽与感染了结核分枝杆菌的个体以及未感染个体的血清的反应性与细菌裂解液反应性的比较。
图5显示了四种典型的多肽与感染及未感染结核分枝杆菌的个体的血清的反应性,与38kD抗原反应性的比较。
图6显示了重组38kD和TbRa11抗原与结核分枝杆菌患者、PPD阳性献血员以及正常献血员的血清的反应性。
图7显示了抗原TbRa2A与38kD阴性血清的反应性。
图8显示了SEQ ID NO:60的抗原与结核分枝杆菌患者以及正常献血员的血清的反应性。
图9描述了重组抗原TbH-29(SEQ ID NO:137)与结核分枝杆菌患者、PPD阳性献血员和正常献血员的血清经间接ELISA测得的反应性。
图10描述了重组抗原TbH-33(SEQ ID NO:140)与结核分枝杆菌患者以及正常献血员的血清、以及与结核分枝杆菌患者合并血清的直接和间接ELISA测得的反应性。
图11描述了浓度增加的重组抗原TbH-33(SEQ ID NO:140)与结核分枝杆菌患者以及正常献血员的血清经ELISA测得的反应性。
图12A-E分别描述了重组抗原MO-1、MO-2、MO-4、MO-28和MO-29分别与结核分枝杆菌患者以及正常献血员的血清经ELISA测得的反应性。
SEQ ID NO:1是TbRa1的DNA序列。
SEQ ID NO:2是TbRa10的DNA序列。
SEQ ID NO:3是TbRa11的DNA序列。
SEQ ID NO:4是TbRa12的DNA序列。
SEQ ID NO:5是TbRa13的DNA序列。
SEQ ID NO:6是TbRa16的DNA序列。
SEQ ID NO:7是TbRa17的DNA序列。
SEQ ID NO:8是TbRa18的DNA序列。
SEQ ID NO:9是TbRa19的DNA序列。
SEQ ID NO:10是TbRa24的DNA序列。
SEQ ID NO:11是TbRa26的DNA序列。
SEQ ID NO:12是TbRa28的DNA序列。
SEQ ID NO:13是TbRa29的DNA序列。
SEQ ID NO:14是TbRa2A的DNA序列。
SEQ ID NO:15是TbRa3的DNA序列。
SEQ ID NO:16是TbRa32的DNA序列。
SEQ ID NO:17是TbRa35的DNA序列。
SEQ ID NO:18是TbRa36的DNA序列。
SEQ ID NO:19是TbRa4的DNA序列。
SEQ ID NO:20是TbRa9的DNA序列。
SEQ ID NO:21是TbRaB的DNA序列。
SEQ ID NO:22是TbRaC的DNA序列。
SEQ ID NO:23是TbRaD的DNA序列。
SEQ ID NO:24是YYWCPG的DNA序列。
SEQ ID NO:25是AAMK的DNA序列。
SEQ ID NO:26是TbL-23的DNA序列。
SEQ ID NO:27是TbL-24的DNA序列。
SEQ ID NO:28是TbL-25的DNA序列。
SEQ ID NO:29是TbL-28的DNA序列。
SEQ ID NO:30是TbL-29的DNA序列。
SEQ ID NO:31是TbH-5的DNA序列。
SEQ ID NO:32是TbH-8的DNA序列。
SEQ ID NO:33是TbH-9的DNA序列。
SEQ ID NO:34是TbM-1的DNA序列。
SEQ ID NO:35是TbM-3的DNA序列。
SEQ ID NO:36是TbM-6的DNA序列。
SEQ ID NO:37是TbM-7的DNA序列。
SEQ ID NO:38是TbM-8的DNA序列。
SEQ ID NO:39是TbM-9的DNA序列。
SEQ ID NO:40是TbM-12的DNA序列。
SEQ ID NO:41是TbM-13的DNA序列。
SEQ ID NO:42是TbM-15的DNA序列。
SEQ ID NO:43是TbH-4的DNA序列。
SEQ ID NO:44是TbH-4-FWD的DNA序列。
SEQ ID NO:45是TbH-12的DNA序列。
SEQ ID NO:46是Tb38-1的DNA序列。
SEQ ID NO:47是Tb38-4的DNA序列。
SEQ ID NO:48是TbL-17的DNA序列。
SEQ ID NO:49是TbL-20的DNA序列。
SEQ ID NO:50是TbL-21的DNA序列。
SEQ ID NO:51是TbH-16的DNA序列。
SEQ ID NO:52是DPEP的DNA序列。
SEQ ID NO:53是DPEP的推导的氨基酸序列。
SEQ ID NO:54是DPV N-端抗原的蛋白质序列。
SEQ ID NO:55是AVGS N-端抗原的蛋白质序列。
SEQ ID NO:56是AAMK N-端抗原的蛋白质序列。
SEQ ID NO:57是YYWC N-端抗原的蛋白质序列。
SEQ ID NO:58是DIGS N-端抗原的蛋白质序列。
SEQ ID NO:59是AEES N-端抗原的蛋白质序列。
SEQ ID NO:60是DPEP N-端抗原的蛋白质序列。
SEQ ID NO:61是APKT N-端抗原的蛋白质序列。
SEQ ID NO:62是DPAS N-端抗原的蛋白质序列。
SEQ ID NO:63是TbM-1肽的推导的氨基酸序列。
SEQ ID NO:64是TbRa1的推导的氨基酸序列。
SEQ ID NO:65是TbRa10的推导的氨基酸序列。
SEQ ID NO:66是TbRa11的推导的氨基酸序列。
SEQ ID NO:67是TbRa12的推导的氨基酸序列。
SEQ ID NO:68是TbRa13的推导的氨基酸序列。
SEQ ID NO:69是TbRa16的推导的氨基酸序列。
SEQ ID NO:70是TbRa17的推导的氨基酸序列。
SEQ ID NO:71是TbRa18的推导的氨基酸序列。
SEQ ID NO:72是TbRa19的推导的氨基酸序列。
SEQ ID NO:73是TbRa24的推导的氨基酸序列。
SEQ ID NO:74是TbRa26的推导的氨基酸序列。
SEQ ID NO:75是TbRa28的推导的氨基酸序列。
SEQ ID NO:76是TbRa29的推导的氨基酸序列。
SEQ ID NO:77是TbRa2A的推导的氨基酸序列。
SEQ ID NO:78是TbRa3的推导的氨基酸序列。
SEQ ID NO:79是TbRa32的推导的氨基酸序列。
SEQ ID NO:80是TbRa35的推导的氨基酸序列。
SEQ ID NO:81是TbRa36的推导的氨基酸序列。
SEQ ID NO:82是TbRa4的推导的氨基酸序列。
SEQ ID NO:83是TbRa9的推导的氨基酸序列。
SEQ ID NO:84是TbRaB的推导的氨基酸序列。
SEQ ID NO:85是TbRaC的推导的氨基酸序列。
SEQ ID NO:86是TbRaD的推导的氨基酸序列。
SEQ ID NO:87是YYWCPG的推导的氨基酸序列。
SEQ ID NO:88是TbAAMK的推导的氨基酸序列。
SEQ ID NO:89是Tb38-1的推导的氨基酸序列。
SEQ ID NO:90是TbH-4的推导的氨基酸序列。
SEQ ID NO:91是TbH-8的推导的氨基酸序列。
SEQ ID NO:92是TbH-9的推导的氨基酸序列。
SEQ ID NO:93是TbH-12的推导的氨基酸序列。
SEQ ID NO:94是DPAS的DNA序列。
SEQ ID NO:95是DPAS的推导的氨基酸序列。
SEQ ID NO:96是DPV的DNA序列。
SEQ ID NO:97是DPV的推导的氨基酸序列。
SEQ ID NO:98是ESAT-6的DNA序列。
SEQ ID NO:99是ESAT-6的推导的氨基酸序列。
SEQ ID NO:100是TbH-8-2的DNA序列。
SEQ ID NO:101是TbH-9FL的DNA序列。
SEQ ID NO:102是TbH-9FL的推导的氨基酸序列。
SEQ ID NO:103是TbH-9-1的DNA序列。
SEQ ID NO:104是TbH-9-1的推导的氨基酸序列。
SEQ ID NO:105是TbH-9-4的DNA序列。
SEQ ID NO:106是TbH-9-4的推导的氨基酸序列。
SEQ ID NO:107是Tb38-1F2 IN的DNA序列。
SEQ ID NO:108是Tb38-1F2 RP的DNA序列。
SEQ ID NO:109是Tb37-FL的推导的氨基酸序列。
SEQ ID NO:110是Tb38-IN的推导的氨基酸序列。
SEQ ID NO:111是Tb38-1F3的DNA序列。
SEQ ID NO:112是Tb38-1F3的推导的氨基酸序列。
SEQ ID NO:113是Tb38-1F5的DNA序列。
SEQ ID NO:114是Tb38-1F6的DNA序列。
SEQ ID NO:115是DPV的推导的N-端氨基酸序列。
SEQ ID NO:116是AVGS的推导的N-端氨基酸序列。
SEQ ID NO:117是AAMK的推导的N-端氨基酸序列。
SEQ ID NO:118是YYWC的推导的N-端氨基酸序列。
SEQ ID NO:119是DIGS的推导的N-端氨基酸序列。
SEQ ID NO:120是AAES的推导的N-端氨基酸序列。
SEQ ID NO:121是DPEP的推导的N-端氨基酸序列。
SEQ ID NO:122是APKT的推导的N-端氨基酸序列。
SEQ ID NO:123是DPAS的推导的N-端氨基酸序列。
SEQ ID NO:124是DPPD N-端抗原的蛋白质序列。
SEQ ID NO:125-128是四个DPPD溴化氰片段的蛋白质序列。
SEQ ID NO:129是XDS抗原的N-端蛋白质序列。
SEQ ID NO:130是AGD抗原的N-端蛋白质序列。
SEQ ID NO:131是APE抗原的N-端蛋白质序列。
SEQ ID NO:132是XYI抗原的N-端蛋白质序列。
SEQ ID NO:133是TbH-29的DNA序列。
SEQ ID NO:134是TbH-30的DNA序列。
SEQ ID NO:135是TbH-32的DNA序列。
SEQ ID NO:136是TbH-33的DNA序列。
SEQ ID NO:137是TbH-29的预计氨基酸序列。
SEQ ID NO:138是TbH-30的预计氨基酸序列。
SEQ ID NO:139是TbH-32的预计氨基酸序列。
SEQ ID NO:140是TbH-33的预计氨基酸序列。
SEQ ID NO:141-146是用于制备含有TbRa3,38kD和Tb38-1的融合蛋白的PCR引物。
SEQ ID NO:147是含有TbRa3,38kD和Tb38-1的融合蛋白的DNA序列。
SEQ ID NO:148是含有TbRa3,38kD和Tb38-1的融合蛋白的氨基酸序列。
SEQ ID NO:149是结核分枝杆菌抗原38kD的DNA序列。
SEQ ID NO:150是结核分枝杆菌抗原38kD的氨基酸序列。
SEQ IDNO:151是XP14的DNA序列。
SEQ ID NO:152是XP24的DNA序列。
SEQ ID NO:153是XP31的DNA序列。
SEQ ID NO:154是XP32的5'DNA序列。
SEQ ID NO:155是XP32的3'DNA序列。
SEQ ID NO:156是XP14的预计氨基酸序列。
SEQ ID NO:157是XP14反向互补序列编码的预计的氨基酸序列。
SEQ ID NO:158是XP27的DNA序列。
SEQ ID NO:159是XP36的DNA序列。
SEQ ID NO:160是XP4的5'DNA序列。
SEQ ID NO:161是XP5的5'DNA序列。
SEQ ID NO:162是XP17的5'DNA序列。
SEQ ID NO:163是XP30的5'DNA序列。
SEQ ID NO:164是XP2的5'DNA序列。
SEQ ID NO:165是XP2的3'DNA序列。
SEQ ID NO:166是XP3的5'DNA序列。
SEQ ID NO:167是XP3的3'DNA序列。
SEQ ID NO:168是XP6的5'DNA序列。
SEQ ID NO:169是XP6的3'DNA序列。
SEQ ID NO:170是XP18的5'DNA序列。
SEQ ID NO:171是XP18的3'DNA序列。
SEQ ID NO:172是XP19的5'DNA序列。
SEQ ID NO:173是XP19的3'DNA序列。
SEQ ID NO:174是XP22的5'DNA 序列。
SEQ ID NO:175是XP22的3'DNA序列。
SEQ ID NO:176是XP25的5'DNA序列。
SEQ ID NO:177是XP25的3'DNA序列。
SEQ ID NO:178是TbH4-XPI的全长DNA序列。
SEQ ID NO:179是TbH4-XP1的预计氨基酸序列。
SEQ ID NO:180是TbH4-XP1的反向互补序列编码的预计的氨基酸序列。
SEQ ID NO:181是XP36编码的第一个预计的氨基酸序列。
SEQ ID NO:182是XP36编码的第二个预计的氨基酸序列。
SEQ ID NO:183是XP36的反向互补序列编码的预计的氨基酸序列。
SEQ ID NO:184是RDIF2的DNA序列。
SEQ ID NO:185是RDIF5的DNA序列。
SEQ ID NO:186是RDIF8的DNA序列。
SEQ ID NO:187是RDIF10的DNA序列。
SEQ ID NO:188是RDIF11的DNA序列。
SEQ ID NO:189是RDIF2的预计的氨基酸序列。
SEQ ID NO:190是RDIF5的预计的氨基酸序列。
SEQ ID NO:191是RDIF8的预计的氨基酸序列。
SEQ ID NO:192是RDIF10的预计的氨基酸序列。
SEQ ID NO:193是RDIF11的预计的氨基酸序列。
SEQ ID NO:194是RDIF12的5'DNA序列。
SEQ ID NO:195是RDIF12的3'DNA序列。
SEQ ID NO:196是RDIF7的DNA序列。
SEQ ID NO:197是RDIF7的预计的氨基酸序列。
SEQ ID NO:198是DIF2-1的DNA序列。
SEQ ID NO:199是DIF2-1的预计的氨基酸序列。
SEQ ID NO:200-207是用来制备含有TbRa3(38kD)、Tb38-1和DPEP的融合蛋白(后称TbF-2)的PCR引物。
SEQ ID NO:208是融合蛋白TbF-2的DNA序列。
SEQ ID NO:209是融合蛋白TbF-2的氨基酸序列。
SEQ ID NO:210是MO-1的5'DNA序列。
SEQ ID NO:211是MO-2的5'DNA序列。
SEQ ID NO:212是MO-4的5'DNA序列。
SEQ ID NO:213是MO-8的5'DNA序列。
SEQ ID NO:214是MO-9的5'DNA序列。
SEQ ID NO:215是MO-26的5'DNA序列。
SEQ ID NO:216是MO-28的5'DNA序列。
SEQ ID NO:217是MO-29的5'DNA序列。
SEQ ID NO:218是MO-30的5'DNA序列。
SEQ ID NO:219是MO-34的5'DNA序列。
SEQ ID NO:220是MO-35的5'DNA序列。
SEQ ID NO:221是MO-1的预计的氨基酸序列。
SEQ ID NO:222是MO-2的预计的氨基酸序列。
SEQ ID NO:223是MO-4的预计的氨基酸序列。
SEQ ID NO:224是MO-8的预计的氨基酸序列。
SEQ ID NO:225是MO-9的预计的氨基酸序列。
SEQ ID NO:226是MO-26的预计的氨基酸序列。
SEQ ID NO:227是MO-28的预计的氨基酸序列。
SEQ ID NO:228是MO-29的预计的氨基酸序列。
SEQ ID NO:229是MO-30的预计的氨基酸序列。
SEQ ID NO:230是MO-34的预计的氨基酸序列。
SEQ ID NO:231是MO-35的预计的氨基酸序列。
SEQ ID NO:232是MO-10的测定的DNA序列。
SEQ ID NO:233是MO-10的预计的氨基酸序列。
SEQ ID NO:234是MO-27的3'DNA序列。
SEQ ID NO:235是DPPD的全长DNA序列。
SEQ ID NO:236是DPPD的预计全长氨基酸序列。
SEQ ID NO:237是LSER-10的测得的5'cDNA序列。
SEQ ID NO:238是LSER-11的测得的5'cDNA序列。
SEQ ID NO:239是LSER-12的测得的5'cDNA序列。
SEQ ID NO:240是LSER-13的测得的5'cDNA序列。
SEQ ID NO:241是LSER-16的测得的5'cDNA序列。
SEQ ID NO:242是LSER-25的测得的5'cDNA序列。
SEQ ID NO:243是LSER-10的预计的氨基酸序列。
SEQ ID NO:244是LSER-12的预计的氨基酸序列。
SEQ ID NO:245是LSER-13的预计的氨基酸序列。
SEQ ID NO:246是LSER-16的预计的氨基酸序列。
SEQ ID NO:247是LSER-25的预计的氨基酸序列。
SEQ ID NO:248是LSER-18的测得的cDNA序列。
SEQ ID NO:249是LSER-23的测得的cDNA序列。
SEQ ID NO:250是LSER-24的测得的cDNA序列。
SEQ ID NO:251是LSER-27的测得的cDNA序列。
SEQ ID NO:252是LSER-18的预计的氨基酸序列。
SEQ ID NO:253是LSER-23的预计的氨基酸序列。
SEQ ID NO:254是LSER-24的预计的氨基酸序列。
SEQ ID NO:255是LSER-27的预计的氨基酸序列。
SEQ ID NO:256是测得的LSER-1的5'cDNA序列。
SEQ ID NO:257是测得的LSER-3的5'cDNA序列。
SEQ ID NO:258是测得的LSER-4的5'cDNA序列。
SEQ ID NO:259是测得的LSER-5的5'cDNA序列。
SEQ ID NO:260是测得的LSER-6的5'cDNA序列。
SEQ ID NO:261是测得的LSER-8的5'cDNA序列。
SEQ ID NO:262是测得的LSER-14的5'cDNA序列。
SEQ ID NO:263是测得的LSER-15的5'cDNA序列。
SEQ ID NO:264是测得的LSER-17的5'cDNA序列。
SEQ ID NO:265是测得的LSER-19的5'cDNA序列。
SEQ ID NO:266是测得的LSER-20的5'cDNA序列。
SEQ ID NO:267是测得的LSER-22的5'cDNA序列。
SEQ ID NO:268是测得的LSER-26的5'cDNA序列。
SEQ ID NO:269是测得的LSER-28的5'cDNA序列。
SEQ ID NO:270是测得的LSER-29的5'cDNA序列。
SEQ ID NO:271是测得的LSER-30的5'cDNA序列。
SEQ ID NO:272是LSER-1的预计的氨基酸序列。
SEQ ID NO:273是LSER-3的预计的氨基酸序列。
SEQ ID NO:274是LSER-5的预计的氨基酸序列。
SEQ ID NO:275是LSER-6的预计的氨基酸序列。
SEQ ID NO:276是LSER-8的预计的氨基酸序列。
SEQ ID NO:277是LSER-14的预计的氨基酸序列。
SEQ ID NO:278是LSER-15的预计的氨基酸序列。
SEQ ID NO:279是LSER-17的预计的氨基酸序列。
SEQ ID NO:280是LSER-19的预计的氨基酸序列。
SEQ ID NO:281是LSER-20的预计的氨基酸序列。
SEQ ID NO:282是LSER-22的预计的氨基酸序列。
SEQ ID NO:283是LSER-26的预计的氨基酸序列。
SEQ ID NO:284是LSER-28的预计的氨基酸序列。
SEQ ID NO:285是LSER-29的预计的氨基酸序列。
SEQ ID NO:286是LSER-30的预计的氨基酸序列。
SEQ ID NO:287是LSER-9的测得的cDNA序列。
SEQ ID NO:288是LSER-6的反向互补序列的测得的cDNA序列。
SEQ ID NO:289是LSER-6的反向互补序列的预计的氨基酸序列。
SEO ID NO:290是MO-12的测得的5'cDNA序列。
SEQ ID NO:291是MO-13的测得的5'cDNA序列。
SEQ ID NO:292是MO-19的测得的5'cDNA序列。
SEQ ID NO:293是MO-39的测得的5'cDNA序列。
SEQ ID NO:294是MO-12的预计的氨基酸序列。
SEO ID NO:295是MO-13的预计的氨基酸序列。
SEQ ID NO:296是MO-19的预计的氨基酸序列。
SEQ ID NO:297是MO-39的预计的氨基酸序列。
SEQ ID NO:298是Erdsn-1的测得的5'cDNA序列。
SEQ ID NO:299是Erdsn-2的测得的5'cDNA序列。
SEQ ID NO:300是Erdsn-4的测得的5'cDNA序列。
SEQ ID NO:301是Erdsn-5的测得的5'cDNA序列。
SEQ ID NO:302是Erdsn-6的测得的5'cDNA序列。
SEQ ID NO:303是Erdsn-7的测得的5'cDNA序列。
SEQ ID NO:304是Erdsn-8的测得的5'cDNA序列。
SEQ ID NO:305是Erdsn-9的测得的5'cDNA序列。
SEQ ID NO:306是Erdsn-10的测得的5'cDNA序列。
SEQ ID NO:307是Erdsn-12的测得的5'cDNA序列。
SEQ ID NO:308是Erdsn-13的测得的5'cDNA序列。
SEQ ID NO:309是Erdsn-14的测得的5'cDNA序列。
SEQ ID NO:310是Erdsn-15的测得的5'cDNA序列。
SEQ ID NO:311是Erdsn-16的测得的5'cDNA序列。
SEQ ID NO:312是Erdsn-17的测得的5'cDNA序列。
SEQ ID NO:313是Erdsn-18的测得的5'cDNA序列。
SEQ ID NO:314是Erdsn-21的测得的5'cDNA序列。
SEQ ID NO:315是Erdsn-22的测得的5'cDNA序列。
SEQ ID NO:316是Erdsn-23的测得的5'cDNA序列。
SEQ ID NO:317是Erdsn-25的测得的5'cDNA序列。
SEQ ID NO:318是Erdsn-1的测得的3'cDNA序列。
SEQ ID NO:319是Erdsn-2的测得的3'cDNA序列。
SEQ ID NO:320是Erdsn-4的测得的3'cDNA序列。
SEQ ID NO:321是Erdsn-5的测得的3'cDNA序列。
SEQ ID NO:322是Erdsn-7的测得的3'cDNA序列。
SEQ ID NO:323是Erdsn-8的测得的3'cDNA序列。
SEQ ID NO:324是Erdsn-9的测得的3'cDNA序列。
SEQ ID NO:325是Erdsn-10的测得的3'cDNA序列。
SEQ ID NO:326是Erdsn-12的测得的3'cDNA序列。
SEQ ID NO:327是Erdsn-13的测得的3'cDNA序列。
SEQ ID NO:328是Erdsn-14的测得的3'cDNA序列。
SEQ ID NO:329是Erdsn-15的测得的3'cDNA序列。
SEQ ID NO:330是Erdsn-16的测得的3'cDNA序列。
SEQ ID NO:331是Erdsn-17的测得的3'cDNA序列。
SEQ ID NO:332是Erdsn-18的测得的3'cDNA序列。
SEQ ID NO:333是Erdsn-21的测得的3'cDNA序列。
SEQ ID NO:334是Erdsn-22的测得的3'cDNA序列。
SEQ ID NO:335是Erdsn-23的测得的3'cDNA序列。
SEQ ID NO:336是Erdsn-25的测得的3'cDNA序列。
SEQ ID NO:337是Erdsn-24的测得的cDNA序列。
SEQ ID NO:338是结核分枝杆菌85b前体类似物的测得的氨基酸序列。
SEQ ID NO:339是spot1的测得的氨基酸序列。
SEQ ID NO:340是spot2的测得的氨基酸序列。
SEQ ID NO:341是spot2的测得的氨基酸序列。
SEQ ID NO:342是spot4的测得的氨基酸序列。
SEQ ID NO:343是引物PDM-157的序列。
SEQ ID NO:344是引物PDM-160的序列。
SEQ ID NO:345是融合蛋白TbF-6的DNA序列。
SEQ ID NO:346是融合蛋白TbF-6的氨基酸序列。
SEQ ID NO:347是引物PDM-176的序列。
SEQ ID NO:348是引物PDM-175的序列。
SEQ ID NO:349是融合蛋白TbF-8的DNA序列。
SEQ ID NO:350是融合蛋白TbF-8的氨基酸序列。
发明详述
如上所述,本发明总地涉及诊断结核病的组合物和方法。本发明的组合物包括多肽,该多肽包含结核分枝杆菌抗原的至少一个抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的该抗原的变体。在本发明范围内的多肽包括,但不局限于,可溶性结核分枝杆菌抗原。“可溶性结核分枝杆菌抗原”是已知结核分枝杆菌来源的蛋白质,它存在于结核分枝杆菌培养物渗滤液中。本文所用的术语“多肽”包括具有任何长度的氨基酸链,包括全长蛋白质(即抗原),其中氨基酸残基通过共价肽键连接。因此,包含上述抗原之一的抗原性部分的多肽可以全部由抗原性部分组成,或可含有附加的序列。附加的序列可以衍生自天然的结核分枝杆菌抗原,或可以是异源的,这些序列可能(但不必)有抗原性。
一个抗原的“抗原性部分”(可能是可溶或不可溶的)是能与结核分枝杆菌感染个体获得的血清反应的部分(即在本文描述的代表性ELISA试验中,用受感染个体的血清获得的吸收值读数比未感染个体获得的血清的吸收值高出至少3个标准差)。“结核分枝杆菌感染的个体”是感染了结核分枝杆菌的人(例如,对PPD的皮内测试反应直径至少为0.5厘米)。受感染的个体可能显示出结核病症状,或可能没有疾病症状。通常可单用或合用含有本文描述的一种或多种结核分枝杆菌抗原的至少抗原部分的多肽来检测患者的结核病。
本发明的组合物和方法还包括上述多肽以及DNA分子的变体。本文所用的多肽“变体”是仅仅在保守性置换和/或修饰中与所述多肽不同的多肽,从而保留了多肽的治疗性、抗原性和/或免疫原性性能。多肽变体宜表现出与鉴定的多肽至少约70%、更佳的约90%、最佳的约95%的相同性。对于具有免疫反应性性能的多肽,可通过修饰上述多肽之一的氨基酸序列并评价修饰后的多肽的免疫反应性来鉴定这些变体。对于用来产生诊断结合试剂的多肽,可通过评价经修饰的多肽产生检测结核病存在与否的抗体的能力来鉴定变体。这些修饰的序列可用例如本文描述的典型程序来制备和测试。
本文所用的“保守性取代”是一个氨基酸被具有相似性质的另一个氨基酸取代,因而肽化学领域技术人员可以预计到该多肽的二级结构和亲水性基本不变。通常,下列氨基酸组代表了保守性变化:(1)ala,pro,gly,glu,asp,gln,asn,set,thr;(2)cys,ser,tyr,thr;(3)val,ile,leu,met,ala,phe;(4)lys,arg,his;和(5)phe,tyr,trp,his。
变体可以另外含有其它修饰,包括对多肽的抗原性、二级结构和亲水性性能影响很小的氨基酸缺失或增加。例如,多肽可以和在共同翻译时或翻译后指导蛋白质转移的蛋白质N端的信号(或前导)序列偶联。多肽还可与接头或为便于合成、纯化或鉴定多肽(例如poly-His)或与增强多肽与固相载体结合的其它序列偶联。例如,多肽可以和免疫球蛋白Fc区偶联。
核苷酸“变体”是与所述核苷酸序列不同的、有一个或多个核苷酸缺失、取代或增加的序列。这种修饰易用标准诱变技术(例如Adelman等人(DNA,2:183,1983)指出的寡核苷酸定点特异性诱变)来导入。核苷酸变体可以是天然存在的等位基因变体或非天然存在的变体。核苷酸序列变体宜表现出与所述序列的相同性至少约为70%,更佳的至少约为80%,最佳的约为90%。这些核苷酸序列变体通常会在严谨条件下与所述的核苷酸序列杂交。本文所用的“严谨条件”指在6×SSC,0.2%SDS的溶液中预洗涤;在65℃、6×SSC、0.2%SDS中杂交过夜;然后在1×SSC、0.1%SDS中65℃洗两次各30分钟,并在0.2×SSC、0.1%SDS中65℃洗涤两次各30分钟。
在一个有关的方面,公开了组合或融合多肽。“融合多肽”是一种多肽,它包含至少一个上述抗原性部分和一个或多个附加的抗原性结核分枝杆菌序列,它们通过肽键连接成一条氨基酸链。序列可以直接连接(即没有介入的氨基酸)或可通过接头序列(例如Gly-Cys-Gly)连接,而接头不会显著减少组分多肽的抗原性。
通常,结核分枝杆菌抗原以及编码这些抗原的DNA序列可以用各种不同方法的任何一种来制备。例如,可以用本领域普通技术人员已知的程序(包括阴离子交换和反相层析)从结核分枝杆菌培养渗滤液中分离出可溶性抗原。然后评价纯化抗原的所需要的性质,例如与结核分枝杆菌感染个体血清反应的能力。这些筛选可用本文描述的代表性方法来进行。然后,可用传统的Edman化学方法对抗原作部分测序。见Edman和Berg,Eur.J.Biochem.80:116-132,1967。
抗原还可用编码该抗原的DNA序列重组产生,将该序列插入表达载体内并在合适的宿主内表达。可用针对可溶性结核分枝杆菌抗原的特异性抗血清(例如家兔)筛选合适的结核分枝杆菌表达文库来分离编码可溶性抗原的DNA分子。编码可能是或不是可溶的抗原的DNA序列可通过这样的方法来鉴定:用感染过结核分枝杆菌的患者血清来筛选合适的结核分枝杆菌基因组或cDNA表达文库。这些筛选的进行通常可采用本领域熟知的技术(例如Sambrook等人《分子克隆实验指南》,Cold SpringHarbor Laboratory,Cold Spring Harbor,NY,1989中描述的那些技术)。
编码可溶性抗原的DNA序列还可这样获得:在合适的结核分枝杆菌cDNA或基因组DNA文库中,筛选能与衍生自分离的可溶性抗原的部分氨基酸序列的简并寡核苷酸杂交的DNA序列。可以设计并合成用于该筛选的简并寡核苷酸序列,筛选可以如Sambrook等人《分子克隆实验指南》,Cold Spring Harbor Laboratory,Cold SpringHarbor,NY(及其中引用的文献)中描述的那样进行。也可采用聚合酶链反应(PCR),采用本领域熟知方法的上述寡核苷酸,以从cDNA或基因组文库中分离出核酸探针。然后可以用分离的探针来进行文库筛选。
不论制备方法如何,本文描述的抗原均具有“抗原性”。更具体地说,抗原具有与结核分枝杆菌感染个体血清反应的能力。反应性例如可用本文描述的代表性ELISA试验来评价,其中用感染个体血清获得的吸收值读数比用未感染个体的血清获得的吸收值高出至少3个标准偏差方认为是阳性。
结核分枝杆菌抗原的抗原性部分可用熟知的技术(例如在Paul《基础免疫学》第3版,Raven出版社,1993,243-247及其引用的参考文献中归纳的那些技术)来制备和鉴定。这些技术包括筛选天然抗原多肽部分的抗原特性。本文描述的代表性ELISA通常可用于这些筛选。多肽的抗原性部分是在这些代表性试验中产生的信号与全长抗原所产生的信号基本上相似的部分。换句话说,结核分枝杆菌抗原的抗原性部分在本文描述的ELISA模型中产生的信号是全长抗原所诱导的信号的至少大约20%,较佳的大约100%。
结核分枝杆菌抗原的部分和其它变体可用合成或重组方法产生。可以用本领域熟知的技术来产生少于约100个氨基酸、通常少于约50个氨基酸的合成多肽。例如,这些多肽可以用任何市售的固相技术来合成,这些技术例如是Merrifield固相合成方法,其中向生长的氨基酸链依次增加氨基酸。见Merrifield,J.Am.Chem.Soc.85:2149-2146,1963。自动化合成多肽的装置购自供应商如Applied BioSystems,Inc.,Foster City,CA,并可根据生产商说明书来操作。天然抗原的变体通常可用标准的诱变技术(如寡核苷酸定点特异性诱变)来制备。还可用标准技术除去DNA序列的部分,以制备截短的多肽。
含有天然抗原的一部分和/或变体的重组多肽易用本领域普通技术人员熟知的各种技术从编码该多肽的DNA序列制得。例如,可以首先用市售的滤膜来浓缩将重组蛋白分泌入培养基的合适的宿主/载体系统的上清液。浓缩后,可将浓缩液上样于合适的纯化基质如亲和基质或离子交换树脂上。最后,可以采用一步或多步反向HPLC步骤来进一步纯化重组蛋白。
本领域普通技术人员已知的各种表达载体的任何一种可用来如本文所述的那样表达重组多肽。表达可在已经转化或转染了含有编码重组多肽之DNA分子的表达载体的任何合适的宿主细胞中进行。合适的宿主细胞包括原核生物、酵母和较高等真核细胞。较佳的,所用的宿主细胞是大肠杆菌、酵母或哺乳动物细胞系,如COS或CHO。以这种方式表达的DNA序列可编码天然存在的抗原、天然存在的抗原的一部分或它们的其它变体。
通常,不论制备方法如何,本文公开的多肽以基本纯的形式制得。较佳的,多肽至少为大约80%纯,更佳的至少大约90%纯,最佳的至少大约99%纯。然而,为了用于本文描述的方法,这些基本上纯的多肽可以组合。
在某些具体的实施方案中,本发明公开了多肽,该多肽含有可溶性结核分枝杆菌抗原(或该抗原的变体)的至少一个抗原性部分,其中抗原具有下列N-端序列中的一个:
(a)Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly-
Gln-Val-Val-Ala-Ala-Leu(SEQ ID NO:115);
(b)Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro-
Ser(SEQ ID NO:116);
(c)Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-
Ala-Lys-Glu-Gly-Arg(SEQ ID NO:117);
(d)Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly-
Pro(SEQ ID NO:118);
(e)Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val
(SEQ ID NO:119);
(f)Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro(SEQ ID
NO:120);
(g)Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Thr-Ala-Ala-Ser-
Pro-Pro-Ser(SEQ ID NO:121);
(h)Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Thr-
Gly(SEQ ID NO:122);
(i)Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Gln-
Thr-Ser-Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe-
Ala-Asn(SEQ ID NO:123);
(j)Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-Ile-Lys-Val-Thr-Asp-Ala-
Ser,(SEQ ID NO:129)
(k)Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-Ala-
Asp;(SEQ ID NO:130)或
(l)Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gln-Ala-
Gly;(SEQ ID NO:131)
其中Xaa可以是任何氨基酸,较佳的是半胱氨酸残基。SEQ ID NO:52中提供了编码上述(g)确定的抗原的DNA序列,它的推导的氨基酸序列提供在SEQ ID NO:53中。编码上述(a)中确定的抗原的DNA序列提供在SEQ ID NO:96中;其推导的氨基酸序列提供在SEQID NO:97中。对应于上述(d)抗原的DNA序列提供在SEQ ID NO:24中,对应于抗原(c)的DNA序列提供在SEQ ID NO:25中,对应于抗原(Ⅰ)的DNA序列公开在SEQ ID NO:94中,其推导的氨基酸序列提供在SEQ ID NO:95中。
在另一个具体的实施方案中,本发明公开了多肽,它包含具有下列N-端序列之一的结核分枝杆菌抗原的至少一个免疫原性部分,或仅仅在保守性置换和/或修饰方面不同的它的变体:
(m)Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Prp-Gly-Lys-
Ile-Asn-Val-His-Leu-Val;(SEQ ID NO:132)或
(n)Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr-
Tyr-Pro-Gly-Gly-Arg-Arg-Xaa-Phe;(SEQ ID NO:124)
其中Xaa可以是任何氨基酸,较佳的是半胱氨酸残基。编码上述(n)抗原的DNA序列提供在SEQ ID NO:235中,其对应的预计的全长氨基酸序列提供在SEQ ID NO:236中。
在其它具体的实施方案中,本发明公开的多肽包含了可溶性结核分枝杆菌抗原(或该抗原的变体)的至少一个抗原性部分,它包含的一个或多个氨基酸序列由(a)SEQID NO:1、2、4-10、13-25、52、94和96的DNA序列、(b)这些DNA序列的互补序列、或(c)与(a)或(b)中序列基本上同源的DNA序列来编码。
在另一个具体的实施方案中,本发明公开的多肽包含结核分枝杆菌抗原(或该抗原的变体)的至少一个抗原性部分,它可以是可溶的或不可溶的,它包含的一个或多个氨基酸序列由(a)SEQ ID NO:26-51、133、134、158-178、184-188、194-196、198、210-220、232、234、235、237-242、248-251、256-271、287、288、290-293和298-337的DNA序列、(b)这些DNA序列的互补序列或(c)与(a)或(b)中的序列基本上同源的DNA序列来编码。
在一个有关的方面,本发明提供了融合蛋白,它包含第一种和第二种本发明多肽,或者,包含本发明的一个多肽以及一种已知的结核分枝杆菌抗原,如Andersen和Hansen,Infect.Immun.57:2481-2488,1989中描述的38kD抗原(Genbank登录号No.M30046)或ESAT-6(SEQ ID NO:98和99),本发明还提供了这些融合蛋白的变体。本发明的融合蛋白还可以包括在第一和第二多肽之间的接头肽。
编码本发明的融合蛋白的DNA序列可用已知的重组DNA技术来构建,将编码第一和第二多肽的分开的DNA序列装配到合适的表达载体中。编码第一多肽的DNA序列的3'端可通过或不通过接头肽与编码第二多肽的DNA序列的5'端连接,从而使序列的读框同相,使两个DNA序列的mRNA翻译成一个融合蛋白,它保留了第一和第二多肽生物学活性。
可用肽接头序列来使第一和第二多肽分开足够的距离,以确保每个多肽折叠成其二级和三级结构。这样的肽接头序列可用本领域熟知的标准技术插入融合蛋白内。合适的肽接头序列可根据下列因素来选择:(1)它们适应灵活伸展构型的能力;(2)它们不能采用与第一和第二多肽上的功能性表位相互作用的二级结构;以及(3)缺少可能会与多肽功能性表位反应的疏水性或带电残基。较佳的肽接头序列含有Gly,Asn和Ser残基。接头序列中也可采用其它接近中性的氨基酸,如Thr和Ala。可用作接头的氨基酸序列包括在Maratea等人Gene 40:39-46,1985;Murphy等人Proc.Natl.Acad.Sci.USA 83:8258-8562,1986;美国专利No.4,935,233和美国专利No.4,751,180中描述的那些。接头序列的长度可以从1到50个氨基酸。当第一和第二多肽具有可用来隔开功能性结构域并防止空间位阻的非必需的N-端氨基酸区域时,不需要肽接头序列。
另一方面,本发明提供了用上述多肽诊断结核病的方法。在该方面,提供了单用或合用上述一种或多种多肽来检测生物样品中结核分枝杆菌感染的方法。在采用多个多肽的实施方案中,可以包括除本文具体描述的那些多肽以外的多肽,例如Andersen和Hansen,Infect.Immun.57:2481-2488,1989中描述的38kD抗原。如本文所用的,“生物样品”是从患者获得的任何含抗体的样品。较佳的,样品是全血、痰液、血清、血浆、唾液、脑脊液和尿液。更佳的,样品是从患者或血液供应商处得到的血液、血清或血浆样品。将多肽用于如下所述的试验中,以确定样品中是否存在针对这些多肽的抗体(相对于预定的截断值(cut-off))。这些抗体的存在表明先前已受分枝杆菌抗原致敏,这可能是结核病的指针。
在采用多个多肽的实施方案中,所用的多肽宜是互相补充的(即,一个多肽组分检测样品中的感染,而感染不被另一多肽组分检测到)。互相补充的多肽通常可通过单独用每种多肽评价已知感染了结核分枝杆菌的一系列患者的血清样品来确定。在确定了用每种多肽使那些样品测试呈阳性(如下所述)后,可以配制两种或多种多肽的组合,该组合能检测大多数或所有测试样品中的感染。这些多肽是互相补充的。例如,约25-30%的结核病感染个体的血清对任何单个蛋白(如上述的38kD抗原)的抗体呈阴性。因此,可以将互相补充的多肽与38kD抗原组合使用,以改进诊断试验的灵敏度。
本领域普通技术人员己知有各种试验采用一种或多种多肽来检测样品中的抗体。例如参见,Harlow和Lane《抗体:实验手册》,Cold Spring Harbor Laboratory,1988,该书纳入本文作参考。在一个较佳的实施方案中,试验涉及采用固定在固相载体上的多肽来结合并取出样品中的抗体。然后可用含有报道基团的检测剂检测该结合的抗体。合适的检测剂包括与抗体/多肽复合物以及标记了报道基团的游离多肽(例如在半竞争性试验中)结合的抗体。或者,可以采用竞争性试验,其中结合多肽的抗体标记有报道基团,在与样品中的抗原培育后和固定的抗原结合。样品组分抑制标记抗体与多肽结合的程度表明了样品与固定多肽的反应性。
固相载体可以是本领域普通技术人员己知的能连接抗原的固体材料。例如,固相载体可以是微量滴定板中的测试孔或硝酸纤维素膜或其它合适的膜。另外,载体可以是珠粒或圆片,如玻璃、玻璃纤维、乳胶或塑料材料如聚苯乙烯或聚氯乙烯。载体还可以是磁性颗粒或光纤传感器,例如在美国专利No.5,359,681中公开的那些。
多肽可用本领域普通技术人员已知的各种技术与固相载体结合,这些技术在专利和科学文献中详细的描述。在本发明的内容中,术语“结合(的)”既指非共价缔合,如吸附,也指共价连接(抗原和载体上的官能团之间可以直接连接或可通过交联剂连接)。通过吸附到微量滴定板中孔内或膜上的结合是较佳的。在这样的情况下,吸附可这样进行:使合适缓冲液中的多肽与固相载体接触适当长的时间。接触时间随温度而异,但通常在大约1小时和1天之间。通常,塑料微量滴定板(如聚苯乙烯或聚氯乙烯)的孔与大约10纳克至1微克(较佳的约100ng)的多肽接触就足以结合足量的抗原。
多肽与固相载体的共价连接通常这样实现:首先使载体与能和载体以及多肽上的官能团(如羟基或氨基)反应的双功能试剂反应。例如,利用苯醌,或通过载体上的醛基团与多肽上的胺以及活性氢缩合,多肽可以与具有合适聚合物涂层的载体结合(例如参见,Pierce Immunotechnology Catalog and Handkbook,1991,A12-A13)。
在某些实施方案中,试验是酶联免疫吸附试验(ELISA)。该试验可这样进行:首先使已经固定在固相载体(通常是微量滴定板的孔)上的多肽抗原与样品接触,从而使样品中针对多肽的抗体与固定的多肽结合。然后,从固定的多肽中除去未结合的样品,加入能结合固定的抗体-多肽复合物的检测剂。然后,用适合特定检测剂的方法测定保持与固相载体结合的检测剂的量。
更具体地说,如上所述一旦多肽固定在载体上,通常要封闭载体上其余的蛋白质结合部位。本领域普通技术人员已知的任何合适的封闭剂,如牛血清白蛋白或吐温20TM(Sigma Chemical Co.,St.Louis,MO)均可采用。然后,使固定的多肽与样品培育,使抗体与抗原结合。在培育前,样品可用合适的稀释剂(如磷酸盐缓冲液)稀释。通常,合适的接触时间(即培育时间)是足以检测感染了结核分枝杆菌的样品中存在抗体的时间。较佳的,接触时间足以达到结合与未结合的抗体之间达到平衡的至少95%的结合水平。本领域普通技术人员会认识到,达到平衡所需的时间易通过测定一段时间内产生的结合水平来确定。在室温下,大约30分钟的培育时间就足够了。
然后用合适的缓冲液(如含有0.1%吐温20TM的PBS)洗涤固相载体,除去未结合的样品。然后,可将检测剂加到固相载体上。合适的检测剂是能结合固定的抗体-多肽复合物并能用本领域技术人员已知的各种方法检测的化合物。较佳的,检测剂含有与报道基团偶联的结合剂(例如蛋白质A、蛋白质G、免疫球蛋白、凝集素或游离的抗原)。较佳的报道基团包括酶(如辣根过氧化物酶)、底物、辅因子、抑制剂、染料、放射性核素、发光基团、荧光基团、生物素和胶体颗粒,如胶体金和硒。结合剂和报道基团的偶联可用本领域普通技术人员已知的标准方法来实现。偶联了各种报道基团的常见结合剂还可购自许多商业来源(例如Zymed Laboratories,SanFrancisco,CA和Pierce,Rockford,IL)。
然后使检测剂和固定的抗体-多肽复合物培育足够长的时间,以检测结合的抗体。合适的时间通常可根据生产商说明书来确定,或通过在一段时间内测定结合水平来确定。然后除去未结合的检测剂,用报道基团检测结合的检测剂。用来检测报道基团的方法取决于报道基团的性质。对于放射活性基团而言,闪烁计数或放射自显影方法通常是合适的。光谱方法可用来检测染料、发光基团和荧光基团。生物素可用偶联了不同报道基团(通常是放射活性基团或荧光基团或酶)的亲和素来检测。酶-报道基团通常可通过加入底物来检测(通常进行特定的时间),然后用光谱法或其它方法分析反应产物。
为了测定样品中是否存在抗结核分枝杆菌抗体,通常将保持结合于固相载体的报道基团检测到的信号与对应于预定截断值的信号比较。在一个较佳的实施方案中,截断值是固定抗原与未感染患者的样品培育后所得的平均信号。通常,认为产生信号比预定截断值高3个标准偏差的样品为结核病阳性。在另一个较佳的实施方案中,根据Sackett等人《临床流行病学:临床医学基础科学》Little Brown and Co.,1985,106-107页中的方法,用接受者-操作者曲线(Receiver Operator Curve)确定截断值(cut-off)。简言之,在该实施方案中,截断值可从对应于诊断测试结果每个可能的截断值的数对真阳性率(即敏感性)和假阳性率(100%特异性)的曲线来确定。曲线上最接近左上角的截断值(即围住最大面积的数值)是最准确的截断值,经该方法测得产生信号高于截断值的样品可认为是阳性。或者,截断值可以沿曲线向左侧移动,以最大程度地减小假阳性率,或向右侧移动,以最大程度地减小假阴性率。通常,经该方法测得产生信号高于截断值的样品认为是结核病阳性。
在有关的实施方案中,试验以快速流穿或试条形式进行,其中抗原被固定在膜(如硝酸纤维素膜)上。在流穿试验中,当样品通过膜时,样品内的抗体与固定的多肽结合。然后,当含有检测剂的溶液流动通过膜时,检测剂(例如蛋白质A-胶体金)与抗体-多肽复合物结合。然后如上所述检测结合的检测剂。在试条形式中,将结合了多肽的膜的一端浸在含有样品的溶液内。样品沿膜迁移通过含有检测剂的区域,并迁移至固定多肽的区域。检测剂在多肽处浓集表明样品中存在抗结核分枝杆菌抗体。通常检测剂在该部位的浓缩产生了一种可用肉眼观察的图案(如线条)。不存在该图案则表示阴性结果。通常,当生物样品含有的抗体水平足以在上述ELISA中产生阳性信号时,选择固定在膜上的多肽的量,以产生肉眼可区分的图案。较佳的,固定在膜上的多肽量范围为25ng至约1微克,更佳的约50ng至500ng。这些测试通常可用非常少量(如一滴)的患者血清或血液来进行。
当然,还有其它许多试验方案也适合与本发明的多肽一起使用。上述描述只是列举性的。
另一方面,本发明提供了针对本发明多肽的抗体。抗体可用本领域普通技术人员已知的各种技术制备,例如参见Harlow和Lane《抗体:实验手册》,Cold SpringHarbor Laboratory,1988。在一种这样的技术中,最初将包含抗原性多肽的免疫原注入各种哺乳动物(如小鼠、大鼠、家兔、绵羊和山羊)。在该步骤中,本发明的多肽可不经修饰作为免疫原。另外,特别是对于较小的多肽,如果将该多肽与载体蛋白(如牛血清白蛋白或匙孔血蓝蛋白)连接,则可能会引发超级免疫应答。将免疫原注入动物宿主,较佳的是根据预定的方案插入一次或多次强化免疫,然后定期对动物取血。然后可以通过例如亲和层析用偶联于合适的固相载体的多肽,将对多肽有特异性的多克隆抗体从这些抗血清中纯化出来。
对感兴趣的抗原性多肽的特异性单克隆抗体可以用Kohler和Milestein,Eur.J.Immunol.6:511-519,1976的技术及其改进方法制得。简言之,这些方法涉及制备能产生具有所需特异性(即与感兴趣多肽的反应性)的抗体的无限增殖细胞系。这些细胞系可以从如上所述免疫的动物的脾细胞产生。然后使脾细胞无限增殖,例如通过与骨髓瘤细胞融合伴侣(较佳的是与受免疫动物同系)融合。可以采用各种融合技术。例如,可以用非离子洗涤剂使脾细胞和骨髓瘤细胞结合数分钟,然后低密度接种在支持杂交细胞生长但不支持骨髓瘤细胞生长的选择培养基上。较佳的选择技术采用HAT(次黄嘌呤、氨基蝶呤、胸苷)选择。足够长的时间(通常约1-2周)后,观察杂交菌落。选出单菌落,测试其对多肽的结合活性。优选出具有高反应性和特异性的杂交瘤。
单克隆抗体可以从生长的杂交瘤集落的上清液分离得到。另外,可以采用各种技术来提高产量,例如将杂交瘤细胞系注射入合适脊椎动物宿主(如小鼠)的腹膜腔内。然后可以从腹水或血液中收获单克隆抗体。污染物可用常规技术(如层析、凝胶过滤、沉淀和抽提)从抗体中除去。本发明的多肽可用于纯化方法,例如亲和层析步骤中。
抗体可用于诊断试验,用类似于上文详细描述的试验以及本领域技术人员熟知的其它技术检测结核分枝杆菌抗原的存在,从而提供了检测患者结核分枝杆菌感染的方法。
本发明的诊断剂还包含编码上述一种或多种多肽或其一个或多个部分的DNA序列。例如,在以聚合酶链反应(PCR)为基础的试验中可以用至少两个寡核苷酸引物来扩增衍生自生物样品的结核分枝杆菌特异性cDNA,其中至少一个寡核苷酸引物对编码本发明多肽的DNA分子有特异性。然后,用本领域熟知的技术(如凝胶电泳)检测所扩增cDNA的存在。类似地,对编码本发明多肽的DNA分子有特异性的寡核苷酸探针可用于杂交试验,以检测生物样品中本发明多肽的存在。
本文所用的术语“对DNA分子有特异性的寡核苷酸/探针”指寡核苷酸序列与所涉及的DNA分子有至少约80%的相同性,较佳的有至少约90%的相同性,更佳的有至少约95%的相同性。可用于本发明诊断方法的寡核苷酸引物和/或探针宜具有至少大约10-40个核苷酸。在一个较佳的实施方案中,寡核苷酸引物包含编码本文公开的多肽之一的DNA分子的至少约10个连续的核苷酸。较佳的,用于本发明诊断方法的寡核苷酸探针包含可编码本文公开的多肽之一的DNA分子的至少大约15个毗连的寡核苷酸。PCR为基础的试验以及杂交试验的技术均是本领域所熟知的(例如参见,Mullis等人,Ibid;Ehrlich,Ibid)。因此,引物或探针可用来检测生物样品中的结核分枝杆菌特异性序列。包含上述寡核苷酸序列的DNA探针或引物可单独使用,或相互组合使用,或与以前鉴定的序列(如上述38kD抗原的序列)合用。
提供下列实施例是为了进行描述,而不是为了限制。
实施例
实施例1
从结核分枝杆菌培养渗滤液纯化多肽并特性分析
本实施例描述了从培养渗滤液制备结核分枝杆菌可溶性多肽。除非另有描述,下列实施例中的所有百分数均为重量/体积。
将结核分枝杆菌(H37Ra,ATCC No.25177或H37Rv,ATCC No.25618)37℃培养在无菌GAS培养基中14天。然后,将培养基通过0.45μ滤膜真空过滤到无菌的2.5升瓶内(留下细胞团块)。然后将培养基通过0.2μ滤膜过滤到无菌的4升瓶内。然后在培养渗滤液中加入NaN3至浓度为0.04%。然后将瓶置于4℃冷藏室内。
对培养渗滤液进行浓缩,将滤液置于12升经高压蒸气灭菌的贮器内,将滤液加入400毫升Amicon搅拌装置内,该装置用乙醇清洗过并含有10000kDa MWCO膜。用氮气维持压力在60psi。该步骤将12升的体积减少至大约50毫升。
然后用8000kDa MWCO纤维素酯膜将培养渗滤液透析入0.1%碳酸氢铵内,更换碳酸氢铵溶液两次。然后用市售的BCA试验试剂(Pierce,Rockford,IL)测定蛋白质浓度。
然后将透析的培养渗滤液冻干,将多肽重悬于蒸馏水中。然后用0.01mM 1,3双[三(羟甲基)-甲氨基]丙烷(pH7.5)(Bis-Tris丙烷缓冲液)(这是阴离子交换层析的初始条件)对多肽透析。在经0.01mM Bis-Tris丙烷缓冲液(pH7.5)平衡的POROS 146Ⅱ Q/M阴离子交换柱(4.6mm×100mm,Perseptive Biosystems,Framingham,MA)上,用凝胶灌流(profusion)层析进行分级。用线性0-0.5M NaCl梯度将多肽洗脱到上述缓冲液系统内。在220nm波长下监测柱洗脱液。
用蒸馏水对离子交换柱洗脱下的多肽合并物透析,并冻干。将得到的物质溶于含0.1%三氟乙酸(TFA)的水(pH1.9)中,在Detla-Pak C18柱(Waters,Milford,MA,孔径300埃,粒径5微米(3.9×150mm))上纯化多肽。用0-60%稀释缓冲液(含0.1%TFA的乙腈)线性梯度将多肽洗脱下柱。流速为0.75毫升/分钟,在214nm下用HPLC监测洗脱液。收集含有洗脱多肽的组分,以最大程度地纯化各份样品。获得约200的纯化多肽。
然后筛选纯化多肽在PBMC制备物中诱导T-细胞增殖的能力。将已知PPD皮肤试验测试呈阳性且T细胞显示对PPD以及对MTB粗制可溶性蛋白起增殖反应的献血员的PBMC培养在含有RPMI 1640的培养基中,该培养基中添加了10%合并的人血清和50微克/毫升庆大霉素。以0.5至10微克/毫升的浓度加入纯化的多肽,一式两份。在96孔圆底平板中200微升的体积内培养6天后,从各孔中取出50微升培养基如下所述测定IFN-γ水平。然后在平板每孔内脉冲加入1μCi氚化的胸苷再培育18小时,收获并用气相闪烁计数器测定氚的摄入量。在两份样品中均导致增殖比培养在单用培养基中所见的细胞增殖大3倍的级分被认为呈阳性。
用酶联免疫吸附试验(ELISA)测定IFN-γ。在室温下用含有针对人IFN-γ的小鼠单克隆抗体(Chemicon)包被ELISA测试板4小时。然后在室温下用含有5%(W/V)脱脂奶粉的PBS封闭诸孔1小时。再用PBS/0.2%吐温-20洗板6次,使ELISA板中以培养基中作1∶2稀释的样品室温培育过夜。然后再次洗涤测试板,在每个孔内加入1∶3000稀释于PBS/10%正常山羊血清中的多克隆家兔抗-人IFN-γ血清。然后,室温培育测试板两小时,洗涤,加入以PBS/5%脱脂奶粉1∶2000稀释的辣根过氧化物酶偶联的抗家兔IgG(Jackson Labs.)。再室温培育2小时后,洗涤平板并加入TMB底物。20分钟后用1N硫酸终止反应。用570nm作为参照波长,在450nm下测定光密度。在两份样品中导致OD比培养在单用培养基中的平均OD大两倍加上3个标准差的级分被认为呈阳性。
为了进行测序,将多肽分别干燥在BiobreneTM(Perkin Elmer/Applied BiosystemsDivision,Foster City,CA)处理的玻璃纤维滤膜上。将带有多肽的滤膜载于PerkinElmer/Applied BioSystems Division Procise 492蛋白质测序仪上。用传统的Edman化学试剂从氨基端测定多肽的序列。通过比较PTH氨基酸衍生物对合适的PTH衍生物标准品的滞留时间,确定每个多肽的氨基酸序列。
采用上述程序,分离出具有下列N-端序列的抗原:
(a)Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Xaa-Asn-Tyr-Gly-
Gln-Val-Val-Ala-Ala-Leu(SEQ ID NO:54);
(b)Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro-
Ser(SEQ ID NO:55);
(c)Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-
Ala-Lys-Glu-Gly-Arg(SEQ ID NO:56);
(d)Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly-
Pro(SEQ ID NO:57);
(e)Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val
(SEQ ID NO:58);
(f)Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro(SEQ ID
NO:59);
(g)Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Ala-Ala-Ala-Ala-
Pro-Pro-Ala(SEQ ID NO:60);和
(h)Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Thr-
Gly(SEQ ID NO:61);
其中Xaa可以是任何氨基酸。
除了上述程序外,用微孔HPLC纯化步骤分离其它的抗原。具体地说,在PerkinElmer/Applied Biosystems Division Model 172 HPLC中,在孔径7微米、柱体积1mm×100mm的Aquapore C18柱(Perkin Elmer/Applied Biosystems Division,Foster City,CA)上纯化含有前述层析纯化步骤得到的抗原混合物每级分20微升。用含1%微量乙腈(含0.05%TFA)线性梯度的水(0.05%TFA)以80微升/分钟的流速从柱上洗脱级分。在250nm处监测洗脱物。将原始的级分分离成4个主峰以及其它较小的组分,获得一种多肽,其显示出分子量为12.054Kd(通过质谱法),具有下列N-端序列:
(i)Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Gln-
Thr-Ser-Leu-Leu-Asn-Asn-Leu-Ala-Asp-Pro-Asp-Val-Ser-Phe-
Ala-Asp(SEQ ID NO:62)。
用上述试验显示出该多肽在PBMC制备物中诱导增殖和IFN-γ产生。
如下所述从结核分枝杆菌培养渗滤液中分离出其它可溶性抗原。如上所述制备结核分枝杆菌培养渗滤液。在用Bis-Tris丙烷缓冲液(pH5.5)透析后,用阴离子交换层析在经Bis-Tris丙烷缓冲液(pH5.5)平衡的Poros QE柱4.6×100mm(PerseptiveBiosystems)上进行分级。在上述缓冲液系统中用0-1.5M NaCl线性梯度以10毫升/分钟的流速洗脱多肽。在214nm下监测柱洗脱液。
合并从离子交换柱上洗脱的级分,用Poros R2柱4.6×100mm(PerseptiveBiosystems)进行反相层析。用0-100%乙腈(0.1%TFA)的线性梯度以5毫升/分钟的流速从柱上洗脱多肽。在214nm下监测洗脱液。
将含有洗脱多肽的级分冻干,重悬于80微升0.1%TFA水溶液中,在Vydac C4柱4.6×150nm(Western Analytical,Temecula,CA)上用0-100%乙腈(0.1%TFA)的线性梯度以2毫升/分钟的流速进行进一步的反相层析。在214nm下监测洗脱液。
具有生物活性的级分被分离为一个主峰和其它较小的组分。将该峰Western印迹到PVDF膜上,结果显示分子量为14Kd,20Kd和26Kd的三条主要条带。测得这些多肽分别具有下列N-端序列:
(j)Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-Ile-Lys-Val-Thr-Asp-Ala-
Ser,(SEQ ID NO:129)
(k)Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-Ala-
Asp;(SEQ ID NO:130)和
(1)Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gla-Ala-
Gly;(SEQ ID NO:131)利用上述试验,这些多肽显示出能在PBMC制备物中诱导增殖和IFN-γ产生。图1A和B分别显示了用第一和第二献血员的PBMC制备物所做的这些试验结果。
用32p末端标记的对应于N-端序列并含有结核分枝杆菌密码子偏性的简并寡核苷酸,筛选结核分枝杆菌基因组文库,获得编码上述(a),(c),(d)和(g)的抗原的DNA序列。用对应于上述抗原(a)的探针进行的筛选鉴定出具有序列SEQ ID NO:96的克隆。SEQ ID NO:96编码的多肽提供在SEQ ID NO:97中。用对应于上述抗原(g)的探针进行的筛选鉴定出具有序列SEQ ID NO:52的克隆。SEQ ID NO:52编码的多肽提供在SEQ ID NO:53中。用对应于上述抗原(d)的探针进行的筛选鉴定出具有序列SEQ ID NO:24的克隆,用对应于抗原(c)的探针进行的筛选鉴定出具有序列SEQ IDNO:25的克隆。
用DNA STAR系统对上述氨基酸序列和基因库中已知的氨基酸序列进行比较。检索的数据库含有大约173000种蛋白质,它是Swiss,PIR数据库以及翻译的蛋白质序列(87版)的组合。没有检测到抗原(a)-(h)和(l)的氨基酸序列的明显同源物。
发现抗原(i)的氨基酸序列与麻风分枝杆菌(M.leprae)的一个序列同源。用从GENEBANK获得的序列从基因组DNA扩增全长麻风分枝杆菌序列。然后用该序列筛选结核分枝杆菌文库,获得结核分枝杆菌同系物的全长拷贝(SEQ ID NO:94)。
发现抗原(j)的氨基酸序列与一个DNA序列翻译的已知结核分枝杆菌蛋白同源。据发明者所知,该蛋白以前未显示具有T-细胞刺激活性。发现抗原(k)的氨基酸序列与麻风分枝杆菌的一个序列有关。
在上述增殖和IFN-γ试验中,用三个PPD阳性献血员,上文提供的代表性抗原的结果显示在表1中:
表1
PBMC增殖和IFN-γ试验的结果
序列 | 增殖 | IFN-γ |
(a) | + | + |
(c) | +++ | +++ |
(d) | ++ | ++ |
(g) | +++ | +++ |
(h) | +++ | +++ |
在表1中,产生刺激指数(SI)在2和4(与单独培养在培养基中的细胞相比)之间的反应评为+,SI为4-8或在1微克或更低浓度下SI为2-4评为++,SI大于8评为+++。发现序列(i)的抗原在增殖和IFN-γ试验中对于一个献血员有高的SI(+++),对于其它两个献血员有较低的SI(++和+)。这些结果表明这些抗原能诱导增殖和/或干扰素-γ产生。
实施例2
用患者血清来分离结核分枝杆菌抗原
本实施例描述了用结核分枝杆菌感染的个体的血清筛选来从结核分枝杆菌裂解液中分离抗原。
将干燥的结核分枝杆菌H37Ra(Difco Laboratories)加入到2%NP40溶液中,交替匀浆并超声处理3次。使所得悬浮液在微量离心管中13000rpm离心,使上清液通过0.2微米针筒式滤器。使滤液与Macro Prep DEAE珠粒(BioRad,Hercules,CA)结合。用20毫摩尔Tris pH7.5彻底洗涤珠粒,用1M NaCl洗脱结合的蛋白质。用10毫摩尔Tris,pH7.5透析NaCl洗脱液过夜。用0.05毫克/毫升DNA酶和RNA酶室温下处理透析溶液30分钟,然后用0.5U/毫克α-D-甘露糖苷酶在pH4.5下室温处理3-4小时。在回复至pH7.5后,通过FPLC在Bio Scale-Q-20柱(BioRad)上对材料分级。将级分合并成9份合并物,在Centriprep 10(Amicon,Beverley,MA)中浓缩,用感染结核分枝杆菌患者的与本发明其它抗原没有免疫反应的合并血清作Western印迹筛选血清学活性。
使最具反应性的级分进行SDS-PAGE,并转移至PVDF。切下约85Kd的条带,产生下列序列:
(m)Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys-
Ile-Asn-Val-His-Leu-Val;(SEQ ID NO:132),其中Xaa可以是任何氨基酸。
将该序列与上述基因库中的那些序列比较,结果显示与已知序列没有明显同源性。
用对应于SEQ ID NO:137N-端序列的标记的简并寡核苷酸筛选基因组结核分枝杆菌Erdman菌株文库,获得编码上述(m)抗原的DNA序列。鉴定出具有DNA序列SEQ ID NO:198的一个克隆。发现该序列编码SEQ ID NO:199的氨基酸序列。将这些序列与基因库中的那些序列比较,结果发现与以前在结核分枝杆菌以及牛分枝杆菌中鉴定的序列有一些相似。
实施例3
编码结核分枝杆菌抗原的DNA序列的制备
本实施例描述了用感染了结核分枝杆菌的患者获得的血清或针对结核分枝杆菌抗原产生的抗血清筛选结核分枝杆菌表达文库,制备编码结核分枝杆菌抗原的DNA序列。
A.用针对结核分枝杆菌上清液的家兔抗血清制备结核分枝杆菌可溶性抗原
从结核分枝杆菌菌株H37Ra分离基因组DNA。随机切割该DNA,并用于以λZAP表达系统(Stratagene,La Jolla,CA)构建表达文库。用结核分枝杆菌培养物的浓缩上清液免疫家兔,产生针对结核分枝杆菌菌株H37Ra,H37Rv和Erdman的分泌性蛋白的家兔抗血清。具体地说,首先用总体积为2毫升(含有100微克胞壁酰二肽(Calbiochem,La Jolla,CA)和1毫升不完全Freund佐剂)的200微克蛋白抗原皮下免疫家兔。4周后,用含100微克抗原的不完全Freund佐剂对家兔作皮下强化免疫。最后,4周后用50微克蛋白抗原对家兔作静脉内免疫。如Sambrook等人《分子克隆实验指南》,Cold Spring Harbor Laboratory,Cold Spring Harbor,NY,1989中描述的那样,用抗血清筛选表达文库。纯化表达免疫反应性抗原的噬菌体噬斑。挽救噬斑的噬菌粒,推导出结核分枝杆菌克隆的核苷酸序列。
纯化得到32个克隆。其中有25个代表的序列以前未曾在结核分枝杆菌中得到鉴定。如Skeiky等人,J.Exp.Med.181:1527-1537,1995中描述的那样,用IPTG诱导蛋白质,并通过凝胶洗脱纯化。在该筛选中鉴定的DNA分子的代表性部分序列提供在SEQ ID NO:1-25中。对应的预计氨基酸序列显示在SEQ ID NO:64-88中。
在用上述数据库比较这些序列与基因库中已知序列后,发现后文称为TbRA2A,TbRA16,TbRA18和TbRA29的克隆(SEQ ID NO:77,69,71,76)显示出与以前鉴定的麻风分枝杆菌(但不是结核分枝杆菌)中的序列有一些同源性。发现TbRA2A是一种脂蛋白,有一个6残基脂化序列与一疏水性分泌性序列毗邻。TbRA11,TbRA26,TbRA28和TbDPEP(SEQ ID NO:66,74,75,55)在结核分枝杆菌中已有鉴定。发现与TbRA1,TbRA3,TbRA4,TbRA9,TbRA10,TbRA13,TbRA17,TbRA19,TbRA29,TbRA32,TbRA36以及重叠的克隆TbRA35和TbRA12(分别为SEQ ID NO:64,78,82,65,68,76,72,76,79,81,80,67)没有明显的同源性。克隆TbRa24与克隆TbRa29重叠。
B.用肺结核以及胸膜结核病患者的血清鉴定编码结核分枝杆菌抗原的DNA序列
用活动性结核病患者的合并血清筛选上述基因组DNA文库以及附加的H37Rv文库。为了制备H37Rv文库,分离结核分枝杆菌菌株H37Rv基因组DNA,进行部分Sau3A消化,用于以λZAP表达盒(Stratagene,La Lolla,Ca)构建表达文库。在表达筛选中,采用三种不同的合并血清,每种所含血清获自三位患有活动性肺结核或胸膜结核病的个体。合并血清命名为TbL,TbM和TbH,指在ELISA和免疫印迹中与H37Ra裂解液的相对反应性(即,TbL=低反应性,TbM=中等反应性,TbH=高反应性)。另外还采用了来自7位活动性肺结核患者的第四份合并血清。所有血清与重组38kD结核分枝杆菌H37Ra磷酸结合蛋白的反应性均不增加。
如Sambrook等人《分子克隆实验指南》,Cold Spring Harbor Laboratory,ColdSpring Harbor,NY,1989中所述的那样,用大肠杆菌裂解液预先吸附所有合并血清,并用来筛选H37Ra和H37Rv表达文库。纯化得到表达免疫反应性抗原的噬菌体噬斑。挽救噬斑的噬菌粒,推导出结核分枝杆菌克隆的核苷酸序列。
纯化获得了32个克隆。其中31个克隆显示出的序列以前未曾在人结核分枝杆菌中鉴定过。鉴定出的DNA分子的代表性序列提供在SEQ ID NO:26-51以及100中。其中,TbH-8-2(SEQ ID NO:100)是TbH-8的部分克隆,TbH-4(SEQ ID NO:43)和TbH-4-FWD(SEQ ID NO:44)是来自同一克隆的不连续的序列。后文鉴定为Tb38-1,TbH-4,TbH-8,TbH-9和TbH-12的抗原的氨基酸序列显示在SEQ ID NO:89-93中。用上述鉴定的数据库比较这些序列与基因库中已知的序列,结果显示与TbH-4,TbH-8,TbH-9和TbM-3没有明显的同源性,虽然发现与TbH-9有弱的同源性。发现TbH-12与以前在副结核分枝杆菌(M.paratuberculosis)(登录号S28515)中鉴定的34kD抗原性蛋白同源。发现Tb38-1位于以前在牛分枝杆菌(登录号U34848)和结核分枝杆菌(Sorensen等人,Infec.Immun.63:1710-1717,1995)中鉴定的抗原ESAT-6开放读框上游34碱基对处。
用衍生自Tb38-1和TbH-9(均从H37Ra文库分离出)的探针鉴定H37Ra文库中的克隆。Tb38-1与Tb38-1F2,Tb38-1F3,Tb38-1F5以及Tb38-1F6(SEQ ID NO:107,108,111,113以及114)杂交。(SEQ ID NO:107和108是克隆Tb38-1F2的不连续的序列。)推导出Tb38-1F2中的两个开放读框;一个对应于Tb37FL(SEQ ID NO:109),第二个部分序列可能与Tb38-1同源,称为Tb38-1N(SEQ ID NO:110)。Tb38-1F3的推导氨基酸序列显示在SEQ ID NO:112中。TbH-9探针在H37Rv文库中鉴定出三个克隆:TbH-9-FL(SEQ ID NO:101),它可能与TbH-9(R37Ra)同源,TbH-9-1(SEQ ID NO:103),以及TbH-8-2(SEQ ID NO:105)是TbH-8的部分克隆。这三个克隆的推导氨基酸序列显示在SEQ ID NO:102、104和106中。
如上所述,进一步筛选结核分枝杆菌基因组DNA文库,导致发现另10个反应性克隆,代表了7种不同的基因。这些基因中的一个经鉴定是上文讨论的38Kd抗原,一个经确定与以前表明存在于结核分枝杆菌中的14Kdα晶体蛋白热休克蛋白相同,第三个经确定与上述抗原TbH-8相同。其余5个克隆(后文称TbH-29,TbH-30,TbH-32和TbH-33)的确定的DNA序列分别提供在SEQ ID NO:133-136中,其对应的预计的氨基酸序列分别提供在SEQ ID NO:137-140中。将这些抗原的DNA和氨基酸序列与上述基因库中的那些序列比较。发现与TbH-29的5'端(它含有反应性开放读框)没有同源性,但是发现TbH-29的3'端与结核分枝杆菌粘粒Y227相同。发现TbH-32和TbH-33分别与以前鉴定的结核分枝杆菌插入元件IS6110以及结核分枝杆菌粘粒Y50相同。发现与TbH-30没有明显的同源性。
如Sambrook等人(同上)所述的那样,用来自该附加筛选的阳性噬菌粒感染大肠杆菌XL-1 Blue MRF'。通过加入IPTG,实现了重组蛋白的诱导。使诱导的和未经诱导的裂解液进行SDS-PAGE,一式两份,并转移到硝酸纤维素膜上。使滤膜与能与TbH反应的人结核分枝杆菌血清(1∶200稀释度)以及能与lacZ的N端4Kd部分反应的家兔血清(1∶200或1∶250稀释度)反应。室温培育血清2小时。加入125Ⅰ标记的蛋白质A,随后使膜曝光16小时至11天的不同时间,检测结合的抗体。免疫印迹的结果总结在表2中。
表2
抗原 人结核分枝杆菌血清 抗-lacZ血清
TbH-29 45Kd 45Kd
TbH-30 没有反应性 29Kd
TbH-32 12Kd 12Kd
TbH-33 16Kd 16Kd
重组人结核分枝杆菌抗原与人结核分枝杆菌血清以及抗lacZ血清的阳性反应表明,人结核分枝杆菌血清的反应性针对融合蛋白。对抗lacZ血清有反应性但对人结核分枝杆菌血清没有反应性的抗原,可能是人结核分枝杆菌血清识别构型性表位的结果,或抗原-抗体结合动力学可能这样的情况,即免疫印迹中2小时的血清接触不够充分。
进行了研究以确定抗原TbH-9和Tb38-1是否代表细胞蛋白还是分泌到结核分枝杆菌培养基内。在第一个研究中,用基本上如实施例3A所述的程序,产生的家兔血清针对下列蛋白:(A)结核分枝杆菌的分泌性蛋白,(B)已知的分泌性重组结核分枝杆菌抗原85b,(C)重组Tb38-1和(D)重组TbH-9。在变性凝胶上分辨了结核分枝杆菌总裂解液、结核分枝杆菌培养物浓缩上清液以及重组抗原85b,TbH-9和Tb38-1,固定在硝酸纤维素膜上,用上述家兔血清探测一式两份的印迹。
图2A-D中分别显示了用对照血清(面板Ⅰ)、针对分泌性蛋白、重组85b、重组Tb38-1以及重组TbH-9的抗血清(面板Ⅱ)的该分析结果,其中泳道编号如下:1)分子量蛋白标准品;2)5微克结核分枝杆菌裂解液;3)5微克分泌性蛋白;4)50ng重组Tb38-1;5)50ng重组TbH-9和6)50ng重组85b。重组抗原经基因工程加上了6个末端组氨酸残基,因此预计迁移时的移动比天然蛋白约大1kD。在图2D中,重组TbH-9缺少全长42kD抗原的大约10kD,因此裂解液泳道中免疫反应性天然TbH-9抗原的大小明显不同(用箭头表示)。这些结果证明Tb38-1和TbH-9是胞内抗原,并非由结核分枝杆菌主动分泌的。
通过测定TbH-9-特异性人T细胞克隆与重组TbH-9、分泌性结核分枝杆菌蛋白和PPD的反应性,确证了TbH-9是胞内抗原的发现。从健康PPD阳性献血员的PBMC了产生TbH-9-特异性T细胞克隆(命名为131TbH-9)。通过如实施例1所述的那样测定氚化胸苷的摄入,确定了131TbH-9对分泌性蛋白、重组TbH-9以及对照结核分枝杆菌抗原TbRa11的增殖反应。如图3A所示,克隆131TbH-9对TbH-9有特异性反应,说明TbH-9并非结核分枝杆菌分泌性蛋白的重要组分。图3B显示了从健康PPD阳性献血员的PBMC制得的第二种TbH-9特异性T细胞克隆(称为PPD 800-10)在用分泌性蛋白、PPD或重组TbH-9刺激T细胞克隆后的IFN-γ产生情况。这些结果进一步确证TbH-9不是结核分枝杆菌分泌的。
C.用肺外结核病患者的血清鉴定编码结核分枝杆菌抗原的DNA序列
从结核分枝杆菌Erdman菌株分离出基因组DNA,随机剪切并用于以λZAP表达系统(Stratagene,La Jolla,CA)来构建表达文库。如实施例3B所述的那样,所得文库用肺外结核病个体的合并血清来筛选,第二抗体是偶联了碱性磷酸酶的山羊抗人IgG+A+M(H+L)。
纯化获得18个克隆。发现其中4个克隆(后称XP14、XP24、XP31和XP32)与己知的序列有一些相同性。测得的XP14、XP24和XP31的DNA序列分别提供在SEQID NO:151-153中,XP32的5'和3'DNA序列分别在SEQ ID NO:154和155中。XP14的预计氨基酸序列提供在SEQ ID NO:156中。发现XP14的反向互补序列编码了SEQID NO:157中的氨基酸序列。
将其余14个克隆(后称XP1-XP6,XP17-19,XP22,XP25,XP27,XP30和XP36)的序列与上述基因库中的那些序列比较,结果表明,除了发现XP2和XP6的3'端与已知的结核分枝杆菌粘粒有一些同源性以外,没有同源性。XP27以及XP36的DNA序列分别显示在SEQ ID NO:158和159中,XP4,XP5,XP17和XP30的5'序列分别显示在SEQ ID NO:160-163中,XP2、XP3、XP6、XP18、XP19、XP22和XP25的5'和3'序列分别显示在SEQ ID NO:164和165;166和167;168和169;170和171;172和173;174和175;176和177中。发现XP1与上述TbH4的DNA序列重叠。TbH4-XP1的全长DNA序列提供在SEQ ID NO:178中。发现该DNA序列含有的开放读框编码了SEQ ID NO:179所示的氨基酸序列。发现TbH4-XP1的反向互补序列含有的开放读框编码SEQ ID NO:180的氨基酸序列。发现XP36的DNA序列含有的两个开放读框编码了SEQ ID NO:181和182所示的氨基酸序列,反向互补序列含有的开放读框编码了SEQ ID NO:183所示的氨基酸序列。
如上实施例3B所述的那样,制备重组XP1蛋白,用金属离子亲和层析柱来纯化。发现重组XP1在分离自结核分枝杆菌-免疫献血员的T细胞中刺激细胞增殖和IFN-γ产生。
D.用结核病患者的裂解液阳性合并血清鉴定编码结核分枝杆菌抗原的DNA序列
如下文实施例6所述,从结核分枝杆菌Erdman菌株分离得到基因组DNA,随机剪切,并用于以λ筛选表达系统(Novagen,Madison,WI)来构建表达文库。如实施例3B所述,用从结核分枝杆菌感染患者获得的且显示与结核分枝杆菌裂解液反应(但不与以前表达的蛋白质38kD、Tb38-1、TbRa3、TbH4、DPEP和TbRa11反应)的合并血清来筛选表达文库,第二抗体是偶联碱性磷酸酶的山羊抗人IgG+A+M(H+L)。
纯化得到27个克隆。比较这些克隆测得的cDNA序列,结果表明与10个克隆(后称LSER-10、LSER-11、LSER-12、LSER-13、LSER-16、LSER-18、LSER-23、LSER-24、LSER-25和LSER-27)没有显著同源性。测得的LSER-10、LSER-11、LSER-12、LSER-13、LSER-16和LSER-25的5'cDNA序列分别提供在SEQ ID NO:237-242中,对应于LSER-10、LSER-12、LSER-13、LSER-16和LSER-25的预计氨基酸序列分别提供在SEQ ID NO:243-247中。测得的LSER-18、LSER-23、LSER-24和LSER-27的全长cDNA序列分别显示在SEQ ID NO:248-251中,其对应的预计氨基酸序列提供在SEQ ID NO:252-255中。发现其余17个克隆与以前在结核分枝杆菌中鉴定的未知序列相似。这些克隆中16个克隆(后称LSER-1、LSER-3、LSER-4、LSER-5、LSER-6、LSER-8、LSER-14、LSER-15、LSER-17、LSER-19、LSER-20、LSER-22、LSER-26、LSER-28、LSER-29和LSER-30)的测得的5'cDNA序列分别提供在SEQ IDNO:256-271中,对应于LSER-1、LSER-3、LSER-5、LSER-6、LSER-8、LSER-14、LSER-15、LSER-17、LSER-19、LSER-20、LSER-22、LSER-26、LSER-28、LSER-29和LSER-30的预计氨基酸序列分别提供在SEQ ID NO:272-286中。SEQ ID NO:287中提供了克隆LSER-9测得的全长cDNA序列。发现LSER-6的反向互补序列(SEQ IDNO:288)编码了SEQ ID NO:289的预计氨基酸序列。
E.用针对结核分枝杆菌分级蛋白产生的家兔抗血清制备结核分枝杆菌可溶性抗原
如实施例2所述制备结核分枝杆菌裂解液。所得物质用HPLC分级,用结核分枝杆菌感染患者的显示出与本发明其它抗原没有或有很少免疫反应性的合并血清通过Western印迹筛选级分的血清学活性。用实施例3A所述的方法针对最具反应性的级分产生家兔抗血清。用该抗血清来筛选如上所述制备的结核分枝杆菌Erdman菌株基因组DNA表达文库。纯化表达免疫反应性抗原的噬菌体噬斑。挽救噬斑的噬菌粒,测定结核分枝杆菌克隆的核苷酸序列。
纯化得到10个不同的克隆。其中发现一个是上述的TbRa35,一个是以前鉴定的结核分枝杆菌抗原HSP60。在其余的8个克隆中,发现6个克隆(后称RDIF2、RDIF5、RDIF8、RDIF10、RDIF11和RDIF12)与以前鉴定的结核分枝杆菌序列具有一定相似性。RDIF2、RDIF5、RDIF8、RDIF10和RDIF11测得的DNA序列分别提供在SEQ ID NO:184-188中,其对应的预计氨基酸序列分别提供在SEQ ID NO:189-193中。RDIF12的5'和3'DNA序列分别提供在SEQ ID NO:194和195中。发现与抗原RDIF-7没有显著的同源性。RDIF7测得的DNA序列以及预计的氨基酸序列分别提供在SEQ ID NO:196和197中。还分离出另一个克隆,称为RDIF6,然而,发现它与RDIF5相同。
如上所述制备重组的RDIF6、RDIF8、RDIF10和RDIF11。发现这些抗原在分离自结核分枝杆菌-免疫献血员的T细胞中刺激细胞增殖和IFN-γ产生。
实施例4
从结核菌素纯化的蛋白质衍生物纯化多肽并作特性分析
如下所述,从结核菌素纯化蛋白质衍生物(PPD)中分离得到结核分枝杆菌多肽。
PPD如公开的那样稍作改动来制备(Seibert,F.等人,″结核菌素纯化的蛋白质衍生物。大量标准品的制备和分析″The American Review of Tuberculosis 44:9-25,1941)。使结核分枝杆菌Rv菌株在转瓶中合成培养基中37℃生长6周。然后在水蒸汽中加热含有细菌生长的瓶至100℃3小时。用0.22μ滤膜对培养物作无菌过滤,用3kD截断膜将液相浓缩20倍。用50%硫酸铵溶液沉淀蛋白质一次,用25%硫酸铵溶液沉淀8次。得到的蛋白质(PPD)用C18柱(7.8×300mM;Waters,Milford,MA)在BiocadHPLC系统(Preseptive Biosystems,Framingham,MA)中通过反相液相层析(RP-HPLC)分级。用0-100%缓冲液(含0.1%TFA的乙腈)的线性梯度从柱上洗脱各级分。流速为10毫升/分钟,在214nm和280nm下监测洗脱液。
收集到6个级分,干燥,分别悬于PBS中,在感染结核分枝杆菌的豚鼠中测试6个级分诱导迟发型超敏(DTH)反应的情况。发现一个级分诱导了强的DTH反应,随后将此级分在Perkin Elmer/Applied Biosystems Division Model 172 HPLC中在微孔Vydac C18柱(目录号218TP5115)上通过RP-HPLC进一步分级。用5-100%缓冲液(含0.05%TFA的乙腈)的线性梯度以80微升/分钟的流速洗脱各级分。在215nm下监测洗脱液。收集到8个级分,测试其在感染了结核分枝杆菌的豚鼠中诱导DTH的情况。发现一个级分诱导可约16毫米硬块的强DTH反应。其它级分没有诱导出可检测的DTH。对该阳性级分再进行SDS-PAGE凝胶电泳,发现有大约12kD分子量的单个蛋白条带。
如上所述,用Perkin Elmer/Applied Biosystems Division Procise 492型蛋白测序仪从氨基端起对该多肽(后称DPPD)测序,发现N端序列如SEQ ID NO:124所示。如上所述将该序列与基因库中的已知序列比较,结果表明没有已知的同源物。分离DPPD的四个溴化氰片段,发现具有如SEQ ID NO:125-128所示的序列。随后搜寻由基因组搜寻委员会(Institute for Genomic Research)发布的结核分枝杆菌基因组数据库,结果表明DPPD部分氨基酸序列与结核分枝杆菌粘粒MTY21C12中的序列匹配。鉴定出336bp的一个开放读框。DPPD的全长DNA序列提供在SEQ ID NO:235中,其对应的全长氨基酸序列提供在SEQ ID NO:236中。
实施例5
用感染结核病的猴血清鉴定编码结核分枝杆菌抗原的DNA序列
从结核分枝杆菌Erdman菌株分离得到基因组DNA,随机剪切并用于以λZAP表达系统(Stratagene,La Jolla,CA)构建表达文库。从感染结核分枝杆菌Erdman菌株后18、33、51和56天的猕猴获得血清样品。合并这些样品,用实施例3C所述的步骤筛选结核分枝杆菌基因组DNA表达文库。
纯化得到20个克隆。SEQ ID NO:210-220中分别提供了称为MO-1、MO-2、MO-4、MO-8、MO-9、MO-26、MO-28、MO-29、MO-30、MO-34和MO-35的克隆所测得的5'DNA序列,其对应的预计氨基酸序列提供在SEQ ID NO:221-231中。克隆MO-10的全长DNA序列提供在SEQ ID NO:232中,其对应的预计氨基酸序列提供在SEQ ID NO:233中。克隆MO-27的3'DNA序列提供在SEQ ID NO:234中。
发现克隆MO-1、MO-30和MO-35与以前鉴定的未明结核分枝杆菌序列以及粘粒MTCI237有高度相关并表现出某些同源性。发现MO-2与结核分枝杆菌的天冬氨酸激酶有一些同源性。发现MO-3、MO-7和MO-27是相同的,并与MO-5高度相关。所有这四种克隆表现出与结核分枝杆菌热休克蛋白70有一些同源性。发现MO-27与结核分枝杆菌粘粒MTCY339有一些同源性。发现MO-4和MO-34与粘粒SCY21B4以及耻垢分枝杆菌整合宿主因子有一些同源性,还发现两者与以前鉴定的未知的结核分枝杆菌序列有一些同源性。发现MO-6与结核分枝杆菌热休克蛋白65有一些同源性。发现MO-8、MO-9、MO-10、MO-26和MO-29彼此高度相关,且与结核分枝杆菌二氢硫辛酰胺琥珀酰转移酶有一些同源性。发现MO-28、MO-31和MO-32是相同的,并显示出与以前鉴定的结核分枝杆菌蛋白质有一些同源性。发现MO-33与以前鉴定的14kDa结核分枝杆菌热休克蛋白有一些同源性。
用上述方案作进一步研究,导致分离得到另四个克隆,后称MO-12、MO-13、MO-19和MO-39。这些克隆所测得的5'cDNA序列分别提供在SEQ ID NO:290-293中,其对应的预计蛋白质序列分别提供在SEQ ID NO:294-297中。如上所述将这些序列与基因库中的那些序列比较,结果揭示与MO-39没有显著的同源性。发现MO-12、MO-13和MO-19与以前从结核分枝杆菌分离的未知序列有一些同源性。
实施例6
通过筛选新的表达文库分离编码结核分枝杆菌抗原的DNA序列
本实施例描述了通过用结核分枝杆菌感染患者的血清筛选新的表达文库来分离编码结核分枝杆菌抗原的DNA序列,该血清显示与一组重组结核分枝杆菌抗原TbRa11,TbRa3,Tb38-1,TbH4,TbF和38kD不反应。
将来自结核分枝杆菌Erdman菌株的基因组DNA随机剪切至平均大小为2kb,用Klenow聚合酶使末端变成平头,然后加入EcoRⅠ衔接物。随后将插入物连接入筛选噬菌体载体(Novagen,Madison,WI),用PhageMaker抽提物(Novagen)体外包装。如实施例3B所述的那样,用来自几个结核分枝杆菌献血员的已显示对一组以前鉴定的结核分枝杆菌抗原呈阴性(反应)的血清筛选得到的文库。
总共分离得到22个不同的克隆。相比,用同血一清筛选上述λZAP文库没有得到一个阳性命中物。发现一个克隆代表了上述的TbRa11。其余21个克隆中的19个克隆(后称Erdsn1,Erdsn2,Erdsn4-Erdsn10,Erdsn12-18,Erdsn21-Erdsn23以及Erdsn25)所测得的5'cDNA序列分别提供在SEQ ID NO:298-317中,Erdsn1,Erdsn2,Erdsn4,Erdsn5,Erdsn7-Erdsn10,Erdsn12-Erdsn18,Erdsn21-Erdsn23以及Erdsn25所测得的3'cDNA序列分别提供在SEQ ID NO:318-336中。克隆Erdsn24的全部cDNA插入序列提供在SEQ ID NO:337中。将测得的cDNA序列与基因库中的那些序列比较,结果表明与SEQ ID NO:304、311、313-315、317、319、324、326、329、331、333、335和337提供的序列没有显著同源性。发现SEQ ID NO:298-303、305-310、312、316、318、320-321、324-326、328、330、332、334和336的序列与以前在结核分枝杆菌中鉴定的未知序列有一些同源性。
实施例7
用质谱法分离可溶性结核分枝杆菌抗原
本实施例描述用质谱法鉴定可溶性结核分枝杆菌抗原。
在第一个方法中,用结核病感染个体的血清通过Western分析筛选结核分枝杆菌培养渗滤液。从银染凝胶上切下反应条带,用质谱法测定氨基酸序列。一个分离抗原所测得的氨基酸序列提供在SEQ ID NO:338中。将该序列与基因库中的那些序列比较,结果揭示与以前在结核分枝杆菌中鉴定的85b前体抗原同源。
在第二个方法中,研究了结核分枝杆菌培养上清液的高分子量区域。该区域可能含有可用于诊断结核分枝杆菌感染的免疫优势抗原。通过Western分析,两个已知的单克隆抗体IT24和IT57(购自Center for Disease Control,Atlanta,GA)显示出与此邻近的抗原有反应性,但是该抗原的身份仍然未知。另外,未知的高分子量蛋白质已被描述成含有HIV阳性个体中结核分枝杆菌感染的替代制造者(Jnl.Infect.Dis.,176:133-143,1997)。为了确定这些抗原的身份,用抗体IT57和IT42进行了二维凝胶电泳和二维Western分析。鉴定出高分子量区域的5个蛋白质斑点,单独切下,酶促消化并进行质谱分析。
这些斑点中的三个(称为斑点1、2和4)所测得的氨基酸序列分别提供在SEQ IDNO:339,340-341和342中。将这些序列与基因库中的那些序列比较,结果表明斑点1是以前鉴定的Pck-1(一种磷酸烯醇丙酮酸激酶)。分离自斑点2的两个序列经测定来自两个DNAks,它们以前在结核分枝杆菌中鉴定为热休克蛋白。斑点4被确定是以前鉴定的结核分枝杆菌蛋白质Kat G。就发明人所知,Pck-1和两个DNAks以前均未显示可用来诊断结核分枝杆菌感染。
实施例8
合成性多肽的合成
可通过FMOC化学试剂用HPTU(O-苯并三唑-N,N,N',N'-四甲基脲六氟磷酸)激活在Millipore 9050肽合成仪上合成多肽。可将Gly-Cys-Gly序列与肽的氨基端连接,提供一种对肽的偶联或标记方法。用下列切割混合物可将肽从固相载体上切割下来:三氟乙酸∶乙二硫醇∶苯硫基甲烷∶水∶苯酚(40∶1∶2∶2∶3)。切割2小时后,可将肽沉淀在冷的甲基叔丁基醚中。然后将肽沉淀溶于含有0.1%三氟乙酸(TFA)的水中,冻干,然后用C18反相HPLC纯化。可用含0-60%梯度乙腈(含有0.1%TFA)的水(含有0.1%TFA)洗脱肽。在将纯级分冻干后,可用电喷射质谱法和氨基酸分析对肽作特性分析。
用该步骤合成了含有一个半TbM-1序列重复序列的TbM-1肽。TbM-1肽具有序列GCGDRSGGNLDQIRLRRDRSGGNL(SEQ ID NO:63)。
实施例9
代表性抗原在结核病血清学诊断中的用途
本实施例描述了几个代表性抗原的诊断性质。
在96孔板上进行试验,测试板上包被了稀释在50微升碳酸盐包被缓冲液(pH9.6)的200ng抗原。4℃下包被诸孔过夜(37℃2小时)。然后除去孔中物质,用200微升PBS/1%BSA封闭诸孔2小时。封闭步骤后,用PBS/0.1%吐温20TM洗孔5次。然后在各孔中加入50微升1∶100稀释在PBS/0.1%吐温20TM/0.1%BSA中的血清,室温培育30分钟。然后用PBS/0.1%吐温20TM再洗板5次。
然后将酶偶联物(辣根过氧化物酶-蛋白质A,Zymed,San Francisco,CA)1∶10000稀释在PBS/0.1%吐温20TM/0.1%BSA中,在各孔中加入50微升稀释的偶联物,室温培育30分钟。培育后,用PBS/0.1%吐温20TM洗涤孔5次。加入100微升四甲基联苯胺过氧化物酶(TMB)底物(Kirkegaard and Perry Laboratories,Gaithersburg,MD),不稀释,培育大约15分钟。在每个孔内加入100微升1N硫酸终止反应,在450nm平板读数。
图4显示了用结核分枝杆菌阳性和阴性患者的血清以及实施例3方法A的两个重组抗原(TbRa3和TbRa9)的ELISA反应性。将这些抗原的反应性与分离自结核分枝杆菌菌株H37Ra(Difco,Detroit,MI)的细菌裂解液比较。在两种情况下,重组抗原均能区别阳性和阴性血清。根据从接受者-操作者曲线(receiver-operator curve)获得的截断值,TbRa3检测出87份阳性血清中的56份,TbRa9检测出165份阳性血清中的111份。
图5描述了用实施例3方法B分离的代表性抗原的ELISA反应性。将重组抗原TbH4,TbH12,Tb38-1和肽TbM-1的反应性(如实施例4所述)与Andersen和Hansen,Infect.Immun.57:2481-2488,1989中描述的38kD抗原的反应性比较。同样,测试的所有多肽均能区别阳性和阴性血清。根据从接受者-操作者曲线获得的截断值,TbH4检测出126份阳性血清中的67份,TbH12检测出125份阳性血清中的50份,38-1检测出101份阳性血清中的61份,TbM-1肽检测出30份阳性血清中的25份。
还检查了四种抗原(TbRa3,TbRa9,TbH4和TbH12)与一组结核分枝杆菌感染患者的血清(此血清在痰液的抗酸细菌染色中有不同的反应性(Smithwick和David,Tubercle 52:226,1971))的反应性,并与结核分枝杆菌裂解液以及38kD抗原的反应性比较。结果显示在下表3中:
表3
抗原与结核分枝杆菌患者血清的反应性
患者 | 抗酸痰液 | ELISA值 | |||||
裂解液 38kD TbRa9 TbH12 TbH4 TbRa3 | |||||||
Tb01B93I-2 | ++++ | 1.853 | 0.634 | 0.998 | 1.022 | 1.030 | 1.314 |
Tb01B93I-19 | ++++ | 2.657 | 2.322 | 0.608 | 0.837 | 1.857 | 2.335 |
Tb01B93I-8 | +++ | 2.703 | 0.527 | 0.492 | 0.281 | 0.501 | 2.002 |
Tb01B93I-10 | +++ | 1.665 | 1.301 | 0.685 | 0.216 | 0.448 | 0.458 |
患者 | 抗酸痰液 | ELISA值 | |||||
裂解液 38kD TbRa9 TbH12 bH4 TbRa3 | |||||||
Tb01B93I-11 | +++ | 2.817 | 0.697 | 0.509 | 0.301 | 0.173 | 2.608 |
Tb01B93I-15 | +++ | 1.28 | 0.283 | 0.808 | 0.218 | 1.537 | 0.811 |
Tb01B93I-16 | +++ | 2.908 | >3 | 0.899 | 0.441 | 0.593 | 1.080 |
Tb01B93I-25 | +++ | 0.395 | 0.131 | 0.335 | 0.211 | 0.107 | 0.948 |
Tb01B93I-87 | +++ | 2.653 | 2.432 | 2.282 | 0.977 | 1.221 | 0.857 |
Tb01B93I-89 | +++ | 1.912 | 2.370 | 2.436 | 0.876 | 0.520 | 0.952 |
Tb01B94I-108 | +++ | 1.639 | 0.341 | 0.797 | 0.368 | 0.654 | 0.798 |
Tb01B94I-201 | +++ | 1.721 | 0.419 | 0.661 | 0.137 | 0.064 | 0.692 |
Tb01B93I-88 | ++ | 1.939 | 1.269 | 2.519 | 1.381 | 0.214 | 0.530 |
Tb01B93I-92 | ++ | 2.355 | 2.329 | 2.78 | 0.685 | 0.997 | 2.527 |
Tb01B94I-109 | ++ | 0.993 | 0.620 | 0.574 | 0.441 | 0.5 | 2.558 |
Tb01B94I-210 | ++ | 2.777 | >3 | 0.393 | 0.367 | 1.004 | 1.315 |
Tb01B94I-224 | ++ | 2.913 | 0.476 | 0.251 | 1.297 | 1.990 | 0.256 |
Tb01B93I-9 | + | 2.649 | 0.278 | 0.210 | 0.140 | 0.181 | 1.586 |
Tb01B93I-14 | + | >3 | 1.538 | 0.282 | 0.291 | 0.549 | 2.880 |
Tb01B93I-21 | + | 2.645 | 0.739 | 2.499 | 0.783 | 0.536 | 1.770 |
Tb01B93I-22 | + | 0.714 | 0.451 | 2.082 | 0.285 | 0.269 | 1.159 |
Tb01B93I-31 | + | 0.956 | 0.490 | 1.019 | 0.812 | 0.176 | 1.293 |
Tb01B93I-32 | - | 2.261 | 0.786 | 0.668 | 0.273 | 0.535 | 0.405 |
Tb01B93I-52 | - | 0.658 | 0.114 | 0.434 | 0.330 | 0.273 | 1.140 |
Tb01B93I-99 | - | 2.118 | 0.584 | 1.62 | 0.119 | 0.977 | 0.729 |
Tb01B94I-130 | - | 1.349 | 0.224 | 0.86 | 0.282 | 0.383 | 2.146 |
Tb01B94I-131 | - | 0.685 | 0.324 | 1.173 | 0.059 | 0.118 | 1.431 |
AT4-0070 | 正常 | 0.072 | 0.043 | 0.092 | 0.071 | 0.040 | 0.039 |
AT4-0105 | 正常 | 0.397 | 0.121 | 0.118 | 0.103 | 0.078 | 0.390 |
3/15/94-1 | 正常 | 0.227 | 0.064 | 0.098 | 0.026 | 0.001 | 0.228 |
4/15/93-2 | 正常 | 0.114 | 0.240 | 0.071 | 0.034 | 0.041 | 0.264 |
5/26/94-4 | 正常 | 0.089 | 0.259 | 0.096 | 0.046 | 0.008 | 0.053 |
5/26/94-3 | 正常 | 0.139 | 0.093 | 0.085 | 0.019 | 0.067 | 0.01 |
根据从接受者-操作者曲线获得的截断值,TbRa3检测出27份阳性血清中的23份,TbRa9检测出27份中的22份,TbH4检测出27份中的18份,TbH12检测出27份的15份。如果合用,这四种抗原理论上的敏感性是27份中检出27份,这表明在结核分枝杆菌感染的血清学检测中,这些抗原应互相补充。另外,几种重组抗原检测出用38kD抗原未能检测出的阳性血清,这表明这些抗原可能与38kD抗原互补。
如上所述,用ELISA测定重组抗原TbRa11和显示对38kD抗原呈阴性的结核分枝杆菌患者血清以及PPD阳性和正常献血员血清的反应性。结果显示在图6中。结果表明TbRa11尽管对PPD阳性和正常献血员的血清呈阴性,却检测出对38kD抗原呈阴性的血清。在测试的13份38kD阴性血清中,有9份对TbRa11呈阳性,这表明该抗原可能与38kD抗原阴性血清的一个亚组反应。相反,在一组与TbRa1l反应的38kD阳性血清中,TbRa11的平均OD450低于38kD抗原的平均OD450。此数据表明TbRa11活性和38kD阳性之间成相反的关系。
在间接ELISA试验中测试了抗原TbRa2A,试验开始时用50微升稀释度为1∶100的血清在室温下反应30分钟,然后用PBS吐温洗涤,并与稀释度为1∶10000的生物素化蛋白质A(Zymed,San Francisco,CA)培育30分钟。洗涤后,加入1∶10000稀释的链霉亲和素辣根过氧化物酶(Zymed),培育混合物30分钟。洗涤后,如上所述用TMB底物使试验显色。表4中显示了TbRa2A与结核分枝杆菌患者以及正常献血员的血清的反应性。TbRa2A与结核分枝杆菌患者血清的反应性的平均值为0.444,标准偏差为0.309。与正常献血员的血清的反应性平均值为0.109,标准偏差为0.029。38kD阴性血清的测试(图7)也表明TbRa2A抗原能检测出该范畴中的血清。
表4TBRA2A与结核分枝杆菌患者以及正常献血员的血清的反应性
血清ID | 状况 | OD450 |
Tb85 | 结核病 | 0.680 |
Tb86 | 结核病 | 0.450 |
Tb87 | 结核病 | 0.263 |
Tb88 | 结核病 | 0.275 |
Tb89 | 结核病 | 0.403 |
Tb91 | 结核病 | 0.393 |
Tb92 | 结核病 | 0.401 |
Tb93 | 结核病 | 0.232 |
Tb94 | 结核病 | 0.333 |
血清ID | 状况 | OD450 |
Tb95 | 结核病 | 0.435 |
Tb96 | 结核病 | 0.284 |
Tb97 | 结核病 | 0.320 |
Tb99 | 结核病 | 0.328 |
Tb100 | 结核病 | 0.817 |
Tb101 | 结核病 | 0.607 |
Tb102 | 结核病 | 0.191 |
Tb103 | 结核病 | 0.228 |
Tb107 | 结核病 | 0.324 |
Tb109 | 结核病 | 1.572 |
Tb112 | 结核病 | 0.338 |
DL4-0176 | 正常 | 0.036 |
AT4-0043 | 正常 | 0.126 |
AT4-0044 | 正常 | 0.130 |
AT4-0052 | 正常 | 0.135 |
AT4-0053 | 正常 | 0.133 |
AT4-0062 | 正常 | 0.128 |
AT4-0070 | 正常 | 0.088 |
AT4-0091 | 正常 | 0.108 |
AT4-0100 | 正常 | 0.106 |
AT4-0105 | 正常 | 0.108 |
AT4-0109 | 正常 | 0.105 |
如上所述,用ELISA测定重组抗原(g)(SEQ ID NO:60)与结核分枝杆菌患者以及正常献血员的血清的反应性。图8显示了全都和38kD抗原反应的四份结核分枝杆菌阳性血清以及四份供体血清滴定抗原(g)的结果。所有四份阳性血清都与抗原(g)反应。
如上所述,用间接ELISA测定了重组抗原TbH-29(SEQ ID NO:137)与结核分枝杆菌患者、PPD阳性献血员以及正常献血员的血清的反应性。结果显示在图9中。TbH检测出60份结核分枝杆菌血清中的30份,8份PPD阳性血清中的2份以及27份正常血清中的2份。
图10显示了用结核分枝杆菌患者的血清、正常献血员的血清以及结核分枝杆菌患者的合并血清对抗原TbH-33(SEQ ID NO:140)进行ELISA测试(直接和间接)的结果。证实结核分枝杆菌患者血清的OD450平均值高于正常献血员的血清,间接ELISA中的OD450平均值显著高于直接ELISA中的OD450平均值。图11是重组TbH-33与结核分枝杆菌患者血清以及正常献血员血清的反应性的滴定曲线,该曲线显示OD450随抗原浓度增加而增加。
如上所述,用ELISA测定重组抗原RDIF6、RDIF8和RDIF10(分别为SEQ ID NO:184-187)与结核分枝杆菌患者血清以及正常献血员血清的反应性。RDIF6检测出32份结核分枝杆菌血清中的6份,15份正常血清中的0份;RDIF8检测出32份结核分枝杆菌血清中的14份,15份正常血清中的0份;RDIF10检测出27份结核分枝杆菌血清中的4份,15份正常血清中的1份。另外,发现RDIF10检测出5份PPD阳性献血员血清中的0份。
在大肠杆菌中表达了上述实施例5中的抗原MO-1、MO-2、MO-4、MO-28和MO-29,并用六组氨酸尾纯化。如上所述用ELISA检测这些抗原与结核分枝杆菌阳性以及阴性血清的反应性。图12A-E中分别显示了在对四份结核分枝杆菌阳性血清和四份结核分枝杆菌阴性血清测试时在不同固相包被水平下MO-1、MO-2、MO-4、MO-28和MO-29的反应性。在HIV阳性/结核病(HIV/TB)阳性和肺外血清组中对克隆中的三个克隆MO-1、MO-2和MO-29作进一步测试。MO-1检测出20份肺外血清中的3份,38份HIV/TB血清中的2份。在同一血清组中,MO-2检测出20份中的2份,38份中的10份,MO-29检测出20份血清中的2份,38份中的8份。这三个克隆组合将检测出20份肺外血清中的4份,38份HIV/TB血清中的16份。另外,在17份已显示仅与结核分枝杆菌裂解液反应但不与本发明的38kD或其它抗原反应的血清中,MO-1检测出6份。
实施例10
结核分枝杆菌融合蛋白的制备和特性分析
如下制备含有TbRa3,38kD抗原和Tb38-1的融合蛋白。
用PCR修饰各个DNA构建物TbRa3,38kD和Tb38-1,以促进它们的融合级融合蛋白TbRa3-38kD-Tb38-1随后的表达。用TbRa3,38kD和Tb38-1的DNA进行PCR,采用的引物分别为PDM-64和PDM-65(SEQ ID NO:141和142),PDM-57和PDM-58(SEQ ID NO:143和144),PDM-69和PDM-60(SEQ ID NO:145-146)。在每一例中,用10微升10X Pfu缓冲液、2微升10mM dNTP、2微升10微摩尔的各种PCR引物、81.5微升水、1.5微升Pfu DNA聚合酶(Stratagene,La Jolla,CA)和1微升70ng/μl(对于TbRa3而言)或50ng/μl(对于38kD和Tb38-1而言)的DNA进行DNA扩增。对于TbRa3,进行94℃变性2分钟,然后进行40轮的96℃15秒和72℃1分钟,最后72℃4分钟。对于38kD,96℃变性2分钟,然后进行40轮的96℃30秒,68℃15秒和72℃3分钟,最后72℃4分钟。对于Tb38-1,94℃变性2分钟,然后进行10轮的96℃15秒、68℃15秒、72℃1.5分钟,30轮的96℃15秒、64℃15秒和72℃1.5分钟,最后72℃4分钟。
用NdeⅠ和EcoRⅠ消化TbRa3 PCR片段,用NdeⅠ和EcoRⅠ位点直接克隆到pT7^L2IL1载体中。用Sse8387I消化38kD PCR片段,用T4 DNA聚合酶处理变为平头,然后用EcoRⅠ消化,直接克隆到经StuⅠ和EcoRⅠ消化的pT7^L2Ra3-1载体中。用Eco47Ⅲ和EcoRⅠ消化38-1PCR片段,直接亚克隆到经相同酶消化的pT7^L2Ra3/38kD-17中。然后利用NdeⅠ和EcoRⅠ位点将整个融合物转移到pET28b中。通过DNA测序确认融合构建物。
将表达构建物转化到BLR plys S大肠杆菌(Novagen,Madison,WI)中在含卡那霉素(30微克/毫升)和氯霉素(34微克/毫升)的LB肉汤中生长过夜。用该培养物(12毫升)接种500毫升含相同抗生素的2XYT,在OD650为0.44时用IPTG诱导培养物至最终浓度为1.2毫摩尔。诱导4小时后,收获细菌并在20毫摩尔Tris(8.0)、100毫摩尔NaCl、0.1%DOC、20微克/毫升亮抑酶肽、20毫摩尔PMSF中超声处理,然后26000Xg离心。将得到的沉淀重悬于8M尿素、20毫摩尔Tris(8.0)、100毫摩尔NaCl中并结合到Pro-bond nickel树脂(Invitrogen,Carlsbad,CA)。用上述缓冲液洗柱数次,然后用咪唑梯度(50毫摩尔、100毫摩尔、500毫摩尔咪唑加入8M尿素、20毫摩尔Tris(8.0)、100毫摩尔氯化钠中)洗脱。然后用10毫摩尔Tris(8.0)透析含有感兴趣蛋白质的洗脱液。
SEQ ID NO:147和148分别提供了所得融合蛋白(后称TbRa3-38kD-Tb38-1)的DNA序列和氨基酸序列。
用类似于上文的方法制备含有两个抗原TbH-9和Tb38-1但无铰链序列(后称TbH9-Tb38-1)的融合蛋白。SEQ ID NO:151中提供了TbH9-Tb38-1融合蛋白的DNA序列。
如下制备含有TbRa3、抗原38kD、Tb38-1和DPEP的融合蛋白。
基本上如上所述的那样,用PCR修饰各个DNA构建物TbRa3、38kD和Tb38-1,并克隆到载体中,引物PDM-69(SEQ ID NO:145)和PDM-83(SEQ ID NO:200)用于Tb38-1A片段的扩增。Tb38-1A与Tb38-1不同之处在于编码区3'端有DraⅠ位点,该位点维持最终氨基酸完整同时产生了符合读框的平头限制性位点。然后用NdeⅠ和EcoRⅠ位点将TbRa3/38kD/Tb38-1A融合物转移到pET28b中。
用DPEP DNA进行PCR,采用引物PDM-84和PDM-85(分别为SEQ ID NO:201和202)以及1微升50ng/μl DNA。94℃变性2分钟,然后进行10轮96℃15秒、68℃15秒和72℃1.5分钟;30轮96℃15秒、64℃15秒和72℃1.5分钟;最后72℃4分钟。用EcoRⅠ和Eco72Ⅰ消化DPEP PCR片段,直接克隆到经DraⅠ和EcoRⅠ消化的pET28Ra3/38kD/38-1A构建物中。用DNA测序确认融合构建物的正确。如上所述制备重组蛋白。得到的融合蛋白(后称TbF-2)的DNA和氨基酸序列分别提供在SEQID NO:203和204中。
如下制备含有TbRa3、抗原38kD、Tb38-1和TbH4的融合蛋白。
用结核分枝杆菌基因组DNA来PCR全长TbH4(FL TbH4),采用引物PDM-157和PDM-160(分别为SEQ ID NO:343和344)和2微升100ng/μl的DNA。96℃变性2分钟,然后进行40轮的96℃30秒、61℃20秒和72℃5分钟;最后72℃退火10分钟。用EcoRⅠ和ScaⅠ(New England Biolabs)消化FL TbH4 PCR片段,直接克隆到上述经DraⅠ和EcoRⅠ消化的pET28Ra3/38kD/38-1A构建物中。用DNA测序确认融合构建物正确。如上所述制备重组蛋白。得到的融合蛋白(后称TbF-6)的DNA和氨基酸序列分别提供在SEQ ID NO:345和346中。
如下制备含有由接头隔开的抗原38kD和DPEP的融合蛋白。
用38kD DNA进行PCR,采用引物PDM-176和PDM-175(分别为SEQ ID NO:347和348)以及1微升110ng/μl的PET28Ra3/38kD/38-1/Ra2A-12 DNA。96℃变性2分钟,然后进行40轮的96℃30秒、71℃15秒和72℃5分钟40秒;最后72℃退火4分钟。两组引物PDM-171、PDM-172和PDM-173、PDM-174通过95℃加热2分钟然后以0.1℃/秒的速度降至25℃来退火。如上所述用DPEP DNA进行PCR。用EcoRⅠ(New England Biolabs)消化38kD片段,并克隆到经Eco72Ⅰ(Promega)和EcoRⅠ切割的修饰的pT7ΔL2载体中。修饰的pT7ΔL2构建物设计成在紧靠Eco72Ⅰ位点的5'有符合读框的MGHHHHHH氨基酸编码区。用Kpn2Ⅰ(Gibco,BRL)和PstⅠ(NewEngland Biolabs)消化构建物,克隆入退火的一组磷酸化的引物(PDM-171、PDM-172和PDM-173、PDM-174)。用EcoRⅠ和Eco72Ⅰ消化DPEP PCR片段,克隆到经过Eco47Ⅲ(New England Biolabs)和EcoRⅠ消化的此第二构建物中。用Panvera(Madison,WI)的连接试剂盒进行连接。得到的构建物用NdeⅠ(New England Biolabs)和EcoRⅠ消化,转移到修饰的pET28载体中。通过DNA测序确认融合构建物的正确。
基本上如上所述制备重组蛋白。所得融合蛋白(后称TbF-8)的DNA和氨基酸序列分别提供在SEQ ID NO:349和350中。
实施例11
结核分枝杆菌融合蛋白在结核病血清学诊断中的应用
用ELISA检查上述制得的融合蛋白TbRa3-38kD-Tb38-1在结核病感染的血清学诊断中的效果。
ELISA程序如上文实施例6所述,每孔包被200ng融合蛋白。从经ELISA或Western印迹分析已显示与三种抗原之一或其组合起反应的一组结核病患者中选出一组血清。该组血清能分辨融合蛋白的血清学反应性,以确定是否所有三个表位均与融合蛋白作用。如表5所示,用融合蛋白仅能检测到所有四份与TbRa3反应的血清。仅与Tb38-1反应的三份血清也是可检测的,与单单38kD反应的两份血清也是如此。根据阴性平均值+3个标准偏差的试验中的截断值,其余的15份血清均对融合蛋白呈阳性。该数据证明了融合蛋白中所有三个表位均有功能活性。
表5
三肽融合蛋白与结核分枝杆菌患者血清的反应性
血清ID | 状况 | 与各蛋白的ELISA和/或Western印迹反应性38kD Tb38-1 TbRa3 | 重组融合物OD450 | 重组融合物状况 | ||
01B93I-40 | 结核病 | - | - | + | 0.413 | + |
01B93I-41 | 结核病 | - | + | + | 0.392 | + |
01B93I-29 | 结核病 | + | - | + | 2.217 | + |
01B93I-109 | 结核病 | + | ± | + | 0.522 | + |
01B93I-132 | 结核病 | + | + | + | 0.937 | + |
5004 | 结核病 | ± | + | ± | 1.098 | + |
15004 | 结核病 | + | + | + | 2.077 | + |
39004 | 结核病 | + | + | + | 1.675 | + |
68004 | 结核病 | + | + | + | 2.388 | + |
99004 | 结核病 | - | + | ± | 0.607 | + |
107004 | 结核病 | - | + | + | 0.667 | + |
92004 | 结核病 | + | ± | ± | 1.070 | + |
97004 | 结核病 | + | - | ± | 1.152 | + |
血清ID | 状况 | 与各蛋白的ELISA和/或Western印迹反应性38kD Tb38-1 TbRa3 | 重组融合物OD450 | 重组融合物状况 | ||
118004 | 结核病 | + | - | ± | 2.694 | + |
173004 | 结核病 | + | + | + | 3.258 | + |
175004 | 结核病 | + | - | + | 2.514 | + |
274004 | 结核病 | - | - | + | 3.220 | + |
276004 | 结核病 | - | + | - | 2.991 | + |
282004 | 结核病 | + | - | - | 0.824 | + |
289004 | 结核病 | - | - | + | 0.848 | + |
308004 | 结核病 | - | + | - | 3.338 | + |
314004 | 结核病 | - | + | - | 1.362 | + |
317004 | 结核病 | + | - | - | 0.763 | + |
312004 | 结核病 | - | - | + | 1.079 | + |
D176 | PPD | - | - | - | 0.145 | - |
D162 | PPD | - | - | - | 0.073 | - |
D161 | PPD | - | - | - | 0.097 | - |
D27 | PPD | - | - | - | 0.082 | - |
A6-124 | 正常 | - | - | - | 0.053 | - |
A6-125 | 正常 | - | - | - | 0.087 | - |
A6-126 | 正常 | - | - | - | 0.346 | ± |
A6-127 | 正常 | - | - | - | 0.064 | - |
A6-128 | 正常 | - | - | - | 0.034 | - |
A6-129 | 正常 | - | - | - | 0.037 | - |
A6-130 | 正常 | - | - | - | 0.057 | - |
A6-131 | 正常 | - | - | - | 0.054 | - |
A6-132 | 正常 | - | - | - | 0.022 | - |
A6-133 | 正常 | - | - | - | 0.147 | - |
A6-134 | 正常 | - | - | - | 0.101 | - |
A6-135 | 正常 | - | - | - | 0.066 | - |
血清ID | 状况 | 与各蛋白的ELISA和/或Western印迹反应性38kD Tb38-1 TbRa3 | 重组融合物OD450 | 重组融合物状况 | ||
A6-136 | 正常 | - | - | - | 0.054 | - |
A6-137 | 正常 | - | - | - | 0.065 | - |
A6-138 | 正常 | - | - | - | 0.041 | - |
A6-139 | 正常 | - | - | - | 0.103 | - |
A6-140 | 正常 | - | - | - | 0.212 | - |
A6-141 | 正常 | - | - | - | 0.056 | - |
A6-142 | 正常 | - | - | - | 0.051 | - |
用上文描述的方法通过ELISA检查融合蛋白TbF-2与结核分枝杆菌感染患者血清的反应性。这些研究的结果(表6)证明所有四种抗原在融合蛋白中独立地起作用。
表6
TBF-2融合蛋白与TB(结核病)以及正常血清的反应性
血清ID | 状况 | TbFOD450 | 状况 | TbF-2OD450 | 状况 | ELISA反应性 | |||
38kD | TbRa3 | Tb38-1 | DPEP | ||||||
B391-40 | 结核病 | 0.57 | + | 0.321 | + | - | + | - | + |
B391-41 | 结核病 | 0.601 | + | 0.396 | + | + | + | + | - |
B391-109 | 结核病 | 0.494 | + | 0.404 | + | + | + | ± | - |
B391-132 | 结核病 | 1.502 | + | 1.292 | + | + | + | + | ±- |
5004 | 结核病 | 1.806 | + | 1.666 | + | ± | ± | + | - |
15004 | 结核病 | 2.862 | + | 2.468 | + | + | + | + | - |
39004 | 结核病 | 2.443 | + | 1.722 | + | + | + | + | - |
68004 | 结核病 | 2.871 | + | 2.575 | + | + | + | + | - |
99004 | 结核病 | 0.691 | + | 0.971 | + | - | ± | + | - |
107004 | 结核病 | 0.875 | + | 0.732 | + | - | ± | + | - |
92004 | 结核病 | 1.632 | + | 1.394 | + | + | ± | ± | + |
97004 | 结核病 | 1.491 | + | 1.979 | + | + | ± | - | - |
118004 | 结核病 | 3.182 | + | 3.045 | + | + | ± | - | - |
173004 | 结核病 | 3.644 | + | 3.578 | + | + | + | + | - |
血清ID | 状况 | TbFOD450 | 状况 | TbF-2OD450 | 状况 | ELISA反应性 | |||
175004 | 结核病 | 3.332 | + | 2.916 | + | + | + | - | + |
274004 | 结核病 | 3.696 | + | 3.716 | + | - | + | - | - |
276004 | 结核病 | 3.243 | + | 2.56 | + | - | + | + | - |
282004 | 结核病 | 1.249 | + | 1.234 | + | + | - | - | - |
289004 | 结核病 | 1.373 | + | 1.17 | + | - | - | - | - |
308004 | 结核病 | 3.708 | + | 3.355 | + | - | + | + | - |
314004 | 结核病 | 1.663 | + | 1.399 | + | - | - | + | - |
317004 | 结核病 | 1.163 | + | 0.92 | + | + | - | - | - |
312004 | 结核病 | 1.709 | + | 1.453 | + | - | - | - | - |
380004 | 结核病 | 0.238 | - | 0.461 | + | - | + | - | + |
451004 | 结核病 | 0.18 | - | 0.2 | - | - | ± | - | ± |
478004 | 结核病 | 0.188 | - | 0.469 | + | - | - | - | ± |
410004 | 结核病 | 0.384 | + | 2.392 | + | ± | - | - | + |
411004 | 结核病 | 0.306 | + | 0.874 | + | - | - | - | + |
421004 | 结核病 | 0.357 | + | 1.456 | + | - | + | - | + |
528004 | 结核病 | 0.047 | - | 0.196 | - | - | + | - | + |
A6-87 | 正常 | 0.094 | - | 0.063 | - | - | - | - | - |
A6-88 | 正常 | 0.214 | - | 0.19 | - | - | - | - | - |
A6-89 | 正常 | 0.248 | - | 0.125 | - | - | - | - | - |
A6-90 | 正常 | 0.179 | - | 0.206 | - | - | - | - | - |
A6-91 | 正常 | 0.135 | - | 0.151 | - | - | - | - | - |
A6-92 | 正常 | 0.064 | - | 0.097 | - | - | - | - | - |
A6-93 | 正常 | 0.072 | - | 0.098 | - | - | - | - | - |
A6-94 | 正常 | 0.072 | - | 0.064 | - | - | - | - | - |
A6-95 | 正常 | 0.125 | - | 0.159 | - | - | - | - | - |
A6-96 | 正常 | 0.121 | - | 0.12 | - | - | - | - | - |
截断值 | 0.284 | 0.266 |
本领域技术人员将会理解,融合蛋白中的各抗原次序可以改变,并且预计将提供相当的活性,只要每个表位仍然在功能上有效。另外,在构建融合蛋白时可以采用含有活性表位的蛋白质截短形式。
从前述内容可以理解,尽管本文出于说明的目的描述了本发明的具体实施方案,但仍可不脱离本发明精神和范围而作各种变动。
序列表(1)一般信息:
(ⅰ)申请人:Reed,Steven G.
Skeiky,Yasir A.W.
Dillon,Davin C.
Campos-Neto,Antonia
Houghton,Raymond
Vedvick,Thomas S.
Twardzik,Daniel R.
Lodes,Michael J.
Hendrickson,Ronald
(ⅱ)发明名称:诊断结核病的化合物和方法
(ⅲ)序列数目:350
(ⅳ)通信地址:
(A)地址:SEED and BERRY LLP
(B)街道:6300 Columbia Center,701 Fifth Avenue
(C)城市:Seattle
(D)州:Washington
(E)国家:USA
(F)ZIP:98104-7092
(ⅴ)计算机可读形式:
(A)记录介质类型:软盘
(B)计算机:IBM PC兼容型
(C)操作系统:PC-DOS/MS-DOS
(D)软件:PatentIn Release#1.0,Version#1.30
(ⅵ)本申请资料:
(A)申请号:
(B)申请日:1998年5月5日
(C)分类:
(ⅷ)律师/代理人信息:
(A)姓名:Maki,David J.
(B)登记号:31,392
(C)参考/案卷号:210121.417C9
(ⅸ)通讯信息:
(A)电话:(206)622-4900
(B)电传:(206)682-6031(2)SEQ ID NO:1的信息:
(ⅰ)序列特征:
(A)长度:766碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:1:CGAGGCACCG GTAGTTTGAA CCAAACGCAC AATCGACGGG CAAACGAACG GAAGAACACA 60ACCATGAAGA TGGTGAAATC GATCGCCGCA GGTCTGACCG CCGCGGCTGC AATCGGCGCC 120GCTGCGGCCG GTGTGACTTC GATCATGGCT GGCGGCCCGG TCGTATACCA GATGCAGCCG 180GTCGTCTTCG GCGCGCCACT GCCGTTGGAC CCGGCATCCG CCCCTGACGT CCCGACCGCC 240GCCCAGTTGA CCAGCCTGCT CAACAGCCTC GCCGATCCCA ACGTGTCGTT TGCGAACAAG 300GGCAGTCTGG TCGAGGGCGG CATCGGGGGC ACCGAGGCGC GCATCGCCGA CCACAAGCTG 360AAGAAGGCCG CCGAGCACGG GGATCTGCCG CTGTCGTTCA GCGTGACGAA CATCCAGCCG 420GCGGCCGCCG GTTCGGCCAC CGCCGACGTT TCCGTCTCGG GTCCGAAGCT CTCGTCGCCG 480GTCACGCAGA ACGTCACGTT CGTGAATCAA GGCGGCTGGA TGCTGTCACG CGCATCGGCG 540ATGGAGTTGC TGCAGGCCGC AGGGNAACTG ATTGGCGGGC CGGNTTCAGC CCGCTGTTCA 600GCTACGCCGC CCGCCTGGTG ACGCGTCCAT GTCGAACACT CGCGCGTGTA GCACGGTGCG 660GTNTGCGCAG GGNCGCACGC ACCGCCCGGT GCAAGCCGTC CTCGAGATAG GTGGTGNCTC 720GNCACCAGNG ANCACCCCCN NNTCGNCNNT TCTCGNTGNT GNATGA 766(2)SEQ ID NO:2的信息:
(ⅰ)序列特征:
(A)长度:752碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:2:ATGCATCACC ATCACCATCA CGATGAAGTC ACGGTAGAGA CGACCTCCGT CTTCCGCGCA 60GACTTCCTCA GCGAGCTGGA CGCTCCTGCG CAAGCGGGTA CGGAGAGCGC GGTCTCCGGG 120GTGGAAGGGC TCCCGCCGGG CTCGGCGTTG CTGGTAGTCA AACGAGGCCC CAACGCCGGG 180TCCCGGTTCC TACTCGACCA AGCCATCACG TCGGCTGGTC GGCATCCCGA CAGCGACATA 240TTTCTCGACG ACGTGACCGT GAGCCGTCGC CATGCTGAAT TCCGGTTGGA AAACAACGAA 300TTCAATGTCG TCGATGTCGG GAGTCTCAAC GGCACCTACG TCAACCGCGA GCCCGTGGAT 360TCGGCGGTGC TGGCGAACGG CGACGAGGTC CAGATCGGCA AGCTCCGGTT GGTGTTCTTG 420ACCGGACCCA AGCAAGGCGA GGATGACGGG AGTACCGGGG GCCCGTGAGC GCACCCGATA 480GCCCCGCGCT GGCCGGGATG TCGATCGGGG CGGTCCTCCG ACCTGCTACG ACCGGATTTT 540CCCTGATGTC CACCATCTCC AAGATTCGAT TCTTGGGAGG CTTGAGGGTC NGGGTGACCC 600CCCCGCGGGC CTCATTCNGG GGTNTCGGCN GGTTTCACCC CNTACCNACT GCCNCCCGGN 660TTGCNAATTC NTTCTTCNCT GCCCNNAAAG GGACCNTTAN CTTGCCGCTN GAAANGGTNA 720TCCNGGGCCC NTCCTNGAAN CCCCNTCCCC CT 752(2)SEQ ID NO:3的信息:
(ⅰ)序列特征:
(A)长度:813碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:3:CATATGCATC ACCATCACCA TCACACTTCT AACCGCCCAG CGCGTCGGGG GCGTCGAGCA 60CCACGCGACA CCGGGCCCGA TCGATCTGCT AGCTTGAGTC TGGTCAGGCA TCGTCGTCAG 120CAGCGCGATG CCCTATGTTT GTCGTCGACT CAGATATCGC GGCAATCCAA TCTCCCGCCT 180GCGGCCGGCG GTGCTGCAAA CTACTCCCGG AGGAATTTCG ACGTGCGCAT CAAGATCTTC 240ATGCTGGTCA CGGCTGTCGT TTTGCTCTGT TGTTCGGGTG TGGCCACGGC CGCGCCCAAG 300ACCTACTGCG AGGAGTTGAA AGGCACCGAT ACCGGCCAGG CGTGCCAGAT TCAAATGTCC 360GACCCGGCCT ACAACATCAA CATCAGCCTG CCCAGTTACT ACCCCGACCA GAAGTCGCTG 420GAAAATTACA TCGCCCAGAC GCGCGACAAG TTCCTCAGCG CGGCCACATC GTCCACTCCA 480CGCGAAGCCC CCTACGAATT GAATATCACC TCGGCCACAT ACCAGTCCGC GATACCGCCG 540CGTGGTACGC AGGCCGTGGT GCTCAMGGTC TACCACAACG CCGGCGGCAC GCACCCAACG 600ACCACGTACA AGGCCTTCGA TTGGGACCAG GCCTATCGCA AGCCAATCAC CTATGACACG 660CTGTGGCAGG CTGACACCGA TCCGCTGCCA GTCGTCTTCC CCATTGTTGC AAGGTGAACT 720GAGCAACGCA GACCGGGACA ACWGGTATCG ATAGCCGCCN AATGCCGGCT TGGAACCCNG 780TGAAATTATC ACAACTTCGC AGTCACNAAA NAA 813(2)SEQ ID NO:4的信息:
(ⅰ)序列特征:
(A)长度:447碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:4:CGGTATGAAC ACGGCCGCGT CCGATAACTT CCAGCTGTCC CAGGGTGGGC AGGGATTCGC 60CATTCCGATC GGGCAGGCGA TGGCGATCGC GGGCCAGATC CGATCGGGTG GGGGGTCACC 120CACCGTTCAT ATCGGGCCTA CCGCCTTCCT CGGCTTGGGT GTTGTCGACA ACAACGGCAA 180CGGCGCACGA GTCCAACGCG TGGTCGGGAG CGCTCCGGCG GCAAGTCTCG GCATCTCCAC 240CGGCGACGTG ATCACCGCGG TCGACGGCGC TCCGATCAAC TCGGCCACCG CGATGGCGGA 300CGCGCTTAAC GGGCATCATC CCGGTGACGT CATCTCGGTG AACTGGCAAA CCAAGTCGGG 360CGGCACGCGT ACAGGGAACG TGACATTGGC CGAGGGACCC CCGGCCTGAT TTCGTCGYGG 420ATACCACCCG CCGGCCGGCC AATTGGA 447(2)SEQ ID NO:5的信息:
(ⅰ)序列特征:
(A)长度:604碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:5:GTCCCACTGC GGTCGCCGAG TATGTCGCCC AGCAAATGTC TGGCAGCCGC CCAACGGAAT 60CCGGTGATCC GACGTCGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 120AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 180CCGGCGACGG NGAGCGCCGG AATGGCGCGA GTGAGGAGGT GGNCAGTCAT GCCCAGNGTG 240ATCCAATCAA CCTGNATTCG GNCTGNGGGN CCATTTGACA ATCGAGGTAG TGAGCGCAAA 300TGAATGATGG AAAACGGGNG GNGACGTCCG NTGTTCTGGT GGTGNTAGGT GNCTGNCTGG 360NGTNGNGGNT ATCAGGATGT TCTTCGNCGA AANCTGATGN CGAGGAACAG GGTGTNCCCG 420NNANNCCNAN GGNGTCCNAN CCCNNNNTCC TCGNCGANAT CANANAGNCG NTTGATGNGA 480NAAAAGGGTG GANCAGNNNN AANTNGNGGN CCNAANAANC NNNANNGNNG NNAGNTNGNT 540NNNTNTTNNC ANNNNNNNTG NNGNNGNNCN NNNCAANCNN NTNNNNGNAA NNGGNTTNTT 600NAAT 604(2)SEQ ID NO:6的信息:
(ⅰ)序列特征:
(A)长度:633碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:6:TTGCANGTCG AACCACCTCA CTAAAGGGAA CAAAAGCTNG AGCTCCACCG CGGTGGCGGC 60CGCTCTAGAA CTAGTGKATM YYYCKGGCTG CAGSAATYCG GYACGAGCAT TAGGACAGTC 120TAACGGTCCT GTTACGGTGA TCGAATGACC GACGACATCC TGCTGATCGA CACCGACGAA 180CGGGTGCGAA CCCTCACCCT CAACCGGCCG CAGTCCCGYA ACGCGCTCTC GGCGGCGCTA 240CGGGATCGGT TTTTCGCGGY GTTGGYCGAC GCCGAGGYCG ACGACGACAT CGACGTCGTC 300ATCCTCACCG GYGCCGATCC GGTGTTCTGC GCCGGACTGG ACCTCAAGGT AGCTGGCCGG 360GCAGACCGCG CTGCCGGACA TCTCACCGCG GTGGGCGGCC ATGACCAAGC CGGTGATCGG 420CGCGATCAAC GGCGCCGCGG TCACCGGCGG GCTCGAACTG GCGCTGTACT GCGACATCCT 480GATCGCCTCC GAGCACGCCC GCTTCGNCGA CACCCACGCC CGGGTGGGGC TGCTGCCCAC 540CTGGGGACTC AGTGTGTGCT TGCCGCAAAA GGTCGGCATC GGNCTGGGCC GGTGGATGAG 600CCTGACCGGC GACTACCTGT CCGTGACCGA CGC 633(2)SEQ ID NO:7的信息:
(ⅰ)序列特征:
(A)长度:1362碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:7:CGACGACGAC GGCGCCGGAG AGCGGGCGCG AACGGCGATC GACGCGGCCC TGGCCAGAGT 60CGGCACCACC CAGGAGGGAG TCGAATCATG AAATTTGTCA ACCATATTGA GCCCGTCGCG 120CCCCGCCGAG CCGGCGGCGC GGTCGCCGAG GTCTATGCCG AGGCCCGCCG CGAGTTCGGC 180CGGCTGCCCG AGCCGCTCGC CATGCTGTCC CCGGACGAGG GACTGCTCAC CGCCGGCTGG 240GCGACGTTGC GCGAGACACT GCTGGTGGGC CAGGTGCCGC GTGGCCGCAA GGAAGCCGTC 300GCCGCCGCCG TCGCGGCCAG CCTGCGCTGC CCCTGGTGCG TCGACGCACA CACCACCATG 360CTGTACGCGG CAGGCCAAAC CGACACCGCC GCGGCGATCT TGGCCGGCAC AGCACCTGCC 420GCCGGTGACC CGAACGCGCC GTATGTGGCG TGGGCGGCAG GAACCGGGAC ACCGGCGGGA 480CCGCCGGCAC CGTTCGGCCC GGATGTCGCC GCCGAATACC TGGGCACCGC GGTGCAATTC 540CACTTCATCG CACGCCTGGT CCTGGTGCTG CTGGACGAAA CCTFCCTGCC GGGGGGCCCG 600CGCGCCCAAC AGCTCATGCG CCGCGCCGGT GGACTGGTGT TCGCCCGCAA GGTGCGCGCG 660GAGCATCGGC CGGGCCGCTC CACCCGCCGG CTCGAGCCGC GAACGCTGCC CGACGATCTG 720GCATGGGCAA CACCGTCCGA GCCCATAGCA ACCGCGTTCG CCGCGCTCAG CCACCACCTG 780GACACCGCGC CGCACCTGCC GCCACCGACT CGTCAGGTGG TCAGGCGGGT CGTGGGGTCG 840TGGCACGGCG AGCCAATGCC GATGAGCAGT CGCTGGACGA ACGAGCACAC CGCCGAGCTG 900CCCGCCGACC TGCACGCGCC CACCCGTCTT GCCCTGCTGA CCGGCCTGGC CCCGCATCAG 960GTGACCGACG ACGACGTCGC CGCGGCCCGA TCCCTGCTCG ACACCGATGC GGCGCTGGTT 1020GGCGCCCTGG CCTGGGCCGC CTTCACCGCC GCGCGGCGCA TCGGCACCTG GATCGGCGCC 1080GCCGCCGAGG GCCAGGTGTC GCGGCAAAAC CCGACTGGGT GAGTGTGCGC GCCCTGTCGG 1140TAGGGTGTCA TCGCTGGCCC GAGGGATCTC GCGGCGGCGA ACGGAGGTGG CGACACAGGT 1200GGAAGCTGCG CCCACTGGCT TGCGCCCCAA CGCCGTCGTG GGCGTTCGGT TGGCCGCACT 1260GGCCGATCAG GTCGGCGCCG GCCCTTGGCC GAAGGTCCAG CTCAACGTGC CGTCACCGAA 1320GGACCGGACG GTCACCGGGG GTCACCCTGC GCGCCCAAGG AA 1362(2)SEQ ID NO:8的信息:
(ⅰ)序列特征:
(A)长度:1458碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:8:GCGACGACCC CGATATGCCG GGCACCGTAG CGAAAGCCGT CGCCGACGCA CTCGGGCGCG 60GTATCGCTCC CGTTGAGGAC ATTCAGGACT GCGTGGAGGC CCGGCTGGGG GAAGCCGGTC 120TGGATGACGT GGCCCGTGTT TACATCATCT ACCGGCAGCG GCGCGCCGAG CTGCGGACGG 180CTAAGGCCTT GCTCGGCGTG CGGGACGAGT TAAAGCTGAG CTTGGCGGCC GTGACGGTAC 240TGCGCGAGCG CTATCTGCTG CACGACGAGC AGGGCCGGCC GGCCGAGTCG ACCGGCGAGC 300TGATGGACCG ATCGGCGCGC TGTGTCGCGG CGGCCGAGGA CCAGTATGAG CCGGGCTCGT 360CGAGGCGGTG GGCCGAGCGG TTCGCCACGC TATTACGCAA CCTGGAATTC CTGCCGAATT 420CGCCCACGTT GATGAACTCT GGCACCGACC TGGGACTGCT CGCCGGCTGT TTTGTTCTGC 480CGATTGAGGA TTCGCTGCAA TCGATCTTTG CGACGCTGGG ACAGGCCGCC GAGCTGCAGC 540GGGCTGGAGG CGGCACCGGA TATGCGTTCA GCCACCTGCG ACCCGCCGGG GATCGGGTGG 600CCTCCACGGG CGGCACGGCC AGCGGACCGG TGTCGTTTCT ACGGCTGTAT GACAGTGCCG 660CGGGTGTGGT CTCCATGGGC GGTCGCCGGC GTGGCGCCTG TATGGCTGTG CTTGATGTGT 720CGCACCCGGA TATCTGTGAT TTCGTCACCG CCAAGGCCGA ATCCCCCAGC GAGCTCCCGC 780ATTTCAACCT ATCGGTTGGT GTGACCGACG CGTTCCTGCG GGCCGTCGAA CGCAACGGCC 840TACACCGGCT GGTCAATCCG CGAACCGGCA AGATCGTCGC GCGGATGCCC GCCGCCGAGC 900TGTTCGACGC CATCTGCAAA GCCGCGCACG CCGGTGGCGA TCCCGGGCTG GTGTTTCTCG 960ACACGATCAA TAGGGCAAAC CCGGTGCCGG GGAGAGGCCG CATCGAGGCG ACCAACCCGT 1020GCGGGGAGGT CCCACTGCTG CCTTACGAGT CATGTAATCT CGGCTCGATC AACCTCGCCC 1080GGATGCTCGC CGACGGTCGC GTCGACTGGG ACCGGCTCGA GGAGGTCGCC GGTGTGGCGG 1140TGCGGTTCCT TGATGACGTC ATCGATGTCA GCCGCTACCC CTTCCCCGAA CTGGGTGAGG 1200CGGCCCGCGC CACCCGCAAG ATCGGGCTGG GAGTCATGGG TTTGGCGGAA CTGCTTGCCG 1260CACTGGGTAT TCCGTACGAC AGTGAAGAAG CCGTGCGGTT AGCCACCCGG CTCATGCGTC 1320GCATACAGCA GGCGGCGCAC ACGGCATCGC GGAGGCTGGC CGAAGAGCGG GGCGCATTCC 1380CGGCGTTCAC CGATAGCCGG TTCGCGCGGT CGGGCCCGAG GCGCAACGCA CAGGTCACCT 1440CCGTCGCTCC GACGGGCA 1458(2)SEQ ID NO:9的信息:
(ⅰ)序列特征:
(A)长度:862碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:9:ACGGTGTAAT CGTGCTGGAT CTGGAACCGC GTGGCCCGCT ACCTACCGAG ATCTACTGGC 60GGCGCAGGGG GCTGGCCCTG GGCATCGCGG TCGTCGTAGT CGGGATCGCG GTGGCCATCG 120TCATCGCCTT CGTCGACAGC AGCGCCGGTG CCAAACCGGT CAGCGCCGAC AAGCCGGCCT 180CCGCCCAGAG CCATCCGGGC TCGCCGGCAC CCCAAGCACC CCAGCCGGCC GGGCAAACCG 240AAGGTAACGC CGCCGCGGCC CCGCCGCAGG GCCAAAACCC CGAGACACCC ACGCCCACCG 300CCGCGGTGCA GCCGCCGCCG GTGCTCAAGG AAGGGGACGA TTGCCCCGAT TCGACGCTGG 360CCGTCAAAGG TTTGACCAAC GCGCCGCAGT ACTACGTCGG CGACCAGCCG AAGTTCACCA 420TGGTGGTCAC CAACATCGGC CTGGTGTCCT GTAAACGCGA CGTTGGGGCC GCGGTGTTGG 480CCGCCTACGT TTACTCGCTG GACAACAAGC GGTTGTGGTC CAACCTGGAC TGCGCGCCCT 540CGAATGAGAC GCTGGTCAAG ACGTTTTCCC CCGGTGAGCA GGTAACGACC GCGGTGACCT 600GGACCGGGAT GGGATCGGCG CCGCGCTGCC CATTGCCGCG GCCGGCGATC GGGCCGGGCA 660CCTACAATCT CGTGGTACAA CTGGGCAATC TGCGCTCGCT GCCGGTTCCG TTCATCCTGA 720ATCAGCCGCC GCCGCCGCCC GGGCCGGTAC CCGCTCCGGG TCCAGCGCAG GCGCCTCCGC 780CGGAGTCTCC CGCGCAAGGC GGATAATTAT TGATCGCTGA TGGTCGATTC CGCCAGCTGT 840GACAACCCCT CGCCTCGTGC CG 862(2)SEQ ID NO:10的信息:
(ⅰ)序列特征:
(A)长度:622碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:10:TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC CAATGACAAA 60GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGCCGGTG GTGCTGCCGC GAACGCTGGA 120GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG CGCGGACGCG 180TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC CTTTCAGGAT 240CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA GTGATGAAGG 300TCGCCGCGCA GTGTTCAAAG CTCGGATATA CGGTGGCACC CATGGAACAG CGTGCGGAGT 360TGGTGGTTGG CCGGGCACTT GTCGTCGTCG TTGACGATCG CACGGCGCAC GGCGATGAAG 420ACCACAGCGG GCCGCTTGTC ACCGAGCTGC TCACCGAGGC CGGGTTTGTT GTCGACGGCG 480TGGTGGCGGT GTCGGCCGAC GAGGTCGAGA TCCGAAATGC GCTGAACACA GCGGTGATCG 540GCGGGGTGGA CCTGGTGGTG TCGGTCGGCG GGACCGGNGT GACGNCTCGC GATGTCACCC 600CGGAAGCCAC CCGNGACATT CT 622(2)SEQ ID NO:11的信息:
(ⅰ)序列特征:
(A)长度:1200碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:11:GGCGCAGCGG TAAGCCTGTT GGCCGCCGGC ACACTGGTGT TGACAGCATG CGGCGGTGGC 60ACCAACAGCT CGTCGTCAGG CGCAGGCGGA ACGTCTGGGT CGGTGCACTG CGGCGGCAAG 120AAGGAGCTCC ACTCCAGCGG CTCGACCGCA CAAGAAAATG CCATGGAGCA GTTCGTCTAT 180GCCTACGTGC GATCGTGCCC GGGCTACACG TTGGACTACA ACGCCAACGG GTCCGGTGCC 240GGGGTGACCC AGTTTCTCAA CAACGAAACC GATTTCGCCG GCTCGGATGT CCCGTTGAAT 300CCGTCGACCG GTCAACCTGA CCGGTCGGCG GAGCGGTGCG GTTCCCCGGC ATGGGACCTG 360CCGACGGTGT TCGGCCCGAT CGCGATCACC TACAATATCA AGGGCGTGAG CACGCTGAAT 420CTTGACGGAC CCACTACCGC CAAGATTTTC AACGGCACCA TCACCGTGTG GAATGATCCA 480CAGATCCAAG CCCTCAACTC CGGCACCGAC CTGCCGCCAA CACCGATTAG CGTTATCTTC 540CGCAGCGACA AGTCCGGTAC GTCGGACAAC TTCCAGAAAT ACCTCGACGG TGTATCCAAC 600GGGGCGTGGG GCAAAGGCGC CAGCGAAACG TTCAGCGGGG GCGTCGGCGT CGGCGCCAGC 660GGGAACAACG GAACGTCGGC CCTACTGCAG ACGACCGACG GGTCGATCAC CTACAACGAG 720TGGTCGTTTG CGGTGGGTAA GCAGTTGAAC ATGGCCCAGA TCATCACGTC GGCGGGTCCG 780GATCCAGTGG CGATCACCAC CGAGTCGGTC GGTAAGACAA TCGCCGGGGC CAAGATCATG 840GGACAAGGCA ACGACCTGGT ATTGGACACG TCGTCGTTCT ACAGACCCAC CCAGCCTGGC 900TCTTACCCGA TCGTGCTGGC GACCTATGAG ATCGTCTGCT CGAAATACCC GGATGCGACG 960ACCGGTACTG CGGTAAGGGC GTTTATGCAA GCCGCGATTG GTCCAGGCCA AGAAGGCCTG 1020GACCAATACG GCTCCATTCC GTTGCCCAAA TCGTTCCAAG CAAAATTGGC GGCCGCGGTG 1080AATGCTATTT CTTGACCTAG TGAAGGGAAT TCGACGGTGA GCGATGCCGT TCCGCAGGTA 1140GGGTCGCAAT TTGGGCCGTA TCAGCTATTG CGGCTGCTGG GCCGAGGCGG GATGGGCGAG 1200(2)SEQ ID NO:12的信息:
(ⅰ)序列特征:
(A)长度:1155碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:12:GCAAGCAGCT GCAGGTCGTG CTGTTCGACG AACTGGGCAT GCCGAAGACC AAACGCACCA 60AGACCGGCTA CACCACGGAT GCCGACGCGC TGCAGTCGTT GTTCGACAAG ACCGGGCATC 120CGTTTCTGCA ACATCTGCTC GCCCACCGCG ACGTCACCCG GCTCAAGGTC ACCGTCGACG 180GGTTGCTCCA AGCGGTGGCC GCCGACGGCC GCATCCACAC CACGTTCAAC CAGACGATCG 240CCGCGACCGG CCGGCTCTCC TCGACCGAAC CCAACCTGCA GAACATCCCG ATCCGCACCG 300ACGCGGGCCG GCGGATCCGG GACGCGTTCG TGGTCGGGGA CGGTTACGCC GAGTTGATGA 360CGGCCGACTA CAGCCAGATC GAGATGCGGA TCATGGGGCA CCTGTCCGGG GACGAGGGCC 420TCATCGAGGC GTTCAACACC GGGGAGGACC TGTATTCGTT CGTCGCGTCC CGGGTGTTCG 480GTGTGCCCAT CGACGAGGTC ACCGGCGAGT TGCGGCGCCG GGTCAAGGCG ATGTCCTACG 540GGCTGGTTTA CGGGTTGAGC GCCTACGGCC TGTCGCAGCA GTTGAAAATC TCCACCGAGG 600AAGCCAACGA GCAGATGGAC GCGTATTTCG CCCGATTCGG CGGGGTGCGC GACTACCTGC 660GCGCCGTAGT CGAGCGGGCC CGCAAGGACG GCTACACCTC GACGGTGCTG GGCCGTCGCC 720GCTACCTGCC CGAGCTGGAC AGCAGCAACC GTCAAGTGCG GGAGGCCGCC GAGCGGGCGG 780CGCTGAACGC GCCGATCCAG GGCAGCGCGG CCGACATCAT CAAGGTGGCC ATGATCCAGG 840TCGACAAGGC GCTCAACGAG GCACAGCTGG CGTCGCGCAT GCTGCTGCAG GTCCACGACG 900AGCTGCTGTT CGAAATCGCC CCCGGTGAAC GCGAGCGGGT CGAGGCCCTG GTGCGCGACA 960AGATGGGCGG CGCTTACCCG CTCGACGTCC CGCTGGAGGT GTCGGTGGGC TACGGCCGCA 1020GCTGGGACGC GGCGGCGCAC TGAGTGCCGA GCGTGCATCT GGGGCGGGAA TTCGGCGATT 1080TTTCCGCCCT GAGTTCACGC TCGGCGCAAT CGGGACCGAG TTTGTCCAGC GTGTACCCGT 1140CGAGTAGCCT CGTCA 1155(2)SEQ ID NO:13的信息:
(ⅰ)序列特征:
(A)长度:1771碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:13:GAGCGCCGTC TGGTGTTTGA ACGGTTTTAC CGGTCGGCAT CGGCACGGGC GTTGCCGGGT 60TCGGGCCTCG GGTTGGCGAT CGTCAAACAG GTGGTGCTCA ACCACGGCGG ATTGCTGCGC 120ATCGAAGACA CCGACCCAGG CGGCCAGCCC CCTGGAACGT CGATTTACGT GCTGCTCCCC 180GGCCGTCGGA TGCCGATTCC GCAGCTTCCC GGTGCGACGG CTGGCGCTCG GAGCACGGAC 240ATCGAGAACT CTCGGGGTTC GGCGAACGTT ATCTCAGTGG AATCTCAGTC CACGCGCGCA 300ACCTAGTTGT GCAGTTACTG TTGAAAGCCA CACCCATGCC AGTCCACGCA TGGCCAAGTT 360GGCCCGAGTA GTGGGCCTAG TACAGGAAGA GCAACCTAGC GACATGACGA ATCACCCACG 420GTATTCGCCA CCGCCGCAGC AGCCGGGAAC CCCAGGTTAT GCTCAGGGGC AGCAGCAAAC 480GTACAGCCAG CAGTTCGACT GGCGTTACCC ACCGTCCCCG CCCCCGCAGC CAACCCAGTA 540CCGTCAACCC TACGAGGCGT TGGGTGGTAC CCGGCCGGGT CTGATACCTG GCGTGATTCC 600GACCATGACG CCCCCTCCTG GGATGGTTCG CCAACGCCCT CGTGCAGGCA TGTTGGCCAT 660CGGCGCGGTG ACGATAGCGG TGGTGTCCGC CGGCATCGGC GGCGCGGCCG CATCCCTGGT 720CGGGTTCAAC CGGGCACCCG CCGGCCCCAG CGGCGGCCCA GTGGCTGCCA GCGCGGCGCC 780AAGCATCCCC GCAGCAAACA TGCCGCCGGG GTCGGTCGAA CAGGTGGCGG CCAAGGTGGT 840GCCCAGTGTC GTCATGTTGG AAACCGATCT GGGCCGCCAG TCGGAGGAGG GCTCCGGCAT 900CATTCTGTCT GCCGAGGGGC TGATCTTGAC CAACAACCAC GTGATCGCGG CGGCCGCCAA 960GCCTCCCCTG GGCAGTCCGC CGCCGAAAAC GACGGTAACC TTCTCTGACG GGCGGACCGC 1020ACCCTTCACG GTGGTGGGGG CTGACCCCAC CAGTGATATC GCCGTCGTCC GTGTTCAGGG 1080CGTCTCCGGG CTCACCCCGA TCTCCCTGGG TTCCTCCTCG GACCTGAGGG TCGGTCAGCC 1140GGTGCTGGCG ATCGGGTCGC CGCTCGGTTT GGAGGGCACC GTGACCACGG GGATCGTCAG 1200CGCTCTCAAC CGTCCAGTGT CGACGACCGG CGAGGCCGGC AACCAGAACA CCGTGCTGGA 1260CGCCATTCAG ACCGACGCCG CGATCAACCC CGGTAACTCC GGGGGCGCGC TGGTGAACAT 1320GAACGCTCAA CTCGTCGGAG TCAACTCGGC CATTGCCACG CTGGGCGCGG ACTCAGCCGA 1380TGCGCAGAGC GGCTCGATCG GTCTCGGTTT TGCGATTCCA GTCGACCAGG CCAAGCGCAT 1440CGCCGACGAG TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC 1500CAATGACAAA GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGCCGGTG GTGCTGCCGC 1560GAACGCTGGA GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG 1620CGCGGACGCG TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC 1680CTTTCAGGAT CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA 1740GTGATGAAGG TCGCCGCGCA GTGTTCAAAG C 1771(2)SEQ ID NO:14的信息:
(ⅰ)序列特征:
(A)长度:1058碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEC)ID NO:14:CTCCACCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA GGAATTCGGC 60ACGAGGATCC GACGTCGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 120AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 180CCGGCGACGG CGAGCGCCGG AATGGCGCGA GTGAGGAGGC GGGCAGTCAT GCCCAGCGTG 240ATCCAATCAA CCTGCATTCG GCCTGCGGGC CCATTTGACA ATCGAGGTAG TGAGCGCAAA 300TGAATGATGG AAAACGGGCG GTGACGTCCG CTGTTCTGGT GGTGCTAGGT GCCTGCCTGG 360CGTTGTGGCT ATCAGGATGT TCTTCGCCGA AACCTGATGC CGAGGAACAG GGTGTTCCCG 420TGAGCCCGAC GGCGTCCGAC CCCGCGCTCC TCGCCGAGAT CAGGCAGTCG CTTGATGCGA 480CAAAAGGGTT GACCAGCGTG CACGTAGCGG TCCGAACAAC CGGGAAAGTC GACAGCTTGC 540TGGGTATTAC CAGTGCCGAT GTCGACGTCC GGGCCAATCC GCTCGCGGCA AAGGGCGTAT 600GCACCTACAA CGACGAGCAG GGTGTCCCGT TTCGGGTACA AGGCGACAAC ATCTCGGTGA 660AACTGTTCGA CGACTGGAGC AATCTCGGCT CGATTTCTGA ACTGTCAACT TCACGCGTGC 720TCGATCCTGC CGCTGGGGTG ACGCAGCTGC TGTCCGGTGT CACGAACCTC CAAGCGCAAG 780GTACCGAAGT GATAGACGGA ATTTCGACCA CCAAAATCAC CGGGACCATC CCCGCGAGCT 840CTGTCAAGAT GCTTGATCCT GGCGCCAAGA GTGCAAGGCC GGCGACCGTG TGGATTGCCC 900AGGACGGCTC GCACCACCTC GTCCGAGCGA GCATCGACCT CGGATCCGGG TCGATTCAGC 960TCACGCAGTC GAAATGGAAC GAACCCGTCA ACGTCGACTA GGCCGAAGTT GCGTCGACGC 1020GTTGNTCGAA ACGCCCTTGT GAACGGTGTC AACGGNAC 1058(2)SEQ ID NO:15的信息:
(ⅰ)序列特征:
(A)长度:542碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:15:GAATTCGGCA CGAGAGGTGA TCGACATCAT CGGGACCAGC CCCACATCCT GGGAACAGGC 60GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA TAGCGTCGAT GACATCCGCG TCGCTCGGGT 120CATTGAGCAG GACATGGCCG TGGACAGCGC CGGCAAGATC ACCTACCGCA TCAAGCTCGA 180AGTGTCGTTC AAGATGAGGC CGGCGCAACC GCGCTAGCAC GGGCCGGCGA GCAAGACGCA 240AAATCGCACG GTTTGCGGTT GATTCGTGCG ATTTTGTGTC TGCTCGCCGA GGCCTACCAG 300GCGCGGCCCA GGTCCGCGTG CTGCCGTATC CAGGCGTGCA TCGCGATTCC GGCGGCCACG 360CCGGAGTTAA TGCTTCGCGT CGACCCGAAC TGGGCGATCC GCCGGNGAGC TGATCGATGA 420CCGTGGCCAG CCCGTCGATG CCCGAGTTGC CCGAGGAAAC GTGCTGCCAG GCCGGTAGGA 480AGCGTCCGTA GGCGGCGGTG CTGACCGGCT CTGCCTGCGC CCTCAGTGCG GCCAGCGAGC 540GG 542(2)SEQ ID NO:16的信息:
(ⅰ)序列特征:
(A)长度:913碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:16:CGGTGCCGCC CGCGCCTCCG TTGCCCCCAT TGCCGCCGTC GCCGATCAGC TGCGCATCGC 60CACCATCACC GCCTTTGCCG CCGGCACCGC CGGTGGCGCC GGGGCCGCCG ATGCCACCGC 120TTGACCCTGG CCGCCGGCGC CGCCATTGCC ATACAGCACC CCGCCGGGGG CACCGTTACC 180GCCGTCGCCA CCGTCGCCGC CGCTGCCGTT TCAGGCCGGG GAGGCCGAAT GAACCGCCGC 240CAAGCCCGCC GCCGGCACCG TTGCCGCCTT TTCCGCCCGC CCCGCCGGCG CCGCCAATTG 300CCGAACAGCC AMGCACCGTT GCCGCCAGCC CCGCCGCCGT TAACGGCGCT GCCGGGCGCC 360GCCGCCGGAC CCGCCATTAC CGCCGTTCCC GTTCGGTGCC CCGCCGTTAC CGGCGCCGCC 420GTTTGCCGCC AATATTCGGC GGGCACCGCC AGACCCGCCG GGGCCACCAT TGCCGCCGGG 480CACCGAAACA ACAGCCCAAC GGTGCCGCCG GCCCCGCCGT TTGCCGCCAT CACCGGCCAT 540TCACCGCCAG CACCGCCGTT AATGTTTATG AACCCGGTAC CGCCAGCGCG GCCCCTATTG 600CCGGGCGCCG GAGNGCGTGC CCGCCGGCGC CGCCAACGCC CAAAAGCCCG GGGTTGCCAC 660CGGCCCCGCC GGACCCACCG GTCCCGCCGA TCCCCCCGTT GCCGCCGGTG CCGCCGCCAT 720TGGTGCTGCT GAAGCCGTTA GCGCCGGTTC CGCSGGTTCC GGCGGTGGCG CCNTGGCCGC 780CGGCCCCGCC GTTGCCGTAC AGCCACCCCC CGGTGGCGCC GTTGCCGCCA TTGCCGCCAT 840TGCCGCCGTT GCCGCCATTG CCGCCGTTCC CGCCGCCACC GCCGGNTTGG CCGCCGGCGC 900CGCCGGCGGC CGC 913(2)SEQ ID NO:17的信息:
(ⅰ)序列特征:
(A)长度:1872碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:17:GACTACGTTG GTGTAGAAAA ATCCTGCCGC CCGGACCCTT AAGGCTGGGA CAATTTCTGA 60TAGCTACCCC GACACAGGAG GTTACGGGAT GAGCAATTCG CGCCGCCGCT CACTCAGGTG 120GTCATGGTTG CTGAGCGTGC TGGCTGCCGT CGGGCTGGGC CTGGCCACGG CGCCGGCCCA 180GGCGGCCCCG CCGGCCTTGT CGCAGGACCG GTTCGCCGAC TTCCCCGCGC TGCCCCTCGA 240CCCGTCCGCG ATGGTCGCCC AAGTGGCGCC ACAGGTGGTC AACATCAACA CCAAACTGGG 300CTACAACAAC GCCGTGGGCG CCGGGACCGG CATCGTCATC GATCCCAACG GTGTCGTGCT 360GACCAACAAC CACGTGATCG CGGGCGCCAC CGACATCAAT GCGTTCAGCG TCGGCTCCGG 420CCAAACCTAC GGCGTCGATG TGGTCGGGTA TGACCGCACC CAGGATGTCG CGGTGCTGCA 480GCTGCGCGGT GCCGGTGGCC TGCCGTCGGC GGCGATCGGT GGCGGCGTCG CGGTTGGTGA 540GCCCGTCGTC GCGATGGGCA ACAGCGGTGG GCAGGGCGGA ACGCCCCGTG CGGTGCCTGG 600CAGGGTGGTC GCGCTCGGCC AAACCGTGCA GGCGTCGGAT TCGCTGACCG GTGCCGAAGA 660GACATTGAAC GGGTTGATCC AGTTCGATGC CGCAATCCAG CCCGGTGATT CGGGCGGGCC 720CGTCGTCAAC GGCCTAGGAC AGGTGGTCGG TATGAACACG GCCGCGTCCG ATAACTTCCA 780GCTGTCCCAG GGTGGGCAGG GATTCGCCAT TCCGATCGGG CAGGCGATGG CGATCGCGGG 840CCAAATCCGA TCGGGTGGGG GGTCACCCAC CGTTCATATC GGGCCTACCG CCTTCCTCGG 900CTTGGGTGTT GTCGACAACA ACGGCAACGG CGCACGAGTC CAACGCGTGG TCGGAAGCGC 960TCCGGCGGCA AGTCTCGGCA TCTCCACCGG CGACGTGATC ACCGCGGTCG ACGGCGCTCC 1020GATCAACTCG GCCACCGCGA TGGCGGACGC GCTTAACGGG CATCATCCCG GTGACGTCAT 1080CTCGGTGAAC TGGCAAACCA AGTCGGGCGG CACGCGTACA GGGAACGTGA CATTGGCCGA 1140GGGACCCCCG GCCTGATTTG TCGCGGATAC CACCCGCCGG CCGGCCAATT GGATTGGCGC 1200CAGCCGTGAT TGCCGCGTGA GCCCCCGAGT TCCGTCTCCC GTGCGCGTGG CATTGTGGAA 1260GCAATGAACG AGGCAGAACA CAGCGTTGAG CACCCTCCCG TGCAGGGCAG TTACGTCGAA 1320GGCGGTGTGG TCGAGCATCC GGATGCCAAG GACTTCGGCA GCGCCGCCGC CCTGCCCGCC 1380GATCCGACCT GGTTTAAGCA CGCCGTCTTC TACGAGGTGC TGGTCCGGGC GTTCTTCGAC 1440GCCAGCGCGG ACGGTTCCGN CGATCTGCGT GGACTCATCG ATCGCCTCGA CTACCTGCAG 1500TGGCTTGGCA TCGACTGCAT CTGTTGCCGC CGTTCCTACG ACTCACCGCT GCGCGACGGC 1560GGTTACGACA TTCGCGACTT CTACAAGGTG CTGCCCGAAT TCGGCACCGT CGACGATTTC 1620GTCGCCCTGG TCGACACCGC TCACCGGCGA GGTATCCGCA TCATCACCGA CCTGGTGATG 1680AATCACACCT CGGAGTCGCA CCCCTGGTTT CAGGAGTCCC GCCGCGACCC AGACGGACCG 1740TACGGTGACT ATTACGTGTG GAGCGACACC AGCGAGCGCT ACACCGACGC CCGGATCATC 1800TTCGTCGACA CCGAAGAGTC GAACTGGTCA TTCGATCCTG TCCGCCGACA GTTNCTACTG 1860GCACCGATTC TT 1872(2)SEQ ID NO:18的信息:
(ⅰ)序列特征:
(A)长度:1482碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:18:CTTCGCCGAA ACCTGATGCC GAGGAACAGG GTGTTCCCGT GAGCCCGACG GCGTCCGACC 60CCGCGCTCCT CGCCGAGATC AGGCAGTCGC TTGATGCGAC AAAAGGGTTG ACCAGCGTGC 120ACGTAGCGGT CCGAACAACC GGGAAAGTCG ACAGCTTGCT GGGTATTACC AGTGCCGATG 180TCGACGTCCG GGCCAATCCG CTCGCGGCAA AGGGCGTATG CACCTACAAC GACGAGCAGG 240GTGTCCCGTT TCGGGTACAA GGCGACAACA TCTCGGTGAA ACTGTTCGAC GACTGGAGCA 300ATCTCGGCTC GATTTCTGAA CTGTCAACTT CACGCGTGCT CGATCCTGCC GCTGGGGTGA 360CGCAGCTGCT GTCCGGTGTC ACGAACCTCC AAGCGCAAGG TACCGAAGTG ATAGACGGAA 420TTTCGACCAC CAAAATCACC GGGACCATCC CCGCGAGCTC TGTCAAGATG CTTGATCCTG 480GCGCCAAGAG TGCAAGGCCG GCGACCGTGT GGATTGCCCA GGACGGCTCG CACCACCTCG 540TCCGAGCGAG CATCGACCTC GGATCCGGGT CGATTCAGCT CACGCAGTCG AAATGGAACG 600AACCCGTCAA CGTCGACTAG GCCGAAGTTG CGTCGACGCG TTGCTCGAAA CGCCCTTGTG 660AACGGTGTCA ACGGCACCCG AAAACTGACC CCCTGACGGC ATCTGAAAAT TGACCCCCTA 720GACCGGGCGG TTGGTGGTTA TTCTTCGGTG GTTCCGGCTG GTGGGACGCG GCCGAGGTCG 780CGGTCTTTGA GCCGGTAGCT GTCGCCTTTG AGGGCGACGA CTTCAGCATG GTGGACGAGG 840CGGTCGATCA TGGCGGCAGC AACGACGTCG TCGCCGCCGA AAACCTCGCC CCACCGGCCG 900AAGGCCTTAT TGGACGTGAC GATCAAGCTG GCCCGCTCAT ACCGGGAGGA CACCAGCTGG 960AAGAAGAGGT TGGCGGCCTC GGGCTCAAAC GGAATGTAAC CGACTTCGTC AACCACCAGG 1020AGCGGATAGC GGCCAAACCG GGTGAGTTCG GCGTAGATGC GCCCGGCGTG GTGAGCCTCG 1080GCGAACCGTG CTACCCATTC GGCGGCGGTG GCGAACAGCA CCCGATGACC GGCCTGACAC 1140GCGCGTATCG CCAGGCCGAC CGCAAGATGA GTCTTCCCGG TGCCAGGCGG GGCCCAAAAA 1200CACGACGTTA TCGCGGGCGG TGATGAAATC CAGGGTGCCC AGATGTGCGA TGGTGTCGCG 1260TTTGAGGCCA CGAGCATGCT CAAAGTCGAA CTCTTCCAAC GACTTCCGAA CCGGGAAGCG 1320GGCGGCGCGG ATGCGGCCCT CACCACCATG GGACTCCCGG GCTGACACTT CCCGCTGCAG 1380GCAGGCGGCC AGGTATTCTT CGTGGCTCCA GTTCTCGGCG CGGGCGCGAT CGGCCAGCCG 1440GGACACTGAC TCACGCAGGG TGGGAGCTTT CAATGCTCTT GT 1482(2)SEQ ID NO:19的信息:
(ⅰ)序列特征:
(A)长度:876碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:19:GAATTCGGCA CGAGCCGGCG ATAGCTTCTG GGCCGCGGCC GACCAGATGG CTCGAGGGTT 60CGTGCTCGGG GCCACCGCCG GGCGCACCAC CCTGACCGGT GAGGGCCTGC AACACGCCGA 120CGGTCACTCG TTGCTGCTGG ACGCCACCAA CCCGGCGGTG GTTGCCTACG ACCCGGCCTT 180CGCCTACGAA ATCGGCTACA TCGNGGAAAG CGGACTGGCC AGGATGTGCG GGGAGAACCC 240GGAGAACATC TTCTTCTACA TCACCGTCTA CAACGAGCCG TACGTGCAGC CGCCGGAGCC 300GGAGAACTTC GATCCCGAGG GCGTGCTGGG GGGTATCTAC CGNTATCACG CGGCCACCGA 360GCAACGCACC AACAAGGNGC AGATCCTGGC CTCCGGGGTA GCGATGCCCG CGGCGCTGCG 420GGCAGCACAG ATGCTGGCCG CCGAGTGGGA TGTCGCCGCC GACGTGTGGT CGGTGACCAG 480TTGGGGCGAG CTAAACCGCG ACGGGGTGGT CATCGAGACC GAGAAGCTCC GCCACCCCGA 540TCGGCCGGCG GGCGTGCCCT ACGTGACGAG AGCGCTGGAG AATGCTCGGG GCCCGGTGAT 600CGCGGTGTCG GACTGGATGC GCGCGGTCCC CGAGCAGATC CGACCGTGGG TGCCGGGCAC 660ATACCTCACG TTGGGCACCG ACGGGTTCGG TTTTTCCGAC ACTCGGCCCG CCGGTCGTCG 720TTACTTCAAC ACCGACGCCG AATCCCAGGT TGGTCGCGGT TTTGGGAGGG GTTGGCCGGG 780TCGACGGGTG AATATCGACC CATTCGGTGC CGGTCGTGGG CCGCCCGCCC AGTTACCCGG 840ATTCGACGAA GGTGGGGGGT TGCGCCCGAN TAAGTT 876(2)SEQ ID NO:20的信息:
(ⅰ)序列特征:
(A)长度:1021碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:20:ATCCCCCCGG GCTGCAGGAA TTCGGCACGA GAGACAAAAT TCCACGCGTT AATGCAGGAA 60CAGATTCATA ACGAATTCAC AGCGGCACAA CAATATGTCG CGATCGCGGT TTATTTCGAC 120AGCGAAGACC TGCCGCAGTT GGCGAAGCAT TTTTACAGCC AAGCGGTCGA GGAACGAAAC 180CATGCAATGA TGCTCGTGCA ACACCTGCTC GACCGCGACC TTCGTGTCGA AATTCCCGGC 240GTAGACACGG TGCGAAACCA GTTCGACAGA CCCCGCGAGG CACTGGCGCT GGCGCTCGAT 300CAGGAACGCA CAGTCACCGA CCAGGTCGGT CGGCTGACAG CGGTGGCCCG CGACGAGGGC 360GATTTCCTCG GCGAGCAGTT CATGCAGTGG TTCTTGCAGG AACAGATCGA AGAGGTGGCC 420TTGATGGCAA CCCTGGTGCG GGTTGCCGAT CGGGCCGGGG CCAACCTGTT CGAGCTAGAG 480AACTTCGTCG CACGTGAAGT GGATGTGGCG CCGGCCGCAT CAGGCGCCCC GCACGCTGCC 540GGGGGCCGCC TCTAGATCCC TGGGGGGGAT CAGCGAGTGG TCCCGTTCGC CCGCCCGTCT 600TCCAGCCAGG CCTTGGTGCG GCCGGGGTGG TGAGTACCAA TCCAGGCCAC CCCGACCTCC 660CGGNAAAAGT CGATGTCCTC GTACTCATCG ACGTTCCAGG AGTACACCGC CCGGCCCTGA 720GCTGCCGAGC GGTCAACGAG TTGCGGATAT TCCTTTAACG CAGGCAGTGA GGGTCCCACG 780GCGGTTGGCC CGACCGCCGT GGCCGCACTG CTGGTCAGGT ATCGGGGGGT CTTGGCGAGC 840AACAACGTCG GCAGGAGGGG TGGAGCCCGC CGGATCCGCA GACCGGGGGG GCGAAAACGA 900CATCAACACC GCACGGGATC GATCTGCGGA GGGGGGTGCG GGAATACCGA ACCGGTGTAG 960GAGCGCCAGC AGTTGTTTTT CCACCAGCGA AGCGTTTTCG GGTCATCGGN GGCNNTTAAG 1020T 1021(2)SEQ ID NO:21的信息:
(ⅰ)序列特征:
(A)长度:321碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:21:CGTGCCGACG AACGGAAGAA CACAACCATG AAGATGGTGA AATCGATCGC CGCAGGTCTG 60ACCGCCGCGG CTGCAATCGG CGCCGCTGCG GCCGGTGTGA CTTCGATCAT GGCTGGCGGN 120CCGGTCGTAT ACCAGATGCA GCCGGTCGTC TTCGGCGCGC CACTGCCGTT GGACCCGGNA 180TCCGCCCCTG ANGTCCCGAC CGCCGCCCAG TGGACCAGNC TGCTCAACAG NCTCGNCGAT 240CCCAACGTGT CGTTTGNGAA CAAGGGNAGT CTGGTCGAGG GNGGNATCGG NGGNANCGAG 300GGNGNGNATC GNCGANCACA A 321(2)SECID NO:22的信息:
(ⅰ)序列特征:
(A)长度:373碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:22:TCTTATCGGT TCCGGTTGGC GACGGGTTTT GGGNGCGGGT GGTTAACCCG CTCGGCCAGC 60CGATCGACGG GCGCGGAGAC GTCGACTCCG ATACTCGGCG CGCGCTGGAG CTCCAGGCGC 120CCTCGGTGGT GNACCGGCAA GGCGTGAAGG AGCCGTTGNA GACCGGGATC AAGGCGATTG 180ACGCGATGAC CCCGATCGGC CGCGGGCAGC GCCAGCTGAT CATCGGGGAC CGCAAGACCG 240GCAAAAACCG CCGTCTGTGT CGGACACCAT CCTCAAACCA GCGGGAAGAA CTGGGAGTCC 300GGTGGATCCC AAGAAGCAGG TGCGCTTGTG TATACGTTGG CCATCGGGCA AGAAGGGGAA 360CTTACCATCG CCG 373(2)SEQ ID NO:23的信息:
(ⅰ)序列特征:
(A)长度:352碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:23:GTGACGCCGT GATGGGATTC CTGGGCGGGG CCGGTCCGCT GGCGGTGGTG GATCAGCAAC 60TGGTTACCCG GGTGCCGCAA GGCTGGTCGT TTGCTCAGGC AGCCGCTGTG CCGGTGGTGT 120TCTTGACGGC CTGGTACGGG TTGGCCGATT TAGCCGAGAT CAAGGCGGGC GAATCGGTGC 180TGATCCATGC CGGTACCGGC GGTGTGGGCA TGGCGGCTGT GCAGCTGGCT CGCCAGTGGG 240GCGTGGAGGT TTTCGTCACC GCCAGCCGTG GNAAGTGGGA CACGCTGCGC GCCATNGNGT 300TTGACGACGA NCCATATCGG NGATTCCCNC ACATNCGAAG TTCCGANGGA GA 352(2)SEQ ID NO:24的信息:
(ⅰ)序列特征:
(A)长度:726碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:24:GAAATCCGCG TTCATTCCGT TCGACCAGCG GCTGGCGATA ATCGACGAAG TGATCAAGCC 60GCGGTTCGCG GCGCTCATGG GTCACAGCGA GTAATCAGCA AGTTCTCTGG TATATCGCAC 120CTAGCGTCCA GTTGCTTGCC AGATCGCTTT CGTACCGTCA TCGCATGTAC CGGTTCGCGT 180GCCGCACGCT CATGCTGGCG GCGTGCATCC TGGCCACGGG TGTGGCGGGT CTCGGGGTCG 240GCGCGCAGTC CGCAGCCCAA ACCGCGCCGG TGCCCGACTA CTACTGGTGC CCGGGGCAGC 300CTTTCGACCC CGCATGGGGG CCCAACTGGG ATCCCTACAC CTGCCATGAC GACTTCCACC 360GCGACAGCGA CGGCCCCGAC CACAGCCGCG ACTACCCCGG ACCCATCCTC GAAGGTCCCG 420TGCTTGACGA TCCCGGTGCT GCGCCGCCGC CCCCGGCTGC CGGTGGCGGC GCATAGCGCT 480CGTTGACCGG GCCGCATCAG CGAATACGCG TATAAACCCG GGCGTGCCCC CGGCAAGCTA 540CGACCCCCGG CGGGGCAGAT TTACGCTCCC GTGCCGATGG ATCGCGCCGT CCGATGACAG 600AAAATAGGCG ACGGTTTTGG CAACCGCTTG GAGGACGCTT GAAGGGAACC TGTCATGAAC 660GGCGACAGCG CCTCCACCAT CGACATCGAC AAGGTTGTTA CCCGCACACC CGTTCGCCGG 720ATCGTG 726(2)SEQ ID NO:25的信息:
(ⅰ)序列特征:
(A)长度:580碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:25:CGCGACGACG ACGAACGTCG GGCCCACCAC CGCCTATGCG TTGATGCAGG CGACCGGGAT 60GGTCGCCGAC CATATCCAAG CATGCTGGGT GCCCACTGAG CGACCTTTTG ACCAGCCGGG 120CTGCCCGATG GCGGCCCGGT GAAGTCATTG CGCCGGGGCT TGTGCACCTG ATGAACCCGA 180ATAGGGAACA ATAGGGGGGT GATTTGGCAG TTCAATGTCG GGTATGGCTG GAAATCCAAT 240GGCGGGGCAT GCTCGGCGCC GACCAGGCTC GCGCAGGCGG GCCAGCCCGA ATCTGGAGGG 300AGCACTCAAT GGCGGCGATG AAGCCCCGGA CCGGCGACGG TCCTTTGGAA GCAACTAAGG 360AGGGGCGCGG CATTGTGATG CGAGTACCAC TTGAGGGTGG CGGTCGCCTG GTCGTCGAGC 420TGACACCCGA CGAAGCCGCC GCACTGGGTG ACGAACTCAA AGGCGTTACT AGCTAAGACC 480AGCCCAACGG CGAATGGTCG GCGTTACGCG CACACCTTCC GGTAGATGTC CAGTGTCTGC 540TCGGCGATGT ATGCCCAGGA GAACTCTTGG ATACAGCGCT 580(2)SEQ ID NO:26的信息:
(ⅰ)序列特征:
(A)长度:160碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:26:AACGGAGGCG CCGGGGGTTT TGGCGGGGCC GGGGCGGTCG GCGGCAACGG CGGGGCCGGC 60GGTACCGCCG GGTTGTTCGG TGTCGGCGGG GCCGGTGGGG CCGGAGGCAA CGGCATCGCC 120GGTGTCACGG GTACGTCGGC CAGCACACCG GGTGGATCCG 160(2)SEQ ID NO:27的信息:
(ⅰ)序列特征:
(A)长度:272碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:27:GACACCGATA CGATGGTGAT GTACGCCAAC GTTGTCGACA CGCTCGAGGC GTTCACGATC 60CAGCGCACAC CCGACGGCGT GACCATCGGC GATGCGGCCC CGTTCGCGGA GGCGGCTGCC 120AAGGCGATGG GAATCGACAA GCTGCGGGTA ATTCATACCG GAATGGACCC CGTCGTCGCT 180GAACGCGAAC AGTGGGACGA CGGCAACAAC ACGTTGGCGT TGGCGCCCGG TGTCGTTGTC 240GCCTACGAGC GCAACGTACA GACCAACGCC CG 272(2)SEQ ID NO:28的信息:
(ⅰ)序列特征:
(A)长度:317碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:28:GCAGCCGGTG GTTCTCGGAC TATCTGCGCA CGGTGACGCA GCGCGACGTG CGCGAGCTGA 60AGCGGATCGA GCAGACGGAT CGCCTGCCGC GGTTCATGCG CTACCTGGCC GCTATCACCG 120CGCAGGAGCT GAACGTGGCC GAAGCGGCGC GGGTCATCGG GGTCGACGCG GGGACGATCC 180GTTCGGATCT GGCGTGGTTC GAGACGGTCT ATCTGGTACA TCGCCTGCCC GCCTGGTCGC 240GGAATCTGAC CGCGAAGATC AAGAAGCGGT CAAAGATCCA CGTCGTCGAC AGTGGCTTCG 300CGGCCTGGTT GCGCGGG 317(2)SEQ ID NO:29的信息:
(ⅰ)序列特征:
(A)长度:182碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:29:GATCGTGGAG CTGTCGATGA ACAGCGTTGC CGGACGCGCG GCGGCCAGCA CGTCGGTGTA 60GCAGCGCCGG ACCACCTCGC CGGTGGGCAG CATGGTGATG ACCACGTCGG CCTCGGCCAC 120CGCTTCGGGC GCGCTACGAA ACACCGCGAC ACCGTGCGCG GCGGCGCCGG ACGCCGCCGT 180GG 182(2)SEQ ID NO:30的信息:
(ⅰ)序列特征:
(A)长度:308碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:30:GATCGCGAAG TTTGGTGAGC AGGTGGTCGA CGCGAAAGTC TGGGCGCCTG CGAAGCGGGT 60CGGCGTTCAC GAGGCGAAGA CACGCCTGTC CGAGCTGCTG CGGCTCGTCT ACGGCGGGCA 120GAGGTTGAGA TTGCCCGCCG CGGCGAGCCG GTAGCAAAGC TTGTGCCGCT GCATCCTCAT 180GAGACTCGGC GGTTAGGCAT TGACCATGGC GTGTACCGCG TGCCCGACGA TTTGGACGCT 240CCGTTGTCAG ACGACGTGCT CGAACGCTTT CACCGGTGAA GCGCTACCTC ATCGACACCC 300ACGTTTGG 308(2)SEQ ID NO:31的信息:
(ⅰ)序列特征:
(A)长度:267碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:31:CCGACGACGA GCAACTCACG TGGATGATGG TCGGCAGCGG CATTGAGGAC GGAGAGAATC 60CGGCCGAAGC TGCCGCGCGG CAAGTGCTCA TAGTGACCGG CCGTAGAGGG CTCCCCCGAT 120GGCACCGGAC TATTCTGGTG TGCCGCTGGC CGGTAAGAGC GGGTAAAAGA ATGTGAGGGG 180ACACGATGAG CAATCACACC TACCGAGTGA TCGAGATCGT CGGGACCTCG CCCGACGGCG 240TCGACGCGGC AATCCAGGGC GGTCTGG 267(2)SEQ ID NO:32的信息:
(ⅰ)序列特征:
(A)长度:1539碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:32:CTCGTGCCGA AAGAATGTGA GGGGACACGA TGAGCAATCA CACCTACCGA GTGATCGAGA 60TCGTCGGGAC CTCGCCCGAC GGCGTCGACG CGGCAATCCA GGGCGGTCTG GCCCGAGCTG 120CGCAGACCAT GCGCGCGCTG GACTGGTTCG AAGTACAGTC AATTCGAGGC CACCTGGTCG 180ACGGAGCGGT CGCGCACTTC CAGGTGACTA TGAAAGTCGG CTTCCGCTGG AGGATTCCTG 240AACCTTCAAG CGCGGCCGAT AACTGAGGTG CATCATTAAG CGACTTTTCC AGAACATCCT 300GACGCGCTCG AAACGCGGTT CAGCCGACGG TGGCTCCGCC GAGGCGCTGC CTCCAAAATC 360CCTGCGACAA TTCGTCGGCG GCGCCTACAA GGAAGTCGGT GCTGAATTCG TCGGGTATCT 420GGTCGACCTG TGTGGGCTGC AGCCGGACGA AGCGGTGCTC GACGTCGGCT GCGGCTCGGG 480GCGGATGGCG TTGCCGCTCA CCGGCTATCT GAACAGCGAG GGACGCTACG CCGGCTTCGA 540TATCTCGCAG AAAGCCATCG CGTGGTGCCA GGAGCACATC ACCTCGGCGC ACCCCAACTT 600CCAGTTCGAG GTCTCCGACA TCTACAACTC GCTGTACAAC CCGAAAGGGA AATACCAGTC 660ACTAGACTTT CGCTTTCCAT ATCCGGATGC GTCGTTCGAT GTGGTGTTTC TTACCTCGGT 720GTTCACCCAC ATGTTTCCGC CGGACGTGGA GCACTATCTG GACGAGATCT CCCGCGTGCT 780GAAGCCCGGC GGACGATGCC TGTGCACGTA CTTCTTGCTC AATGACGAGT CGTTAGCCCA 840CATCGCGGAA GGAAAGAGTG CGCACAACTT CCAGCATGAG GGACCGGGTT ATCGGACAAT 900CCACAAGAAG CGGCCCGAAG AAGCAATCGG CTTGCCGGAG ACCTTCGTCA GGGATGTCTA 960TGGCAAGTTC GGCCTCGCCG TGCACGAACC ATTGCACTAC GGCTCATGGA GTGGCCGGGA 1020ACCACGCCTA AGCTTCCAGG ACATCGTCAT CGCGACCAAA ACCGCGAGCT AGGTCGGCAT 1080CCGGGAAGCA TCGCGACACC GTGGCGCCGA GCGCCGCTGC CGGCAGGCCG ATTAGGCGGG 1140CAGATTAGCC CGCCGCGGCT CCCGGCTCCG AGTACGGCGC CCCGAATGGC GTCACCGGCT 1200GGTAACCACG CTTGCGCGCC TGGGCGGCGG CCTGCCGGAT CAGGTGGTAG ATGCCGACAA 1260AGCCTGCGTG ATCGGTCATC ACCAACGGTG ACAGCAGCCG GTTGTGCACC AGCGCGAACG 1320CCACCCCGGT CTCCGGGTCT GTCCAGCCGA TCGAGCCGCC CAAGCCCACA TGACCAAACC 1380CCGGCATCAC GTTGCCGATC GGCATACCGT GATAGCCAAG ATGAAAATTT AAGGGCACCA 1440ATAGATTTCG ATCCGGCAGA ACTTGCCGTC GGTTGCGGGT CAGGCCCGTG ACCAGCTCCC 1500GCGACAAGAA CCGTATGCCG TCGATCTCGC CTCGTGCCG 1539(2)SEQ ID NO:33的信息:
(ⅰ)序列特征:
(A)长度:851碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:33:CTGCAGGGTG GCGTGGATGA GCGTCACCGC GGGGCAGGCC GAGCTGACCG CCGCCCAGGT 60CCGGGTTGCT GCGGCGGCCT ACGAGACGGC GTATGGGCTG ACGGTGCCCC CGCCGGTGAT 120CGCCGAGAAC CGTGCTGAAC TGATGATTCT GATAGCGACC AACCTCTTGG GGCAAAACAC 180CCCGGCGATC GCGGTCAACG AGGCCGAATA CGGCGAGATG TGGGCCCAAG ACGCCGCCGC 240GATGTTTGGC TACGCCGCGG CGACGGCGAC GGCGACGGCG ACGTTGCTGC CGTTCGAGGA 300GGCGCCGGAG ATGACCAGCG CGGGTGGGCT CCTCGAGCAG GCCGCCGCGG TCGAGGAGGC 360CTCCGACACC GCCGCGGCGA ACCAGTTGAT GAACAATGTG CCCCAGGCGC TGAAACAGTT 420GGCCCAGCCC ACGCAGGGCA CCACGCCTTC TTCCAAGCTG GGTGGCCTGT GGAAGACGGT 480CTCGCCGCAT CGGTCGCCGA TCAGCAACAT GGTGTCGATG GCCAACAACC ACATGTCGAT 540GACCAACTCG GGTGTGTCGA TGACCAACAC CTTGAGCTCG ATGTTGAAGG GCTTTGCTCC 600GGCGGCGGCC GCCCAGGCCG TGCAAACCGC GGCGCAAAAC GGGGTCCGGG CGATGAGCTC 660GCTGGGCAGC TCGCTGGGTT CTTCGGGTCT GGGCGGTGGG GTGGCCGCCA ACTTGGGTCG 720GGCGGCCTCG GTACGGTATG GTCACCGGGA TGGCGGAAAA TATGCANAGT CTGGTCGGCG 780GAACGGTGGT CCGGCGTAAG GTTTACCCCC GTTTTCTGGA TGCGGTGAAC TTCGTCAACG 840GAAACAGTTA C 851(2)SEQ ID NO:34的信息:
(ⅰ)序列特征:
(A)长度:254碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:34:GATCGATCGG GCGGAAATTT GGACCAGATT CGCCTCCGGC GATAACCCAA TCAATCGAAC 60CTAGATTTAT TCCGTCCAGG GGCCCGAGTA ATGGCTCGCA GGAGAGGAAC CTTACTGCTG 120CGGGCACCTG TCGTAGGTCC TCGATACGGC GGAAGGCGTC GACATTTTCC ACCGACACCC 180CCATCCAAAC GTTCGAGGGC CACTCCAGCT TGTGAGCGAG GCGACGCAGT CGCAGGCTGC 240GCTTGGTCAA GATC 254(2)SEQ ID NO:35的信息:
(ⅰ)序列特征:
(A)长度:1227碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:35:GATCCTGACC GAAGCGGCCG CCGCCAAGGC GAAGTCGCTG TTGGACCAGG AGGGACGGGA 60CGATCTGGCG CTGCGGATCG CGGTTCAGCC GGGGGGGTGC GCTGGATTGC GCTATAACCT 120TTTCTTCGAC GACCGGACGC TGGATGGTGA CCAAACCGCG GAGTTCGGTG GTGTCAGGTT 180GATCGTGGAC CGGATGAGCG CGCCGTATGT GGAAGGCGCG TCGATCGATT TCGTCGACAC 240TATTGAGAAG CAAGGTTCAC CATCGACAAT CCCAACGCCA CCGGCTCCTG CGCGTGCGGG 300GATTCGTTCA ACTGATAAAA CGCTAGTACG ACCCCGCGGT GCGCAACACG TACGAGCACA 360CCAAGACCTG ACCGCGCTGG AAAAGCAACT GAGCGATGCC TTGCACCTGA CCGCGTGGCG 420GGCCGCCGGC GGCAGGTGTC ACCTGCATGG TGAACAGCAC CTGGGCCTGA TATTGCGACC 480AGTACACGAT TTTGTCGATC GAGGTCACTT CGACCTGGGA GAACTGCTTG CGGAACGCGT 540CGCTGCTCAG CTTGGCCAAG GCCTGATCGG AGCGCTTGTC GCGCACGCCG TCGTGGATAC 600CGCACAGCGC ATTGCGAACG ATGGTGTCCA CATCGCGGTT CTCCAGCGCG TTGAGGTATC 660CCTGAATCGC GGTTTTGGCC GGTCCCTCCG AGAATGTGCC TGCCGTGTTG GCTCCGTTGG 720TGCGGACCCC GTATATGATC GCCGCCGTCA TAGCCGACAC CAGCGCGAGG GCTACCACAA 780TGCCGATCAG CAGCCGCTTG TGCCGTCGCT TCGGGTAGGA CACCTGCGGC GGCACGCCGG 840GATATGCGGC GGGCGGCAGC GCCGCGTCGT CTGCCGGTCC CGGGGCGAAG GCCGGTTCGG 900CGGCGCCGAG GTCGTGGGGG TAGTCCAGGG CTTGGGGTTC GTGGGATGAG GGCTCGGGGT 960ACGGCGCCGG TCCGTTGGTG CCGACACCGG GGTTCGGCGA GTGGGGACCG GGCATTGTGG 1020TTCTCCTAGG GTGGTGGACG GGACCAGCTG CTAGGGCGAC AACCGCCCGT CGCGTCAGCC 1080GGCAGCATCG GCAATCAGGT GAGCTCCCTA GGCAGGCTAG CGCAACAGCT GCCGTCAGCT 1140CTCAACGCGA CGGGGCGGGC CGCGGCGCCG ATAATGTTGA AAGACTAGGC AACCTTAGGA 1200ACGAAGGACG GAGATTTTGT GACGATC 1227(2)SEQ ID NO:36的信息:
(ⅰ)序列特征:
(A)长度:181碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:36:GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGGGCCGGC GGGGCCGGCG 60GGACCGGCGC TAACGGTGGT GCCGGCGGCA ACGCCTGGTT GTTCGGGGCC GGCGGGTCCG 120GCGGNGCCGG CACCAATGGT GGNGTCGGCG GGTCCGGCGG ATTTGTCTAC GGCAACGGCG 180G 181(2)SEQ ID NO:37的信息:
(ⅰ)序列特征:
(A)长度:290碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:37:GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGTGTCGGC GGCCGGGGCG 60GCGACGGCGT CTTTGCCGGT GCCGGCGGCC AGGGCGGCCT CGGTGGGCAG GGCGGCAATG 120GCGGCGGCTC CACCGGCGGC AACGGCGGTC TTGGCGGCGC GGGCGGTGGC GGAGGCAACG 180CCCCGGACGG CGGCTTCGGT GGCAACGGCG GTAAGGGTGG CCAGGGCGGN ATTGGCGGCG 240GCACTCAGAG CGCGACCGGC CTCGGNGGTG ACGGCGGTGA CGGCGGTGAC 290(2)SEQ ID NO:38的信息:
(ⅰ)序列特征:
(A)长度:34碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:38:GATCCAGTGG CATGGNGGGT GTCAGTGGAA GCAT 34(2)SEQ ID NO:39的信息:
(ⅰ)序列特征:
(A)长度:155碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:39:GATCGCTGCT CGTCCCCCCC TTGCCGCCGA CGCCACCGGT CCCACCGTTA CCGAACAAGC 60TGGCGTGGTC GCCAGCACCC CCGGCACCGC CGACGCCGGA GTCGAACAAT GGCACCGTCG 120TATCCCCACC ATTGCCGCCG GNCCCACCGG CACCG 155(2)SEQ ID NO:40的信息:
(ⅰ)序列特征:
(A)长度:53碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:40:ATGGCGTTCA CGGGGCGCCG GGGACCGGGC AGCCCGGNGG GGCCGGGGGG TGG 53(2)SEQ ID NO:41的信息:
(ⅰ)序列特征:
(A)长度:132碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:41:GATCCACCGC GGGTGCAGAC GGTGCCCGCG GCGCCACCCC GACCAGCGGC GGCAACGGCG 60GCACCGGCGG CAACGGCGCG AACGCCACCG TCGTCGGNGG GGCCGGCGGG GCCGGCGGCA 120AGGGCGGCAA CG 132(2)SEQ ID NO:42的信息:
(ⅰ)序列特征:
(A)长度:132碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:42:GATCGGCGGC CGGNACGGNC GGGGACGGCG GCAAGGGCGG NAACGGGGGC GCCGNAGCCA 60CCNGCCAAGA ATCCTCCGNG TCCNCCAATG GCGCGAATGG CGGACAGGGC GGCAACGGCG 120GCANCGGCGG CA 132(2)SEQ ID NO:43的信息:
(ⅰ)序列特征:
(A)长度:702碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:43:CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 120ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 180AGCACTAAGG AGGATGATCC GATATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 240AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 300CCATCACACC GTGCGAACTC ACGGNGGNTA AAAACGCCGC CCAACAGNTG GTNTTGTCCG 360CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 420CGCTGCGCAA CGCGGCCAAG GNGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 480ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGCCGTCGGA GGGGACAGTT 540CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 600TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTGNG 660GGGATGGGTG GAACACTTNC ACCCTGACGC TGCAAGGCGA CG 702(2)SEQ ID NO:44的信息:
(ⅰ)序列特征:
(A)长度:298碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:44:GAAGCCGCAG CGCTGTCGGG CGACGTGGCG GTCAAAGCGG CATCGCTCGG TGGCGGTGGA 60GGCGGCGGGG TGCCGTCGGC GCCGTTGGGA TCCGCGATCG GGGGCGCCGA ATCGGTGCGG 120CCCGCTGGCG CTGGTGACAT TGCCGGCTTA GGCCAGGGAA GGGCCGGCGG CGGCGCCGCG 180CTGGGCGGCG GTGGCATGGG AATGCCGATG GGTGCCGCGC ATCAGGGACA AGGGGGCGCC 240AAGTCCAAGG GTTCTCAGCA GGAAGACGAG GCGCTCTACA CCGAGGATCC TCGTGCCG 298(2)SEQ ID NO:45的信息:
(ⅰ)序列特征:
(A)长度:1058碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:45:CGGCACGAGG ATCGAATCGC GTCGCCGGGA GCACAGCGTC GCACTGCACC AGTGGAGGAG 60CCATGACCTA CTCGCCGGGT AACCCCGGAT ACCCGCAAGC GCAGCCCGCA GGCTCCTACG 120GAGGCGTCAC ACCCTCGTTC GCCCACGCCG ATGAGGGTGC GAGCAAGCTA CCGATGTACC 180TGAACATCGC GGTGGCAGTG CTCGGTCTGG CTGCGTACTT CGCCAGCTTC GGCCCAATGT 240TCACCCTCAG TACCGAACTC GGGGGGGGTG ATGGCGCAGT GTCCGGTGAC ACTGGGCTGC 300CGGTCGGGGT GGCTCTGCTG GCTGCGCTGC TTGCCGGGGT GGTTCTGGTG CCTAAGGCCA 360AGAGCCATGT GACGGTAGTT GCGGTGCTCG GGGTACTCGG CGTATTTCTG ATGGTCTCGG 420CGACGTTTAA CAAGCCCAGC GCCTATTCGA CCGGTTGGGC ATTGTGGGTT GTGTTGGCTT 480TCATCGTGTT CCAGGCGGTT GCGGCAGTCC TGGCGCTCTT GGTGGAGACC GGCGCTATCA 540CCGCGCCGGC GCCGCGGCCC AAGTTCGACC CGTATGGACA GTACGGGCGG TACGGGCAGT 600ACGGGCAGTA CGGGGTGCAG CCGGGTGGGT ACTACGGTCA GCAGGGTGCT CAGCAGGCCG 660CGGGACTGCA GTCGCCCGGC CCGCAGCAGT CTCCGCAGCC TCCCGGATAT GGGTCGCAGT 720ACGGCGGCTA TTCGTCCAGT CCGAGCCAAT CGGGCAGTGG ATACACTGCT CAGCCCCCGG 780CCCAGCCGCC GGCGCAGTCC GGGTCGCAAC AATCGCACCA GGGCCCATCC ACGCCACCTA 840CCGGCTTTCC GAGCTTCAGC CCACCACCAC CGGTCAGTGC CGGGACGGGG TCGCAGGCTG 900GTTCGGCTCC AGTCAACTAT TCAAACCCCA GCGGGGGCGA GCAGTCGTCG TCCCCCGGGG 960GGGCGCCGGT CTAACCGGGC GTTCCCGCGT CCGGTCGCGC GTGTGCGCGA AGAGTGAACA 1020GGGTGTCAGC AAGCGCGGAC GATCCTCGTG CCGAATTC 1058(2)SEQ ID NO:46的信息:
(ⅰ)序列特征:
(A)长度:327碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:46:CGGCACGAGA GACCGATGCC GCTACCCTCG CGCAGGAGGC AGGTAATTTC GAGCGGATCT 60CCGGCGACCT GAAAACCCAG ATCGACCAGG TGGAGTCGAC GGCAGGTTCG TTGCAGGGCC 120AGTGGCGCGG CGCGGCGGGG ACGGCCGCCC AGGCCGCGGT GGTGCGCTTC CAAGAAGCAG 180CCAATAAGCA GAAGCAGGAA CTCGACGAGA TCTCGACGAA TATTCGTCAG GCCGGCGTCC 240AATACTCGAG GGCCGACGAG GAGCAGCAGC AGGCGCTGTC CTCGCAAATG GGCTTCTGAC 300CCGCTAATAC GAAAAGAAAC GGAGCAA 327(2)SEQ ID NO:47的信息:
(ⅰ)序列特征:
(A)长度:170碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:47:CGGTCGCGAT GATGGCGTTG TCGAACGTGA CCGATTCTGT ACCGCCGTCG TTGAGATCAA 60CCAACAACGT GTTGGCGTCG GCAAATGTGC CGNACCCGTG GATCTCGGTG ATCTTGTTCT 120TCTTCATCAG GAAGTGCACA CCGGCCACCC TGCCCTCGGN TACCTTTCGG 170(2)SEQ ID NO:48的信息:
(ⅰ)序列特征:
(A)长度:127碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:48:GATCCGGCGG CACGGGGGGT GCCGGCGGCA GCACCGCTGG CGCTGGCGGC AACGGCGGGG 60CCGGGGGTGG CGGCGGAACC GGTGGGTTGC TCTTCGGCAA CGGCGGTGCC GGCGGGCACG 120GGGCCGT 127(2)SEQ ID NO:49的信息:
(ⅰ)序列特征:
(A)长度:81碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:49:CGGCGGCAAG GGCGGCACCG CCGGCAACGG GAGCGGCGCG GCCGGCGGCA ACGGCGGCAA 60CGGCGGCTCC GGCCTCAACG G 81(2)SEQ ID NO:50的信息:
(ⅰ)序列特征:
(A)长度:149碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:50:GATCAGGGCT GGCCGGCTCC GGCCAGAAGG GCGGTAACGG AGGAGCTGCC GGATTGTTTG 60GCAACGGCGG GGCCGGNGGT GCCGGCGCGT CCAACCAAGC CGGTAACGGC GGNGCCGGCG 120GAAACGGTGG TGCCGGTGGG CTGATCTGG 149(2)SEQ ID NO:51的信息:
(ⅰ)序列特征:
(A)长度:355碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:51:CGGCACGAGA TCACACCTAC CGAGTGATCG AGATCGTCGG GACCTCGCCC GACGGTGTCG 60ACGCGGNAAT CCAGGGCGGT CTGGCCCGAG CTGCGCAGAC CATGCGCGCG CTGGACTGGT 120TCGAAGTACA GTCAATTCGA GGCCACCTGG TCGACGGAGC GGTCGCGCAC TTCCAGGTGA 180CTATGAAAGT CGGCTTCCGC CTGGAGGATT CCTGAACCTT CAAGCGCGGC CGATAACTGA 240GGTGCATCAT TAAGCGACTT TTCCAGAACA TCCTGACGCG CTCGAAACGC GGTTCAGCCG 300ACGGTGGCTC CGCCGAGGCG CTGCCTCCAA AATCCCTGCG ACAATTCGTC GGCGG 355(2)SEQ ID NO:52的信息:
(ⅰ)序列特征:
(A)长度:999碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:52:ATGCATCACC ATCACCATCA CATGCATCAG GTGGACCCCA ACTTGACACG TCGCAAGGGA 60CGATTGGCGG CACTGGCTAT CGCGGCGATG GCCAGCGCCA GCCTGGTGAC CGTTGCGGTG 120CCCGCGACCG CCAACGCCGA TCCGGAGCCA GCGCCCCCGG TACCCACAAC GGCCGCCTCG 180CCGCCGTCGA CCGCTGCAGC GCCACCCGCA CCGGCGACAC CTGTTGCCCC CCCACCACCG 240GCCGCCGCCA ACACGCCGAA TGCCCAGCCG GGCGATCCCA ACGCAGCACC TCCGCCGGCC 300GACCCGAACG CACCGCCGCC ACCTGTCATT GCCCCAAACG CACCCCAACC TGTCCGGATC 360GACAACCCGG TTGGAGGATT CAGCTTCGCG CTGCCTGCTG GCTGGGTGGA GTCTGACGCC 420GCCCACTTCG ACTACGGTTC AGCACTCCTC AGCAAAACCA CCGGGGACCC GCCATTTCCC 480GGACAGCCGC CGCCGGTGGC CAATGACACC CGTATCGTGC TCGGCCGGCT AGACCAAAAG 540CTTTACGCCA GCGCCGAAGC CACCGACTCC AAGGCCGCGG CCCGGTTGGG CTCGGACATG 600GGTGAGTTCT ATATGCCCTA CCCGGGCACC CGGATCAACC AGGAAACCGT CTCGCTCGAC 660GCCAACGGGG TGTCTGGAAG CGCGTCGTAT TACGAAGTCA AGTTCAGCGA TCCGAGTAAG 720CCGAACGGCC AGATCTGGAC GGGCGTAATC GGCTCGCCCG CGGCGAACGC ACCGGACGCC 780GGGCCCCCTC AGCGCTGGTT TGTGGTATGG CTCGGGACCG CCAACAACCC GGTGGACAAG 840GGCGCGGCCA AGGCGCTGGC CGAATCGATC CGGCCTTTGG TCGCCCCGCC GCCGGCGCCG 900GCACCGGCTC CTGCAGAGCC CGCTCCGGCG CCGGCGCCGG CCGGGGAAGT CGCTCCTACC 960CCGACGACAC CGACACCGCA GCGGACCTTA CCGGCCTGA 999(2)SEQ ID NO:53的信息:
(ⅰ)序列特征:
(A)长度:332氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:53:Met His His His His His His Met His Gln Val Asp Pro Asn Leu Thr1 5 10 15Arg Arg Lys Gly Arg Leu Ala Ala Leu Ala Ile Ala Ala Met Ala Ser
20 25 30Ala Ser Leu Val Thr Val Ala Val Pro Ala Thr Ala Asn Ala Asp Pro
35 40 45Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro Ser Thr
50 55 60Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro Val Ala Pro Pro Pro Pro65 70 75 80Ala Ala Ala Asn Thr Pro Asn Ala Gln Pro Gly Asp Pro Asn Ala Ala
85 90 95Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro Pro Pro Val Ile Ala Pro
100 105 110Asn Ala Pro Gln Pro Val Arg Ile Asp Asn Pro Val Gly Gly Phe Ser
115 120 125Phe Ala Leu Pro Ala Gly Trp Val Glu Ser Asp Ala Ala His Phe Asp
130 135 140Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr Gly Asp Pro Pro Phe Pro145 150 155 160Gly Gln Pro Pro Pro Val Ala Asn Asp Thr Arg Ile Val Leu Gly Arg
165 170 175Leu Asp Gln Lys Leu Tyr Ala Ser Ala Glu Ala Thr Asp Ser Lys Ala
180 185 190Ala Ala Arg Leu Gly Ser Asp Met Gly Glu Phe Tyr Met Pro Tyr Pro
195 200 205Gly Thr Arg Ile Asn Gln Glu Thr Val Ser Leu Asp Ala Asn Gly Val
210 215 220Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys225 230 235 240Pro Asn Gly Gln Ile Trp Thr Gly Val Ile Gly Ser Pro Ala Ala Asn
245 250 255Ala Pro Asp Ala Gly Pro Pro Gln Arg Trp Phe Val Val Trp Leu Gly
260 265 270Thr Ala Asn Asn Pro Val Asp Lys Gly Ala Ala Lys Ala Leu Ala Glu
275 280 285Ser Ile Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro
290 295 300Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly Glu Val Ala Pro Thr305 310 315 320Pro Thr Thr Pro Thr Pro Gln Arg Thr Leu Pro Ala
325 330(2)SEQ ID NO:54的信息:
(ⅰ)序列特征:
(A)长度:20氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:54:Asp Pro Val Asp Ala Val Ile Asn Thr Thr Xaa Asn Tyr Gly Gln Val1 5 10 15Val Ala Ala Leu
20(2)SEQ ID NO:55的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:55:
Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser
1 5 10 15(2)SEQ ID NO:56的信息:
(ⅰ)序列特征:
(A)长度:19氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:56:
Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys
1 5 10 15
Glu Gly Arg(2)SEQ ID NO:57的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:57:
Tyr Tyr Trp Cys Pro Gly Gln Pro Phe Asp Pro Ala Trp Gly Pro
1 5 10 15(2)SEQ ID NO:58的信息:
(ⅰ)序列特征:
(A)长度:14氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:58:
Asp Ile Gly Ser Glu Ser Thr Glu Asp Gln Gln Xaa Ala Val
1 5 10(2)SEQ ID NO:59的信息:
(ⅰ)序列特征:
(A)长度:13氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:59:
Ala Glu Glu Ser Ile Ser Thr Xaa Glu Xaa Ile Val Pro
1 5 10(2)SEQ ID NO:60的信息:
(ⅰ)序列特征:
(A)长度:17氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:60:
Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Ala Ala Ala Ma Pro Pro
1 5 10 15
Ala(2)SEQ ID NO:61的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:61:
Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly
1 5 10 15(2)SEQ ID NO:62的信息:
(ⅰ)序列特征:
(A)长度:30氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:62:
Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gln Gin Thr Ser
1 5 10 15
Leu Leu Asn Asn Leu Ala Asp Pro Asp Val Ser Phe Ala Asp
20 25 30(2)SEQ ID NO:63的信息:
(ⅰ)序列特征:
(A)长度:24氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:63:
Gly Cys Gly Asp Arg Ser Gly Gly Asn Leu Asp Gin Ile Arg Leu Arg
1 5 10 15
Arg Asp Arg Ser Gly Gly Asn Leu
20(2)SEQ ID NO:64的信息:
(ⅰ)序列特征:
(A)长度:187氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:64:
Thr Gly Ser Leu Asn Gln Thr His Asn Arg Arg Ala Asn Glu Arg Lys
1 5 10 15
Asn Thr Thr Met Lys Met Val Lys Ser Ile Ala Ala Gly Leu Thr Ala
20 25 30
Ala Ala Ala Ile Gly Ala Ala Ala Ala Gly Val Thr Ser Ile Met Ala
35 40 45
Gly Gly Pro Val Val Tyr Gln Met Gln Pro Val Val Phe Gly Ala Pro
50 55 60
Leu Pro Leu Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gln
65 70 75 80
Leu Thr Ser Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala
85 90 95Asn Lys Gly Ser Leu Val Glu Gly Gly Ile Gly Gly Thr Glu Alg Arg
100 105 110Ile Ala Asp His Lys Leu Lys Lys Ala Ala Glu His Gly Asp Leu Pro
115 120 125Leu Ser Phe Ser Val Thr Asn Ile Gln Pro Ala Ala Ala Gly Ser Ala
130 135 140Thr Ala Asp Val Ser Val Ser Gly Pro Lys Leu Ser Ser Pro Val Thr145 150 155 160Gln Asn Val Thr Phe Val Asn Gln Gly Gly Trp Met Leu Ser Arg Ala
165 170 175Ser Ala Met Glu Leu Leu Gln Ala Ala Gly Xaa
180 185(2)SEQ ID NO:65的信息:
(ⅰ)序列特征:
(A)长度:148氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:65:Asp Glu Val Thr Val Glu Thr Thr Ser Val Phe Arg Ala Asp Phe Leu1 5 10 15Ser Glu Leu Asp Ala Pro Ala Gln Ala Gly Thr Glu Ser Ala Val Ser
20 25 30Gly Val Glu Gly Leu Pro Pro Gly Ser Ala Leu Leu Val Val Lys Arg
35 40 45Gly Pro Asn Ala Gly Ser Arg Phe Leu Leu Asp Gln Ala Ile Thr Ser
50 55 60Ala Gly Arg His Pro Asp Ser Asp Ile Phe Leu Asp Asp Val Thr Val65 70 75 80Ser Arg Arg His Ala Glu Phe Arg Leu Glu Asn Asn Glu Phe Asn Val
85 90 95Val Asp Val Gly Ser Leu Asn Gly Thr Tyr Val Asn Arg Glu Pro Val
100 105 110Asp Ser Ala Val Leu Ala Asn Gly Asp Glu Val Gln Ile Gly Lys Leu
115 120 125Arg Leu Val Phe Leu Thr Gly Pro Lys Gln Gly Glu Asp Asp Gly Ser
130 135 140Thr Gly Gly Pro145(2)SEQ ID NO:66的信息:
(ⅰ)序列特征:
(A)长度:230氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:66:Thr Ser Asn Arg Pro Ala Arg Arg Gly Arg Arg Ala Pro Arg Asp Thr1 5 10 15Gly Pro Asp Arg Ser Ala Ser Leu Ser Leu Val Arg His Arg Arg Gln
20 25 30Gln Arg Asp Ala Leu Cys Leu Ser Ser Thr Gln Ile Ser Arg Gln Ser
35 40 45Asn Leu Pro Pro Ala Ala Gly Gly Ala Ala Asn Tyr Ser Arg Arg Asn
50 55 60Phe Asp Val Arg Ile Lys Ile Phe Met Leu Val Thr Ala Val Val Leu65 70 75 80Leu Cys Cys Ser Gly Val Ala Thr Ala Ala Pro Lys Thr Tyr Cys Glu
85 90 95Glu Leu Lys Gly Thr Asp Thr Gly Gln Ala Cys Gln Ile Gln Met Ser
100 105 110Asp Pro Ala Tyr Asn Ile Asn Ile Ser Leu Pro Ser Tyr Tyr Pro Asp
115 120 125Gln Lys Ser Leu Glu Asn Tyr Ile Ala Gln Thr Arg Asp Lys Phe Leu
130 135 140Ser Ala Ala Thr Ser Ser Thr Pro Arg Glu Ala Pro Tyr Glu Leu Asn145 150 155 160Ile Thr Ser Ala Thr Tyr Gln Ser Ala Ile Pro Pro Arg Gly Thr Gln
165 170 175Ala Val Val Leu Xaa Val Tyr His Asn Ala Gly Gly Thr His Pro Thr
180 185 190Thr Thr Tyr Lys Ala Phe Asp Trp Asp Gln Ala Tyr Arg Lys Pro Ile
195 200 205Thr Tyr Asp Thr Leu Trp Gln Ala Asp Thr Asp Pro Leu Pro Val Val
210 215 220Phe Pro Ile Val Ala Arg225 230(2)SEQ ID NO:67的信息:
(ⅰ)序列特征:
(A)长度:132氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:67:Thr Ala Ala Ser Asp Asn Phe Gln Leu Ser Gln Gly Gly Gln Gly Phe1 5 10 15Ala Ile Pro Ile Gly Gln Ala Met Ala Ile Ala Gly Gln Ile Arg Ser
20 25 30Gly Gly Gly Ser Pro Thr Val His Ile Gly Pro Thr Ala Phe Leu Gly
35 40 45Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val Gln Arg Val
50 55 60Val Gly Ser Ala Pro Ala Ala Ser Leu Gly Ile Ser Thr Gly Asp Val65 70 75 80Ile Thr Ala Val Asp Gly Ala Pro Ile Asn Ser Ala Thr Ala Met Ala
85 90 95Asp Ala Leu Asn Gly His His Pro Gly Asp Val Ile Ser Val Asn Trp
100 105 110Gln Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr Leu Ala Glu
115 120 125Gly Pro Pro Ala
130(2)SEQ ID NO:68的信息:
(ⅰ)序列特征:
(A)长度:100氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:68:Val Pro Leu Arg Ser Pro Ser Met Ser Pro Ser Lys Cys Leu Ala Ala1 5 10 15Ala Gln Arg Asn Pro Val Ile Arg Arg Arg Arg Leu Ser Asn Pro Pro
20 25 30Pro Arg Lys Tyr Arg Ser Met Pro Ser Pro Ala Thr Ala Ser Ala Gly
35 40 45Met Ala Arg Val Arg Arg Arg Ala Ile Trp Arg Gly Pro Ala Thr Xaa
50 55 60Ser Ala Gly Met Ala Arg Val Arg Arg Trp Xaa Val Met Pro Xaa Val65 70 75 80Ile Gln Ser Thr Xaa Ile Arg Xaa Xaa Gly Pro Phe Asp Asn Arg Gly
85 90 95Ser Glu Arg Lys
100(2)SEQ ID NO:69的信息:
(ⅰ)序列特征:
(A)长度:163氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:69:Met Thr Asp Asp Ile Leu Leu Ile Asp Thr Asp Glu Arg Val Arg Thr1 5 10 15Leu Thr Leu Asn Arg Pro Gln Ser Arg Asn Ala Leu Ser Ala Ala Leu
20 25 30Arg Asp Arg Phe Phe Ala Xaa Leu Xaa Asp Ala Glu Xaa Asp Asp Asp
35 40 45Ile Asp Val Val Ile Leu Thr Gly Ala Asp Pro Val Phe Cys Ala Gly
50 55 60Leu Asp Leu Lys Val Ala Gly Arg Ala Asp Arg Ala Ala Gly His Leu65 70 75 80Thr Ala Val Gly Gly His Asp Gln Ala Gly Asp Arg Arg Asp Gln Arg
85 90 95Arg Arg Gly His Arg Arg Ala Arg Thr Gly Ala Val Leu Arg His Pro
100 105 110Asp Arg Leu Arg Ala Arg Pro Leu Arg Arg His Pro Arg Pro Gly Gly
115 120 125Ala Ala Ala His Leu Gly Thr Gln Cys Val Leu Ala Ala Lys Gly Arg
130 135 140His Arg Xaa Gly Pro Val Asp Glu Pro Asp Arg Arg Leu Pro Val Arg145 150 155 160Asp Arg Arg(2)SEQ ID NO:70的信息:
(ⅰ)序列特征:
(A)长度:344氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:70:Met Lys Phe Val Asn His Ile Glu Pro Val Ala Pro Arg Arg Ala Gly1 5 10 15Gly Ala Val Ala Glu Val Tyr Ala Glu Ala Arg Arg Glu Phe Gly Arg
20 25 30Leu Pro Glu Pro Leu Ala Met Leu Ser Pro Asp Glu Gly Leu Leu Thr
35 40 45Ala Gly Trp Ala Thr Leu Arg Glu Thr Leu Leu Val Gly Gln Val Pro
50 55 60Arg Gly Arg Lys Glu Ala Val Ala Ala Ala Val Ala Ala Ser Leu Arg65 70 75 80Cys Pro Trp Cys Val Asp Ala His Thr Thr Met Leu Tyr Ala Ala Gly
85 90 95Gln Thr Asp Thr Ala Ala Ala Ile Leu Ala Gly Thr Ala Pro Ala Ala
100 105 110Gly Asp Pro Asn Ala Pro Tyr Val Ala Trp Ala Ala Gly Thr Gly Thr
115 120 125Pro Ala Gly Pro Pro Ala Pro Phe Gly Pro Asp Val Ala Ala Glu Tyr
130 135 140Leu Gly Thr Ala Val Gln Phe His Phe Ile Ala Arg Leu Val Leu Val145 150 155 160Leu Leu Asp Glu Thr Phe Leu Pro Gly Gly Pro Arg Ala Gln Gln Leu
165 170 175Met Arg Arg Ala Gly Gly Leu Val Phe Ala Arg Lys Val Arg Ala Glu
180 185 190His Arg Pro Gly Arg Ser Thr Arg Arg Leu Glu Pro Arg Thr Leu Pro
195 200 205Asp Asp Leu Ala Trp Ala Thr Pro Ser Glu Pro Ile Ala Thr Ala Phe
210 215 220Ala Ala Leu Ser His His Leu Asp Thr Ala Pro His Leu Pro Pro Pro225 230 235 240Thr Arg Gln Val Val Arg Arg Val Val Gly Ser Trp His Gly Glu Pro
245 250 255Met Pro Met Ser Ser Arg Trp Thr Asn Glu His Thr Ala Glu Leu Pro
260 265 270Ala Asp Leu His Ala Pro Thr Arg Leu Ala Leu Leu Thr Gly Leu Ala
275 280 285Pro His Gln Val Thr Asp Asp Asp Val Ala Ala Ala Arg Ser Leu Leu
290 295 300Asp Thr Asp Ala Ala Leu Val Gly Ala Leu Ala Trp Ala Ala Phe Thr305 310 315 320Ala Ala Arg Arg Ile Gly Thr Trp Ile Gly Ala Ala Ala Glu Gly Gln
325 330 335Val Ser Arg Gln Asn Pro Thr Gly
340(2)SEQ ID NO:71的信息:
(ⅰ)序列特征:
(A)长度:485氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:71:Asp Asp Pro Asp Met Pro Gly Thr Val Ala Lys Ala Val Ala Asp Ala1 5 10 15Leu Gly Arg Gly Ile Ala Pro Val Glu Asp Ile Gln Asp Cys Val Glu
20 25 30Ala Arg Leu Gly Glu Ala Gly Leu Asp Asp Val Ala Arg Val Tyr Ile
35 40 45Ile Tyr Arg Gln Arg Arg Ala Glu Leu Arg Thr Ala Lys Ala Leu Leu
50 55 60Gly Val Arg Asp Glu Leu Lys Leu Ser Leu Ala Ala Val Thr Val Leu65 70 75 80Arg Glu Arg Tyr Leu Leu His Asp Glu Gln Gly Arg Pro Ala Glu Ser
85 90 95Thr Gly Glu Leu Met Asp Arg Ser Ala Arg Cys Val Ala Ala Ala Glu
100 105 110Asp Gln Tyr Glu Pro Gly Ser Ser Arg Arg Trp Ala Glu Arg Phe Ala
115 120 125Thr Leu Leu Arg Asn Leu Glu Phe Leu Pro Asn Ser Pro Thr Leu Met
130 135 140Asn Ser Gly Thr Asp Leu Gly Leu Leu Ala Gly Cys Phe Val Leu Pro145 150 155 160Ile Glu Asp Ser Leu Gln Ser Ile Phe Ala Thr Leu Gly Gln Ala Ala
165 170 175Glu Leu Gln Arg Ala Gly Gly Gly Thr Gly Tyr Ala Phe Ser His Leu
180 185 190Arg Pro Ala Gly Asp Arg Val Ala Ser Thr Gly Gly Thr Ala Ser Gly
195 200 205Pro Val Ser Phe Leu Arg Leu Tyr Asp Ser Ala Ala Gly Val Val Ser
210 215 220Met Gly Gly Arg Arg Arg Gly Ala Cys Met Ala Val Leu Asp Val Ser225 230 235 240His Pro Asp Ile Cys Asp Phe Val Thr Ala Lys Ala Glu Ser Pro Ser
245 250 255Glu Leu Pro His Phe Asn Leu Ser Val Gly Val Thr Asp Ala Phe Leu
260 265 270Arg Ala Val Glu Arg Asn Gly Leu His Arg Leu Val Asn Pro Arg Thr
275 280 285Gly Lys Ile Val Ala Arg Met Pro Ala Ala Glu Leu Phe Asp Ala Ile
290 295 300Cys Lys Ala Ala His Ala Gly Gly Asp Pro Gly Leu Val Phe Leu Asp305 310 315 320Thr Ile Ash Arg Ala Asn Pro Val Pro Gly Arg Gly Arg Ile Glu Ala
325 330 335Thr Asn Pro Cys Gly Glu Val Pro Leu Leu Pro Tyr Glu Ser Cys Asn
340 345 350Leu Gly Ser Ile Asn Leu Ala Arg Met Leu Ala Asp Gly Arg Val Asp
355 360 365Trp Asp Arg Leu Glu Glu Val Ala Gly Val Ala Val Arg Phe Leu Asp
370 375 380Asp Val Ile Asp Val Ser Arg Tyr Pro Phe Pro Glu Leu Gly Glu Ala385 390 395 400Ala Arg Ala Thr Arg Lys Ile Gly Leu Gly Val Met Gly Leu Ala Glu
405 410 415Leu Leu Ala Ala Leu Gly Ile Pro Tyr Asp Ser Glu Glu Ala Val Arg
420 425 430Leu Ala Thr Arg Leu Met Arg Arg Ile Gln Gln Ala Ala His Thr Ala
435 440 445Ser Arg Arg Leu Ala Glu Glu Arg Gly Ala Phe Pro Ala Phe Thr Asp
450 455 460Ser Arg Phe Ala Arg Ser Gly Pro Arg Arg Asn Ala Gln Val Thr Ser465 470 475 480Val Ala Pro Thr Gly
485(2)SEQ ID NO:72的信息:
(ⅰ)序列特征:
(A)长度:267氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:72:Gly Val Ile Val Leu Asp Leu Glu Pro Arg Gly Pro Leu Pro Thr Glu1 5 10 15Ile Tyr Trp Arg Arg Arg Gly Leu Ala Leu Gly Ile Ala Val Val Val
20 25 30Val Gly Ile Ala Val Ala Ile Val Ile Ala Phe Val Asp Ser Ser Ala
35 40 45Gly Ala Lys Pro Val Ser Ala Asp Lys Pro Ala Ser Ala Gln Ser His
50 55 60Pro Gly Ser Pro Ala Pro Gln Ala Pro Gln Pro Ala Gly Gln Thr Glu65 70 75 80Gly Asn Ala Ala Ala Ala Pro Pro Gln Gly Gln Asn Pro Glu Thr Pro
85 90 95Thr Pro Thr Ala Ala Val Gln Pro Pro Pro Val Leu Lys Glu Gly Asp
100 105 110Asp Cys Pro Asp Ser Thr Leu Ala Val Lys Gly Leu Thr Asn Ala Pro
115 120 125Gln Tyr Tyr Val Gly Asp Gln Pro Lys Phe Thr Met Val Val Thr Asn
130 135 140Ile Gly Leu Val Ser Cys Lys Arg Asp Val Gly Ala Ala Val Leu Ala145 150 155 160Ala Tyr Val Tyr Ser Leu Asp Asn Lys Arg Leu Trp Ser Asn Leu Asp
165 170 175Cys Ala Pro Ser Asn Glu Thr Leu Val Lys Thr Phe Ser Pro Gly Glu
180 185 190Gln Val Thr Thr Ala Val Thr Trp Thr Gly Met Gly Ser Ala Pro Arg
195 200 205Cys Pro Leu Pro Arg Pro Ala Ile Gly Pro Gly Thr Tyr Asn Leu Val
210 215 220Val Gln Leu Gly Asn Leu Arg Ser Leu Pro Val Pro Phe Ile Leu Asn225 230 235 240Gln Pro Pro Pro Pro Pro Gly Pro Val Pro Ala Pro Gly Pro Ala Gln
245 250 255Ala Pro Pro Pro Glu Ser Pro Ala Gln Gly Gly
260 265(2)SEQ ID NO:73的信息:
(ⅰ)序列特征:
(A)长度:97氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:73:Leu Ile Ser Thr Gly Lys Ala Ser His Ala Ser Leu Gly Val Gln Val1 5 10 15Thr Asn Asp Lys Asp Thr Pro Gly Ala Lys Ile Val Glu Val Val Ala
20 25 30Gly Gly Ala Ala Ala Asn Ala Gly Val Pro Lys Gly Val Val Val Thr
35 40 45Lys Val Asp Asp Arg Pro Ile Asn Ser Ala Asp Ala Leu Val Ala Ala
50 55 60Val Arg Ser Lys Ala Pro Gly Ala Thr Val Ala Leu Thr Phe Gln Asp65 70 75 80Pro Ser Gly Gly Ser Arg Thr Val Gln Val Thr Leu Gly Lys Ala Glu
85 90 95Gln(2)SEQ ID NO:74的信息:
(ⅰ)序列特征:
(A)长度:364氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:74:Gly Ala Ala Val Ser Leu Leu Ala Ala Gly Thr Leu Val Leu Thr Ala1 5 10 15Cys Gly Gly Gly Thr Asn Ser Ser Ser Ser Gly Ala Gly Gly Thr Ser
20 25 30Gly Ser Val His Cys Gly Gly Lys Lys Glu Leu His Ser Ser Gly Ser
35 40 45Thr Ala Gln Glu Asn Ala Met Glu Gln Phe Val Tyr Ala Tyr Val Arg
50 55 60Ser Cys Pro Gly Tyr Thr Leu Asp Tyr Asn Ala Asn Gly Ser Gly Ala65 70 75 80Gly Val Thr Gln Phe Leu Asn Asn Glu Thr Asp Phe Ala Gly Ser Asp
85 90 95Val Pro Leu Asn Pro Ser Thr Gly Gln Pro Asp Arg Ser Ala Glu Arg
100 105 110Cys Gly Ser Pro Ala Trp Asp Leu Pro Thr Val Phe Gly Pro Ile Ala
115 120 125Ile Thr Tyr Asn Ile Lys Gly Val Ser Thr Leu Asn Leu Asp Gly Pro
130 135 140Thr Thr Ala Lys Ile Phe Asn Gly Thr Ile Thr Val Trp Asn Asp Pro145 150 155 160Gln Ile Gln Ala Leu Ash Ser Gly Thr Asp Leu Pro Pro Thr Pro Ile
165 170 175Ser Val Ile Phe Arg Ser Asp Lys Ser Gly Thr Ser Asp Asn Phe Gln
180 185 190Lys Tyr Leu Asp Gly Val Ser Asn Gly Ala Trp Gly Lys Gly Ala Ser
195 200 205Glu Thr Phe Ser Gly Gly Val Gly Val Gly Ala Ser Gly Asn Asn Gly
210 215 220Thr Ser Ala Leu Leu Gln Thr Thr Asp Gly Ser Ile Thr Tyr Asn Glu225 230 235 240Trp Ser Phe Ala Val Gly Lys Gln Leu Asn Met Ala Gln Ile Ile Thr
245 250 255Ser Ala Gly Pro Asp Pro Val Ala Ile Thr Thr Glu Ser Val Gly Lys
260 265 270Thr Ile Ala Gly Ala Lys Ile Met Gly Gln Gly Asn Asp Leu Val Leu
275 280 285Asp Thr Ser Ser Phe Tyr Arg Pro Thr Gln Pro Gly Ser Tyr Pro Ile
290 295 300Val Leu Ala Thr Tyr Glu Ile Val Cys Ser Lys Tyr Pro Asp Ala Thr305 310 315 320Thr Gly Thr Ala Val Arg Ala Phe Met Gln Ala Ala Ile Gly Pro Gly
325 330 335Gln Glu Gly Leu Asp Gln Tyr Gly Ser lle Pro Leu Pro Lys Ser Phe
340 345 350Gln Ala Lys Leu Ala Ala Ala Val Asn Ala Ile Ser
355 360(2)SEQ ID NO:75的信息:
(ⅰ)序列特征:
(A)长度:309氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:75:Gln Ala Ala Ala Gly Arg Ala Val Arg Arg Thr Gly His Ala Glu Asp1 5 10 15Gln Thr His Gln Asp Arg Leu His His Gly Cys Arg Arg Ala Ala Val
20 25 30Val Val Arg Gln Asp Arg Ala Ser Val Ser Ala Thr Ser Ala Arg Pro
35 40 45Pro Arg Arg His Pro Ala Gln Gly His Arg Arg Arg Val Ala Pro Ser
50 55 60Gly Gly Arg Arg Arg Pro His Pro His His Val Gln Pro Asp Asp Arg65 70 75 80Arg Asp Arg Pro Ala Leu Leu Asp Arg Thr Gln Pro Ala Glu His Pro
85 90 95Asp Pro His Arg Arg Gly Pro Ala Asp Pro Gly Arg Val Arg Gly Arg
100 105 110Gly Arg Leu Arg Arg Val Asp Asp Gly Arg Leu Gln Pro Asp Arg Asp
115 120 125Ala Asp His Gly Ala Pro Val Arg Gly Arg Gly Pro His Arg Gly Val
130 135 140Gln His Arg Gly Gly Pro Val Phe Val Arg Arg Val Pro Gly Val Arg145 150 155 160Cys Ala His Arg Arg Gly His Arg Arg Val Ala Ala Pro Gly Gln Gly
165 170 175Asp Val Leu Arg Ala Gly Leu Arg Val Glu Arg Leu Arg Pro Val Ala
180 185 190Ala Val Glu Asn Leu His Arg Gly Ser Gln Arg Ala Asp Gly Arg Val
195 200 205Phe Arg Pro Ile Arg Arg Gly Ala Arg Leu Pro Ala Arg Arg Ser Arg
210 215 220Ala Gly Pro Gln Gly Arg Leu His Leu Asp Gly Ala Gly Pro Ser Pro225 230 235 240Leu Pro Ala Arg Ala Gly Gln Gln Gln Pro Ser Ser Ala Gly Gly Arg
245 250 255Arg Ala Gly Gly Ala Glu Arg Ala Asp Pro Gly Gln Arg Gly Arg His
260 265 270His Gln Gly Gly His Asp Pro Gly Arg Gln Gly Ala Gln Arg Gly Thr
275 280 285Ala Gly Val Ala His Ala Ala Ala Gly Pro Arg Arg Ala Ala Val Arg
290 295 300Asn Arg Pro Arg Arg305(2)SEQ ID NO:76的信息:
(ⅰ)序列特征:
(A)长度:580氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:76:Ser Ala Val Trp Cys Leu Asn Gly Phe Thr Gly Arg His Arg His Gly1 5 10 15Arg Cys Arg Val Arg Ala Ser Gly Trp Arg Ser Ser Asn Arg Trp Cys
20 25 30Ser Thr Thr Ala Asp Cys Cys Ala Ser Lys Thr Pro Thr Gln Ala Ala
35 40 45Ser Pro Leu Glu Arg Arg Phe Thr Cys Cys Ser Pro Ala Val Gly Cys
50 55 60Arg Phe Arg Ser Phe Pro Val Arg Arg Leu Ala Leu Gly Ala Arg Thr65 70 75 80Ser Arg Thr Leu Gly Val Arg Arg Thr Leu Ser Gln Trp Asn Leu Ser
85 90 95Pro Arg Ala Gln Pro Ser Cys Ala Val Thr Val Glu Ser His Thr His
100 105 110Ala Ser Pro Arg Met Ala Lys Leu Ala Arg Val Val Gly Leu Val Gln
115 120 125Glu Glu Gln Pro Ser Asp Met Thr Asn His Pro Arg Tyr Ser Pro Pro
130 135 140Pro Gln Gln Pro Gly Thr Pro Gly Tyr Ala Gln Gly Gln Gln Gln Thr145 150 155 160Tyr Ser Gln Gln Phe Asp Trp Arg Tyr Pro Pro Ser Pro Pro Pro Gln
165 170 175Pro Thr Gln Tyr Arg Gln Pro Tyr Glu Ala Leu Gly Gly Thr Arg Pro
180 185 190Gly Leu Ile Pro Gly Val Ile Pro Thr Met Thr Pro Pro Pro Gly Met
195 200 205Val Arg Gln Arg Pro Arg Ala Gly Met Leu Ala Ile Gly Ala Val Thr
210 215 220Ile Ala Val Val Ser Ala Gly Ile Gly Gly Ala Ala Ala Ser Leu Val225 230 235 240Gly Phe Asn Arg Ala Pro Ala Gly Pro Ser Gly Gly Pro Val Ala Ala
245 250 255Ser Ala Ala Pro Ser Ile Pro Ala Ala Asn Met Pro Pro Gly Ser Val
260 265 270Glu Gln Val Ala Ala Lys Val Val Pro Ser Val Val Met Leu Glu Thr
275 280 285Asp Leu Gly Arg Gln Ser Glu Glu Gly Ser Gly Ile Ile Leu Ser Ala
290 295 300Glu Gly Leu Ile Leu Thr Asn Asn His Val Ile Ala Ala Ala Ala Lys305 310 315 320Pro Pro Leu Gly Ser Pro Pro Pro Lys Thr Thr Val Thr Phe Ser Asp
325 330 335Gly Arg Thr Ala Pro Phe Thr Val Val Gly Ala Asp Pro Thr Ser Asp
340 345 350Ile Ala Val Val Arg Val Gln Gly Val Ser Gly Leu Thr Pro Ile Ser
355 360 365Leu Gly Ser Ser Ser Asp Leu Arg Val Gly Gln Pro Val Leu Ala Ile
370 375 380Gly Ser Pro Leu Gly Leu Glu Gly Thr Val Thr Thr Gly Ile Val Ser385 390 395 400Ala Leu Asn Arg Pro Val Ser Thr Thr Gly Glu Ala Gly Asn Gln Asn
405 410 415Thr Val Leu Asp Ala Ile Gln Thr Asp Ala Ala Ile Asn Pro Gly Asn
420 425 430Ser Gly Gly Ala Leu Val Asn Met Asn Ala Gln Leu Val Gly Val Asn
435 440 445Ser Ala Ile Ala Thr Leu Gly Ala Asp Ser Ala Asp Ala Gln Ser Gly
450 455 460Ser Ile Gly Leu Gly Phe Ala Ile Pro Val Asp Gln Ala Lys Arg Ile465 470 475 480Ala Asp Glu Leu Ile Ser Thr Gly Lys Ala Ser His Ala Ser Leu Gly
485 490 495Val Gln Val Thr Asn Asp Lys Asp Thr Pro Gly Ala Lys Ile Val Glu
500 505 510Val Val Ala Gly Gly Ala Ala Ala Asn Ala Gly Val Pro Lys Gly Val
515 520 525Val Val Thr Lys Val Asp Asp Arg Pro Ile Asn Ser Ala Asp Ala Leu
530 535 540Val Ala Ala Val Arg Ser Lys Ala Pro Gly Ala Thr Val Ala Leu Thr545 550 555 560Phe Gln Asp Pro Ser Gly Gly Ser Arg Thr Val Gln Val Thr Leu Gly
565 570 575Lys Ala Glu Gln
580(2)SEQ ID NO:77的信息:
(ⅰ)序列特征:
(A)长度:233氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:77:Met Asn Asp Gly Lys Arg Ala Val Thr Ser Ala Val Leu Val Val Leu1 5 10 15Gly Ala Cys Leu Ala Leu Trp Leu Ser Gly Cys Ser Ser Pro Lys Pro
20 25 30Asp Ala Glu Glu Gln Gly Val Pro Val Ser Pro Thr Ala Ser Asp Pro
35 40 45Ala Leu Leu Ala Glu Ile Arg Gln Ser Leu Asp Ala Thr Lys Gly Leu
50 55 60Thr Ser Val His Val Ala Val Arg Thr Thr Gly Lys Val Asp Ser Leu65 70 75 80Leu Gly Ile Thr Ser Ala Asp Val Asp Val Arg Ala Asn Pro Leu Ala
85 90 95Ala Lys Gly Val Cys Thr Tyr Asn Asp Glu Gln Gly Val Pro Phe Arg
100 105 110Val Gln Gly Asp Asn Ile Ser Val Lys Leu Phe Asp Asp Trp Ser Asn
115 120 125Leu Gly Ser Ile Ser Glu Leu Ser Thr Ser Arg Val Leu Asp Pro Ala
130 135 140Ala Gly Val Thr Gln Leu Leu Ser Gly Val Thr Asn Leu Gln Ala Gln145 150 155 160Gly Thr Glu Val Ile Asp Gly Ile Ser Thr Thr Lys Ile Thr Gly Thr
165 170 175Ile Pro Ala Ser Ser Val Lys Met Leu Asp Pro Gly Ala Lys Ser Ala
180 185 190Arg Pro Ala Thr Val Trp Ile Ala Gln Asp Gly Ser His His Leu Val
195 200 205Arg Ala Ser Ile Asp Leu Gly Ser Gly Ser Ile Gln Leu Thr Gln Ser
210 215 220Lys Trp Asn Glu Pro Val Asn Val Asp225 230(2)SEQ ID NO:78的信息:
(ⅰ)序列特征:
(A)长度:66氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:78:Val Ile Asp Ile Ile Gly Thr Ser Pro Thr Ser Trp Glu Gln Ala Ala1 5 10 15Ala Glu Ala Val Gln Arg Ala Arg Asp Ser Val Asp Asp Ile Arg Val
20 25 30Ala Arg Val Ile Glu Gln Asp Met Ala Val Asp Ser Ala Gly Lys Ile
35 40 45Thr Tyr Arg Ile Lys Leu Glu Val Ser Phe Lys Met Arg Pro Ala Gln
50 55 60Pro Arg65(2)SEQ ID NO:79的信息:
(ⅰ)序列特征:
(A)长度:69氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:79:Val Pro Pro Ala Pro Pro Leu Pro Pro Leu Pro Pro Ser Pro Ile Ser1 5 10 15Cys Ala Ser Pro Pro Ser Pro Pro Leu Pro Pro Ala Pro Pro Val Ala
20 25 30Pro Gly Pro Pro Met Pro Pro Leu Asp Pro Trp Pro Pro Ala Pro Pro
35 40 45Leu Pro Tyr Ser Thr Pro Pro Gly Ala Pro Leu Pro Pro Ser Pro Pro
50 55 60Ser Pro Pro Leu Pro65(2)SEQ ID NO:80的信息:
(ⅰ)序列特征:
(A)长度:355氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:80:Met Ser Asn Ser Arg Arg Arg Ser Leu Arg Trp Ser Trp Leu Leu Ser1 5 10 15Val Leu Ala Ala Val Gly Leu Gly Leu Ala Thr Ala Pro Ala Gln Ala
20 25 30Ala Pro Pro Ala Leu Ser Gln Asp Arg Phe Ala Asp Phe Pro Ala Leu
35 40 45Pro Leu Asp Pro Ser Ala Met Val Ala Gln Val Ala Pro Gln Val Val
50 55 60Asn Ile Asn Thr Lys Leu Gly Tyr Asn Asn Ala Val Gly Ala Gly Thr65 70 75 80Gly Ile Val Ile Asp Pro Asn Gly Val Val Leu Thr Asn Asn His Val
85 90 95Ile Ala Gly Ala Thr Asp Ile Asn Ala Phe Ser Val Gly Ser Gly Gln
100 105 110Thr Tyr Gly Val Asp Val Val Gly Tyr Asp Arg Thr Gln Asp Val Ala
115 120 125Val Leu Gln Leu Arg Gly Ala Gly Gly Leu Pro Ser Ala Ala Ile Gly
130 135 140Gly Gly Val Ala Val Gly Glu Pro Val Val Ala Met Gly Asn Ser Gly145 150 155 160Gly Gln Gly Gly Thr Pro Arg Ala Val Pro Gly Arg Val Val Ala Leu
165 170 175Gly Gln Thr Val Gln Ala Ser Asp Ser Leu Thr Gly Ala Glu Glu Thr
180 185 190Leu Asn Gly Leu Ile Gln Phe Asp Ala Ala Ile Gln Pro Gly Asp Ser
195 200 205Gly Gly Pro Val Val Asn Gly Leu Gly Gln Val Val Gly Met Asn Thr
210 215 220Ala Ala Ser Asp Asn Phe Gln Leu Ser Gln Gly Gly Gln Gly Phe Ala225 230 235 240Ile Pro Ile Gly Gln Ala Met Ala Ile Ala Gly Gln Ile Arg Ser Gly
245 250 255Gly Gly Ser Pro Thr Val His Ile Gly Pro Thr Ala Phe Leu Gly Leu
260 265 270Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val Gln Arg Val Val
275 280 285Gly Ser Ala Pro Ala Ala Ser Leu Gly Ile Ser Thr Gly Asp Val Ile
290 295 300Thr Ala Val Asp Gly Ala Pro Ile Asn Ser Ala Thr Ala Met Ala Asp305 310 315 320Ala Leu Asn Gly His His Pro Gly Asp Val Ile Ser Val Asn Trp Gln
325 330 335Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr Leu Ala Glu Gly
340 345 350Pro Pro Ala
355(2)SEQ ID NO:81的信息:
(ⅰ)序列特征:
(A)长度:205氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:81:Ser Pro Lys Pro Asp Ala Glu Glu Gln Gly Val Pro Val Ser Pro Thr1 5 10 15Ala Ser Asp Pro Ala Leu Leu Ala Glu Ile Arg Gln Ser Leu Asp Ala
20 25 30Thr Lys Gly Leu Thr Ser Val His Val Ala Val Arg Thr Thr Gly Lys
35 40 45Val Asp Ser Leu Leu Gly Ile Thr Ser Ala Asp Val Asp Val Arg Ala
50 55 60Asn Pro Leu Ala Ala Lys Gly Val Cys Thr Tyr Asn Asp Glu Gln Gly65 70 75 80Val Pro Phe Arg Val Gln Gly Asp Asn Ile Ser Val Lys Leu Phe Asp
85 90 95Asp Trp Ser Asn Leu Gly Ser Ile Ser Glu Leu Ser Thr Ser Arg Val
100 105 110Leu Asp Pro Ala Ala Gly Val Thr Gln Leu Leu Ser Gly Val Thr Asn
115 120 125Leu Gln Ala Gln Gly Thr Glu Val Ile Asp Gly Ile Ser Thr Thr Lys
130 135 140Ile Thr Gly Thr Ile Pro Ala Ser Ser Val Lys Met Leu Asp Pro Gly145 150 155 160Ala Lys Ser Ala Arg Pro Ala Thr Val Trp Ile Ala Gln Asp Gly Ser
165 170 175His His Leu Val Arg Ala Ser Ile Asp Leu Gly Ser Gly Ser Ile Gln
180 185 190Leu Thr Gln Ser Lys Trp Asn Glu Pro Val Asn Val Asp
195 200 205(2)SEQ ID NO:82的信息:
(ⅰ)序列特征:
(A)长度:286氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:82:Gly Asp Ser Phe Trp Ala Ala Ala Asp Gln Met Ala Arg Gly Phe Val1 5 10 15Leu Gly Ala Thr Ala Gly Arg Thr Thr Leu Thr Gly Glu Gly Leu Gln
20 25 30His Ala Asp Gly His Ser Leu Leu Leu Asp Ala Thr Asn Pro Ala Val
35 40 45Val Ala Tyr Asp Pro Ala Phe Ala Tyr Glu Ile Gly Tyr Ile Xaa Glu
50 55 60Ser Gly Leu Ala Arg Met Cys Gly Glu Asn Pro Glu Asn Ile Phe Phe65 70 75 80Tyr Ile Thr Val Tyr Asn Glu Pro Tyr Val Gln Pro Pro Glu Pro Glu
85 90 95Asn Phe Asp Pro Glu Gly Val Leu Gly Gly Ile Tyr Arg Tyr His Ala
100 105 110Ala Thr Glu Gln Arg Thr Asn Lys Xaa Gln Ile Leu Ala Ser Gly Val
115 120 125Ala Met Pro Ala Ala Leu Arg Ala Ala Gln Met Leu Ala Ala Glu Trp
130 135 140Asp Val Ala Ala Asp Val Trp Ser Val Thr Ser Trp Gly Glu Leu Asn145 150 155 160Arg Asp Gly Val Val Ile Glu Thr Glu Lys Leu Arg His Pro Asp Arg
165 170 175Pro Ala Gly Val Pro Tyr Val Thr Arg Ala Leu Glu Asn Ala Arg Gly
180 185 190Pro Val Ile Ala Val Ser Asp Trp Met Arg Ala Val Pro Glu Gln Ile
195 200 205Arg Pro Trp Val Pro Gly Thr Tyr Leu Thr Leu Gly Thr Asp Gly Phe
210 215 220Gly Phe Ser Asp Thr Arg Pro Ala Gly Arg Arg Tyr Phe Asn Thr Asp225 230 235 240Ala Glu Ser Gln Val Gly Arg Gly Phe Gly Arg Gly Trp Pro Gly Arg
245 250 255Arg Val Asn Ile Asp Pro Phe Gly Ala Gly Arg Gly Pro Pro Ala Gln
260 265 270Leu Pro Gly Phe Asp Glu Gly Gly Gly Leu Arg Pro Xaa Lys
275 280 285(2)SEQ ID NO:83的信息:
(ⅰ)序列特征:
(A)长度:173氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:83:Thr Lys Phe His Ala Leu Met Gln Glu Gln Ile His Asn Glu Phe Thr1 5 10 15Ala Ala Gln Gln Tyr Val Ala Ile Ala Val Tyr Phe Asp Ser Glu Asp
20 25 30Leu Pro Gln Leu Ala Lys His Phe Tyr Ser Gln Ala Val Glu Glu Arg
35 40 45Asn His Ala Met Met Leu Val Gln His Leu Leu Asp Arg Asp Leu Arg
50 55 60Val Glu Ile Pro Gly Val Asp Thr Val Arg Asn Gln Phe Asp Arg Pro65 70 75 80Arg Glu Ala Leu Ala Leu Ala Leu Asp Gln Glu Arg Thr Val Thr Asp
85 90 95Gln Val Gly Arg Leu Thr Ala Val Ala Arg Asp Glu Gly Asp Phe Leu
100 105 110Gly Glu Gln Phe Met Gln Trp Phe Leu Gln Glu Gln Ile Glu Glu Val
115 120 125Ala Leu Met Ala Thr Leu Val Arg Val Ala Asp Arg Ala Gly Ala Asn
130 135 140Leu Phe Glu Leu Glu Asn Phe Val Ala Arg Glu Val Asp Val Ala Pro145 150 155 160Ala Ala Ser Gly Ala Pro His Ala Ala Gly Gly Arg Leu
165 170(2)SEQ ID NO:84的信息:
(ⅰ)序列特征:
(A)长度:107氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:84:Arg Ala Asp Glu Arg Lys Asn Thr Thr Met Lys Met Val Lys Ser Ile1 5 10 15Ala Ala Gly Leu Thr Ala Ala Ala Ala Ile Gly Ala Ala Ala Ala Gly
20 25 30Val Thr Ser Ile Met Ala Gly Gly Pro Val Val Tyr Gln Met Gln Pro
35 40 45Val Val Phe Gly Ala Pro Leu Pro Leu Asp Pro Xaa Ser Ala Pro Xaa
50 55 60Val Pro Thr Ala Ala Gln Trp Thr Xaa Leu Leu Asn Xaa Leu Xaa Asp65 70 75 80Pro Asn Val Ser Phe Xaa Asn Lys Gly Ser Leu Val Glu Gly Gly Ile
85 90 95Gly Gly Xaa Glu Gly Xaa Xaa Arg Arg Xaa Gln
100 105(2)SEQ ID NO:85的信息:
(ⅰ)序列特征:
(A)长度:125氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:85:Val Leu Ser Val Pro Val Gly Asp Gly Phe Trp Xaa Arg Val Val Asn1 5 10 15Pro Leu Gly Gln Pro Ile Asp Gly Arg Gly Asp Val Asp Ser Asp Thr
20 25 30Arg Arg Ala Leu Glu Leu Gln Ala Pro Ser Val Val Xaa Arg Gln Gly
35 40 45Val Lys Glu Pro Leu Xaa Thr Gly Ile Lys Ala Ile Asp Ala Met Thr
50 55 60Pro Ile Gly Arg Gly Gln Arg Gln Leu Ile Ile Gly Asp Arg Lys Thr65 70 75 80Gly Lys Asn Arg Arg Leu Cys Arg Thr Pro Ser Ser Asn Gln Arg Glu
85 90 95Glu Leu Gly Val Arg Trp Ile Pro Arg Ser Arg Cys Ala Cys Val Tyr
100 105 110Val Gly His Arg Ala Arg Arg Gly Thr Tyr His Arg Arg
115 120 125(2)SEQ ID NO:86的信息:
(ⅰ)序列特征:
(A)长度:117氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:86:Cys Asp Ala Val Met Gly Phe Leu Gly Gly Ala Gly Pro Leu Ala Val1 5 10 15Val Asp Gln Gln Leu Val Thr Arg Val Pro Gln Gly Trp Ser Phe Ala
20 25 30Gln Ala Ala Ala Val Pro Val Val Phe Leu Thr Ala Trp Tyr Gly Leu
35 40 45Ala Asp Leu Ala Glu Ile Lys Ala Gly Glu Ser Val Leu Ile His Ala
50 55 60Gly Thr Gly Gly Val Gly Met Ala Ala Val Gln Leu Ala Arg Gln Trp65 70 75 80Gly Val Glu Val Phe Val Thr Ala Ser Arg Gly Lys Trp Asp Thr Leu
85 90 95Arg Ala Xaa Xaa Phe Asp Asp Xaa Pro Tyr Arg Xaa Phe Pro His Xaa
100 105 110Arg Ser Ser Xaa Gly
115(2)SEQ ID NO:87的信息:
(ⅰ)序列特征:
(A)长度:103氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:87:Met Tyr Arg Phe Ala Cys Arg Thr Leu Met Leu Ala Ala Cys Ile Leu1 5 10 15Ala Thr Gly Val Ala Gly Leu Gly Val Gly Ala Gln Ser Ala Ala Gln
20 25 30Thr Ala Pro Val Pro Asp Tyr Tyr Trp Cys Pro Gly Gln Pro Phe Asp
35 40 45Pro Ala Trp Gly Pro Asn Trp Asp Pro Tyr Thr Cys His Asp Asp Phe
50 55 60His Arg Asp Ser Asp Gly Pro Asp His Ser Arg Asp Tyr Pro Gly Pro65 70 75 80Ile Leu Glu Gly Pro Val Leu Asp Asp Pro Gly Ala Ala Pro Pro Pro
85 90 95Pro Ala Ala Gly Gly Gly Ala
100(2)SEQ ID NO:88的信息:
(ⅰ)序列特征:
(A)长度:88氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:88:Val Gln Cys Arg Val Trp Leu Glu Ile Gln Trp Arg Gly Met Leu Gly1 5 10 15Ala Asp Gln Ala Arg Ala Gly Gly Pro Ala Arg Ile Trp Arg Glu His
20 25 30Ser Met Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala
35 40 45Thr Lys Glu Gly Arg Gly Ile Val Met Arg Val Pro Leu Glu Gly Gly
50 55 60Gly Arg Leu Val Val Glu Leu Thr Pro Asp Glu Ala Ala Ala Leu Gly65 70 75 80Asp Glu Leu Lys Gly Val Thr Ser
85(2)SEQ ID NO:89的信息:
(ⅰ)序列特征:
(A)长度:95氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:89:Thr Asp Ala Ala Thr Leu Ala Gln Glu Ala Gly Asn Phe Glu Arg Ile1 5 10 15Ser Gly Asp Leu Lys Thr Gln Ile Asp Gln Val Glu Ser Thr Ala Gly
20 25 30Ser Leu Gln Gly Gln Trp Arg Gly Ala Ala Gly Thr Ala Ala Gln Ala
35 40 45Ala Val Val Arg Phe Gln Glu Ala Ala Asn Lys Gln Lys Gln Glu Leu
50 55 60Asp Glu Ile Ser Thr Asn Ile Arg Gln Ala Gly Val Gln Tyr Ser Arg65 70 75 80Ala Asp Glu Glu Gln Gln Gln Ala Leu Ser Ser Gln Met Gly Phe
85 90 95(2)SEQ ID NO:90的信息:
(ⅰ)序列特征:
(A)长度:166氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:90:Met Thr Gln Ser Gln Thr Val Thr Val Asp Gln Gln Glu Ile Leu Asn1 5 10 15Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val
20 25 30Pro Ile Thr Pro Cys Glu Leu Thr Xaa Xaa Lys Asn Ala Ala Gln Gln
35 40 45Xaa Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala
50 55 60Lys Glu Arg Gln Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Xaa65 70 75 80Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly
85 90 95Glu Gly Thr Val Gln Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser
100 105 110Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro
115 120 125Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp
130 135 140Gln Gly Ala Ser Leu Ala His Xaa Gly Asp Gly Trp Asn Thr Xaa Thr145 150 155 160Leu Thr Leu Gln Gly Asp
165(2)SEQ ID NO:91的信息:
(ⅰ)序列特征:
(A)长度:5氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:9l:
Arg Ala Glu Arg Met
1 5(2)SEQ ID NO:92的信息:
(ⅰ)序列特征:
(A)长度:263氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:92:Val Ala Trp Met Ser Val Thr Ala Gly Gln Ala Glu Leu Thr Ala Ala1 5 10 15Gln Val Arg Val Ala Ala Ala Ala Tyr Glu Thr Ala Tyr Gly Leu Thr
20 25 30Val Pro Pro Pro Val Ile Ala Glu Asn Arg Ala Glu Leu Met Ile Leu
35 40 45Ile Ala Thr Asn Leu Leu Gly Gln Asn Thr Pro Ala Ile Ala Val Asn
50 55 60Glu Ala Glu Tyr Gly Glu Met Trp Ala Gln Asp Ala Ala Ala Met Phe65 70 75 80Gly Tyr Ala Ala Ala Thr Ala Thr Ala Thr Ala Thr Leu Leu Pro Phe
85 90 95Glu Glu Ala Pro Glu Met Thr Ser Ala Gly Gly Leu Leu Glu Gln Ala
100 105 110Ala Ala Val Glu Glu Ala Ser Asp Thr Ala Ala Ala Asn Gln Leu Met
115 120 125Asn Asn Val Pro Gln Ala Leu Lys Gln Leu Ala Gln Pro Thr Gln Gly
130 135 140Thr Thr Pro Ser Ser Lys Leu Gly Gly Leu Trp Lys Thr Val Ser Pro145 150 155 160His Arg Ser Pro Ile Ser Asn Met Val Ser Met Ala Asn Asn His Met
165 170 175Ser Met Thr Asn Ser Gly Val Ser Met Thr Asn Thr Leu Ser Ser Met
180 185 190Leu Lys Gly Phe Ala Pro Ala Ala Ala Ala Gln Ala Val Gln Thr Ala
195 200 205Ala Gln Asn Gly Val Arg Ala Met Ser Ser Leu Gly Ser Ser Leu Gly
210 215 220Ser Ser Gly Leu Gly Gly Gly Val Ala Ala Asn Leu Gly Arg Ala Ala225 230 235 240Ser Val Arg Tyr Gly His Arg Asp Gly Gly Lys Tyr Ala Xaa Ser Gly
245 250 255
Arg Arg Asn Gly Gly Pro Ala
260(2)SEQ ID NO:93的信息:
(ⅰ)序列特征:
(A)长度:303氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:93:Met Thr Tyr Ser Pro Gly Asn Pro Gly Tyr Pro Gln Ala Gln Pro Ala1 5 10 15Gly Ser Tyr Gly Gly ValThr Pro Ser Phe Ala His Ala Asp Glu Gly
20 25 30Ala Ser Lys Leu Pro Met Tyr Leu Asn Ile Ala Val Ala Val Leu Gly
35 40 45Leu Ala Ala Tyr Phe Ala Ser Phe Gly Pro Met Phe Thr Leu Ser Thr
50 55 60Glu Leu Gly Gly Gly Asp Gly Ala Val Ser Gly Asp Thr Gly Leu Pro65 70 75 80Val Gly Val Ala Leu Leu Ala Ala Leu Leu Ala Gly Val Val Leu Val
85 90 95Pro Lys Ala Lys Ser His Val Thr Val Val Ala Val Leu Gly Val Leu
100 105 110Gly Val Phe Leu Met Val Ser Ala Thr Phe Asn Lys Pro Ser Ala Tyr
115 120 125Ser Thr Gly Trp Ala Leu Trp Val Val Leu Ala Phe Ile Val Phe Gln
130 135 140Ala Val Ala Ala Val Leu Ala Leu Leu Val Glu Thr Gly Ala Ile Thr145 150 155 160Ala Pro Ala Pro Arg Pro Lys Phe Asp Pro Tyr Gly Gln Tyr Gly Arg
165 170 175Tyr Gly Gln Tyr Gly Gln Tyr Gly Val Gln Pro Gly Gly Tyr Tyr Gly
180 185 190Gln Gln Gly Ala Gln Gln Ala Ala Gly Leu Gln Ser Pro Gly Pro Gln
195 200 205Gln Ser Pro Gln Pro Pro Gly Tyr Gly Ser Gln Tyr Gly Gly Tyr Ser
210 215 220Ser Ser Pro Ser Gln Ser Gly Ser Gly Tyr Thr Ala Gln Pro Pro Ala225 230 235 240Gln Pro Pro Ala Gln Ser Gly Ser Gln Gln Ser His Gln Gly Pro Ser
245 250 255Thr Pro Pro Thr Gly Phe Pro Ser Phe Ser Pro Pro Pro Pro Val Ser
260 265 270Ala Gly Thr Gly Ser Gln Ala Gly Ser Ala Pro Val Asn Tyr Ser Asn
275 280 285Pro Ser Gly Gly Glu Gln Ser Ser Ser Pro Gly Gly Ala Pro Val
290 295 300(2)SEQ ID NO:94的信息:
(ⅰ)序列特征:
(A)长度:507碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:94:ATGAAGATGG TGAAATCGAT CGCCGCAGGT CTGACCGCCG CGGCTGCAAT CGGCGCCGCT 60GCGGCCGGTG TGACTTCGAT CATGGCTGGC GGCCCGGTCG TATACCAGAT GCAGCCGGTC 120GTCTTCGGCG CGCCACTGCC GTTGGACCCG GCATCCGCCC CTGACGTCCC GACCGCCGCC 180CAGTTGACCA GCCTGCTCAA CAGCCTCGCC GATCCCAACG TGTCGTTTGC GAACAAGGGC 240AGTCTGGTCG AGGGCGGCAT CGGGGGCACC GAGGCGCGCA TCGCCGACCA CAAGCTGAAG 300AAGGCCGCCG AGCACGGGGA TCTGCCGCTG TCGTTCAGCG TGACGAACAT CCAGCCGGCG 360GCCGCCGGTT CGGCCACCGC CGACGTTTCC GTCTCGGGTC CGAAGCTCTC GTCGCCGGTC 420ACGCAGAACG TCACGTTCGT GAATCAAGGC GGCTGGATGC TGTCACGCGC ATCGGCGATG 480GAGTTGCTGC AGGCCGCAGG GAACTGA 507(2)SEQ ID NO:95的信息:
(ⅰ)序列特征:
(A)长度:168氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:95:Met Lys Met Val Lys Ser Ile Ala Ala Gly Leu Thr Ala Ala Ala Ala1 5 10 15Ile Gly Ala Ala Ala Ala Gly Val Thr Ser Ile Met Ala Gly Gly Pro
20 25 30Val Val Tyr Gln Met Gln Pro Val Val Phe Gly Ala Pro Leu Pro Leu
35 40 45Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gln Leu Thr Ser
50 55 60Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala Asn Lys Gly65 70 75 80Ser Leu Val Glu Gly Gly Ile Gly Gly Thr Glu Ala Arg Ile Ala Asp
85 90 95His Lys Leu Lys Lys Ala Ala Glu His Gly Asp Leu Pro Leu Ser Phe
100 105 110Ser Val Thr Asn Ile Gln Pro Ala Ala Ala Gly Ser Ala Thr Ala Asp
115 120 125Val Ser Val Ser Gly Pro Lys Leu Ser Ser Pro Val Thr Gln Asn Val
130 135 140Thr Phe Val Asn Gln Gly Gly Trp Met Leu Ser Arg Ala Ser Ala Met145 150 155 160Glu Leu Leu Gln Ala Ala Gly Asn
165(2)SEQ ID NO:96的信息:
(ⅰ)序列特征:
(A)长度:500碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:96:CGTGGCAATG TCGTTGACCG TCGGGGCCGG GGTCGCCTCC GCAGATCCCG TGGACGCGGT 60CATTAACACC ACCTGCAATT ACGGGCAGGT AGTAGCTGCG CTCAACGCGA CGGATCCGGG 120GGCTGCCGCA CAGTTCAACG CCTCACCGGT GGCGCAGTCC TATTTGCGCA ATTTCCTCGC 180CGCACCGCCA CCTCAGCGCG CTGCCATGGC CGCGCAATTG CAAGCTGTGC CGGGGGCGGC 240ACAGTACATC GGCCTTGTCG AGTCGGTTGC CGGCTCCTGC AACAACTATT AAGCCCATGC 300GGGCCCCATC CCGCGACCCG GCATCGTCGC CGGGGCTAGG CCAGATTGCC CCGCTCCTCA 360ACGGGCCGCA TCCCGCGACC CGGCATCGTC GCCGGGGCTA GGCCAGATTG CCCCGCTCCT 420CAACGGGCCG CATCTCGTGC CGAATTCCTG CAGCCCGGGG GATCCACTAG TTCTAGAGCG 480GCCGCCACCG CGGTGGAGCT 500(2)SEQ ID NO:97的信息:
(ⅰ)序列特征:
(A)长度:96氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:97:Val Ala Met Ser Leu Thr Val Gly Ala Gly Val Ala Ser Ala Asp Pro1 5 10 15Val Asp Ala Val Ile Asn Thr Thr Cys Asn Tyr Gly Gln Val Val Ala
20 25 30Ala Leu Asn Ala Thr Asp Pro Gly Ala Ala Ala Gln Phe Asn Ala Ser
35 40 45Pro Val Ala Gln Ser Tyr Leu Arg Asn Phe Leu Ala Ala Pro Pro Pro
50 55 60Gln Arg Ala Ala Met Ala Ala Gln Leu Gln Ala Val Pro Gly Ala Ala65 70 75 80Gln Tyr Ile Gly Leu Val Glu Ser Val Ala Gly Ser Cys Asn Asn Tyr
85 90 95(2)SEQ ID NO:98的信息:
(ⅰ)序列特征:
(A)长度:154碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:98:ATGACAGAGC AGCAGTGGAA TTTCGCGGGT ATCGAGGCCG CGGCAAGCGC AATCCAGGGA 60AATGTCACGT CCATTCATTC CCTCCTTGAC GAGGGGAAGC AGTCCCTGAC CAAGCTCGCA 120GCGGCCTGGG GCGGTAGCGG TTCGGAAGCG TACC 154(2)SEQ ID NO:99的信息:
(ⅰ)序列特征:
(A)长度:51氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:99:Met Thr Glu Gln Gln Trp Asn Phe Ala Gly Ile Glu Ala Ala Ala Ser1 5 10 15Ala Ile Gln Gly Asn Val Thr Ser Ile His Ser Leu Leu Asp Glu Gly
20 25 30Lys Gln Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser
35 40 45Glu Ala Tyr
50(2)SEQ ID NO:100的信息:
(ⅰ)序列特征:
(A)长度:282碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:100:CGGTCGCGCA CTTCCAGGTG ACTATGAAAG TCGGCTTCCG NCTGGAGGAT TCCTGAACCT 60TCAAGCGCGG CCGATAACTG AGGTGCATCA TTAAGCGACT TTTCCAGAAC ATCCTGACGC 120GCTCGAAACG CGGCACAGCC GACGGTGGCT CCGNCGAGGC GCTGNCTCCA AAATCCCTGA 180GACAATTCGN CGGGGGCGCC TACAAGGAAG TCGGTGCTGA ATTCGNCGNG TATCTGGTCG 240ACCTGTGTGG TCTGNAGCCG GACGAAGCGG TGCTCGACGT CG 282(2)SEQ ID NO:101的信息:
(ⅰ)序列特征:
(A)长度:3058碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:101:GATCGTACCC GTGCGAGTGC TCGGGCCGTT TGAGGATGGA GTGCACGTGT CTTTCGTGAT 60GGCATACCCA GAGATGTTGG CGGCGGCGGC TGACACCCTG CAGAGCATCG GTGCTACCAC 120TGTGGCTAGC AATGCCGCTG CGGCGGCCCC GACGACTGGG GTGGTGCCCC CCGCTGCCGA 180TGAGGTGTCG GCGCTGACTG CGGCGCACTT CGCCGCACAT GCGGCGATGT ATCAGTCCGT 240GAGCGCTCGG GCTGCTGCGA TTCATGACCA GTTCGTGGCC ACCCTTGCCA GCAGCGCCAG 300CTCGTATGCG GCCACTGAAG TCGCCAATGC GGCGGCGGCC AGCTAAGCCA GGAACAGTCG 360GCACGAGAAA CCACGAGAAA TAGGGACACG TAATGGTGGA TTTCGGGGCG TTACCACCGG 420AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC CTCGCTGGTG GCCGCGGCTC 480AGATGTGGGA CAGCGTGGCG AGTGACCTGT TTTCGGCCGC GTCGGCGTTT CAGTCGGTGG 540TCTGGGGTCT GACGGTGGGG TCGTGGATAG GTTCGTCGGC GGGTCTGATG GTGGCGGCGG 600CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA GGCCGAGCTG ACCGCCGCCC 660AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG GCTGACGGTG CCCCCGCCGG 720TGATCGCCGA GAACCGTGCT GAACTGATGA TTCTGATAGC GACCAACCTC TTGGGGCAAA 780ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGCGA GATGTGGGCC CAAGACGCCG 840CCGCGATGTT TGGCTACGCC GCGGCGACGG CGACGGCGAC GGCGACGTTG CTGCCGTTCG 900AGGAGGCGCC GGAGATGACC AGCGCGGGTG GGCTCCTCGA GCAGGCCGCC GCGGTCGAGG 960AGGCCTCCGA CACCGCCGCG GCGAACCAGT TGATGAACAA TGTGCCCCAG GCGCTGCAAC 1020AGCTGGCCCA GCCCACGCAG GGCACCACGC CTTCTTCCAA GCTGGGTGGC CTGTGGAAGA 1080CGGTCTCGCC GCATCGGTCG CCGATCAGCA ACATGGTGTC GATGGCCAAC AACCACATGT 1140CGATGACCAA CTCGGGTGTG TCGATGACCA ACACCTTGAG CTCGATGTTG AAGGGCTTTG 1200CTCCGGCGGC GGCCGCCCAG GCCGTGCAAA CCGCGGCGCA AAACGGGGTC CGGGCGATGA 1260GCTCGCTGGG CAGCTCGCTG GGTTCTTCGG GTCTGGGCGG TGGGGTGGCC GCCAACTTGG 1320GTCGGGCGGC CTCGGTCGGT TCGTTGTCGG TGCCGCAGGC CTGGGCCGCG GCCAACCAGG 1380CAGTCACCCC GGCGGCGCGG GCGCTGCCGC TGACCAGCCT GACCAGCGCC GCGGAAAGAG 1440GGCCCGGGCA GATGCTGGGC GGGCTGCCGG TGGGGCAGAT GGGCGCCAGG GCCGGTGGTG 1500GGCTCAGTGG TGTGCTGCGT GTTCCGCCGC GACCCTATGT GATGCCGCAT TCTCCGGCGG 1560CCGGCTAGGA GAGGGGGCGC AGACTGTCGT TATTTGACCA GTGATCGGCG GTCTCGGTGT 1620TTCCGCGGCC GGCTATGACA ACAGTCAATG TGCATGACAA GTTACAGGTA TTAGGTCCAG 1680GTTCAACAAG GAGACAGGCA ACATGGCCTC ACGTTTTATG ACGGATCCGC ACGCGATGCG 1740GGACATGGCG GGCCGTTTTG AGGTGCACGC CCAGACGGTG GAGGACGAGG CTCGCCGGAT 1800GTGGGCGTCC GCGCAAAACA TTTCCGGTGC GGGCTGGAGT GGCATGGCCG AGGCGACCTC 1860GCTAGACACC ATGGCCCAGA TGAATCAGGC GTTTCGCAAC ATCGTGAACA TGCTGCACGG 1920GGTGCGTGAC GGGCTGGTTC GCGACGCCAA CAACTACGAG CAGCAAGAGC AGGCCTCCCA 1980GCAGATCCTC AGCAGCTAAC GTCAGCCGCT GCAGCACAAT ACTTTTACAA GCGAAGGAGA 2040ACAGGTTCGA TGACCATCAA CTATCAATTC GGGGATGTCG ACGCTCACGG CGCCATGATC 2100CGCGCTCAGG CCGGGTTGCT GGAGGCCGAG CATCAGGCCA TCATTCGTGA TGTGTTGACC 2160GCGAGTGACT TTTGGGGCGG CGCCGGTTCG GCGGCCTGCC AGGGGTTCAT TACCCAGTTG 2220GGCCGTAACT TCCAGGTGAT CTACGAGCAG GCCAACGCCC ACGGGCAGAA GGTGCAGGCT 2280GCCGGCAACA ACATGGCGCA AACCGACAGC GCCGTCGGCT CCAGCTGGGC CTGACACCAG 2340GCCAAGGCCA GGGACGTGGT GTACGAGTGA AGTTCCTCGC GTGATCCTTC GGGTGGCAGT 2400CTAAGTGGTC AGTGCTGGGG TGTTGGTGGT TTGCTGCTTG GCGGGTTCTT CGGTGCTGGT 2460CAGTGCTGCT CGGGCTCGGG TGAGGACCTC GAGGCCCAGG TAGCGCCGTC CTTCGATCCA 2520TTCGTCGTGT TGTTCGGCGA GGACGGCTCC GACGAGGCGG ATGATCGAGG CGCGGTCGGG 2580GAAGATGCCC ACGACGTCGG TTCGGCGTCG TACCTCTCGG TTGAGGCGTT CCTGGGGGTT 2640GTTGGACCAG ATTTGGCGCC AGATCTGCTT GGGGAAGGCG GTGAACGCCA GCAGGTCGGT 2700GCGGGCGGTG TCGAGGTGCT CGGCCACCGC GGGGAGTTTG TCGGTCAGAG CGTCGAGTAC 2760CCGATCATAT TGGGCAACAA CTGATTCGGC GTCGGGCTGG TCGTAGATGG AGTGCAGCAG 2820GGTGCGCACC CACGGCCAGG AGGGCTTCGG GGTGGCTGCC ATCAGATTGG CTGCGTAGTG 2880GGTTCTGCAG CGCTGCCAGG CCGCTGCGGG CAGGGTGGCG CCGATCGCGG CCACCAGGCC 2940GGCGTGGGCG TCGCTGGTGA CCAGCGCGAC CCCGGACAGG CCGCGGGCGA CCAGGTCGCG 3000GAAGAACGCC AGCCAGCCGG CCCCGTCCTC GGCGGAGGTG ACCTGGATGC CCAGGATC 3058(2)SEQ ID NO:102的信息:
(ⅰ)序列特征:
(A)长度:391氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:102:Met Val Asp Phe Gly Ala Leu Pro Pro Glu Ile Asn Ser Ala Arg Met1 5 10 15Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Gln Met Trp
20 25 30Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gln Ser
35 40 45Val Val Trp Gly Leu Thr Val Gly Ser Trp Ile Gly Ser Ser Ala Gly
50 55 60Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr65 70 75 80Ala Gly Gln Ala Glu Leu Thr Ala Ala Gln Val Arg Val Ala Ala Ala
85 90 95Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val Ile Ala
100 105 110Glu Asn Arg Ala Glu Leu Met Ile Leu Ile Ala Thr Asn Leu Leu Gly
115 120 125Gln Asn Thr Pro Ala Ile Ala Val Asn Glu Ala Glu Tyr Gly Glu Met
130 135 140Trp Ala Gln Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Ala Thr Ala145 150 155 160Thr Ala Thr Ala Thr Leu Leu Pro Phe Glu Glu Ala Pro Glu Met Thr
165 170 175Ser Ala Gly Gly Leu Leu Glu Gln Ala Ala Ala Val Glu Glu Ala Ser
180 185 190Asp Thr Ala Ala Ala Asn Gln Leu Met Asn Asn Val Pro Gln Ala Leu
195 200 205Gln Gln Leu Ala Gln Pro Thr Gln Gly Thr Thr Pro Ser Ser Lys Leu
210 215 220Gly Gly Leu Trp Lys Thr Val Ser Pro His Arg Ser Pro Ile Ser Asn225 230 235 240Met Val Ser Met Ala Asn Asn His Met Ser Met Thr Asn Ser Gly Val
245 250 255Ser Met Thr Asn Thr Leu Ser Ser Met Leu Lys Gly Phe Ala Pro Ala
260 265 270Ala Ala Ala Gln Ala Val Gln Thr Ala Ala Gln Asn Gly Val Arg Ala
275 280 285Met Ser Ser Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu Gly Gly Gly
290 295 300Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser Leu Ser Val305 310 315 320Pro Gln Ala Trp Ala Ala Ala Asn Gln Ala Val Thr Pro Ala Ala Arg
325 330 335Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Glu Arg Gly Pro Gly
340 345 350Gln Met Leu Gly Gly Leu Pro Val Gly Gln Met Gly Ala Arg Ala Gly
355 360 365Gly Gly Leu Ser Gly Val Leu Arg Val Pro Pro Arg Pro Tyr Val Met
370 375 380Pro His Ser Pro Ala Ala Gly385 390(2)SEQ ID NO:103的信息:
(ⅰ)序列特征:
(A)长度:1725碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:103:GACGTCAGCA CCCGCCGTGC AGGGCTGGAG CGTGGTCGGT TTTGATCTGC GGTCAAGGTG 60ACGTCCCTCG GCGTGTCGCC GGCGTGGATG CAGACTCGAT GCCGCTCTTT AGTGCAACTA 120ATTTCGTTGA AGTGCCTGCG AGGTATAGGA CTTCACGATT GGTTAATGTA GCGTTCACCC 180CGTGTTGGGG TCGATTTGGC CGGACCAGTC GTCACCAACG CTTGGCGTGC GCGCCAGGCG 240GGCGATCAGA TCGCTTGACT ACCAATCAAT CTTGAGCTCC CGGGCCGATG CTCGGGCTAA 300ATGAGGAGGA GCACGCGTGT CTTTCACTGC GCAACCGGAG ATGTTGGCGG CCGCGGCTGG 360CGAACTTCGT TCCCTGGGGG CAACGCTGAA GGCTAGCAAT GCCGCCGCAG CCGTGCCGAC 420GACTGGGGTG GTGCCCCCGG CTGCCGACGA GGTGTCGCTG CTGCTTGCCA CACAATTCCG 480TACGCATGCG GCGACGTATC AGACGGCCAG CGCCAAGGCC GCGGTGATCC ATGAGCAGTT 540TGTGACCACG CTGGCCACCA GCGCTAGTTC ATATGCGGAC ACCGAGGCCG CCAACGCTGT 600GGTCACCGGC TAGCTGACCT GACGGTATTC GAGCGGAAGG ATTATCGAAG TGGTGGATTT 660CGGGGCGTTA CCACCGGAGA TCAACTCCGC GAGGATGTAC GCCGGCCCGG GTTCGGCCTC 720GCTGGTGGCC GCCGCGAAGA TGTGGGACAG CGTGGCGAGT GACCTGTTTT CGGCCGCGTC 780GGCGTTTCAG TCGGTGGTCT GGGGTCTGAC GGTGGGGTCG TGGATAGGTT CGTCGGCGGG 840TCTGATGGCG GCGGCGGCCT CGCCGTATGT GGCGTGGATG AGCGTCACCG CGGGGCAGGC 900CCAGCTGACC GCCGCCCAGG TCCGGGTTGC TGCGGCGGCC TACGAGACAG CGTATAGGCT 960GACGGTGCCC CCGCCGGTGA TCGCCGAGAA CCGTACCGAA CTGATGACGC TGACCGCGAC 1020CAACCTCTTG GGGCAAAACA CGCCGGCGAT CGAGGCCAAT CAGGCCGCAT ACAGCCAGAT 1080GTGGGGCCAA GACGCGGAGG CGATGTATGG CTACGCCGCC ACGGCGGCGA CGGCGACCGA 1140GGCGTTGCTG CCGTTCGAGG ACGCCCCACT GATCACCAAC CCCGGCGGGC TCCTTGAGCA 1200GGCCGTCGCG GTCGAGGAGG CCATCGACAC CGCCGCGGCG AACCAGTTGA TGAACAATGT 1260GCCCCAAGCG CTGCAACAGC TGGCCCAGCC AGCGCAGGGC GTCGTACCTT CTTCCAAGCT 1320GGGTGGGCTG TGGACGGCGG TCTCGCCGCA TCTGTCGCCG CTCAGCAACG TCAGTTCGAT 1380AGCCAACAAC CACATGTCGA TGATGGGCAC GGGTGTGTCG ATGACCAACA CCTTGCACTC 1440GATGTTGAAG GGCTTAGCTC CGGCGGCGGC TCAGGCCGTG GAAACCGCGG CGGAAAACGG 1500GGTCTGGGCG ATGAGCTCGC TGGGCAGCCA GCTGGGTTCG TCGCTGGGTT CTTCGGGTCT 1560GGGCGCTGGG GTGGCCGCCA ACTTGGGTCG GGCGGCCTCG GTCGGTTCGT TGTCGGTGCC 1620GCCAGCATGG GCCGCGGCCA ACCAGGCGGT CACCCCGGCG GCGCGGGCGC TGCCGCTGAC 1680CAGCCTGACC AGCGCCGCCC AAACCGCCCC CGGACACATG CTGGG 1725(2)SEQ ID NO:104的信息:
(ⅰ)序列特征:
(A)长度:359氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:104:Val Val Asp Phe Gly Ala Leu Pro Pro Glu Ile Asn Ser Ala Arg Met1 5 10 15Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp
20 25 30Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gln Ser
35 40 45Val Val Trp Gly Leu Thr Val Gly Ser Trp Ile Gly Ser Ser Ala Gly
50 55 60Leu Met Ala Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr65 70 75 80Ala Gly Gln Ala Gln Leu Thr Ala Ala Gln Val Arg Val Ala Ala Ala
85 90 95Ala Tyr Glu Thr Ala Tyr Arg Leu Thr Val Pro Pro Pro Val Ile Ala
100 105 110Glu Asn Arg Thr Glu Leu Met Thr Leu Thr Ala Thr Asn Leu Leu Gly
115 120 125Gln Asn Thr Pro Ala Ile Glu Ala Asn Gln Ala Ala Tyr Ser Gln Met
130 135 140Trp Gly Gln Asp Ala Glu Ala Met Tyr Gly Tyr Ala Ala Thr Ala Ala145 150 155 160Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu Ile Thr
165 170 175Asn Pro Gly Gly Leu Leu Glu Gln Ala Val Ala Val Clu Glu Ala Ile
180 185 190Asp Thr Ala Ala Ala Asn Gln Leu Met Asn Asn Val Pro Gln Ala Leu
195 200 205Gln Gln Leu Ala Gln Pro Ala Gln Gly Val Val Pro Ser Ser Lys Leu
210 215 220Gly Gly Leu Trp Thr Ala Val Ser Pro His Leu Ser Pro Leu Ser Asn225 230 235 240Val Ser Ser Ile Ala Asn Asn His Met Ser Met Met Gly Thr Gly Val
245 250 255Ser Met Thr Asn Thr Leu His Ser Met Leu Lys Gly Leu Ala Pro Ala
260 265 270Ala Ala Gln Ala Val Glu Thr Ala Ala Glu Asn Gly Val Trp Ala Met
275 280 285Ser Ser Leu Gly Ser Gln Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu
290 295 300Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser305 310 315 320Leu Ser Val Pro Pro Ala Trp Ala Ala Ala Asn Gln Ala Val Thr Pro
325 330 335Ala Ala Arg Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Gln Thr
340 345 350Ala Pro Gly His Met Leu Gly
355(2)SEQ ID NO:105的信息:
(ⅰ)序列特征:
(A)长度:3027碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:105:AGTTCAGTCG AGAATGATAC TGACGGGCTG TATCCACGAT GGCTGAGACA ACCGAACCAC 60CGTCGGACGC GGGGACATCG CAAGCCGACG CGATGGCGTT GGCCGCCGAA GCCGAAGCCG 120CCGAAGCCGA AGCGCTGGCC GCCGCGGCGC GGGCCCGTGC CCGTGCCGCC CGGTTGAAGC 180GTGAGGCGCT GGCGATGGCC CCAGCCGAGG ACGAGAACGT CCCCGAGGAT ATGCAGACTG 240GGAAGACGCC GAAGACTATG ACGACTATGA CGACTATGAG GCCGCAGACC AGGAGGCCGC 300ACGGTCGGCA TCCTGGCGAC GGCGGTTGCG GGTGCGGTTA CCAAGACTGT CCACGATTGC 360CATGGCGGCC GCAGTCGTCA TCATCTGCGG CTTCACCGGG CTCAGCGGAT ACATTGTGTG 420GCAACACCAT GAGGCCACCG AACGCCAGCA GCGCGCCGCG GCGTTCGCCG CCGGAGCCAA 480GCAAGGTGTC ATCAACATGA CCTCGCTGGA CTTCAACAAG GCCAAAGAAG ACGTCGCGCG 540TGTGATCGAC AGCTCCACCG GCGAATTCAG GGATGACTTC CAGCAGCGGG CAGCCGATTT 600CACCAAGGTT GTCGAACAGT CCAAAGTGGT CACCGAAGGC ACGGTGAACG CGACAGCCGT 660CGAATCCATG AACGAGCATT CCGCCGTGGT GCTCGTCGCG GCGACTTCAC GGGTCACCAA 720TTCCGCTGGG GCGAAAGACG AACCACGTGC GTGGCGGCTC AAAGTGACCG TGACCGAAGA 780GGGGGGACAG TACAAGATGT CGAAAGTTGA GTTCGTACCG TGACCGATGA CGTACGCGAC 840GTCAACACCG AAACCACTGA CGCCACCGAA GTCGCTGAGA TCGACTCAGC CGCAGGCGAA 900GCCGGTGATT CGGCGACCGA GGCATTTGAC ACCGACTCTG CAACGGAATC TACCGCGCAG 960AAGGGTCAGC GGCACCGTGA CCTGTGGCGA ATGCAGGTTA CCTTGAAACC CGTTCCGGTG 1020ATTCTCATCC TGCTCATGTT GATCTCTGGG GGCGCGACGG GATGGCTATA CCTTGAGCAA 1080TACGACCCGA TCAGCAGACG GACTCCGGCG CCGCCCGTGC TGCCGTCGCC GCGGCGTCTG 1140ACGGGACAAT CGCGCTGTTG TGTATTCACC CGACACGTCG ACCAAGACTT CGCTACCGCC 1200AGGTCGCACC TCGCCGGCGA TTTCCTGTCC TATACGACCA GTTCACGCAG CAGATCGTGG 1260CTCCGGCGGC CAAACAGAAG TCACTGAAAA CCACCGCCAA GGTGGTGCGC GCGGCCGTGT 1320CGGAGCTACA TCCGGATTCG GCCGTCGTTC TGGTTTTTGT CGACCAGAGC ACTACCAGTA 1380AGGACAGCCC CAATCCGTCG ATGGCGGCCA GCAGCGTGAT GGTGACCCTA GCCAAGGTCG 1440ACGGCAATTG GCTGATCACC AAGTTCACCC CGGTTTAGGT TGCCGTAGGC GGTCGCCAAG 1500TCTGACGGGG GCGCGGGTGG CTGCTCGTGC GAGATACCGG CCGTTCTCCG GACAATCACG 1560GCCCGACCTC AAACAGATCT CGGCCGCTGT CTAATCGGCC GGGTTATTTA AGATTAGTTG 1620CCACTGTATT TACCTGATGT TCAGATTGTT CAGCTGGATT TAGCTTCGCG GCAGGGCGGC 1680TGGTGCACTT TGCATCTGGG GTTGTGACTA CTTGAGAGAA TTTGACCTGT TGCCGACGTT 1740GTTTGCTGTC CATCATTGGT GCTAGTTATG GCCGAGCGGA AGGATTATCG AAGTGGTGGA 1800CTTCGGGGCG TTACCACCGG AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC 1860CTCGCTGGTG GCCGCCGCGA AGATGTGGGA CAGCGTGGCG AGTGACCTGT TTTCGGCCGC 1920GTCGGCGTTT CAGTCGGTGG TCTGGGGTCT GACGACGGGA TCGTGGATAG GTTCGTCGGC 1980GGGTCTGATG GTGGCGGCGG CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA 2040GGCCGAGCTG ACCGCCGCCC AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG 2100GCTGACGGTG CCCCCGCCGG TGATCGCCGA GAACCGTGCT GAACTGATGA TTCTGATAGC 2160GACCAACCTC TTGGGGCAAA ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGGGA 2220GATGTGGGCC CAAGACGCCG CCGCGATGTT TGGCTACGCC GCCACGGCGG CGACGGCGAC 2280CGAGGCGTTG CTGCCGTrCG AGGACGCCCC ACTGATCACC AACCCCGGCG GGCTCCTTGA 2340GCAGGCCGTC GCGGTCGAGG AGGCCATCGA CACCGCCGCG GCGAACCAGT TGATGAACAA 2400TGTGCCCCAA GCGCTGCAAC AACTGGCCCA GCCCACGAAA AGCATCTGGC CGTTCGACCA 2460ACTGAGTGAA CTCTGGAAAG CCATCTCGCC GCATCTGTCG CCGCTCAGCA ACATCGTGTC 2520GATGCTCAAC AACCACGTGT CGATGACCAA CTCGGGTGTG TCGATGGCCA GCACCTTGCA 2580CTCAATGTTG AAGGGCTTTG CTCCGGCGGC GGCTCAGGCC GTGGAAACCG CGGCGCAAAA 2640CGGGGTCCAG GCGATGAGCT CGCTGGGCAG CCAGCTGGGT TCGTCGCTGG GTTCTTCGGG 2700TCTGGGCGCT GGGGTGGCCG CCAACTTGGG TCGGGCGGCC TCGGTCGGTT CGTTGTCGGT 2760GCCGCAGGCC TGGGCCGCGG CCAACCAGGC GGTCACCCCG GCGGCGCGGG CGCTGCCGCT 2820GACCAGCCTG ACCAGCGCCG CCCAAACCGC CCCCGGACAC ATGCTGGGCG GGCTACCGCT 2880GGGGCAACTG ACCAATAGCG GCGGCGGGTT CGGCGGGGTT AGCAATGCGT TGCGGATGCC 2940GCCGCGGGCG TACGTAATGC CCCGTGTGCC CGCCGCCGGG TAACGCCGAT CCGCACGCAA 3000TGCGGGCCCT CTATGCGGGC AGCGATC 3027(2)SEQ ID NO:106的信息:
(ⅰ)序列特征:
(A)长度:396氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:106:Val Val Asp Phe Gly Ala Leu Pro Pro Glu Ile Asn Ser Ala Arg Met1 5 10 15Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp
20 25 30Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gln Ser
35 40 45Val Val Trp Gly Leu Thr Thr Gly Ser Trp Ile Gly Ser Ser Ala Gly
50 55 60Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr65 70 75 80Ala Gly Gln Ala Glu Leu Thr Ala Ala Gln Val Arg Val Ala Ala Ala
85 90 95Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val Ile Ala
100 105 110Glu Asn Arg Ala Glu Leu Met Ile Leu Ile Ala Thr Asn Leu Leu Gly
115 120 125Gln Asn Thr Pro Ala Ile Ala Val Asn Glu Ala Glu Tyr Gly Glu Met
130 135 140Trp Ala Gln Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Thr Ala Ala145 150 155 160Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu Ile Thr
165 170 175Asn Pro Gly Gly Leu Leu Glu Gln Ala Val Ala Val Glu Glu Ala Ile
180 185 190Asp Thr Ala Ala Ala Asn Gln Leu Met Asn Asn Val Pro Gln Ala Leu
195 200 205Gln Gln Leu Ala Gln Pro Thr Lys Ser Ile Trp Pro Phe Asp Gln Leu
210 215 220Ser Glu Leu Trp Lys Ala Ile Ser Pro His Leu Ser Pro Leu Ser Asn225 230 235 240Ile Val Ser Met Leu Asn Asn His Val Ser Met Thr Asn Ser Gly Val
245 250 255Ser Met Ala Ser Thr Leu His Ser Met Leu Lys Gly Phe Ala Pro Ala
260 265 270Ala Ala Gln Ala Val Glu Thr Ala Ala Gln Asn Gly Val Gln Ala Met
275 280 285Ser Ser Leu Gly Ser Gln Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu
290 295 300Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser305 310 315 320Leu Ser Val Pro Gln Ala Trp Ala Ala Ala Asn Gln Ala Val Thr Pro
325 330 335Ala Ala Arg Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Gln Thr
340 345 350Ala Pro Gly His Met Leu Gly Gly Leu Pro Leu Gly Gln Leu Thr Asn
355 360 365Ser Gly Gly Gly Phe Gly Gly Val Ser Asn Ala Leu Arg Met Pro Pro
370 375 380Arg Ala Tyr Val Met Pro Arg Val Pro Ala Ala Gly385 390 395(2)SEQ ID NO:107的信息:
(ⅰ)序列特征:
(A)长度:1616碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:107:CATCGGAGGG AGTGATCACC ATGCTGTGGC ACGCAATGCC ACCGGAGTAA ATACCGCACG 60GCTGATGGCC GGCGCGGGTC CGGCTCCAAT GCTTGCGGCG GCCGCGGGAT GGCAGACGCT 120TTCGGCGGCT CTGGACGCTC AGGCCGTCGA GTTGACCGCG CGCCTGAACT CTCTGGGAGA 180AGCCTGGACT GGAGGTGGCA GCGACAAGGC GCTTGCGGCT GCAACGCCGA TGGTGGTCTG 240GCTACAAACC GCGTCAACAC AGGCCAAGAC CCGTGCGATG CAGGCGACGG CGCAAGCCGC 300GGCATACACC CAGGCCATGG CCACGACGCC GTCGCTGCCG GAGATCGCCG CCAACCACAT 360CACCCAGGCC GTCCTTACGG CCACCAACTT CTTCGGTATC AACACGATCC CGATCGCGTT 420GACCGAGATG GATTATTTCA TCCGTATGTG GAACCAGGCA GCCCTGGCAA TGGAGGTCTA 480CCAGGCCGAG ACCGCGGTTA ACACGCTTTT CGAGAAGCTC GAGCCGATGG CGTCGATCCT 540TGATCCCGGC GCGAGCCAGA GCACGACGAA CCCGATCTTC GGAATGCCCT CCCCTGGCAG 600CTCAACACCG GTTGGCCAGT TGCCGCCGGC GGCTACCCAG ACCCTCGGCC AACTGGGTGA 660GATGAGCGGC CCGATGCAGC AGCTGACCCA GCCGCTGCAG CAGGTGACGT CGTTGTTCAG 720CCAGGTGGGC GGCACCGGCG GCGGCAACCC AGCCGACGAG GAAGCCGCGC AGATGGGCCT 780GCTCGGCACC AGTCCGCTGT CGAACCATCC GCTGGCTGGT GGATCAGGCC CCAGCGCGGG 840CGCGGGCCTG CTGCGCGCGG AGTCGCTACC TGGCGCAGGT GGGTCGTTGA CCCGCACGCC 900GCTGATGTCT CAGCTGATCG AAAAGCCGGT TGCCCCCTCG GTGATGCCGG CGGCTGCTGC 960CGGATCGTCG GCGACGGGTG GCGCCGCTCC GGTGGGTGCG GGAGCGATGG GCCAGGGTGC 1020GCAATCCGGC GGCTCCACCA GGCCGGGTCT GGTCGCGCCG GCACCGCTCG CGCAGGAGCG 1080TGAAGAAGAC GACGAGGACG ACTGGGACGA AGAGGACGAC TGGTGAGCTC CCGTAATGAC 1140AACAGACTTC CCGGCCACCC GGGCCGGAAG ACTTGCCAAC ATTTTGGCGA GGAAGGTAAA 1200GAGAGAAAGT AGTCCAGCAT GGCAGAGATG AAGACCGATG CCGCTACCCT CGCGCAGGAG 1260GCAGGTAATT TCGAGCGGAT CTCCGGCGAC CTGAAAACCC AGATCGACCA GGTGGAGTCG 1320ACGGCAGGTT CGTTGCAGGG CCAGTGGCGC GGCGCGGCGG GGACGGCCGC CCAGGCCGCG 1380GTGGTGCGCT TCCAAGAAGC AGCCAATAAG CAGAAGCAGG AACTCGACGA GATCTCGACG 1440AATATTCGTC AGGCCGGCGT CCAATACTCG AGGGCCGACG AGGAGCAGCA GCAGGCGCTG 1500TCCTCGCAAA TGGGCTTCTG ACCCGCTAAT ACGAAAAGAA ACGGAGCAAA AACATGACAG 1560AGCAGCAGTG GAATTTCGCG GGTATCGAGG CCGCGGCAAG CGCAATCCAG GGAAAT 1616(2)SEQ ID NO:108的信息:
(ⅰ)序列特征:
(A)长度:432碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:108:CTAGTGGATG GGACCATGGC CATTTTCTGC AGTCTCACTG CCTTCTGTGT TGACATTTTG 60GCACGCCGGC GGAAACGAAG CACTGGGGTC GAAGAACGGC TGCGCTGCCA TATCGTCCGG 120AGCTTCCATA CCTTCGTGCG GCCGGAAGAG CTTGTCGTAG TCGGCCGCCA TGACAACCTC 180TCAGAGTGCG CTCAAACGTA TAAACACGAG AAAGGGCGAG ACCGACGGAA GGTCGAACTC 240GCCCGATCCC GTGTTTCGCT ATTCTACGCG AACTCGGCGT TGCCCTATGC GAACATCCCA 300GTGACGTTGC CTTCGGTCGA AGCCATTGCC TGACCGGCTT CGCTGATCGT CCGCGCCAGG 360TTCTGCAGCG CGTTGTTCAG CTCGGTAGCC GTGGCGTCCC ATTTTTGCTG GACACCCTGG 420TACGCCTCCG AA 432(2)SEQ ID NO:109的信息:
(ⅰ)序列特征:
(A)长度:368氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:109:Met Leu Trp His Ala Met Pro Pro Glu Xaa Asn Thr Ala Arg Leu Met1 5 10 15Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gln
20 25 30Thr Leu Ser Ala Ala Leu Asp Ala Gln Ala Val Glu Leu Thr Ala Arg
35 40 45Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala
50 55 60Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gln Thr Ala Ser Thr65 70 75 80Gln Ala Lys Thr Arg Ala Met Gln Ala Thr Ala Gln Ala Ala Ala Tyr
85 90 95Thr Gln Ala Met Ala Thr Thr Pro Ser Leu Pro Glu Ile Ala Ala Asn
100 105 110His Ile Thr Gln Ala Val Leu Thr Ala Thr Asn Phe Phe Gly Ile Asn
115 120 125Thr Ile Pro Ile Ala Leu Thr Glu Met Asp Tyr Phe Ile Arg Met Trp
130 135 140Asn Gln Ala Ala Leu Ala Met Glu Val Tyr Gln Ala Glu Thr Ala Val145 150 155 160Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser Ile Leu Asp Pro
165 170 175Gly Ala Ser Gln Ser Thr Thr Asn Pro Ile Phe Gly Met Pro Ser Pro
180 185 190Gly Ser Ser Thr Pro Val Gly Gln Leu Pro Pro Ala Ala Thr Gln Thr
195 200 205Leu Gly Gln Leu Gly Glu Met Ser Gly Pro Met Gln Gln Leu Thr Gln
210 215 220Pro Leu Gln Gln Val Thr Ser Leu Phe Ser Gln Val Gly Gly Thr Gly225 230 235 240Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gln Met Gly Leu Leu Gly
245 250 255Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser
260 265 270Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly
275 280 285Ser Leu Thr Arg Thr Pro Leu Met Ser Gln Leu Ile Glu Lys Pro Val
290 295 300Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly305 310 315 320Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gln Gly Ala Gln Ser
325 330 335Gly Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro Leu Ala Gln
340 345 350Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp
355 360 365(2)SEQ ID NO:110的信息:
(ⅰ)序列特征:
(A)长度:100氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:110:Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Ala Gln Glu Ala Gly1 5 10 15Asn Phe Glu Arg Ile Ser Gly Asp Leu Lys Thr Gln Ile Asp Gln Val
20 25 30Glu Ser Thr Ala Gly Ser Leu Gln Gly Gln Trp Arg Gly Ala Ala Gly
35 40 45Thr Ala Ala Gln Ala Ala Val Val Arg Phe Gln Glu Ala Ala Asn Lys
50 55 60Gln Lys Gln Glu Leu Asp Glu Ile Ser Thr Asn Ile Arg Gln Ala Gly65 70 75 80Val Gln Tyr Ser Arg Ala Asp Glu Glu Gln Gln Gln Ala Leu Ser Ser
85 90 95Gln Met Gly Phe
100(2)SEQ ID NO:111的信息:
(ⅰ)序列特征:
(A)长度:396碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:111:GATCTCCGGC GACCTGAAAA CCCAGATCGA CCAGGTGGAG TCGACGGCAG GTTCGTTGCA 60GGGCCAGTGG CGCGGCGCGG CGGGGACGGC CGCCCAGGCC GCGGTGGTGC GCTTCCAAGA 120AGCAGCCAAT AAGCAGAAGC AGGAACTCGA CGAGATCTCG ACGAATATTC GTCAGGCCGG 180CGTCCAATAC TCGAGGGCCG ACGAGGAGCA GCAGCAGGCG CTGTCCTCGC AAATGGGCTT 240CTGACCCGCT AATACGAAAA GAAACGGAGC AAAAACATGA CAGAGCAGCA GTGGAATTTC 300GCGGGTATCG AGGCCGCGGC AAGCGCAATC CAGGGAAATG TCACGTCCAT TCATTCCCTC 360CTTGACGAGG GGAAGCAGTC CCTGACCAAG CTCGCA 396(2)SEQ ID NO:112的信息:
(ⅰ)序列特征:
(A)长度:80氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:112:Ile Ser Gly Asp Leu Lys Thr Gln Ile Asp Gln Val Glu Ser Thr Ala1 5 10 15Gly Ser Leu Gln Gly Gln Trp Arg Gly Ala Ala Gly Thr Ala Ala Gln
20 25 30Ala Ala Val Val Arg Phe Gln Glu Ala Ala Asn Lys Gln Lys Gln Glu
35 40 45Leu Asp Glu Ile Ser Thr Asn Ile Arg Gln Ala Gly Val Gln Tyr Ser
50 55 60Arg Ala Asp Glu Glu Gln Gln Gln Ala Leu Ser Ser Gln Met Gly Phe65 70 75 80(2)SEQ ID NO:113的信息:
(ⅰ)序列特征:
(A)长度:387碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:113:GTGGATCCCG ATCCCGTGTT TCGCTATTCT ACGCGAACTC GGCGTTGCCC TATGCGAACA 60TCCCAGTGAC GTTGCCTTCG GTCGAAGCCA TTGCCTGACC GGCTTCGCTG ATCGTCCGCG 120CCAGGTTCTG CAGCGCGTTG TTCAGCTCGG TAGCCGTGGC GTCCCATTTT TGCTGGACAC 180CCTGGTACGC CTCCGAACCG CTACCGCCCC AGGCCGCTGC GAGCTTGGTC AGGGACTGCT 240TCCCCTCGTC AAGGAGGGAA TGAATGGACG TGACATTTCC CTGGATTGCG CTTGCCGCGG 300CCTCGATACC CGCGAAATTC CACTGCTGCT CTGTCATGTT TTTGCTCCGT TTCTTTTCGT 360ATTAGCGGGT CAGAAGCCCA TTTGCGA 387(2)SEQ ID NO:114的信息:
(ⅰ)序列特征:
(A)长度:272碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:114:CGGCACGAGG ATCTCGGTTG GCCCAACGGC GCTGGCGAGG GCTCCGTTCC GGGGGCGAGC 60TGCGCGCCGG ATGCTTCCTC TGCCCGCAGC CGCGCCTGGA TGGATGGACC AGTTGCTACC 120TTCCCGACGT TTCGTTCGGT GTCTGTGCGA TAGCGGTGAC CCCGGCGCGC ACGTCGGGAG 180TGTTGGGGGG CAGGCCGGGT CGGTGGTTCG GCCGGGGACG CAGACGGTCT GGACGGAACG 240GGCGGGGGTT CGCCGATTGG CATCTTTGCC CA 272(2)SEQ ID NO:115的信息:
(ⅰ)序列特征:
(A)长度:20氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:115:
Asp Pro Val Asp Ala Val Ile Asn Thr Thr Cys Asn Tyr Gly Gln Val
1 5 10 15
Val Ala Ala Leu
20(2)SEQ ID NO:116的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:116:
Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser
1 5 10 15(2)SEQ ID NO:117的信息:
(ⅰ)序列特征:
(A)长度:19氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:117:
Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys
1 5 10 15
Glu Gly Arg(2)SEQ ID NO:118的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:118:
Tyr Tyr Trp Cys Pro Gly Gln Pro Phe Asp Pro Ala Trp Gly Pro
1 5 10 15(2)SEQ ID NO:119的信息:
(ⅰ)序列特征:
(A)长度:14氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:119:
Asp Ile Gly Ser Glu Ser Thr Glu Asp Gln Gln Xaa Ala Val
1 5 10(2)SEQ ID NO:120的信息:
(ⅰ)序列特征:
(A)长度:13氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:120:
Ala Glu Glu Ser Ile Ser Thr Xaa Glu Xaa Ile Val Pro
1 5 10(2)SEQ ID NO:121的信息:
(ⅰ)序列特征:
(A)长度:17氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:121:
Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro
1 5 10 15
Ser(2)SEQ ID NO:122的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:122:
Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly
1 5 10 15(2)SEQ ID NO:123的信息:
(ⅰ)序列特征:
(A)长度:30氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:123:
Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gln Leu Thr Ser
1 5 10 15
Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala Asn
20 25 30(2)SEQ ID NO:124的信息:
(ⅰ)序列特征:
(A)长度:22氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:124:
Asp Pro Pro Asp Pro His Gln Xaa Asp Met Thr Lys Gly Tyr Tyr Pro
1 5 10 15
Gly Gly Arg Arg Xaa Phe
20(2)SEQ ID NO:125的信息:
(ⅰ)序列特征:
(A)长度:7氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:125:
Asp Pro Gly Tyr Thr Pro Gly
1 5(2)SEQ ID NO:126的信息:
(ⅰ)序列特征:
(A)长度:10氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅸ)特征:
(D)其它信息:/注意=″第二个残基可以是Pro或Thr″
(ⅹⅰ)序列描述:SEQ ID NO:126:
Xaa Xaa Gly Phe Thr Gly Pro Gln Phe Tyr
1 5 10(2)SEQ ID NO:127的信息:
(ⅰ)序列特征:
(A)长度:9氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅸ)特征:
(D)其它信息:/注意=″第三个残基可以是Gln或Leu″
(ⅹⅰ)序列描述:SEQ ID NO:127:
Xaa Pro Xaa Val Thr Ala Tyr Ala Gly
1 5(2)SEQ ID NO:128的信息:
(ⅰ)序列特征:
(A)长度:9氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:128:
Xaa Xaa Xaa Glu Lys Pro Phe Leu Arg
1 5(2)SEQ ID NO:129的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:129:
Xaa Asp Ser Glu Lys Ser Ala Thr Ile Lys Val Thr Asp Ala Ser
1 5 10 15(2)SEQ ID NO:130的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:130:
Ala Gly Asp Thr Xaa Ile Tyr Ile Val Gly Asn Leu Thr Ala Asp
1 5 10 15(2)SEQ ID NO:131的信息:
(ⅰ)序列特征:
(A)长度:15氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:131:
Ala Pro Glu Ser Gly Ala Gly Leu Gly Gly Thr Val Gln Ala Gly
1 5 10 15(2)SEQ ID NO:132的信息:
(ⅰ)序列特征:
(A)长度:21氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:132:
Xaa Tyr Ile Ala Tyr Xaa Thr Thr Ala Gly Ile Val Pro Gly Lys Ile
1 5 10 15
Asn Val His Leu Val
20(2)SEQ ID NO:133的信息:
(ⅰ)序列特征:
(A)长度:882碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:DNA(基因组)
(ⅹⅰ)序列描述:SEQ ID NO:133:GCAACGCTGT CGTGGCCTTT GCGGTGATCG GTTTCGCCTC GCTGGCGGTG GCGGTGGCGG 60TCACCATCCG ACCGACCGCG GCCTCAAAAC CGGTAGAGGG ACACCAAAAC GCCCAGCCAG 120GGAAGTTCAT GCCGTTGTTG CCGACGCAAC AGCAGGCGCC GGTCCCGCCG CCTCCGCCCG 180ATGATCCCAC CGCTGGATTC CAGGGCGGCA CCATTCCGGC TGTACAGAAC GTGGTGCCGC 240GGCCGGGTAC CTCACCCGGG GTGGGTGGGA CGCCGGCTTC GCCTGCGCCG GAAGCGCCGG 300CCGTGCCCGG TGTTGTGCCT GCCCCGGTGC CAATCCCGGT CCCGATCATC ATTCCCCCGT 360TCCCGGGTTG GCAGCCTGGA ATGCCGACCA TCCCCACCGC ACCGCCGACG ACGCCGGTGA 420CCACGTCGGC GACGACGCCG CCGACCACGC CGCCGACCAC GCCGGTGACC ACGCCGCCAA 480CGACGCCGCC GACCACGCCG GTGACCACGC CGCCAACGAC GCCGCCGACC ACGCCGGTGA 540CCACGCCACC AACGACCGTC GCCCCGACGA CCGTCGCCCC GACGACGGTC GCTCCGACCA 600CCGTCGCCCC GACCACGGTC GCTCCAGCCA CCGCCACGCC GACGACCGTC GCTCCGCAGC 660CGACGCAGCA GCCCACGCAA CAACCAACCC AACAGATGCC AACCCAGCAG CAGACCGTGG 720CCCCGCAGAC GGTGGCGCCG GCTCCGCAGC CGCCGTCCGG TGGCCGCAAC GGCAGCGGCG 780GGGGCGACTT ATTCGGCGGG TTCTGATCAC GGTCGCGGCT TCACTACGGT CGGAGGACAT 840GGCCGGTGAT GCGGTGACGG TGGTGCTGCC CTGTCTCAAC GA 882(2) SEQ ID NO:134的信息:
(ⅰ)序列特征:
(A)长度:815碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:DNA(基因组)
(ⅹⅰ)序列描述:SEQ ID NO:134:CCATCAACCA ACCGCTCGCG CCGCCCGCGC CGCCGGATCC GCCGTCGCCG CCACGCCCGC 60CGGTGCCTCC GGTGCCCCCG TTGCCGCCGT CGCCGCCGTC GCCGCCGACC GGCTGGGTGC 120CTAGGGCGCT GTTACCGCCC TGGTTGGCGG GGACGCCGCC GGCACCACCG GTACCGCCGA 180TGGCGCCGTT GCCGCCGGCG GCACCGTTGC CACCGTTGCC ACCGTTGCCA CCGTTGCCGA 240CCAGCCACCC GCCGCGACCA CCGGCACCGC CGGCGCCGCC CGCACCGCCG GCGTGCCCGT 300TCGTGCCCGT ACCGCCGGCA CCGCCGTTGC CGCCGTCACC GCCGACGGAA CTACCGGCGG 360ACGCGGCCTG CCCGCCGGCG CCGCCCGCAC CGCCATTGGC ACCGCCGTCA CCGCCGGCTG 420GGAGTGCCGC GATTAGGGCA CTGACCGGCG CAACCAGCGC AAGTACTCTC GGTCACCGAG 480CACTTCCAGA CGACACCACA GCACGGGGTT GTCGGCGGAC TGGGTGAAAT GGCAGCCGAT 540AGCGGCTAGC TGTCGGCTGC GGTCAACCTC GATCATGATG TCGAGGTGAC CGTGACCGCG 600CCCCCCGAAG GAGGCGCTGA ACTCGGCGTT GAGCCGATCG GCGATCGGTT GGGGCAGTGC 660CCAGGCCAAT ACGGGGATAC CGGGTGTCNA AGCCGCCGCG AGCGCAGCTT CGGTTGCGCG 720ACNGTGGTCG GGGTGGCCTG TTACGCCGTT GTCNTCGAAC ACGAGTAGCA GGTCTGCTCC 780GGCGAGGGCA TCCACCACGC GTTGCGTCAG CTCGT 815(2)SEQ ID NO:135的信息:
(ⅰ)序列特征:
(A)长度:1152碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:DNA(基因组)
(ⅹⅰ)序列描述:SEQ ID NO:135:ACCAGCCGCC GGCTGAGGTC TCAGATCAGA GAGTCTCCGG ACTCACCGGG GCGGTTCAGC 60CTTCTCCCAG AACAACTGCT GAAGATCCTC GCCCGCGAAA CAGGCGCTGA TTTGACGCTC 120TATGACCGGT TGAACGACGA GATCATCCGG CAGATTGATA TGGCACCGCT GGGCTAACAG 180GTGCGCAAGA TGGTGCAGCT GTATGTCTCG GACTCCGTGT CGCGGATCAG CTTTGCCGAC 240GGCCGGGTGA TCGTGTGGAG CGAGGAGCTC GGCGAGAGCC AGTATCCGAT CGAGACGCTG 300GACGGCATCA CGCTGTTTGG GCGGCCGACG ATGACAACGC CCTTCATCGT TGAGATGCTC 360AAGCGTGAGC GCGACATCCA GCTCTTCACG ACCGACGGCC ACTACCAGGG CCGGATCTCA 420ACACCCGACG TGTCATACGC GCCGCGGCTC CGTCAGCAAG TTCACCGCAC CGACGATCCT 480GCGTTCTGCC TGTCGTTAAG CAAGCGGATC GTGTCGAGGA AGATCCTGAA TCAGCAGGCC 540TTGATTCGGG CACACACGTC GGGGCAAGAC GTTGCTGAGA GCATCCGCAC GATGAAGCAC 600TCGCTGGCCT GGGTCGATCG ATCGGGCTCC CTGGCGGAGT TGAACGGGTT CGAGGGAAAT 660GCCGCAAAGG CATACTTCAC CGCGCTGGGG CATCTCGTCC CGCAGGAGTT CGCATTCCAG 720GGCCGCTCGA CTCGGCCGCC GTTGGACGCC TTCAACTCGA TGGTCAGCCT CGGCTATTCG 780CTGCTGTACA AGAACATCAT AGGGGCGATC GAGCGTCACA GCCTGAACGC GTATATCGGT 840TTCCTACACC AGGATTCACG AGGGCACGCA ACGTCTCGTG CCGAATTCGG CACGAGCTCC 900GCTGAAACCG CTGGCCGGCT GCTCAGTGCC CGTACGTAAT CCGCTGCGCC CAGGCCGGCC 960CGCCGGCCGA ATACCAGCAG ATCGGACAGC GAATTGCCGC CCAGCCGGTT GGAGCCGTGC 1020ATACCGCCGG CACACTCACC GGCAGCGAAC AGGCCTGGCA CCGTGGCGGC GCCGGTGTCC 1080GCGTCTACTT CGACACCGCC CATCACGTAG TGACACGTCG GCCCGACTTC CATTGCCTGC 1140GTTCGGCACG AG 1152(2)SEQ ID NO:136的信息:
(ⅰ)序列特征:
(A)长度:655碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:DNA(基因组)
(ⅹⅰ)序列描述:SEQ ID NO:136:CTCGTGCCGA TTCGGCAGGG TGTACTTGCC GGTGGTGTAN GCCGCATGAG TGCCGACGAC 60CAGCAATGCG GCAACAGCAC GGATCCCGGT CAACGACGCC ACCCGGTCCA CGTGGGCGAT 120CCGCTCGAGT CCGCCCTGGG CGGCTCTTTC CTTGGGCAGG GTCATCCGAC GTGTTTCCGC 180CGTGGTTTGC CGCCATTATG CCGGCGCGCC GCGTCGGGCG GCCGGTATGG CCGAANGTCG 240ATCAGCACAC CCGAGATACG GGTCTGTGCA AGCTTTTTGA GCGTCGCGCG GGGCAGCTTC 300GCCGGCAATT CTACTAGCGA GAAGTCTGGC CCGATACGGA TCTGACCGAA GTCGCTGCGG 360TGCAGCCCAC CCTCATTGGC GATGGCGCCG ACGATGGCGC CTGGACCGAT CTTGTGCCGC 420TTGCCGACGG CGACGCGGTA GGTGGTCAAG TCCGGTCTAC GCTTGGGCCT TTGCGGACGG 480TCCCGACGCT GGTCGCGGTT GCGCCGCGAA AGCGGCGGGT CGGGTGCCAT CAGGAATGCC 540TCACCGCCGC GGCACTGCAC GGCCAGTGCC GCGGCGATGT CAGCCATCGG GACATCATGC 600TCGCGTTCAT ACTCCTCGAC CAGTCGGCGG AACAGCTCGA TTCCCGGACC GCCCA 655(2)SEQ ID NO:137的信息:
(ⅰ)序列特征:
(A)长度:267氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽(ⅹⅰ)序列描述:SEQ ID NO:137:Ash Ala Val Val Ala Phe Ala Val Ile Gly Phe Ala Ser Leu Ala Val1 5 10 15Ala Val Ala Val Thr Ile Arg Pro Thr Ala Ala Ser Lys Pro Val Glu
20 25 30Gly His Gln Asn Ala Gln Pro Gly Lys Phe Met Pro Leu Leu Pro Thr
35 40 45Gln Gln Gln Ala Pro Val Pro Pro Pro Pro Pro Asp Asp Pro Thr Ala
50 55 60Gly Phe Gln Gly Gly Thr Ile Pro Ala Val Gln Asn Val Val Pro Arg65 70 75 80Pro Gly Thr Ser Pro Gly Val Gly Gly Thr Pro Ala Ser Pro Ala Pro
85 90 95Glu Ala Pro Ala Val Pro Gly Val Val Pro Ala Pro Val Pro Ile Pro
100 105 110Val Pro Ile Ile Ile Pro Pro Phe Pro Gly Trp Gln Pro Gly Met Pro
115 120 125Thr Ile Pro Thr Ala Pro Pro Thr Thr Pro Val Thr Thr Ser Ala Thr
130 135 140Thr Pro Pro Thr Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr145 150 155 160Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr Thr Pro Pro Thr
165 170 175Thr Pro Val Thr Thr Pro Pro Thr Thr Val Ala Pro Thr Thr Val Ala
180 185 190Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro
195 200 205Ala Thr Ala Thr Pro Thr Thr Val Ala Pro Gln Pro Thr Gln Gln Pro
210 215 220Thr Gln Gln Pro Thr Gln Gln Met Pro Thr Gln Gln Gln Thr Val Ala225 230 235 240Pro Gln Thr Val Ala Pro Ala Pro Gln Pro Pro Ser Gly Gly Arg Asn
245 250 255Gly Ser Gly Gly Gly Asp Leu Phe Gly Gly Phe
260 265(2)SEQ ID NO:138的信息:
(ⅰ)序列特征:
(A)长度:174氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽(ⅹⅰ)序列描述:SEQ ID NO:138:Ile Asn Gln Pro Leu Ala Pro Pro Ala Pro Pro Asp Pro Pro Ser Pro1 5 10 15Pro Arg Pro Pro Val Pro Pro Val Pro Pro Leu Pro Pro Ser Pro Pro
20 25 30Ser Pro Pro Thr Gly Trp Val Pro Arg Ala Leu Leu Pro Pro Trp Leu
35 40 45Ala Gly Thr Pro Pro Ala Pro Pro Val Pro Pro Met Ala Pro Leu Pro
50 55 60Pro Ala Ala Pro Leu Pro Pro Leu Pro Pro Leu Pro Pro Leu Pro Thr65 70 75 80Ser His Pro Pro Arg Pro Pro Ala Pro Pro Ala Pro Pro Ala Pro Pro
85 90 95Ala Cys Pro Phe Val Pro Val Pro Pro Ala Pro Pro Leu Pro Pro Ser
100 105 110Pro Pro Thr Glu Leu Pro Ala Asp Ala Ala Cys Pro Pro Ala Pro Pro
115 120 125Ala Pro Pro Leu Ala Pro Pro Ser Pro Pro Ala Gly Ser Ala Ala Ile
130 135 140Arg Ala Leu Thr Gly Ala Thr Ser Ala Ser Thr Leu Gly His Arg Ala145 150 155 160Leu Pro Asp Asp Thr Thr Ala Arg Gly Cys Arg Arg Thr Gly
165 170(2)SEQ ID NO:139的信息:
(ⅰ)序列特征:
(A)长度:35氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽(ⅹⅰ)序列描述:SEQ ID NO:139:Gln Pro Pro Ala Glu Val Ser Asp Gln Arg Val Ser Gly Leu Thr Gly1 5 10 15Ala Val Gln Pro Ser Pro Arg Thr Thr Ala Glu Asp Pro Arg Pro Arg
20 25 30Asn Arg Arg
35(2)SEQ ID NO:140的信息:
(ⅰ)序列特征:
(A)长度:104氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽(ⅹⅰ)序列描述:SEQ ID NO:140:Arg Ala Asp Ser Ala Gly Cys Thr Cys Arg Trp Cys Xaa Pro His Glu1 5 10 15Cys Arg Arg Pro Ala Met Arg Gln Gln His Gly Ser Arg Ser Thr Thr
20 25 30Pro Pro Gly Pro Arg Gly Arg Ser Ala Arg Val Arg Pro Gly Arg Leu
35 40 45Phe Pro Trp Ala Gly Ser Ser Asp Val Phe Pro Pro Trp Phe Ala Ala
50 55 60Ile Met Pro Ala Arg Arg Val Gly Arg Pro Val Trp Pro Xaa Val Asp65 70 75 80Gln His Thr Arg Asp Thr Gly Leu Cys Lys Leu Phe Glu Arg Arg Ala
85 90 95Gly Gln Leu Arg Arg Gln Phe Tyr
100(2)SEQ ID NO:141的信息:
(ⅰ)序列特征:
(A)长度:53碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:141:GGATCCATAT GGGCCATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 53(2)SEQ ID NO:142的信息:
(ⅰ)序列特征:
(A)长度:42碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:142:CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 42(2)SEQ ID NO:143的信息:
(ⅰ)序列特征:
(A)长度:31碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:143:GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 31(2)SEQ ID NO:144的信息:
(ⅰ)序列特征:
(A)长度:31碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:144:CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 31(2)SEQ ID NO:145的信息:
(ⅰ)序列特征:
(A)长度:33碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:145:GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 33(2)SEQ ID NO:146的信息:
(ⅰ)序列特征:
(A)长度:33碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它核酸
(A)描述:/描述=″PCR引物″
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅹⅰ)序列描述:SEQ ID NO:146:GAGAGAATTC TCAGAAGCCC ATTTGCGAGG ACA 33(2)SEQ ID NO:147的信息:
(ⅰ)序列特征:
(A)长度:1993碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:DNA(基因组)
(ⅵ)最初来源:
(A)生物体:结核分枝杆菌
(ⅸ)特征:
(A)名称/关键字:CDS
(B)位置:152..1273
(ⅹⅰ)序列描述:SEQ ID NO:147:TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 60AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120GCGGAAATTG AAGAGCACAG AAAGGTATGG C GTG AAA ATT CGT TTG CAT ACG 172
Val Lys Ile Arg Leu His Thr
1 5CTG TTG GCC GTG TTG ACC GCT GCG CCG CTG CTG CTA GCA GCG GCG GGC 220Leu Leu Ala Val Leu Thr Ala Ala Pro Leu Leu Leu Ala Ala Ala Gly
10 15 20TGT GGC TCG AAA CCA CCG AGC GGT TCG CCT GAA ACG GGC GCC GGC GCC 268Cys Gly Ser Lys Pro Pro Ser Gly Ser Pro Glu Thr Gly Ala Gly Ala
25 30 35GGT ACT GTC GCG ACT ACC CCC GCG TCG TCG CCG GTG ACG TTG GCG GAG 316Gly Thr Val Ala Thr Thr Pro Ala Ser Ser Pro Val Thr Leu Ala Glu40 45 50 55ACC GGT AGC ACG CTG CTC TAC CCG CTG TTC AAC CTG TGG GGT CCG GCC 364Thr Gly Ser Thr Leu Leu Tyr Pro Leu Phe Asn Leu Trp Gly Pro Ala
60 65 70TTT CAC GAG AGG TAT CCG AAC GTC ACG ATC ACC GCT CAG GGC ACC GGT 412Phe His Glu Arg Tyr Pro Asn Val Thr Ile Thr Ala Gln Gly Thr Gly
75 80 85TCT GGT GCC GGG ATC GCG CAG GCC GCC GCC GGG ACG GTC AAC ATT GGG 460Ser Gly Ala Gly Ile Ala Gln Ala Ala Ala Gly Thr Val Asn Ile Gly
90 95 100GCC TCC GAC GCC TAT CTG TCG GAA GGT GAT ATG GCC GCG CAC AAG GGG 508Ala Ser Asp Ala Tyr Leu Ser Glu Gly Asp Met Ala Ala His Lys Gly
105 110 115CTG ATG AAC ATC GCG CTA GCC ATC TCC GCT CAG CAG GTC AAC TAC AAC 556Leu Met Asn Ile Ala Leu Ala Ile Ser Ala Gln Gln Val Asn Tyr Asn120 125 130 135CTG CCC GGA GTG AGC GAG CAC CTC AAG CTG AAC GGA AAA GTC CTG GCG 604Leu Pro Gly Val Ser Glu His Leu Lys Leu Asn Gly Lys Val Leu Ala
140 145 150GCC ATG TAC CAG GGC ACC ATC AAA ACC TGG GAC GAC CCG CAG ATC GCT 652Ala Met Tyr Gln Gly Thr Ile Lys Thr Trp Asp Asp Pro Gln Ile Ala
155 160 165GCG CTC AAC CCC GGC GTG AAC CTG CCC GGC ACC GCG GTA GTT CCG CTG 700Ala Leu Asn Pro Gly Val Asn Leu Pro Gly Thr Ala Val Val Pro Leu
170 175 180CAC CGC TCC GAC GGG TCC GGT GAC ACC TTC TTG TTC ACC CAG TAC CTG 748His Arg Ser Asp Gly Ser Gly Asp Thr Phe Leu Phe Thr Gln Tyr Leu
185 190 195TCC AAG CAA GAT CCC GAG GGC TGG GGC AAG TCG CCC GGC TTC GGC ACC 796Ser Lys Gln Asp Pro Glu Gly Trp Gly Lys Ser Pro Gly Phe Gly Thr200 205 210 215ACC GTC GAC TTC CCG GCG GTG CCG GGT GCG CTG GGT GAG AAC GGC AAC 844Thr Val Asp Phe Pro Ala Val Pro Gly Ala Leu Gly Glu Asn Gly Asn
220 225 230GGC GGC ATG GTG ACC GGT TGC GCC GAG ACA CCG GGC TGC GTG GCC TAT 892Gly Gly Met Val Thr Gly Cys Ala Glu Thr Pro Gly Cys Val Ala Tyr
235 240 245ATC GGC ATC AGC TTC CTC GAC CAG GCC AGT CAA CGG GGA CTC GGC GAG 940Ile Gly Ile Ser Phe Leu Asp Gln Ala Ser Gln Arg Gly Leu Gly Glu
250 255 260GCC CAA CTA GGC AAT AGC TCT GGC AAT TTC TTG TTG CCC GAC GCG CAA 988Ala Gln Leu Gly Asn Ser Ser Gly Asn Phe Leu Leu Pro Asp Ala Gln
265 270 275AGC ATT CAG GCC GCG GCG GCT GGC TTC GCA TCG AAA ACC CCG GCG AAC 1036Ser Ile Gln Ala Ala Ala Ala Gly Phe Ala Ser Lys Thr Pro Ala Asn280 285 290 295CAG GCG ATT TCG ATG ATC GAC GGG CCC GCC CCG GAC GGC TAC CCG ATC 1084Gln Ala Ile Ser Met Ile Asp Gly Pro Ala Pro Asp Gly Tyr Pro Ile
300 305 310ATC AAC TAC GAG TAC GCC ATC GTC AAC AAC CGG CAA AAG GAC GCC GCC 1132Ile Asn Tyr Glu Tyr Ala Ile Val Asn Asn Arg Gln Lys Asp Ala Ala
315 320 325ACC GCG CAG ACC TTG CAG GCA TTT CTG CAC TGG GCG ATC ACC GAC GGC 1180Thr Ala Gln Thr Leu Gln Ala Phe Leu His Trp Ala Ile Thr Asp Gly
330 335 340AAC AAG GCC TCG TTC CTC GAC CAG GTT CAT TTC CAG CCG CTG CCG CCC 1228Asn Lys Ala Ser Phe Leu Asp Gln Val His Phe Gln Pro Leu Pro Pro
345 350 355GCG GTG GTG AAG TTG TCT GAC GCG TTG ATC GCG ACG ATT TCC AGC 1273Ala Val Val Lys Leu Ser Asp Ala Leu Ile Ala Thr Ile Ser Ser360 365 370TAGCCTCGTT GACCACCACG CGACAGCAAC CTCCGTCGGG CCATCGGGCT GCTTTGCGGA 1333GCATGCTGGC CCGTGCCGGT GAAGTCGGCC GCGCTGGCCC GGCCATCCGG TGGTTGGGTG 1393GGATAGGTGC GGTGATCCCG CTGCTTGCGC TGGTCTTGGT GCTGGTGGTG CTGGTCATCG 1453AGGCGATGGG TGCGATCAGG CTCAACGGGT TGCATTTCTT CACCGCCACC GAATGGAATC 1513CAGGCAACAC CTACGGCGAA ACCGTTGTCA CCGACGCGTC GCCCATCCGG TCGGCGCCTA 1573CTACGGGGCG TTGCCGCTGA TCGTCGGGAC GCTGGCGACC TCGGCAATCG CCCTGATCAT 1633CGCGGTGCCG GTCTCTGTAG GAGCGGCGCT GGTGATCGTG GAACGGCTGC CGAAACGGTT 1693GGCCGAGGCT GTGGGAATAG TCCTGGAATT GCTCGCCGGA ATCCCCAGCG TGGTCGTCGG 1753TTTGTGGGGG GCAATGACGT TCGGGCCGTT CATCGCTCAT CACATCGCTC CGGTGATCGC 1813TCACAACGCT CCCGATGTGC CGGTGCTGAA CTACTTGCGC GGCGACCCGG GCAACGGGGA 1873GGGCATGTTG GTGTCCGGTC TGGTGTTGGC GGTGATGGTC GTTCCCATTA TCGCCACCAC 1933CACTCATGAC CTGTTCCGGC AGGTGCCGGT GTTGCCCCGG GAGGGCGCGA TCGGGAATTC 1993(2)SEQ ID NO:148的信息:
(ⅰ)序列特征:
(A)长度:374氨基酸
(B)类型:氨基酸
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:148:Val Lys Ile Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro1 5 10 15Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser
20 25 30Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser
35 40 45Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu
50 55 60Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr65 70 75 80Ile Thr Ala Gln Gly Thr Gly Ser Gly Ala Gly Ile Ala Gln Ala Ala
85 90 95Ala Gly Thr Val Asn Ile Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly
100 105 110Asp Met Ala Ala His Lys Gly Leu Met Asn Ile Ala Leu Ala Ile Ser
115 120 125Ala Gln Gln Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys
130 135 140Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gln Gly Thr Ile Lys Thr145 150 155 160Trp Asp Asp Pro Gln Ile Ala Ala Leu Asn Pro Gly Val Asn Leu Pro
165 170 175Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr
180 185 190Phe Leu Phe Thr Gln Tyr Leu Ser Lys Gln Asp Pro Glu Gly Trp Gly
195 200 205Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly
210 215 220Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu225 230 235 240Thr Pro Gly Cys Val Ala Tyr Ile Gly Ile Ser Phe Leu Asp Gln Ala
245 250 255Ser Gln Arg Gly Leu Gly Glu Ala Gln Leu Gly Asn Ser Ser Gly Asn
260 265 270Phe Leu Leu Pro Asp Ala Gln Ser Ile Gln Ala Ala Ala Ala Gly Phe
275 280 285Ala Ser Lys Thr Pro Ala Asn Gln Ala Ile Ser Met Ile Asp Gly Pro
290 295 300Ala Pro Asp Gly Tyr Pro Ile Ile Asn Tyr Glu Tyr Ala Ile Val Asn305 310 315 320Asn Arg Gln Lys Asp Ala Ala Thr Ala Gln Thr Leu Gln Ala Phe Leu
325 330 335His Trp Ala Ile Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gln Val
340 345 350His Phe Gln Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu
355 360 365Ile Ala Thr Ile Ser Ser
370(2)SEQ ID NO:149的信息:
(ⅰ)序列特征:
(A)长度:1993碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:149:TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 60AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120GCGGAAATTG AAGAGCACAG AAAGGTATGG CGTGAAAATT CGTTTGCATA CGCTGTTGGC 180CGTGTTGACC GCTGCGCCGC TGCTGCTAGC AGCGGCGGGC TGTGGCTCGA AACCACCGAG 240CGGTTCGCCT GAAACGGGCG CCGGCGCCGG TACTGTCGCG ACTACCCCCG CGTCGTCGCC 300GGTGACGTTG GCGGAGACCG GTAGCACGCT GCTCTACCCG CTGTTCAACC TGTGGGGTCC 360GGCCTTTCAC GAGAGGTATC CGAACGTCAC GATCACCGCT CAGGGCACCG GTTCTGGTGC 420CGGGATCGCG CAGGCCGCCG CCGGGACGGT CAACATTGGG GCCTCCGACG CCTATCTGTC 480GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT GATGAACATC GCGCTAGCCA TCTCCGCTCA 540GCAGGTCAAC TACAACCTGC CCGGAGTGAG CGAGCACCTC AAGCTGAACG GAAAAGTCCT 600GGCGGCCATG TACCAGGGCA CCATCAAAAC CTGGGACGAC CCGCAGATCG CTGCGCTCAA 660CCCCGGCGTG AACCTGCCCG GCACCGCGGT AGTTCCGCTG CACCGCTCCG ACGGGTCCGG 720TGACACCTTC TTGTTCACCC AGTACCTGTC CAAGCAAGAT CCCGAGGGCT GGGGCAAGTC 780GCCCGGCTTC GGCACCACCG TCGACTTCCC GGCGGTGCCG GGTGCGCTGG GTGAGAACGG 840CAACGGCGGC ATGGTGACCG GTTGCGCCGA GACACCGGGC TGCGTGGCCT ATATCGGCAT 900CAGCTTCCTC GACCAGGCCA GTCAACGGGG ACTCGGCGAG GCCCAACTAG GCAATAGCTC 960TGGCAATTTC TTGTTGCCCG ACGCGCAAAG CATTCAGGCC GCGGCGGCTG GCTTCGCATC 1020GAAAACCCCG GCGAACCAGG CGATTTCGAT GATCGACGGG CCCGCCCCGG ACGGCTACCC 1080GATCATCAAC TACGAGTACG CCATCGTCAA CAACCGGCAA AAGGACGCCG CCACCGCGCA 1140GACCTTGCAG GCATTTCTGC ACTGGGCGAT CACCGACGGC AACAAGGCCT CGTTCCTCGA 1200CCAGGTTCAT TTCCAGCCGC TGCCGCCCGC GGTGGTGAAG TTGTCTGACG CGTTGATCGC 1260GACGATTTCC AGCTAGCCTC GTTGACCACC ACGCGACAGC AACCTCCGTC GGGCCATCGG 1320GCTGCTTTGC GGAGCATGCT GGCCCGTGCC GGTGAAGTCG GCCGCGCTGG CCCGGCCATC 1380CGGTGGTTGG GTGGGATAGG TGCGGTGATC CCGCTGCTTG CGCTGGTCTT GGTGCTGGTG 1440GTGCTGGTCA TCGAGGCGAT GGGTGCGATC AGGCTCAACG GGTTGCATTT CTTCACCGCC 1500ACCGAATGGA ATCCAGGCAA CACCTACGGC GAAACCGTTG TCACCGACGC GTCGCCCATC 1560CGGTCGGCGC CTACTACGGG GCGTTGCCGC TGATCGTCGG GACGCTGGCG ACCTCGGCAA 1620TCGCCCTGAT CATCGCGGTG CCGGTCTCTG TAGGAGCGGC GCTGGTGATC GTGGAACGGC 1680TGCCGAAACG GTTGGCCGAG GCTGTGGGAA TAGTCCTGGA ATTGCTCGCC GGAATCCCCA 1740GCGTGGTCGT CGGTTTGTGG GGGGCAATGA CGTTCGGGCC GTTCATCGCT CATCACATCG 1800CTCCGGTGAT CGCTCACAAC GCTCCCGATG TGCCGGTGCT GAACTACTTG CGCGGCGACC 1860CGGGCAACGG GGAGGGCATG TTGGTGTCCG GTCTGGTGTT GGCGGTGATG GTCGTTCCCA 1920TTATCGCCAC CACCACTCAT GACCTGTTCC GGCAGGTGCC GGTGTTGCCC CGGGAGGGCG 1980CGATCGGGAA TTC 1993(2)SEQ ID NO:150的信息:
(ⅰ)序列特征:
(A)长度:374氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:150:Met Lys Ile Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro1 5 10 15Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser
20 25 30Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser
35 40 45Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu
50 55 60Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr65 70 75 80Ile Thr Ala Gln Gly Thr Gly Ser Gly Ala Gly Ile Ala Gln Ala Ala
85 90 95Ala Gly Thr Val Asn Ile Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly
100 105 110Asp Met Ala Ala His Lys Gly Leu Met Asn Ile Ala Leu Ala Ile Ser
115 120 125Ala Gln Gln Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys
130 135 140Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gln Gly Thr Ile Lys Thr145 150 155 160Trp Asp Asp Pro Gln Ile Ala Ala Leu Asn Pro Gly Val Asn Leu Pro
165 170 175Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr
180 185 190Phe Leu Phe Thr Gln Tyr Leu Ser Lys Gln Asp Pro Glu Gly Trp Gly
195 200 205Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly
210 215 220Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu225 230 235 240Thr Pro Gly Cys Val Ala Tyr Ile Gly Ile Ser Phe Leu Asp Gln Ala
245 250 255Ser Gln Arg Gly Leu Gly Glu Ala Gln Leu Gly Asn Ser Ser Gly Asn
260 265 270Phe Leu Leu Pro Asp Ala Gln Ser Ile Gln Ala Ala Ala Ala Gly Phe
275 280 285Ala Ser Lys Thr Pro Ala Asn Gln Ala Ile Ser Met Ile Asp Gly Pro
290 295 300Ala Pro Asp Gly Tyr Pro Ile Ile Asn Tyr Glu Tyr Ala Ile Val Asn305 310 315 320Asn Arg Gln Lys Asp Ala Ala Thr Ala Gln Thr Leu Gln Ala Phe Leu
325 330 335His Trp Ala Ile Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gln Val
340 345 350His Phe Gln Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu
355 360 365Ile Ala Thr Ile Ser Ser
370(2)SEQ ID NO:151的信息:
(ⅰ)序列特征:
(A)长度:1777碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:151:GGTCTTGACC ACCACCTGGG TGTCGAAGTC GGTGCCCGGA TTGAAGTCCA GGTACTCGTG 60GGTGGGGCGG GCGAAACAAT AGCGACAAGC ATGCGAGCAG CCGCGGTAGC CGTTGACGGT 120GTAGCGAAAC GGCAACGCGG CCGCGTTGGG CACCTTGTTC AGCGCTGATT TGCACAACAC 180CTCGTGGAAG GTGATGCCGT CGAATTGTGG CGCGCGAACG CTGCGGACCA GGCCGATCCG 240CTGCAACCCG GCAGCGCCCG TCGTCAACGG GCATCCCGTT CACCGCGACG GCTTGCCGGG 300CCCAACGCAT ACCATTATTC GAACAACCGT TCTATACTTT GTCAACGCTG GCCGCTACCG 360AGCGCCGCAC AGGATGTGAT ATGCCATCTC TGCCCGCACA GACAGGAGCC AGGCCTTATG 420ACAGCATTCG GCGTCGAGCC CTACGGGCAG CCGAAGTACC TAGAAATCGC CGGGAAGCGC 480ATGGCGTATA TCGACGAAGG CAAGGGTGAC GCCATCGTCT TTCAGCACGG CAACCCCACG 540TCGTCTTACT TGTGGCGCAA CATCATGCCG CACTTGGAAG GGCTGGGCCG GCTGGTGGCC 600TGCGATCTGA TCGGGATGGG CGCGTCGGAC AAGCTCAGCC CATCGGGACC CGACCGCTAT 660AGCTATGGCG AGCAACGAGA CTTTTTGTTC GCGCTCTGGG ATGCGCTCGA CCTCGGCGAC 720CACGTGGTAC TGGTGCTGCA CGACTGGGGC TCGGCGCTCG GCTTCGACTG GGCTAACCAG 780CATCGCGACC GAGTGCAGGG GATCGCGTTC ATGGAAGCGA TCGTCACCCC GATGACGTGG 840GCGGACTGGC CGCCGGCCGT GCGGGGTGTG TTCCAGGGTT TCCGATCGCC TCAAGGCGAG 900CCAATGGCGT TGGAGCACAA CATCTTTGTC GAACGGGTGC TGCCCGGGGC GATCCTGCGA 960CAGCTCAGCG ACGAGGAAAT GAACCACTAT CGGCGGCCAT TCGTGAACGG CGGCGAGGAC 1020CGTCGCCCCA CGTTGTCGTG GCCACGAAAC CTTCCAATCG ACGGTGAGCC CGCCGAGGTC 1080GTCGCGTTGG TCAACGAGTA CCGGAGCTGG CTCGAGGAAA CCGACATGCC GAAACTGTTC 1140ATCAACGCCG AGCCCGGCGC GATCATCACC GGCCGCATCC GTGACTATGT CAGGAGCTGG 1200CCCAACCAGA CCGAAATCAC AGTGCCCGGC GTGCATTTCG TTCAGGAGGA CAGCGATGGC 1260GTCGTATCGT GGGCGGGCGC TCGGCAGCAT CGGCGACCTG GGAGCGCTCT CATTTCACGA 1320GACCAAGAAT GTGATTTCCG GCGAAGGCGG CGCCCTGCTT GTCAACTCAT AAGACTTCCT 1380GCTCCGGGCA GAGATTCTCA GGGAAAAGGG CACCAATCGC AGCCGCTTCC TTCGCAACGA 1440GGTCGACAAA TATACGTGGC AGGACAAAGG TCTTCCTATT TGCCCAGCGA ATTAGTCGCT 1500GCCTTTCTAT GGGCTCAGTT CGAGGAAGCC GAGCGGATCA CGCGTATCCG ATTGGACCTA 1560TGGAACCGGT ATCATGAAAG CTTCGAATCA TTGGAACAGC GGGGGCTCCT GCGCCGTCCG 1620ATCATCCCAC AGGGCTGCTC TCACAACGCC CACATGTACT ACGTGTTACT AGCGCCCAGC 1680GCCGATCGGG AGGAGGTGCT GGCGCGTCTG ACGAGCGAAG GTATAGGCGC GGTCTTTCAT 1740TACGTGCCGC TTCACGATTC GCCGGCCGGG CGTCGCT 1777(2)SEQ ID NO:152的信息:
(ⅰ)序列特征:
(A)长度:324碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:152:GAGATTGAAT CGTACCGGTC TCCTTAGCGG CTCCGTCCCG TGAATGCCCA TATCACGCAC 60GGCCATGTTC TGGCTGTCGA CCTTCGCCCC ATGCCCGGAC GTTGGTAAAC CCAGGGTTTG 120ATCAGTAATT CCGGGGGACG GTTGCGGGAA GGCGGCCAGG ATGTGCGTGA GCCGCGGCGC 180CGCCGTCGCC CAGGCGACCG CTGGATGCTC AGCCCCGGTG CGGCGACGTA GCCAGCGTTT 240GGCGCGTGTC GTCCACAGTG GTACTCCGGT GACGACGCGG CGCGGTGCCT GGGTGAAGAC 300CGTGACCGAC GCCGCCGATT CAGA 324(2)SEQ ID NO:153的信息:
(ⅰ)序列特征:
(A)长度:1338碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:153:GCGGTACCGC CGCGTTGCGC TGGCACGGGA CCTGTACGAC CTGAACCACT TCGCCTCGCG 60AACGATTGAC GAACCGCTCG TGCGGCGGCT GTGGGTGCTC AAGGTGTGGG GTGATGTCGT 120CGATGACCGG CGCGGCACCC GGCCACTACG CGTCGAAGAC GTCCTCGCCG CCCGCAGCGA 180GCACGACTTC CAGCCCGACT CGATCGGCGT GCTGACCCGT CCTGTCGCTA TGGCTGCCTG 240GGAAGCTCGC GTTCGGAAGC GATTTGCGTT CCTCACTGAC CTCGACGCCG ACGAGCAGCG 300GTGGGCCGCC TGCGACGAAC GGCACCGCCG CGAAGTGGAG AACGCGCTGG CGGTGCTGCG 360GTCCTGATCA ACCTGCCGGC GATCGTGCCG TTCCGCTGGC ACGGTTGCGG CTGGACGCGG 420CTGAATCGAC TAGATGAGAG CAGTTGGGCA CGAATCCGGC TGTGGTGGTG AGCAAGACAC 480GAGTACTGTC ATCACTATTG GATGCACTGG ATGACCGGCC TGATTCAGCA GGACCAATGG 540AACTGCCCGG GGCAAAACGT CTCGGAGATG ATCGGCGTCC CCTCGGAACC CTGCGGTGCT 600GGCGTCATTC GGACATCGGT CCGGCTCGCG GGATCGTGGT GACGCCAGCG CTGAAGGAGT 660GGAGCGCGGC GGTGCACGCG CTGCTGGACG GCCGGCAGAC GGTGCTGCTG CGTAAGGGCG 720GGATCGGCGA GAAGCGCTTC GAGGTGGCGG CCCACGAGTT CTTGTTGTTC CCGACGGTCG 780CGCACAGCCA CGCCGAGCGG GTTCGCCCCG AGCACCGCGA CCTGCTGGGC CCGGCGGCCG 840CCGACAGCAC CGACGAGTGT GTGCTACTGC GGGCCGCAGC GAAAGTTGTT GCCGCACTGC 900CGGTTAACCG GCCAGAGGGT CTGGACGCCA TCGAGGATCT GCACATCTGG ACCGCCGAGT 960CGGTGCGCGC CGACCGGCTC GACTTTCGGC CCAAGCACAA ACTGGCCGTC TTGGTGGTCT 1020CGGCGATCCC GCTGGCCGAG CCGGTCCGGC TGGCGCGTAG GCCCGAGTAC GGCGGTTGCA 1080CCAGCTGGGT GCAGCTGCCG GTGACGCCGA CGTTGGCGGC GCCGGTGCAC GACGAGGCCG 1140CGCTGGCCGA GGTCGCCGCC CGGGTCCGCG AGGCCGTGGG TTGACTGGGC GGCATCGCTT 1200GGGTCTGAGC TGTACGCCCA GTCGGCGCTG CGAGTGATCT GCTGTCGGTT CGGTCCCTGC 1260TGGCGTCAAT TGACGGCGCG GGCAACAGCA GCATTGGCGG CGCCATCCTC CGCGCGGCCG 1320GCGCCCACCG CTACAACC 1338(2)SEQ ID NO:154的信息:
(ⅰ)序列特征:
(A)长度:321碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:154:CCGGCGGCAC CGGCGGCACC GGCGGTACCG GCGGCAACGG CGCTGACGCC GCTGCTGTGG 60TGGGCTTCGG CGCGAACGGC GACCCTGGCT TCGCTGGCGG CAAAGGCGGT AACGGCGGAA 120TAGGTGGGGC CGCGGTGACA GGCGGGGTCG CCGGCGACGG CGGCACCGGC GGCAAAGGTG 180GCACCGGCGG TGCCGGCGGC GCCGGCAACG ACGCCGGCAG CACCGGCAAT CCCGGCGGTA 240AGGGCGGCGA CGGCGGGATC GGCGGTGCCG GCGGGGCCGG CGGCGCGGCC GGCACCGGCA 300ACGGCGGCCA TGCCGGCAAC C 321(2)SEQ ID NO:155的信息:
(ⅰ)序列特征:
(A)长度:492碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:155:GAAGACCCGG CCCCGCCATA TCGATCGGCT CGCCGACTAC TTTCGCCGAA CGTGCACGCG 60GCGGCGTCGG GCTGATCATC ACCGGTGGCT ACGCGCCCAA CCGCACCGGA TGGCTGCTGC 120CGTTCGCCTC CGAACTCGTC ACTTCGGCGC AAGCCCGACG GCACCGCCGA ATCACCAGGG 180CGGTCCACGA TTCGGGTGCA AAGATCCTGC TGCAAATCCT GCACGCCGGA CGCTACGCCT 240ACCACCCACT TGCGGTCAGC GCCTCGCCGA TCAAGGCGCC GATCACCCCG TTTCGTCCGC 300GAGCACTATC GGCTCGCGGG GTCGAAGCGA CCATCGCGGA TTTCGCCCGC TGCGCGCAGT 360TGGCCCGCGA TGCCGGCTAC GACGGCGTCG AAATCATGGG CAGCGAAGGG TATCTGCTCA 420ATCAGTTCCT GGCGCCGCGC ACCAACAAGC GCACCGACTC GTGGGGCGGC ACACCGGCCA 480ACCGTCGCCG GT 492(2)SEQ ID NO:156的信息:
(ⅰ)序列特征:
(A)长度:536氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:156:Phe Ala Gln His Leu Val Glu Gly Asp Ala Val Glu Leu Trp Arg Ala1 5 10 15Asn Ala Ala Asp Gln Ala Asp Pro Leu Gln Pro Gly Ser Ala Arg Arg
20 25 30Gln Arg Ala Ser Arg Ser Pro Arg Arg Leu Ala Gly Pro Asn Ala Tyr
35 40 45His Tyr Ser Asn Asn Arg Ser Ile Leu Cys Gln Arg Trp Pro Leu Pro
50 55 60Ser Ala Ala Gln Asp Val Ile Cys His Leu Cys Pro His Arg Gln Glu65 70 75 80Pro Gly Leu Met Thr Ala Phe Gly Val Glu Pro Tyr Gly Gln Pro Lys
85 90 95Tyr Leu Glu Ile Ala Gly Lys Arg Met Ala Tyr Ile Asp Glu Gly Lys
100 105 110Gly Asp Ala Ile Val Phe Gln His Gly Asn Pro Thr Ser Ser Tyr Leu
115 120 125Trp Arg Asn Ile Met Pro His Leu Glu Gly Leu Gly Arg Leu Val Ala
130 135 140Cys Asp Leu Ile Gly Met Gly Ala Ser Asp Lys Leu Ser Pro Ser Gly145 150 155 160Pro Asp Arg Tyr Ser Tyr Gly Glu Gln Arg Asp Phe Leu Phe Ala Leu
165 170 175Trp Asp Ala Leu Asp Leu Gly Asp His Val Val Leu Val Leu His Asp
180 185 190Trp Gly Ser Ala Leu Gly Phe Asp Trp Ala Asn Gln His Arg Asp Arg
195 200 205Val Gln Gly Ile Ala Phe Met Glu Ala Ile Val Thr Pro Met Thr Trp
210 215 220Ala Asp Trp Pro Pro Ala Val Arg Gly Val Phe Gln Gly Phe Arg Ser225 230 235 240Pro Gln Gly Glu Pro Met Ala Leu Glu His Asn Ile Phe Val Glu Arg
245 250 255Val Leu Pro Gly Ala Ile Leu Arg Gln Leu Ser Asp Glu Glu Met Asn
260 265 270His Tyr Arg Arg Pro Phe Val Asn Gly Gly Glu Asp Arg Arg Pro Thr
275 280 285Leu Ser Trp Pro Arg Asn Leu Pro Ile Asp Gly Glu Pro Ala Glu Val
290 295 300Val Ala Leu Val Asn Glu Tyr Arg Ser Trp Leu Glu Glu Thr Asp Met305 310 315 320Pro Lys Leu Phe Ile Asn Ala Glu Pro Gly Ala Ile Ile Thr Gly Arg
325 330 335Ile Arg Asp Tyr Val Arg Ser Trp Pro Asn Gln Thr Glu Ile Thr Val
340 345 350Pro Gly Val His Phe Val Gln Glu Asp Ser Asp Gly Val Val Ser Trp
355 360 365Ala Gly Ala Arg Gln His Arg Arg Pro Gly Ser Ala Leu Ile Ser Arg
370 375 380Asp Gln Glu Cys Asp Phe Arg Arg Arg Arg Arg Pro Ala Cys Gln Leu385 390 395 400Ile Arg Leu Pro Ala Pro Gly Arg Asp Ser Gln Gly Lys Gly His Gln
405 410 415Ser Gln Pro Leu Pro Ser Gln Arg Gly Arg Gln Ile Tyr Val Ala Gly
420 425 430Gln Arg Ser Ser Tyr Leu Pro Ser Glu Leu Val Ala Ala Phe Leu Trp
435 440 445Ala Gln Phe Glu Glu Ala Glu Arg Ile Thr Arg Ile Arg Leu Asp Leu
450 455 460Trp Asn Arg Tyr His Glu Ser Phe Glu Ser Leu Glu Gln Arg Gly Leu465 470 475 480Leu Arg Arg Pro Ile Ile Pro Gln Gly Cys Ser His Asn Ala His Met
485 490 495Tyr Tyr Val Leu Leu Ala Pro Ser Ala Asp Arg Glu Glu Val Leu Ala
500 505 510Arg Leu Thr Ser Glu Gly Ile Gly Ala Val Phe His Tyr Val Pro Leu
515 520 525His Asp Ser Pro Ala Gly Arg Arg
530 535(2)SEQ ID NO:157的信息:
(ⅰ)序列特征:
(A)长度:284氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:157:
Asn Glu Ser Ala Pro Arg Ser Pro Met Leu Pro Ser Ala Arg Pro Arg
1 5 10 15Tyr Asp Ala Ile Ala Val Leu Leu Asn Glu Met His Ala Gly His Cys
20 25 30Asp Phe Gly Leu Val Gly Pro Ala Pro Asp Ile Val Thr Asp Ala Ala
35 40 45Gly Asp Asp Arg Ala Gly Leu Gly Val Asp Glu Gln Phe Arg His Val
50 55 60Gly Phe Leu Glu Pro Ala Pro Val Leu Val Asp Gln Arg Asp Asp Leu65 70 75 80Gly Gly Leu Thr Val Asp Trp Lys Val Ser Trp Pro Arg Gln Arg Gly
85 90 95Ala Thr Val Leu Ala Ala Val His Glu Trp Pro Pro Ile Val Val His
100 105 110Phe Leu Val Ala Glu Leu Ser Gln Asp Arg Pro Gly Gln His Pro Phe
115 120 125Asp Lys Asp Val Val Leu Gln Arg His Trp Leu Ala Leu Arg Arg Ser
130 135 140Glu Thr Leu Glu His Thr Pro His Gly Arg Arg Pro Val Arg Pro Arg145 150 155 160His Arg Gly Asp Asp Arg Phe His Glu Arg Asp Pro Leu His Ser Val
165 170 175Ala Met Leu Val Ser Pro Val Glu Ala Glu Arg Arg Ala Pro Val Val
180 185 190Gln His Gln Tyr His Val Val Ala Glu Val Glu Arg Ile Pro Glu Arg
195 200 205Glu Gln Lys Val Ser Leu Leu Ala Ile Ala Ile Ala Val Gly Ser Arg
210 215 220Trp Ala Glu Leu Val Arg Arg Ala His Pro Asp Gln Ile Ala Gly His225 230 235 240Gln Pro Ala Gln Pro Phe Gln Val Arg His Asp Val Ala Pro Gln Val
245 250 255Arg Arg Arg Gly Val Ala Val Leu Lys Asp Asp Gly Val Thr Leu Ala
260 265 270Phe Val Asp Ile Arg His Ala Leu Pro Gly Asp Phe
275 280(2)SEQ ID NO:158的信息:
(ⅰ)序列特征:
(A)长度:264碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:158:ATGAACATGT CGTCGGTGGT GGGTCGCAAG GCCTTTGCGC GATTCGCCGG CTACTCCTCC 60GCCATGCACG CGATCGCCGG TTTCTCCGAT GCGTTGCGCC AAGAGCTGCG GGGTAGCGGA 120ATCGCCGTCT CGGTGATCCA CCCGGCGCTG ACCCAGACAC CGCTGTTGGC CAACGTCGAC 180CCCGCCGACA TGCCGCCGCC GTTTCGCAGC CTCACGCCCA TTCCCGTTCA CTGGGTCGCG 240GCAGCGGTGC TTGACGGTGT GGCG 264(2)SEQ ID NO:159的信息:
(ⅰ)序列特征:
(A)长度:1171碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:159:TAGTCGGCGA CGATGACGTC GCGGTCCAGG CCGACCGCTT CAAGCACCAG CGCGACCACG 60AAGCCGGTGC GATCCTTACC CGCGAAGCAG TGGGTGAGCA CCGGGCGTCC GGCGGCAAGC 120AGTGTGACGA CACGATGTAG CGCGCGCTGT GCTCCATTGC GCGTTGGGAA TTGGCGATAC 180TCGTCGGTCA TGTAGCGGGT GGCCGCGTCA TTTATCGACT GGCTGGATTC GCCGGACTCG 240CCGTTGGACC CGTCATTGGT TAGCAGCCTC TTGAATGCGG TTTCGTGCGG CGCTGAGTCG 300TCGGCGTCAT CATCGGCGAG GTCGGGGAAC GGCAGCAGGT GGACGTCGAT GCCGTCCGGA 360ACCCGTCCTG GACCGCGGCG GGCAACCTCC CGGGACGACC GCAGGTCGGC AACGTCGGTG 420ATCCCCAGCC GGCGCAGCGT TGCCCCTCGT GCCGAATTCG GCACGAGGCT GGCGAGCCAC 480CGGGCATCAC CAAGCAACGC TTGCCCAGTA CGGATCGTCA CTTCCGCATC CGGCAGACCA 540ATCTCCTCGC CGCCCATCGT CAGATCCCGC TCGTGCGTTG ACAAGAACGG CCGCAGATGT 600GCCAGCGGGT ATCGGAGATT GAACCGCGCA CGCAGTTCTT CAATCGCTGC GCGCTGCCGC 660ACTATTGGCA CTTTCCGGCG GTCGCGGTAT TCAGCAAGCA TGCGAGTCTC GACGAACTCG 720CCCCACGTAA CCCACGGCGT AGCTCCCGGC GTGACGCGGA GGATCGGCGG GTGATCTTTG 780CCGCCACGCT CGTAGCCGTT GATCCACCGC TTCGCGGTGC CGGCGGGGAG GCCGATCAGC 840TTATCGACCT CGGCGTATGC CGACGGCAAG CTGGGCGCGT TCGTCGAGGT CAAGAACTCC 900ACCATCGGCA CCGGCACCAA GGTGCCGCAC CTGACCTACG TCGGCGACGC CGACATCGGC 960GAGTACAGCA ACATCGGCGC CTCCAGCGTG TTCGTCAACT ACGACGGTAC GTCCAAACGG 1020CGCACCACCG TCGGTTCGCA CGTACGGACC GGGTCCGACA CCATGTTCGT GGCCCCAGTA 1080ACCATCGGCG ACGGCGCGTA TACCGGGGCC GGCACAGTGG TGCGGGAGGA TGTCCCGCCG 1140GGGGCGCTGG CAGTGTCGGC GGGTCCGCAA C 1171(2)SEQ ID NO:160的信息:
(ⅰ)序列特征:
(A)长度:227碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:160:GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 60ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 120TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 180GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCC 227(2)SEQ ID NO:161的信息:
(ⅰ)序列特征:
(A)长度:304碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:161:CCTCGCCACC ATGGGCGGGC AGGGCGGTAG CGGTGGCGCC GGCTCTACCC CAGGCGCCAA 60GGGCGCCCAC GGCTTCACTC CAACCAGCGG CGGCGACGGC GGCGACGGCG GCAACGGCGG 120CAACTCCCAA GTGGTCGGCG GCAACGGCGG CGACGGCGGC AATGGCGGCA ACGGCGGCAG 180CGCCGGCACG GGCGGCAACG GCGGCCGCGG CGGCGACGGC GCGTTTGGTG GCATGAGTGC 240CAACGCCACC AACCCTGGTG AAAACGGGCC AAACGGTAAC CCCGGCGGCA ACGGTGGCGC 300CGGC 304(2)SEQ ID NO:162的信息:
(ⅰ)序列特征:
(A)长度:1439碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:162:GTGGGACGCT GCCGAGGCTG TATAACAAGG ACAACATCGA CCAGCGCCGG CTCGGTGAGC 60TGATCGACCT ATTTAACAGT GCGCGCTTCA GCCGGCAGGG CGAGCACCGC GCCCGGGATC 120TGATGGGTGA GGTCTACGAA TACTTCCTCG GCAATTTCGC TCGCGCGGAA GGGAAGCGGG 180GTGGCGAGTT CTTTACCCCG CCCAGCGTGG TCAAGGTGAT CGTGGAGGTG CTGGAGCCGT 240CGAGTGGGCG GGTGTATGAC CCGTGCTGCG GTTCCGGAGG CATGTTTGTG CAGACCGAGA 300AGTTCATCTA CGAACACGAC GGCGATCCGA AGGATGTCTC GATCTATGGC CAGGAAAGCA 360TTGAGGAGAC CTGGCGGATG GCGAAGATGA ACCTCGCCAT CCACGGCATC GACAACAAGG 420GGCTCGGCGC CCGATGGAGT GATACCTTCG CCCGCGACCA GCACCCGGAC GTGCAGATGG 480ACTACGTGAT GGCCAATCCG CCGTTCAACA TCAAAGACTG GGCCCGCAAC GAGGAAGACC 540CACGCTGGCG CTTCGGTGTT CCGCCCGCCA ATAACGCCAA CTACGCATGG ATTCAGCACA 600TCCTGTACAA CTTGGCGCCG GGAGGTCGGG CGGGCGTGGT GATGGCCAAC GGGTCGATGT 660CGTCGAACTC CAACGGCAAG GGGGATATTC GCGCGCAAAT CGTGGAGGCG GATTTGGTTT 720CCTGCATGGT CGCGTTACCC ACCCAGCTGT TCCGCAGCAC CGGAATCCCG GTGTGCCTGT 780GGTTTTTCGC CAAAAACAAG GCGGCAGGTA AGCAAGGGTC TATCAACCGG TGCGGGCAGG 840TGCTGTTCAT CGACGCTCGT GAACTGGGCG ACCTAGTGGA CCGGGCCGAG CGGGCGCTGA 900CCAACGAGGA GATCGTCCGC ATCGGGGATA CCTTCCACGC GAGCACGACC ACCGGCAACG 960CCGGCTCCGG TGGTGCCGGC GGTAATGGGG GCACTGGCCT CAACGGCGCG GGCGGTGCTG 1020GCGGGGCCGG CGGCAACGCG GGTGTCGCCG GCGTGTCCTT CGGCAACGCT GTGGGCGGCG 1080ACGGCGGCAA CGGCGGCAAC GGCGGCCACG GCGGCGACGG CACGACGGGC GGCGCCGGCG 1140GCAAGGGCGG CAACGGCAGC AGCGGTGCCG CCAGCGGCTC AGGCGTCGTC AACGTCACCG 1200CCGGCCACGG CGGCAACGGC GGCAATGGCG GCAACGGCGG CAACGGCTCC GCGGGCGCCG 1260GCGGCCAGGG CGGTGCCGGC GGCAGCGCCG GCAACGGCGG CCACGGCGGC GGTGCCACCG 1320GCGGCGCCAG CGGCAAGGGC GGCAACGGCA CCAGCGGTGC CGCCAGCGGC TCAGGCGTCA 1380TCAACGTCAC CGCCGGCCAC GGCGGCAACG GCGGCAATGG CCGCAACGGC GGCAACGGC 1439(2)SEQ ID NO:163的信息:
(ⅰ)序列特征:
(A)长度:329碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:163:GGGCCGGCGG GGCCGGATTT TCTCGTGCCT TGATTGTCGC TGGGGATAAC GGCGGTGATG 60GTGGTAACGG CGGGATGGGC GGGGCTGGCG GGGCTGGCGG CCCCGGCGGG GCCGGCGGCC 120TGATCAGCCT GCTGGGCGGC CAAGGCGCCG GCGGGGCCGG CGGGACCGGC GGGGCCGGCG 180GTGTTGGCGG TGACGGCGGG GCCGGCGGCC CCGGCAACCA GGCCTTCAAC GCAGGTGCCG 240GCGGGGCCGG CGGCCTGATC AGCCTGCTGG GCGGCCAAGG CGCCGGCGGG GCCGGCGGGA 300CCGGCGGGGC CGGCGGTGTT GGCGGTGAC 329(2)SEQ ID NO:164的信息:
(ⅰ)序列特征:
(A)长度:80碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:164:GCAACGGTGG CAACGGCGGC ACCAGCACGA CCGTGGGGAT GGCCGGAGGT AACTGTGGTG 60CCGCCGGGCT GATCGGCAAC 80(2)SEQ ID NO:165的信息:
(ⅰ)序列特征:
(A)长度:392碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:165:GGGCTGTGTC GCACTCACAC CGCCGCATTC GGCGACGTTG GCCGCCCAAT ATCCAGCTCA 60AGGCCTACTA CTTACCGTCG GAGGACCGCC GCATCAAGGT GCGGGTCAGC GCCCAAGGAA 120TCAAGGTCAT CGACCGCGAC GGGCATCGAG GCCGTCGTCG CGCGGCTCGG GCAGGATCCG 180CCCCGGCGCA CTTCGCGCGC CAAGCGGGCT CATCGCTCCG AACGGCGGCG ATCCTGTGAG 240CACAACTGAT GGCGCGCAAC GAGATTCGTC CAATTGTCAA GCCGTGTTCG ACCGCAGGGA 300CCGGTTATAC GTATGTCAAC CTATGTCACT CGCAAGAACC GGCATAACGA TCCCGTGATC 360CGCCGACAGC CCACGAGTGC AAGACCGTTA CA 392(2)SEQ ID NO:166的信息:
(ⅰ)序列特征:
(A)长度:535碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:166:ACCGGCGCCA CCGGCGGCAC CGGGTTCGCC GGTGGCGCCG GCGGGGCCGG CGGGCAGGGC 60GGTATCAGCG GTGCCGGCGG CACCAACGGC TCTGGTGGCG CTGGCGGCAC CGGCGGACAA 120GGCGGCGCCG GGGGCGCTGG CGGGGCCGGC GCCGATAACC CCACCGGCAT CGGCGGCGCC 180GGCGGCACCG GCGGCACCGG CGGAGCGGCC GGAGCCGGCG GGGCCGGTGG CGCCATCGGT 240ACCGGCGGCA CCGGCGGCGC GGTGGGCAGC GTCGGTAACG CCGGGATCGG CGGTACCGGC 300GGTACGGGTG GTGTCGGTGG TGCTGGTGGT GCAGGTGCGG CTGCGGCCGC TGGCAGCAGC 360GCTACCGGTG GCGCCGGGTT CGCCGGCGGC GCCGGCGGAG AAGGCGGACC GGGCGGCAAC 420AGCGGTGTGG GCGGCACCAA CGGCTCCGGC GGCGCCGGCG GTGCAGGCGG CAAGGGCGGC 480ACCGGAGGTG CCGGCGGGTC CGGCGCGGAC AACCCCACCG GTGCTGGTTT CGCCG 535(2)SEQ ID NO:167的信息:
(ⅰ)序列特征:
(A)长度:690碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:167:CCGACGTCGC CGGGGCGATA CGGGGGTCAC CGACTACTAC ATCATCCGCA CCGAGAATCG 60GCCGCTGCTG CAACCGCTGC GGGCGGTGCC GGTCATCGGA GATCCGCTGG CCGACCTGAT 120CCAGCCGAAC CTGAAGGTGA TCGTCAACCT GGGCTACGGC GACCCGAACT ACGGCTACTC 180GACGAGCTAC GCCGATGTGC GAACGCCGTT CGGGCTGTGG CCGAACGTGC CGCCTCAGGT 240CATCGCCGAT GCCCTGGCCG CCGGAACACA AGAAGGCATC CTTGACTTCA CGGCCGACCT 300GCAGGCGCTG TCCGCGCAAC CGCTCACGCT CCCGCAGATC CAGCTGCCGC AACCCGCCGA 360TCTGGTGGCC GCGGTGGCCG CCGCACCGAC GCCGGCCGAG GTGGTGAACA CGCTCGCCAG 420GATCATCTCA ACCAACTACG CCGTCCTGCT GCCCACCGTG GACATCGCCC TCGCCTGGTC 480ACCACCCTGC CGCTGTACAC CACCCAACTG TTCGTCAGGC AACTCGCTGC GGGCAATCTG 540ATCAACGCGA TCGGCTATCC CCTGGCGGCC ACCGTAGGTT TAGGCACGAT CGATAGCGGG 600CGGCGTGGAA TTGCTCACCC TCCTCGCGGC GGCCTCGGAC ACCGTTCGAA ACATCGAGGG 660CCTCGTCACC TAACGGATTC CCGACGGCAT 690(2)SEQ ID NO:168的信息:
(ⅰ)序列特征:
(A)长度:407碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:168:ACGGTGACGG CGGTACTGGC GGCGGCCACG GCGGCAACGG CGGGAATCCC GGGTGGCTCT 60TGGGCACAGC CGGGGGTGGC GGCAACGGTG GCGCCGGCAG CACCGGTACT GCAGGTGGCG 120GCTCTGGGGG CACCGGCGGC GACGGCGGGA CCGGCGGGCG TGGCGGCCTG TTAATGGGCG 180CCGGCGCCGG CGGGCACGGT GGCACTGGCG GCGCGGGCGG TGCCGGTGTC GACGGTGGCG 240GCGCCGGCGG GGCCGGCGGG GCCGGCGGCA ACGGCGGCGC CGGGGGTCAA GCCGCCCTGC 300TGTTCGGGCG CGGCGGCACC GGCGGAGCCG GCGGCTACGG CGGCGATGGC GGTGGCGGCG 360GTGACGGCTT CGACGGCACG ATGGCCGGCC TGGGTGGTAC CGGTGGC 407(2)SEQ ID NO:169的信息:
(ⅰ)序列特征:
(A)长度:468碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:169:GATCGGTCAG CGCATCGCCC TCGGCGGCAA GCGATTCCGC GGTCTCACCG AAGAACATCG 60TGCACGCGGC GGCGCGGACC AGCCCGCTGC GCTGCGGCGC GTCGAACGCC TCCAGCAGGC 120ACAGCCAGTC CTTGGCGGCC TGCGAGGCGA ACACGTCGGT GTCACCGGTG TAGATCGCCG 180GGATGCCCGC CTCCGCCAAC GCATTCCGGC ACGCCCGCGC GTCTTTGTGA TGCTCGACGA 240TCACCGCGAT GTCTGCGGCC ACCACGGGCC GCCCGGCGAA GGTGGCCCCG CTGGCCAGTA 300GCGCCGCGAC GTCGGCGGCC AGGTCGTCGG GGATGTGCCG GCGCAGCGCT CCGGCGCGAC 360GCCCGAAAAA CGACCCCTCA CCCAGCTGGG TCCCGCTGGC ATATCCCTTG CCGTCCTGGG 420CGATATTGGA CGCGCATGCC CCGACCGCGT ACAGGCCGGC CACCACCG 468(2)SEQ ID NO:170的信息:
(ⅰ)序列特征:
(A)长度:219碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:170:GGTGGTAACG GCGGCCAGGG TGGCATCGGC GGCGCCGGCG AGAGAGGCGC CGACGGCGCC 60GGCCCCAATG CTAACGGCGC AAACGGCGAG AACGGCGGTA GCGGTGGTAA CGGTGGCGAC 120GGCGGCGCCG GCGGCAATGG CGGCGCGGGC GGCAACGCGC AGGCGGCCGG GTACACCGAC 180GGCGCCACGG GCACCGGCGG CGACGGCGGC AACGGCGGC 219(2)SEQ ID NO:171的信息:
(ⅰ)序列特征:
(A)长度:494碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:171:TAGCTCCGGC GAGGGCGGCA AGGGCGGCGA CGGTGGCCAC GGCGGTGACG GCGTCGGCGG 60CAACAGTTCC GTCACCCAAG GCGGCAGCGG CGGTGGCGGC GGCGCCGGCG GCGCCGGCGG 120CAGCGGCTTT TTCGGCGGCA AGGGCGGCTT CGGCGGCGAC GGCGGTCAGG GCGGCCCCAA 180CGGCGGCGGT ACCGTCGGCA CCGTGGCCGG TGGCGGCGGC AACGGCGGTG TCGGCGGCCG 240GGGCGGCGAC GGCGTCTTTG CCGGTGCCGG CGGCCAGGGC GGCCTCGGTG GGCAGGGCGG 300CAATGGCGGC GGCTCCACCG GCGGCAACGG CGGCCTTGGC GGCGCGGGCG GTGGCGGAGG 360CAACGCCCCG GCTCGTGCCG AATCCGGGCT GACCATGGAC AGCGCGGCCA AGTTCGCTGC 420CATCGCATCA GGCGCGTACT GCCCCGAACA CCTGGAACAT CACCCGAGTT AGCGGGGCGC 480ATTTCCTGAT CACC 494(2)SEQ ID NO:172的信息:
(ⅰ)序列特征:
(A)长度:220碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:172:GGGCCGGTGG TGCCGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 60TTGGCGGCAC CGGCGGCCAG GGTGGGGCTG GCGGTGCCGG AGCGGCCGGC GCCGACGCCC 120CCGCCAGCAC AGGTCTAACC GGTGGTACCG GGTTCGCTGG CGGGGCCGGC GGCGTCGGCG 180GCCAGAGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC 220(2)SEQ ID NO:173的信息:
(ⅰ)序列特征:
(A)长度:388碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:173:ATGGCGGCAA CGGGGGCCCC GGCGGTGCTG GCGGGGCCGG CGACTACAAT TTCCAACGGC 60GGGCAGGGTG GTGCCGGCGG CCAAGGCGGC CAAGGCGGCC TGGGCGGGGC AAGCACCACC 120TGATCGGCCT AGCCGCACCC GGGAAAGCCG ATCCAACAGG CGACGATGCC GCCTTCCTTG 180CCGCGTTGGA CCAGGCCGGC ATCACCTACG CTGACCCAGG CCACGCCATA ACGGCCGCCA 240AGGCGATGTG TGGGCTGTGT GCTAACGGCG TAACAGGTCT ACAGCTGGTC GCGGACCTGC 300GGGACTACAA TCCCGGGCTG ACCATGGACA GCGCGGCCAA GTTCGCTGCC ATCGCATCAG 360GCGCGTACTG CCCCGAACAC CTGGAACA 388(2)SEQ ID NO:174的信息:
(ⅰ)序列特征:
(A)长度:400碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:174:GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 60ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 120TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 180GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCCGGC ACCACAGGCG 240GCGACGGCGG GGCCGGCGGG GCCGGCGGAA CCGGCGGAAC CGGCGGAGCC GCCGGCACCG 300GCACCGGCGG CCAACAAGGC AACGGCGGCA ACGGCGGCAC CGGCGGCAAA GGCGGCACCG 360GCGGCGACGG TGCACTCTCA GGCAGCACCG GTGGTGCCGG 400(2)SEQ ID NO:175的信息:
(ⅰ)序列特征:
(A)长度:538碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:175:GGCAACGGCG GCAACGGCGG CATCGCCGGC ATTGGGCGGC AACGGCGTTC CGGGACGGGC 60AGCGGCAACG GCGGCCAACG GCGGCAGCGG CGGCAACGGC GGCAACGCCG GCATGGGCGG 120CAACAGCGGC ACCGGCAGCG GCGACGGCGG TGCCGGCGGG AACGGCGGCG CGGCGGGCAC 180GGGCGGCACC GGCGGCGACG GCGGCCTCAC CGGTACTGGC GGCACCGGCG GCAGCGGTGG 240CACCGGCGGT GACGGCGGTA ACGGCGGCAA CGGAGCAGAT AACACCGCAA ACATGACTGC 300GCAGGCGGGC GGTGACGGTG GCAACGGCGG CGACGGTGGC TTCGGCGGCG GGGCCGGGGC 360CGGCGGCGGT GGCTTGACCG CTGGCGCCAA CGGCACCGGC GGGCAAGGCG GCGCCGGCGG 420CGATGGCGGC AACGGGGCCA TCGGCGGCCA CGGCCCACTC ACTGACGACC CCGGCGGCAA 480CGGGGGCACC GGCGGCAACG GCGGCACCGG CGGCACCGGC GGCGCGGGCA TCGGCAGC 538(2)SEQ ID NO:176的信息:
(ⅰ)序列特征:
(A)长度:239碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:176:GGGCCGGTGG TGCCGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 60TTGGCGGCAC CGGCGGCCAG GGTGGGGCTG GCGGTGCCGG AGCGGCCGGC GCCGACGCCC 120CCGCCAGCAC AGGTCTAACC GGTGGTACCG GGTTCGCTGG CGGGGCCGGC GGCGTCGGCG 180GCCACGGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC CGGTGGTGCC GGCGGCACC 239(2)SEQ ID NO:177的信息:
(ⅰ)序列特征:
(A)长度:985碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:177:AGCAGCGCTA CCGGTGGCGC CGGGTTCGCC GGCGGCGCCG GCGGAGAAGG CGGAGCGGGC 60GGCAACAGCG GTGTGGGCGG CACCAACGGC TCCGGCGGCG CCGGCGGTGC AGGCGGCAAG 120GGCGGCACCG GAGGTGCCGG CGGGTCCGGC GCGGACAACC CCACCGGTGC TGGTTTCGCC 180GGTGGCGCCG GCGGCACAGG TGGCGCGGCC GGCGCCGGCG GGGCCGGCGG GGCGACCGGT 240ACCGGCGGCA CCGGCGGCGT TGTCGGCGCC ACCGGTAGTG CAGGCATCGG CGGGGCCGGC 300GGCCGCGGCG GTGACGGCGG CGATGGGGCC AGCGGTCTCG GCCTGGGCCT CTCCGGCTTT 360GACGGCGGCC AAGGCGGCCA AGGCGGGGCC GGCGGCAGCG CCGGCGCCGG CGGCATCAAC 420GGGGCCGGCG GGGCCGGCGG CAACGGCGGC GACGGCGGGG ACGGCGCAAC CGGTGCCGCA 480GGTCTCGGCG ACAACGGCGG GGTCGGCGGT GACGGTGGGG CCGGTGGCGC CGCCGGCAAC 540GGCGGCAACG CGGGCGTCGG CCTGACAGCC AAGGCCGGCG ACGGCGGCGC CGCGGGCAAT 600GGCGGCAACG GGGGCGCCGG CGGTGCTGGC GGGGCCGGCG ACAACAATTT CAACGGCGGC 660CAGGGTGGTG CCGGCGGCCA AGGCGGCCAA GGCGGCTTGG GCGGGGCAAG CACCACCTGA 720TCGGCCTAGC CGCACCCGGG AAAGCCGATC CAACAGGCGA CGATGCCGCC TTCCTTGCCG 780CGTTGGACCA GGCCGGCATC ACCTACGCTG ACCCAGGCCA CGCCATAACG GCCGCCAAGG 840CGATGTGTGG GCTGTGTGCT AACGGCGTAA CAGGTCTACA GCTGGTCGCG GACCTGCGGG 900AATACAATCC CGGGCTGACC ATGGACAGCG CGGCCAAGTT CGCTGCCATC GCATCAGGCG 960CGTACTGCCC CGAACACCTG GAACA 985(2)SEQ ID NO:178的信息:
(ⅰ)序列特征:
(A)长度:2138碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:178:CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 120ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 180AGCACTAAGG AGGATGATCC GATATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 240AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 300CCATCACACC GTGCGAACTC ACGGCGGCTA AAAACGCCGC CCAACAGCTG GTATTGTCCG 360CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 420CGCTGCGCAA CGCGGCCAAG GCGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 480ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGCCGTCGGA GGGGACAGTT 540CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 600TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTTTG 660CGGATGGGTG GAACACTTTC AACCTGACGC TGCAAGGCGA CGTCAAGCGG TTCCGGGGGT 720TTGACAACTG GGAAGGCGAT GCGGCTACCG CTTGCGAGGC TTCGCTCGAT CAACAACGGC 780AATGGATACT CCACATGGCC AAATTGAGCG CTGCGATGGC CAAGCAGGCT CAATATGTCG 840CGCAGCTGCA CGTGTGGGCT AGGCGGGAAC ATCCGACTTA TGAAGACATA GTCGGGCTCG 900AACGGCTTTA CGCGGAAAAC CCTTCGGCCC GCGACCAAAT TCTCCCGGTG TACGCGGAGT 960ATCAGCAGAG GTCGGAGAAG GTGCTGACCG AATACAACAA CAAGGCAGCC CTGGAACCGG 1020TAAACCCGCC GAAGCCTCCC CCCGCCATCA AGATCGACCC GCCCCCGCCT CCGCAAGAGC 1080AGGGATTGAT CCCTGGCTTC CTGATGCCGC CGTCTGACGG CTCCGGTGTG ACTCCCGGTA 1140CCGGGATGCC AGCCGCACCG ATGGTTCCGC CTACCGGATC GCCGGGTGGT GGCCTCCCGG 1200CTGACACGGC GGCGCAGCTG ACGTCGGCTG GGCGGGAAGC CGCAGCGCTG TCGGGCGACG 1260TGGCGGTCAA AGCGGCATCG CTCGGTGGCG GTGGAGGCGG CGGGGTGCCG TCGGCGCCGT 1320TGGGATCCGC GATCGGGGGC GCCGAATCGG TGCGGCCCGC TGGCGCTGGT GACATTGCCG 1380GCTTAGGCCA GGGAAGGGCC GGCGGCGGCG CCGCGCTGGG CGGCGGTGGC ATGGGAATGC 1440CGATGGGTGC CGCGCATCAG GGACAAGGGG GCGCCAAGTC CAAGGGTTCT CAGCAGGAAG 1500ACGAGGCGCT CTACACCGAG GATCGGGCAT GGACCGAGGC CGTCATTGGT AACCGTCGGC 1560GCCAGGACAG TAAGGAGTCG AAGTGAGCAT GGACGAATTG GACCCGCATG TCGCCCGGGC 1620GTTGACGCTG GCGGCGCGGT TTCAGTCGGC CCTAGACGGG ACGCTCAATC AGATGAACAA 1680CGGATCCTTC CGCGCCACCG ACGAAGCCGA GACCGTCGAA GTGACGATCA ATGGGCACCA 1740GTGGCTCACC GGCCTGCGCA TCGAAGATGG TTTGCTGAAG AAGCTGGGTG CCGAGGCGGT 1800GGCTCAGCGG GTCAACGAGG CGCTGCACAA TGCGCAGGCC GCGGCGTCCG CGTATAACGA 1860CGCGGCGGGC GAGCAGCTGA CCGCTGCGTT ATCGGCCATG TCCCGCGCGA TGAACGAAGG 1920AATGGCCTAA GCCCATTGTT GCGGTGGTAG CGACTACGCA CCGAATGAGC GCCGCAATGC 1980GGTCATTCAG CGCGCCCGAC ACGGCGTGAG TACGCATTGT CAATGTTTTG ACATGGATCG 2040GCCGGGTTCG GAGGGCGCCA TAGTCCTGGT CGCCAATATT GCCGCAGCTA GCTGGTCTTA 2100GGTTCGGTTA CGCTGGTTAA TTATGACGTC CGTTACCA 2138(2)SEQ ID NO:179的信息:
(ⅰ)序列特征:
(A)长度:460氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:179:Met Thr Gln Ser Gln Thr Val Thr Val Asp Gln Gln Glu Ile Leu Asn1 5 10 15Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val
20 25 30Pro Ile Thr Pro Cys Glu Leu Thr Ala Ala Lys Asn Ala Ala Gln Gln
35 40 45Leu Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala
50 55 60Lys Glu Arg Gln Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Ala65 70 75 80Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly
85 90 95Glu Gly Thr Val Gln Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser
100 105 110Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro
115 120 125Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp
130 135 140Gln Gly Ala Ser Leu Ala His Phe Ala Asp Gly Trp Asn Thr Phe Asn145 150 155 160Leu Thr Leu Gln Gly Asp Val Lys Arg Phe Arg Gly Phe Asp Asn Trp
165 170 175Glu Gly Asp Ala Ala Thr Ala Cys Glu Ala Ser Leu Asp Gln Gln Arg
180 185 190Gln Trp Ile Leu His Met Ala Lys Leu Ser Ala Ala Met Ala Lys Gln
195 200 205Ala Gln Tyr Val Ala Gln Leu His Val Trp Ala Arg Arg Glu His Pro
210 215 220Thr Tyr Glu Asp Ile Val Gly Leu Glu Arg Leu Tyr Ala Glu Asn Pro225 230 235 240Ser Ala Arg Asp Gln Ile Leu Pro Val Tyr Ala Glu Tyr Gln Gln Arg
245 250 255Ser Glu Lys Val Leu Thr Glu Tyr Asn Asn Lys Ala Ala Leu Glu Pro
260 265 270Val Asn Pro Pro Lys Pro Pro Pro Ala Ile Lys Ile Asp Pro Pro Pro
275 280 285Pro Pro Gln Glu Gln Gly Leu Ile Pro Gly Phe Leu Met Pro Pro Ser
290 295 300Asp Gly Ser Gly Val Thr Pro Gly Thr Gly Met Pro Ala Ala Pro Met305 310 315 320Val Pro Pro Thr Gly Ser Pro Gly Gly Gly Leu Pro Ala Asp Thr Ala
325 330 335Ala Gln Leu Thr Ser Ala Gly Arg Glu Ala Ala Ala Leu Ser Gly Asp
340 345 350Val Ala Val Lys Ala Ala Ser Leu Gly Gly Gly Gly Gly Gly Gly Val
355 360 365Pro Ser Ala Pro Leu Gly Ser Ala Ile Gly Gly Ala Glu Ser Val Arg
370 375 380Pro Ala Gly Ala Gly Asp Ile Ala Gly Leu Gly Gln Gly Arg Ala Gly385 390 395 400Gly Gly Ala Ala Leu Gly Gly Gly Gly Met Gly Met Pro Met Gly Ala
405 410 415Ala His Gln Gly Gln Gly Gly Ala Lys Ser Lys Gly Ser Gln Gln Glu
420 425 430Asp Glu Ala Leu Tyr Thr Glu Asp Arg Ala Trp Thr Glu Ala Val Ile
435 440 445Gly Asn Arg Arg Arg Gln Asp Ser Lys Glu Ser Lys
450 455 460(2)SEQ ID NO:180的信息:
(ⅰ)序列特征:
(A)长度:277氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:180:Ala Gly Asn Val Thr Ser Ala Ser Gly Pro His Arg Phe Gly Ala Pro1 5 10 15Asp Arg Gly Ser Gln Arg Arg Arg Arg His Pro Ala Ala Ser Thr Ala
20 25 30Thr Glu Arg Cys Arg Phe Asp Arg His Val Ala Arg Gln Arg Cys Gly
35 40 45Phe Pro Pro Ser Arg Arg Gln Leu Arg Arg Arg Val Ser Arg Glu Ala
50 55 60Thr Thr Arg Arg Ser Gly Arg Arg Asn His Arg Cys Gly Trp His Pro65 70 75 80Gly Thr Gly Ser His Thr Gly Ala Val Arg Arg Arg His Gln Glu Ala
85 90 95Arg Asp Cln Ser Leu Leu Leu Arg Arg Arg Gly Arg Val Asp Leu Asp
100 105 110Gly Gly Gly Arg Leu Arg Arg Val Tyr Arg Phe Gln Gly Cys Leu Val
115 120 125Val Val Phe Gly Gln His Leu Leu Arg Pro Leu Leu Ile Leu Arg Val
130 135 140His Arg Glu Asn Leu Val Ala Gly Arg Arg Val Phe Arg Val Lys Pro145 150 155 160Phe Glu Pro Asp Tyr Val Phe Ile Ser Arg Met Phe Pro Pro Ser Pro
165 170 175His Val Gln Leu Arg Asp Ile Leu Ser Leu Leu Gly His Arg Ser Ala
180 185 190Gln Phe Gly His Val Glu Tyr Pro Leu Pro Leu Leu Ile Glu Arg Ser
195 200 205Leu Ala Ser Gly Ser Arg Ile Ala Phe Pro Val Val Lys Pro Pro Glu
210 215 220Pro Leu Asp Val Ala Leu Gln Arg Gln Val Glu Ser Val Pro Pro Ile225 230 235 240Arg Lys Val Arg Glu Arg Cys Ala Leu Val Ala Arg Phe Glu Leu Pro
245 250 255Cys Arg Phe Phe Glu Ile His Glu Val Gly Phe Thr Gly Arg Gly His
260 265 270
Pro Arg Arg Ile Gly
275(2)SEQ ID NO:181的信息:
(ⅰ)序列特征:
(A)长度:192氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:181:Arg Val Ala Ala Ser Phe Ile Asp Trp Leu Asp Ser Pro Asp Ser Pro1 5 10 15Leu Asp Pro Ser Leu Val Ser Ser Leu Leu Asn Ala Val Ser Cys Gly
20 25 30Ala Glu Ser Ser Ala Ser Ser Ser Ala Arg Ser Gly Asn Gly Ser Arg
35 40 45Trp Thr Ser Met Pro Ser Gly Thr Arg Pro Gly Pro Arg Arg Ala Thr
50 55 60Ser Arg Asp Asp Arg Arg Ser Ala Thr Ser Val Ile Pro Ser Arg Arg65 70 75 80Ser Val Ala Pro Arg Ala Glu Phe Gly Thr Arg Leu Ala Ser His Arg
85 90 95Ala Ser Pro Ser Asn Ala Cys Pro Val Arg Ile Val Thr Ser Ala Ser
100 105 110Gly Arg Pro Ile Ser Ser Pro Pro Ile Val Arg Ser Arg Ser Cys Val
115 120 125Asp Lys Asn Gly Arg Arg Cys Ala Ser Gly Tyr Arg Arg Leu Asn Arg
130 135 140Ala Arg Ser Ser Ser Ile Ala Ala Arg Cys Arg Thr Ile Gly Thr Phe145 150 155 160Arg Arg Ser Arg Tyr Ser Ala Ser Met Arg Val Ser Thr Asn Ser Pro
165 170 175His Val Thr His Gly Val Ala Pro Gly Val Thr Arg Arg Ile Gly Gly
180 185 190(2)SEQ ID NO:182的信息:
(ⅰ)序列特征:
(A)长度:196氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:182:Gln Glu Arg Pro Gln Met Cys Gln Arg Val Ser Glu Ile Glu Pro Arg1 5 10 15Thr Gln Phe Phe Asn Arg Cys Ala Leu Pro His Tyr Trp His Phe Pro
20 25 30Ala Val Ala Val Phe Ser Lys His Ala Ser Leu Asp Glu Leu Ala Pro
35 40 45Arg Asn Pro Arg Arg Ser Ser Arg Arg Asp Ala Glu Asp Arg Arg Val
50 55 60Ile Phe Ala Ala Thr Leu Val Ala Val Asp Pro Pro Leu Arg Gly Ala65 70 75 80Gly Gly Glu Ala Asp Gln Leu Ile Asp Leu Gly Val Cys Arg Arg Gln
85 90 95Ala Gly Arg Val Arg Arg Gly Gln Glu Leu His His Arg His Arg His
100 105 110Gln Gly Ala Ala Pro Asp Leu Arg Arg Arg Arg Arg His Arg Arg Val
115 120 125Gln Gln His Arg Arg Leu Gln Arg Val Arg Gln Leu Arg Arg Tyr Val
130 135 140Gln Thr Ala His His Arg Arg Phe Ala Arg Thr Asp Arg Val Arg His145 150 155 160His Val Arg Gly Pro Ser Asn His Arg Arg Arg Arg Val Tyr Arg Gly
165 170 175Arg His Ser Gly Ala Gly Gly Cys Pro Ala Gly Gly Ala Gly Ser Val
180 185 190Gly Gly Ser Ala
195(2)SEQ ID NO:183的信息:
(ⅰ)序列特征:
(A)长度:311氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:183:Val Arg Cys Gly Thr Leu Val Pro Val Pro Met Val Glu Phe Leu Thr1 5 10 15Ser Thr Asn Ala Pro Ser Leu Pro Ser Ala Tyr Ala Glu Val Asp Lys
20 25 30Leu Ile Gly Leu Pro Ala Gly Thr Ala Lys Arg Trp Ile Asn Gly Tyr
35 40 45Glu Arg Gly Gly Lys Asp His Pro Pro Ile Leu Arg Val Thr Pro Gly
50 55 60Ala Thr Pro Trp Val Thr Trp Gly Glu Phe Val Glu Thr Arg Met Leu65 70 75 80Ala Glu Tyr Arg Asp Arg Arg Lys Val Pro Ile Val Arg Gln Arg Ala
85 90 95Ala Ile Glu Glu Leu Arg Ala Arg Phe Asn Leu Arg Tyr Pro Leu Ala
100 105 110His Leu Arg Pro Phe Leu Ser Thr His Glu Arg Asp Leu Thr Met Gly
115 120 125Gly Glu Glu Ile Gly Leu Pro Asp Ala Glu Val Thr Ile Arg Thr Gly
130 135 140Gln Ala Leu Leu Gly Asp Ala Arg Trp Leu Ala Ser Leu Val Pro Asn145 150 155 160Ser Ala Arg Gly Ala Thr Leu Arg Arg Leu Gly Ile Thr Asp Val Ala
165 170 175Asp Leu Arg Ser Ser Arg Glu Val Ala Arg Arg Gly Pro Gly Arg Val
180 185 190Pro Asp Gly Ile Asp Val His Leu Leu Pro Phe Pro Asp Leu Ala Asp
195 200 205Asp Asp Ala Asp Asp Ser Ala Pro His Glu Thr Ala Phe Lys Arg Leu
210 215 220Leu Thr Asn Asp Gly Ser Asn Gly Glu Ser Gly Glu Ser Ser Gln Ser225 230 235 240Ile Asn Asp Ala Ala Thr Arg Tyr Met Thr Asp Glu Tyr Arg Gln Phe
245 250 255Pro Thr Arg Asn Gly Ala Gln Arg Ala Leu His Arg Val Val Thr Leu
260 265 270Leu Ala Ala Gly Arg Pro Val Leu Thr His Cys Phe Ala Gly Lys Asp
275 280 285Arg Thr Gly Phe Val Val Ala Leu Val Leu Glu Ala Val Gly Leu Asp
290 295 300Arg Asp Val Ile Val Ala Asp305 310(2)SEQ ID NO:184的信息:
(ⅰ)序列特征:
(A)长度:2072碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:184:CTCGTGCCGA TTCGGCACGA GCTGAGCAGC CCAAGGGGCC GTTCGGCGAA GTCATCGAGG 60CATTCGCCGA CGGGCTGGCC GGCAAGGGTA AGCAAATCAA CACCACGCTG AACAGCCTGT 120CGCAGGCGTT GAACGCCTTG AATGAGGGCC GCGGCGACTT CTTCGCGGTG GTACGCAGCC 180TGGCGCTATT CGTCAACGCG CTACATCAGG ACGACCAACA GTTCGTCGCG TTGAACAAGA 240ACCTTGCGGA GTTCACCGAC AGGTTGACCC ACTCCGATGC GGACCTGTCG AACGCCATCC 300AGCAATTCGA CAGCTTGCTC GCCGTCGCGC GCCCGTTCTT CGCCAAGAAC CGCGAGGTGC 360TGACGCATGA CGTCAATAAT CTCGCGACCG TGACCACCAC GTTGCTGCAG CCCGATCCGT 420TGGATGGGTT GGAGACCGTC CTGCACATCT TCCCGACGCT GGCGGCGAAC ATTAACCAGC 480TTTACCATCC GACACACGGT GGCGTGGTGT CGCTTTCCGC GTTCACGAAT TTCGCCAACC 540CGATGGAGTT CATCTGCAGC TCGATTCAGG CGGGTAGCCG GCTCGGTTAT CAAGAGTCGG 600CCGAACTCTG TGCGCAGTAT CTGGCGCCAG TCCTCGATGC GATCAAGTTC AACTACTTTC 660CGTTCGGCCT GAACGTGGCC AGCACCGCCT CGACACTGCC TAAAGAGATC GCGTACTCCG 720AGCCCCGCTT GCAGCCGCCC AACGGGTACA AGGACACCAC GGTGCCCGGC ATCTGGGTGC 780CGGATACGCC GTTGTCACAC CGCAACACGC AGCCCGGTTG GGTGGTGGCA CCCGGGATGC 840AAGGGGTTCA GGTGGGACCG ATCACGCAGG GTTTGCTGAC GCCGGAGTCC CTGGCCGAAC 900TCATGGGTGG TCCCGATATC GCCCCTCCGT CGTCAGGGCT GCAAACCCCG CCCGGACCCC 960CGAATGCGTA CGACGAGTAC CCCGTGCTGC CGCCGATCGG TTTACAGGCC CCACAGGTGC 1020CGATACCACC GCCGCCTCCT GGGCCCGACG TAATCCCGGG TCCGGTGCCA CCGGTCTTGG 1080CGGCGATCGT GTTCCCAAGA GATCGCCCGG CAGCGTCGGA AAACTTCGAC TACATGGGCC 1140TCTTGTTGCT GTCGCCGGGC CTGGCGACCT TCCTGTTCGG GGTGTCATCT AGCCCCGCCC 1200GTGGAACGAT GGCCGATCGG CACGTGTTGA TACCGGCGAT CACCGGCCTG GCGTTGATCG 1260CGGCATTCGT CGCACATTCG TGGTACCGCA CAGAACATCC GCTCATAGAC ATGCGCTTGT 1320TCCAGAACCG AGCGGTCGCG CAGGCCAACA TGACGATGAC GGTGCTCTCC CTCGGGCTGT 1380TTGGCTCCTT CTTGCTGCTC CCGAGCTACC TCCAGCAAGT GTTGCACCAA TCACCGATGC 1440AATCGGGGGT GCATATCATC CCACAGGGCC TCGGTGCCAT GCTGGCGATG CCGATCGCCG 1500GAGCGATGAT GGACCGACGG GGACCGGCCA AGATCGTGCT GGTTGGGATC ATGCTGATCG 1560CTGCGGGGTT GGGCACCTTC GCCTTTGGTG TCGCGCGGCA AGCGGACTAC TTACCCATTC 1620TGCCGACCGG GCTGGCAATC ATGGGCATGG GCATGGGCTG CTCCATGATG CCACTGTCCG 1680GGGCGGCAGT GCAGACCCTG GCCCCACATC AGATCGCTCG CGGTTCGACG CTGATCAGCG 1740TCAACCAGCA GGTGGGCGGT TCGATAGGGA CCGCACTGAT GTCGGTGCTG CTCACCTACC 1800AGTTCAATCA CAGCGAAATC ATCGCTACTG CAAAGAAAGT CGCACTGACC CCAGAGAGTG 1860GCGCCGGGCG GGGGGCGGCG GTTGACCCTT CCTCGCTACC GCGCCAAACC AACTTCGCGG 1920CCCAACTGCT GCATGACCTT TCGCACGCCT ACGCGGTGGT ATTCGTGATA GCGACCGCGC 1980TAGTGGTCTC GACGCTGATC CCCGCGGCAT TCCTGCCGAA ACAGCAGGCT AGTCATCGAA 2040GAGCACCGTT GCTATCCGCA TGACGTCTGC TT 2072(2)SEQ ID NO:185的信息:
(ⅰ)序列特征:
(A)长度:1923碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:185:TCACCCCGGA GAAGTCGTTC GTCGACGACC TGGACATCGA CTCGCTGTCG ATGGTCGAGA 60TCGCCGTGCA GACCGAGGAC AAGTACGGCG TCAAGATCCC CGACGAGGAC CTCGCCGGTC 120TGCGTACCGT CGGTGACGTT GTCGCCTACA TCCAGAAGCT CGAGGAAGAA AACCCGGAGG 180CGGCTCAGGC GTTGCGCGCG AAGATTGAGT CGGAGAACCC CGATGCGGCA CGAGCAGATC 240GGTGCGTTTC ACCCACATCG CAAGCTCGAG ACGCCCGTCG TCCTCTTGCA CGCTCAGCCA 300GGTTGGCGTG TCGCCGCCTT CCAGCAAGTG TTCCCACCAC ACGAAGGGAC CCTCGCGAAA 360GGTGACTGAT CCGCGGACCA CATAGTCGAT GCCACCGTGG CTGACAATTG CGCCGGGTCC 420GAGTTGGCGG GGGCCGAATT GCGGCATTGC GTCGAAGGCC AGCGGATCCC GGCGCCCGCC 480CGGCGTGGCT GGTGTTTTGG GCCGCCGGAT GGCCACGACG AGAACGACGA TGGCGGCGAT 540GAACAGCGCC ACGGCAATCA CGACCAGCAG ATTTCCCACG CATACCCTCT CGTACCGCTG 600CGCCGCGGTT GGTCGATCGG TCGCATATCG ATGGCGCCGT TTAACGTAAC AGCTTTCGCG 660GGACCGGGGG TCACAACGGG CGAGTTGTCC GGCCGGGAAC CCGGCAGGTC TCGGCCGCGG 720TCACCCCAGC TCACTGGTGC ACCATCCGGG TGTCGGTGAG CGTGCAACTC AAACACACTC 780AACGGCAACG GTTTCTCAGG TCACCAGCTC AACCTCGACC CGCAATCGCT CGTACGTTTC 840GACCGCGCGC AGGTCGCGAG TCAGCAGCTT TGCGCCGGCA GCTTTCGCCG TGAAGCCGAC 900CAGGGCATCG TAGGTTGCGC CACCGGTGAC ATCGTGCTCG GCGAGGTGGT CGGTCAAGCC 960GCGATATGAG CAGGCATCCA GTGCCAGGTA GTTGCTGGAG GTGATGTCCG CCAAGTAGGC 1020GTGGACGGCA ACAGGGGCAA TACGATGCGG CGGTGGTAGC CGGGTCAAGA CCGAATAGGT 1080TTCCACAGCC GCGTGCGCGA TCAGATGGAC GCCACGGTTG AGCGCGCGCA CGGCGGCCTC 1140GTGCCCTTCG TGCCAGGTCG CGAATCCGGC AACCAGCACG CTGGTGTCTG GTGCGATCAC 1200CGCCGTGTGC GATCGAGCGT TTCCCGAACG ATTTCGTCGG TCAACGGGGG CAGGGGACGT 1260TCTGGCCGTG CGACGAGAAC CGAGCCTTCC CGAACGAGTT CGACACCGGT CGGGGCCGGC 1320TCAATCTCGA TGCGCCCATC GCGCTCGGTG ATCTCCACCT GGTCGTTCCC GCGCAAGCCA 1380AGGCGCTCGC GAATCCGCTT GGGAATCACC AGACGTCCTG CGACATCGAT GGTTGTTCGC 1440ATGGTAGGAA ATTTACCATC GCACGTTCCA TAGGCGTGTC CTGCGCGGGA TGTCGGGACG 1500ATCCGCTAGC GTATCGAACG ATTGTTTCGG AAATGGCTGA GGGAGCGTGC GGTGCGGGTG 1560ATGGGTGTCG ATCCCGGGTT GACCCGATGC GGGCTGTCGC TCATCGAGAG TGGGCGTGGT 1620CGGCAGCTCA CCGCGCTGGA TGTCGACGTG GTGCGCACAC CGTCGGATGC GGCCTTGGCG 1680CAGCGCCTGT TGGCCATCAG CGATGCCGTC GAGCACTGGC TGGACACCCA TCATCCGGAG 1740GTGGTGGCTA TCGAACGGGT GTTCTCTCAG CTCAACGTGA CCACGGTGAT GGGCACCGCG 1800CAGGCCGGCG GCGTGATCGC CCTGGCGGCG GCCAAACGTG GTGTCGACGT GCATTTCCAT 1860ACCCCCAGCG AGGTCAAGGC GGCGGTCACT GGCAACGGTT CCGCAGACAA GGCTCAGGTC 1920ACC 1923(2)SEQ ID NO:186的信息:
(ⅰ)序列特征:
(A)长度:1055碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:186:CTGGCGTGCC AGTGTCACCG GCGATATGAC GTCGGCATTC AATTTCGCGG CCCCGCCGGA 60CCCGTCGCCA CCCAATCTGG ACCACCCGGT CCGTCAATTG CCGAAGGTCG CCAAGTGCGT 120GCCCAATGTG GTGCTGGGTT TCTTGAACGA AGGCCTGCCG TATCGGGTGC CCTACCCCCA 180AACAACGCCA GTCCAGGAAT CCGGTCCCGC GCGGCCGATT CCCAGCGGCA TCTGCTAGCC 240GGGGATGGTT CAGACGTAAC GGTTGGCTAG GTCGAAACCC GCGCCAGGGC CGCTGGACGG 300GCTCATGGCA GCGAAATTAG AAAACCCGGG ATATTGTCCG CGGATTGTCA TACGATGCTG 360AGTGCTTGGT GGTTCGTGTT TAGCCATTGA GTGTGGATGT GTTGAGACCC TGGCCTGGAA 420GGGGACAACG TGCTTTTGCC TCTTGGTCCG CCTTTGCCGC CCGACGCGGT GGTGGCGAAA 480CGGGCTGAGT CGGGAATGCT CGGCGGGTTG TCGGTTCCGC TCAGCTGGGG AGTGGCTGTG 540CCACCCGATG ATTATGACCA CTGGGCGCCT GCGCCGGAGG ACGGCGCCGA TGTCGATGTC 600CAGGCGGCCG AAGGGGCGGA CGCAGAGGCC GCGGCCATGG ACGAGTGGGA TGAGTGGCAG 660GCGTGGAACG AGTGGGTGGC GGAGAACGCT GAACCCCGCT TTGAGGTGCC ACGGAGTAGC 720AGCAGCGTGA TTCCGCATTC TCCGGCGGCC GGCTAGGAGA GGGGGCGCAG ACTGTCGTTA 780TTTGACCAGT GATCGGCGGT CTCGGTGTTC CCGCGGCCGG CTATGACAAC AGTCAATGTG 840CATGACAAGT TACAGGTATT AGGTCCAGGT TCAACAAGGA GACAGGCAAC ATGGCAACAC 900GTTTTATGAC GGATCCGCAC GCGATGCGGG ACATGGCGGG CCGTTTTGAG GTGCACGCCC 960AGACGGTGGA GGACGAGGCT CGCCGGATGT GGGCGTCCGC GCAAAACATC TCGGGNGCGG 1020GCTGGAGTGG CATGGCCGAG GCGACCTCGC TAGAC 1055(2)SEQ ID NO:187的信息:
(ⅰ)序列特征:
(A)长度:359碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:187:CCGCCTCGTT GTTGGCATAC TCCGCCGCGG CCGCCTCGAC CGCACTGGCC GTGGCGTGTG 60TCCGGGCTGA CCACCGGGAT CGCCGAACCA TCCGAGATCA CCTCGCAATG ATCCACCTCG 120CGCAGCTGGT CACCCAGCCA CCGGGCGGTG TGCGACAGCG CCTGCATCAC CTTGGTATAG 180CCGTCGCGCC CCAGCCGCAG GAAGTTGTAG TACTGGCCCA CCACCTGGTT ACCGGGACGG 240GAGAAGTTCA GGGTGAAGGT CGGCATGTCG CCGCCGAGGT AGTTGACCCG GAAAACCAGA 300TCCTCCGGCA GGTGCTCGGG CCCGCGCCAC ACGACAAACC CGACGCCGGG ATAGGTCAG 359(2)SEQ ID NO:188的信息:
(ⅰ)序列特征:
(A)长度:350碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:188:AACGGGCCCG TGGGCACCGC TCCTCTAAGG GCTCTCGTTG GTCGCATGAA GTGCTGGAAG 60GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC TAGTCCTAGT CCGAGTCGCC 120CGCAAAGTTC CTCGAATAAC TCCGTACCCG GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC 180TGCGCGAACC ACTTGAGGTT CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC 240TGATCGGTTC GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 300GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 350(2)SEQ ID NO:189的信息:
(ⅰ)序列特征:
(A)长度:679氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:189:Glu Gln Pro Lys Gly Pro Phe Gly Glu Val Ile Glu Ala Phe Ala Asp1 5 10 15Gly Leu Ala Gly Lys Gly Lys Gln Ile Asn Thr Thr Leu Asn Ser Leu
20 25 30Ser Gln Ala Leu Asn Ala Leu Asn Glu Gly Arg Gly Asp Phe Phe Ala
35 40 45Val Val Arg Ser Leu Ala Leu Phe Val Asn Ala Leu His Gln Asp Asp
50 55 60Gln Gln Phe Val Ala Leu Asn Lys Asn Leu Ala Glu Phe Thr Asp Arg65 70 75 80Leu Thr His Ser Asp Ala Asp Leu Ser Asn Ala Ile Gln Gln Phe Asp
85 90 95Ser Leu Leu Ala Val Ala Arg Pro Phe Phe Ala Lys Asn Arg Glu Val
100 105 110Leu Thr His Asp Val Asn Asn Leu Ala Thr Val Thr Thr Thr Leu Leu
115 120 125Gln Pro Asp Pro Leu Asp Gly Leu Glu Thr Val Leu His Ile Phe Pro
130 135 140Thr Leu Ala Ala Asn Ile Asn Gln Leu Tyr His Pro Thr His Gly Gly145 150 155 160Val Val Ser Leu Ser Ala Phe Thr Asn Phe Ala Asn Pro Met Glu Phe
165 170 175Ile Cys Ser Ser Ile Gln Ala Gly Ser Arg Leu Gly Tyr Gln Glu Ser
180 185 190Ala Glu Leu Cys Ala Gln Tyr Leu Ala Pro Val Leu Asp Ala Ile Lys
195 200 205Phe Asn Tyr Phe Pro Phe Gly Leu Asn Val Ala Ser Thr Ala Ser Thr
210 215 220Leu Pro Lys Glu Ile Ala Tyr Ser Glu Pro Arg Leu Gln Pro Pro Asn225 230 235 240Gly Tyr Lys Asp Thr Thr Val Pro Gly Ile Trp Val Pro Asp Thr Pro
245 250 255Leu Ser His Arg Asn Thr Gln Pro Gly Trp Val Val Ala Pro Gly Met
260 265 270Gln Gly Val Gln Val Gly Pro Ile Thr Gln Gly Leu Leu Thr Pro Glu
275 280 285Ser Leu Ala Glu Leu Met Gly Gly Pro Asp Ile Ala Pro Pro Ser Ser
290 295 300Gly Leu Gln Thr Pro Pro Gly Pro Pro Asn Ala Tyr Asp Glu Tyr Pro305 310 315 320Val Leu Pro Pro Ile Gly Leu Gln Ala Pro Gln Val Pro Ile Pro Pro
325 330 335Pro Pro Pro Gly Pro Asp Val Ile Pro Gly Pro Val Pro Pro Val Leu
340 345 350Ala Ala Ile Val Phe Pro Arg Asp Arg Pro Ala Ala Ser Glu Asn Phe
355 360 365Asp Tyr Met Gly Leu Leu Leu Leu Ser Pro Gly Leu Ala Thr Phe Leu
370 375 380Phe Gly Val Ser Ser Ser Pro Ala Arg Gly Thr Met Ala Asp Arg His385 390 395 400Val Leu Ile Pro Ala Ile Thr Gly Leu Ala Leu Ile Ala Ala Phe Val
405 410 415Ala His Ser Trp Tyr Arg Thr Glu His Pro Leu Ile Asp Met Arg Leu
420 425 430Phe Gln Asn Arg Ala Val Ala Gln Ala Asn Met Thr Met Thr Val Leu
435 440 445Ser Leu Gly Leu Phe Gly Ser Phe Leu Leu Leu Pro Ser Tyr Leu Gln
450 455 460Gln Val Leu His Gln Ser Pro Met Gln Ser Gly Val His Ile Ile Pro465 470 475 480Gln Gly Leu Gly Ala Met Leu Ala Met Pro Ile Ala Gly Ala Met Met
485 490 495Asp Arg Arg Gly Pro Ala Lys Ile Val Leu Val Gly Ile Met Leu Ile
500 505 510Ala Ala Gly Leu Gly Thr Phe Ala Phe Gly Val Ala Arg Gln Ala Asp
515 520 525Tyr Leu Pro Ile Leu Pro Thr Gly Leu Ala Ile Met Gly Met Gly Met
530 535 540Gly Cys Ser Met Met Pro Leu Ser Gly Ala Ala Val Gln Thr Leu Ala545 550 555 560Pro His Gln Ile Ala Arg Gly Ser Thr Leu Ile Ser Val Asn Gln Gln
565 570 575Val Gly Gly Ser Ile Gly Thr Ala Leu Met Ser Val Leu Leu Thr Tyr
580 585 590Gln Phe Asn His Ser Glu Ile Ile Ala Thr Ala Lys Lys Val Ala Leu
595 600 605Thr Pro Glu Ser Gly Ala Gly Arg Gly Ala Ala Val Asp Pro Ser Ser
610 615 620Leu Pro Arg Gln Thr Asn Phe Ala Ala Gln Leu Leu His Asp Leu Ser625 630 635 640His Ala Tyr Ala Val Val Phe Val Ile Ala Thr Ala Leu Val Val Ser
645 650 655Thr Leu Ile Pro Ala Ala Phe Leu Pro Lys Gln Gln Ala Ser His Arg
660 665 670Arg Ala Pro Leu Leu Ser Ala
675(2)SEQ ID NO:190的信息:
(ⅰ)序列特征:
(A)长度:120氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:190:Thr Pro Glu Lys Ser Phe Val Asp Asp Leu Asp Ile Asp Ser Leu Ser1 5 10 15Met Val Glu Ile Ala Val Gln Thr Glu Asp Lys Tyr Gly Val Lys Ile
20 25 30Pro Asp Glu Asp Leu Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala
35 40 45Tyr Ile Gln Lys Leu Glu Glu Glu Asn Pro Glu Ala Ala Gln Ala Leu
50 55 60Arg Ala Lys Ile Glu Ser Glu Asn Pro Asp Ala Ala Arg Ala Asp Arg65 70 75 80Cys Val Ser Pro Thr Ser Gln Ala Arg Asp Ala Arg Arg Pro Leu Ala
85 90 95Arg Ser Ala Arg Leu Ala Cys Arg Arg Leu Pro Ala Ser Val Pro Thr
100 105 110Thr Arg Arg Asp Pro Arg Glu Arg
115 120(2)SEQ ID NO:191的信息:
(ⅰ)序列特征:
(A)长度:89氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:191:Leu Ala Cys Gln Cys His Arg Arg Tyr Asp Val Gly Ile Gln Phe Arg1 5 10 15Gly Pro Ala Gly Pro Val Ala Thr Gln Ser Gly Pro Pro Gly Pro Ser
20 25 30Ile Ala Glu Gly Arg Gln Val Arg Ala Gln Cys Gly Ala Gly Phe Leu
35 40 45Glu Arg Arg Pro Ala Val Ser Gly Ala Leu Pro Pro Asn Asn Ala Ser
50 55 60Pro Gly Ile Arg Ser Arg Ala Ala Asp Ser Gln Arg His Leu Leu Ala65 70 75 80Gly Asp Gly Ser Asp Val Thr Val Gly
85(2)SEQ ID NO:192的信息:
(ⅰ)序列特征:
(A)长度:119氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:192:Ala Ser Leu Leu Ala Tyr Ser Ala Ala Ala Ala Ser Thr Ala Leu Ala1 5 10 15Val Ala Cys Val Arg Ala Asp His Arg Asp Arg Arg Thr Ile Arg Asp
20 25 30His Leu Ala Met Ile His Leu Ala Gln Leu Val Thr Gln Pro Pro Gly
35 40 45Gly Val Arg Gln Arg Leu His His Leu Gly Ile Ala Val Ala Pro Gln
50 55 60Pro Gln Glu Val Val Val Leu Ala His His Leu Val Thr Gly Thr Gly65 70 75 80Glu Val Gln Gly Glu Gly Arg His Val Ala Ala Glu Val Val Asp Pro
85 90 95Glu Asn Gln Ile Leu Arg Gln Val Leu Gly Pro Ala Pro His Asp Lys
100 105 110Pro Asp Ala Gly Ile Gly Gln
115(2)SEQ ID NO:193的信息:
(ⅰ)序列特征:
(A)长度:116氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:193:Arg Ala Arg Gly His Arg Ser Ser Lys Gly Ser Arg Trp Ser His Glu1 5 10 15Val Leu Glu Gly Cys Ile Leu Ala Asp Ser Arg Gln Ser Lys Thr Ala
20 25 30Ala Ser Pro Ser Pro Ser Arg Pro Gln Ser Ser Ser Asn Asn Ser Val
35 40 45Pro Gly Ala Pro Asn Arg Val Ser Phe Ala Lys Leu Arg Glu Pro Leu
50 55 60Glu Val Pro Gly Leu Leu Asp Val Gln Thr Asp Ser Phe Glu Trp Leu65 70 75 80Ile Gly Ser Pro Arg Trp Arg Glu Ser Ala Ala Glu Arg Gly Asp Val
85 90 95Asn Pro Val Gly Gly Leu Glu Glu Val Leu Tyr Glu Leu Ser Pro lle
100 105 110Glu Asp Phe Ser
115(2)SEQ ID NO:194的信息:
(ⅰ)序列特征:
(A)长度:811碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:194:TGCTACGCAG CAATCGCTTT GGTGACAGAT GTGGATGCCG GCGTCGCTGC TGGCGATGGC 60GTGAAAGCCG CCGACGTGTT CGCCGCATTC GGGGAGAACA TCGAACTGCT CAAAAGGCTG 120GTGCGGGCCG CCATCGATCG GGTCGCCGAC GAGCGCACGT GCACGCACTG TCAACACCAC 180GCCGGTGTTC CGTTGCCGTT CGAGCTGCCA TGAGGGTGCT GCTGACCGGC GCGGCCGGCT 240TCATCGGGTC GCGCGTGGAT GCGGCGTTAC GGGCTGCGGG TCACGACGTG GTGGGCGTCG 300ACGCGCTGCT GCCCGCCGCG CACGGGCCAA ACCCGGTGCT GCCACCGGGC TGCCAGCGGG 360TCGACGTGCG CGACGCCAGC GCGCTGGCCC CGTTGTTGGC CGGTGTCGAT CTGGTGTGTC 420ACCAGGCCGC CATGGTGGGT GCCGGCGTCA ACGCCGCCGA CGCACCCGCC TATGGCGGCC 480ACAACGATTT CGCCACCACG GTGCTGCTGG CGCAGATGTT CGCCGCCGGG GTCCGCCGTT 540TGGTGCTGGC GTCGTCGATG GTGGTTTACG GGCAGGGGCG CTATGACTGT CCCCAGCATG 600GACCGGTCGA CCCGCTGCCG CGGCGGCGAG CCGACCTGGA CAATGGGGTC TTCGAGCACC 660GTTGCCCGGG GTGCGGCGAG CCAGTCATCT GGCAATTGGT CGACGAAGAT GCCCCGTTGC 720GCCCGCGCAG CCTGTACGCG GCAGCAAGAC CGCGCAGGAG CACTACGCGC TGGCGTGGTC 780GGAAACGAAT GGCGGTTCCG TGGTGGCGTT G 811(2)SEQ ID NO:195的信息:
(ⅰ)序列特征:
(A)长度:966碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅱ)序列描述:SEQ ID NO:195:GTCCCGCGAT GTGGCCGAGC ATGACTTTCG GCAACACCGG CGTAGTAGTC GAAGATATCG 60GACTTTGTGG TCCCGGTGGC GGGATAGAGC ACCTGTCGGC GTTGGTCAGC GTCACCCGTT 120GCTCGGACGC CGAACCCATG CTTTCAACGT AGCCTGTCGG TCACACAAGT CGCGAGCGTA 180ACGTCACGGT CAAATATCGC GTGGAATTTC GCCGTGACGT TCCGCTCGCG GACAATCAAG 240GCATACTCAC TTACATGCGA GCCATTTGGA CGGGTTCGAT CGCCTTCGGG CTGGTGAACG 300TGCCGGTCAA GGTGTACAGC GCTACCGCAG ACCACGACAT CAGGTTCCAC CAGGTGCACG 360CCAAGGACAA CGGACGCATC CGGTACAAGC GCGTCTGCGA GGCGTGTGGC GAGGTGGTCG 420ACTACCGCGA TCTTGCCCGG GCCTACGAGT CCGGCGACGG CCAAATGGTG GCGATCACCG 480ACGACGACAT CGCCAGCTTG CCTGAAGAAC GCAGCCGGGA GATCGAGGTG TTGGAGTTCG 540TCCCCGCCGC CGACGTGGAC CCGATGATGT TCGACCGCAG CTACTTTTTG GAGCCTGATT 600CGAAGTCGTC GAAATCGTAT GTGCTGCTGG CTAAGACACT CGCCGAGACC GACCGGATGG 660CGATCGTGGA TCGCCCCACC GGCCGTGAAT GCAGGAAAAA TAAGAGCCGC TATCCACAAT 720TCGGCGTCGA GCTCGGCTAC CACAAACGGT AGAACGATCG AGACATTCCC GAGCTGAAGT 780GCGGCGCTAT AGAAGCCGCT CTGCGCGATT ATCAAACGCA AAATACGCTT ACTCATGCCA 840TCGGCGCTGC TCACCCGATG CGACGTTTTT GCCACGCTCC ACCGCCTGCC GCGCGACCTC 900AAGTGGGCAT GCATCCCACC CGTTCCCGGA AACCGGTTCC GGCGGGTCGG CTCATCGCTT 960CATCCT 966(2)SEQ ID NO:196的信息:
(ⅰ)序列特征:
(A)长度:2367碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:196:CCGCACCGCC GGCAATACCG CCAGCGCCAC CGTTACCGCC GTTTGCGCCG TTGCCCCCGT 60TGCCGCCCGT CCCGCCGGCC CCGCCGATGG AGTTCTCATC GCCAAAAGTA CTGGCGTTGC 120CACCGGAGCC GCCGTTGCCG CCGTCACCGC CAGCCCCGCC GACTCCACCG GCCCCACCGA 180CTCCGCCGCT GCCACCGTTG CCGCCGTTGC CGATCAACAT GCCGCTGGCG CCACCCTTGC 240CACCCACGCC ACCGGCTCCG CCCACCCCGC CGACACCAAG CGAGCTGCCG CCGGAGCCAC 300CATCACCACC TACGCCACCG ACCGCCCAGA CACCAGCGAC CGGGTCTTCG TGAAACGTCG 360CGGTGCCACC ACCGCCGCCG TTACCGCCAA CCCCACCGGC AACGCCGGCG CCGCCATCCC 420CGCCGGCCCC GGCGTTGCCG CCGTTGCCGC CGTTGCCGAA CAACAACCCG CCGGCGCCGC 480CGTTGCCGCC CGCGCCGCCG GTCCCGCCGG CGCCGCCGAC GCCAAGGCCG CTGCCGCCCT 540TGCCGCCATC ACCACCCTTG CCGCCGACCA CATCGGGTTC TGCCTCGGGG TCTGGGCTGT 600CAAACCTCGC GATGCCAGCG TTGCCGCCGC TTCCCCCGGG CCCCCCCGTG GCGCCGTCAC 660CACCGATACC ACCCGCGCCA CCGGCGCCAC CGTTGCCGCC ATCACCGAAT AGCAACCCGC 720CGGCGCCACC ATTGCCGCCA GCTCCCCCTG CGCCACCGTC GGCGCCGGAG GCGGCACTGG 780CAGCCCCGTT ACCACCGAAA CCGCCGCTAC CACCGGTAGA GGTGGCAGTG GCGATGTGTA 840CGAAAGCGCC GCCTCCGGCG CCGCCGCTAC CACCCCCACT GCCGGCGGCT ACACCGTCGG 900ACCCGTTGCC ACCATCACCG CCAAAGGCGC TCGCAATGTC GCCCTGCGCG ACTCCGCCGT 960CGCCGCCGTT GCCGCCGCCG CCACCGGCAG CGGCGGTACC GCCGTCACCA CCGGCACCGC 1020CGGTGGCCTT GCCCGAGCCT GCCGTCGCGG TGGCACCGTC GCCGCCGGTG CCACCGGTCG 1080GCGTGCCGGC AGTGCCATGG CCGCCCGTGC CGCCGTCGCC GCCGGTTTGA TCACCGATGC 1140CGGACACATC TGCCGGGCTG TCCCCGGTGC TGGCCGCGGG GCCGGGCGTG GGATTGACCC 1200CGTTTGCCCC GGCGAGGCCG GCGCCGCCGG TACCACCGGC GCCGCCATGG CCGAACAGCC 1260CGGCGTTGCC GCCGTTACCG CCCGCACCCC CGATGCCTGC GGCCACGCTG GTGCCGCCGA 1320CACCGCCGTT GCCGCCGTTG CCCCACAACC ACCCCCCGTT CCCACCGGCA CCGCCGGCCG 1380CGCCGGTACC ACCGGCCCCG CCGTTGCCGC CGTTGCCGAT CAACCCGGCC GCGCCTCCGC 1440TGCCGCCGGT TTGACCGAAC CCGCCAGCCG CGCCGTTGCC ACCGTTGCCA AACAGCAACC 1500CGCCGGCCGC GCCAGGCTGC CCGGGTGCCG TCCCGTCGGC GCCGTTTCCG ATCAACGGGC 1560GCCCCAAAAG CGCCTCGGTG GGCGCATTCA CCGCACCCAG CAGACTCCGC TCAACAGCGG 1620CTTCAGTGCT GGCATACCGA CCCGCGGCCG CAGTCAACGC CTGCACAAAC TGCTCGTGAA 1680ACGCTGCCAC CTGTACGCTG AGCGCCTGAT ACTGCCGAGC ATGGGCCCCG AACAACCCCG 1740CAATCGCCGC CGACACTTCA TCGGCAGCCG CAGCCACCAC TTCCGTCGTC GGGATCGCCG 1800CGGCCGCATT AGCCGCGCTC ACCTGCGAAC CAATAGTCGA TAAATCCAAA GCCGCAGTTG 1860CCAGCAGCTG CGGCGTCGCG ATCACCAAGG ACACCTCGCA CCTCCGGATA CCCCATATCG 1920CCGCACCGTG TCCCCAGCGG CCACGTGACC TTTGGTCGCT GGCTGGCGGC CCTGACTATG 1980GCCGCGACGG CCCTCGTTCT GATTCGCCCC GGCGCGCAGC TTGTTGCGCG AGTTGAAGAC 2040GGGAGGACAG GCCGAGCTTG GTGTAGACGT GGGTCAAGTG GGAATGCACG GTCCGCGGCG 2100AGATGAATAG GCGGACGCCG ATCTCCTTGT TGCTGAGTCC CTCACCGACC AGTAGAGCCA 2160CCTCAAGCTC TGTCGGTGTC AACGCGCCCC AGCCACTTGT CGGGCGTTTC CGTGCACCGC 2220GGCCTCGTTG CGCGTACGCG ATCGCCTCAT CGATCGATAA CGCAGTTCCT TCGGCCCAGG 2280CATCGTCGAA CTCGCTGTCA CCCATGGATT TTCGAAGGGT GGCTAGCGAC GAGTTACAGC 2340CCGCCTGGTA GATCCCGAAG CGGACCG 2367(2)SEQ ID NO:197的信息:
(ⅰ)序列特征:
(A)长度:376氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:197:Gln Pro Ala Gly Ala Thr Ile Ala Ala Ser Ser Pro Cys Ala Thr Val1 5 10 15Gly Ala Gly Gly Gly Thr Gly Ser Pro Val Thr Thr Glu Thr Ala Ala
20 25 30Thr Thr Gly Arg Gly Gly Ser Gly Asp Val Tyr Glu Ser Ala Ala Ser
35 40 45Gly Ala Ala Ala Thr Thr Pro Thr Ala Gly Gly Tyr Thr Val Gly Pro
50 55 60Val Ala Thr Ile Thr Ala Lys Gly Ala Arg Asn Val Ala Leu Arg Asp65 70 75 80Ser Ala Val Ala Ala Val Ala Ala Ala Ala Thr Gly Ser Gly Gly Thr
85 90 95Ala Val Thr Thr Gly Thr Ala Gly Gly Leu Ala Arg Ala Cys Arg Arg
100 105 110Gly Gly Thr Val Ala Ala Gly Ala Thr Gly Arg Arg Ala Gly Ser Ala
115 120 125Met Ala Ala Arg Ala Ala Val Ala Ala Gly Leu Ile Thr Asp Ala Gly
130 135 140His Ile Cys Arg Ala Val Pro Gly Ala Gly Arg Gly Ala Gly Arg Gly145 150 155 160Ile Asp Pro Val Cys Pro Gly Glu Ala Gly Ala Ala Gly Thr Thr Gly
165 170 175Ala Ala Met Ala Glu Gln Pro Gly Val Ala Ala Val Thr Ala Arg Thr
180 185 190Pro Asp Ala Cys Gly His Ala Gly Ala Ala Asp Thr Ala Val Ala Ala
195 200 205Val Ala Pro Gln Pro Pro Pro Val Pro Thr Gly Thr Ala Gly Arg Ala210 215 220Gly Thr Thr Gly Pro Ala Val Ala Ala Val Ala Asp Gln Pro Gly Arg225 230 235 240Ala Ser Ala Ala Ala Gly Leu Thr Glu Pro Ala Ser Arg Ala Val Ala
245 250 255Thr Val Ala Lys Gln Gln Pro Ala Gly Arg Ala Arg Leu Pro Gly Cys
260 265 270Arg Pro Val Gly Ala Val Ser Asp Gln Arg Ala Pro Gln Lys Arg Leu
275 280 285Gly Gly Arg Ile His Arg Thr Gln Gln Thr Pro Leu Asn Ser Gly Phe
290 295 300Ser Ala Gly Ile Pro Thr Arg Gly Arg Ser Gln Arg Leu His Lys Leu305 310 315 320Leu Val Lys Arg Cys His Leu Tyr Ala Glu Arg Leu Ile Leu Pro Ser
325 330 335Met Gly Pro Glu Gln Pro Arg Asn Arg Arg Arg His Phe Ile Gly Ser
340 345 350Arg Ser His His Phe Arg Arg Arg Asp Arg Arg Gly Arg lle Ser Arg
355 360 365
Ala His Leu Arg Thr Asn Ser Arg
370 375(2)SEQ ID NO:198的信息:
(ⅰ)序列特征:
(A)长度:2852碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:198:GGCCAAAACG CCCCGGCGAT CGCGGCCACC GAGGCCGCCT ACGACCAGAT GTGGGCCCAG 60GACGTGGCGG CGATGTTTGG CTACCATGCC GGGGCTTCGG CGGCCGTCTC GGCGTTGACA 120CCGTTCGGCC AGGCGCTGCC GACCGTGGCG GGCGGCGGTG CGCTGGTCAG CGCGGCCGCG 180GCTCAGGTGA CCACGCGGGT CTTCCGCAAC CTGGGCTTGG CGAACGTCCG CGAGGGCAAC 240GTCCGCAACG GTAATGTCCG GAACTTCAAT CTCGGCTCGG CCAACATCGG CAACGGCAAC 300ATCGGCAGCG GCAACATCGG CAGCTCCAAC ATCGGGTTTG GCAACGTGGG TCCTGGGTTG 360ACCGCAGCGC TGAACAACAT CGGTTTCGGC AACACCGGCA GCAACAACAT CGGGTTTGGC 420AACACCGGCA GCAACAACAT CGGGTTCGGC AATACCGGAG ACGGCAACCG AGGTATCGGG 480CTCACGGGTA GCGGTTTGTT GGGGTTCGGC GGCCTGAACT CGGGCACCGG CAACATCGGT 540CTGTTCAACT CGGGCACCGG AAACGTCGGC ATCGGCAACT CGGGTACCGG GAACTGGGGC 600ATTGGCAACT CGGGCAACAG CTACAACACC GGTTTTGGCA ACTCCGGCGA CGCCAACACG 660GGCTTCTTCA ACTCCGGAAT AGCCAACACC GGCGTCGGCA ACGCCGGCAA CTACAACACC 720GGTAGCTACA ACCCGGGCAA CAGCAATACC GGCGGCTTCA ACATGGGCCA GTACAACACG 780GGCTACCTGA ACAGCGGCAA CTACAACACC GGCTTGGCAA ACTCCGGCAA TGTCAACACC 840GGCGCCTTCA TTACTGGCAA CTTCAACAAC GGCTTCTTGT GGCGCGGCGA CCACCAAGGC 900CTGATTTTCG GGAGCCCCGG CTTCTTCAAC TCGACCAGTG CGCCGTCGTC GGGATTCTTC 960AACAGCGGTG CCGGTAGCGC GTCCGGCTTC CTGAACTCCG GTGCCAACAA TTCTGGCTTC 1020TTCAACTCTT CGTCGGGGGC CATCGGTAAC TCCGGCCTGG CAAACGCGGG CGTGCTGGTA 1080TCGGGCGTGA TCAACTCGGG CAACACCGTA TCGGGTTTGT TCAACATGAG CCTGGTGGCC 1140ATCACAACGC CGGCCTTGAT CTCGGGCTTC TTCAACACCG GAAGCAACAT GTCGGGATTT 1200TTCGGTGGCC CACCGGTCTT CAATCTCGGC CTGGCAAACC GGGGCGTCGT GAACATTCTC 1260GGCAACGCCA ACATCGGCAA TTACAACATT CTCGGCAGCG GAAACGTCGG TGACTTCAAC 1320ATCCTTGGCA GCGGCAACCT CGGCAGCCAA AACATCTTGG GCAGCGGCAA CGTCGGCAGC 1380TTCAATATCG GCAGTGGAAA CATCGGAGTA TTCAATGTCG GTTCCGGAAG CCTGGGAAAC 1440TACAACATCG GATCCGGAAA CCTCGGGATC TACAACATCG GTTTTGGAAA CGTCGGCGAC 1500TACAACGTCG GCTTCGGGAA CGCGGGCGAC TTCAACCAAG GCTTTGCCAA CACCGGCAAC 1560AACAACATCG GGTTCGCCAA CACCGGCAAC AACAACATCG GCATCGGGCT GTCCGGCGAC 1620AACCAGCAGG GCTTCAATAT TGCTAGCGGC TGGAACTCGG GCACCGGCAA CAGCGGCCTG 1680TTCAATTCGG GCACCAATAA CGTTGGCATC TTCAACGCGG GCACCGGAAA CGTCGGCATC 1740GCAAACTCGG GCACCGGGAA CTGGGGTATC GGGAACCCGG GTACCGACAA TACCGGCATC 1800CTCAATGCTG GCAGCTACAA CACGGGCATC CTCAACGCCG GCGACTTCAA CACGGGCTTC 1860TACAACACGG GCAGCTACAA CACCGGCGGC TTCAACGTCG GTAACACCAA CACCGGCAAC 1920TTCAACGTGG GTGACACCAA TACCGGCAGC TATAACCCGG GTGACACCAA CACCGGCTTC 1980TTCAATCCCG GCAACGTCAA TACCGGCGCT TTCGACACGG GCGACTTCAA CAATGGCTTC 2040TTGGTGGCGG GCGATAACCA GGGCCAGATT GCCATCGATC TCTCGGTCAC CACTCCATTC 2100ATCCCCATAA ACGAGCAGAT GGTCATTGAC GTACACAACG TAATGACCTT CGGCGGCAAC 2160ATGATCACGG TCACCGAGGC CTCGACCGTT TTCCCCCAAA CCTTCTATCT GAGCGGTTTG 2220TTCTTCTTCG GCCCGGTCAA TCTCAGCGCA TCCACGCTGA CCGTTCCGAC GATCACCCTC 2280ACCATCGGCG GACCGACGGT GACCGTCCCC ATCAGCATTG TCGGTGCTCT GGAGAGCCGC 2340ACGATTACCT TCCTCAAGAT CGATCCGGCG CCGGGCATCG GAAATTCGAC CACCAACCCC 2400TCGTCCGGCT TCTTCAACTC GGGCACCGGT GGCACATCTG GCTTCCAAAA CGTCGGCGGC 2460GGCAGTTCAG GCGTCTGGAA CAGTGGTTTG AGCAGCGCGA TAGGGAATTC GGGTTTCCAG 2520AACCTCGGCT CGCTGCAGTC AGGCTGGGCG AACCTGGGCA ACTCCGTATC GGGCTTTTTC 2580AACACCAGTA CGGTGAACCT CTCCACGCCG GCCAATGTCT CGGGCCTGAA CAACATCGGC 2640ACCAACCTGT CCGGCGTGTT CCGCGGTCCG ACCGGGACGA TTTTCAACGC GGGCCTTGCC 2700AACCTGGGCC AGTTGAACAT CGGCAGCGCC TCGTGCCGAA TTCGGCACGA GTTAGATACG 2760GTTTCAACAA TCATATCCGC GTTTTGCGGC AGTGCATCAG ACGAATCGAA CCCGGGAAGC 2820GTAAGCGAAT AAACCGAATG GCGGCCTGTC AT 2852(2)SEQ ID NO:199的信息:
(ⅰ)序列特征:
(A)长度:943氨基酸
(B)类型:氨基酸
(C)链型:
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:199:Gly Gln Asn Ala Pro Ala Ile Ala Ala Thr Glu Ala Ala Tyr Asp Gln1 5 10 15Met Trp Ala Gln Asp Val Ala Ala Met Phe Gly Tyr His Ala Gly Ala
20 25 30Ser Ala Ala Val Ser Ala Leu Thr Pro Phe Gly Gln Ala Leu Pro Thr
35 40 45Val Ala Gly Gly Gly Ala Leu Val Ser Ala Ala Ala Ala Gln Val Thr
50 55 60Thr Arg Val Phe Arg Asn Leu Gly Leu Ala Asn Val Arg Glu Gly Asn65 70 75 80Val Arg Asn Gly Asn Val Arg Asn Phe Asn Leu Gly Ser Ala Asn Ile
85 90 95Gly Asn Gly Asn Ile Gly Ser Gly Asn Ile Gly Ser Ser Asn Ile Gly
100 105 110Phe Gly Asn Val Gly Pro Gly Leu Thr Ala Ala Leu Asn Asn Ile Gly
115 120 125Phe Gly Asn Thr Gly Ser Asn Asn Ile Gly Phe Gly Asn Thr Gly Ser
130 135 140Asn Asn Ile Gly Phe Gly Asn Thr Gly Asp Gly Asn Arg Gly Ile Gly145 150 155 160Leu Thr Gly Ser Gly Leu Leu Gly Phe Gly Gly Leu Asn Ser Gly Thr
165 170 175Gly Asn Ile Gly Leu Phe Asn Ser Gly Thr Gly Asn Val Gly Ile Gly
180 185 190Asn Ser Gly Thr Gly Asn Trp Gly Ile Gly Asn Ser Gly Asn Ser Tyr
195 200 205Asn Thr Gly Phe Gly Asn Ser Gly Asp Ala Asn Thr Gly Phe Phe Asn
210 215 220Ser Gly Ile Ala Asn Thr Gly Val Gly Asn Ala Gly Asn Tyr Asn Thr225 230 235 240Gly Ser Tyr Asn Pro Gly Asn Ser Asn Thr Gly Gly Phe Asn Met Gly
245 250 255Gln Tyr Asn Thr Gly Tyr Leu Asn Ser Gly Asn Tyr Asn Thr Gly Leu
260 265 270Ala Asn Ser Gly Asn Val Asn Thr Gly Ala Phe Ile Thr Gly Asn Phe
275 280 285Asn Asn Gly Phe Leu Trp Arg Gly Asp His Gln Gly Leu Ile Phe Gly
290 295 300Ser Pro Gly Phe Phe Asn Ser Thr Ser Ala Pro Ser Ser Gly Phe Phe305 310 315 320Asn Ser Gly Ala Gly Ser Ala Ser Gly Phe Leu Asn Ser Gly Ala Asn
325 330 335Asn Ser Gly Phe Phe Asn Ser Ser Ser Gly Ala Ile Gly Asn Ser Gly
340 345 350Leu Ala Asn Ala Gly Val Leu Val Ser Gly Val Ile Asn Ser Gly Asn
355 360 365Thr Val Ser Gly Leu Phe Asn Met Ser Leu Val Ala Ile Thr Thr Pro
370 375 380Ala Leu Ile Ser Gly Phe Phe Asn Thr Gly Ser Asn Met Ser Gly Phe385 390 395 400Phe Gly Gly Pro Pro Val Phe Asn Leu Gly Leu Ala Asn Arg Gly Val
405 410 415Val Asn Ile Leu Gly Asn Ala Asn Ile Gly Asn Tyr Asn Ile Leu Gly
420 425 430Ser Gly Asn Val Gly Asp Phe Asn Ile Leu Gly Ser Gly Asn Leu Gly
435 440 445Ser Gln Asn Ile Leu Gly Ser Gly Asn Val Gly Ser Phe Asn Ile Gly
450 455 460Ser Gly Asn Ile Gly Val Phe Asn Val Gly Ser Gly Ser Leu Gly Asn465 470 475 480Tyr Asn Ile Gly Ser Gly Asn Leu Gly Ile Tyr Asn Ile Gly Phe Gly
485 490 495Asn Val Gly Asp Tyr Asn Val Gly Phe Gly Asn Ala Gly Asp Phe Asn
500 505 510Gln Gly Phe Ala Asn Thr Gly Asn Asn Asn Ile Gly Phe Ala Asn Thr
515 520 525Gly Asn Asn Asn Ile Gly Ile Gly Leu Ser Gly Asp Asn Gln Gln Gly
530 535 540Phe Ash Ile Ala Ser Gly Trp Asn Ser Gly Thr Gly Asn Ser Gly Leu545 550 555 560Phe Asn Ser Gly Thr Asn Asn Val Gly Ile Phe Asn Ala Gly Thr Gly
565 570 575Asn Val Gly Ile Ala Asn Ser Gly Thr Gly Asn Trp Gly Ile Gly Asn
580 585 590Pro Gly Thr Asp Asn Thr Gly Ile Leu Asn Ala Gly Ser Tyr Asn Thr
595 600 605Gly Ile Leu Asn Ala Gly Asp Phe Asn Thr Gly Phe Tyr Asn Thr Gly
610 615 620Ser Tyr Asn Thr Gly Gly Phe Asn Val Gly Asn Thr Asn Thr Gly Asn625 630 635 640Phe Asn Val Gly Asp Thr Asn Thr Gly Ser Tyr Asn Pro Gly Asp Thr
645 650 655Asn Thr Gly Phe Phe Asn Pro Gly Asn Val Asn Thr Gly Ala Phe Asp
660 665 670Thr Gly Asp Phe Asn Asn Gly Phe Leu Val Ala Gly Asp Asn Gln Gly
675 680 685Gln Ile Ala Ile Asp Leu Ser Val Thr Thr Pro Phe Ile Pro Ile Asn
690 695 700Glu Gln Met Val Ile Asp Val His Asn Val Met Thr Phe Gly Gly Asn705 710 715 720Met Ile Thr Val Thr Glu Ala Ser Thr Val Phe Pro Gln Thr Phe Tyr
725 730 735Leu Ser Gly Leu Phe Phe Phe Gly Pro Val Asn Leu Ser Ala Ser Thr
740 745 750Leu Thr Val Pro Thr Ile Thr Leu Thr Ile Gly Gly Pro Thr Val Thr
755 760 765Val Pro Ile Ser Ile Val Gly Ala Leu Glu Ser Arg Thr Ile Thr Phe
770 775 780Leu Lys Ile Asp Pro Ala Pro Gly Ile Gly Asn Ser Thr Thr Asn Pro785 790 795 800Ser Ser Gly Phe Phe Asn Ser Gly Thr Gly Gly Thr Ser Gly Phe Gln
805 810 815Asn Val Gly Gly Gly Ser Ser Gly Val Trp Asn Ser Gly Leu Ser Ser
820 825 830Ala Ile Gly Asn Ser Gly Phe Gln Asn Leu Gly Ser Leu Gln Ser Gly
835 840 845Trp Ala Asn Leu Gly Asn Ser Val Ser Gly Phe Phe Asn Thr Ser Thr
850 855 860Val Asn Leu Ser Thr Pro Ala Asn Val Ser Gly Leu Asn Asn Ile Gly865 870 875 880Thr Asn Leu Ser Gly Val Phe Arg Gly Pro Thr Gly Thr Ile Phe Asn
885 890 895Ala Gly Leu Ala Asn Leu Gly Gln Leu Asn Ile Gly Ser Ala Ser Cys
900 905 910Arg Ile Arg His Glu Leu Asp Thr Val Ser Thr Ile Ile Ser Ala Phe
915 920 925Cys Gly Ser Ala Ser Asp Glu Ser Asn Pro Gly Ser Val Ser Glu
930 935 940(2)SEQ ID NO:200的信息:
(ⅰ)序列特征:
(A)长度:53碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:200:GGATCCATAT GGGCCATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 53(2)SEQ ID NO:201的信息:
(ⅰ)序列特征:
(A)长度:42碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:201:CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 42(2)SEQ ID NO:202的信息:
(ⅰ)序列特征:
(A)长度:31碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:202:GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 31(2)SEQ ID NO:203的信息:
(ⅰ)序列特征:
(A)长度:31碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:203:CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 31(2)SEQ ID NO:204的信息:
(ⅰ)序列特征:
(A)长度:33碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:204:GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 33(2)SEQ ID NO:205的信息:
(ⅰ)序列特征:
(A)长度:38碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:205:GGATATCTGC AGAATTCAGG TTTAAAGCCC ATTTGCGA 38(2)SEQ ID NO:206的信息:
(ⅰ)序列特征:
(A)长度:30碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:206:CCGCATGCGA GCCACGTGCC CACAACGGCC 30(2)SEQ ID NO:207的信息:
(ⅰ)序列特征:
(A)长度:37碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:207:CTTCATGGAA TTCTCAGGCC GGTAAGGTCC GCTGCGG 37(2)SEQ ID NO:208的信息:
(ⅰ)序列特征:
(A)长度:7676碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅹⅰ)序列描述:SEQ ID NO:208:TGGCGAATGG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG 60CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC 120CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG 180GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC 240ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT 300CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC 360TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA 420ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT 480TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 540TCCGCTCATG AATTAATTCT TAGAAAAACT CATCGAGCAT CAAATGAAAC TGCAATTTAT 600TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGCCG TTTCTGTAAT GAAGGAGAAA 660ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA TCGGTCTGCG ATTCCGACTC 720GTCCAACATC AATACAACCT ATTAATTTCC CCTCGTCAAA AATAAGGTTA TCAAGTGAGA 780AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA AAGTTTATGC ATTTCTTTCC 840AGACTTGTTC AACAGGCCAG CCATTACGCT CGTCATCAAA ATCACTCGCA TCAACCAAAC 900CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC GCGATCGCTG TTAAAAGGAC 960AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC TGCCAGCGCA TCAACAATAT 1020TTTCACCTGA ATCAGGATAT TCTTCTAATA CCTGGAATGC TGTTTTCCCG GGGATCGCAG 1080TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG CTTGATGGTC GGAAGAGGCA 1140TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT AACATCATTG GCAACGCTAC 1200CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT CCCATACAAT CGATAGATTG 1260TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA CCCATATAAA TCAGCATCCA 1320TGTTGGAATT TAATCGCGGC CTAGAGCAAG ACGTTTCCCG TTGAATATGG CTCATAACAC 1380CCCTTGTATT ACTGTTTATG TAAGCAGACA GTTTTATTGT TCATGACCAA AATCCCTTAA 1440CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 1500GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG 1560GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 1620AGAGCGCAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1680AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 1740AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 1800CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 1860ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 1920AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 1980CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 2040CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 2100GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 2160TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT GAGCTGATAC CGCTCGCCGC 2220AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG CGGAAGAGCG CCTGATGCGG 2280TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA TATATGGTGC ACTCTCAGTA 2340CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGTATACACT CCGCTATCGC TACGTGACTG 2400GGTCATGGCT GCGCCCCGAC ACCCGCCAAC ACCCGCTGAC GCGCCCTGAC GGGCTTGTCT 2460GCTCCCGGCA TCCGCTTACA GACAAGCTGT GACCGTCTCC GGGAGCTGCA TGTGTCAGAG 2520GTTTTCACCG TCATCACCGA AACGCGCGAG GCAGCTGCGG TAAAGCTCAT CAGCGTGGTC 2580GTGAAGCGAT TCACAGATGT CTGCCTGTTC ATCCGCGTCC AGCTCGTTGA GTTTCTCCAG 2640AAGCGTTAAT GTCTGGCTTC TGATAAAGCG GGCCATGTTA AGGGCGGTTT TTTCCTGTTT 2700GGTCACTGAT GCCTCCGTGT AAGGGGGATT TCTGTTCATG GGGGTAATGA TACCGATGAA 2760ACGAGAGAGG ATGCTCACGA TACGGGTTAC TGATGATGAA CATGCCCGGT TACTGGAACG 2820TTGTGAGGGT AAACAACTGG CGGTATGGAT GCGGCGGGAC CAGAGAAAAA TCACTCAGGG 2880TCAATGCCAG CGCTTCGTTA ATACAGATGT AGGTGTTCCA CAGGGTAGCC AGCAGCATCC 2940TGCGATGCAG ATCCGGAACA TAATGGTGCA GGGCGCTGAC TTCCGCGTTT CCAGACTTTA 3000CGAAACACGG AAACCGAAGA CCATTCATGT TGTTGCTCAG GTCGCAGACG TTTTGCAGCA 3060GCAGTCGCTT CACGTTCGCT CGCGTATCGG TGATTCATTC TGCTAACCAG TAAGGCAACC 3120CCGCCAGCCT AGCCGGGTCC TCAACGACAG GAGCACGATC ATGCGCACCC GTGGGGCCGC 3180CATGCCGGCG ATAATGGCCT GCTTCTCGCC GAAACGTTTG GTGGCGGGAC CAGTGACGAA 3240GGCTTGAGCG AGGGCGTGCA AGATTCCGAA TACCGCAAGC GACAGGCCGA TCATCGTCGC 3300GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT GACCCAGAGC GCTGCCGGCA CCTGTCCTAC 3360GAGTTGCATG ATAAAGAAGA CAGTCATAAG TGCGGCGACG ATAGTCATGC CCCGCGCCCA 3420CCGGAAGGAG CTGACTGGGT TGAAGGCTCT CAAGGGCATC GGTCGAGATC CCGGTGCCTA 3480ATGAGTGAGC TAACTTACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 3540CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG GTTTGCGTAT 3600TGGGCGCCAG GGTGGTTTTT CTTTTCACCA GTGAGACGGG CAACAGCTGA TTGCCCTTCA 3660CCGCCTGGCC CTGAGAGAGT TGCAGCAAGC GGTCCACGCT GGTTTGCCCC AGCAGGCGAA 3720AATCCTGTTT GATGGTGGTT AACGGCGGGA TATAACATGA GCTGTCTTCG GTATCGTCGT 3780ATCCCACTAC CGAGATATCC GCACCAACGC GCAGCCCGGA CTCGGTAATG GCGCGCATTG 3840CGCCCAGCGC CATCTGATCG TTGGCAACCA GCATCGCAGT GGGAACGATG CCCTCATTCA 3900GCATTTGCAT GGTTTGTTGA AAACCGGACA TGGCACTCCA GTCGCCTTCC CGTTCCGCTA 3960TCGGCTGAAT TTGATTGCGA GTGAGATATT TATGCCAGCC AGCCAGACGC AGACGCGCCG 4020AGACAGAACT TAATGGGCCC GCTAACAGCG CGATTTGCTG GTGACCCAAT GCGACCAGAT 4080GCTCCACGCC CAGTCGCGTA CCGTCTTCAT GGGAGAAAAT AATACTGTTG ATGGGTGTCT 4140GGTCAGAGAC ATCAAGAAAT AACGCCGGAA CATTAGTGCA GGCAGCTTCC ACAGCAATGG 4200CATCCTGGTC ATCCAGCGGA TAGTTAATGA TCAGCCCACT GACGCGTTGC GCGAGAAGAT 4260TGTGCACCGC CGCTTTACAG GCTTCGACGC CGCTTCGTTC TACCATCGAC ACCACCACGC 4320TGGCACCCAG TTGATCGGCG CGAGATTTAA TCGCCGCGAC AATTTGCGAC GGCGCGTGCA 4380GGGCCAGACT GGAGGTGGCA ACGCCAATCA GCAACGACTG TTTGCCCGCC AGTTGTTGTG 4440CCACGCGGTT GGGAATGTAA TTCAGCTCCG CCATCGCCGC TTCCACTTTT TCCCGCGTTT 4500TCGCAGAAAC GTGGCTGGCC TGGTTCACCA CGCGGGAAAC GGTCTGATAA GAGACACCGG 4560CATACTCTGC GACATCGTAT AACGTTACTG GTTTCACATT CACCACCCTG AATTGACTCT 4620CTTCCGGGCG CTATCATGCC ATACCGCGAA AGGTTTTGCG CCATTCGATG GTGTCCGGGA 4680TCTCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG TAGGTTGAGG 4740CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC CAACAGTCCC 4800CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG CCCGAAGTGG 4860CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC CGCACCTGTG 4920GCGCCGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC GATCCCGCGA 4980AATTAATACG ACTCACTATA GGGGAATTGT GAGCGGATAA CAATTCCCCT CTAGAAATAA 5040TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGGGCCAT CATCATCATC ATCACGTGAT 5100CGACATCATC GGGACCAGCC CCACATCCTG GGAACAGGCG GCGGCGGAGG CGGTCCAGCG 5160GGCGCGGGAT AGCGTCGATG ACATCCGCGT CGCTCGGGTC ATTGAGCAGG ACATGGCCGT 5220GGACAGCGCC GGCAAGATCA CCTACCGCAT CAAGCTCGAA GTGTCGTTCA AGATGAGGCC 5280GGCGCAACCG AGGGGCTCGA AACCACCGAG CGGTTCGCCT GAAACGGGCG CCGGCGCCGG 5340TACTGTCGCG ACTACCCCCG CGTCGTCGCC GGTGACGTTG GCGGAGACCG GTAGCACGCT 5400GCTCTACCCG CTGTTCAACC TGTGGGGTCC GGCCTTTCAC GAGAGGTATC CGAACGTCAC 5460GATCACCGCT CAGGGCACCG GTTCTGGTGC CGGGATCGCG CAGGCCGCCG CCGGGACGGT 5520CAACATTGGG GCCTCCGACG CCTATCTGTC GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT 5580GATGAACATC GCGCTAGCCA TCTCCGCTCA GCAGGTCAAC TACAACCTGC CCGGAGTGAG 5640CGAGCACCTC AAGCTGAACG GAAAAGTCCT GGCGGCCATG TACCAGGGCA CCATCAAAAC 5700CTGGGACGAC CCGCAGATCG CTGCGCTCAA CCCCGGCGTG AACCTGCCCG GCACCGCGGT 5760AGTTCCGCTG CACCGCTCCG ACGGGTCCGG TGACACCTTC TTGTTCACCC AGTACCTGTC 5820CAAGCAAGAT CCCGAGGGCT GGGGCAAGTC GCCCGGCTTC GGCACCACCG TCGACTTCCC 8880GGCGGTGCCG GGTGCGCTGG GTGAGAACGG CAACGGCGGC ATGGTGACCG GTTGCGCCGA 5940GACACCGGGC TGCGTGGCCT ATATCGGCAT CAGCTTCCTC GACCAGGCCA GTCAACGGGG 6000ACTCGGCGAG GCCCAACTAG GCAATAGCTC TGGCAATTTC TTGTTGCCCG ACGCGCAAAG 6060CATTCAGGCC GCGGCGGCTG GCTTCGCATC GAAAACCCCG GCGAACCAGG CGATTTCGAT 6120GATCGACGGG CCCGCCCCGG ACGGCTACCC GATCATCAAC TACGAGTACG CCATCGTCAA 6180CAACCGGCAA AAGGACGCCG CCACCGCGCA GACCTTGCAG GCATTTCTGC ACTGGGCGAT 6240CACCGACGGC AACAAGGCCT CGTTCCTCGA CCAGGTTCAT TTCCAGCCGC TGCCGCCCGC 6300GGTGGTGAAG TTGTCTGACG CGTTGATCGC GACGATTTCC AGCGCTGAGA TGAAGACCGA 6360TGCCGCTACC CTCGCGCAGG AGGCAGGTAA TTTCGAGCGG ATCTCCGGCG ACCTGAAAAC 6420CCAGATCGAC CAGGTGGAGT CGACGGCAGG TTCGTTGCAG GGCCAGTGGC GCGGCGCGGC 6480GGGGACGGCC GCCCAGGCCG CGGTGGTGCG CTTCCAAGAA GCAGCCAATA AGCAGAAGCA 6540GGAACTCGAC GAGATCTCGA CGAATATTCG TCAGGCCGGC GTCCAATACT CGAGGGCCGA 6600CGAGGAGCAG CAGCAGGCGC TGTCCTCGCA AATGGGCTTT GTGCCCACAA CGGCCGCCTC 6660GCCGCCGTCG ACCGCTGCAG CGCCACCCGC ACCGGCGACA CCTGTTGCCC CCCCACCACC 6720GGCCGCCGCC AACACGCCGA ATGCCCAGCC GGGCGATCCC AACGCAGCAC CTCCGCCGGC 6780CGACCCGAAC GCACCGCCGC CACCTGTCAT TGCCCCAAAC GCACCCCAAC CTGTCCGGAT 6840CGACAACCCG GTTGGAGGAT TCAGCTTCGC GCTGCCTGCT GGCTGGGTGG AGTCTGACGC 6900CGCCCACTTC GACTACGGTT CAGCACTCCT CAGCAAAACC ACCGGGGACC CGCCATTTCC 6960CGGACAGCCG CCGCCGGTGG CCAATGACAC CCGTATCGTG CTCGGCCGGC TAGACCAAAA 7020GCTTTACGCC AGCGCCGAAG CCACCGACTC CAAGGCCGCG GCCCGGTTGG GCTCGGACAT 7080GGGTGAGTTC TATATGCCCT ACCCGGGCAC CCGGATCAAC CAGGAAACCG TCTCGCTTGA 7140CGCCAACGGG GTGTCTGGAA GCGCGTCGTA TTACGAAGTC AAGTTCAGCG ATCCGAGTAA 7200GCCGAACGGC CAGATCTGGA CGGGCGTAAT CGGCTCGCCC GCGGCGAACG CACCGGACGC 7260CGGGCCCCCT CAGCGCTGGT TTGTGGTATG GCTCGGGACC GCCAACAACC CGGTGGACAA 7320GGGCGCGGCC AAGGCGCTGG CCGAATCGAT CCGGCCTTTG GTCGCCCCGC CGCCGGCGCC 7380GGCACCGGCT CCTGCAGAGC CCGCTCCGGC GCCGGCGCCG GCCGGGGAAG TCGCTCCTAC 7440CCCGACGACA CCGACACCGC AGCGGACCTT ACCGGCCTGA GAATTCTGCA GATATCCATC 7500ACACTGGCGG CCGCTCGAGC ACCACCACCA CCACCACTGA GATCCGGCTG CTAACAAAGC 7560CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG 7620GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGAT 7676(2)SEQ ID NO:209的信息:
(ⅰ)序列特征:
(A)长度:802氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性(ⅹⅰ)序列描述:SEQ ID NO:209:Met Gly His His His His His His Val Ile Asp Ile Ile Gly Thr Ser1 5 10 15Pro Thr Ser Trp Glu Gln Ala Ala Ala Glu Ala Val Gln Arg Ala Arg
20 25 30Asp Ser Val Asp Asp Ile Arg Val Ala Arg Val Ile Glu Gln Asp Met
35 40 45Ala Val Asp Ser Ala Gly Lys Ile Thr Tyr Arg Ile Lys Leu Glu Val
50 55 60Ser Phe Lys Met Arg Pro Ala Gin Pro Arg Gly Ser Lys Pro Pro Ser65 70 75 80Gly Ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro
85 90 95Ala Ser Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr
100 105 110Pro Leu Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn
115 120 125Val Thr Ile Thr Ala Gln Gly Thr Gly Ser Gly Ala Gly Ile Ala Gln
130 135 140Ala Ala Ala Gly Thr Val Asn Ile Gly Ala Ser Asp Ala Tyr Leu Ser145 150 155 160Glu Gly Asp Met Ala Ala His Lys Gly Leu Met Asn Ile Ala Leu Ala
165 170 175Ile Ser Ala Gln Gln Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His
180 185 190Leu Lys Leu Ash Gly Lys Val Leu Ala Ala Met Tyr Gln Gly Thr Ile
195 200 205Lys Thr Trp Asp Asp Pro Gln Ile Ala Ala Leu Asn Pro Gly Val Asn
210 215 220Leu Pro Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly225 230 235 240Asp Thr Phe Leu Phe Thr Gln Tyr Leu Ser Lys Gln Asp Pro Glu Gly
245 250 255Trp Gly Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val
260 265 270Pro Gly Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys
275 280 285Ala Glu Thr Pro Gly Cys Val Ala Tyr Ile Gly Ile Ser Phe Leu Asp
290 295 300Gln Ala Ser Gln Arg Gly Leu Gly Glu Ala Gln Leu Gly Asn Ser Ser305 310 315 320Gly Asn Phe Leu Leu Pro Asp Ala Gln Ser Ile Gln Ala Ala Ala Ala
325 330 335Gly Phe Ala Ser Lys Thr Pro Ala Asn G1n Ala Ile Ser Met Ile Asp
340 345 350Gly Pro Ala Pro Asp Gly Tyr Pro Ile Ile Asn Tyr Glu Tyr Ala Ile
355 360 365Val Asn Asn Arg Gln Lys Asp Ala Ala Thr Ala Gln Thr Leu Gln Ala
370 375 380Phe Leu His Trp Ala Ile Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp385 390 395 400Gln Val His Phe Gln Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp
405 410 415Ala Leu Ile Ala Thr Ile Ser Ser Ala Glu Met Lys Thr Asp Ala Ala
420 425 430Thr Leu Ala Gln Glu Ala Gly Asn Phe Glu Arg Ile Ser Gly Asp Leu
435 440 445Lys Thr Gln Ile Asp Gln Val Glu Ser Thr Ala Gly Ser Leu Gln Gly
450 455 460Gln Trp Arg Gly Ala Ala Gly Thr Ala Ala Gln Ala Ala Val Val Arg465 470 475 480Phe Gln Glu Ala Ala Asn Lys Gln Lys Gln Glu Leu Asp Glu Ile Ser
485 490 495Thr Asn Ile Arg Gln Ala Gly Val Gln Tyr Ser Arg Ala Asp Glu Glu
500 505 510Gln Gln Gln Ala Leu Ser Ser Gln Met Gly Phe Val Pro Thr Thr Ala
515 520 525Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro
530 535 540Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gln Pro545 550 555 560Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Ash Ala Pro Pro
565 570 575Pro Pro Val Ile Ala Pro Ash Ala Pro Gln Pro Val Arg Ile Asp Asn
580 585 590
Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser
595 600 605
Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr
610 615 620
Gly Asp Pro Pro Phe Pro Gly Gln Pro Pro Pro Val Ala Asn Asp Thr
625 630 635 640
Arg Ile Val Leu Gly Arg Leu Asp Gln Lys Leu Tyr Ala Ser Ala Glu
645 650 655
Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu
660 665 670
Phe Tyr Met Pro Tyr Pro Gly Thr Arg Ile Asn Gln Glu Thr Val Ser
675 680 685
Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys
690 695 700
Phe Ser Asp Pro Ser Lys Pro Asn Gly Gln Ile Trp Thr Gly Val Ile
705 710 715 720
Gly Ser Pro Ala Ala Asn Ala Pro Asp Ala Gly Pro Pro Gln Arg Trp
725 730 735
Phe Val Val Trp Leu Gly Thr Ala Asn Asn Pro Val Asp Lys Gly Ala
740 745 750
Ala Lys Ala Leu Ala Glu Ser Ile Arg Pro Leu Val Ala Pro Pro Pro
755 760 765
Ala Pro Ala Pro Ala Pro Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala
770 775 780
Gly Glu Val Ala Pro Thr Pro Thr Thr Pro Thr Pro Gln Arg Thr Leu
785 790 795 800
Pro Ala(2)SEQ ID NO:210的信息:
(ⅰ)序列特征:
(A)长度:454碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:210:GTGGCGGCGC TGCGGCCGGC CAGCAGAGCG ATGTGCATCC GTTCGCGAAC CTGATCGCGG 60TCGACGATGA GCGCGCCGAA CGCCGCGACG ACGAAGAACG TCAGGAAGCC GTCCAGCAGC 120GCGGTCCGCG CGGTGACGAA GCTGACCCCG TCGCAGATCA GCAGCACCCC GGCGATGGCG 180CCGACCAATG TCGACCGGCT GATCCGCCGC ACGATCCGCA CCACCAGCGC CACCAGGACC 240ACACCCAGCA GGGCGCCGGT GAACCGCCAG CCGAATCCGT TGTGACCGAA GATGGCCTCC 300CCGATCGCGA TCAGCTGCTT ACCGACCGGC GGGTGAACCA CCAGGCCGTA CCCGGGGTTG 360TCTTCCACCC CATGGTTGTT CAGCACCTGC CAGGCCTGGC GGTGCGTAAT GCTTCTCGTC 420GAAGATGGGG GTGCCGGCAT CCGTCACCGA GCCC 454(2)SEQ ID NO:211的信息:
(ⅰ)序列特征:
(A)长度:470碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:211:TGCAGAAGTA CGGCGGATCC TCGGTGGCCG ACGCCGAACG GATTCGCCGC GTCGCCGAAC 60GCATCGTCGC CACCAAGAAG CAAGGCAATG ACGTCGTCGT CGTCGTCTCT GCCATGGGGG 120ATACCACCGA CGACCTGCTG GATCTGGCTC AGCAGGTGTG CCCGGCGCCG CCGCCTCGGG 180AGCTGGACAT GCTGCTTACC GCCGGTGAAC GCATCTCGAA TGCGTTGGTG GCCATGGCCA 240TCGAGTCGCT CGGCGCGCAT GCCCGGTCGT TCACCGGTTC GCAGGCCGGG GTGATCACCA 300CCGGCACCCA CGGCAACGCC AAGATCATCG ACGTCACGCC GGGGCGGCTG CAAACCGCCC 360TTGAGGAAGG GCGGGTCGTC TTGGTGGCCG GATTCCAAGG GGTCAGCCAG GACACCAAGG 420ATGTCACGAC GTTGGGCCGC GGCGGCTCGG ACACCACCGC CGTCGCCATG 470(2)SEQ ID NO:212的信息:
(ⅰ)序列特征:
(A)长度:279碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:212:GGCCGGCGTA CCCGGCCGGG ACAAACAACG ATCGATTGAT ATCGATGAGA GACGGAGGAA 60TCGTGGCCCT TCCCCAGTTG ACCGACGAGC AGCGCGCGGC CGCGTTGGAG AAGGCTGCTG 120CCGCACGTCG AGCGCGAGCA GAGCTCAAGG ATCGGCTCAA GCGTGGCGGC ACCAACCTCA 180CCCAGGTCCT CAAGGACGCG GAGAGCGATG AAGTCTTGGG CAAAATGAAG GTGTCTGCGC 240TGCTTGAGGC CTTGCCAAAG GTGGGCAAGG TCCAGGCGC 279
(2)SEQ ID NO:213的信息:
(ⅰ)序列特征:
(A)长度:219碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:213:ACACGGTCGA ACTCGACGAG CCCCTCGTGG AGGTGTCGAC CGACAAGGTC GACACCGAAA 60TCCCTCGCCG GCCGCGGGTG TGCTGACCAA GATCATCGCC CAAGAAGATG ACACGGTCGA 120GGTCGGCGGC GAGCTCTCTG TCATTGGCGA CGCCCATGAT GCCGGCGAGG CCGCGGTCCC 180GGCACCCCAG AAAGTCTCTG CCGGCCCAAC CCGAATCCA 219
(2)SEQ ID NO:214的信息:
(ⅰ)序列特征:
(A)长度:342碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:214:TCGCTGCCGA CATCGGCGCC GCGCCCGCCC CCAAGCCCGC ACCCAAGCCC GTCCCCGAGC 60CAGCGCCGAC GCCGAAGGCC GAACCCGCAC CATCGCCGCC GGCGGCCCAG CCAGCCGGTG 120CGGCCGAGGG CGCACCGTAC GTGACGCCGC TGGTGCGAAA GCTGGCGTCG GAAAACAACA 180TCGACCTCGC CGGGGTGACC GGCACCGGAG TGGGTGGTCG CATCCGCAAA CAGGATGTGC 240TGGCCGCGGC TGAACAAAAG AAGCGGGCGA AAGCACCGGC GCCGGCCGCC CAGGCCGCCG 300CCGCGCCGGC CCCGAAAGCG CCGCCTGAAG ATCCGATGCC GC 342
(2)SEQ ID NO:215的信息:
(ⅰ)序列特征:
(A)长度:515碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:215:GGGTCTTGGT CAGTATCAGC GCCGACGAGG ACGCCACGGT GCCCGTCGGC GGCGAGTTGG 60CCCGGATCGG TGTCGCTGCC GACATCGGCG CCGCGCCCGC CCCCAAGCCC GCACCCAAGC 120CCGTCCCCGA GCCAGCGCCG ACGCCGAAGG CCGAACCCGC ACCATCGCCG CCGGCGGCCC 180AGCCAGCCGG TGCGGCCGAG GGCGCACCGT ACGTGACGCC GCTGGTGCGA AAGCTGGCGT 240CGGAAAACAA CATCGACCTC GCCGGGGTGA CCGGCACCGG AGTGGGTGGT CGCATCCGCA 300AACAGGATGT GCTGGCCGCG GCTGAACAAA AGAAGCGGGC GAAAGCACCG GCGCCCTGAG 360CGCTTCATCA CCCGGTTAAC CAGCTTGCCC CAGAAGCCGG CTTCGACCTC TTCGCGGGTC 420TTGGTCCGCT GCAGGCGGTC GGCGAGCCAG TTCAGGTTAG GCGGCCGAAA TCTTCCAGTT 480CGCCAGGAAG GGCACCCGGA ACAGGGTCCG CACCC 515
(2)SEQ ID NO:216的信息:
(ⅰ)序列特征:
(A)长度:557碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:216:CCGACCCCAA GGTGCAGATT CAACAGGCCA TTGAGGAAGC ACAGCGCACC CACCAAGCGC 60TGACTCAACA GGCGGCGCAA GTGATCGGTA ACCAGCGTCA ATTGGAGATG CGACTCAACC 120GACAGCTGGC GGACATCGAA AAGCTTCAGG TCAATGTGCG CCAAGCCCTG ACGCTGGCCG 180ACCAGGCCAC CGCCGCCGGA GACGCTGCCA AGGCCACCGA ATACAACAAC GCCGCCGAGG 240CGTTCGCAGC CCAGCTGGTG ACCGCCGAGC AGAGCGTCGA AGACCTCAAG ACGCTGCATG 300ACCAGGCGCT TAGCGCCGCA GCTCAGGCCA AGAAGGCCGT CGAACGAAAT GCGATGGTGC 360TGCAGCAGAA GATCGCCGAG CGAACCAAGC TGCTCAGCCA GCTCGAGCAG GCGAAGATGC 420AGGAGCAGGT CAGCGCATCG TTGCGGTCGA TGAGTGAGCT CGCCGCGCCA GGCAACACGC 480CGAGCCTCGA CGAGGTGCGC GACAAGATCG AGCGTCGCTA CGCCAACGCG ATCGGTTCGG 540CTGAACTTGC CGAGAGT 557
(2)SEQ ID NO:217的信息:
(ⅰ)序列特征:
(A)长度:223碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:217:CAGGATAGGT TTCGACATCC ACCTGGGTTC CGCACCCGGT GCGCGACCGT GTGATAGGCC 60AGAGGTGGAC CTGCGCCGAC CGACGATCGA TCGAGGAGTC AACAGAAATG GCCTTCTCCG 120TCCAGATGCC GGCACTCGGT GAGAGCGTCA CCGAGGGGAC GGTTACCCGC TGGCTCAAAC 180AGGAAGGCGA CACGGTCGAA CTCGACGAGC CCCTCGTGGA GGT 223
(2)SEQ ID NO:218的信息:
(ⅰ)序列特征:
(A)长度:578碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:218:AAGAAGTACA TCTGCCGGTC GATGTCGGCG AACCACGGCA GCCAACCGGC GCAGTAGCCG 60ACCAGGACCA CCGCATAACG CCAGTCCCGG CGCACAAACA TACGCCACCC CGCGTATGCC 120AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 180CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 240AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 300GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTATT GCCAGAGCGA GCGCACGGCG 360TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 420GCGGTCTCGG ACGCGAACCA CGGAGCGTAG GTGGCCAGAT AGACCGCGAA CGGGATCAAC 480CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TTCCCAGCCA CGGTCTTTGC 540ACTTGGTATG AACGTCGCGC CGCCACGTCA ACGCCAGC 578
(2)SEQ ID NO:219的信息:
(ⅰ)序列特征:
(A)长度:484碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:219:ACAACGATCG ATTGATATCG ATGAGAGACG GAGGAATCGT GGCCCTTCCC CAGTTGACCG 60ACGAGCAGCG CGCGGCCGCG TTGGAGAAGG CTGCTGCCGC ACGTCGAGCG CGAGCAGAGC 120TCAAGGATCG GCTCAAGCGT GGCGGCACCA ACCTCACCCA GGTCCTCAAG GACGCGGAGA 180GCGATGAAGT CTTGGGCAAA ATGAAGGTGT CTGCGCTGCT TGAGGCCTTG CCAAAGGTGG 240GCAAGGTCAA GGCGCAGGAG ATCATGACCG AGCTGGAAAT TGCGCCCCAC CCCGCCGCCT 300TCGTGGCCTC GGTGACCGTC AGCGCAAGGC CCTGCTGGAA AAGTTCGGCT CCGCCTAACC 360CCGCCGGCCG ACGATGCGGG CCGGAAGGCC TGTGGTGGGC GTACCCCCGC ATACGGGGGA 420GAAGCGGCCT GACAGGGCCA GCTCACAATT CAGGCCGAAC GCCCCGGTGG GGGGGAACCC 480GCCC 484
(2)SEQ ID NO:220的信息:
(ⅰ)序列特征:
(A)长度:537碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:220:AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 60CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 120AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 180GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTAGT GCCAGAGCGA GCGCACGGCG 240TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 300GCGGTCTCGG ACGCGAACCA CGGAGCGTAG GTGGCCAGAT AGACCGCGAA CGGGATCAAC 360CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TCCCCAGCCA CGGTCTTTGC 420ACTTGGTACT GACGTCGCGC CGCCACGTCG AACGCCAGCG CCATCGCGCC GAAGAACAGC 480ACGAAGTACA CGCCGGACCA CTTGGTGGCG CAAGCCAATC CCAAGCAGCA CCCCGGC 537
(2)SEQ ID NO:221的信息:
(ⅰ)序列特征:
(A)长度:135氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:221:Gly Gly Ala Ala Ala Gly Gln Gln Ser Asp Val His Pro Phe Ala Asn1 5 10 15Leu lle Ala Val Asp Asp Glu Arg Ala Glu Arg Arg Asp Asp Glu Glu
20 25 30Arg Gln Glu Ala Val Gln Gln Arg Gly Pro Arg Gly Asp Glu Ala Asp
35 40 45Pro Val Ala Asp Gln Gln His Pro Gly Asp Gly Ala Asp Gln Cys Arg
50 55 60Pro Ala Asp Pro Pro His Asp Pro His His Gln Arg His Gln Asp His65 70 75 80Thr Gln Gln Gly Ala Gly Glu Pro Pro Ala Glu Ser Val Val Thr Glu
85 90 95Asp Gly Leu Pro Asp Arg Asp Gln Leu Leu Thr Asp Arg Arg Val Asn
100 105 110His Gln Ala Val Pro Gly Val Val Phe His Pro Met Val Val Gln His
115 120 125Leu Pro Gly Leu Ala Val Arg
130 135(2)SEQ ID NO:222的信息:
(ⅰ)序列特征:
(A)长度:156氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:222:Gln Lys Tyr Gly Gly Ser Ser Val Ala Asp Ala Glu Arg Ile Arg Arg1 5 10 15Val Ala Glu Arg Ile Val Ala Thr Lys Lys Gln Gly Asn Asp Val Val
20 25 30Val Val Val Ser Ala Met Gly Asp Thr Thr Asp Asp Leu Leu Asp Leu
35 40 45Ala Gln Gln Val Cys Pro Ala Pro Pro Pro Arg Glu Leu Asp Met Leu
50 55 60Leu Thr Ala Gly Glu Arg Ile Ser Asn Ala Leu Val Ala Met Ala Ile65 70 75 80Glu Ser Leu Gly Ala His Ala Arg Ser Phe Thr Gly Ser Gln Ala Gly
85 90 95Val Ile Thr Thr Gly Thr His Gly Ash Ala Lys Ile Ile Asp Val Thr
100 105 110Pro Gly Arg Leu Gln Thr Ala Leu Glu Glu Gly Arg Val Val Leu Val
115 120 125Ala Gly Phe Gln Gly Val Ser Gln Asp Thr Lys Asp Val Thr Thr Leu
130 135 140Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala Met145 150 155
(2)SEQ ID NO:223的信息:
(ⅰ)序列特征:
(A)长度:92氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:223:Pro Ala Tyr Pro Ala Gly Thr Asn Asn Asp Arg Leu Ile Ser Met Arg1 5 10 15Asp Gly Gly Ile Val Ala Leu Pro Gln Leu Thr Asp Glu Gln Arg Ala
20 25 30Ala Ala Leu Glu Lys Ala Ala Ala Ala Arg Arg Ala Arg Ala Glu Leu
35 40 45Lys Asp Arg Leu Lys Arg Gly Gly Thr Asn Leu Thr Gln Val Leu Lys
50 55 60Asp Ala Glu Ser Asp Glu Val Leu Gly Lys Met Lys Val Ser Ala Leu65 70 75 80Leu Glu Ala Leu Pro Lys Val Gly Lys Val Gln Ala
85 90(2)SEQ ID NO:224的信息:
(ⅰ)序列特征:
(A)长度:72氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:224:Thr Val Glu Leu Asp Glu Pro Leu Val Glu Val Ser Thr Asp Lys Val1 5 10 15Asp Thr Glu Ile Pro Ser Pro Ala Ala Gly Val Leu Thr Lys Ile Ile
20 25 30Ala Gln Glu Asp Asp Thr Val Glu Val Gly Gly Glu Leu Ser Val Ile
35 40 45Gly Asp Ala His Asp Ala Gly Glu Ala Ala Val Pro Ala Pro Gln Lys
50 55 60Val Ser Ala Gly Pro Thr Arg Ile65 70
(2)SEQ ID NO:225的信息:
(ⅰ)序列特征:
(A)长度:113氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:225:Ala Ala Asp Ile Gly Ala Ala Pro Ala Pro Lys Pro Ala Pro Lys Pro1 5 10 15Val Pro Glu Pro Ala Pro Thr Pro Lys Ala Glu Pro Ala Pro Ser Pro
20 25 30Pro Ala Ala Gln Pro Ala Gly Ala Ala Glu Gly Ala Pro Tyr Val Thr
35 40 45Pro Leu Val Arg Lys Leu Ala Ser Glu Asn Asn Ile Asp Leu Ala Gly
50 55 60Val Thr Gly Thr Gly Val Gly Gly Arg Ile Arg Lys Gln Asp Val Leu65 70 75 80Ala Ala Ala Glu Gln Lys Lys Arg Ala Lys Ala Pro Ala Pro Ala Ala
85 90 95Gln Ala Ala Ala Ala Pro Ala Pro Lys Ala Pro Pro Glu Asp Pro Met
100 105 110Pro
(2)SEQ ID NO:226的信息:
(ⅰ)序列特征:
(A)长度:118氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:226:Val Leu Val Ser Ile Ser Ala Asp Glu Asp Ala Thr Val Pro Val Gly1 5 10 15Gly Glu Leu Ala Arg Ile Gly Val Ala Ala Asp Ile Gly Ala Ala Pro
20 25 30Ala Pro Lys Pro Ala Pro Lys Pro Val Pro Glu Pro Ala Pro Thr Pro
35 40 45Lys Ala Glu Pro Ala Pro Ser Pro Pro Ala Ala Gln Pro Ala Gly Ala
50 55 60Ala Glu Gly Ala Pro Tyr Val Thr Pro Leu Val Arg Lys Leu Ala Ser65 70 75 80Glu Asn Asn Ile Asp Leu Ala Gly Val Thr Gly Thr Gly Val Gly Gly
85 90 95Arg Ile Arg Lys Gln Asp Val Leu Ala Ala Ala Glu Gln Lys Lys Arg
100 105 110Ala Lys Ala Pro Ala Pro
115
(2)SEQ ID NO:227的信息:
(ⅰ)序列特征:
(A)长度:185氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:227:Asp Pro Lys Val Gln Ile Gln Gln Ala Ile Glu Glu Ala Gln Arg Thr1 5 10 15His Gln Ala Leu Thr Gln Gln Ala Ala Gln Val Ile Gly Asn Gln Arg
20 25 30Gln Leu Glu Met Arg Leu Asn Arg Gln Leu Ala Asp Ile Glu Lys Leu
35 40 45Gln Val Asn Val Arg Gln Ala Leu Thr Leu Ala Asp Gln Ala Thr Ala
50 55 60Ala Gly Asp Ala Ala Lys Ala Thr Glu Tyr Asn Asn Ala Ala Glu Ala65 70 75 80Phe AIa Ala Gln Leu Val Thr Ala Glu Gln Ser Val Glu Asp Leu Lys
85 90 95Thr Leu His Asp Gln Ala Leu Ser Ala Ala Ala Gln Ala Lys Lys Ala
100 105 110Val Glu Arg Asn Ala Met Val Leu Gln Gln Lys Ile Ala Glu Arg Thr
115 120 125Lys Leu Leu Ser Gln Leu Glu Gln Ala Lys Met Gln Glu Gln Val Ser
130 135 140Ala Ser Leu Arg Ser Met Ser Glu Leu Ala Ala Pro Gly Asn Thr Pro145 150 155 160Ser Leu Asp Glu Val Arg Asp Lys Ile Glu Arg Arg Tyr Ala Asn Ala
165 170 175Ile Gly Ser Ala Glu Leu Ala Glu Ser
180 185
(2)SEQ ID NO:228的信息:
(ⅰ)序列特征:
(A)长度:71氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:228:Val Ser Thr Ser Thr Trp Val Pro His Pro Val Arg Asp Arg Val Ile1 5 10 15Gly Gln Arg Trp Thr Cys Ala Asp Arg Arg Ser Ile Glu Glu Ser Thr
20 25 30Glu Met Ala Phe Ser Val Gln Met Pro Ala Leu Gly Glu Ser Val Thr
35 40 45Glu Gly Thr Val Thr Arg Trp Leu Lys Gln Glu Gly Asp Thr Val Glu
50 55 60Leu Asp Glu Pro Leu Val Glu65 70
(2)SEQ ID NO:229的信息:
(ⅰ)序列特征:
(A)长度:182氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:229:Glu Val His Leu Pro Val Asp Val Gly Glu Pro Arg Gln Pro Thr Gly1 5 10 15Ala Val Ala Asp Gln Asp His Arg Ile Thr Pro Val Pro Ala His Lys
20 25 30His Thr Pro Pro Arg Val Cys Gln Asp Trp His Arg Gln Pro Pro His
35 40 45Arg Gly Arg Ala Asp Gln His Leu Gly Leu Asp Ala Arg Leu Cys Ala
50 55 60Ala Ala Cys Asn Val Leu Leu Val Asp Gly Val Gln His Arg Pro Gln65 70 75 80Arg His Gly Pro Gly Pro Arg Phe Gly Phe Pro Arg Val Val Val Ala
85 90 95Cys Gly Ile Arg Gln Ala Arg Val Glu Val Glu Arg Phe Gly Gly Val
100 105 110Leu Pro Glu Arg Ala His Gly Val Gly Gln Arg Asn Asn Arg Val Ala
115 120 125Thr Asp Arg Leu Thr Asp Arg Met Pro Ile Asp Arg Gly Leu Gly Arg
130 135 140Glu Pro Arg Ser Val Gly Gly Gln Ile Asp Arg Glu Arg Asp Gln Pro145 150 155 160Gln Arg Ile Pro Ala Gly Lys His Val Thr Pro His Cys Ser Gln Pro
165 170 175Arg Ser Leu His Leu Val
180
(2)SEC)ID N0:230的信息:
(ⅰ)序列特征:
(A)长度:160氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:230:Asn Asp Arg Leu Ile Ser Met Arg Asp Gly Gly Ile Val Ala Leu Pro1 5 10 15Gln Leu Thr Asp Glu Gln Arg Ala Ala Ala Leu Glu Lys Ala Ala Ala
20 25 30Ala Arg Arg Ala Arg Ala Glu Leu Lys Asp Arg Leu Lys Arg Gly Gly
35 40 45Thr Asn Leu Thr Gln Val Leu Lys Asp Ala Glu Ser Asp Glu Val Leu
50 55 60Gly Lys Met Lys Val Ser Ala Leu Leu Glu Ala Leu Pro Lys Val Gly65 70 75 80Lys Val Lys Ala Gln Glu Ile Met Thr Glu Leu Glu Ile Ala Pro His
85 90 95Pro Ala Ala Phe Val Ala Ser Val Thr Val Ser Ala Arg Pro Cys Trp
100 105 110Lys Ser Ser Ala Pro Pro Asn Pro Ala Gly Arg Arg Cys Gly Pro Glu
115 120 125Gly Leu Trp Trp Ala Tyr Pro Arg Ile Arg Gly Arg Ser Gly Leu Thr
130 135 140Gly Pro Ala His Asn Ser Gly Arg Thr Pro Arg Trp Gly Gly Thr Arg145 150 155 160
(2)SEQ ID NO:231的信息:
(ⅰ)序列特征:
(A)长度:178氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:231:Asp Trp His Arg Gln Pro Pro His Arg Gly Arg Ala Asp Gln His Leu1 5 10 15Gly Leu Asp Ala Arg Leu Cys Ala Ala Ala Cys Asn Val Leu Leu Val
20 25 30Asp Gly Val Gln His Arg Pro Gin Arg His Gly Pro Gly Pro Arg Phe
35 40 45Gly Phe Pro Arg Val Val Val Ala Cys Gly Ile Arg Gln Ala Arg Val
50 55 60Glu Val Glu Arg Phe Gly Gly Val Val Pro Glu Arg Ala His Gly Val65 70 75 80Gly Gln Arg Asn Asn Arg Val Ala Thr Asp Arg Leu Thr Asp Arg Met
85 90 95Pro Ile Asp Arg Gly Leu Gly Arg Glu Pro Arg Ser Val Gly Gly Gln
100 105 110Ile Asp Arg Glu Arg Asp Gln Pro Gln Arg Ile Pro Ala Gly Lys His
115 120 125Val Thr Pro His Cys Pro Gln Pro Arg Ser Leu His Leu Val Leu Thr
130 135 140Ser Arg Arg His Val Glu Arg Gln Arg His Arg Ala Glu Glu Gln His145 150 155 160Glu Val His Ala Gly Pro Leu Gly Gly Ala Ser Gln Ser Gln Ala Ala
165 170 175Pro Arg
(2)SEQ ID NO:232的信息:
(ⅰ)序列特征:
(A)长度:271碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:232:ATGCCAAGCC GGTGCTGATG CCCGAGCTCG GCGAATCGGT GACCGAGGGG ACCGTCATTC 60GTTGGCTGAA GAAGATCGGG GATTCGGTTC AGGTTGACGA GCCACTCGTG GAGGTGTCCA 120CCGACAAGGT GGACACCGAG ATCCCGTCCC CGGTGGCTGG GGTCTTGGTC AGTATCAGCG 180CCGACGAGGA CGCCACGGTG CCCGTCGGCG GCGAGTTGGC CCGGATCGGT GTCGCTGCCG 240AGATCGGCGC CGCGCCCGCC CCCAAGCCCC C 271
(2)SEQ ID NO:233的信息:
(ⅰ)序列特征:
(A)长度:89氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:233:Ala Lys Pro Val Leu Met Pro Glu Leu Gly Glu Ser Val Thr Glu Gly1 5 10 15Thr Val Ile Arg Trp Leu Lys Lys Ile Gly Asp Ser Val Gln Val Asp
20 25 30Glu Pro Leu Val Glu Val Ser Thr Asp Lys Val Asp Thr Glu Ile Pro
35 40 45Ser Pro Val Ala Gly Val Leu Val Ser Ile Ser Ala Asp Glu Asp Ala
50 55 60Thr Val Pro Val Gly Gly Glu Leu Ala Arg Ile Gly Val Ala Ala Glu65 70 75 80Ile Gly Ala Ala Pro Ala Pro Lys Pro
85
(2)SEQ ID NO:234的信息:
(ⅰ)序列特征:
(A)长度:107碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:234:GAGGTAGCGG ATGGCCGGAG GAGCACCCCA GGACCGCGCC CGAACCGCGG GTGCCGGTCA 60TCGATATGTG GGCACCGTTC GTTCCGTCCG CCGAGGTCAT TGACGAT 107
(2)SEQ ID NO:235的信息:
(ⅰ)序列特征:
(A)长度:339碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:235:ATGAAGTTGA AGTTTGCTCG CCTGAGTACT GCGATACTGG GTTGTGCAGC GGCGCTTGTG 60TTTCCTGCCT CGGTTGCCAG CGCAGATCCA CCTGACCCGC ATCAGCCGGA CATGACGAAA 120GGCTATTGCC CGGGTGGCCG ATGGGGTTTT GGCGACTTGG CCGTGTGCGA CGGCGAGAAG 180TACCCCGACG GCTCGTTTTG GCACCAGTGG ATGCAAACGT GGTTTACCGG CCCACAGTTT 240TACTTCGATT GTGTCAGCGG CGGTGAGCCC CTCCCCGGCC CGCCGCCACC GGGTGGTTGC 300GGTGGGGCAA TTCCGTCCGA GCAGCCCAAC GCTCCCTGA 339
(2)SEQ ID NO:236的信息:
(ⅰ)序列特征:
(A)长度:112氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:236:Met Lys Leu Lys Phe Ala Arg Leu Ser Thr Ala Ile Leu Gly Cys Ala1 5 10 15Ala Ala Leu Val Phe Pro Ala Ser Val Ala Ser Ala Asp Pro Pro Asp
20 25 30Pro His Gln Pro Asp Met Thr Lys Gly Tyr Cys Pro Gly Gly Arg Trp
35 40 45Gly Phe Gly Asp Leu Ala Val Cys Asp Gly Glu Lys Tyr Pro Asp Gly
50 55 60Ser Phe Trp His Gln Trp Met Gln Thr Trp Phe Thr Gly Pro Gln Phe65 70 75 80Tyr Phe Asp Cys Val Ser Gly Gly Glu Pro Leu Pro Gly Pro Pro Pro
85 90 95Pro Gly Gly Cys Gly Gly Ala Ile Pro Ser Glu Gln Pro Asn Ala Pro
100 105 110(2)SEQ ID NO:237的信息:
(ⅰ)序列特征:
(A)长度:371碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:237:GTGACCACGG TGGGCCTGCC ACCAACCCGG GCAGCGGCAG CCGCGGCGGC GCCGGCGGCT 60CCGGCGGCAA CGGTGGCGCC GGGGGTAACG CCACCGGCTC AGGCGGCAAG GGCGGCGCCG 120GTGGCAATGG CGGTGATGGG AGCTTCGGCG CTACCAGCGG CCCCGCCTCC ATCGGGGTCA 180CGGGCGCCCC CGGCGGCAAC GGCGGCAAGG GCGGCGCCGG TGGCAGCAAC CCCAACGGCT 240CAGGTGGCGA CGGCGGCAAA GGCGGCAACG GCGGTGCCGG CGGCAACGGG GGCTCGATCG 300GCGCCAACAG CGGCATCGTC GGCGGTTCCG GTGGGGCCGG TGGCGCTGGC GGCGCCGGCG 360GAAACGGCAG C 371
(2)SEQ ID NO:238的信息:
(ⅰ)序列特征:
(A)长度:424碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:238:GTCCGGGTCC CACCACCGCG CCGGCGCGCC CCTAGCGGCC GGGCGCACCA GCCCCTTTTC 60TTGACTCGTT CAAGAAAAGG GCCTTCTGTT TGGTCGGCCA TGTTGGCATG ATCGTGACCC 120ATGGGCAACA TCGACGTCGA CATCTCGGCC AAGGTCTAGC TCCATGCGAA TCGCCGCCGC 180GGTGGTGAGC ATCGGTCTAG CCGTCATAGC AGGGTTCGCG GTACCTGTTG CCGACGCACA 240CCCGTCGGAG CCCGGGGTTG TGTCCTACGC GGTGCTCGGA AAGGGGTCGG TCGGCAACAT 300CGTCGGCGCC CCAATGGGGT GGGAGGCGGT GTTCACCAAG CCGTTCCAGG CGTTTTGGGT 360CGAACTACCG GCGTGCAACA ACTGGGTGGA CATCGGGCTG CCCGAGGTGT ACGACGATCC 420CGAC 424
(2)SEQ ID NO:239的信息:
(ⅰ)序列特征:
(A)长度:317碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:239:GCGATGGCGG CCGCGGGTAC CACCGCCAAT GTGGAACGGT TTCCCAACCC CAACGATCCT 60TTGCATCTGG CGTCAATTGA CTTCAGCCCG GCCGATTTCG TCACCGAGGG CCACCGTCTA 120AGGGCGGATG CGATCCTACT GCGCCGTACC GACCGGCTGC CTTTCGCCGA GCCGCCGGAT 180TGGGACTTGG TGGAGTCGCA GTTGCGCACG ACCGTCACCG CCGACACGGT GCGCATCGAC 240GTCATCGCCG ACGATATGCG TCCCGAACTG GCGGCGGCGT CCAAACTCAC CGAATCGCTG 300CGGCTCTACG ATTCGTC 317
(2)SEQ ID NO:240的信息:
(ⅰ)序列特征:
(A)长度:422碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:240:TGGCGTATGC GCTTCGCAGC CGGTGCCGCG TCAACGCGCC GGAGGCAATC GCTTCGCTGC 60CGAGGAATGG TTCGATCACG ATCGCAGTGT GCCGTCGTGC ACCGACACCG CCGTCCAACG 120TGAACTGAGG GCGGAAAATC GGCCGAAATC TCGCCCTCAG TTCACGCTCG GCGCCTAACG 180GTTCTGGAAG TTGGGTGCGC GCTTCTCGGC GAACGCGCGC GGGCCTTCCT TGGCGTCGTC 240GGACAGGAAG ACCTTGATGC CGATCTGGGT GTCGATCTTG AACGCCTCGT TTTCGGGCAT 300GCACTCGGTC TCGCGGATGG ACCGCAAGAT GGCCTGCACG GCCAGGGGTC CGTTAGCCGA 360GATGGCGTCG GCAAGTTCTA GAACCTTGGT CAACGCCTGG CCGTCGGGCA CACGTGGCCG 420AT 422
(2)SEQ ID NO:241的信息:
(ⅰ)序列特征:
(A)长度:426碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:241:GCGTGCCGCT GAACACCAGC CCGCGGCTGC CAGATCTCCC GGACTCGGTA GTGCCGCCGG 60TGGCGTCGTT GCTCTCCTGA CGGGGCGCGG CGACCATAAG GTCGCTAATG CCCAGGTAGC 120GGCCCAGGTG CATGGAGTCG ATGATGATGC GACTCTCCAG CTCGCCGACC GGGAGCTTGG 180CATCGGGCCT GATCAGCCAG GACGCGTAGG ACAAGTCGAT CGAATGCATA GTGGCCTCCA 240GAGTGGCCGT GCCACTTCCG GCGTGCTCCA CGGCAAATGC CTTGATTTCT AGCTCCGCGT 300AGTGTTCCCG CATCGCCTGC GGGATGAATG GGAACCGCAG GATGGCGACA AACGGGTCTG 360ACCTCAGGTT TGCCGCTTTG CGCACAGTGG TCGACAGCCG GTACTCGGCA TAAATGCTGG 420CCCCGA 426
(2)SEQ ID NO:242的信息:
(ⅰ)序列特征:
(A)长度:327碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:242:AGACCGGCGA GGGTGTGGTC GCTGCCCGCG GCATTGTCGA TAATCTGCGC TGGGTCGACG 60CGCCGATCAA CTAGTGAGGC GCAACGCTAG GCTTTGGGAT ACCCACAGCT AAAAAGTTTA 120TCAAAGAAAC GAAGAAGGTT GCCATGAGCA CTGTTGCCGC CTACGCCGCC ATGTCGGCGA 180CCGAACCCCT GACCAAGACC ACGATCACCC GTCGCGACCC GGGCCCGCAC GACATGGCGA 240TCGACATCAA ATTCGCCGGA ATCTGTCGCT CGGACATCCA TACCGTCCAA ACCGAATGGG 300GGCAACCGAA TTTACCTGTG GTCCCTG 327
(2)SEQ ID NO:243的信息:
(ⅰ)序列特征:
(A)长度:123氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:243:Asp His Gly Gly Pro Ala Thr Asn Pro Gly Ser Gly Ser Arg Gly Gly1 5 10 15Ala Gly Gly Ser Gly Gly Asn Gly Gly Ala Gly Gly Asn Ala Thr Gly
20 25 30Ser Gly Gly Lys Gly Gly Ala Gly Gly Asn Gly Gly Asp Gly Ser Phe
35 40 45Gly Ala Thr Ser Gly Pro Ala Ser Ile Gly Val Thr Gly Ala Pro Gly
50 55 60Gly Asn Gly Gly Lys Gly Gly Ala Gly Gly Ser Asn Pro Asn Gly Ser65 70 75 80Gly Gly Asp Gly Gly Lys Gly Gly Asn Gly Gly Ala Gly Gly Asn Gly
85 90 95Gly Ser Ile Gly Ala Asn Ser Gly Ile Val Gly Gly Ser Gly Gly Ala
100 105 110Gly Gly Ala Gly Gly Ala Gly Gly Asn Gly Ser
115 120
(2)SEQ ID NO:244的信息:
(ⅰ)序列特征:
(A)长度:104氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:244:Met Ala Ala Ala Gly Thr Thr Ala Asn Val Glu Arg Phe Pro Asn Pro1 5 10 15Asn Asp Pro Leu His Leu Ala Ser Ile Asp Phe Ser Pro Ala Asp Phe
20 25 30Val Thr Glu Gly His Arg Leu Arg Ala Asp Ala Ile Leu Leu Arg Arg
35 40 45Thr Asp Arg Leu Pro Phe Ala Glu Pro Pro Asp Trp Asp Leu Val Glu
50 55 60Ser Gln Leu Arg Thr Thr Val Thr Ala Asp Thr Val Arg Ile Asp Val65 70 75 80Ile Ala Asp Asp Met Arg Pro Glu Leu Ala Ala Ala Ser Lys Leu Thr
85 90 95Glu Ser Leu Arg Leu Tyr Asp Ser
100
(2)SEQ ID NO:245的信息:
(ⅰ)序列特征:
(A)长度:41氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:245:Ala Tyr Ala Leu Arg Ser Arg Cys Arg Val Asn Ala Pro Glu Ala Ile1 5 10 15Ala Ser Leu Pro Arg Asn Gly Ser Ile Thr Ile Ala Val Cys Arg Arg
20 25 30Ala Pro Thr Pro Pro Ser Asn Val Ash
35 40
(2)SEQ ID NO:246的信息:
(ⅰ)序列特征:
(A)长度:25氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:246:Val Pro Leu Asn Thr Ser Pro Arg Leu Pro Asp Leu Pro Asp Ser Val1 5 10 15Val Pro Pro Val Ala Ser Leu Leu Ser
20 25
(2)SEQ ID NO:247的信息:
(ⅰ)序列特征:
(A)长度:61氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白
(ⅹⅰ)序列描述:SEQ ID NO:247:Met Ser Thr Val Ala Ala Tyr Ala Ala Met Ser Ala Thr Glu Pro Leu1 5 10 15Thr Lys Thr Thr Ile Thr Arg Arg Asp Pro Gly Pro His Asp Met Ala
20 25 30Ile Asp Ile Lys Phe Ala Gly Ile Cys Arg Ser Asp Ile His Thr Val
35 40 45Gln Thr Glu Trp Gly Gln Pro Ash Leu Pro Val Val Pro
50 55 60
(2)SEQ ID NO:248的信息:
(ⅰ)序列特征:
(A)长度:213碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:248:GCTTGGAGCC CTGGAGCGAC GGTGTGGGTC TGGGGGTCGA TTCGTTCTCG GCGAAAGTCA 60ACTAAAGACC ACGTTGACAC CCAACCGGCG GCCCGGCATG GGCCGTCGCG GCGTAGAAGC 120TTTGACCGCG GCGCGAAACG TTCGCTGCTG CGGCCCATGC AGATCGCACA CGCTTGCTTG 180AACATCGGGT GGAGCCGGTG GTAACGCCAG GCT 213
(2)SEQ ID NO:249的信息:
(ⅰ)序列特征:
(A)长度:367碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:249:CCGAGCTGCT GTTCGGCGCC GGCGGTGCGG GCGGCGCGGG TGGGGCGGGC ACCGACGGCG 60GGCCCGGTGC TACCGGCGGG ACCGGCGGAC ACGGCGGAGT CGGCGGCGAC GGCGGATGGC 120TGGCACCCGG CGGGGCCGGC GGGGCCGGCG GGCAAGGCGG GGCAGGTGGT GCCCGCAGCG 180ATGGTGGCGC GTTGGGTGGT ACCGGCGGGA CGGGCGGTAC CGGCGGCGCC GGTGGCGCCG 240GCGGTCGCGG CACACTGCTG CTGGGCGCTG GCGGACAGGG CGGCCTCGGC GGCGCCGGCG 300GACAAGGCGG CACCGGCGGG GGCCGGCGGA GATGGCGTTC TGGGGGGTGT CAGTGGCACT 360GGTGGTA 367
(2)SEQ ID NO:250的信息:
(ⅰ)序列特征:
(A)长度:420碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:250:AAGGCGTGAT TGGCAAGGCG ACCGCGCAGC GGCCCGTAGC CGCGGGACGG CCCAGGCCCC 60GACCGCAGCG GCCGGTGTCT GACCGGGTCA GCGACCAGCG GCGCTGACCG TGCCGCTCGT 120CTACTTCGAC GCCAGCGCCT TCGTCAAACT TCTCACCACC GAGACAGGGA GCTCGCTGGC 180GTCCGCTCTA TGGGACGGCT GCGACGCCGC ATTGTCCAAC CGCCTGGCCT ACCCCGAAGT 240CCGCGCCGCA CTCGCTGCAA CGGGCCGCAA TCACGACCTA ACCGAATCCG AGCTCGCCGA 300CGCCGAGCGT GACTGGGAGG ACTTCTGGGC CGCACCCGCC CAGTCGAACT CACCGCGACG 360GTTGAACAGC ACGCCGGGCA CCTCGCCCGA ACACATGCCT TACGCGGAGC CGACACCGTT 420
(2)SEQ ID NO:251的信息:
(ⅰ)序列特征:
(A)长度:299碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:251:CTCTTGTCGG TGGCATCGGC GGTACCGGCG GAACCGGCGG CAACGCCGGT ATGCTCGCCG 60GCGCCGCCGG GGCCGGCGGT GCCGGCGGGT TCAGCTTCAG CACTGCCGGT GGGGCTGGCG 120GCGCCGGCGG GGCCGGTGGG CTGTTCACCA CCGGCGGTGT CGGCGGCGCC GGTGGGCAGG 180GTCACACGGG CGGGGCGGGC GGCGCCGGCG GGGCCGGCGG GTTGTTTGGT GCCGGCGGCA 240TGGGCGGGGC GGGCGGATTC GGGGATCACG GAACGCTCGG CACCGGCGGG GCCGGCGGG 299
(2)SEQ ID NO:252的信息:
(ⅰ)序列特征:
(A)长度:20氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:252:Leu Glu Pro Trp Ser Asp Gly Val Gly Leu Gly Val Asp Ser Phe Ser1 5 10 15Ala Lys Val Asn
20
(2)SEQ ID NO:253的信息:
(ⅰ)序列特征:
(A)长度:121氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:253:Glu Leu Leu Phe Gly Ala Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly1 5 10 15Thr Asp Gly Gly Pro Gly Ala Thr Gly Gly Thr Gly Gly His Gly Gly
20 25 30Val Gly Gly Asp Gly Gly Trp Leu Ala Pro Gly Gly Ala Gly Gly Ala
35 40 45Gly Gly Gln Gly Gly Ala Gly Gly Ala Arg Ser Asp Gly Gly Ala Leu
50 55 60Gly Gly Thr Gly Gly Thr Gly Gly Thr Gly Gly Ala Gly Gly Ala Gly65 70 75 80Gly Arg Gly Thr Leu Leu Leu Gly Ala Gly Gly Gln Gly Gly Leu Gly
85 90 95Gly Ala Gly Gly Gln Gly Gly Thr Gly Gly Gly Arg Arg Arg Trp Arg
100 105 110Ser Gly Gly Cys Gln Trp His Trp Trp
115 120
(2)SEQ ID NO:254的信息:
(ⅰ)序列特征:
(A)长度:34氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:254:Gly Val Ile Gly Lys Ala Thr Ala Gln Arg Pro Val Ala Ala Gly Arg1 5 10 15Pro Arg Pro Arg Pro Gln Arg Pro Val Ser Asp Arg Val Ser Asp Gln
20 25 30Arg Arg
(2)SEQ ID NO:255的信息:
(ⅰ)序列特征:
(A)长度:99氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:255:Leu Val Gly Gly Ile Gly Gly Thr Gly Gly Thr Gly Gly Asn Ala Gly1 5 10 15Met Leu Ala Gly Ala Ala Gly Ala Gly Gly Ala Gly Gly Phe Ser Phe
20 25 30Ser Thr Ala Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly Gly Leu Phe
35 40 45Thr Thr Gly Gly Val Gly Gly Ala Gly Gly Gln Gly His Thr Gly Gly
50 55 60Ala Gly Gly Ala Gly Gly Ala Gly Gly Leu Phe Gly Ala Gly Gly Met65 70 75 80Gly Gly Ala Gly Gly Phe Gly Asp His Gly Thr Leu Gly Thr Gly Gly
85 90 95Ala Gly Gly
(2)SEQ ID NO:256的信息:
(ⅰ)序列特征:
(A)长度:282碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:256:TCCTGTTCGG CGCCGGCGGG GTGGGCGGTG TTGGCGGTGA CGGTGTGGCA TTCCTGGGCA 60CCGCCCCCGG CGGGCCCGGT GGTGCCGGCG GGGCCGGTGG GCTGTTCAGC GTCGGTGGGG 120CCGGCGGCGC CGGCGGAATC GGATTGGTCG GGAACAGCGG TGCCGGGGGG TCCGGCGGGT 180CCGCCCTGCT CTGGGGCGAC GGCGGTGCCG GCGGCGCGGG TGGGGTCGGG TCCACTACCG 240GCGGTGCCGG CGGGGCGGGC GGCAACGCCA GCCTGCTGGT AA 282
(2)SEQ ID NO:257的信息:
(ⅰ)序列特征:
(A)长度:415碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:257:CGGCACGAGC CGTGCTACTG GTCAACTGAT GCCCTGATTG TGACCTTCCC GGCGCCGGAT 60CAGTGCTTCT CAGGACCGAC GTAATATTCG AAAACCAATC CGGCCGCCGA GGCGAGGATG 120AATGCCACAC CGGCGGCGAT CAGCCACGGG AGCCACAACG CGATGCCGAC CGCTGCCACC 180GAGCCGGACA ACGCGACCAT GATCGGCCAC CAGCTATGCG GACTGAAGAA TCCAAGTTCT 240CCTGCGCCGT CGCTGATTTC AGCGCCTTCG TAGTCCTCGG GCCGGGAATC TAACCGGCGG 300GCCACAAACC GGAAGAAGGT GGCGACGATC AACGCCATGC CGCCGGTGAG CGCCAACGCA 360ATGGTGCCAG CCCACTCGAC ACCACCGGTG GCGAACATCG AGGTCAACAC GCCGT 415
(2)SEQ ID NO:258的信息:
(ⅰ)序列特征:
(A)长度:373碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:258:TCACCGCGTG AACGGTTCGT AACACTGATA CGTATGCTTG TCAGCGAGCA GATCAAGTCC 60AGTCCGACCA ATGCCAGGAG ATCATCGGCT AGGCTCACGG TTTCGCCTGG GACGAGACGG 120TATTGAGTTC TGGCGTTGGA CGGTCCGTGG CGTGGTGGGA AGTCTGACGC GGCATCAGAA 180CGGTTGTCAA TACCAGTCTT TGGGGGATAT GGCCTATTTG GTGTCGTCGG GCCGCTCCAC 240CGGATCCCTT TTCGAACGTT GCGCAAGCGC GGTCCAGTTA CGGCCTGTTC ACTGCGCGCT 300GGCGTAGCTG CGCGGCCTCG ATCGGTTTGA ACGTCATCGC AATTCCCGCA ATGGGTGAGT 360ACCTGACGCT CCT 373
(2)SEQ ID NO:259的信息:
(ⅰ)序列特征:
(A)长度:423碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SECQID NO:259:CCAAACCGGA CAGGCCGGCA GCGACGGTCG GAAGTTGCAC CACGGTGCGC GCTCCATGTA 60GCCAACCGGT GACCACGGCG TAGACAGCAG ATCCGTGGAT CGCGCGTTCG GTGTCGTCCG 120GGCCGAGTAC CCGCGGGCCG AACCGCAGCG ACCAAAGCAA CGCGATCGAT ACGGGGATCG 180CCACTCGTGC CGAATTCGAG CTCCGTCGAC AAGCTTGCGG CCGCACTCGA ACCCGGGTGA 240ATGATTGAGT TTAAACCGCT TAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG 300TCTTGAGGGG TTTTTTGCTG AAAGGAGGAA CTATATCCGG ATAACCTGGC GTAGTAGCGA 360AGAGGCCCGC ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGACGCG 420CCC 423
(2)SEQ ID NO:260的信息:
(ⅰ)序列特征:
(A)长度:404碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:260:AGTGGCCAGC CGGTCGGCCA ATGCATCCAG CTCCCGGTAC GTCAGCTGAC CATCCGCCCA 60ACTGACCGCC ACCGAGTCAG GCTGTGCCGC AGCGATTTCG GCGAACCGGG TATGCACCGC 120GGGTGCCGAC GTCGTCACAT CCGGCAGGCC GGGTGCGGTC GGATCGTGCT CGCCGTCCAG 180CAGAATGTCG ACGTCGCGCA GCGGCCGATC CCACCGGCTG ACCAAGCGCT GTAACACAGC 240CAGCACCCGC CTGCCGAGGC TTTCGGGCGC CATCGTGCCC AGCGCACCGT CGAGCACCTC 300CACTAGCAGC GTGAGCTCAC CGGTGCTGCG GTGCGCGGCG ACGGTCACCG GAAAGTGCGA 360CAAACTCTCT AGCGCCACCG GACGGAACGT CACCCCGTTT GCGA 404
(2)SEQ ID NO:261的信息:
(ⅰ)序列特征:
(A)长度:421碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:261:GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 240ATGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 300GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420C 421
(2)SEQ ID NO:262的信息:
(ⅰ)序列特征:
(A)长度:426碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:262:ATGCATATCA CGCTCAACGC CATCCTGCGT GCGATCTTCG GGGCCGGCGG CAGTGAACTA 60GACGAGCTGC GCCGCCTCAT TCCGCCGTGG GTCACGCTGG GCTCGCGCCT GGCGGCGCTA 120CCGAAACCCA AACGCGACTA TGGCCGCCTT AGCCCGTGGG GCCGGCTGGC CGAGTGGCGG 180CGCCAGTACG ACACTGTCAT CGACGAGCTC ATCGAAGCCG AGCGGGCCGA CCCGAACTTC 240GCCGATCGGA CCGACGTTTT GGCGTTGATG CTGCGCAGCA CTTACGACGA CGGTTCCATC 300ATGTCGCGCA AGGACATTGG CGACGAACTG CTCACGCTGC TTGCCGCCGG GCACGAAACC 360ACGGCGGCGA CATGGGCTGG GCGTTCGAAC GGCTCAACCG GCACCCCGAC GTGCTCGCGG 420CTCTGG 426
(2)SEQ ID NO:263的信息:
(ⅰ)序列特征:
(A)长度:522碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:263:GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 240GTGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 300GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420CGCCGCTCCG ACGGCAAGCT GGTGCTGGGC AGCGCAGATG GCGCCGTCTA CACGCTTGCC 480AAGAACCCGC AGTTGACCGG CGTCGGCGCC GCCACCGTAG CC 522
(2)SEQ ID NO:264的信息:
(ⅰ)序列特征:
(A)长度:739碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:264:GCTGGGGCGC ACCGCCGTCC GGCGGCCCCA GCCCCTGGGC CCAGACCCCG CGCAAAACCA 60ACCCGTGGCC CTTAGTGGCC GGCGCCGCCG CCGTCGTGCT CGTCCTCGTG TTGGGCGCCA 120TCGGCATCTG GATCGCCATC CGGCCCAAGC CGGTACAGCC GCCTCAGCCG GTTGCGGAGG 180AGCGCCTTAG CGCCCTACTG CTGAACTCCT CAGAAGTCAA CGCCGTGATG GGCTCGTCGT 240CCATGCAGCC GGGCAAACCG ATCACATCGA TGGACTCTTC GCCGGTGACG GTGTCCCTGC 300CGGACTGCCA GGGCGCGCTG TATACCAGCC AGGATCCGGT GTATGCCGGC ACCGGCTACA 360CCGCCATCAA CGGCTTGATT TCATCCGAGC CGGGCGACAA CTACGAACAT TGGGTGAACC 420AAGCCGTCGT CGCCTTTCCG ACCGCCGACA AAGCCCGCGC GTTCGTGCAG ACTTCGGCCG 480ACAAATGGAA GAACTGCGCA GGCAAGACGG TCACCGTCAC GAATAAGGCC AAGACCTACC 540GGTGGACGTT TGCCGACGTC AAAGGCAGCC CGCCGACGAT CACGGTGATA GACACCCAAG 600AAGGCGCTGA GGGCTGGGAA TGCCAACGCG CGATGAGCGT GGCCAACAAT GTGGTTGTCG 660ACGTCAACGC ATGCGGGTAC CAGATCACCA ATCAAGCAGG CCAGATCGCC GCCAAGATCT 720GTTGACAAAG TCAACAAGG 739
(2)SEQ ID NO:265的信息:
(ⅰ)序列特征:
(A)长度:69碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:265:AGACGTCGTC GAGGCCGCCA TCGCCCGCGC CGAAGCCGTT AACCCGGCAC TGAACGCGTT 60GGCGTATGC 69
(2)SEQ ID NO:266的信息:
(ⅰ)序列特征:
(A)长度:523碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:266:ACTGCACCCG GCAGGCGCGA CCAACGGATC GGGTCAACTA GCACTGCCGG TGGAGGCGCC 60CCCGCGGTCT GTGCCTTCCC ACGGGGAACC CTTGGGCAGC GCGGCTCCAG AAGGGTTGGA 120GGGAGAGTTC GACGACCGTA TCGACGAGCG GTTCCCGGTC TTCAGCTCGG CCAGTCTCGC 180CGAAGCGCTG CCGGGTCCGC TGACCCCGAT GACGCTGGAT GTCCAGTTGA GTGGACTGCG 240CGCGGCCGGT CGGGCGATGG GTCGGGTACT GGCGCTTGGC GGTGTCGTTG CCGATGAGTG 300GGAGAGAAGA GCCATCGCGG TGTTCGGTCA CCGCCCGTAT ATCGGAGTGT CGGCCAATAT 360TGTGGCCGCC GCCCAACTGC CGGGGTGGGA CGCGCAGGCC GTAACCCGGC GGGCACTGGG 420CGAGCAACCG CAGGTCACTG AGCTGCTTCC GTTTGGTCGA CCGCAACTTG CGGGCGGACC 480GCTCGGCTCG GTCGCGAAGG TGGTCGTGAC GGCACGGTCG CTG 523
(2)SEQ ID NO:267的信息:
(ⅰ)序列特征:
(A)长度:224碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:267:GTGTCGGTGT CGTCGGGGTA GGAGCGACTT CCCCGGCCGG CGCCGGCGCC GGAGCGGGCT 60CTGCAGGAAC CGGTGCCGGC GCCGGCGGCG GGGCGACCAA AGGCCGGATC GATTCGGCCA 120GCGCCTTGGC CGCGCCCTTG TCCACCGGGT TGTTGGCGGT CCCGAGCCAT ACCACAAACC 180AACGCTGAAG GGGCCCGGCG TCCGGTGCGT TCGCCGCGGG CGAC 224
(2)SEQ ID NO:268的信息:
(ⅰ)序列特征:
(A)长度:521碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:268:TGAACTGACT GCCCCGCTCG ATCGGCGGCG GCGGCGTGTC ATAGCTGCGC CGCCAGGCCA 60TGAACTGCTC TTCGCCATAG CGGGCCTTGG TCTCGGCCTT GTCCAAACCC TGCAGCGCGC 120CGTAGTGGCG TTCGTTGAGC CGCCAGCTAC GCCGCACGGG AATCCAGAGC CGATCGGCGC 180TGTCCAACGC CAGATGCGCG GTGGTGATCG CGCGCCGCAG CAACGAGGTG TAGAGCACGT 240CGGGCAATAG GTCGTGTTCC GCGATCAGCT CGCCGCTTCG AACCGCCTCT GCCTGGCCCT 300TGTCCGTCAG GCCGACATCG ACCCAGCCGG TGAACAGGTT GAGGGCATTC CAGTCGCTCT 360CGCCGTGGCG CAGCAACACC AGGCTGCCAG TGTTTGCCAT ACCGGCAAGT CTCTCACGCA 420CTCCCGCACT CCTCATCGTG GACCAAAATG CCCGAATTCT CCTCGGTCCG CTGCGCAGCG 480CGTTCATACC GCCGAGGTGG TCGGCACCGT AACGGCCGGT T 521
(2)SEQ ID NO:269的信息:
(ⅰ)序列特征:
(A)长度:426碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:269:CTCCAGGCTC ATTCGCTCGA ACAAAGCCAC CCGGCCGTAC AGCGGACGCC CCCATTCGTT 60GTCGTGATAG TCGCGGTACA GCTGGGCATC GGGCCCTGGA CGAACCTCCG CCCAGGGGCA 120GCGAACCAGC CCGTCGCCGC TCACGCGGGG TCAGAACGGT AGTGCACGAC AGTCTCGCCG 180CGCGAAGGGT TTGACGCGTC AGACTCGGCC TCGGCGTCTT CCGACGAGGC GTGGATCGCC 240CCGAGCTGAG AGCGTAGCGC CTCGAGCTCA CGGCCGAGCC GTTCCAGCAC CCAGTCCACC 300TCGCTGGTCT TGTTCCCGCG CAGCACCTGC GTGAACTTGA CCGCGTCGAC ATCGGCGCGG 360GTGACCCCGA ACGCCGGCAG CGTCGTCGCC GTCGTCGCCC GCGGCAGGGG CGGCAACTGC 420TCGCCA 426
(2)SEQ ID NO:270的信息:
(ⅰ)序列特征:
(A)长度:219碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:270:GCGGACACGG CGGACAAAGC GCAATCGGCC TCGGCGGCGG CGCCGGCGGC GACGGGGGCC 60AGGGCGGCGC CGGCCGCGGA CTGTGGGGTA CTGGCGGCGC CGGCGGACAC GGCGGGGCAA 120GGCGGTGGTA CCGGGGGCCC ACCGCTGCCC GGTCAGGCAG GCATGGGCGC CGCGGGTGGC 180GCCGGTGGGC TGATCGGCAA CGGCGGGGCC GGCGGCGAC 219
(2)SEQ ID NO:271的信息:
(ⅰ)序列特征:
(A)长度:571碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:271:AAGATCATCG GCGCCGCTCC TTAGCATCGC TGCGCTCTGC ATCGTCGCCG GCGCGGATCA 60CGGAGGTCCG GCCTTGTACC CCACTCCTCG AACGGTCAGC ACCACAGTCG GGTTCTCGGG 120ATCCTTTTCG ACCTTGGCCC GCAGACGCTG GACATGCACG TTCACCAGCC TGGTATCGGC 180TGGGTGCCGG TAACCCCATA CCTGTTCGAG CAGCACATCA CGAGTAAACA CCTGGCGCGG 240CTTGCGCGCC AATGCGACCA ACAGGTCGAA TTCCAGCGGT GTCAACGAGA TCTGCTCACC 300GTTGCGAGTG ACCTTGTGCG CCGGTACGTC GATTTCTACG TCGGCGATGG ACAGCATCTC 360GGCGGGTTCG TCGTCGTTGC GGCGCAGCCG CGCCCGCACC CGCGCAACCA GCTCCTTGGG 420CTTGAACGGC TTCATGATGT AGTCGTCGGC GCCCGACTCC AGACCCAGCA CCACATCCAC 480GGTGTCGGTC TTTGCGGTGA GCATCACGAT CGGAACACCG GAATCGGCGC GCAACACCCG 540GCACACGTCG ATGCCGTTCA TACCGGGGCA A 571
(2)SEQ ID NO:272的信息:
(ⅰ)序列特征:
(A)长度:93氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:272:Leu Phe Gly Ala Gly Gly Val Gly Gly Val Gly Gly Asp Gly Val Ala1 5 10 15Phe Leu Gly Thr Ala Pro Gly Gly Pro Gly Gly Ala Gly Gly Ala Gly
20 25 30Gly Leu Phe Ser Val Gly Gly Ala Gly Gly Ala Gly Gly Ile Gly Leu
35 40 45Val Gly Asn Ser Gly Ala Gly Gly Ser Gly Gly Ser Ala Leu Leu Trp
50 55 60Gly Asp Gly Gly Ala Gly Gly Ala Gly Gly Val Gly Ser Thr Thr Gly65 70 75 80Gly Ala Gly Gly Ala Gly Gly Asn Ala Ser Leu Leu Val
85 90
(2)SEQ ID NO:273的信息:
(ⅰ)序列特征:
(A)长度:26氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:273:Met Pro Pro Val Ser Ala Asn Ala Met Val Pro Ala His Ser Thr Pro1 5 10 15Pro Val Ala Asn Ile Glu Val Asn Thr Pro
20 25
(2)SEQ ID NO:274的信息:
(ⅰ)序列特征:
(A)长度:26氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:274:Lys Pro Asp Arg Pro Ala Ala Thr Val Gly Ser Cys Thr Thr Val Arg1 5 10 15Ala Pro Cys Ser Gln Pro Val Thr Thr Ala
20 25
(2)SEQ ID NO:275的信息:
(ⅰ)序列特征:
(A)长度:20氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:275:Trp Pro Ala Gly Arg Pro Met His Pro Ala Pro Gly Thr Ser Ala Asp1 5 10 15His Pro Pro Asn
20
(2)SEQ ID NO:276的信息:
(ⅰ)序列特征:
(A)长度:140氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:276:Val Leu Val Ala Gly Cys Ser Ser Asn Pro Leu Ala Asn Phe Ala Pro1 5 10 15Gly Tyr Pro Pro Thr Ile Glu Pro Ala Gln Pro Ala Val Ser Pro Pro
20 25 30Thr Ser Gln Asp Pro Ala Gly Ala Val Arg Pro Leu Ser Gly His Pro
35 40 45Arg Ala Ala Leu Phe Asp Asn Gly Thr Arg Gln Leu Val Ala Leu Arg
50 55 60Pro Gly Ala Asp Ser Ala Ala Pro Ala Ser Ile Met Val Phe Asp Asp65 70 75 80Met His Val Ala Pro Arg Val Ile Phe Leu Pro Gly Pro Ala Ala Ala
85 90 95Leu Thr Ser Asp Asp His Gly Thr Ala Phe Leu Ala Ala Arg Gly Gly
100 105 110Tyr Phe Val Ala Asp Leu Ser Ser Gly His Thr Ala Arg Val Asn Val
115 120 125Ala Asp Ala Ala His Thr Asp Phe Thr Ala Ile Ala
130 135 140
(2)SEQ ID NO:277的信息:
(ⅰ)序列特征:
(A)长度:142氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:277:Met His Ile Thr Leu Asn Ala Ile Leu Arg Ala Ile Phe Gly Ala Gly1 5 10 15Gly Ser Glu Leu Asp Glu Leu Arg Arg Leu Ile Pro Pro Trp Val Thr
20 25 30Leu Gly Ser Arg Leu Ala Ala Leu Pro Lys Pro Lys Arg Asp Tyr Gly
35 40 45Arg Leu Ser Pro Trp Gly Arg Leu Ala Glu Trp Arg Arg Gln Tyr Asp
50 55 60Thr Val Ile Asp Glu Leu Ile Glu Ala Glu Arg Ala Asp Pro Ash Phe65 70 75 80Ala Asp Arg Thr Asp Val Leu Ala Leu Met Leu Arg Ser Thr Tyr Asp
85 90 95Asp Gly Ser Ile Met Ser Arg Lys Asp Ile Gly Asp Glu Leu Leu Thr
100 105 110Leu Leu Ala Ala Gly His Glu Thr Thr Ala Ala Thr Trp Ala Gly Arg
115 120 125Ser Asn Gly Ser Thr Gly Thr Pro Thr Cys Ser Arg Leu Trp
130 135 140
(2)SEQ ID NO:278的信息:
(ⅰ)序列特征:
(A)长度:163氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:278:Val Leu Val Ala Gly Cys Ser Ser Asn Pro Leu Ala Asn Phe Ala Pro1 5 10 15Gly Tyr Pro Pro Thr Ile Glu Pro Ala Gln Pro Ala Val Ser Pro Pro
20 25 30Thr Ser Gln Asp Pro Ala Gly Ala Val Arg Pro Leu Ser Gly His Pro
35 40 45Arg Ala Ala Leu Phe Asp Asn Gly Thr Arg Gln Leu Val Ala Leu Arg
50 55 60Pro Gly Ala Asp Ser Ala Ala Pro Ala Ser Ile Met Val Phe Asp Asp65 70 75 80Val His Val Ala Pro Arg Val Ile Phe Leu Pro Gly Pro Ala Ala Ala
85 90 95Leu Thr Ser Asp Asp His Gly Thr Ala Phe Leu Ala Ala Arg Gly Gly
100 105 110Tyr Phe Val Ala Asp Leu Ser Ser Gly His Thr Ala Arg Val ASn Val
115 120 125Ala Asp Ala Ala His Thr Asp Phe Thr Ala Ile Ala Arg Arg Ser Asp
130 135 140Gly Lys Leu Val Leu Gly Ser Ala Asp Gly Ala Val Tyr Thr Leu Ala145 150 155 160Lys Asn Pro
(2)SEQ ID NO:279的信息:
(ⅰ)序列特征:
(A)长度:240氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:279:Trp Gly Ala Pro Pro Ser Gly Gly Pro Ser Pro Trp Ala Gln Thr Pro1 5 10 15Arg Lys Thr Asn Pro Trp Pro Leu Val Ala Gly Ala Ala Ala Val Val
20 25 30Leu Val Leu Val Leu Gly Ala Ile Gly Ile Trp Ile Ala Ile Arg Pro
35 40 45Lys Pro Val Gln Pro Pro Gln Pro Val Ala Glu Glu Arg Leu Ser Ala
50 55 60Leu Leu Leu Asn Ser Ser Glu Val Asn Ala Val Met Gly Ser Ser Ser65 70 75 80Met Gln Pro Gly Lys Pro Ile Thr Ser Met Asp Ser Ser Pro Val Thr
85 90 95Val Ser Leu Pro Asp Cys Gln Gly Ala Leu Tyr Thr Ser Gln Asp Pro
100 105 110Val Tyr Ala Gly Thr Gly Tyr Thr Ala Ile Asn Gly Leu Ile Ser Ser
115 120 125Glu Pro Gly Asp Asn Tyr Glu His Trp Val Asn Gln Ala Val Val Ala
130 135 140Phe Pro Thr Ala Asp Lys Ala Arg Ala Phe Val Gln Thr Ser Ala Asp145 150 155 160Lys Trp Lys Asn Cys Ala Gly Lys Thr Val Thr Val Thr Asn Lys Ala
165 170 175Lys Thr Tyr Arg Trp Thr Phe Ala Asp Val Lys Gly Ser Pro Pro Thr
180 185 190Ile Thr Val Ile Asp Thr Gln Glu Gly Ala Glu Gly Trp Glu Cys Gln
195 200 205Arg Ala Met Ser Val Ala Asn Asn Val Val Val Asp Val Asn Ala Cys
210 215 220Gly Tyr Gln Ile Thr Asn Gln Ala Gly Gln Ile Ala Ala Lys Ile Cys225 230 235 240
(2)SEQ ID NO:280的信息:
(ⅰ)序列特征:
(A)长度:22氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:280:Asp Val Val Glu Ala Ala Ile Ala Arg Ala Glu Ala Val Asn Pro Ala1 5 10 15Leu Asn Ala Leu Ala Tyr
20
(2)SEQ ID NO:281的信息:
(ⅰ)序列特征:
(A)长度:174氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:281:Leu His Pro Ala Gly Ala Thr Asn Gly Ser Gly Gln Leu Ala Leu Pro1 5 10 15Val Glu Ala Pro Pro Arg Ser Val Pro Ser His Gly Glu Pro Leu Gly
20 25 30Ser Ala Ala Pro Glu Gly Leu Glu Gly Glu Phe Asp Asp Arg Ile Asp
35 40 45Glu Arg Phe Pro Val Phe Ser Ser Ala Ser Leu Ala Glu Ala Leu Pro
50 55 60Gly Pro Leu Thr Pro Met Thr Leu Asp Val Gln Leu Ser Gly Leu Arg65 70 75 80Ala Ala Gly Arg Ala Met Gly Arg Val Leu Ala Leu Gly Gly Val Val
85 90 95Ala Asp Glu Trp Glu Arg Arg Ala Ile Ala Val Phe Gly His Arg Pro
100 105 110Tyr Ile Gly Val Ser Ala Asn Ile Val Ala Ala Ala Gln Leu Pro Gly
115 120 125Trp Asp Ala Gln Ala Val Thr Arg Arg Ala Leu Gly Glu Gln Pro Gln
130 135 140Val Thr Glu Leu Leu Pro Phe Gly Arg Pro Gln Leu Ala Gly Gly Pro145 150 155 160Leu Gly Ser Val Ala Lys Val Val Val Thr Ala Arg Ser Leu
165 170
(2)SEQ ID NO:282的信息:
(ⅰ)序列特征:
(A)长度:61氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:282:Val Gly Val Val Gly Val Gly Ala Thr Ser Pro Ala Gly Ala Gly Ala1 5 10 15Gly Ala Gly Ser Ala Gly Thr Gly Ala Gly Ala Gly Gly Gly Ala Thr
20 25 30Lys Gly Arg Ile Asp Ser Ala Ser Ala Leu Ala Ala Pro Leu Ser Thr
35 40 45Gly Leu Leu Ala Val Pro Ser His Thr Thr Asn Gln Arg
50 55 60
(2)SEQ ID NO:283的信息:
(ⅰ)序列特征:
(A)长度:133氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:283:Met Ala Asn Thr Gly Ser Leu Val Leu Leu Arg His Gly Glu Ser Asp1 5 10 15Trp Asn Ala Leu Ash Leu Phe Thr Gly Trp Val Asp Val Gly Leu Thr
20 25 30Asp Lys Gly Gln Ala Glu Ala Val Arg Ser Gly Glu Leu Ile Ala Glu
35 40 45His Asp Leu Leu Pro Asp Val Leu Tyr Thr Ser Leu Leu Arg Arg Ala
50 55 60Ile Thr Thr Ala His Leu Ala Leu Asp Ser Ala Asp Arg Leu Trp Ile65 70 75 80Pro Val Arg Arg Ser Trp Arg Leu Asn Glu Arg His Tyr Gly Ala Leu
85 90 95Gln Gly Leu Asp Lys Ala Glu Thr Lys Ala Arg Tyr Gly Glu Glu Gln
100 105 110Phe Met Ala Trp Arg Arg Ser Tyr Asp Thr Pro Pro Pro Pro Ile Glu
115 120 125Arg Gly Ser Gln Phe
130
(2)SEQ ID NO:284的信息:
(ⅰ)序列特征:
(A)长度:63氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:284:Pro Gly Ser Phe Ala Arg Thr Lys Pro Pro Gly Arg Thr Ala Asp Ala1 5 10 15Pro Ile Arg Cys Arg Asp Ser Arg Gly Thr Ala Gly His Arg Ala Leu
20 25 30Asp Glu Pro Pro Pro Arg Gly Ser Glu Pro Ala Arg Arg Arg Ser Arg
35 40 45Gly Val Arg Thr Val Val His Asp Ser Leu Ala Ala Arg Arg Val
50 55 60
(2)SEQ ID NO:285的信息:
(ⅰ)序列特征:
(A)长度:72氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:285:Gly His Gly Gly Gln Ser Ala Ile Gly Leu Gly Gly Gly Ala Gly Gly1 5 10 15Asp Gly Gly Gln Gly Gly Ala Gly Arg Gly Leu Trp Gly Thr Gly Gly
20 25 30Ala Gly Gly His Gly Gly Ala Arg Arg Trp Tyr Arg Gly Pro Thr Ala
35 40 45Ala Arg Ser Gly Arg His Gly Arg Arg Gly Trp Arg Arg Trp Ala Asp
50 55 60Arg Gln Arg Arg Gly Arg Arg Arg65 70
(2)SEQ ID NO:286的信息:
(ⅰ)序列特征:
(A)长度:74氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:286:Asp His Arg Arg Arg Ser Leu Ala Ser Leu Arg Ser Ala Ser Ser Pro1 5 10 15Ala Arg Ile Thr Glu Val Arg Pro Cys Thr Pro Leu Leu Glu Arg Ser
20 25 30Ala Pro Gln Ser Gly Ser Arg Asp Pro Phe Arg Pro Trp Pro Ala Asp
35 40 45Ala Gly His Ala Arg Ser Pro Ala Trp Tyr Arg Leu Gly Ala Gly Asn
50 55 60Pro Ile Pro Val Arg Ala Ala His His Glu65 70
(2)SEQ ID NO:287的信息:
(ⅰ)序列特征:
(A)长度:174碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:287:CCGCACGTAA CACCGTGAAT TGAAGGGAGC CGCTGGTCAT GGGCCGATTC TATCCGTGGG 60CGAACGGTTA TTGACGGCCC GGAGGCCACT CCGCTGCCAC CAAGTGGTGA CTCAGCGCGT 120TTTCACGGCA ACGAACGGCG GACACACCAC TTGACATTCG ACAGCACGGC CGCG 174
(2)SEQ ID NO:288的信息:
(ⅰ)序列特征:
(A)长度:404碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:288:TCGCAAACGG GGTGACGTTC CGTCCGGTGG CGCTAGAGAG TTTGTCGCAC TTTCCGGTGA 60CCGTCGCCGC GCACCGCAGC ACCGGTGAGC TCACGCTGCT AGTGGAGGTG CTCGACGGTG 120CGCTGGGCAC GATGGCGCCC GAAAGCCTCG GCAGGCGGGT GCTGGCTGTG TTACAGCGCT 180TGGTCAGCCG GTGGGATCGG CCGCTGCGCG ACGTCGACAT TCTGCTGGAC GGCGAGCACG 240ATCCGACCGC ACCCGGCCTG CCGGATGTGA CGACGTCGGC ACCCGCGGTG CATACCCGGT 300TCGCCGAAAT CGCTGCGGCA CAGCCTGACT CGGTGGCGGT CAGTTGGGCG GATGGTCAGC 360TGACGTACCG GGAGCTGGAT GCATTGGCCG ACCGGCTGGC CACT 404
(2)SEQ ID NO:289的信息:
(ⅰ)序列特征:
(A)长度:134氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:289:Ala Asn Gly Val Thr Phe Arg Pro Val Ala Leu Glu Ser Leu Ser His1 5 10 15Phe Pro Val Thr Val Ala Ala His Arg Ser Thr Gly Glu Leu Thr Leu
20 25 30Leu Val Glu Val Leu Asp Gly Ala Leu Gly Thr Met Ala Pro Glu Ser
35 40 45Leu Gly Arg Arg Val Leu Ala Val Leu G1n Arg Leu Val Ser Arg Trp
50 55 60Asp Arg Pro Leu Arg Asp Val Asp Ile Leu Leu Asp Gly Glu His Asp65 70 75 80Pro Thr Ala Pro Gly Leu Pro Asp Val Thr Thr Ser Ala Pro Ala Val
85 90 95His Thr Arg Phe Ala Glu Ile Ala Ala Ala Gln Pro Asp Ser Val Ala
100 105 110Val Ser Trp Ala Asp Gly Gln Leu Thr Tyr Arg Glu Leu Asp Ala Leu
115 120 125Ala Asp Arg Leu Ala Thr
130
(2)SEQ ID NO:290的信息:
(ⅰ)序列特征:
(A)长度:526碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:290:GCTTCGACGG CTACGAGTAC CTGTTCTGGG TGGGTTGTGC GGGCGCCTAC GACGACAAGG 60CCAAGAAGAC CACCAAGGCC GTCGCCGAGC TGTTCGCCGT CGCCGGGGTG AAATACTTGG 120TGCTGGGCGC TGGGGAAACC TGCAACGGCG ACTCGGCGCG CCGCTCCGGC AACGAGTTCC 180TCTTCCAGCA GCTGGCACAA CAGGCCGTCG AGACCCTGGA CGGTTTGTTC GAGGGTGTGG 240AGACCGTCGA CCGCAAGATC GTTGTCACCT GCCCGCACTG CTTCAACACC ATCGGCAAGG 300AATATCGGCA GCTGGGCGCC AACTACACCG TGCTGCACCA CACCCAGCTG CTCAATCGGT 360TGGTGCGCGA CAAGAGGCTG GTCCCTGTCA CTCCGGTTTC TCAGGACATC ACCTACCACG 420ACCCGTGCTA CCTGGGTCGG CACAACAAGG TCTACGAGGC ACCACGGGAG CTGATCGGTG 480CCGCGGGGGC CACCTGAGCC GAGATGCCGC GCCATGCCGA CCGCAG 526
(2)SEQ ID NO:291的信息:
(ⅰ)序列特征:
(A)长度:487碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:291:CTCGCCGCCG TGATCTGGCC GGCGAACTTC GTCAGTGCAT CCAGACCCCA ACGATCATCG 60ATCAGGCCGA TGCCCATGAT CACCGCACCG GCCACCAGCA CCGCGGGCAT GCCGGTGGAA 120TAGACGAACC CCCGGGTGAG TGCCGGAAGC TGGGAGGCAA GAAAGACGGC GCCGACAATG 180CCCAGGAACA TCGCCAACCC ACCCATCCGA GGGGTAGGCG TGACGTGCAC ATCTCGCTCC 240CGCGGGTAGG CGACGGCTCC CAGGCGACTG GCCAGCATCC GCACCGGACC GGTCGCAAAA 300TAGGTGATGA TCGCCGCGGT CAGCCCGACC AGCGCAAGCT CACGCAGCGG GACACCGGCG 360CCGCGATAGG ACAGGGCGAG CAAGCCACCG GCAACGCCGG CCACATCGCT GGACACCTCG 420AGACCGTACT GCACCAACCT GAAGAGCTGA ACACTCGCCG AACGTGCAAC AGCTGCGAAC 480AATTGGG 487
(2)SEQ ID NO:292的信息:
(ⅰ)序列特征:
(A)长度:528碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:eDNA
(ⅹⅰ)序列描述:SEQ ID NO:292:ACGAAGCGCG AGAATATGAG CCGGGGCAAC CCGGCATGTA CGAGCTTGAG TTCCCGGCGC 60CTCAGCTGTC GTCGTCCGAC GGCCGTGGTC CGGTGTTGGT GCACGCTTTG GAAGGTTTCT 120CCGACGCCGG CCATGCGATC CGGCTGGCCG CCGCCCACCT CAAGGCGGCC CTGGACACAG 180AGCTGGTCGC GTCCTTCGCG ATCGATGAAC TACTGGACTA CCGCTCGCGG CGGCCATTAA 240TGACTTTCAA GACCGATCAT TTCACCCACT CCGATGATCC TGAGCTAAGC CTGTATGCGC 300TGCGCGACAG CATCGGCACC CCATTTCTGC TGCTGGCGGG TTTGGAGCCG GACCTGAAGT 360GGGAGCGGTT CATCACCGCC GTCCGATTGC TGGCCGAGCG CCTGGGTGTA CGGCAGAACC 420ATCGGCCTGG GCACCGTCCC GATGGCCGTT CCGCACACAC GACCGATCAC GATGACCGCT 480CATTCCAACA ACCGGGAGCT ATCTCCGATT TTCAACCGTT CGATCTCC 528
(2)SEQ ID NO:293的信息:
(ⅰ)序列特征:
(A)长度:610碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:293:CCAAGCCCGT CAAGGAGCCG GTGCCGGCCT TGCCTCCGGT GCCGCCGACG CCGGCGTTGC 60CGCCGTTGCC GCCGTTGCCG CCGGTACCGG GGTTTCCTAC GGTGCCGCCG CCCGGCAGCA 120TGGCCCCGCT GTTTAGGCCG TTTTCGCCGG CCCCGCCGTC ACCGGCTTTG CCGCCATCGC 180CGCCGTTGCC GCCGCTGGTG GGGGTGGCGG CCTGGTTGAC GTATTGTTCC ACCGGCCCGG 240CCCTTGACCC TTTGGCGGTG TCGATCGCGG CGTCGATGGA TCCGCCGACC ACGACGTGCG 300AAGCCTCGCC TGCCGCCGCA GCCGCCCAAC TGTGTCGCGG CTCCTGCGAT TTGGCCCCGG 360CCGACGAGAT GATGGGCACC ACCGGAGCCT GCGGCCGTCT GGGGGAGGCC AGCGCGGGTT 420CGCGGTCACG CCATACGCGA CGGTGCGCCG CCGCTTCGGA GATTTGCAGG CTGCGTTGCA 480CCAGATCGAG CAGCGGTGTG CCCAGGGACT GGGTTAGCCC GTTGGCGCCG CCGTTGTAGC 540GGCGAGCGCA ATATCGGTGC CCACTCGACC CAACCGCGAC TCCATAAGCG ACACCATTCG 600CGGTTGATGC 610
(2)SEQ ID NO:294的信息:
(ⅰ)序列特征:
(A)长度:164氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:294:Phe Asp Gly Tyr Glu Tyr Leu Phe Trp Val Gly Cys Ala Gly Ala Tyr1 5 10 15Asp Asp Lys Ala Lys Lys Thr Thr Lys Ala Val Ala Glu Leu Phe Ala
20 25 30Val Ala Gly Val Lys Tyr Leu Val Leu Gly Ala Gly Glu Thr Cys Asn
35 40 45Gly Asp Ser Ala Arg Arg Ser Gly Asn Glu Phe Leu Phe Gln Gln Leu
50 55 60Ala Gln Gln Ala Val Glu Thr Leu Asp Gly Leu Phe Glu Gly Val Glu65 70 75 80Thr Val Asp Arg Lys Ile Val Val Thr Cys Pro His Cys Phe Asn Thr
85 90 95Ile Gly Lys Glu Tyr Arg Gln Leu Gly Ala Asn Tyr Thr Val Leu His
100 105 110His Thr Gln Leu Leu Asn Arg Leu Val Arg Asp Lys Arg Leu Val Pro
115 120 125Val Thr Pro Val Ser Gln Asp lle Thr Tyr His Asp Pro Cys Tyr Leu
130 135 140Gly Arg His Asn Lys Val Tyr Glu Ala Pro Arg Glu Leu Ile Gly Ala145 150 155 160Ala Gly Ala Thr
(2)SEQ ID NO:295的信息:
(ⅰ)序列特征:
(A)长度:161氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:295:Arg Arg Arg Asp Leu Ala Gly Glu Leu Arg Gln Cys Ile Gln Thr Pro1 5 10 15Thr Ile Ile Asp Gln Ala Asp Ala His Asp His Arg Thr Gly His Gln
20 25 30His Arg Gly His Ala Gly Gly Ile Asp Glu Pro Pro Gly Glu Cys Arg
35 40 45Lys Leu Gly Gly Lys Lys Asp Gly Ala Asp Asn Ala Gln Glu His Arg
50 55 60Gln Pro Thr His Pro Arg Gly Arg Arg Asp Val His Ile Ser Leu Pro65 70 75 80Arg Val Gly Asp Gly Ser Gln Ala Thr Gly Gln His Pro His Arg Thr
85 90 95Gly Arg Lys Ile Gly Asp Asp Arg Arg Gly Gln Pro Asp Gln Arg Lys
100 105 110Leu Thr Gln Arg Asp Thr Gly Ala Ala Ile Gly Gln Gly Glu Gln Ala
115 120 125Thr Gly Asn Ala Gly His Ile Ala Gly His Leu Glu Thr Val Leu His
130 135 140Gln Pro Glu Glu Leu Asn Thr Arg Arg Thr Cys Asn Ser Cys Glu Gln145 150 155 160Leu
(2)SEQ ID NO:296的信息:
(ⅰ)序列特征:
(A)长度:175氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:296:Glu Ala Arg Glu Tyr Glu Pro Gly Gln Pro Gly Met Tyr Glu Leu Glu1 5 10 15Phe Pro Ala Pro Gln Leu Ser Ser Ser Asp Gly Arg Gly Pro Val Leu
20 25 30Val His Ala Leu Glu Gly Phe Ser Asp Ala Gly His Ala Ile Arg Leu
35 40 45Ala Ala Ala His Leu Lys Ala Ala Leu Asp Thr Glu Leu Val Ala Ser
50 55 60Phe Ala Ile Asp Glu Leu Leu Asp Tyr Arg Ser Arg Arg Pro Leu Met65 70 75 80Thr Phe Lys Thr Asp His Phe Thr His Ser Asp Asp Pro Glu Leu Ser
85 90 95Leu Tyr Ala Leu Arg Asp Ser Ile Gly Thr Pro Phe Leu Leu Leu Ala
100 105 110Gly Leu Glu Pro Asp Leu Lys Trp Glu Arg Phe Ile Thr Ala Val Arg
115 120 125Leu Leu Ala Glu Arg Leu Gly Val Arg Gln Ash His Arg Pro Gly His
130 135 140Arg Pro Asp Gly Arg Ser Ala His Thr Thr Asp His Asp Asp Arg Ser145 150 155 160Phe Gln Gln Pro Gly Ala Ile Ser Asu Phe Gln Pro Phe Asp Leu
165 170 175
(2)SEQ ID NO:297的信息:
(ⅰ)序列特征:
(A)长度:178氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:297:Lys Pro Val Lys Glu Pro Val Pro Ala Leu Pro Pro Val Pro Pro Thr1 5 10 15Pro Ala Leu Pro Pro Leu Pro Pro Leu Pro Pro Val Pro Gly Phe Pro
20 25 30Thr Val Pro Pro Pro Gly Ser Met Ala Pro Leu Phe Arg Pro Phe Ser
35 40 45Pro Ala Pro Pro Ser Pro Ala Leu Pro Pro Ser Pro Pro Leu Pro Pro
50 55 60Leu Val Gly Val Ala Ala Trp Leu Thr Tyr Cys Ser Thr Gly Pro Ala65 70 75 80Leu Asp Pro Leu Ala Val Ser Ile Ala Ala Ser Met Asp Pro Pro Thr
85 90 95Thr Thr Cys Glu Ala Ser Pro Ala Ala Ala Ala Ala Gln Leu Cys Arg
100 105 110Gly Ser Cys Asp Leu Ala Pro Ala Asp Glu Met Met Gly Thr Thr Gly
115 120 125Ala Cys Gly Arg Leu Gly Glu Ala Ser Ala Gly Ser Arg Ser Arg His
130 135 140Thr Arg Arg Cys Ala Ala Ala Ser Glu Ile Cys Arg Leu Arg Cys Thr145 150 155 160Arg Ser Ser Ser Gly Val Pro Arg Asp Trp Val Ser Pro Leu Ala Pro
165 170 175Pro Leu
(2)SEQ ID NO:298的信息:
(ⅰ)序列特征:
(A)长度:921碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:298:AATTCGGCAC GARCAGCACC AACACCGGCT TCTTCAACTC CGGCGACGTC AATACCGGTA 60TCGGCAACAC CGGCAGCTTC AACACCGGCA GCTTCAATCC GGGCGATTCC AACACCGGGG 120ATTTCAACCC ANGCAGCTAC CACACGGGGA CTCGGAAACA CCGGCGATTT TACACCGGCS 180CCTTCATCTC CGGCAGCTAC AGCAACGGGT CTTGTGGAGT GGAAATTATC AGGGCTCATT 240GGNTGCACCC GGSCTTRCGA ATCCCTCGKG CCAATTCAAC TCCTCNACAA GCTTGCGGCC 300GCACTCSAGC CCGGGTGAAT GATTGAGTTT AACCGCTNAN CAATAACTAG CATAACCCCT 360TKGGGCCTCT AAACGGGTCT TGAAGGGTTT TTTGCTGAAA GGANGAACTA TATCCGGATA 420ACTGGCGTAN TACGAAAAGC CGCACCGATC GCCTTCCCAA CAGTTGCGCA CCKGAATGGC 480AATGGACCNC CCTKTTACCG GSCATTAACN CGGGGGTGTN GGKGTTACCC CCACGTNACC 540GCTACCTTGC CANNSSCCTN RSGCCGTCTT TCSTTTCTTC CTTCCTTCTC CCMCTTCGCC 600GGTTCCCNTC AGCTCTAAAT CGGGGNNCCC TTTMGGGTTC CAATTATTGC TTACNGSCCC 660CCACCCCAAA AAYTNATTNG GGTTAATGTC CCTTMTTGGG CNTCCCCCTA WTNANNGTTT 720TCCCCCTTNA CTTTGRSTCC CTTCYTTATW NTGAMNCTNT TTCCACYGGA AAAMNCTCCA 780CCNTTYSSGS TTTCCTTTGA WTTATMRGGR AATTSCAATY CCGCYTTKGG TTMAANTTAA 840CYTATTTCNA ATTTTCCCGM TTTTMMNATR TTNSNCKCGM KNCTCCNRKA SSGNTTTCCT 900CCCCCYTTSS GKTYCCCCRN G 921
(2)SEQ ID NO:299的信息:
(ⅰ)序列特征:
(A)长度:1082碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:299:AATTCGGCAC GAGATANGGG CGCACCGGGG TCCGCAGCCG GCGGGACCGT CGCCAGCACC 60ACCGGGGTCA ACAGCACCAC GGTGGCGTCC ANGCAGAGCG CCGCGGTGAT GGCGGCCGAG 120ACGGCRAACA CCTGCCGTAG CAGTCGGTGC GACTCCGCGC TCGCTCGANC CATGGCCGCG 180CCGGCTGCCT CGAACANGCC TTCGTCGTCC ACAGCTTAGC CAGCANCCAA ACCGCACCCA 240GAAACCCACA CGCCCGCCGC CCCGGANACC TGCGCCATCG KCTGCTGGGG CGANATCCCC 300CGATCGCTNA CANGATGACC GCTGCCGGAA CGCCGCCGCT GCCTCCGGGC AGCCGCGTGG 360GCSGGGCAAC CGCGAACCCA NGAACACGGC AAGCAGTATC ANCGCAACAG CAATTGTCAA 420GGGCTAAACG CTTCACATCC AGGGATCTCG CGGCGCCACA CCGTCGGMTC TGCAGSGCGA 480CCCCNTCCTN GGGCGGNCAC TCNTCAAAGA TGCNGATCNA CAGKCTAGGT CTTCGGCCGA 540TATGSAAGGN CCCAACGGNT TTAAAGCGGC SAAAAAASTC TCCCANTGGA TAAAATCAGC 600CGGGGANCCC CCCGTGSCMM NGTCYCGGKC ATTNTTCAAC MGGTTTNACG GCGGKTGCNG 660GCCAACTKGC CAAAMTTAAG KTNGGGGNTY CGGGGCGGTA ACCGGCNNTK NGCCCCTTAA 720AAAACCGGNC YTTTCTKGAT TAMMACCGGN CCCCCAWTGG CGGKTGKTCC CANGNTYAAC 780AMCCYCCCSS MNGGGKTGGS SAACCCTTCC CGNGGGGTTC NTKGTTSCYT AWMCCCCCGG 840AAACCSGKYG GGKTGGCRTN WASSAMNCCC CMNGYYTCTT TAAAGGCCAN KNRAAWGKYT 900CCTTGGGAAW CCTNCAATYC GAAAAYYCTC CTYMMGSSCN CTTKCWRTYN NRNGGGAACS 960AMWTNYCCNC GWTTCAWTCG GGTCCGASMN AAACKCTTTY TTTTYCGSSC STCCMGGSNC 1020SGGTKNANAN AAASATTTMC YYCNNNANKK YYYCSSGCTT CYKMGRRNRR GMGAACCCGR 1080GS 1082
(2)SEQ ID NO:300的信息:
(ⅰ)序列特征:
(A)长度:990碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:300:AATTGGCACG AGTGATCGCG CTGAAGCCGG TAGCGCGGGT GGCTCGGGTG GTTTGCGAAC 60RAAATCCGCT CGANGTGGTC TCGGTAGGCG GTGTCCANAA CGGTGGCGCG GTGCCGGCGG 120ATCTGATCGG CGCGGCCGTA GTGCACGTCG GCGGGCGTGT GCAGTCCGAT GCCGGAATGC 180TTGTGTTCGT GGTTGTACCA GCCGAAGAAC CGGTCGCAGT GCACCCGGGC CGCCTCGATC 240GACTCGAACC GTTTCGGGAA ATCGGGCCGG TACTTGAAGG TCTYGAACTG GGCCTCAGAC 300AACGGGTTGT CTTGCTGGTG TGCGGGCGTG AGTGCGACTT GGTGACACCG AAGTCGGCCA 360NCANCAATGC CACCGGTTTG GAACTCATCC ACAACCCCCG TCCGCGTCMA GGTCACTTGT 420NCGGCGCTAA TTTNYTGGGC GGCAAGGGTT TGCCGAYCAN KCCGCTCGGC CAAAACTTCG 480ANTCNCSCCA AGGCCNCCAT CCNCCCAAAC AMGTTACGGG ANAAAANATY CAAAGAYCAC 540CYTCCGGKTN TTATANCTYC CCYTTTGSTY GGGCCCCCCN CYYTGKKNAT ACCCCTNCCA 600AWTCCCAACN CCCKCCAANA RCYKGGGGCC CCCNCCAACC CGGGKGAAKA WTAATTTAAA 660CCCYAACMAW ACTWMMNACC CNNGGGSCCY AAMCGTYYNR AGGTTTTSCT NAAAGAAASA 720ANTCGGAAMC CGGNTSTACC AAAAASCCCK CCNWTCCCTC CRASATTGSC NCCSAAWKSA 780AKGCCCCCNY TCSGCNWNNC CSGCGGKKKT KKGTTNCCCT WMRCWMWYTS GGCCNASCCN 840CKYYSSMYCC CCCCTCCCCM CTCCGNKTCC CCAMCCYANC MGGCCCCYTM GKKCCCWKNT 900YKGCCCCCCC AMMNNNGGGG WGACCCTNGG CCCCMKRRGM TCCCNANTGA MCCTCWGNRA 960MKCYCCNRAR ANMCCSCNCC NGCNCRCKNN 990
(2)SEQ ID NO:301的信息:
(ⅰ)序列特征:
(A)长度:223碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:301:AATTCGGGTG GCAACGCGGG CCTGTTCGGC AACGGCGGCG CCGGTGGTGC CGGTGGGGCT 60GGTGGTGGCG CCGGCGGCGC GGGCGGTAAC GCGGGGTGGT TTGGTCATGG GGGCGCTGGC 120GGCGTGGGTG GTGTANGTGC GGCCGGGGCC AACGGTGCTA CGCCCGGTCA GGATGGGGCG 180GCTGGTGTTG CCGGGTCGGA CRACRCTCGT GCCGCTCGTG CCG 223
(2)SEQ ID NO:302的信息:
(ⅰ)序列特征:
(A)长度:418碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:302:AATTCGGCAC GANGCGGCAA CGGTGGCAGC GGCGGCACGT CNGTTGCCAC CGGGGGGGCC 60GGGAACGGCG GTGCCGGCGG CGCCGGCGGC GGGGCCGGGC TGATCGGCAA CGGCSGCAAC 120GGCGGCAGTG GCGGAATGGG CGATGCCCCG GGCGGCACCG GCGTCNGCGG CATCRGTGGG 180CTGTTGTTGG GTTTGGACRG CGCCAACGCC CCGGCCAGCA CCAACCCGCT GCACACCGCG 240CAGCACAGGC GTTGGCCGCA GTCAACGCGC CCATCCAGGC CGTGACCGGG CGCCCCTGAT 300CGGCAACGCG CCAACGGCGC CCCGGGCAAC GGGGCCCCCG GCRGGCACGG CGGGTGGTTG 360TTCGGCGGCG GAAGGAACGG CGGGTCCGGC GTCANCRGCG GGGCGGGCGG AAATGCCG 418
(2)SEQ ID NO:303的信息:
(ⅰ)序列特征:
(A)长度:1049碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:303:AATTCGGCAC GAGGGGCACG ATCGCATACA GCGCTCGCGG CAGACCCGCC CGATACAGCA 60GCTCGGCACA CGCGAGCGCA CAATACGGCG TCTGGCTGTC CGGCTTGARC ACCACCGCGT 120TACCGGCCAC CAGCGCGGGC ACCGAGTCCG ACACCGTAAG CGTCATGGGG TAGTTCCACG 180GCGAGATCAC CCCCACCACG CCCTTCGGTT GATAGCACAC CGTGGTCTTG CCTATCCCGG 240GCAGCAGCGG CTGTGCCTTA CGGGGCTTCA GCAGGTCCAC ACAGACTCGT GCSTTATAAT 300TNCGCSTTCC GCGATCAGAT CGACAATTTC CTCTTGCGCC GCCCATCGGG CCTTGCCCGC 360CTCGGCTTGC AGGAAGTCCA TGAAGAACTC GCGGTTCTCG ATNAACAGGT CGCGATAGCG 420GCSGATGACT GCAGCTCGCT CGATNACGGG ACCTTCGCCA GTCGGTCTGC GCCGCGCGAN 480CTTCCGCGAA TGCCGCTTCG ACTTCCGCGG NCGTGCCAAC GGAATCNTAT CACGGGTTGC 540CGGTTAAAAC TCCTCAATST NCYGGTCGAA ATTCGGCAAC TTCTTATCCC GGCAGGTRCC 600AACSANNCAA ACCTCGGCAA GGTTAGGMTT TCCCCCNCTT YCAAAAATNC GGKTTTTGGN 660CMAATTTCGC CKCNATGKTG MCAAGGMTCT CKAANAAKCS GGGTCYTCTN NTCNGKGGAK 720CCAAAMGGKT TTGGGGMAGC GKNMNCCAAN CCTWACCCTG KTKAANGGNW TTCCCCCCGG 780GGGAKKGNGA ATYCYCCSNA NCCCRGGGGG GNMCARATTC TYCCGGMCTC CTCKGGAWTC 840WGMGSTTTCC CAAAAAACSC CCCAAATTMM TTTTTCCRCN TRTTGANACW CTTTTKARCA 900MMCSSAARNS ANMCNCTCYC CKCTKTGKTK AAAAAGNAYW CCCCMAAATT TYTAWTTSSC 960CCSCGCGGGN CCCNCTNTTT TSCNMTWCTM WNYTNCRMCC MMMSNCKSNG KKGGNRCCNN 1020CRCCSNCCCM AAWYNTKGYN KNTATMAGC 1049
(2)SEQ ID NO:304的信息:
(ⅰ)序列特征:
(A)长度:1036碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:304:AATTCGGCAC GAGGGAATCG AGAATCCCGG AATGGTGAAG CCTCGGTGCC TGCCGTTACG 60CCAAGAKTCA GGGTGAGCGG CCCCCCGGTG GGAATGCTGA SGCCAACCGG GAAAAGGGTG 120AGGGCTGGGG TGGAATAACT GAANGTTACT GGGATGGAAA ACCCGGTATT GATATGTATT 180GGGCCGATCA ANGTTGTGGG AATGGGGGAA GGCTGAGGGC GACCTGTTGG ATTTGGGGAA 240TTGTYRTGGA CRAKACWGGC CAGCCMGCGT GATGGTTTGG TTSAANTTTT GTGCCGSCCA 300CANGGTGATG GGATTGATTT TGATGGGGCC SATCGAAATA TTGGGTATGC CNACGCCSAA 360CGAGATYGCC GGGACGTTCA TGGGCGGGAC AACCMASGGT CCSANGTAAK GGTTTCCTTN 420ATNTTGATCG GGATTCCGGA ACTMTSTCGA TGSGCTCSAY MTSATSGCCC NACNCCWCCG 480YTTATTTCMS GCTNAYGGGA ATBAMRGGAA CAAYNTCCCT CCCMGGAAAA ACCAACMSGC 540CCTGGTNSYC CNCCCRCCNC AKAACCCRTT KCTGTRSTMC CCSMAAATNA CSCCCSCTTS 600NACTCCNCSG AANTNSCCCC CCCSCKNNTT ATSTYCCCGK GTTCCCCCMC CCCTTNAAMC 660TCCCCGGTTA ACCCCCWTNT SNCNCCCCCS YTAAKMNCRG GCTTSTTNCT CCCCCYTRMK 720CNCCCCCTCK SAMCWNCCNC CTCKAACNAC CCCKCYKGSM TNCCCAATNT WCMWCKCCNS 780KTTNTMCTKC CCAAYTNCRC CCNCRCTCCC CCKSTSTCAM WTATAAAACC WCWYAWYNNK 840KCNCWMAWTA MGACWCTCNY NCCCCNCNCK NTTKTAMWCC CKMCCCKCSW TWCYCKCSCC 900CCMTCTMNAC YCCCCCKKTY NKWMCCCTTC CCCCCCTCCC MCNMBMKTCT YCSGKTWCWC 960NCYNTTMTCN CYNANMCKCK KTCTCTTCCN CRNTCTCCCC CCWCCCCCCV KKCTCTSKCC 1020CNCNCTCCSC MMKGSC 1036
(2)SEQ ID NO:305的信息:
(ⅰ)序列特征:
(A)长度:1036碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:305:AATTCGGCAC GAGATCATGA ATAGCGGGCT GGTCAGCACC GAAGTGGTCG GCGATCTCGC 60GAGCAAGTCT CGTCTGCTCG CCCAGCAGGA GGTCGGCATC GATGCGGACA CCTGCGATGT 120CTTGGATGGT GTTCAGTTGC AGGTAAGGCC GACGCCGCAG CTTTGCTAGC AGGGTGTCTT 180GGCTCTTCGC ACGTGAGGTA ACCAATAACT CCGACGCAGA CCAACTCCGG CCCTCGATCC 240GGGTACCAGG CTCCGCCGGA GCCAGCCGTT GTGCCCCCTG GGCCGAAGGT CAGCTGCTGT 300GCGATCGAAG TAAGAAACCG CGCCATGCCC GTCGCCAAGT ACGACTGACC GAGCAAACGA 360ACGATCGTCG TCCTTTCCGT GGGGGTAATC GANCCCAGCA ACCGCACGAG CCACCAATCA 420TTGGGATTCG GCCACTGACC GACCAACCGC CTGTGCGACA CCCCAGCGGA ATTGGTGGTC 480TTCCGCGGGG CCGCNAACGG AATCANCGSG ACGCGCTCGC CGAASCANCC GCATANCCNT 540ACATANCAAC GGNNTCTGCG CCCACATTTC GGGSTTMTGC CCCTCNGCAA CSSNAAYNCC 600CCCAATTCYG AACNAAAAAA TTGGYCCATY ARNGTYCTCM CCAAAAACCN AWTCCCCKTA 660TCCCCCGGGG GGGRCCCCYY NMNAAAACGG CCCWWAANCC CCSGGGCSCC CGGGTTRWTN 720CCCCTTGTCG GCCCNCCSGG TTTGGTCMCM GGSCMMTNWN GGGNTGCSCC CCCNCNAAAA 780AAAAAYCKNG NCAAATYAAA CCCKYCMAAA ASKTGGGSSC CCCMARCCGG GGKAAKKWWA 840ANTTAANCCN KAAAAAAAWW NCANNMCCCC NGGGNCCTAA GGKYTTAGGG GTTSTTNANG 900ARAAAATMTC CANATMNSSK TTNNAAAAAA ASCCSWAKCC CCCNNNKKNN CCAAWKAARR 960SRCCTTCGGG TNWNSGGGGG KKKKKTNCMS KMNMMTTWGR CCCNCCGCCN NNTWKCCTTN 1020TCCNYGGNGC RNCAGN 1036
(2)SEQ ID NO:306的信息:
(ⅰ)序列特征:
(A)长度:1060碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:306:AATTCGGCAC GAGTCGATTC GATCGAACAC GCCCGCACCT GGCCAGGCCA CATGGGCGCG 60GCCATGGCCA ACGCCTACTC GGCCAACCCG AATCCATTCG GCGTCTCACC GCAACCCCCG 120AAACCGGCGA CCGCGGCATG GATCAACCCG CCCACCCCAG ATCCGAAATA GCGTCCACAT 180AATGAGACAC TGGCGCAAAG AGCTTGACAG GCGCCGCACC ACGCAAGCTG TTAGACGTGT 240CGGTCTTGCA AGAAGCGGGT TGGCCACCCA AGATCACGCC GCCCAAGGGC ATCGAGTCAA 300CGTTGCGGTG GTATCGCGCT AACGTCGGCG CCGCCAAGAA ATGACGGTGC GCATTACCAT 360GGCCCTGCTG ATCACCTTTG GCCACCTGCG CACCANAACT ATGANCAGCC TTATGCCGAG 420TCTCGTGGAC ATCGGCAGCC GCTTCAAAAA CTCCTTGTCG ACAATSGTAT TGCTGANCCG 480CCGAATTCTT NTRCTTGCAA SAACACTNCA TGTTNCSGGT NAACAACCYT GGTTNGAAAA 540ACANCCAATA TTGAANTCCC ANTCGGGCAM GAACCNGTTM CGGAAGKTGK TGGGAACGAA 600TGKTGCCCAA AAATCCCGGG NGGTRAAAWW CCCNSNATGG MSAATTTTSC CTNGAACAAM 660AAAAGGTCCA AGKYCAAAGG NGCCCCCCCC SGNAAATTGG TGAACSCAKA WYANRTTCCC 720WWWTNCAAAT MTTNGGGTCC KNNTCCCCWT AAANGGGSCN CCCCNCCRGG GMGTYTCCCC 780NWNMGGGMGN CYYCSCCCCA AAAAAAAMMM MTTTCSGKGG SMGGKKCCCC CCSGGTYWGG 840GKKYTTAAAC CCGGKGGGTN CAAAAAANAN ACCCCCCAMS NGGGGGGAAA ATTTGNAAWT 900AAGGKKKTKC SCMACCCCAA AAANMMNNCN AWNCCCGMGK SARGGGGRNY TTMKAGGGMG 960GNYCCCCCCW YCGGGGGGNA NAAYAAAAGK NGSNGRGAAT NTTNTTTTGK RSSSRNKTTT 1020TYNTCCTYCN CCNMGNRWWG SRAMNTGKTS NSSGGGSGGC 1060
(2)SEQ ID NO:307的信息:
(ⅰ)序列特征:
(A)长度:1040碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:307:AATTCGGCAC GAGCTTCACC AAAGAGCTGA CATGCCGGGT GATGCGACAT CGCATCGAGG 60GCAATACGGG CATGGATGAN CCGAANGGAN TCTGGCGTTC GCTCAACTGG ATTACGGTTC 120CCAAGGTGAA ACGCTTTGCG GCGAAAGATG CGACGCTTAA CTTGCGCTTC CACCGTGCAA 180TGTTNGTATG GATGCTGGAA CCGCGCTGAC NGATAANGAA TTCGCTGGTC GCCGGGCACN 240ATGGATGGTC CKSTTTTCNC TCCGCSGTTA AATTGCSTGT GCATCATCTG GCAGGCTATG 300TTCCCGCTAC RCTGCAGCCC ATCATGGATG TGCGGCTAAC GAANAAGTTA TGACATGGCG 360CAAGCGAMTC GGGCATSCNC GCGGCAMTTT CGCAACCTGC TGTGTNTGAA GCGTMTCAAC 420CGAATGCGGC GCTYAAAAGC NGGCTTGCGT TGATTMMAAC CNAACCCNTN CNATYCTTTG 480CCGNGNMNTG CGTTCTCTCC AACTCCGKKG SYTGCCNCCG TGAAACCCMA CTNCCCCCCC 540GTTGGACTTA MRTNTTCAAA AAMCGGMTNA ACCSGAATNN SAACCTNCCR TCAAANTAMM 600SAANTCGGGC TTYGGGNRCC CCCCNGAAYW TTCKNCNGGG GMNNTYCTCN GGTTYNGGCG 660SAAACNTTTG CCRTNCYMNN TTTACAMGGC NCMTNMTTGM GGGSCSNNAS GWCCCGGGKK 720TNTTTNCAAW TCNCNSKTTT TTKGGGGGGG GGCYGRTRMC NCGGGCCCCC GGCCCKKMAA 780AAAAAMCMSA RRCCNCYGGG KKCCCCCCCM NNATNGGGCG YKCRAAACAA ACCCCAANRA 840TNGNGMGGGC SMACCSGNGN GYNAAAKGGT TSNSCTMANM MKGMANNNCT SGMSCCMNSN 900NCTGMGGGKT TTKGNNGARN AANAMKMGGM RCGGNCGCNN GAAAGGGSMS GSCKSCNNGN 960NGASNGWMGN CRNNGANRCC NCNGYGNMRN NNGNNNGNNN GGGRKNNACN NMKMCAWSMC 1020NSNMMGNNNS CGYMTNKCGC 1040
(2)SEQ ID NO:308的信息:
(ⅰ)序列特征:
(A)长度:348碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:308:AATTCGGCAC GAGACAANGG CGTGAAATGG GATCCGGCCG AGCTGGGGCC CGTCGTCAGC 60GACCTGTTGG CCAAGTCGCG GCCGCCGGTT CCGGTCTATG GGGCCTAGTT ATCTGCGCCG 120AGCGTGAACT CAGGGCGAGA TTTCGGCCGT TTTCTCGCCC TGGCTTCACG TTCGGCGAAG 180TKGGGAACGG TCAGGGTTCG CAAACCACGA TCGGGATCGT GCGGTCGGTC CAGGACTGGT 240ANTCCTGATA CTTKGGTACA TCGTGACCAA CTGTGGNCAA TATTCGGCGC GCTCCTCGTC 300NGTCGCGTCC CGCGCGGTAA GGTCCANCAC TTCCTTTTTC TCGTGCCG 348
(2)SEQ ID NO:309的信息:
(ⅰ)序列特征:
(A)长度:332碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:309:AATTCGGCAC GAGAGACCGG GTCGTTGACC AACGGACGCT TGGGCGCGGG CCCCTTGCGT 60GGCATCAGCC CTTCTCCTTC TTAGCGCCGT AACGGCTGCG TGCCTGTTTG CGGTTCTTGA 120CACCCTGCGT ATCCAGCGAA CCGCGGATGA TCTTGTAGCG CACACCAGGC AGGTCCTTCA 180CCCGGCCGCC GCGCACCAGC ACCATCGAGT GCTCCTGCAG GTTGTGGCCC TCGCCGGGAA 240TGTACGCCGT GACCTCGAAC TGACTCGTCA CTTCACGCGG GCAACCTTCC GAAGCGCCGA 300GTTCGGCTTC TTCGGAGTGG TGGCTCGTGC CG 332
(2)SEQ ID NO:310的信息:
(ⅰ)序列特征:
(A)长度:962碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:310:AATTCGGCAC RAGTCGGTCT AGACGGATTC AATGCTCCCG CGAGCACCTC GCCACTGCAC 60ACCCTGCAGC AAAATGTGCT CAATGTGGTG AACGAGCCCT TCCAGACGCT CACCGGCCGC 120CCGCTGATCG GCAACGGCGC CAACGGGACT CCTGGAACCG GGGCTGACGC GGGGCCGGCG 180GGTGGCTGTT CGGCAACGGC GGCAACGGCG GGTCCGGGGC GAACGGAACC AACGGCGGGG 240ACGTGGGGAC GCGCCCGGCG GGATTTCTTC GCACCGGSGC ACCGGCGGGG CCGGCGGCGT 300CGCACAACGG CACCGGCGGG GACGCNGCGC CCGTNGGGCG GCTTCTKGAT GGGCTCCGGC 360GGTNACGCGG CACGGCGGCG CCCGGCTCAC CGCCNGTTGG GACGCGGGGA CGCGTNACCC 420CGATCTTCTT CCGCNCCCCG GAAACCGCGG GGCCGGCCCC ACATTAKACC CGGCGGNACC 480GCGGMCCCGG CGGAACGGNG GGYNTTTTCC AACGGCGGGG CCGCGGAACC GNMGGSTGTT 540CCTTNGGSGA AGGNCCAAKT CCCGKCTANC YYAATCCCCG ANGGKTGAMC CTSATGSNCA 600MYTTMAGGAA CYTNCCCANT KTTSGRACCW CRCCNGGAAA ASRAWNKNGT KGGCAAACNA 660NNTNCYTTKN NATTKGGNNA AAAANCCCTY CCWCSGRACT NCCCCCCNGM GRGMCNNTNN 720NTTTYGNCNN CCCGGSNAAM RNTTKATTTC NGGGGGNTCN GGGTKMNNNA AACCCCAAAM 780MNRNNKCSCA ANGGGKSNGC NKNNMMNSGT TTTYCKNMRA MRNWTYKNKN NTCNGARSRN 840NAAMCNNSNK NGKKKNNKAA ARNNTTWKTN KNSCNNNCNN GRRNGVRGGC CKMKGSNMNG 900MCWHNAWRNG NNGSNCNCKC NNKMNAAAAA AASGGVNCKS NSMKNKKKKG NRGGGGGGGG 960GG 962
(2)SEQ ID NO:311的信息:
(ⅰ)序列特征:
(A)长度:323碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:311:AATTCGGCAC RAGAAGACGC CCGAANGTTT GCGCTGGCTC TACAACTTCA TCAARGCGCA 60GGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 120GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 180GCAGAAGATG TCGTTCGAGG TGGCCTGGAG GATTTTGCAN GCGACGCCNG TGACCGCGAC 240GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCGGCACC GCGTTGACCT CGACCAGCTG 300CACCACTCGT GCCGCTCGTG CCG 323
(2)SEQ ID NO:312的信息:
(ⅰ)序列特征:
(A)长度:1034碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:312:AATTCGCAGT GTGTGTGGCG GCGTCCAGAA GAAGATGATC GCGAACATCG CCAGCGCCGG 60CCAGGCTATG GTGCCGGTGA TGGCCGACCA GCCGATCATC ACCGGCATAC AGCCGGCCGC 120CCCACCCCAC ACCACGTTCT GTGACGTGCG TCGCTTGAGC CAAAGCGTGT AGACRAACAC 180ATAAAACGCG ACGGTGACCA GGGCCAGCAC CCCCGCCAGC AGGTTCGTGG CGCACCATAG 240CCAGAAGAAC GAGATCACCG TCNACGTCAC CCGAGTGCCA ACGCGTTTCG GGTCGGCACC 300GCTTCCCGCG CCAAGGGCCG GCGCGCGGTT CGCTTCATCA CCTTGTCGAT ATCGGCGTCG 360GCNACCAGTT GAGCGTGTTG GCGCCGGCGG CSGCCATCAT CCCGCCGACN ANCGTGTTGA 420GCATGANCAG CGGATGAATG GCGCCGCGGC TCGTGCCGCT CGTGCCGAAT TCAACTCCGT 480CNACAACTTG CGGNCGCACT CGAACCCGGG TGAATGAWTG AATTTAAACC GSTSAACANT 540AACTACATAA CCCTTGGGGG CTCTTAACCG GTYYTGAANG GGTTTTTTGC TTAAAGGAAG 600AACYATTTCC GGATANCTGG CSTTNWTARC GAAAAGGCCC CRCCCATNGC CCTCCACAGT 660TTSCCCCTGA ATGGSAATGG MNCNCCYKNR CNGGGNCTTT AACRCSGGCG GGNTTTTGKT 720MCCCNNCTKA CNTTMMMTGC ARNNCNGGCC SKCCCTTCCK TNTYCCCTCC NTCCCCCNST 780TNCNGKTCCC CNNAMNYTNW ACGGGGGGCC YTNGGGKCRM TWTKKTTTGG GCCCCMCCCC 840MAAANASAAN GGGGKRNGTY CSTTTGGCNC CCCAMAARGG NYCCCCCCAM YTNRRKMCSY 900CNNTNKGGNN CTGTNCKNCG GAARAMAMCC KCCCCGNSTS STTNGTYWAG GNRWKGNSRG 960CCSCCCCGGY MNNNAAYAWN WMNATNCNNS STNANMAKKN NNNNNNNSCN WNGNGNNTCN 1020SCNSNGGKBC CSCC 1034
(2)SEQ ID NO:313的信息:
(ⅰ)序列特征:
(A)长度:331碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:313:AATTCGGCAC GAGCCCACAT CCGGGGCCGC TCGTTGCATG ACTCGTTCGT CATCGTCGAC 60RAGGCACAGT CGCTGGAGCG CAATGTGTTG CTGACCGTGC TGTCCCGGTT GGGGACCGGT 120TCCCGGGTGG TGTTGACCCA CGACATCGCC CAGCGCGACA ACCTGCGGGT CGGCCGCCAC 180GACGGGTCGC CGCGGTGATC GAGAAGCTCA AAGGTCATCC GTTGTTCGCC CACATCACCT 240TGCTGCGCAG TGAGCGCTCG CCGATCGCCG CGCTGGTCAC GAGATGCTCG ANGAGATCAC 300CGGGCCGCGC TGAGTGCGCC TCCCGCGAGC A 331
(2)SEQ ID NO:314的信息:
(ⅰ)序列特征:
(A)长度:1026碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:314:AATTCGGCAC GAGATCGTCA CCCTGGCGAC CAGTGCACCC AGGCCACGCC ACCAGTTACG 60GCTGATGGGC CAGAAGATGG ACCAGGTGCT GCCCATCCCG CCCACCGCAC TGCAGCTGAG 120CACCGGGATC GCGGTCCTCA GCTACGGCGA TRAGCTGGTG TTCGGCATCA CCGCTGACTA 180TGACGCCGCG TCCGAAATGC AGCAGCTGGT CAACGGTATC GAACTGGGTG TGGCGCGTCT 240GGTGGCGCTC ANCGACAATT CCGTGCTGCT GTTTACAAGG ATCGGCSTAA GCGTTCATCC 300CGCGCACTCC CCANCGCCGC GCGGCSGGGG CGGCCCTCTG TGCCGACCGC CCGAGCGCGT 360CACTGACGCC ATCTCCGTCG GCGTTAACCC CGTGAGAAGG TGGGTCGTGC GCAAGTTGGG 420CCCGGTCACC ATCNATCCGC GCCGCCATGA CGCNGTGCTG TTCCACACCA CNTSNGACNC 480CCCCCAGGAA CTGGTCCGGC AMTNCAGGAA NTYCGTGTGG GCACCNGCTT CTTCCGKTRT 540GGCYTAAACT TCCNATSTTN CSGCSGGCCT CTGGCGTTNC GNCCGGGCCG NTCTTNCCAA 600ATCGGSMMAA ATCCCCANMC AAACCCCCCG GGTCTTGSGG GCSGGGNGGC GGCCNAWNCC 660AAACCCCCCC NTTAAANTCT TTGKTNCCNN CNCSGGCNCC NCNAANSCAN CCCTTTKGGC 720NCTTCCCCCC CCCAWTTTAA CCGAKCGSCN AAYCCCAAGY TMMGKCCYCY KNAAAAAAAA 780AATTTGSCSG CCCCAANTAA ATTCCCNGGC CCYTTGGGGG CGRANCNYNT TTTMCCSNSS 840TKGNNNAAMC NGGANCCSGG KAAYTMMTKG NAAYCGCCSN AAMBNTTTTC TAANNCCCCN 900YNCCCSGAAA ATTNNAMAAM CMNNKTGSNG GGGGKTTSNC SGKKGRAGGM AAAAAANRSN 960SKTTNMCNNN SANMNCNSNN SGGNSNNNNN NNNCNCGYKC CSNAANMCCC CGCGGGGGGG 1020CCMMCC 1026
(2)SEQ ID NO:315的信息:
(ⅰ)序列特征:
(A)长度:324碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:315:AATTCGGCAC GAGAAGACGC CCGARNGTST GCGCTGGCTC TACAACTTCA TCAARGCGCA 60NGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 120GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 180GCAGAAGATG TCGTTCGAGG TGGCCTGGAN GATTTTGCAN GCGACGCCNG TNACCGCGAC 240GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCSGCACC GCGTTGACGC TCGACCAGCT 300GCACCACTCG TGCCGCTCGT GCCG 324
(2)SEQ ID NO:316的信息:
(ⅰ)序列特征:
(A)长度:1010碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:316:AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGCCAGATAT CCCGGACTCG 60GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGACCAT AAGGTCGCTM 120ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 180ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240ATAGTGGCCT CCAGAGTGGC CGTGCAMTTC CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAACCGCA SGATGGCGAC 360GAACGGGTCT GANCTCAGGT TTGCCGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 480CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 540CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 720AGSTTTTCTT SYTGAAGGGG GAAACCCAAC TTTNTYTYYN AACCSCMNAA MYMTTTYCSG 780MNAASCCNKT CCCCTTTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 840NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 900GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 960NTTTCCNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 1010
(2)SEQ ID NO:317的信息:
(ⅰ)序列特征:
(A)长度:1010碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:317:AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGCCAGATAT CCCGGACTCG 60GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGACCAT AAGGTCGCTM 120ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 180ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240ATAGTGGCCT CCAGAGTGGC CGTGCAMTTC CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAACCGCA SGATGGCGAC 360GAACGGGTCT GANCTCAGGT TTGCCGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 480CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 540CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 720AGSTTTTCTT SYTGAAGGGG GAAACCCAAC TTTNTYTYYN AACCSCMNAA MYMTTTYCSG 780MNAASCCNKT CCCCTTTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 840NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 900GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 960NTTTCCNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 1010
(2)SEQ ID NO:318的信息:
(ⅰ)序列特征:
(A)长度:1092碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:318:NGNGGGGWNS NTCAYCAYCA YCACSGGGYW CWATTGCGGC CGCAWCTTGT MAASAGATCT 60CGAAYTCGGC AMGAGGGAMT CKCTMGCNCC GCTGTGCAAN CCAATRAGGC CTRATAATTY 120CCACTCCACA AAAAACCGTT GTGTGTAYYT SCCGRAAATR AAGGCGCCGG TNTCAACWYC 180GCCGGTKTTY CCRATYCCCG TKTTGTAMCT GCCKGGGTSR AAAYCCCCGG TGTTGGAYCC 240CCGGATTGAA ACTGCCGGKT TGAAACTGCC GKTTTSGCSA TCCGGKWATT GAMSTCRCGG 300ATTAAAAAAC CGGKKTTGGN GCTGSNCGTG CCAAATNCGR AYCCRATAYC CCATGGCCTG 360KYCTYCTCCK YCGGTACCCA AAYCTGGGTA TCCTATACTG GYCCCTAAAK GCAAWYCKGG 420GCTGYCMMTK TTGCKGGSGT CCNAATTTAS CACCASCGGT TCCTTCCATA CCNAAACNCG 480CKTGGGCWCC AGMCCGRAAA AAAKAATAAT RAKAAKGGTG CATNYCCAAA ACCNCCGCCN 540CCCNANTNCN ATCCGNTNCC MSCNCCCCCA GCGGTNAAGK TKSGGAAYTT CTMMAACCCC 600CAAANCCCCA TAACNTNCGR GAASAAACCC CTYCNCGGGG GYCNWNCAAA ACASCNTTAT 660TTGCTKSTTT CGGGMWCCGT GCCGCCNAAA YCCCAAASTA CTTTYTGGGT CCNAGAKAAA 720ACCNCGGGCN CCMCCCSNAA NWTATYTCTT KGGCAANCCC CSAAACCTTR TCMNACCNCK 780ATRMTCCCTT CCCCVSCAAT TGGYCGGRAT NCGSNCCYTY TCAAAKKKSC CAKWWNNGNG 840GRRNNACCMA ACCCCAAGTY CCMNAAAATN GKCCCCGCTC CNAACACGNK TYYTCCSAAA 900ASCCCWCCCC CCCCCCCRAA AACCCCCCNA RKANTNCCCA AAAACNYNGK GGCCCCCCCC 960CAAACMAAAA AMCCCCCSGM RMACSGGGGN NMCCCCGKKK KKTTTTCTTT TKCCMRSCCC 1020AAMGCAMWSY KSKTNMAAAA GGAAGRANCN TYCCSANANM TCCCNYWRSW CCGSWGMGNA 1080GAASMCCCCC CS 1092
(2)SEQ ID NO:319的信息:
(ⅰ)序列特征:
(A)长度:1251碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:319:GGGGGGGNNN NATACATCWT CYGTGYACCG GGGMTCTAKT GGCGGGCCGC AATCTNGTCA 60ASAGATCTCT NAMTTCGGGC ACAAAAACTW GACAAASYMT CGNGCNMTCC GTGTCCTNKA 120TCGCAAAACG NGTRACASAC ASACACRTAT GTGTGCCCAC CASCAAYTCK TTGGGACCTC 180GCTRACCGGY TGCCCRNACG CCACGYTGCS CWTCTATCCC RACGCCGGCC ACGGGYGGGG 240ATATTCCAGG CACCACGCCC AGTTTGGTGG ACAATGCCCT GGCAKTTTCC TCRAANTTCG 300TGAAACCGAA TTCNSMTTGA ACCNCCAARG CCCCSNCCNR AACARTTGGG WTCCGCGGTT 360CTCCCCACCG KTTTCCGGGG GTNTCGGCAN AANCGCACCC WTGGWTTCTM TCNCCGCACC 420GGGCGGACAA NTCGGGTTGC AATTTTGCRA AYCGGGGCCG GGATTCCSCA AACGGGTGCC 480GAAACTGTTY YCRAAMACCG GGAKCCGCAA TTTCCGGGCR ANAAATTTCN YCNCACCACT 540GCTTRTACTT CCCCGACCGT AACMANTTTC ATCGTCNTNN CCTCTGCCCT TGGGGCAGGG 600CKAAAYACCG CMTTKGGTTT CGCAACCTGC GGCCCAANTC CCNAMCCRCA CTTTCNATTT 660GGNTCGAATT SCCCCCCGGT RANAACCSCC NTGGCCNNYT CGGASSAAAA NGGGCCCTNT 720KGGCNSCCCC AGTAANACCC TACCNNAYTS CAWTCTTTGC CAAASTTKGG ACGAANSKTG 780GGNTTCCGGK ATTTYYTTGS GGNCNCCCTN TATNGGSNTN GGGCCKCYNC NCSTKTGKCA 840NASSKAYCCS NGNKGGGGGT ACCCCCCTMG GGGGGTTTTT NSSGCCCCCC AWAYGNKSTG 900GCCCCCNNGG GGAAKAATWT MWWMCNSGGG GGGAAWTTTT NTSTGGAMCS SGGACYCCCR 960GGGGGKTTTT TCCCCCNCSA NNAWANGGGG GGGGGANAYT NTGNSGNGGG KWNTTTATTT 1020YTYYCYCCTM TKACMSGGGG GTTTKKAKNG GGGGGAGAAA ANAAAAAAAA RAKGGYKNTT 1080TSKNCACNCT GKWNWNWANR NAGAGKTCCT CKCKCCNCSG SNTTTCTTTT MGNSGSYGGG 1140GNNGNNNAAA ACNKSRMMAC KCSYTYCCCG CGYCTCCTCC NCNGGGGYGS NGSCGNSTYN 1200GNNKGRKWTA TNTMGNCGTN SCCTCCNCCC GCKNKNTGTC TMTCNMYGSG C 1251
(2)SEQ ID NO:320的信息:
(ⅰ)序列特征:
(A)长度:1099碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:320:AAYTCGGCAC MGAGTATCAC CAAKCTGYGT GGCCCAGCAA AGTGGAGCTA TTACTACCTG 60TATGTGATCC TCRACATCTY CTCCCGCTAC KTGGTCGGGT GGATGGTGGC CTCGCKTGAK 120TCRAAGGTCT TGGCCRAACG GCTGATCGCG CAAACCCTTG CGCCCAGCAC ATCAKCGCCG 180AACAGCTGAC CTGCMCGCCG ACCGGGGGYC GNCAATAACT CCAAACCGGT GGCMCTGCTG 240CTGGCCNACY CCGTGTCCCA ANTCGAACTC ASCCSGCNMA CCAKMAACKA NAACCGTTGT 300CTGAAGCCCA GTTCAAAAAC CTCAAGTWCC GGCCCRACTT CCCGAAACGG TNCGAGTCKA 360TCRSAGGSGG CCGGGTGCMC TGCAACCGGT TCTTCGGNTG GTRCAMCCCN AAAMCAAGCA 420TTCCGGGMTC CGMMTGCCCA CGCCGCCAAS TTTMCTACGG GCSGSCCNAT CAAATTCGCC 480GGGAACSGSN CCMCCKTCNK GGAMACGCCC TWCCAAAACC CYCGAACGGK ATCCTTCKGY 540NAACNCCCGA RCNCCCKSKT TCCGGGCTTC NMSGCGAATA CCCKNSCMNT CCGAATCCAA 600TTCCCMKYGG CTTTTYYYCC CCCCGGCCCC AAAYNGGGYC CCTASSNMKC KNCCAMNANT 660CCNWATCTGG NGGTCCCNAN KYYGGCGTTC NMAATSAMNA NMNRGGGTYT TSCYACCMMN 720AACCGKNNKG KCCCCMKCTK MANAAAKATT RATCAMKWNG GGNKCKCNCN NAAMACCSCN 780CNCYNCWYTC TMYCSSKWGC GCSMYNANCA SNGGGGAGGW GGSGRMKMCT CTMTCTCNCT 840MGCGCCKNTN TYCKSGAKAT ACASMNKTCC GCGCNGCGCN MAAMANRAKA CTAKCCGYGN 900CCSNSTMTYN CTSNNMKMNN TCCWMWNATC NTYYGKKCNN KCTMKATNWC CSCTSKCNCK 960MRAMTCKTYG SNMTCCTCCA TCNCTCKKSC SNMSKNTCKC KSCNCCNCWN CNKCNMKCWN 1020GGNSTCRCCY TCTMNNNTCS AGCKCGSKNC WACNCACACK NGWCTYTTCC WKNNMKCNKM 1080TCKCKCACRG MTMTCWCCS 1099
(2)SEQ ID NO:321的信息:
(ⅰ)序列特征:
(A)长度:296碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:321:GNGNTATACA TCWCTGTGYA CCSAGGATCW ANTGCGGCCG MAAKCTWSTM CASAGATCTC 60AAAYTCTGCA MGAGCGGCAC AKAKYSTCGT CCMRACCCGG CAYACWCCWG CNCGCCCCWT 120CTTRGACCGG GGCKATASMC ACCGTTGGCC CCGGCNCGCA CCTACACCAC CCACGCCGCC 180AGCGCCCCCW TRAMCAAACC ACCCCGCKTT TACCGCCCGC GCCGCCGGGG CCACCACCAG 240CCCCACCGGC ACCACCGGCG CCGCCGTTGC CAAAACAGGC CCGCKTTTGC CACCRA 296
(2)SEQ ID NO:322的信息:
(ⅰ)序列特征:
(A)长度:1073碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:322:NGNGSGNKMY ATCATCWTTC TGCACCSNGG MTCWATTGCG GCCGCAATCT TSTMNASAGA 60TCTCGAAYTC GGCAMGARCA TCTGCGCGGN GAATGTCCAA AWGTCWKTAA CGGCMATCGG 120TTTGCCGYCA ACCACKCTRT SCAKATGCGG GCCAMWTYCA AACCRATTAT TTGGGYCGAG 180AAAATTTMCG CKTGTRASCA ACCTGCAGCG GGTCAASCAA CAGCCTCTRA ACCGTAAATY 240CKTAGGTNKT YCCGGCAACA ASCYCRATAA TSCGGCCCGC AMCCACAAAA CCTGANTNGT 300TNTTCNCRAA NCCGGTYCCC GRAGGGGTSA ACTGCSGTAR GCTTNTCWYC NCCTTRACAT 360TAAACCCCCC CGGNTCWTCG CCGCGCCCAA ATYCYTGCCC WTKGCNACCA YCCCANCCTG 420CSGTATGGTS RAANCASTSG GCRAACGGTM MCCSTACCKC TGGCTGATYC KTCGGNTCCS 480SNAATTCGGG GATTTACGGS CAMGGTTAAY CCAGGYCCCC TNTGCYTCKY CNACAACCSG 540ATCMWCNCCG TACCTKTTAA AATTCTTTGT GGTGGAACCC AWYCKAAAAA NMTNTYCCCN 600TCCAMMGGGG CYCGGAAKKT CNACNTGGKT NACCCCTNCC YTTGAASTTT TCYTGNCCCC 660GGCCCKAAAS ANACCSGAKC CCCGGAAYCS WTAGGCYTCN TGCCCCSTTA AATTKGNCYC 720AATCCKCCAA CGCTCCCCGG GGTCSSCCMT TAAAMTTCCC CCCKSCASNG GAATYCYKSG 780GCWGTMATTW CCNCCCNTTT CYYGKNAAAC SCCCCCWKGN GSCTYCCCCN SNTTSSGCCS 840GGTTSGAMYC AAAAWTNGGG MMCNRAGNCG SGNAMCCSCN GKKGGGSATW TKAAYYCYGG 900GGGGGTCNYC CCCCRCSNAA AAGYGTKGGC KCCSSSCCYC CCMARTTTYT CNGGMRCMAM 960ACCANGGGNG CTCCCGTNCW WGGCTCCCSN SNSMAMAAAN NKCKCCKGGS CKGARRNMNA 1020MCTCSNGNGG WTCCCKNKTC NSCNSGNCGS YGGNSASWCC YNYCNCCACA ANC 1073
(2)SEQ ID NO:323的信息:
(ⅰ)序列特征:
(A)长度:1166碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:323:CGCCCCGTTC TTMMMTTCAY TCATTCACCG GGMTCTAGTG CGGCCGCAAK CTTGTCKACA 60GATCTCGAAY TCGGCAMGAS ACAATSTCGG GTKGGGCAAT GTCNGGTGGG GCAACTTTGG 120GCTCGGRAAT YCGGGGTTAA CGCCGGGTCT RATGGGTSTG GGTAATATCG GGTTTGGTAA 180TGCCGGCAGC TACAATTTCG GTTTGGCAAA ATATGGGTGT GGGCAATATN GGGTYCGCTA 240ACACCGSCAS TGGRAATTYC GGTATTSGGT NACCGGTRAY AAYCTGACCG GGTNCGGTGG 300TTYCAATACC GGTAACGGGA ATGTSGGTTS YYYACYCCGS GSAACGGNWW YTTNGKTCCT 360TMMCNCTSSM CCKSAAMTSM KMGGTSTYCT MTYCNNGGAS TAMTYNMCCC CCGWAYCKSC 420WAYCCCTCGT CATYCCMCMC SGSGYCCTCA MNCCACCYTG NGYYCCCTCC MKMTCYCAYT 480CMNTCCGGTW CCTNTMMNCC CSCNCRYCTC AMCNCTKSGK CACCNATMYC CSACKCHTCT 540MCYMCSCAKN MTTCCCCTCN CCTYTNNCCA MCMCSCTCTM TCMAACTCKC CCGGYCKCNC 600MYCTCTCKCC AYNMAACCKK TYCYWCNWYC YMYCKCKCAG WYKNMCTCCW ACTCTMYNTT 660TCTCTCNKCC CMKACCKNTT CTCWCSCCCC CCACAKAYMC YAWCMTMTCC MCTCKACSCC 720CYYCNNYCCM NMCWCMTCWC TWNAKCANCN TTCTTCTCTC MMYMTMACKC WCNNTCNCCK 780SGACCYTCTC ACTKMKCCKM TCTCCTTMCK CCYMWCNTCC MKYNCCCTCC NMTCMTCKYT 840CCTCNCNMRY CYYYAKCAKC NMCTCCCCAN KMCAKCTKCT CCCCCAKMKS ACNCKCCCWC 900CCTCCTATCC WCTCTCWCTY ATCTCKCTCW CNYCMYMKMC ACNCKCYAYT CNACTMNMWN 960CCANCNCTCT CTNYCTCWCK ACGTYCKCCK CTMCKCNYMC NRWCTYRCCT CKKCCNCCRN 1020CKNMCMKCTM CTCTCCWMKM TCCCWCCCAT CTMMKSTCTC WCNCMTCCCT CNKCCYNYNT 1080KCYTYCCMYG CTTCKNTCMT MCCWCCYATC TCTMKCCTCT CWCACYMCAC WMTTACWNCC 1140ACTCTCTRCW CKCCKCMCCR MTCTCB 1166
(2)SEQ ID NO:324的信息:
(ⅰ)序列特征:
(A)长度:1230碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:324:NGNGGNNNNT CWTACATCWN TCTNCACCSG NGMTCWATTG CGCGCCGCAW NCTTGTMNAS 60AGAATCTCNN AAYTCGGCAC ANATGTCTTT TSTMTAKTGT GGCGGGGNGC CACGCCKTAT 120GTGYGCCTGG GYTRACCCAA CCCCGCGGCS CGGGCCRACC AGGCGGGGRA TSCAGGCCGC 180GGCGGCCGCG GCGGYTATAT RAAGCGCCGY TTTTKTRATA ACGGTSCCGC CGCCGGGTRA 240TTACGGGCAA AAYCGGKKTT TTGGGTRTAT AACGCTAATT GCAACCAWTT TTTYCGGGTC 300AAAAACYCGG CGWGCANATC NCGGGYCNCT RAGGCGCATT YMCGCCAAAA WTNTGGGCGC 360AAAACCCCKT TSYTATTTTN TGGGCTATSC GGYTGCTTCG GCAAACGCTY CCCGGGTTAA 420TCCCKTCCGC GGCGCCGCCN AAAAACCACC AATYCCGYTG GGGGTGKYCC CMCAGGCSGT 480TGCTYCGNGY CACCTGGCCA AAYYCCCAWT AKATTGGGTG SCYCKTSCGG TTSYTGGGCY 540CAATTACCCC CNCGGGNAAA GRRAAAANAA ATCNTCCNTT TGCTCGGYCA YCTTTMTTGG 600SAAAAGGGGC ATGGCSCGGT TYYTTTACCT CAAYCCCCNA NCANTWACCT YTCCSCCCGG 660GGGGNCANAA CGSTTNGCTC CGSGGNAKCC TKGTMCCCGN ATCNAAAGGC CNGAATTTGG 720TYYSSTYCNA ATTWTWKKKY CCCCWCNTTG YAAAAAKCCA AAASAKCCCK YCNCAMMYKT 780NGGGGTYSSG GCCKNYCTTK SNMTTAAACC CYCCCCAAAA YYNSGGGKKT TCCGCYNSAT 840KCCACCNCCK GNGGGGGGNA SAAAAAAAAY TTTYCCSAAA ATCCCACCYY TCYKTKSTRY 900AMACCCCCTT TYYMKKAYTC CKYSCNATTC SGMTTCWAAA TYCCGYGGCT TNTTCCCCCK 960CSGGNGCCCC AAWTTTGKTT YNCNANTTYC CCCNAAMNCM AWTMGGGGKS KCCATTCTGG 1020SCYTMAANTA AAANAANGGG NKTTTYYCTY MANAAACACN GTGKCNCNCN CNAAMAAASN 1080AKMAAAKAGN KKKMTKNNSA AANCCNCCCC CTSTYTNYTT NKTNMNCKCC CYGGKKNKGM 1140SWSWYNTTCT NCCCRCCCCC YNYNKTGANA AAMMNCYCCS GGSTMCRNAN ASNMNTTTCK 1200STSTNGMGCC KMBASNANAN MCAMWKWYCC 1230
(2)SEQ ID NO:325的信息:
(ⅰ)序列特征:
(A)长度:1022碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:325:NGNGGGKNNA TMAYCWTCTC ACSSGGTCTA TGCGGCGCAW CTMGTMAASA GATCTCNAAY 60TCGGCAMNAN GCATMTCMMC CATATATAAC CATTGCGTCS GYWTGCAWCT CRAAWCTGTC 120CTTCSKGCCG TTKTACRAAG GTGGMWTGYT CWTYCCTRAA SCCCTCRATC TCKTKTATYC 180CTKGGGCTYC ACTTTAACSG RATKSCTGCC TTKTAYCATT RATGCAAWTA WTGGYCRAWT 240KTTGCAGGCC RACGGCWYCT TTTYCCGCRA GRACAATNGA TTGGAWYCGC TYCGCRAGGC 300CCGGCACCAR ACCGGGCNCC AAAGGYCCGC GCAAWTSCCT GGKTCAAAAA TGGTGCAAAC 360AAAMCNATCC CCGGYTTRAC CGCAGYTAMC ACAAKAAAAT TCCCWTGGCC GCACCAWNNT 420TTYCRATCWY CWYCCCCACC TTRAACTTGK YTGCSGTATT GCCTKCCTGC CTCRACAGCM 480YCNCCCKTCA AACCTGCGGT GACTCCAACT GGTCTGGYCG AASGGGGGYT CAMCGGACAA 540AACCCCRANN TCGCCAAATT TTCNCCCCCC CYCGGGAAAN GKTGATMTTC TCSNAACCSA 600CMGGGNNYTW NAACCCTGAA CSSSGSNKGA MYNSCCSGGA ANTTTTCCCT TYNGGGCGRN 660AAANCCTTTT AAGGTACCCC KGGNGGGGKG CCCYYTTGGG AAAACAACCC CKATTGGKTT 720TGGAAATNTT TKCNCCCCCA TTCNSGGGGG GGGCCCCAMC CCMMCTTTTN TCMSCNMTYY 780YCYYGGGAAT TNYTCGCCSG GAAYYCGGSM CCKGYCCTAA NCCCCMNWGG GKYSTGSNAR 840GGRATMAWWT TYSTTTYYMC CCGGCNNCCC CCCKAKMCNT KGNTGAACMA AAAKCSGGGG 900GSCNMYMWYY YCNNNGNRTT TNRGGSSNMT TYMAAAMMAN GGGGKYWTYY CKCCNGSCNN 960GKTYSGGGST TTTCCNTTTS GGGSSATYKG MACCCCKTMT AYCCGGGGGT NTKTKYCCCC 1020SC 1022
(2)SEQ ID NO:326的信息:
(ⅰ)序列特征:
(A)长度:1083碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:326:NNCGNNKNTA TAMAYCWYCT NCACCSGGGA TCWATTGCGG CCGCAATCTT STMAASAGAT 60CTCKAAYTCG GCAMGANCCG CAWCTATTTG KGTGRASCGC ACCAGCGRGA CCTCGCSGKT 120CKTTYCTTGC AGRGAGGCCK TGGGTGGCRC CGGTGGCAAT GCCAACCGCC CCCCAAAACN 180CCGCAAATMY CRAAAAACAA CCCSGGGGTA GKTCCSGGCC GCCAAATMAA TAACCGTKTT 240AACKCAGGCN ACGGCCAACC GGYCCCGCCC AACCAAGCNA CCTCCCCSCC NATAGGYCCG 300GTGGGGGCTG CCKTATYKCC AASTCGTCAY CTCNACGGGM CGGYCCMCWT TCCGCCTCAT 360CCGTCTCTCC TTMMATTTTC CRTCCACYKG GCGGGGAACY TTTTTNYCNC CCTTGSCMAN 420CACCNAAGGY CNAAAATTNC CCMTGCCKYG SNNCAAAYGR GATTGGGGTY CGKKTTTTNT 480TCNMCCMAAC CCCCNTTTNA CGCCCCMATC CCYTWATACC CCCWWMCMNS ANGKTTGNSA 540AAKTNNCCCC AAATRCCAAA MTTCTTCGCC NTTTMTWMCY YYCCTTTCCC CMCCCWNAAA 600GGSCCRCCYY TCGGGAANTY TCCCCNCAAA AWTCAMWCCM TTTCCCNCCA AGAAWTTCSG 660SACTCCTTTN TTCNGGGNAM ATANATYYTT YCKTNGGGSK TTCCGMTCNC AMMAATNTCC 720RGGGKAAMCC AGKNTNNTCC YYYYCCCCAA NNTYCCYKGG RMCYNNYYCY TTAAANRASR 780SAACCCKSGG GKCYNCNCSS TARCCCCCAM KAAAATTTCC CCCSSKTTTC TYYNNKKMRW 840GCCCCCSAAM ACTMTWAYTT TCCCKCGNNN TTTSYCCKCS KCAMWMWMTG KKNCTTTTTT 900YCSCMATAMA CTTNGGKCCT NTCNYGSGCG CMAAANAAGG CGCGSTTCTN TTCWMAMACA 960YNTSGNMMMA SAAKAKWATA AWNNTRKKYK TKNNCCCNCC CKCKCTTSNN TNKCCMCSKS 1020GGGKNWNKKR GWCTCCWCNC CKCCCNCKNK CCKWATMCCC CCCCSKCCGM NCMMNTTTKT 1080CCC 1083
(2)SEQ ID NO:327的信息:
(ⅰ)序列特征:
(A)长度:1069碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:327:GGGGNNKYAT MCAYCWTCTS YACSGGGMNC TATTGCGGCC GCAWYTNGTM GASAGATCTC 60GAAYTCGGCA MGAAAAAAGW GATGTGCTGG ACCTTMCCGC GCGGGACGCR ACCRACAAAG 120RAASCGCGCC ANAATATTGG CCACAKTTGG TCACATATTT ACCCAATTMT AYCAGGGAYT 180MCCATTCCKG GGACCRACCG CACAATCCCR ATSKTGGTTT GCRAACCCTR ACCGTCCCCA 240MYTYCGCCRA STTGAACCAG GGCRAAAAAA CGGCCRAAWY CTCGCCCTGA NTCCCGCTCS 300GCGCNAATAA CTAGGCCCAT TKAACGGAAC CGGNGGCCSC NANTTGGCCA ACAGGTCCTR 360ACAAAGGGGC CCCASYYCGG CCGGWTCCCW TTYCACNCCC TNKTCTCKTG CCGAATYCGG 420WTCCRATNYC CCWTGGGCCT TKTCKYCKYC KYCGGTNCCA AWTCTNGGTA TNCTATRGKG 480TCCCCTAAAT SCANATCTGG GCKYCCATTT NCTGGSNTTC NATTTAMMAN SRRCGGTTCT 540TTCWTTCCRA AACCGSNTGG GCCCNNMCCA AAAAATGATN ATAATAATGK YGSCTTTCAA 600ACCCCGCCCC CCCATTCRWT CSGTTCCANC CCCCNGNGGT TAAGKTGGGA ATTTYTNAMC 660YCNARGCCCT NATTTSGGNA AAAACCYCYC GGGYCTCAAA CMNYTTTTTT GSKSSNTCGG 720GCTCRTTCSC CAAAACCCAA ATTNTYNYGG GGYCCKTNAA ACMCGGYCRC RCCGGAAATT 780TTTYTGGTTC AACCCCAACC TTTTCAASCC NTTTTYTYYT TRCCSSCSMN TNGSSGGGNT 840KSSCCNTTCY RARKKCCNMN GGGGGWYCYN CCCCRMNTTT CTTTTTTTTT CCGTNNMAAM 900NGKTTCTTCA AASMCCCCCC SCCCCCNSAA ACCCCCTNAR GTTTTYCMMA AANNWYNNGN 960KNCCCCCCCC MMNAAAAAAY YCSCCCGNRN ACSMSNGGGA MCCCCCGGSN NTTRKTTTTT 1020TNCMSGYCCC CSRMASYYTT TKAMAMANRR GAMNSMTTTY TNNRGNWNK 1069
(2)SEQ ID NO:328的信息:
(ⅰ)序列特征:
(A)长度:1210碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:328:NGNGGGGKWK MATACATCWT TCTTCACGSG GGATCWATTG CGGGCCGCAW TCTNGTMCAA 60SAGATCTCGA TYTCGGGCAM NACCCACCWC TCCRAAAAAA ACCCRAAWCT CGGGSKCTYC 120GARAAGTGTT GCCCGCKTTR AATTTAACAA ATTCAGTGTC ANAGTGTCAC GGCKTTACWT 180YCCCGGCAAA GGGGCCACAA CCTGCAGRGA SCACYCRATG GKTGYTGKTS CNCGGGCGGG 240CCGGKTNAAG GGACCTGCCT GGGTKTGCSC TMCAAANATC WYCCGCGGGT YCGCTGGRAT 300MCNCAGGGGT GTCAAAAAAC CGCAAACAGG CACSCCANCC NTTTACGGGS CTTAAAANGA 360AAAAGGGCTG ATGCCCCCAA GGGGGCCCGC NCCCAACCTT CCGTTGGTCA ACAACCCGGT 420CTCTCKTGCC RAATCCGRWT CCRATNYCNC CWTGGCCTTK TCKYCTYCTY CGGTACCCAA 480ATCTGGGTAT CCTATASTGT CCCCTAAWTT CCAAATCTGG GCTGTCCATT TSCTTGGCNT 540TCCAAATTTA CCANCAACGG TTTCTTNCAT NCCAAAAACC GNTKGGCKCC NRACCCRAAA 600AAATGAATAA TAATAANNGG KCNNTTYCNA ACCNCCCCCC CCCNATTCCA TYSNGTTCCA 660NMNCCCCCAG NGGKTAGGTK GGGAAANYYC TCMACCYYCA ANCCCTWARS TTTTNGRAAT 720KAAACCCTYC YCNGGGTCWW TYMAAAAAMA NTTATTTGGN NGNTTTCGGG MWNCKRKNST 780SCCAAAATCC MAAATANTTT YYTGGTYCNA TWAAAAAMCG YGNCCMNCCC GGAAAAWTTT 840TTNTGKTTSA ACCCCAAAAC YTTTTCMNAA NCSSKTTTTY CYTTCCCCCC AMNWTGGGYS 900GGGNATKGYG SCYTNTCTTA TKTKYTYMTW CMGGGGGGNN MKMTCMMCCC CCMTTTYYCY 960NYWRTTTTTN KCCCCKTNMR NNRAANNGGN YTCSYNANAA AAGCNCCCCC SCCKNCCCNA 1020AAAAWCCCCN NNNARAKTNT TTMKANNRMN SCKCNKNGKY YCCCCCCCWC YNMNNAAAAA 1080AATMYCCNCC RASANMCASM NMGGRGNRSC CCCCCCCSTT NNNNTMTTNT TTTTTTCSRA 1140GAGCKCCSCG MNNANMKNCK CTTTTTKCNC NNGNNGNGNN GGNGMNCKCC CCNAGAAMWK 1200CTKSTCCCKS 1210
(2)SEQ ID NO:329的信息:
(ⅰ)序列特征:
(A)长度:1105碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:329:NGSSSNGNNA TMCATCWYCT GYACSGGGMT CWATTGCGGC CGCAACTNGT MAASAGATCT 60CGAAYTCGGC AAKANACACC ACCGCCGTGT MTATACACCG CAAATGTTCT GTKTGCCAAA 120ACCGAGACGC GCCGGCCGCG GGGYTCCAAC GCKTTACYTR ACCCGCCAGY TCAGTGTTRA 180AACCGGTGYT RAGGGCCGCA CCCAACWTAA ACGCTTTAKC CAAGRAWYTG GKTGGCCCGC 240AGCCACCTGY TGTGGYTGCC CTCWYCGGTG GTAGCGCCGG TTANCGCCGG TTGCGCGYTC 300AMCASCSCGC CGGTRATCCC AKCNWTCCCC CGGCCMRACC CACCGGGCAC TTTGRACGGT 360GCCGCCAATT CAAAYCKYCT GRWTCCTTCM AAACACCACR AAGGCCACCM CCMSCACCNA 420ATMGGGRACT TTAAGGCCCA GGCAAAACCT NTRAKCNCCT CCCGGGCRAA GGTCCSGCAA 480SCRATCCMAA AAAAKCKNAT TTCCCCCAGC AKCAACCCAA MMCGSTTTGC TGCTTCCGGA 540TTCGAAMCCA ATTMCWGGKT NCNWGGGAAA AACASCNNCC NWTAKCCMGG CCCMCGGGCA 600ATTTCSGRAA SAACCCCTNY CCCGGGTTTT YCCTGCTCMG GCCCAANACC CCCGGGAATC 660AAAAASGGTC GGNCAAANGG GCMAAACCCS SACCCMACTT WTTCCRCTTN GGGGGGSCWN 720CCKNGTTTAA AWKSCCTCYY CTSCCCAAAY TCGGKCMAAA NNGRKTTGGK TTNGGCNACC 780NTTTCCGGKC CCGGGKGKGK WGKYCTMNMA CSTTTNTTTT SCCCCYKAAA NYSCCCCCCC 840CGGSSCCCCG CCCGGGGGGA NNTTTTTAMA GKKTYCCCCT CCCCAMAAAA ANACCCCNYC 900CCSGGSCCCT TTKRWAAAMN KCTSCCCCNG GNNGGGGKCM GGKTTATTMT NNNCCSCCCC 960TCCGCGSAAA AAATAKMTTT SYCCCCCCNC CTCCKNCKNR GKAMSMSCGC TCCCYCTCNC 1020GCNKNTWAAN ARSNCCKKNN CCNCYKCCGS NSNGKCNWCD NCCSTSSNCT NKGCNCKNCN 1080KAAANAAYNC NGSMSTSSMN CNKCC 1105
(2)SEQ ID NO:330的信息:
(ⅰ)序列特征:
(A)长度:936碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:330:NGSNSNKNNN TAMAYCWYYC TSCACSNGGA ACWANTGCGG CCRMAWCTNS TMKASAGATC 60TMGAAYTCGG CAAGAGCGGC AAGAGTGTGT GCATCTGGTC ANAGTSTMMA CRCGGTGCCG 120CSGGTGKGTR GASCACMCAT NTGCGRACAC CAAACCCKTC GCGGGYCACC GGCKTCGCCT 180GCAAAWYCCT CCAGGCCACC TCRAACAAYW YCTYCTGCAA CGCARGCCGT TYCGCGGCCG 240RATCCTGGKT CASYYCGCCK TGCGGTGCCC AAGKTACTGG CSCAYCAAAA CCGCTCCGGG 300RAACRAACKT AAWTYTGCCG AATTTCNTTC CCCTGCGCCT TGATAAATTT NTNAAGCCAC 360CGCAAMCCTY CGGGCKTCTC CTCKTGCCRA ATYCGRWTCC RATAYCGCCA TGGCCTNKTC 420KYCTYCKYCS GTACCCAAAT CTTGGGTATC CTATANTKYC CCWAAANRCA AWTCTGGGCK 480KTCCATKTSC TGGSKTCCRA ATTTAMMACA NCGGTTTCTT TCWTACCAAA AACCSNTGGG 540CCCCRACCRA AAAAKGATAA TAATAAKGTG CWWWCAAAAC CCCGCCCCCC RRTTCAAYCG 600GTCCARCACC CCANGNGGTN AGGTNGGAAT TYTMAACCCC CAGCCCATAA SNTTNSGNAA 660AAACCCCCCN GGGYMYCAAA AMMCTTTTTG GGGMTTCSGS CCATKGYKCC AAAACCAAAA 720TMTTTCYGGT CRWAAAAACC GGCCCNCCCG NAAATTTTTT GKCAACCCCA AACCTTTMAM 780CCNNNTTCYY YCCCNSACAA TNGGSGGNKN NGSSCNTTYT TWTTTYYNNA GGGGGGRRWC 840SNCCCCNAAN YYCCNAANKG NKCCCGSNMA AAAGAGANTT YCMKAAAAAC CCCCNCNCCC 900NAAAYACCCC MAAAKWTTCM AAASMSCNNG YCCCCC 936
(2)SEQ ID NO:331的信息:
(ⅰ)序列特征:
(A)长度:1042碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:331:NNNGNKNNNY ATMMAYTCWY YCTSCACCSG GGNNWCWATT GCGGCCRMAW KCTTGTMAAS 60AGATCTMNAA YTCGGCACAG ASSSGCACAG ASCCGCGGCG CTATYCMYCC GYTGCTCATG 120CTCAACACGC TCKTCGGCGW GRATAATGGC NCGCCGCCGG CGCCAACACG YTCAAYTGCT 180TCGCCAACGC CATATNTCAA CAAGGTRATA AAASCAAAAC CGCSCGCCGY GCCCTTGGGC 240SCGGRAASCG GTGCCAACCC RAAACNCKTT GGGCACYCGG KTSRACTTTA AASGGTAATC 300TCKTCCTCCT GGGCTATGGT GCGCCACAAA CCTSYTGGCG WGGGTCTGGC CCTGGGYCAC 360CGYCRCNTTT TATNTNTCCK YCTACACNCT TKGGTYCAAC CAACCCACTT CACMAAATTG 420TTTTGGGKTG GGGSSGCCGG YTGTNNCCGK TAATAATCSG NTGKTCSGCC MYCACCGGWA 480CCATANCCTG GCCGGCSCTG GCAAATTTCC SAAATCATYT CCTTCTGRAC CCCCACAMRC 540CTNSAAATCC GRATCAATNC CCCNKGGCTT NTCYCTCTCN GTRCCCAATY TGGTTTCTAT 600RKTNCCCYAA TSCAATTGGS TTYCCRTTSC YGSTTCCAAN TTNACAAMAS GGTTTYTCMT 660ACCAAAACCC NTGGSCCNNA CMNAAAAKNA RAAAANAKGG KCTTTYAAAC CCCCCCCTAT 720TCAWYCGGTN CMRNWCCCCG NGKAAGGKGN GAAAYTTHRA CCCAANCCMT ARSTTSGNAK 780AAACCCYYCG GGGTSMCAAA MKNTWTTSSC CTTCGGMCTT YCCAAATMSA AAATYYTCKK 840KRMNAAAAMC YGNCCCCSAA ANATTTTTGT NAAMCCCKMA YYTRTTWMCC WTTTTCCYCC 900CCMCNNSNSG GNTNCCCTTY TYATTTCYMM MCRNNSGACN CCCCMNTYTT TWTTCKCWCN 960MMARGSNNYT RGRMMNMNCC CCNCCCCNAK MTCCNCAAAK NTTTNAACNN NNKYCKCCCC 1020CCCMWMNKNC CCCCMNCMTT TM 1042
(2)SEQ ID NO:332的信息:
(ⅰ)序列特征:
(A)长度:1073碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:332:NNSGSGMKKK ATAMATCWCT CTSYACCSNG GMTCWATTGC GGCCGMAWTC TNGTMAASAG 60ATCTCGAAYT CGGCAAANAK ACGCMAYGTC AAGTGTRAYY CGGTCACATA TCMTCGCGNG 120TCAACMCCAA AGCCGNGTCA CCGYCTCCCT GGGGCGCCAC CCCCATCGGT RATGCAACYT 180CGCGCGCCAC CGYCAAAAGG KTCWTTRAGG CGCTAAAGGT CAMCAATTCC TRAGGTYMCN 240CACCGTTNTT TGGCCCGCCC RAWTYCTRAC CCGCAATWTC GGTAATCGGR AATTTGGGCW 300YCGGCTTGGG CAATAAGKTN TTGGGCAACG GCGGRWTCYC NCTGGCCGRA ATTCCCNCAT 360TCCKTTAACG GKTGRACCGT TTYCCCGGYT GCCGTAAYTG YTYCNTGGGC GCCYTCGGCC 420CRNAGCASYY CRCTAACGGY CMCCAGGCAA TACCKTTGGC TTTRAACCAC CGGRATNAAY 480TGKTACCCAC YTCAASSGTS CTGRANTTRK TNTCNTGRAA AANMCCACCN AACCCGGNTT 540RATCTGCTTC MTCANCWTTT SCCGGGTTCT GCCGTTTTGR AAYCTTNATC CMTYCAAAAG 600GTTTAMTTTC CCAANRAATT CGGYTTGCCA CCTTGGCCGS GGCTGGTTTM CGMWCCTTRR 660AMATCCNCCS GCGGGSAAAN AMTTSGGNTT SGSCCGGTCC CCCGNAATAT YCNTGGNCCT 720GNAAATTGSS GGGATCCCCN GSGNAYCCGG CCWTKGGGGK TNCCCAGTTG GWACAATTYC 780WKCCGTTCCA AACCCGGGNC CGGGGGGTGG GSCCCNTTTT CCTMYNNAAA AAGKGTTTGN 840NYYTTTTCCG CNRAANTTCA CCSKCNKTNT GGNCCNAACY YYYCAANTTC CANACCTTTA 900AASAAANCYK YGKTYYCCCC TTTTMCCSGS SANCCCCCCM NMSSKNCGGG AAAAAAAGNK 960TYNGCCTTAN CNSNKTKTTT TNKTYCCCCC NMWNNSNMCY NCBKKCNKRY NGNSNMNCCT 1020MKYSKCNNNN SNNNNNKCGN GSNCSGMKYM CMNNCNGMYK NGNKSNNCCC MSC 1073
(2)SEQ ID NO:333的信息:
(ⅰ)序列特征:
(A)长度:1061碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:333:GNSNGNKNTN TMCAYCWYCT SCACSGGGTC TATTGCGGCC GCAATYTNGT CKASAGATCT 60CGATYTCGGC AMNANAARTG TCGTCGTCAA TTTCAGKKTG GTCKTCAAAY GGGCCAGGCC 120GNGACCRACA CCCTGNGTCA CCCAAAANAC CAACAGCWTC AAATWTCAAG GCCRAGGCSC 180TRTCAATYCC CRASCAKTTA ACCGTKTCCW TCRAAGGTGC CRAACCAGGC ACCCAGYTCA 240CCGCCSGGCA AWTCGCGCTG CCGGCCGGTN TCAGCCTGAT TYCTGACCCT RWTCTGTSGG 300TGGYCAMCNT GGTGAAGGCC CWWCCGCCNA AGAACTGGAG GGCRAATTCC CAGGANCCNA 360GRAACCCNAG GAACCCGCGG TAKAANCCGG CRAAACCRAG GCCGYTGGCN ATTCCNATTA 420NAMSGGTTTG CRACNTGGCC RAACCGTTTY CTTGGTCGGC CTCGGCAACC CTGGACCANT 480TACCCCKTNC CCGGNMCMAC CYCGGGTNCT TGKYCCCAAT NTGCYCCCGC GNRANTNGGC 540CNAATTCCAG GGCNCCANCT TTCCGGCCCN AATTCCCYTG GTTAATCACC GGGCNCNCCT 600GGTTTTGGGC AACCCCNCYS CTTMTTTAAA CATTCCGSCC CAAATGGGNC STTGGSAAAT 660TCTNTYCGGT GGGGCSGGCR ANMYTTCTCT YCCCNAASAN CTTAMYCCAN TTCGSSNTCC 720CGGKCAAAWS NGGGGGGGNA AAGGGCCCCC CGGNTSCKCC GGGGKKGCCC CYGGKTTCAA 780AANTTTCSGG GKTSTMSCGG NVTCSCCCCC CSGCCAAGRA CCGNGGTTTT TTTTTGAACC 840KCMANTCSSA AMCCGCCSSC CCCMAAAGGS GCCTNAAWGR RAYTTNKSCC CNNAAACSGG 900CCCCCAKYTY SGGKTTCNNC CNCCSGKKGT CCMTSTTTMM MRCCCTTTGN GNKTTTTTAN 960MGSCCTTNNC CACCCCCYCK GGGKCSMNNA GAAKTMYWKC CNGGGGNNAN RSCCCCCCNN 1020GSGKGGGGKG MGAGYSCCKT CTKGCGNCNN YKNTTTCCCC C 1061
(2)SEQ ID NO:334的信息:
(ⅰ)序列特征:
(A)长度:986碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:334:GNNGNNNKWN ATMCAYCWYY CTSCACCSGG GMTCWATTGC GGCCGCAWKY TNGTMAASAG 60ATCTMGAAYT CGGCACANAG CGGCACAGAG TGTGTGCATC TGTGTCANAG CTGTCAACGC 120GGTGCCGCSG GTGGTRASCA CMCATTGCGR AACACCAAAC CCGTCCGCGG GYCACCGGCK 180TCGCCTGCAA AAYCCTCCAG GCCACCYCRA AACAAYWYCT CCTGCAACSC ARSCCGTTYC 240GCGGCCGRAT CCTGGKYCAS YTCGCCKTGC GGTGCGCCAA GGTACTGGCS CWYCRANACC 300GCTYCGGGRA ACCNAACGTA AATCTTGCCN AATTTGCNTT CCCCCTSCCC TTRATNAATT 360TGTTAAACCA CGCAAACCTY CGGGCKTCTC CTCKTGCCRA WTCCGRWTCC RATNYCGCCA 420TGGCCTNKTC KYCTYCKYCS GTMCCCAAAT CTTGGTATCC TATATTGTCC CTAAATGCAA 480ATCTKGGCTG TCCATNTGCT GGCGTTCAAA TTWAMANCAG NGGTTTCTTY CTTCCNAAAC 540CCSTTGGCCC CAAACCNAAA AATGATNATA ATAATGGTGC TNTCAAACCC CGCNCCCATY 600CNATCSGKCC AMMCCCCRGN GGKTANKKGG GNAATTCTMM AACCCCAAGC CATAASNTTG 660SGANAAACCY NCNCMGGYCA CCAAAACANY NTTNTTGGNY SSNTTCGGMN YCATGGCTNN 720CMAAAACCCA AATACTNYYG GGYCCAATAA AAMMMSGGYC SAMCCGGAAA WTTTTYTTGN 780KYNAAACCNA AAKCCTTTTT CNAACCCDAN WNTYCCTNCC RCRCMANTGG CNSGGARTKT 840SSSCTTNCCA ATGKYCCMAA AGNGGGRANA CCARCCCCAA TTCCTNNNTN KNKNCCCNST 900TRNAAAAGGG GKNTYNCMAA AASCNCCNCC NCNCTCCCAA AAKAMCCCCN AAAGAKNTCN 960NAANASKYSN NNNSCCCCCC CCMMMN 986
(2)SEQ ID NO:335的信息:
(ⅰ)序列特征:
(A)长度:1074碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:335:NGNGGGNKRN ATMMAYCWCT SATYYACCSN GGMNMWATTG CGGCCRMAWT CTNGTMKASA 60GATCTMGAAA YTCGGCAAAG AGYATKCTCG GGGGCCAGAT TTNTGGCCCG CAACCGCCGC 120ACTTTGCAYW TCAACAKTCC SGGTGCCCCA AAAAAWTCWT ACCCCCATMC TYCKTGCASM 180ASYTGCGCCC RATTRAACAC CCGGCCGGCW TGCTGCGCCA GGTATTYCAS CAGYTCAAAY 240YCTTTKTAGK TAAAATCCAG CSGGCGGCCA CNCAGCCGGG CGGTKTAGGT GCCTYCRTCA 300ATMACCAGCY CGCCCAGGGY CACCTTGCCC AAAAYCTCCT GGGTCAGCCA AATTYCCGCS 360CCGGCCAACM ACCANCCGCA TYCTGGCNTC AATCYCACCG GGCCCGGTGY TAAAMMANMA 420GRATCTCKTC MANCCCCCAN TCAGCSYTNA CNGCMACAGC CCGCCTTCTT CAMACCGCCA 480RTACCGGGWT CAACCGGCCS GTCAAACTCA ACAGGCGGNC AGGCCTCCCC CGGANSAAAG 540GTCTTACSCC NNYAANAAAA MAAGNTCTGT TTTCCCCCTC CASAASNAAA AANCCCCSGC 600CGGGCCTTCN NMMGGGTTTG GGGMANANAA AARCNCCGGN GGAACGNATC CGAAAMCTCC 660CAAGTCNCMT TWAWAACYCN NNAACCCCCC ANTTTTGGGA AAGGNTCCCC NTTMYCCCCC 720TTTTASGKTS GGGMMYYCTY TAAAAAAATT CCCCAAAAAG CCCCGGGAAG GGTCMAMCTG 780GGNAAATTTC CAAMCCNWGK TTNTTYNGGT TMCGGGGGRA AATTYCNCTC CCYYNNNGGG 840CSSGSNNNAT TAYGGMSNMT TTTNNAAWTM NSGKKTSAMM YNNKCCMNNN SNNMSMANNK 900TNAMCKCCCN CCTCNGNGKY CSCYNCCCSG GNAGNGGRAS MKCCNANMAA AYASGNTTNK 960CGGAAMMCNN AATKGNNNSC CCGGASMCMN NNNMAAATMT CNCNKCNSNN AANRGMRACN 1020CCCNSNSGMN RRGAARMTNY YCCCCCGSKM GKGNKAAAAW GKYCCCCCCM AAAG 1074
(2)SEQ ID NO:336的信息:
(ⅰ)序列特征:
(A)长度:1195碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:336:NGNGNCNKNT MTACATCWTT CTGCACCSGG GNTCWANTGC GGCCGCAWKY TTGTCGASAG 60ATCTCGAAYT CGGCAMGAGG ACWCTCGCRA CGCCCCCACA NACTCTGGCG TGTGTACCCC 120ATTGNGCGCK TCACGCGCCC AYTGANCCAK TNCACTGGGG TGCCGTYCGC CKTGCGCGGC 180GGCCTCACGG CKCTSCWTCT RAAGGCWTGG CGCACCGCAT TCGGTTTTCT RAACGCTGGG 240AAAWTGGCCA GCCGTCTGGC TCATGGGNTC TACGCAACGC CNGCCCCCAA CRCTTTCTTA 300AATCCGGYCC NTCCTGANCS CTTTGAAYCC CGGGGSAAGA ACTGGTTGCS CNCGAYCTGC 360TCGAACTTRK TCNAAATCCC GCANAKTGTT TCNTAMGYCC CNCCGGAAGG NGAACCTACT 420TTCNGGWANG TCGGCNKCCG GCGCTTATCA STCCTGATCA ACGGGGAACT GGYKNNSTTG 480KGGGAAAAAG RRCCTCAATG MTYGGTCCKC GCTGCGKANC CGCSCCCTGK GYCGCNAATG 540GAAGGCSMAG GGTTAANGCC MTTYCNYCCR RSCCGTSTGA SGKWTTYCGG MGGANKAMNN 600NNKMAMWTTK TCRGNGGCCW ATSTSCCGGG CKSTTAKAGA ANACTYCCKW WCCGTNTYSC 660SAAAGNTKCS GCGMGTTTTS SCCKMGANGN YCTGATTTSA GGGGGKYKCC CCCGGGGTYC 720CGAAWKWRKY CCYAGGGGGM GNYCSAGCSC CGMNNATNAG AGNAAGGKTT RYGSTSKNCC 780TYTNKGGACC WSCNNCWSAK ANAACNNKKT TGCSCCNTMS AGNKTNKGRT YCCNKTSTTC 840TAAGAGGAGC TATKMKCGCC CKTGGANGMM GAGWGMGCGC KYCCCSNKRT TCNTNGWAAA 900TATKSAGMGG TKCCGMAGMK CCSCGTTTKT TKTGANAAMN MSMRKNKKTG CGMGYTCTSC 960GGGNTTTGTA GAGTAKTCGS CSCSSMWGAC WCSGMCMGNG AGKNKTNNTS YANTGARCGY 1020MNNSKTMKMT MSCSCGCGNA GGAGNGCCCC CSANGMSTGY NKGGNMSSNG ARAKGATGGS 1080GGCCNCGMNN MGMGGANMGA SANNGMGGMR GGGGGKTGKC TCKCSCCGNS CSANGRAGAA 1140GKTCNGSCGC CGMGGKYGKT KTKTKNKTGG YSTCMSSMMM NAGAAAAGAG AGGGC 1195
(2)SEQ ID NO:337的信息:
(ⅰ)序列特征:
(A)长度:3572碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:基因组DNA
(ⅹⅰ)序列描述:SEQ ID NO:337:CCATCTGATC GTTGGCAACC AGCATCGCAG TGGGAACGAT GCCCTCATTC AGCATTTGCA 60TGGTTTGTTG AAAACCGGAC ATGGCACTCC AGTCGCCTTC CCGTTCCGCT ATCGGCTGAA 120TTTGATTGCG AGTGAGATAT TTATGCCAGC CAGCCAGACG CAGACGCGCC GAGACAGAAC 180TTAATGGGCC CGCTAACAGC GCGATTTGCT GGTGACCCAA TGCGACCAGA TGCTCCACGC 240CCAGTCGCGT ACCGTCTTCA TGGGAGAAAA TAATACTGTT GATGGGTGTC TGGTCAGAGA 300CATCAAGAAA TAACGCCGGA ACATTAGTGC AGGCAGCTTC CACAGCAATG GCATCCTGGT 360CATCCAGCGG ATAGTTAATG ATCAGCCCAC TGACGCGTTG CGCGAGAAGA TTGTGCACCG 420CCGCTTTACA GGCTTCGACG CCGCTTCGTT CTACCATCGA CACCACCACG CTGGCACCCA 480GTTGATCGGC GCGAGATTTA ATCGCCGCGA CAATTTGCGA CGGCGCGTGC AGGGCCAGAC 540TGGAGGTGGC AACGCCAATC AGCAACGACT GTTTGCCCGC CAGTTGTTGT GCCACGCGGT 600TGGGAATGTA ATTCAGCTCC GCCATCGCCG CTTCCACTTT TTCCCGCGTT TTCGCAGAAA 660CGTGGCTGGC CTGGTTCACC ACGCGGGAAA CGGTCTGATA AGAGACACCG GCATACTCTG 720CGACATCGTA TAACGTTACT GGTTTCACAT TCACCACCCT GAATTGACTC TCTTCCGGGC 780GGTATCATGC CATACCGCGA AAGGTTTTGC GCCATTCGAT GGTGTCCGGG ATCTCGACGC 840TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC 900ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG 960GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA 1020TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 1080ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAATTAATAC 1140GACTCACTAT AGGGGAATTG TGAGCGGATA ACAATTCCCC TCTAGAAATA ATTTTGTTTA 1200ACTTTAAGAA GGAGATATAC ATATGGGCCA TCATCATCAT CATCACGTGA TCGACATCAT 1260CGGGACCAGC CCCACATCCT GGGAACAGGC GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA 1320TAGCGTCGAT GACATCCGCG TCGCTCGGGT CATTGAGCAG GACATGGCCG TGGACAGCGC 1380CGGCAAGATC ACCTACCGCA TCAAGCTCGA AGTGTCGTTC AAGATGAGGC CGGCGCAACC 1440GAGGGGCTCG AAACCACCGA GCGGTTCGCC TGAAACGGGC GCCGGCGCCG GTACTGTCGC 1500GACTACCCCC GCGTCGTCGC CGGTGACGTT GGCGGAGACC GGTAGCACGC TGCTCTACCC 1560GCTGTTCAAC CTGTGGGGTC CGGCCTTTCA CGAGAGGTAT CCGAACGTCA CGATCACCGC 1620TCAGGGCACC GGTTCTGGTG CCGGGATCGC GCAGGCCGCC GCCGGGACGG TCAACATTGG 1680GGCCTCCGAC GCCTATCTGT CGGAAGGTGA TATGGCCGCG CACAAGGGGC TGATGAACAT 1740CGCGCTAGCC ATCTCCGCTC AGCAGGTCAA CTACAACCTG CCCGGAGTGA GCGAGCACCT 1800CAAGCTGAAC GGAAAAGTCC TGGCGGCCAT GTACCAGGGC ACCATCAAAA CCTGGGACGA 1860CCCGCAGATC GCTGCGCTCA ACCCCGGCGT GAACCTGCCC GGCACCGCGG TAGTTCCGCT 1920GCACCGCTCC GACGGGTCCG GTGACACCTT CTTGTTCACC CAGTACCTGT CCAAGCAAGA 1980TCCCGAGGGC TGGGGCAAGT CGCCCGGCTT CGGCACCACC GTCGACTTCC CGGCGGTGCC 2040GGGTGCGCTG GGTGAGAACG GCAACGGCGG CATGGTGACC GGTTGCGCCG AGACACCGGG 2100CTGCGTGGCC TATATCGGCA TCAGCTTCCT CGACCAGGCC AGTCAACGGG GACTCGGCGA 2160GGCCCAACTA GGCAATAGCT CTGGCAATTT CTTGTTGCCC GACGCGCAAA GCATTCAGGC 2220CGCGGCGGCT GGCTTCGCAT CGAAAACCCC GGCGAACCAG GCGATTTCGA TGATCGACGG 2280GCCCGCCCCG GACGGCTACC CGATCATCAA CTACGAGTAC GCCATCGTCA ACAACCGGCA 2340AAAGGACGCC GCCACCGCGC AGACCTTGCA GGCATTTCTG CACTGGGCGA TCACCGACGG 2400CAACAAGGCC TCGTTCCTCG ACCAGGTTCA TTTCCAGCCG CTGCCGCCCG CGGTGGTGAA 2460GTTGTCTGAC GCGTTGATCG CGACGATTTC CAGCGCTGAG ATGAAGACCG ATGCCGCTAC 2520CCTCGCGCAG GAGGCAGGTA ATTTCGAGCG GATCTCCGGC GACCTGAAAA CCCAGATCGA 2580CCAGGTGGAG TCGACGGCAG GTTCGTTGCA GGGCCAGTGG CGCGGCGCGG CGGGGACGGC 2640CGCCCAGGCC GCGGTGGTGC GCTTCCAAGA AGCAGCCAAT AAGCAGAAGC AGGAACTCGA 2700CGAGATCTCG ACGAATATTC GTCAGGCCGG CGTCCAATAC TCGAGGGCCG ACGAGGAGCA 2760GCAGCAGGCG CTGTCCTCGC AAATGGGCTT TGGATTCAGC TTCGCGCTGC CTGCTGGCTG 2820GGTGGAGTCT GACGCCGCCC ACTTCGACTA CGGTTCAGCA CTCCTCAGCA AAACCACCGG 2880GGACCCGCCA TTTCCCGGAC AGCCGCCGCC GGTGGCCAAT GACACCCGTA TCGTGCTCGG 2940CCGGCTAGAC CAAAAGCTTT ACGCCAGCGC CGAAGCCACC GACTCCAAGG CCGCGGCCCG 3000GTTGGGCTCG GACATGGGTG AGTTCTATAT GCCCTACCCG GGCACCCGGA TCAACCAGGA 3060AACCGTCTCG CTYGACGCCA ACGGGGTGTC TGGAAGCGCG TCGTATTACG AAGTCAAGTT 3120CAGCGATCCG AGTAAGCCGA ACGGCCAGAT CTGGACGGGC GTAATCGGCT CGCCCGCGGC 3180GAACGCACCG GACGCCGGGC CCCCTCAGCG CTGGTTTGTG GTATGGCTCG GGACCGCCAA 3240CAACCCGGTG GACAAGGGCG CGGCCAAGGC GCTGGCCGAA TCGATCCGGC CTTTGGTCGC 3300CCCGCCGCCG GCGCCGGCCG GGGAAGTCGC TCCTACCCCG ACGACACCGA CACCGCAGCG 3360GACCTTACCG GCCTGAGAAT TCTGCAGATA TCCATCACAC TGGCGGCCGC TCGAGCACCA 3420CCACCACCAC CACTGAGATC CGGCTGCTAA CAAAGCCCGA AAGGAAGCTG AGTTGGCTGC 3480TGCCACCGCT GAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG TCTTGAGGGG 3540TTTTTTGCTG AAAGGAGGAA CTATATCCGG AT 3572
(2)SEQ ID NO:338的信息:
(ⅰ)序列特征:
(A)长度:20氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽
(ⅹⅰ)序列描述:SEQ ID NO:338:Val Gln Phe Gln Ser Gly Gly Asp Asn Ser Pro Ala Val Tyr Xaa Xaa1 5 10 15Asp Gly Xaa Arg
20
(2)SEQ ID NO:339的信息:
(ⅰ)序列特征:
(A)长度:10氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽
(ⅹⅰ)序列描述:SEQ ID NO:339:Thr Thr Val Pro Xaa Val Thr Glu Ala Arg1 5 10
(2)SEQ ID NO:340的信息:
(ⅰ)序列特征:
(A)长度:10氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽
(ⅹⅰ)序列描述:SEQ ID NO:340:Thr Thr Pro Ser Xaa Val Ala Phe Ala Arg1 5 10
(2)SEQ ID NO:341的信息:
(ⅰ)序列特征:
(A)长度:12氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽
(ⅹⅰ)序列描述:SEQ ID NO:341:Asp Ala Gly Lys Xaa Ala Gly Xaa Asp Val Xaa Arg1 5 10
(2)SEQ ID NO:342的信息:
(ⅰ)序列特征:
(A)长度:18氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:肽
(ⅹⅰ)序列描述:SEQ ID NO:342:Thr Xaa Glu Glu Xaa Gln Glu Ser Phe Asn Ser Ala Ala Pro Gly Asn1 5 10 15Xaa Lys
(2)SEQ ID NO:343的信息:
(ⅰ)序列特征:
(A)长度:27碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它
(ⅹⅰ)序列描述:SEQ ID NO:343:CTAGTTAGTA CTCAGTCGCA GACCGTG 27
(2)SEQ ID NO:344的信息:
(ⅰ)序列特征:
(A)长度:25碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它
(ⅹⅰ)序列描述:SEQ ID N0:344:GCAGTGACGA ATTCACTTCG ACTCC 25
(2)SEQ ID NO:345的信息:
(ⅰ)序列特征:
(A)长度:2412碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:345:CATATGGGCC ATCATCATCA TCATCACGTG ATCGACATCA TCGGGACCAG CCCCACATCC 60TGGGAACAGG CGGCGGCGGA GGCGGTCCAG CGGGCGCGGG ATAGCGTCGA TGACATCCGC 120GTCGCTCGGG TCATTGAGCA GGACATGGCC GTGGACAGCG CCGGCAAGAT CACCTACCGC 180ATCAAGCTCG AAGTGTCGTT CAAGATGAGG CCGGCGCAAC CGAGGGGCTC GAAACCACCG 240AGCGGTTCGC CTGAAACGGG CGCCGGCGCC GGTACTGTCG CGACTACCCC CGCGTCGTCG 300CCGGTGACGT TGGCGGAGAC CGGTAGCACG CTGCTCTACC CGCTGTTCAA CCTGTGGGGT 360CCGGCCTTTC ACGAGAGGTA TCCGAACGTC ACGATCACCG CTCAGGGCAC CGGTTCTGGT 420GCCGGGATCG CGCAGGCCGC CGCCGGGACG GTCAACATTG GGGCCTCCGA CGCCTATCTG 480TCGGAAGGTG ATATGGCCGC GCACAAGGGG CTGATGAACA TCGCGCTAGC CATCTCCGCT 540CAGCAGGTCA ACTACAACCT GCCCGGAGTG AGCGAGCACC TCAAGCTGAA CGGAAAAGTC 600CTGGCGGCCA TGTACCAGGG CACCATCAAA ACCTGGGACG ACCCGCAGAT CGCTGCGCTC 660AACCCCGGCG TGAACCTGCC CGGCACCGCG GTAGTTCCGC TGCACCGCTC CGACGGGTCC 720GGTGACACCT TCTTGTTCAC CCAGTACCTG TCCAAGCAAG ATCCCGAGGG CTGGGGCAAG 780TCGCCCGGCT TCGGCACCAC CGTCGACTTC CCGGCGGTGC CGGGTGCGCT GGGTGAGAAC 840GGCAACGGCG GCATGGTGAC CGGTTGCGCC GAGACACCGG GCTGCGTGGC CTATATCGGC 900ATCAGCTTCC TCGACCAGGC CAGTCAACGG GGACTCGGCG AGGCCCAACT AGGCAATAGC 960TCTGGCAATT TCTTGTTGCC CGACGCGCAA AGCATTCAGG CCGCGGCGGC TGGCTTCGCA 1020TCGAAAACCC CGGCGAACCA GGCGATTTCG ATGATCGACG GGCCCGCCCC GGACGGCTAC 1080CCGATCATCA ACTACGAGTA CGCCATCGTC AACAACCGGC AAAAGGACGC CGCCACCGCG 1140CAGACCTTGC AGGCATTTCT GCACTGGGCG ATCACCGACG GCAACAAGGC CTCGTTCCTC 1200GACCAGGTTC ATTTCCAGCC GCTGCCGCCC GCGGTGGTGA AGTTGTCTGA CGCGTTGATC 1260GCGACGATTT CCAGCGCTGA GATGAAGACC GATGCCGCTA CCCTCGCGCA GGAGGCAGGT 1320AATTTCGAGC GGATCTCCGG CGACCTGAAA ACCCAGATCG ACCAGGTGGA GTCGACGGCA 1380GGTTCGTTGC AGGGCCAGTG GCGCGGCGCG GCGGGGACGG CCGCCCAGGC CGCGGTGGTG 1440CGCTTCCAAG AAGCAGCCAA TAAGCAGAAG CAGGAACTCG ACGAGATCTC GACGAATATT 1500CGTCAGGCCG GCGTCCAATA CTCGAGGGCC GACGAGGAGC AGCAGCAGGC GCTGTCCTCG 1560CAAATGGGCT TTGTGCCCAC AACGGCCGCC TCGCCGCCGT CGACCGCTGC AGCGCCACCC 1620GCACCGGCGA CACCTGTTGC CCCCCCACCA CCGGCCGCCG CCAACACGCC GAATGCCCAG 1680CCGGGCGATC CCAACGCAGC ACCTCCGCCG GCCGACCCGA ACGCACCGCC GCCACCTGTC 1740ATTGCCCCAA ACGCACCCCA ACCTGTCCGG ATCGACAACC CGGTTGGAGG ATTCAGCTTC 1800GCGCTGCCTG CTGGCTGGGT GGAGTCTGAC GCCGCCCACT TCGACTACGG TTCAGCACTC 1860CTCAGCAAAA CCACCGGGGA CCCGCCATTT CCCGGACAGC CGCCGCCGGT GGCCAATGAC 1920ACCCGTATCG TGCTCGGCCG GCTAGACCAA AAGCTTTACG CCAGCGCCGA AGCCACCGAC 1980TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC ATGGGTGAGT TCTATATGCC CTACCCGGGC 2040ACCCGGATCA ACCAGGAAAC CGTCTCGCTC GACGCCAACG GGGTGTCTGG AAGCGCGTCG 2100TATTACGAAG TCAAGTTCAG CGATCCGAGT AAGCCGAACG GCCAGATCTG GACGGGCGTA 2160ATCGGCTCGC CCGCGGCGAA CGCACCGGAC GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA 2220TGGCTCGGGA CCGCCAACAA CCCGGTGGAC AAGGGCGCGG CCAAGGCGCT GGCCGAATCG 2280ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG CCGGCACCGG CTCCTGCAGA GCCCGCTCCG 2340GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT ACCCCGACGA CACCGACACC GCAGCGGACC 2400TTACCGGCCT GA 2412
(2)SEQ ID NO:346的信息:
(ⅰ)序列特征:
(A)长度:802氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:346:Met Gly His His His His His His Val Ile Asp Ile Ile Gly Thr Ser1 5 10 15Pro Thr Ser Trp Glu Gln Ala Ala Ala Glu Ala Val Gln Arg Ala Arg
20 25 30Asp Ser Val Asp Asp Ile Arg Val Ala Arg Val Ile Glu Gln Asp Met
35 40 45Ala Val Asp Ser Ala Gly Lys Ile Thr Tyr Arg Ile Lys Leu Glu Val
50 55 60Ser Phe Lys Met Arg Pro Ala Gln Pro Arg Gly Ser Lys Pro Pro Ser65 70 75 80Gly Ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro
85 90 95Ala Ser Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr
100 105 110Pro Leu Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn
115 120 125Val Thr Ile Thr Ala Gln Gly Thr Gly Ser Gly Ala Gly Ile Ala Gln
130 135 140Ala Ala Ala Gly Thr Val Asn Ile Gly Ala Ser Asp Ala Tyr Leu Ser145 150 155 160Glu Gly Asp Met Ala Ala His Lys Gly Leu Met Asn Ile Ala Leu Ala
165 170 175Ile Ser Ala Gln Gln Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His
180 185 190Leu Lys Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gln Gly Thr Ile
195 200 205Lys Thr Trp Asp Asp Pro Gln Ile Ala Ala Leu Asn Pro Gly Val Asn
210 215 220Leu Pro Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly225 230 235 240Asp Thr Phe Leu Phe Thr Gln Tyr Leu Ser Lys Gln Asp Pro Glu Gly
245 250 255Trp Gly Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val
260 265 270Pro Gly Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys
275 280 285Ala Glu Thr Pro Gly Cys Val Ala Tyr Ile Gly Ile Ser Phe Leu Asp
290 295 300Gln Ala Ser Gln Arg Gly Leu Gly Glu Ala Gln Leu Gly Asn Ser Ser305 310 315 320Gly Asn Phe Leu Leu Pro Asp Ala Gln Ser Ile Gln Ala Ala Ala Ala
325 330 335Gly Phe Ala Ser Lys Thr Pro Ala Asn Gln Ala Ile Ser Met Ile Asp
340 345 350Gly Pro Ala Pro Asp Gly Tyr Pro Ile Ile Asn Tyr Glu Tyr Ala Ile
355 360 365Val Asn Asn Arg Gln Lys Asp Ala Ala Thr Ala Gln Thr Leu Gln Ala
370 375 380Phe Leu His Trp Ala Ile Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp385 390 395 400Gln Val His Phe Gln Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp
405 410 415Ala Leu Ile Ala Thr Ile Ser Ser Ala Glu Met Lys Thr Asp Ala Ala
420 425 430Thr Leu Ala Gln Glu Ala Gly Asn Phe Glu Arg Ile Ser Gly Asp Leu
435 440 445Lys Thr Gln Ile Asp Gln Val Glu Ser Thr Ala Gly Ser Leu Gln Gly
450 455 460Gln Trp Arg Gly Ala Ala Gly Thr Ala Ala Gln Ala Ala Val Val Arg465 470 475 480Phe Gln Glu Ala Ala Asn Lys Gln Lys Gln Glu Leu Asp Glu Ile Ser
485 490 495Thr Asn Ile Arg Gln Ala Gly Val Gln Tyr Ser Arg Ala Asp Glu Glu
500 505 510Gln Gln Gln Ala Leu Ser Ser Gln Met Gly Phe Val Pro Thr Thr Ala
515 520 525Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro
530 535 540Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gln Pro545 550 555 560Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro
565 570 575Pro Pro Val Ile Ala Pro Asn Ala Pro Gln Pro Val Arg Ile Asp Asn
580 585 590Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser
595 600 605Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr
610 615 620Gly Asp Pro Pro Phe Pro Gly Gln Pro Pro Pro Val Ala Asn Asp Thr625 630 635 640Arg Ile Val Leu Gly Arg Leu Asp Gln Lys Leu Tyr Ala Ser Ala Glu
645 650 655Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu
660 665 670Phe Tyr Met Pro Tyr Pro Gly Thr Arg Ile Asn Gln Glu Thr Val Ser
675 680 685Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys
690 695 700Phe Ser Asp Pro Ser Lys Pro Asn Gly Gln Ile Trp Thr Gly Val Ile705 710 715 720Gly Ser Pro Ala Ala Asn Ala Pro Asp Ala Gly Pro Pro Gln Arg Trp
725 730 735Phe Val Val Trp Leu Gly Thr Ala Asn Asn Pro Val Asp Lys Gly Ala
740 745 750Ala Lys Ala Leu Ala Glu Ser Ile Arg Pro Leu Val Ala Pro Pro Pro
755 760 765Ala Pro Ala Pro Ala Pro Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala
770 775 780Gly Glu Val Ala Pro Thr Pro Thr Thr Pro Thr Pro Gln Arg Thr Leu785 790 795 800Pro Ala
(2)SEQ ID NO:347的信息:
(ⅰ)序列特征:
(A)长度:34碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它
(ⅹⅰ)序列描述:SEQ ID NO:347:GGATCCAAAC CACCGAGCGG TTCGCCTGAA ACGG 34
(2)SEQ ID NO:348的信息:
(ⅰ)序列特征:
(A)长度:37碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:其它
(ⅹⅰ)序列描述:SEQ ID NO:348:CGCTGCGAAT TCACCTCCGG AGGAAATCGT CGCGATC 37
(2)SEQ ID NO:349的信息:
(ⅰ)序列特征:
(A)长度:1962碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:cDNA
(ⅹⅰ)序列描述:SEQ ID NO:349:CATATGGGCC ATCATCATCA TCATCACGGA TCCAAACCAC CGAGCGGTTC GCCTGAAACG 60GGCGCCGGCG CCGGTACTGT CGCGACTACC CCCGCGTCGT CGCCGGTGAC GTTGGCGGAG 120ACCGGTAGCA CGCTGCTCTA CCCGCTGTTC AACCTGTGGG GTCCGGCCTT TCACGAGAGG 180TATCCGAACG TCACGATCAC CGCTCAGGGC ACCGGTTCTG GTGCCGGGAT CGCGCAGGCC 240GCCGCCGGGA CGGTCAACAT TGGGGCCTCC GACGCCTATC TGTCGGAAGG TGATATGGCC 300GCGCACAAGG GGCTGATGAA CATCGCGCTA GCCATCTCCG CTCAGCAGGT CAACTACAAC 360CTGCCCGGAG TGAGCGAGCA CCTCAAGCTG AACGGAAAAG TCCTGGCGGC CATGTACCAG 420GGCACCATCA AAACCTGGGA CGACCCGCAG ATCGCTGCGC TCAACCCCGG CGTGAACCTG 480CCCGGCACCG CGGTAGTTCC GCTGCACCGC TCCGACGGGT CCGGTGACAC CTTCTTGTTC 540ACCCAGTACC TGTCCAAGCA AGATCCCGAG GGCTGGGGCA AGTCGCCCGG CTTCGGCACC 600ACCGTCGACT TCCCGGCGGT GCCGGGTGCG CTGGGTGAGA ACGGCAACGG CGGCATGGTG 660ACCGGTTGCG CCGAGACACC GGGCTGCGTG GCCTATATCG GCATCAGCTT CCTCGACCAG 720GCCAGTCAAC GGGGACTCGG CGAGGCCCAA CTAGGCAATA GCTCTGGCAA TTTCTTGTTG 780CCCGACGCGC AAAGCATTCA GGCCGCGGCG GCTGGCTTCG CATCGAAAAC CCCGGCGAAC 840CAGGCGATTT CGATGATCGA CGGGCCCGCC CCGGACGGCT ACCCGATCAT CAACTACGAG 900TACGCCATCG TCAACAACCG GCAAAAGGAC GCCGCCACCG CGCAGACCTT GCAGGCATTT 960CTGCACTGGG CGATCACCGA CGGCAACAAG GCCTCGTTCC TCGACCAGGT TCATTTCCAG 1020CCGCTGCCGC CCGCGGTGGT GAAGTTGTCT GACGCGTTGA TCGCGACGAT TTCCTCCGGA 1080GGTGGCAGTG GGGGAGGCTC AGGTGGAGGT TCTGGCGGGA GCGTGCCCAC AACGGCCGCC 1140TCGCCGCCGT CGACCGCTGC AGCGCCACCC GCACCGGCGA CACCTGTTGC CCCCCCACCA 1200CCGGCCGCCG CCAACACGCC GAATGCCCAG CCGGGCGATC CCAACGCAGC ACCTCCGCCG 1260GCCGACCCGA ACGCACCGCC GCCACCTGTC ATTGCCCCAA ACGCACCCCA ACCTGTCCGG 1320ATCGACAACC CGGTTGGAGG ATTCAGCTTC GCGCTGCCTG CTGGCTGGGT GGAGTCTGAC 1380GCCGCCCACT TCGACTACGG TTCAGCACTC CTCAGCAAAA CCACCGGGGA CCCGCCATTT 1440CCCGGACAGC CGCCGCCGGT GGCCAATGAC ACCCGTATCG TGCTCGGCCG GCTAGACCAA 1500AAGCTTTACG CCAGCGCCGA AGCCACCGAC TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC 1560ATGGGTGAGT TCTATATGCC CTACCCGGGC ACCCGGATCA ACCAGGAAAC CGTCTCGCTC 1620GACGCCAACG GGGTGTCTGG AAGCGCGTCG TATTACGAAG TCAAGTTCAG CGATCCGAGT 1680AAGCCGAACG GCCAGATCTG GACGGGCGTA ATCGGCTCGC CCGCGGCGAA CGCACCGGAC 1740GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA TGGCTCGGGA CCGCCAACAA CCCGGTGGAC 1800AAGGGCGCGG CCAAGGCGCT GGCCGAATCG ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG 1860CCGGCACCGG CTCCTGCAGA GCCCGCTCCG GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT 1920ACCCCGACGA CACCGACACC GCAGCGGACC TTACCGGCCT GA 1962
(2)SEQ ID NO:350的信息:
(ⅰ)序列特征:
(A)长度:652氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线性
(ⅱ)分子类型:蛋白质
(ⅹⅰ)序列描述:SEQ ID NO:350:Met Gly His His His His His His Gly Ser Lys Pro Pro Ser Gly Ser1 5 10 15Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser
20 25 30Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu
35 40 45Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr
50 55 60Ile Thr Ala Gln Gly Thr Gly Ser Gly Ala Gly Ile Ala Gln Ala Ala65 70 75 80Ala Gly Thr Val Asn Ile Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly
85 90 95Asp Mer Ala Ala His Lys Gly Leu Met Asn Ile Ala Leu Ala Ile Ser
100 105 110Ala Gln Gln Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys
115 120 125Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gln Gly Thr Ile Lys Thr
130 135 140Trp Asp Asp Pro Gln Ile Ala Ala Leu Asn Pro Gly Val Asn Leu Pro145 150 155 160Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr
165 170 175Phe Leu Phe Thr Gln Tyr Leu Ser Lys Gln Asp Pro Glu Gly Trp G1y
180 185 190Lys Ser Pro GIy Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly
195 200 205Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu
210 215 220Thr Pro Gly Cys Val Ala Tyr Ile Gly Ile Ser Phe Leu Asp Gln Ala225 230 235 240Ser Gln Arg Gly Leu Gly Glu Ala Gln Leu Gly Asn Ser Ser Gly Asn
245 250 255Phe Leu Leu Pro Asp Ala Gln Ser Ile Gln Ala Ala Ala Ala Gly Phe
260 265 270Ala Ser Lys Thr Pro Ala Asn Gln Ala Ile Ser Met Ile Asp Gly Pro
275 280 285Ala Pro Asp Gly Tyr Pro Ile Ile Asn Tyr Glu Tyr Ala Ile Val Asn
290 295 300Asn Arg Gln Lys Asp Ala Ala Thr Ala Gln Thr Leu Gln Ala Phe Leu305 310 315 320His Trp Ala Ile Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gln Val
325 330 335His Phe Gln Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu
340 345 350Ile Ala Thr Ile Ser Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly
355 360 365Gly Ser Gly Gly Ser Val Pro Thr Thr Ala Ala Ser Pro Pro Ser Thr
370 375 380Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro Val Ala Pro Pro Pro Pro385 390 395 400Ala Ala Ala Asn Thr Pro Asn Ala Gln Pro Gly Asp Pro Asn Ala Ala
405 410 415Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro Pro Pro Val Ile Ala Pro
420 425 430Ash Ala Pro Gln Pro Val Arg Ile Asp Asn Pro Val Gly Gly Phe Ser
435 440 445Phe Ala Leu Pro Ala Gly Trp Val Glu Ser Asp Ala Ala His Phe Asp
450 455 460Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr Gly Asp Pro Pro Phe Pro465 470 475 480Gly Gln Pro Pro Pro Val Ala Asn Asp Thr Arg Ile Val Leu Gly Arg
485 490 495Leu Asp Gln Lys Leu Tyr Ala Ser Ala Glu Ala Thr Asp Ser Lys Ala
500 505 510Ala Ala Arg Leu Gly Ser Asp Met Gly Glu Phe Tyr Met Pro Tyr Pro
515 520 525Gly Thr Arg Ile Asn Gln Glu Thr Val Ser Leu Asp Ala Asn Gly Val
530 535 540Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys545 550 555 560Pro Asn Gly Gln Ile Trp Thr Gly Val Ile Gly Ser Pro Ala Ala Asn
565 570 575Ala Pro Asp Ala Gly Pro Pro Gln Arg Trp Phe Val Val Trp Leu Gly
580 585 590Thr Ala Asn Ash Pro Val Asp Lys Gly Ala Ala Lys Ala Leu Ala Glu
595 600 605Ser Ile Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro
610 615 620Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly Glu Val Ala Pro Thr625 630 635 640Pro Thr Thr Pro Thr Pro Gln Arg Thr Leu Pro Ala
645 650
Claims (54)
1.一种多肽,它含有可溶性结核分枝杆菌抗原的抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的所述抗原的变体,其中所述抗原具有选自下列的N-端序列:
(a)Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly-Gln-Val-Val-Ala-Ala-Leu(SEQ ID NO:115);
(b)Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro-Ser(SEQ ID NO:116);
(c)Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-Ala-Lys-Glu-Gly-Arg(SEQ ID NO:17);
(d)Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly-Pro(SEQ ID NO:118);
(e)Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Glu-Glu-Xaa-Ala-Val(SEQ IDNO:119);
(f)Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro(SEQ IDNO:120);
(g)Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Thr-Ala-Ala-Ser-Pro-Pro-Ser(SEQ ID NO:121);
(h)Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Thr-Gly(SEQ ID NO:122);
(i)Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Leu-Thr-Ser-Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe-Ala-Asn(SEQID NO:123);和
(j)Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gln-Ala-Gly;(SEQ ID NO:131)
其中Xaa可以是任何氨基酸。
2.一种多肽,它包含结核分枝杆菌抗原的免疫原性部分,或仅仅在保守性置换和/或修饰中有所不同的所述抗原的变体,其中所述抗原具有选自下列的N-端序列:
(a)Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr-Tyr-Pro-Gly-Gly-Arg-Arg-Xaa-Phe;(SEQ ID NO:124)和
(b)Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys-Ile-Asn-Val-His-Leu-Val;(SEQ ID NO:132)
其中Xaa可以是任何氨基酸。
3.一种多肽,它含有可溶性结核分枝杆菌抗原的抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的所述抗原的变体,其中所述抗原包含由DNA序列编码的氨基酸序列,该DNA序列选自:SEQ ID NO:1、2、4-10、13-25、52、94和96中描述的序列、所述序列的互补序列、以及在中等严谨条件下与SEQ ID NO:1、2、4-10、13-25、52、94和96中描述的序列或其互补序列杂交的DNA序列。
4.一种多肽,它包含结核分枝杆菌抗原的抗原性部分,或仅仅在保守性置换和/或修饰中有所不同的所述抗原的变体,其中所述抗原包含由DNA序列编码的氨基酸序列,该DNA序列选自:SEQ ID NO:26-51、133、134、158-178、196、235、237-242、248-251、290-293、304、311、313-315、317、319、323、324、328、330、332、334和336中描述的序列、所述序列的互补序列、以及在中等严谨条件下与SEQ ID NO:26-51、133、134、158-178、196、235、237-242、248-251、290-293、304、311、313-315、317、319、323、324、328、330、332、334和336中描述的序列或其互补序列杂交的DNA序列。
5.一种DNA分子,它包含编码权利要求1-4任一项所述的多肽的核苷酸序列。
6.一种重组表达载体,它含有权利要求5所述的DNA分子。
7.一种被权利要求6所述的表达载体转化的宿主细胞。
8.根据权利要求7所述的宿主细胞,其中宿主细胞选自大肠杆菌、酵母和哺乳动物细胞。
9.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使权利要求1-4任一项所述的一种或多种多肽与生物样品接触;和
(b)检测样品中结合至少一种多肽的抗体的存在,从而检测生物样品中的结核分枝杆菌感染。
10.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使生物样品与具有选自序列SEQ ID NO:129和130的N-端序列的多肽接触;和
(b)检测样品中结合至少一种多肽的抗体的存在,从而检测生物样品中的结核分枝杆菌感染。
11.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使生物样品与一种或多种由DNA序列编码的多肽接触,所述DNA序列选自:SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的序列、所述序列的互补序列、以及与SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的序列杂交的DNA序列。
(b)检测样品中结合至少一种多肽的抗体的存在,从而检测生物样品中结核分枝杆菌的感染。
12.根据权利要求9-11任一项所述的方法,其中步骤(a)还包括使生物样品与38kD结核分枝杆菌抗原接触,步骤(b)还包括检测样品中结合38kD结核分枝杆菌抗原的抗体的存在。
13.根据权利要求9-11任一项所述的方法,其中多肽与固相载体结合。
14.根据权利要求13所述的方法,其中固相载体包括硝酸纤维素、胶乳或塑料材料。
15.根据权利要求9-11任一项所述的方法,其中生物样品选自全血、血清、血浆、唾液、脑脊液和尿液。
16.根据权利要求15所述的方法,其中生物样品是全血或血清。
17.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使样品在聚合酶链反应中与至少两种寡核苷酸引物接触,其中至少一种寡核苷酸引物对权利要求5所述的DNA分子有特异性;和
(b)检测样品中的在寡核苷酸引物存在下扩增的DNA序列,从而检测结核分枝杆菌感染。
18.根据权利要求17所述的方法,其中至少一种寡核苷酸引物包含权利要求5所述的DNA分子的至少10个连续的核苷酸。
19.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使样品在聚合酶链反应中与至少两种寡核苷酸引物接触,其中至少一种寡核苷酸引物对选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列有特异性;和
(b)检测样品中的在第一和第二寡核苷酸引物存在下扩增的DNA序列,从而检测结核分枝杆菌感染。
20.根据权利要求19所述的方法,其中至少一种寡核苷酸引物包含选自SEQ IDNO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列的至少10个连续的核苷酸。
21.根据权利要求17至19所述的方法,其中生物样品选自全血、痰液、血清、血浆、唾液、脑脊液和尿液。
22.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使样品与对权利要求5所述的DNA分子有特异性的一种或多种寡核苷酸探针接触;和
(b)检测样品中与寡核苷酸探针杂交的DNA序列,从而检测结核分枝杆菌感染。
23.根据权利要求22所述的方法,其中探针包含权利要求5所述的DNA分子的至少15个连续的核苷酸。
24.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使样品与对选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列有特异性的一种或多种寡核苷酸探针接触;和
(b)检测样品中与寡核苷酸探针杂交的DNA序列,从而检测结核分枝杆菌感染。
25.根据权利要求24所述的方法,其中寡核苷酸探针包含选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列的至少15个连续的核苷酸。
26.根据权利要求22或24所述的方法,其中生物样品选自全血、痰液、血清、血浆、唾液、脑脊液和尿液。
27.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使生物样品与能结合权利要求1-4任一项所述多肽的结合剂接触;和
(b)检测样品中与结合剂结合的蛋白质或多肽,从而检测生物样品中的结核分枝杆菌感染。
28.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使生物样品与能结合多肽的结合剂接触,所述多肽具有选自SEQ ID NO:129和130所提供序列的N-端序列;和
(b)检测样品中与结合剂结合的蛋白质或多肽,从而检测生物样品中的结核分枝杆菌感染。
29.一种检测生物样品中结核分枝杆菌感染的方法,该方法包括:
(a)使生物样品与能结合多肽的结合剂接触,编码该多肽的DNA序列选自:SEQID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列,所述序列的互补序列,以及与SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337中描述的序列杂交的DNA序列;和
(b)检测样品中与结合剂结合的蛋白或多肽,从而检测生物样品中的结核分枝杆菌感染。
30.根据权利要求27至29任一项所述的方法,其中结合剂是单克隆抗体。
31.根据权利要求27至29任一项所述的方法,其中结合剂是多克隆抗体。
32.一种诊断试剂盒,它包含:
(a)一种或多种权利要求1-4任一项所述的多肽;和
(b)检测剂。
33.一种诊断试剂盒,它包含:
(a)具有选自SEQ ID NO:129和130所提供序列的N-端序列的一种或多种多肽;和
(b)检测剂。
34.一种诊断试剂盒,它包含:
(a)一种或多种由DNA序列编码的多肽,该DNA序列选自:SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337,所述序列的互补序列,以及与SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337中描述的序列杂交的DNA序列;和
(b)检测剂。
35.根据权利要求32-34任一项所述的试剂盒,其中多肽固定在固相载体上。
36.根据权利要求35所述的试剂盒,其中固相载体包括硝酸纤维素、胶乳或塑料材料。
37.根据权利要求32-34任一项所述的试剂盒,其中检测剂包含与结合剂偶联的报道基团。
38.根据权利要求37所述的试剂盒,其中结合剂选自抗免疫球蛋白、蛋白G、蛋白A和凝集素。
39.根据权利要求37所述的试剂盒,其中报道基团选自放射性同位素、荧光基团、发光基团、酶、生物素、染料颗粒和胶粒。
40.一种诊断试剂盒,它包含至少两种寡核苷酸引物,至少一种寡核苷酸引物对权利要求5所述的DNA分子有特异性。
41.根据权利要求40所述的诊断试剂盒,其中至少一种寡核苷酸引物包含权利要求5所述的DNA分子的至少10个连续的核苷酸。
42.一种诊断试剂盒,它包含至少两种寡核苷酸引物,至少一种引物对选自SEQID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列有特异性。
43.根据权利要求42所述的诊断试剂盒,其中至少一种寡核苷酸引物包含选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列的至少10个连续的核苷酸。
44.一种诊断试剂盒,它包含至少一种寡核苷酸探针,所述寡核苷酸探针对权利要求5所述的DNA分子有特异性。
45.根据权利要求44所述的试剂盒,其中寡核苷酸探针包含权利要求5所述的DNA分子的至少15个连续的核苷酸。
46.一种诊断试剂盒,它包含至少一种寡核苷酸探针,所述寡核苷酸探针对选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列有特异性。
47.根据权利要求46所述的试剂盒,其中寡核苷酸探针包含选自SEQ ID NO:3、11、12、135、136、151-155、184-188、194-195、198、210-220、232、234、256-271、287、288、298-303、305-310、312、316、318、320-322、325-327、329、331、333、335和337的DNA序列的至少15个连续的核苷酸。
48.一种单克隆抗体,它与权利要求1-4任一项所述的多肽结合。
49.一种多克隆抗体,它与权利要求1-4任一项所述的多肽结合。
50.一种融合蛋白,它包含两种或多种权利要求1-4任一项所述的多肽。
51.一种融合蛋白,它包含一种或多种权利要求1-4任一项所述的多肽以及ESAT-6即SEQ ID NO:99。
52.一种融合蛋白,它包含具有选自SEQ ID NO:129和130所提供序列的N-端序列的多肽。
53.一种融合蛋白,它包含一种或多种权利要求1-4任一项所述的多肽以及结核分枝杆菌抗原38kD即SEQ ID NO:150。
54.一种诊断试剂盒,它包含:
(a)权利要求50-53任一项所述的一种或多种融合蛋白;和
(b)检测剂。
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US2475398A | 1998-02-18 | 1998-02-18 | |
US09/024,753 | 1998-02-18 | ||
US09/072,596 US6458366B1 (en) | 1995-09-01 | 1998-05-05 | Compounds and methods for diagnosis of tuberculosis |
US09/072,596 | 1998-05-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1312723A true CN1312723A (zh) | 2001-09-12 |
Family
ID=26698832
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN99809541A Pending CN1312723A (zh) | 1998-02-18 | 1999-02-17 | 诊断结核病的化合物和方法 |
Country Status (8)
Country | Link |
---|---|
US (4) | US6458366B1 (zh) |
EP (1) | EP1091749A1 (zh) |
JP (1) | JP2002530050A (zh) |
CN (1) | CN1312723A (zh) |
AU (1) | AU2681999A (zh) |
BR (1) | BR9908083A (zh) |
CA (1) | CA2322617A1 (zh) |
WO (1) | WO1999042118A2 (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101925819A (zh) * | 2007-12-28 | 2010-12-22 | 株式会社比尔生命 | 结核分歧杆菌复合群的免疫检出方法 |
CN103814139A (zh) * | 2011-04-01 | 2014-05-21 | 澳康姆生物实验室公司 | 用于检测无细胞的病原体特异性核酸的方法和试剂盒 |
CN106011296A (zh) * | 2016-08-01 | 2016-10-12 | 武汉大学 | 一种结核分枝杆菌Rv1768基因微滴数字PCR绝对定量检测试剂盒 |
CN113004362A (zh) * | 2021-02-26 | 2021-06-22 | 南方科技大学 | 一种订书钉核酸、dna纳米机器人及其制备方法和应用 |
CN113388623A (zh) * | 2020-03-11 | 2021-09-14 | 北京蛋白质组研究中心 | 结核分枝杆菌H37Rv新基因Rv2203c及其编码蛋白和应用 |
CN114034860A (zh) * | 2021-11-01 | 2022-02-11 | 宁夏大学 | 一种结核分枝杆菌mtb39a蛋白抗体间接elisa检测方法及其试剂盒 |
CN114989312A (zh) * | 2022-06-27 | 2022-09-02 | 安徽理工大学 | 一种结合分枝杆菌融合蛋白dr2及其应用 |
Families Citing this family (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6458366B1 (en) | 1995-09-01 | 2002-10-01 | Corixa Corporation | Compounds and methods for diagnosis of tuberculosis |
US6290969B1 (en) * | 1995-09-01 | 2001-09-18 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
US6592877B1 (en) | 1995-09-01 | 2003-07-15 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
US7087713B2 (en) | 2000-02-25 | 2006-08-08 | Corixa Corporation | Compounds and methods for diagnosis and immunotherapy of tuberculosis |
EP1144447B1 (en) | 1998-11-04 | 2009-10-14 | Isis Innovation Limited | Tuberculosis diagnostic test |
US6196917B1 (en) * | 1998-11-20 | 2001-03-06 | Philips Electronics North America Corp. | Goal directed user interface |
US8143386B2 (en) * | 1999-04-07 | 2012-03-27 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis antigens and their uses |
CA2386841A1 (en) * | 1999-10-07 | 2001-04-12 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis |
US7009042B1 (en) | 1999-10-07 | 2006-03-07 | Corixa Corporation | Methods of using a Mycobacterium tuberculosis coding sequence to facilitate stable and high yield expression of the heterologous proteins |
DE60033583T2 (de) * | 1999-10-07 | 2007-09-13 | Corixa Corp. CSC, Wilmington | Eine mycobacterium tuberculosis kodierende sequenz zur expression von heterologen proteinen |
US6316205B1 (en) | 2000-01-28 | 2001-11-13 | Genelabs Diagnostics Pte Ltd. | Assay devices and methods of analyte detection |
DK2133100T3 (da) * | 2000-06-20 | 2012-01-23 | Corixa Corp | MTB32A-Antigen af Mycobacterium tuberculosis med inaktiveret aktivt sted og fusionsproteiner deraf |
AU2003213118A1 (en) * | 2002-02-15 | 2003-09-09 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis |
US8475735B2 (en) * | 2004-11-01 | 2013-07-02 | Uma Mahesh Babu | Disposable immunodiagnostic test system |
SG159554A1 (en) | 2004-11-16 | 2010-03-30 | Crucell Holland Bv | Multivalent vaccines comprising recombinant viral vectors |
US7608277B2 (en) * | 2004-12-01 | 2009-10-27 | Gene Therapy Systems, Inc. | Tuberculosis nucleic acids, polypeptides and immunogenic compositions |
WO2007005627A2 (en) | 2005-07-01 | 2007-01-11 | Forsyth Dental Infirmary For Children | Tuberculosis antigen detection assays and vaccines |
US20070081944A1 (en) * | 2005-09-30 | 2007-04-12 | Reed Steven G | Iontophoresis apparatus and method for the diagnosis of tuberculosis |
US20100008922A1 (en) * | 2006-03-13 | 2010-01-14 | Oragenics, Inc. | In Vivo Induced Genes of Mycobacterium Tuberculosis |
US20090148870A1 (en) * | 2006-05-18 | 2009-06-11 | Ericson Daniel G | Rapid Detection of Mycobacterium Tuberculosis and Antimicrobial Drug Resistance |
GB0618127D0 (en) * | 2006-09-14 | 2006-10-25 | Isis Innovation | Biomarker |
US20090181078A1 (en) | 2006-09-26 | 2009-07-16 | Infectious Disease Research Institute | Vaccine composition containing synthetic adjuvant |
PL2486938T3 (pl) | 2006-09-26 | 2018-08-31 | Infectious Disease Research Institute | Kompozycja szczepionki zawierająca syntetyczny adiuwant |
EP2368568A1 (en) | 2006-11-01 | 2011-09-28 | Immport Therapeutics, INC. | Compositions and methods for immunodominant antigens |
ES2650066T3 (es) * | 2008-05-28 | 2018-01-16 | Wako Pure Chemical Industries, Ltd. | Cebador y sonda para la detección de Mycobacterium intracellulare, y método para la detección de Mycobacterium intracellulare empleando el cebador o la sonda |
EA022203B1 (ru) | 2008-07-25 | 2015-11-30 | Глаксосмитклайн Байолоджикалс С.А. | Способы и композиции для лечения или предупреждения туберкулёза |
EP2315834B1 (en) | 2008-07-25 | 2018-06-13 | GlaxoSmithKline Biologicals S.A. | The tuberculosis rv2386c protein, compositions and uses thereof |
US20100159485A1 (en) * | 2008-12-19 | 2010-06-24 | Centre For Dna Fingerprinting And Diagnostics | Detection of mycobacterium tuberculosis |
WO2010089098A1 (de) | 2009-02-05 | 2010-08-12 | Deklatec Gmbh | Verfahren und mittel für die tuberkulose diagnostik |
ES2606563T3 (es) | 2009-06-05 | 2017-03-24 | Infectious Disease Research Institute | Adyuvantes lipídicos de glucopiranosilo sintéticos y composiciones de vacuna que contienen los mismos |
JP6010463B2 (ja) | 2010-01-27 | 2016-10-19 | グラクソスミスクライン バイオロジカルズ ソシエテ アノニム | 改変された結核抗原 |
EP2694099B1 (en) | 2011-04-08 | 2019-10-16 | Immune Design Corp. | Immunogenic compositions and methods of using the compositions for inducing humoral and cellular immune responses |
EP2573107A1 (en) * | 2011-09-21 | 2013-03-27 | Norwegian Institute of Public Health | A recombinant fusion protein |
LT2811981T (lt) | 2012-02-07 | 2019-06-10 | Infectious Disease Research Institute | Pagerintos adjuvanto kompozicijos, apimančios tlr4 agonistus, ir jų panaudojimo būdai |
PL2850431T3 (pl) | 2012-05-16 | 2018-09-28 | Immune Design Corp. | Szczepionki przeciwko HSV-2 |
SG11201508092YA (en) | 2013-04-18 | 2015-10-29 | Immune Design Corp | Gla monotherapy for use in cancer treatment |
US9463198B2 (en) | 2013-06-04 | 2016-10-11 | Infectious Disease Research Institute | Compositions and methods for reducing or preventing metastasis |
WO2015083056A2 (en) | 2013-12-03 | 2015-06-11 | Kusuma School Of Biological Sciences | Genetic markers for diagnosis of tuberculosis caused by mycobacterium tuberculosis |
EA201691348A1 (ru) | 2013-12-31 | 2016-11-30 | Инфекшес Дизиз Рисерч Инститьют | Однофлаконные вакцинные составы |
MX2018014399A (es) | 2016-06-01 | 2019-06-06 | Infectious Disease Res Inst | Particulas de nanoalumbre que contienen un agente de dimensionamiento. |
CA3059408A1 (en) * | 2016-09-22 | 2018-04-12 | Pace Diagnostics, Inc | Mycobacterium tuberculosis proteins in diagnostic assays and devices for tuberculosis detection and diagnosis |
CA3141577A1 (en) | 2019-05-25 | 2020-12-03 | Infectious Disease Research Institute | Composition and method for spray drying an adjuvant vaccine emulsion |
Family Cites Families (86)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU5856073A (en) | 1973-06-28 | 1975-01-30 | Toru Tsumits | Tuberculin active proteins and peptides |
FR2244539A1 (en) | 1973-07-13 | 1975-04-18 | Mitsui Pharmaceuticals | Tuberculin active proteins and peptides prepn - from tubercle bacillus |
FR2265402A1 (en) | 1974-03-29 | 1975-10-24 | Mitsui Pharmaceuticals | Tuberculin active proteins and peptides prepn - from tubercle bacillus |
US4235877A (en) | 1979-06-27 | 1980-11-25 | Merck & Co., Inc. | Liposome particle containing viral or bacterial antigenic subunit |
US4769330A (en) | 1981-12-24 | 1988-09-06 | Health Research, Incorporated | Modified vaccinia virus and methods for making and using the same |
US4603112A (en) | 1981-12-24 | 1986-07-29 | Health Research, Incorporated | Modified vaccinia virus |
US4436727A (en) | 1982-05-26 | 1984-03-13 | Ribi Immunochem Research, Inc. | Refined detoxified endotoxin product |
US4866034A (en) | 1982-05-26 | 1989-09-12 | Ribi Immunochem Research Inc. | Refined detoxified endotoxin |
US4876089A (en) | 1984-09-06 | 1989-10-24 | Chiron Corporation | Feline leukemia virus protein vaccines |
US4751180A (en) | 1985-03-28 | 1988-06-14 | Chiron Corporation | Expression using fused genes providing for protein product |
US4689397A (en) | 1985-08-12 | 1987-08-25 | Scripps Clinic And Research Foundation | Synthetic polypeptides for detecting mycobacterial infections |
US4777127A (en) | 1985-09-30 | 1988-10-11 | Labsystems Oy | Human retrovirus-related products and methods of diagnosing and treating conditions associated with said retrovirus |
US4935233A (en) | 1985-12-02 | 1990-06-19 | G. D. Searle And Company | Covalently linked polypeptide cell modulators |
US4877611A (en) | 1986-04-15 | 1989-10-31 | Ribi Immunochem Research Inc. | Vaccine containing tumor antigens and adjuvants |
US4946778A (en) | 1987-09-21 | 1990-08-07 | Genex Corporation | Single polypeptide chain binding molecules |
US5811128A (en) | 1986-10-24 | 1998-09-22 | Southern Research Institute | Method for oral or rectal delivery of microencapsulated vaccines and compositions therefor |
US4879213A (en) | 1986-12-05 | 1989-11-07 | Scripps Clinic And Research Foundation | Synthetic polypeptides and antibodies related to Epstein-Barr virus early antigen-diffuse |
AU1548388A (en) | 1987-02-02 | 1988-08-24 | Whitehead Institute For Biomedical Research | Mycobacterium tuberculosis genes and encoding protein antigens |
GB8702816D0 (en) | 1987-02-07 | 1987-03-11 | Al Sumidaie A M K | Obtaining retrovirus-containing fraction |
US4952395A (en) | 1987-02-26 | 1990-08-28 | Scripps Clinic And Research Foundation | Mycobacterial recombinants and peptides |
US4976958A (en) | 1987-02-26 | 1990-12-11 | Scripps Clinic And Research Foundation | Mycobacterial recombinants and peptides |
US5504005A (en) | 1987-03-02 | 1996-04-02 | Albert Einstein College Of Medicine Of Yeshiva University | Recombinant mycobacterial vaccine |
US5583112A (en) | 1987-05-29 | 1996-12-10 | Cambridge Biotech Corporation | Saponin-antigen conjugates and the use thereof |
US4897268A (en) | 1987-08-03 | 1990-01-30 | Southern Research Institute | Drug delivery system and method of making the same |
AP129A (en) | 1988-06-03 | 1991-04-17 | Smithkline Biologicals S A | Expression of retrovirus gag protein eukaryotic cells |
US4912094B1 (en) | 1988-06-29 | 1994-02-15 | Ribi Immunochem Research Inc. | Modified lipopolysaccharides and process of preparation |
US5108745B1 (en) | 1988-08-16 | 1998-06-30 | Univ California | Tuberculosis and legionellosis vaccines and methods for their production |
US5549910A (en) | 1989-03-31 | 1996-08-27 | The Regents Of The University Of California | Preparation of liposome and lipid complex compositions |
JPH0775324B2 (ja) | 1989-08-18 | 1995-08-09 | 日本電気株式会社 | 携帯無線機 |
IL95742A (en) | 1989-09-19 | 2000-12-06 | Innogenetics Nv | Recombinant polypeptides and peptides nucleic acids coding for the same and use of these polypeptides and peptides in the diagnosis of tuberculosis |
EP0419355B1 (en) | 1989-09-19 | 2000-02-09 | N.V. Innogenetics S.A. | Recombinant polypeptides and peptides, nucleic acids coding for the same and use of these polypeptides and peptides in the diagnostic of tuberculosis |
US5707644A (en) | 1989-11-04 | 1998-01-13 | Danbiosyst Uk Limited | Small particle compositions for intranasal drug delivery |
SE9001105D0 (sv) | 1990-03-27 | 1990-03-27 | Astra Ab | New methods foer diagnosis of tuberculosis |
US5466468A (en) | 1990-04-03 | 1995-11-14 | Ciba-Geigy Corporation | Parenterally administrable liposome formulation comprising synthetic lipids |
JP3218637B2 (ja) | 1990-07-26 | 2001-10-15 | 大正製薬株式会社 | 安定なリポソーム水懸濁液 |
WO1992004049A1 (en) | 1990-09-06 | 1992-03-19 | Rijksuniversiteit Te Utrecht | Inhibitor of lymphocyte response and immune-related disease |
AU660430B2 (en) | 1990-11-08 | 1995-06-29 | Stanford Rook Limited | Mycobacterium as adjuvant for antigens |
US5145684A (en) | 1991-01-25 | 1992-09-08 | Sterling Drug Inc. | Surface modified drug nanoparticles |
EP0499003A1 (en) | 1991-02-14 | 1992-08-19 | N.V. Innogenetics S.A. | Polypeptides and peptides, particularly recombinant polypeptides and peptides, nucleic acids coding for the same and use of these polypeptides and peptides in the diagnosis of tuberculosis |
RU2024021C1 (ru) | 1991-05-05 | 1994-11-30 | Научно-исследовательский институт хирургии Восточно-Сибирского филиала СО РАМН | Способ диагностики туберкулеза |
DE4116249A1 (de) | 1991-05-17 | 1992-11-19 | Biotechnolog Forschung Gmbh | Hybrid-plasmid fuer m.-tuberculosis-antigen, e. coli als wirt und antigen |
FR2677365B1 (fr) | 1991-06-07 | 1995-08-04 | Pasteur Institut | Proteines de mycobacterium et applications. |
US5240856A (en) | 1991-10-23 | 1993-08-31 | Cellpro Incorporated | Apparatus for cell separation |
US6037135A (en) | 1992-08-07 | 2000-03-14 | Epimmune Inc. | Methods for making HLA binding peptides and their uses |
WO1993023011A1 (en) | 1992-05-18 | 1993-11-25 | Minnesota Mining And Manufacturing Company | Transmucosal drug delivery device |
US5330754A (en) | 1992-06-29 | 1994-07-19 | Archana Kapoor | Membrane-associated immunogens of mycobacteria |
NL9202197A (nl) | 1992-12-17 | 1994-07-18 | Kreatech Biotech Bv | Werkwijze en inrichting voor het identificeren van een voor een mycobacteriële infectie verantwoordelijk mycobacterium species. |
WO1994015635A1 (en) | 1993-01-11 | 1994-07-21 | Dana-Farber Cancer Institute | Inducing cytotoxic t lymphocyte responses |
US5616500A (en) | 1993-04-30 | 1997-04-01 | The United States Of America As Represented By The Department Of Health & Human Services | Trichohyalin and transglutaminase-3 and methods of using same |
DK79793D0 (da) | 1993-07-02 | 1993-07-02 | Statens Seruminstitut | Diagnostic test |
DK79893D0 (da) | 1993-07-02 | 1993-07-02 | Statens Seruminstitut | New vaccine |
US5955077A (en) | 1993-07-02 | 1999-09-21 | Statens Seruminstitut | Tuberculosis vaccine |
US5639653A (en) | 1993-07-19 | 1997-06-17 | Albert Einstein College Of Medicine Of Yeshiva University, A Division Of Yeshiva Universtiy | Method for proliferating Vγ2Vδ2 T cells |
US5543158A (en) | 1993-07-23 | 1996-08-06 | Massachusetts Institute Of Technology | Biodegradable injectable nanoparticles |
DE69435138D1 (de) | 1993-11-23 | 2008-10-23 | Univ California | Reichlich vorhandene extrazelluläre produkte und methoden zu ihrer herstellung sowie ihre verwendung |
GB9409985D0 (en) | 1994-05-18 | 1994-07-06 | Medical Res Council | Vaccine against mycobacterial infections |
US5736524A (en) | 1994-11-14 | 1998-04-07 | Merck & Co.,. Inc. | Polynucleotide tuberculosis vaccine |
DE4446509A1 (de) | 1994-12-24 | 1996-06-27 | Sel Alcatel Ag | Verfahren zur Herstellung von Leiterbahnen auf einem Vertiefungen aufweisenden Substrat |
US5795587A (en) | 1995-01-23 | 1998-08-18 | University Of Pittsburgh | Stable lipid-comprising drug delivery complexes and methods for their production |
US5714593A (en) * | 1995-02-01 | 1998-02-03 | Institut Pasteur | DNA from mycobacterium tuberculosis which codes for a 45/47 kilodalton protein |
US5580579A (en) | 1995-02-15 | 1996-12-03 | Nano Systems L.L.C. | Site-specific adhesion within the GI tract using nanoparticles stabilized by high molecular weight, linear poly (ethylene oxide) polymers |
IE960132A1 (en) | 1995-03-13 | 1996-09-18 | Astra Ab | New DNA molecules |
US6290969B1 (en) | 1995-09-01 | 2001-09-18 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
US6592877B1 (en) | 1995-09-01 | 2003-07-15 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
EP0851927B1 (en) | 1995-09-01 | 2003-10-22 | Corixa Corporation | Compounds for immunotherapy and diagnosis of tuberculosis |
US6338852B1 (en) | 1995-09-01 | 2002-01-15 | Corixa Corporation | Compounds and methods for diagnosis of tuberculosis |
MX9801687A (es) | 1995-09-01 | 1998-11-29 | Corixa Corp | Compuestos y metodos para el diagnostico de tuberculosis. |
US6458366B1 (en) | 1995-09-01 | 2002-10-01 | Corixa Corporation | Compounds and methods for diagnosis of tuberculosis |
WO1997033909A2 (en) | 1996-03-15 | 1997-09-18 | Corixa Corporation | Compounds and methods for immunotherapy and immunodiagnosis of prostate cancer |
US5985287A (en) | 1996-08-29 | 1999-11-16 | Genesis Research And Development Corporation Limited | Compounds and methods for treatment and diagnosis of mycobacterial infections |
US6284255B1 (en) | 1996-08-29 | 2001-09-04 | Genesis Research & Development Corporation Limited | Compounds and methods for treatment and diagnosis of mycobacterial infections |
US5856462A (en) | 1996-09-10 | 1999-01-05 | Hybridon Incorporated | Oligonucleotides having modified CpG dinucleosides |
US6350456B1 (en) | 1997-03-13 | 2002-02-26 | Corixa Corporation | Compositions and methods for the prevention and treatment of M. tuberculosis infection |
US6544522B1 (en) | 1998-12-30 | 2003-04-08 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis antigens and their uses |
US6627198B2 (en) | 1997-03-13 | 2003-09-30 | Corixa Corporation | Fusion proteins of Mycobacterium tuberculosis antigens and their uses |
US6113918A (en) | 1997-05-08 | 2000-09-05 | Ribi Immunochem Research, Inc. | Aminoalkyl glucosamine phosphate compounds and their use as adjuvants and immunoeffectors |
US6355257B1 (en) | 1997-05-08 | 2002-03-12 | Corixa Corporation | Aminoalkyl glucosamine phosphate compounds and their use as adjuvants and immunoeffectors |
US6613881B1 (en) | 1997-05-20 | 2003-09-02 | Corixa Corporation | Compounds for immunotherapy and diagnosis of tuberculosis and methods of their use |
US7087713B2 (en) | 2000-02-25 | 2006-08-08 | Corixa Corporation | Compounds and methods for diagnosis and immunotherapy of tuberculosis |
US6555653B2 (en) | 1997-05-20 | 2003-04-29 | Corixa Corporation | Compounds for diagnosis of tuberculosis and methods for their use |
BR9909472A (pt) | 1998-04-07 | 2001-09-11 | Corixa Corp | Polipeptìdeo purificado, processo para prevenir tuberculose, e, composição farmacêutica |
US6465633B1 (en) | 1998-12-24 | 2002-10-15 | Corixa Corporation | Compositions and methods of their use in the treatment, prevention and diagnosis of tuberculosis |
US8143386B2 (en) | 1999-04-07 | 2012-03-27 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis antigens and their uses |
CA2386841A1 (en) | 1999-10-07 | 2001-04-12 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis |
DK2133100T3 (da) | 2000-06-20 | 2012-01-23 | Corixa Corp | MTB32A-Antigen af Mycobacterium tuberculosis med inaktiveret aktivt sted og fusionsproteiner deraf |
AU2003213118A1 (en) | 2002-02-15 | 2003-09-09 | Corixa Corporation | Fusion proteins of mycobacterium tuberculosis |
-
1998
- 1998-05-05 US US09/072,596 patent/US6458366B1/en not_active Expired - Fee Related
-
1999
- 1999-02-17 WO PCT/US1999/003265 patent/WO1999042118A2/en not_active Application Discontinuation
- 1999-02-17 JP JP2000532132A patent/JP2002530050A/ja not_active Withdrawn
- 1999-02-17 CN CN99809541A patent/CN1312723A/zh active Pending
- 1999-02-17 AU AU26819/99A patent/AU2681999A/en not_active Abandoned
- 1999-02-17 EP EP99907065A patent/EP1091749A1/en not_active Withdrawn
- 1999-02-17 BR BRPI9908083-4A patent/BR9908083A/pt not_active IP Right Cessation
- 1999-02-17 CA CA002322617A patent/CA2322617A1/en not_active Abandoned
-
2002
- 2002-07-10 US US10/193,002 patent/US6949246B2/en not_active Expired - Fee Related
-
2005
- 2005-03-15 US US11/082,005 patent/US7122196B2/en not_active Expired - Fee Related
-
2006
- 2006-08-16 US US11/505,569 patent/US7906277B2/en not_active Expired - Fee Related
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101925819A (zh) * | 2007-12-28 | 2010-12-22 | 株式会社比尔生命 | 结核分歧杆菌复合群的免疫检出方法 |
CN101925819B (zh) * | 2007-12-28 | 2013-06-12 | 株式会社比尔生命 | 结核分歧杆菌复合群的免疫检出方法 |
CN103123353B (zh) * | 2007-12-28 | 2014-11-12 | 株式会社比尔生命 | 结核分歧杆菌复合群的免疫检出方法 |
CN103814139A (zh) * | 2011-04-01 | 2014-05-21 | 澳康姆生物实验室公司 | 用于检测无细胞的病原体特异性核酸的方法和试剂盒 |
CN103814139B (zh) * | 2011-04-01 | 2018-11-13 | 澳康姆生物实验室公司 | 用于检测无细胞的病原体特异性核酸的方法和试剂盒 |
CN106011296A (zh) * | 2016-08-01 | 2016-10-12 | 武汉大学 | 一种结核分枝杆菌Rv1768基因微滴数字PCR绝对定量检测试剂盒 |
CN113388623A (zh) * | 2020-03-11 | 2021-09-14 | 北京蛋白质组研究中心 | 结核分枝杆菌H37Rv新基因Rv2203c及其编码蛋白和应用 |
CN113004362A (zh) * | 2021-02-26 | 2021-06-22 | 南方科技大学 | 一种订书钉核酸、dna纳米机器人及其制备方法和应用 |
CN113004362B (zh) * | 2021-02-26 | 2023-08-15 | 南方科技大学 | 一种订书钉核酸、dna纳米机器人及其制备方法和应用 |
CN114034860A (zh) * | 2021-11-01 | 2022-02-11 | 宁夏大学 | 一种结核分枝杆菌mtb39a蛋白抗体间接elisa检测方法及其试剂盒 |
CN114989312A (zh) * | 2022-06-27 | 2022-09-02 | 安徽理工大学 | 一种结合分枝杆菌融合蛋白dr2及其应用 |
CN114989312B (zh) * | 2022-06-27 | 2023-07-21 | 安徽理工大学 | 一种结核分枝杆菌融合蛋白dr2及其应用 |
Also Published As
Publication number | Publication date |
---|---|
US7122196B2 (en) | 2006-10-17 |
US6458366B1 (en) | 2002-10-01 |
EP1091749A1 (en) | 2001-04-18 |
US20030135026A1 (en) | 2003-07-17 |
AU2681999A (en) | 1999-09-06 |
CA2322617A1 (en) | 1999-08-26 |
US6949246B2 (en) | 2005-09-27 |
US20050181419A1 (en) | 2005-08-18 |
JP2002530050A (ja) | 2002-09-17 |
US7906277B2 (en) | 2011-03-15 |
WO1999042118A2 (en) | 1999-08-26 |
US20070141087A1 (en) | 2007-06-21 |
WO1999042118A8 (en) | 1999-10-21 |
BR9908083A (pt) | 2007-07-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1312723A (zh) | 诊断结核病的化合物和方法 | |
CN1117149C (zh) | 用于免疫治疗和诊断结核病的化合物和方法 | |
US6592877B1 (en) | Compounds and methods for immunotherapy and diagnosis of tuberculosis | |
CN1154730C (zh) | 用于结核病诊断的化合物和方法 | |
CZ126599A3 (cs) | Polypeptid pro imunoterapii a diagnosu tuberkulosy | |
US6350456B1 (en) | Compositions and methods for the prevention and treatment of M. tuberculosis infection | |
WO1998016646A9 (en) | Compounds and methods for immunotherapy and diagnosis of tuberculosis | |
SA99200488B1 (ar) | تركيبات وطرق لاج والوقاية من الاصابة بعدوي بكتيريا العصيات الفطرية للدرنM.tuberculosis | |
CZ126699A3 (cs) | Polypeptid obsahující antigenní část rozpustného antigenu M. tuberculosis nebo variantu uvedeného antigenu, molekula DNA kódující tento polypeptid, expresivní vektor, hostitelská buňka, způsoby diagnostiky infekce M. tuberculosis a diagnostické kity | |
US6338852B1 (en) | Compounds and methods for diagnosis of tuberculosis | |
CN1268745C (zh) | B组链球菌抗原 | |
CN1437653A (zh) | 抗原多肽 | |
CN1235555A (zh) | 治疗和诊断分枝杆菌感染的化合物和方法 | |
CN1336957A (zh) | 来自脑膜炎奈瑟氏球菌的basb006多核苷酸和多肽 | |
CN1294632A (zh) | 得自母牛分枝杆菌的组合物及其使用方法 | |
CN1235513A (zh) | 与幽门螺杆菌及其疫苗组合物有关的核酸和氨基酸序列 | |
CN1261403A (zh) | 新的肺炎衣原体表面蛋白 | |
CN1241212A (zh) | 免疫治疗和诊断结核病的化合物和方法 | |
CN1242047A (zh) | 诊断结核病的化合物和方法 | |
CN1210401C (zh) | 源自粘膜炎莫拉氏菌的化合物 | |
MXPA99003393A (en) | Compounds and methods for diagnosis of tuberculosis | |
MXPA99003392A (en) | Compounds and methods for immunotherapy and diagnosis of tuberculosis | |
CN1326509A (zh) | 粘膜炎莫拉氏菌basb034多肽及应用 | |
CN1371389A (zh) | 疫苗 | |
CN1420930A (zh) | 新颖的化合物 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1040366 Country of ref document: HK |