CN110891600B - Recombinant measles virus expressing Zika virus protein and application thereof - Google Patents

Recombinant measles virus expressing Zika virus protein and application thereof Download PDF

Info

Publication number
CN110891600B
CN110891600B CN201880047044.6A CN201880047044A CN110891600B CN 110891600 B CN110891600 B CN 110891600B CN 201880047044 A CN201880047044 A CN 201880047044A CN 110891600 B CN110891600 B CN 110891600B
Authority
CN
China
Prior art keywords
zikv
protein
seq
gly
leu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201880047044.6A
Other languages
Chinese (zh)
Other versions
CN110891600A (en
Inventor
F·坦吉
E·西蒙-洛列里
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Centre National de la Recherche Scientifique CNRS
Institut Pasteur de Lille
Original Assignee
Centre National de la Recherche Scientifique CNRS
Institut Pasteur de Lille
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Centre National de la Recherche Scientifique CNRS, Institut Pasteur de Lille filed Critical Centre National de la Recherche Scientifique CNRS
Publication of CN110891600A publication Critical patent/CN110891600A/en
Application granted granted Critical
Publication of CN110891600B publication Critical patent/CN110891600B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K39/12Viral antigens
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/525Virus
    • A61K2039/5254Virus avirulent or attenuated
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/53DNA (RNA) vaccination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2710/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
    • C12N2710/00011Details
    • C12N2710/14011Baculoviridae
    • C12N2710/14034Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2710/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
    • C12N2710/00011Details
    • C12N2710/14011Baculoviridae
    • C12N2710/14041Use of virus, viral particle or viral elements as a vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2760/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
    • C12N2760/00011Details
    • C12N2760/18011Paramyxoviridae
    • C12N2760/18411Morbillivirus, e.g. Measles virus, canine distemper
    • C12N2760/18421Viruses as such, e.g. new isolates, mutants or their genomic sequences
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2760/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
    • C12N2760/00011Details
    • C12N2760/18011Paramyxoviridae
    • C12N2760/18411Morbillivirus, e.g. Measles virus, canine distemper
    • C12N2760/18423Virus like particles [VLP]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/24011Flaviviridae
    • C12N2770/24111Flavivirus, e.g. yellow fever virus, dengue, JEV
    • C12N2770/24122New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/24011Flaviviridae
    • C12N2770/24111Flavivirus, e.g. yellow fever virus, dengue, JEV
    • C12N2770/24123Virus like particles [VLP]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/24011Flaviviridae
    • C12N2770/24111Flavivirus, e.g. yellow fever virus, dengue, JEV
    • C12N2770/24134Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/005Assays involving biological materials from specific organisms or of a specific nature from viruses
    • G01N2333/08RNA viruses
    • G01N2333/115Paramyxoviridae, e.g. parainfluenza virus
    • G01N2333/12Mumps virus; Measles virus
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Virology (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Engineering & Computer Science (AREA)
  • Mycology (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Animal Behavior & Ethology (AREA)
  • Epidemiology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • General Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)

Abstract

The present invention relates to recombinant measles virus expressing the Zika virus protein and its use, in particular in inducing prophylactic protection against Zika virus. The present invention relates to recombinant Measles Virus (MV) expressing at least (i) the membrane precursor (prM) protein of ZIKV and the envelope (E) protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof, and to recombinant infectious particles of said MV-ZIKV capable of replication in a host after administration, and also to virus-like particles (VLPs) comprising these ZIKV proteins on their surface. The present invention provides means for producing these recombinant infectious particles and VLPs, in particular nucleic acids, vectors, cells and rescue systems. The invention also relates to the use of these recombinant infectious particles and/or VLPs, in particular in the form of a composition, more particularly in vaccine formulations, for the prevention of ZIKV infection or for the prophylactic protection against clinical consequences of ZIKV infection.

Description

Recombinant measles virus expressing Zika virus protein and application thereof
The present invention relates to recombinant measles virus expressing the Zika virus protein and its use, in particular in inducing prophylactic protection against Zika virus. The present invention relates to recombinant Measles Virus (MV) expressing at least (i) the membrane precursor (prM) protein of ZIKV and the envelope (E) protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof, and to recombinant infectious particles of said MV-ZIKV capable of replication in a host after administration, and also to virus-like particles (VLPs) comprising these ZIKV proteins on their surface. The present invention provides means for producing these recombinant infectious particles and VLPs, in particular nucleic acids, vectors, cells and rescue systems. The invention also relates to the use of these recombinant infectious particles and/or VLPs, in particular in the form of a composition, more particularly in vaccine formulations, for the prevention of ZIKV infection or for the prophylactic protection against clinical consequences of ZIKV infection.
ZIKV is a new occurrence of mosquito-borne flaviviruses. Although it was originally isolated in 1947, to date, there is no specific therapeutic approach or any vaccine available against ZIKV disease, making it a truly neglected and emerging disease. ZIKV has recently spread rapidly in previously unaffected areas (e.g., the south pacific islands and latin america), providing strong epidemiological evidence that infection with this virus may be associated with neurological complications in adults and with severe congenital brain deformities in newborns. Therefore, the World Health Organization (WHO) has declared the most recently exploded ZIKV as a public health incident.
ZIKV was originally isolated in 1947 from rhesus monkeys in the forest of Uganda village cards (Gubler DJ, et al, eds. Fields Virology,5th edn.Philadelphia,PA:Lippincott Williams&Wilkins Publishers,2007:1155-227;Dick GWA,et al.Trans R Soc Trop Med Hyg 1952;46:509-20). The first human infection was reported in Nigeria in 1954 (Macnamara FN. Trans R Soc Trop Med Hyg 1954; 48:139-45). Like dengue and chikungunya viruses, ZIKV is a cycle of ancestral transmission involving non-human primates and mediated by a broad spectrum of forest mosquito species to urban circulation that is adapted to be mediated by Aedes (Aedes) mosquitoes involving humans as reservoirs and a broad distribution (Musso D, et al Lancet 2015; 386:243-44). Since the 50 s of the 20 th century, only ZIKV was reported to spread sporadically in africa and southeast asia. ZIKV was first isolated in the Pacific on the Cronenia island Yapu (Duffy MR, et al N Engl J Med 2009; 360:2536-43). Between 10 in 2013 and 4 in 2014, the largest burst of card by Fabry occurred since the current history (Cao-Lormeau VM, et al Emerg information Dis 2013; 20:1085-86). More than 32000 patients were suspected of being infected with ZIKV. ZIKV has spread to other pacific islands, especially kuke islands and easter islands (chile), during 2014 to 2015. 3 months of 2015, brazil reported autogenous transmission of ZIKV (Zanluca C, et al Mem Inst Oswaldo Cruz 2015; 110:569-72) and announced an unprecedented outbreak after 6 months (Dyer O.BMJ 2015; 351:h6983), which was 12 months of 2015, with a preliminary estimate of 44 to 130 thousands of cases of infection (European disease prevention and control center, 12 months of 2015, 10). By month 3 of 2016, 43 countries and regions worldwide reported ZIKV infection.
The current Zika epidemic is the most epidemic of the virus since its record (Abushouk et al an updated review of Zika virus, J.Clin. Virol.2016,84,53-58). Although ZIKV infection is often associated with mild disease, its occurrence in america occurs simultaneously with a dramatic increase in patients who develop guillain-barre syndrome. In addition, ZIKV infection is associated with birth of infants with neurological complications, particularly congenital microcephaly (WHO. Guillain-Barre syndrome-El Salvador.2016, 21. 1/21. ECDC. Quick rim estimate. Zika virus epidemic in the Americas: potential association with microcephaly and Guillain-Barre syndrome.2015, 10. 12. 10. Soares de Ara, jo J, et al Microcephaly in northeast Brazil: a review of 16 208 births between 2012 and 2015), and it has been shown that the risk of neonatal microcephaly increases from 2/10 000 to 1/100 when pregnant women are exposed to ZIKV during early pregnancy (Cauchemez S, et al Association between Zika virus and microcephaly in French Polynesia,2013-15:a retrospective study.The Lancet 2016). The WHO announced the suspected link between ZIKV and neurological disorders and neonatal deformities as an international public health incident of interest for month 2.
In this context, the specialists WHO aggregate, month 3 of 2016, agree that developing a prophylactic vaccine is a priority in the future to address the card epidemic. A realistic strategy is required to rapidly perform the development of safe and effective vaccines. Since there is an established link between ZIKV infection and congenital microcephaly in infants born by the infected mother, it is likely that the ZIKV vaccine will suggest that it must be suitable for use in pregnant women. However, currently no licensed vaccine is recommended for use during pregnancy. Furthermore, with the demonstration of the correlation of the Zika infection with Guillain-Barre syndrome, the observation of possible sexually transmitted and the occurrence of developmental defects that may occur in early gestation, it is likely that the vaccine should be directed against the general population. In any case, the zika vaccine will have to demonstrate excellent safety properties, in particular with respect to the risk of neurotropic.
To allow rapid execution of the development of the zhai card vaccine, the inventors used one of the safest and most effective available vaccines, the measles live attenuated vaccine, as a delivery vehicle for the ZIKV protective antigen, to ensure timely acquisition of a prophylactic vaccine whenever a new epidemic occurs. This delivery platform technology has demonstrated principle verification in humans, as well as rapid adaptation and preclinical follow-up records of the effectiveness of a variety of pathogens. In addition, the manufacturing process of these measles vector-based vaccines was optimized to give higher yields and purities than standard measles vaccine manufacturing processes. It uses standard equipment and thus helps to further scale up and transfer technology to lower and medium income countries.
Measles vaccination has been used for more than 40 years in over 10 billion children and is about 93% effective after one administration and about 97% effective after two administrations. Attenuated measles vaccine strains have been shown to be genetically stable. Restoration of pathogenicity or integration into the host cell genome is almost impossible and never observed. Using these features, the inventors have previously cloned an attenuated measles Schwarz vaccine virus and developed a method for genetic manipulation of this negative strand RNA virus into a universal chimeric or recombinant vector (Combredet, C.et al, 2003, J Virol,77 (21): 11546-11554).
As with any other target, a prophylactic vaccine against ZIKV must be safe and effective. In addition, the special epidemiology of rapidly emerging viruses affecting industrialized and developing countries, as well as the threat of infection during pregnancy that poses serious birth defects, require several additional features of the ideal ZIKV vaccine.
ZIKV infection during pregnancy is strongly suspected to lead to birth defects. Although live vaccines are usually disabled during pregnancy, measles infection is not associated with birth defects (Rasmussen SA, et al, obstet Gynecol 2015 Jul;126 (1): 163-70), and accidental use of MMR vaccine during pregnancy is not associated with congenital birth defects (Swamy GK, et al, obstet Gynecol 2015Jan;125 (1): 212-26). In contrast to measles-based vaccines according to the invention, if the method of live attenuated card vaccine is applied accidentally during pregnancy, it would cause very significant safety problems. Serious questions must be raised if a vaccine against the card, intended for use during pregnancy, can be developed and licensed within any acceptable time frame to prevent current epidemics. In contrast, a vaccine for adolescents with minimal safety concerns for accidental use during pregnancy appears to be the most practical and realistic intervention to eliminate the zika-induced disease. A vaccine based on measles would just meet this target property.
The measles-based method of the invention may meet all relevant criteria for future ZIKV vaccines, at least as good as or better than the alternative methods. In particular, the adjuvant-free measles-based ZIKV vaccine for children, adolescents and travelers represents one of the most likely candidates to be developed in a short time frame, which has excellent safety and efficacy properties, and whose production and cost characteristics are compatible with its use in countries where economical viability is limited.
To this end, the inventors of the present application defined a sequential development path (sequential development path). The first stage is the construction and characterization of recombinant MVs that express at least the ZIKV prM-E protein or E protein as a soluble secreted antigen. Characterization included confirmation of expression of the Zika antigen, establishment of growth characteristics in the production cell line, and genetic stability analysis. The pre-clinical immunogenicity and protective efficacy of the selected recombinant MV-zika vaccine was evaluated in MV-infected CD46-IFNAR mice. The immunogenicity and protective efficacy of the current best candidate vaccine was evaluated in a non-human primate model of ZIKV infection.
The inventors have achieved the generation of a vaccine based on recombinant infectious replicative MVs, recombinant with polynucleotides encoding at least the ZIKV prM-E antigen or E antigen, which is recovered upon replication of the recombinant virus, in particular upon replication in a host after administration. The present invention therefore relates to a ZIKV live vaccine active ingredient based on widely used pediatric vaccines for measles, in particular from the Schwarz strain. In a preferred embodiment, the recombinant live MV-ZIKV vaccine produces ZIKV VLPs by replication in infected cells.
MV is a segment-free single-stranded negative-sense enveloped RNA virus of the genus measles virus (Morbilivirus) in the Paramyxoviridae family. The virus was isolated in 1954 (Enders, J.F., and T.C.Peebles.1954. Propanation in tissue cultures of cytopathogenic agents from patients with measures. Proc.Soc.exp.biol.Med.86:277-286), and attenuated live vaccines have since been derived from the virus, in particular from Schwarz strain, to provide vaccine strains. Over the last 30 years, measles vaccine has been administered to hundreds of millions of children and demonstrated its effectiveness and safety. It is mass produced in many countries and distributed at low cost. For all these reasons, the inventors used attenuated MVs to generate recombinant MV particles that stably expressed prM-E antigen or E antigen of ZIKV and possibly also were able to express VLPs.
Accordingly, the present invention relates to a nucleic acid construct comprising:
(1) A polynucleotide encoding at least (i) a membrane precursor (prM) protein of ZIKV or a truncated version thereof, or (ii) an E protein of ZIKV or a truncated version thereof; and
(2) A cDNA molecule encoding the full-length infectious antigenomic (+) RNA strand of Measles Virus (MV);
Wherein said polynucleotide encoding at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof is operably linked to, in particular cloned into, said cDNA molecule.
The nucleic acid construct according to the invention is in particular a purified DNA molecule which is obtained or obtainable by recombination of a plurality of polynucleotides of different origin which are operably linked together.
The expression "operably linked" refers to a functional linkage between different polynucleotides present in the nucleic acid construct of the invention such that the different polynucleotides and nucleic acid construct are transcribed and, if appropriate, translated efficiently, in particular in a cell or cell line that is part of a rescue system for producing the chimeric infectious MV particles of the invention, or in a host cell, in particular in a human cell.
In a particular embodiment of the invention, the construct is prepared by cloning a polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof in a cDNA encoding the full-length infectious antigenomic (+) RNA strand of MV. Alternatively, the nucleic acid construct of the invention may be prepared using synthetic nucleic acid fragments or steps from template polymerization (including by PCR).
In a specific embodiment of the invention, the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof is cloned into an ATU (additional transcription unit) inserted in a cDNA molecule of MV. ATU sequences are known to the person skilled in the art and comprise cis-acting sequences necessary for MV-dependent transgene expression (e.g. the promoter of the gene at the front end in MV cDNA), inserts represented by polynucleotides encoding at least (i) the prM protein of ZIKV and a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof) and a multiple cloning site cassette for insertion of said polynucleotides.
When used in the practice of the present invention, the ATU is preferably located in the N-terminal sequence of a cDNA molecule encoding the full-length (+) RNA strand of the MV antigenome, and in particular between the P gene and M gene or between the H gene and L gene of the virus. Transcription of viral RNA of MV has been observed to follow a gradient from the 5 'end to the 3' end. This means that when inserted 5' of the cDNA coding sequence, the ATU will be able to more efficiently express the heterologous DNA sequence it contains (e.g., a polynucleotide encoding at least (i) the prM protein of ZIKV and either the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof).
Thus, a polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof may be inserted into any intergenic region in the cDNA molecule of MV, in particular ATU. The specific construct of the invention is the construct shown in the examples.
In a preferred embodiment of the invention, the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof is inserted in the intergenic region of the P gene and the M gene of the MV cDNA molecule, in particular in the ATU.
As used herein, the expression "encoding" defines the ability of a nucleic acid molecule to be transcribed and, where appropriate, translated so that the product is expressed to a selected cell or cell line. Thus, the nucleic acid construct may comprise regulatory elements which control transcription of the coding sequence, in particular promoter and termination sequences for transcription and possibly enhancers and other cis-acting elements. These regulatory elements may be heterologous with respect to the ZIKV polynucleotide sequence.
The term "protein" is used interchangeably with the terms "antigen" or "polypeptide" and defines a molecule produced by a cascade of amino acid residues. In particular, the proteins disclosed in the present application are derived from ZIKV and are structural proteins that may be identical to the native protein, or alternatively structural proteins from which they are derived by: the reduction of the fragment relative to the size of the native protein to be referred to is brought about by mutation, including by substitution, in particular by substitution of conserved amino acid residues, or by addition of amino acid residues, or by post-translational secondary modification, or by deletion of a part of the native protein. Fragments encompassed by the present invention in this sense carry epitopes of the native protein suitable for eliciting an immune response in a host, in particular in a human host, which immune response is preferably capable of a protective response against ZIKV infection or against ZIKV-associated diseases. Epitopes are in particular B-type epitopes which are involved in the eliciting of a humoral immune response by activating antibody production in a host which has been administered the protein or which has been expressed after administration of the infectious replicative particles of the invention. The epitope may alternatively be an epitope of a T-type epitope that is involved in the priming of a cell-mediated immune response (CMI response). The fragment size may represent more than 50%, preferably at least 90% or 95% of the amino acid sequence size of the ZIKV native protein. Alternatively, the fragment may be a short polypeptide having at least 10 amino acid residues, which carries an epitope of the native protein. In this regard, fragments also include polyepitopes as defined herein.
In a specific embodiment of the invention, the nucleic acid construct complies with the six (6) rule of the MV genome, i.e. the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof, together with the cDNA molecule encoding the MV full-length infectious antigenomic (+) RNA strand is composed of a number of nucleotides that is a multiple of six.
The organization of the genome of MV and its replication and transcription processes have been well established in the prior art, and are disclosed in, inter alia, horikami SM and Moyer S.A. (Curr. Top. Microbiol. Immunol. (1995) 191,35-50) or Combredet C.et al (Journal of Virology, nov 2003, p 11546-11554) for Schwarz vaccine strains of the virus, or Neumann G.et al (Journal of General Virology (2002) 83, 2635-2662) for the widely recognized negative sense RNA viruses.
"six-digit rule" in fact means that the total number of nucleotides present in a nucleic acid representing the MV (+) strand RNA genome or in a nucleic acid construct comprising the nucleic acid is a multiple of six. In the prior art, the "six-digit rule" has been considered as a requirement for the total number of nucleotides in the genome of the MV, which enables efficient or optimized replication of MV genomic RNA. In embodiments of the invention that define a nucleic acid construct that satisfies the six-digit rule, the rule applies to the nucleic acid construct that specifies the cDNA encoding the full length MV (+) strand RNA genome and all inserted sequences, alone or together. In this regard, the six-digit rule applies to cdnas encoding full-length infectious antigenomic (+) RNA strands of MVs, and to polynucleotides cloned into the cdnas and encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof, or (ii) the E protein of ZIKV or a truncated version thereof.
In a specific embodiment of the invention, the nucleic acid construct comprises from 5 'to 3' the following polynucleotides:
(a) A polynucleotide encoding an N protein of MV;
(b) A polynucleotide encoding a P protein of MV;
(c) A polynucleotide encoding at least (i) a prM protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof;
(d) A polynucleotide encoding an M protein of MV;
(e) A polynucleotide encoding an F protein of MV;
(f) A polynucleotide encoding an H protein of MV; and
(g) A polynucleotide encoding an L protein of MV;
wherein the polynucleotide is operably linked in a nucleic acid construct and under the control of viral replication and transcription regulatory sequences such as MV leader and trailer sequences.
The expressions "N protein", "P protein", "M protein", "F protein", "H protein" and "L protein" refer to the nucleoprotein (N), phosphoprotein (P), matrix protein (M), fusion protein (F), hemagglutinin protein (H) and RNA polymerase macroprotein (L) of MV, respectively. These components have been identified in the prior art and are disclosed in particular in Fields, virology (Knipe & Howley, 2001).
In a preferred embodiment of the invention, the measles virus is an attenuated strain.
An "attenuated strain" of measles virus is defined as a strain that is non-toxic or less toxic than the parent strain in the same host, while retaining immunogenicity and possibly adjuvanticity, i.e. retaining immunodominant T and B cell epitopes and possibly adjuvanticity, e.g. inducing the T cell costimulatory protein or cytokine IL-12, when administered in the host.
Thus, an attenuated strain of MV refers to a strain that has been serially passaged on selected cells and possibly adapted to other cells to produce a seed virus strain suitable for preparing vaccine strains that has a stable genome that neither allows for recovery of pathogenicity nor integration into the host chromosome. As a specific "attenuated strain", a strain approved for use in a vaccine is one that meets FDA (U.S. food and drug administration) regulatory standards, i.e., meets safety, efficacy, quality and reproducibility standards, following stringent laboratory and clinical data reviews (www.fda.gov/cber/vaccinee/vacapp.htm) suitable for the present invention.
Specific attenuated strains of MV cDNA which can be used in the practice of the invention and in particular for deriving nucleic acid constructs are the Schwarz strain, the Zagreb strain, the AIK-C strain and the Moraten strain. All of these strains have been described in the prior art and their acquisition is provided, in particular, in commercial vaccines.
In a particular embodiment of the invention, the cDNA molecules are placed under the control of heterologous expression control sequences. Insertion of such controls for cDNA expression is advantageous when attempting to express the cDNA in cell types that are not fully transcribed with their native control sequences.
In a particular embodiment of the invention, the heterologous expression control sequences include a T7 promoter and a T7 terminator sequence. These sequences are located 5 'and 3' respectively to the coding sequence of the MV full-length antigenomic (+) RNA strand and are derived from adjacent sequences surrounding the coding sequence.
In a particular embodiment of the invention, the cDNA molecules defined above are modified, i.e.comprise additional nucleotide sequences or motifs.
In a preferred embodiment, the cDNA molecule of the invention further comprises a GGG motif at its 5 'end, immediately adjacent to the first nucleotide of the nucleotide sequence encoding the full-length antigenomic (+) RNA strand of the approved MV vaccine strain, followed by a hammerhead ribozyme sequence, and at its 3' end, immediately adjacent to the last nucleotide of the nucleotide sequence encoding the full-length antigenomic (+) RNA strand, comprising a ribozyme sequence. Hepatitis delta virus ribozyme (delta) (hepatitis delta virus ribozyme) is suitable for practicing the present invention.
The 5' -terminal GGG motif adjacent to the first nucleotide of the above coding sequence increases the transcription efficiency of the cDNA coding sequence. Since the requirement for correct assembly of measles virus particles is that the cDNA encoding the antigenomic (+) RNA of the nucleic acid construct of the invention complies with the six rules, when the GGG motif is added, a ribozyme is also added at the 5 'end of the cDNA coding sequence, at the 3' end of the GGG motif, so as to be able to cleave the transcript at the first coding nucleotide of the full length antigenomic (+) RNA strand of MV.
In a specific embodiment of the invention, the preparation of cDNA molecules encoding the MV full-length antigenomic (+) RNA disclosed in the prior art is accomplished by known methods for the preparation of the nucleic acid constructs of the invention. The cDNA provides, inter alia, a genomic vector when inserted into a vector, such as a plasmid.
Specific cDNA molecules suitable for use in preparing the nucleic acid constructs of the invention are those obtained using the Schwarz strain of MV. Thus, the cDNA used in the present invention may be obtained as disclosed in WO2004/000876, or may be obtained from the plasmid pTM-MVSchw deposited as No I-2889 on month 12 2002 at Institut Pasteur at the Collection Nationale de Culture de Microorganismes (CNCM), 28rue du Dr Roux,75724Paris Cedex 15,France, the sequence of which is disclosed in WO2004/000876, WO2004/000876 being incorporated herein by reference. Plasmid pTM-MVSchw has been obtained from the Bluescript plasmid and comprises a polynucleotide encoding the full length measles virus (+) RNA strand of the Schwarz strain, said polynucleotide being placed under the control of the T7 RNA polymerase promoter. It has 18967 nucleotides and the sequence shown in SEQ ID No. 1. cDNA molecules derived from other MV strains (also referred to as cDNA of measles virus or MV cDNA for convenience) can similarly be obtained starting from nucleic acids purified from viral particles of attenuated MVs (such as those disclosed herein).
The cDNA used in the present invention may also be obtained from the plasmid pTM2-MVSchw-gfp deposited at Institut Pasteur at the Collection Nationale de Culture de Microorganismes (CNCM), 28 rue du Dr Roux,75724 Paris Cedex 15,France as No. I-2890 on month 12 of 2002. It has 19795 nucleotides and the sequence shown in SEQ ID NO. 2. The plasmid contains a sequence encoding an eGFP marker, which may be deleted.
The nucleic acid construct of the invention is suitable and intended for use in the preparation of a recombinant infectious replicating measles-ZIKV (MV-ZIKV), and thus the nucleic acid construct is intended for insertion into a transfer genomic vector, which thus comprises a cDNA molecule of a measles virus, in particular of the Schwarz strain, for the production of said MV-ZIKV and the production of at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof, or (ii) the E protein of ZIKV or a truncated version thereof, in particular ZIKV VLPs. The pTM-MVSchw plasmid or pTM2-MVSchw plasmid is suitable for the preparation of the transfer vector by insertion of ZIKV polynucleotides necessary for the expression of at least (i) the prM protein of ZIKV and also of the E protein or of a truncated version thereof or (ii) the E protein of ZIKV or of a truncated version thereof. Recombinant infectious replicative MV-ZIKV particles can be recovered from rescue helper cells or in production cells, and optionally with VLPs expressing ZIKV antigens according to the disclosure.
Thus, the present invention relates to transfer vectors for the preparation of recombinant MV-ZIKV particles upon rescue from helper cells. Advantageously, the transfer vector of the invention is a transfer vector plasmid suitable for transfecting said helper or producer cell, comprising the nucleic acid construct of the invention, in particular a plasmid obtained from a Bluescript plasmid, such as pMV-ZIKV.
In a specific embodiment of the invention, the transfer vector plasmid has the sequence of SEQ ID NO. 165, SEQ ID NO. 166 or SEQ ID NO. 167, preferably has the sequence of SEQ ID NO. 165.
The invention also relates to the use of said transfer vectors for transforming cells suitable for rescuing viral MV-ZIKV particles, in particular for transfecting or transducing such cells with plasmids or viral vectors carrying the nucleic acid constructs of the invention, respectively, said cells being selected for their ability to express MV proteins required for proper replication, transcription and encapsidation of the recombinant viral genome corresponding to the nucleic acid construct of the invention in the recombinant infectious replicative MV-ZIKV particles.
In a preferred embodiment, the invention relates to a transformed cell comprising in its genome the nucleic acid construct according to the invention or comprising the transfer vector plasmid according to the invention, wherein the cell is in particular a eukaryotic cell, such as an avian cell, in particular a CEF cell, a mammalian cell (e.g. HEK293 cell) or a yeast cell.
Thus, a polynucleotide encoding a protein comprising in particular N, P and L proteins of MV (i.e. the native MV protein or a functional variant thereof capable of forming Ribonucleoprotein (RNP) complexes), is present in said cells, preferably at least for N and P proteins, which play a role in replication and transcription of recombinant viral MV-ZIKV particles, as stably expressed proteins. The N and P proteins may be expressed in the cell by a plasmid containing their coding sequences, or may be expressed by a DNA molecule inserted into the genome of the cell. The L protein may be expressed from different plasmids. It may be transiently expressed. The helper cell is also capable of expressing an RNA polymerase, which is suitable for being able to synthesize recombinant RNA derived from the nucleic acid construct of the invention, possibly as a stably expressed RNA polymerase. The RNA polymerase may be a T7 phage polymerase or a nuclear form thereof (nlsT 7).
In an embodiment of the invention, the cDNA clone of MV is from the same MV strain as the N protein and/or P protein and/or L protein. In another embodiment of the invention, the cDNA clone of MV is from a different strain than the N protein and/or P protein and/or L protein.
The invention also relates to a process for the preparation of recombinant infectious Measles Virus (MV) particles, comprising:
1) Transferring, in particular transfecting, the nucleic acid construct of the invention or a transfer vector comprising the nucleic acid construct in a helper cell line which also expresses the proteins necessary for transcription, replication and encapsidation of the antigenomic (+) RNA sequence of MV from cDNA of MV and under conditions allowing viral particle assembly; and
2) Recovering recombinant infectious MV-ZIKV particles expressing at least (i) a prM protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof.
In a particular embodiment of the invention, the method comprises:
1) Transfecting a helper cell with the nucleic acid construct according to the invention and with a transfer vector, wherein the helper cell is capable of expressing a helper function to express RNA polymerase and to express N, P and L proteins of MV virus;
2) Co-culturing the transfected helper cells of step 1) with passaging cells suitable for passaging of the MV attenuated strain from which the cDNA originates;
3) Recovering the recombinant infectious MV-ZIKV particles that express at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof.
In another particular embodiment of the invention, a method for producing recombinant infectious MV-ZIKV particles comprises:
1) Cell or cell culture stably producing RNA polymerase, N protein of MV and P protein of MV is recombined with the nucleic acid construct of the present invention and with a vector comprising nucleic acid encoding L protein of MV, and
2) Recovering the recombinant infectious MV-ZIKV particles from the recombinant cells or recombinant cell cultures.
In a particular embodiment of the method, a recombinant MV is produced that expresses at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof, in particular a ZIKV VLP expressing the same ZIKV protein.
Preferably, the present invention relates to a method of rescuing recombinant infectious measles virus-ZIKV particles expressing at least (i) a membrane precursor (prM) protein of ZIKV and an envelope (E) protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof, and ZIKV VLPs expressing the same ZIKV protein, comprising:
1) Co-transfecting helper cells, in particular HEK293 helper cells, stably expressing T7 RNA polymerase and measles N and P proteins with (i) a transfer vector plasmid according to the invention and (ii) a vector encoding MV L polymerase, in particular a plasmid;
2) Culturing the co-transfected helper cells under conditions capable of producing recombinant MV-ZIKV particles;
3) Propagating the recombinant MV-ZIKV thus produced by co-culturing the helper cells of step 2) with cells allowing said propagation, such as Vero cells;
4) Recovering replicative infectious replicative MV-ZIKV particles expressing at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof, and ZIKV VLPs expressing the same ZIKV proteins.
According to a particular embodiment of the method, the transfer vector plasmid has the sequence of SEQ ID NO. 165, SEQ ID NO. 166 or SEQ ID NO. 167, preferably the sequence of SEQ ID NO. 165.
As used herein, the term "recombinant" refers to the introduction of at least one polynucleotide into a cell, e.g., in the form of a vector, either integrated (in whole or in part) or not integrated into the cell genome (e.g., as defined above).
According to a particular embodiment, recombination may be obtained with a first polynucleotide, which is a nucleic acid construct of the invention. Recombination may also or alternatively encompass the introduction of a polynucleotide, which is a vector encoding the RNA polymerase large protein (L) of MV, the definition, nature and stability of expression of which have been described herein.
According to the invention, a cell or cell line or cell culture which stably produces RNA polymerase, the nucleoprotein (N) of measles virus and the polymerase cofactor phosphoprotein (P) of measles virus is a cell or cell line as defined in the present specification or a cell culture as defined in the present specification, i.e. also a recombinant cell in this sense which has been modified by the introduction of one or more polynucleotides as described above. In certain embodiments of the invention, the cells or cell lines or cell cultures that stably produce the RNA polymerase, N and P proteins do not produce or do not stably produce the L protein of measles virus, e.g., allow for its transient expression or production.
The production of recombinant infectious replicative MV-ZIKV particles of the invention may involve the transfer of cells transformed as described herein. The term "transfer" as used herein refers to the laying of recombinant cells on different types of cells, in particular on monolayers of different types of cells. These latter cells are sufficient to maintain replication and production of infectious MV-ZIKV particles, i.e. the formation of infectious viruses inside the cell and the possible release of these infectious viruses outside the cell, respectively. This transfer results in co-culture of the recombinant cells of the invention with competent cells as defined in the previous sentence. This transfer may be an additional (i.e., optional) step when the recombinant cells are not an efficient virus-producing culture, i.e., when infectious MV-ZIKV particles cannot be efficiently recovered from these recombinant cells. This step is introduced after further recombining the recombinant cell of the invention with the nucleic acid construct of the invention and optionally a vector comprising a nucleic acid encoding the RNA polymerase large protein (L) of measles virus.
In certain embodiments of the invention, the transfer step is desirable because recombinant cells, which are generally selected for their ability to readily recombine, are not sufficiently efficient at maintaining and producing recombinant infectious MV-ZIKV particles. In said embodiment, the cell or cell line or cell culture of step 1) of the above method is a recombinant cell or cell line or culture of recombinant cells according to the invention.
Cells suitable for preparing the recombinant cells of the invention are prokaryotic or eukaryotic cells, in particular animal or plant cells, and more particularly mammalian cells, such as human cells or non-human mammalian cells or avian cells or yeast cells. In certain embodiments, the cells are isolated from a primary culture or cell line prior to recombining their genomes. The cells of the invention may be dividing or non-dividing cells.
According to a preferred embodiment, the helper cells are derived from a human embryonic kidney cell line 293, which cell line 293 is deposited with the ATCC as No. CRL-1573. A particular cell line 293 is the cell line disclosed in international application WO2008/078198 and mentioned in the examples below.
According to another aspect of the method, the cells suitable for passaging are CEF cells. CEF cells may be prepared from fertilized eggs obtained from EARL Morizeau,8rue Moulin,28190Dangers,France or from any other fertilized egg manufacturer.
The methods disclosed according to the present invention are advantageously used to produce infectious replicative MV-ZIKV particles and optionally VLPs expressing ZIKV antigens suitable for use as an immune composition.
The invention thus relates to an immunogenic composition, the active elements of which comprise infectious replicative MV-ZIKV particles rescued by the nucleic acid construct of the invention and in particular obtained by the disclosed method.
The nucleic acid construct of the invention and the MV-CHIKV of the invention encode or express at least (i) a prM protein of ZIKV, and an E protein of ZIKV or a truncated version thereof, or (ii) an E protein of ZIKV or a truncated version thereof.
"protein of ZIKV" refers to a "protein" as defined herein, which has the same sequence as the counterpart in a strain of ZIKV, including polypeptides that are natural maturation or precursors of ZIKV proteins or fragments or mutants thereof as defined herein. In the present invention, the "ZIKV protein" is specifically an antigen (prM or E or derivatives thereof disclosed herein) designed for the ZIKV consensus sequence. In particular, the antigen was designed using the consensus amino acid sequence of the Zika virus that was observed to spread in 2015 and later, especially designed to include the S139N change in prM and V763M in E, the S139N change generated new potential N glycosylation sites in prM, which were not present in the African lineage. Thus, the inventors included this S139N mutation in all asian lineage sequences, but did not include a single mutation in a particular isolate. The inventors observed that the amino acid sequence of Asian strain BeH818995 (GenBank: KU 365777) corresponds to the consensus amino acid sequence of Zika virus observed in 2015 and later.
In particular fragments or mutants having at least 50%, at least 80%, particularly advantageously at least 90% or preferably at least 95% amino acid sequence identity with the naturally occurring ZIKV capsid protein or envelope protein. Amino acid sequence identity can be determined by one skilled in the art using manual alignment or alignment using many available alignment programs. Fragments or mutants of the ZIKV proteins of the invention may be defined relative to the specific amino acid sequences shown herein.
According to a preferred embodiment, the invention also relates to modification and optimization of polynucleotides to allow efficient expression of at least (i) prM of ZIKV and E protein of ZIKV or a truncated version thereof or (ii) E protein of ZIKV or a truncated version thereof in a host, in particular a human host, on the surface of chimeric infectious particles of MV-ZIKV.
According to this embodiment, optimization of the polynucleotide sequence may be operated so as to circumvent the cis-acting domain of the following nucleic acid molecules: internal TATA box, chi site and ribosome entry site; AT-rich or GC-rich sequence segments; ARE, INS, CRS sequence elements; repeat sequences and RNA secondary structures; masking splice donor and acceptor sites, branching points.
The optimized polynucleotides may also be codon optimized for expression in a particular cell type. This optimization allows for improved efficiency of chimeric infectious particle production in cells without affecting the expressed protein.
In certain embodiments of the invention, the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof has been optimized for cynomolgus codon usage or has been optimized for human codon usage.
Optimization of the polynucleotides encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof may be performed by modifying the wobble (Wobble) position in the codon without affecting the identity of the amino acid residues translated from the codon relative to the original codon.
Optimization was also performed to avoid editing-like sequences (editing-like sequences) from measles virus. Editing of measles virus transcripts is a process that occurs in particular in transcripts encoded by the P gene of measles virus. This editing is performed by inserting additional G residues at specific sites in the P transcript, resulting in a novel protein that is truncated compared to the P protein. Only the addition of a single G residue results in the expression of a V protein that contains a unique carboxyl terminus (Cattaneo R et al, cell.1989Mar 10;56 (5): 759-64).
In a specific embodiment of the invention, the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof has deleted measles edit-like sequences. The following measles edit-like sequences may be mutated: AAAGGG, AAAAGG, GGGAAA, GGGGAA, TTAAA, AAAA and their complement TTCCCC, TTTCCC, CCTTTT, CCCCTT, TTTAA, TTTT. For example, AAAGGG may be mutated in AAAGGC, AAAAGG may be mutated in AGAAGG or TAAAGG or GAAAGG, and GGGAAA may be mutated in GCGAAA.
In a particular embodiment of the invention, the natural and codon-optimized nucleotide sequences of the polynucleotides encoding the particular peptides/proteins/antigens of the invention and the amino acid sequences of these peptides/proteins/antigens are the sequences disclosed in SEQ ID NO 3-164 and mentioned in Table 1 below. These sequences are also shown in figures 3A-3D.
In a particular embodiment of the invention, the transfer vector plasmid pTM2-MVSchw_A1_Zikasp_ZikaprME has the optimized sequence of SEQ ID NO. 165, the transfer vector plasmid pTM 2-MVSchw_Insert4 has the native sequence of SEQ ID NO. 166 and the transfer vector plasmid pTM 2-MVSchw_Insert5 has the native sequence of SEQ ID NO. 167, as mentioned in Table 1 below.
In another particular embodiment of the invention, the natural nucleotide sequence of the polynucleotide encoding insert 4 or insert 5 and the amino acid sequence of said insert 4 or insert 5 of the invention are the sequences as disclosed in SEQ ID Nos. 168-171 and mentioned in Table 1 below. Insert 4 (SEQ ID NO: 169) is similar to Zikasp_Zika_prMEd404 (SEQ ID NO: 54) except that it has a shorter sp at 5'. Insert 5 (SEQ ID NO: 171) was similar to Zikasp '_ZikaEd445 (SEQ ID NO: 75) with a nuance of 2 at 5'.
Table 1. The natural and codon optimized nucleotide sequences of polynucleotides encoding specific peptides/proteins and the amino acid sequences of these peptides/proteins used in the present invention.
/>
/>
In a particular embodiment of the invention, the ZIKV is derived from the african lineage, in particular from african strain ArB1362 (GenBank: KF 383115) or from african isolate IbH _30656 (GenBank: HQ 234500), or from the asian lineage, in particular from asian strain BeH818995 (GenBank: KU 365777), preferably from the ZIKV strain propagated in pacific and america since 2013.
In another particular embodiment of the invention, the ZIKV corresponds to multiple lineages of ZIK virus, including strains transmitted in the pacific and america since 2013.
In a preferred embodiment of the invention, the prM protein of ZIKV has an amino acid sequence that is a consensus amino acid sequence representing prM sequences pooled across multiple ZIKV strains, including those from asian lineages, particularly from the ZIKV strain BeH 818995. The E protein of ZIKV or truncated versions thereof has an amino acid sequence that is a consensus amino acid sequence representing the E sequence of a variety of ZIKV strains (particularly from ZIKV strain BeH 818995) pooled together in asian lineages.
In a specific embodiment of the invention, the polynucleotide encoding at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof further encodes (iii) a signal peptide (sp) from the capsid protein of ZIKV or a signal peptide (JEVsp) from the capsid protein of JEV or a signal peptide (MVsp) from the fusion protein of MV or a modified signal peptide (MVsp ') from the fusion protein of MV and a signal peptide (sp') from the membrane protein of ZIKV
The polynucleotide encoding at least (ii) the E protein of ZIKV or a truncated version thereof further encodes (iii) a signal peptide (sp) from a capsid protein of ZIKV or a signal peptide (sp ') from a membrane protein of ZIKV or a signal peptide (JEVsp) from a capsid protein of JEV or a signal peptide (MVsp) from a fusion protein of MV or a modified signal peptide (MVsp') of a fusion protein of MV.
In a preferred embodiment of the invention, the polynucleotide encoding at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof further encodes (iii) a signal peptide from a capsid protein of ZIKV and a signal peptide from a membrane protein of ZIKV, or
The polynucleotide encoding at least (ii) an E protein of ZIKV or a truncated version thereof further encodes (iii) a signal peptide from a capsid protein of ZIKV or a signal peptide from a membrane protein of ZIKV.
In a particular embodiment of the invention, the polynucleotide encoding the E protein encodes the full-length E protein or a soluble form thereof lacking both C-terminal transmembrane domains of the full-length E protein.
In a preferred embodiment of the invention, the polynucleotide encoding a truncated version of the E protein is selected from the group consisting of: (i) a polynucleotide encoding an E protein truncated at amino acid 456 of the full-length E protein of ZIKV (i.e., an E protein lacking the anchor region and the intermediate domain between the stem region and the anchor region), (ii) a polynucleotide encoding an E protein truncated at position 445 of the full-length E protein of ZIKV (i.e., an E protein lacking the anchor region, the intermediate domain between the stem region and the anchor region, and a fragment of the second helix constituting the stem region), and (iii) a polynucleotide encoding an E protein truncated at position 404 of the full-length E protein of ZIKV (i.e., an E protein lacking the stem region, the intermediate domain between the stem region and the anchor region, and the anchor region).
In a preferred embodiment of the invention, the polynucleotide encodes a prM protein of ZIKV having the sequence of SEQ ID No. 20 and the polynucleotide encodes an E protein of ZIKV or a truncated version thereof having a sequence selected from the group consisting of SEQ ID NO: SEQ ID NO. 23, SEQ ID NO. 26, SEQ ID NO. 29 and SEQ ID NO. 32.
In a preferred embodiment of the invention, the polynucleotide encoding the prM protein of ZIKV has the sequence of SEQ ID NO:19, and the polynucleotide encoding the E protein of ZIKV or a truncated version thereof has a sequence selected from the group consisting of: SEQ ID NO. 22, SEQ ID NO. 25, SEQ ID NO. 28 and SEQ ID NO. 31.
In a specific embodiment of the invention, the nucleic acid construct comprises a sequence selected from the group consisting of: SEQ ID NO 45-164 and SEQ ID NO 168-171.
In a preferred embodiment of the invention, the nucleic acid construct comprises a sequence selected from the group consisting of: SEQ ID NO. 46, SEQ ID NO. 52, SEQ ID NO. 55, SEQ ID NO. 70, SEQ ID NO. 76, SEQ ID NO. 79, SEQ ID NO. 168 and SEQ ID NO. 170, preferably having the sequence of SEQ ID NO. 46, SEQ ID NO. 55 or SEQ ID NO. 76, more preferably having the sequence of SEQ ID NO. 46.
In a preferred embodiment of the invention, the nucleic acid construct comprises the sequence from nucleotide 83 to nucleotide 18404 in the sequence of SEQ ID NO:165, or the sequence from nucleotide 83 to nucleotide 18074 in the sequence of SEQ ID NO:166, or the sequence from nucleotide 83 to nucleotide 17702 in the sequence of SEQ ID NO: 167.
The invention also relates to recombinant infectious replication competent measles virus-ZIKV virus (MV-ZIKV) particles comprising as their genome the nucleic acid construct according to the invention.
In a particular embodiment of the invention, the recombinant infectious replicative MV-ZIKV particles are rescued from helper cell lines that express the RNA polymerase recognized by the cell line (e.g., T7 RNA polymerase), the nucleoprotein (N) of MV, the phosphoprotein (P) of MV, and optionally the RNA polymerase macroprotein (L) of MV, and are further transfected with a transfer vector plasmid according to the invention.
The recombinant infectious replicative MV-ZIKV particles are thus produced by a method comprising expressing a nucleic acid construct according to the invention in a host cell comprising an RNA polymerase (e.g., T7 RNA polymerase), a nucleoprotein (N) of MV, a phosphoprotein (P) of MV, and optionally an RNA polymerase macroprotein (L) of MV that is recognized by the host cell.
According to a particular embodiment of the invention, the particle comprises in its genome a polynucleotide sequence comprising a sequence selected from the group consisting of: SEQ ID NO. 46, SEQ ID NO. 52, SEQ ID NO. 55, SEQ ID NO. 70, SEQ ID NO. 76, SEQ ID NO. 79, SEQ ID NO. 168 and SEQ ID NO. 170, preferably having the sequence of SEQ ID NO. 46, SEQ ID NO. 55 or SEQ ID NO. 76, more preferably having the sequence of SEQ ID NO. 46.
The obtained at least (i) prM protein of ZIKV and E protein of ZIKV or truncated versions thereof or (ii) E protein of ZIKV or truncated versions thereof are also capable of self-assembly into ZIK virus-like particles (VLPs), together with MV-ZIKV particles.
As used herein, the term "virus-like particle" (VLP) refers to a structure that is similar in at least one attribute to a virus but has not been demonstrated to be equally infectious. VLPs according to the invention do not carry genetic information for the proteins encoding the VLPs, typically, the VLPs lack the viral genome and are therefore non-infectious and non-replicating. According to the invention, VLPs may be produced in large quantities and expressed together with recombinant infectious MV-ZIKV particles. The VLP is a ZIKV VLP.
According to a further aspect, the present invention relates to recombinant infectious MV-ZIKV particles expressing at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof, in particular by reference to their nucleic acid and polypeptide sequences. The recombinant infectious MV-ZIKV particles advantageously express at least (i) the prM protein of ZIKV and the E protein of ZIKV or a truncated version thereof or (ii) the E protein of ZIKV or a truncated version thereof as VLPs.
The invention also relates to a composition or a kit of active ingredients comprising recombinant infectious replicative MV-ZIKV particles according to the invention and a pharmaceutically acceptable carrier.
The invention also relates to the combination (association) of VLPs comprising at least (i) a prM protein of ZIKV and an E protein of ZIKV or a truncated version thereof or (ii) an E protein of ZIKV or a truncated version thereof with recombinant infectious replicative MV-ZIKV-MV particles in a composition.
According to a preferred embodiment of the invention, the recombinant MV vector is designed in such a way, and the production process involves cells, that the viral particles produced in helper cells transfected or transformed with the vector are capable of producing recombinant infectious replicative MVs and producing ZIKV VLPs, for use in immunogenic compositions, preferably in protective or even vaccine compositions, the vector being derived from a MV strain suitable for vaccination.
Advantageously, the genome of the recombinant infectious MV-ZIKV particles of the invention has replication capacity. "replicative capacity" refers to the ability of a nucleic acid to be transcribed and expressed when transduced into helper cell lines expressing the N, P and L proteins of MV to produce new viral particles.
Replication of the recombinant viruses of the invention obtained using MV cDNA for preparing a recombinant MV-ZIKV genome can also be achieved in vivo in a host, in particular a human host, to which the recombinant MV-ZIKV is administered.
The invention also relates to a composition or a kit of active ingredients comprising recombinant infectious replicative MV-ZIKV particles according to the invention in combination with ZIKV-VLPs, said ZIKV-VLPs expressing the same ZIKV proteins as said MV-ZIKV particles.
According to a preferred embodiment of the invention, the composition or formulation of the active ingredient is used to elicit an immune response, in particular a protective immune response, against ZIKV by eliciting antibodies against ZIKV proteins and/or eliciting a cellular immune response in a host in need thereof, in particular a human host.
Thus, the composition or formulation of the active ingredient may comprise a suitable carrier (e.g., a pharmaceutically acceptable carrier) for administration to a host (particularly a human host), and may further comprise, but need not comprise, an adjuvant to enhance the immune response in the host. The inventors have in fact shown that the administration of the active ingredients of the present invention can elicit an immune response without the need for an adjuvant.
According to a particular embodiment of the invention, the composition or formulation of the active ingredient comprises a pharmaceutically acceptable carrier.
The invention relates in particular to compositions, in particular immunogenic compositions, preferably vaccine compositions for administration to children, adolescents or travellers.
In certain embodiments, the composition or vaccine is used for prophylactic protection against ZIKV african and asian strains.
The use of the composition or vaccine is for protection against ZIKV infection or against clinical consequences of ZIKV infection (protection against ZIKV disease) in prophylactic treatment. Such vaccine compositions have an advantageous active principle (active principle) (active ingredient)) comprising recombinant infectious replicative MV-ZIKV particles rescued by the vectors defined herein, optionally in combination with VLPs comprising the same ZIKV protein.
In the context of the present invention, the term "combined" or "combination" refers to the presence of recombinant infectious replicative MV-ZIKV particles and the aforementioned ZIKV proteins (in particular as VLPs) in separate compositions, typically as physically separate entities.
The invention also relates to recombinant infectious replicative MV-ZIKV particles according to the invention in combination with the above mentioned ZIKV proteins, in particular with ZIKV-VLPs expressing the same ZIKV proteins, or to compositions or formulations of the active ingredients according to the invention for use in preventing a ZIKV infection in a subject or preventing the clinical consequences of a ZIKV infection in a subject, in particular in a human.
The invention also relates to recombinant infectious replicative MV-ZIKV particles according to the invention in combination with the above mentioned ZIKV proteins, in particular with ZIKV-VLPs expressing the same ZIKV proteins, for use in an administration schedule and according to a dosing regimen that elicits an immune response (advantageously a protective immune response) against a ZIKV infection or induced disease, in particular in a human host.
The administration schedule and dosing regimen may entail the separate administration of selected doses of recombinant infectious replicative MV-ZIKV particles according to the invention in combination with the aforementioned ZIKV proteins, in particular in combination with ZIKV-VLPs expressing the same ZIKV proteins.
Alternatively, it may require multiple administrations in a prime-boost regimen. Priming and boosting can be accomplished with the same active ingredient consisting of the recombinant infectious replicative MV-ZIKV particles in combination with the aforementioned ZIKV proteins, in particular in combination with ZIKV-VLPs expressing the same ZIKV proteins.
Alternatively, priming and boosting administration may be accomplished with different active ingredients, which involve recombinant infectious replicative MV-ZIKV particles in combination with the above-mentioned ZIKV proteins (in particular in combination with ZIKV-VLPs expressing the same ZIKV proteins) in at least one administration step, as well as other ZIKV active immunogens in other administration steps, such as the above-mentioned ZIKV proteins or ZIKV-VLPs expressing the same ZIKV proteins.
Administration of recombinant infectious replicative MV-ZIKV particles according to the invention in combination with ZIKV-VLPs expressing the same ZIKV protein elicits an immune response, and in particular, antibodies cross-reactive to multiple ZIKV strains. Thus, it has been shown that the administration of an active ingredient according to the invention can elicit an immune response against a group of ZIKV strains when prepared with the coding sequences of a particular ZIKV strain.
In view of the knowledge available in terms of vaccine dosages suitable for other pathogens (e.g. HBV or HPV) involving administration of VLPs and also in terms of vaccine dosages suitable for known human MV vaccines, the inventors have determined that recovery of ZIKV-VLPs with recombinant MV-ZIKV enables the proposed administration of effective low doses of active ingredients. Indeed, it is contemplated that recombinant MV-ZIKV is capable of producing about 10 per recombinant infectious replicative MV-ZIKV particle 4 ZIKV-VLPs and taking into account the currently known doses of human MV vaccine at 10 3 To 10 5 Suitable doses of recombinant MV-ZIKV to be administered in the range of TCID50 may be in the range of 0.1-10ng, especially 0.2-6ng, and possibly as low as in the range of 0.2-2 ng. For comparison purposes, the dose of VLPs administered in the case of HBV or HPV vaccines is in the range of 10 μg, meaning that the dose of recombinant MV-ZIKV vaccine may comprise about 2000-fold less or up to 5000 to 10000-fold less VLPs.
According to a particular embodiment of the invention, the immunogenic or vaccine composition defined herein may also be used for the prevention of infection with measles virus.
Other features and advantages of the present invention will become apparent from the following examples, and also be illustrated in the accompanying drawings.
Brief Description of Drawings
FIG. 1A schematic representation of the genome of Zika virus.
FIG. 2 phylogenetic tree of major human pathogenic flaviviruses based on the amino acid sequences of the E protein (left panel) and the polymerase NS5 protein (right panel). JEV, japanese encephalitis virus (Japanese encephalitis virus); MVEV, murray Valley encephalitis Virus (Murray Valley encephalitis virus); POWV, polio virus (Powassan virus); SLEV, st.louis encephalitis virus (Saint Louis encephalitis virus); TBEV, tick-borne encephalitis virus (tick-borne encephalitis virus); YFV, yellow fever virus (yellow fever virus); WNV, west Nile virus (West Nile virus).
FIG. 3 is a schematic representation of the Zika virus antigen. Protein domains are drawn to scale. Zika, zika virus; JEV, japanese encephalitis virus; MV, measles virus. A. 12 variants of the Zika antigen, in which natural signal peptides from the Zika viral capsid protein (sp) or membrane protein (sp') are used. B. 8 variants of chimeric JEV-Zika antigen in which the signal peptide of the JEV capsid protein was used. 10 variants of MV-Zika antigen, wherein the signal peptide (MVsp) of the MV fusion protein is used. 10 variants of MV Zika antigen in which the signal peptide (MVsp') of the fusion protein of modified MV was used.
FIG. 4. Schematic representation of MV vector. The MV gene is expressed as: n (nucleoprotein), PVC (phosphoprotein and V/C protein), M (matrix), F (fusion), H (hemagglutinin), L (polymerase), T7 (T7 RNA polymerase promoter), hh (hammerhead ribozyme), T7T (T7 RNA polymerase terminator),(hepatitis delta virus ribozyme), red arrow (additional transcription unit).
Figure 5. Single immunization in mice. A) One month after a single immunization, the Zika antibody response in the mouse serum was measured by ELISA. MV-prMEd404 (natural sequence, insert 4); MV-ssEd445 (native sequence, insert 5). B) Survival of immunized mice after Zika virus challenge. C) Zika viremia (determined by RT-qPCR) in serum of mice immunized on different days post challenge. D) One week after immunization with MV-Zika or control MVSchw virus, IFN-. Gamma.Elispot.assays were performed in mouse spleen cells. Elispot assays were performed against MV (Schwarz), zika virus (Zika) and concanavalin a as controls.
Fig. 6. Prime-boost immunization in mice. A) The Zika antibody responses in mouse serum were measured by ELISA at days 30, 45 and 55 after two immunizations. B) The Zika virus neutralizing antibodies were detected in serum from mice immunized with MV-prMEd404 (natural sequence, insert 4), MV-ssEd445 (natural sequence, insert 5) by two injections. C) Survival of mice immunized after challenge with low dose of Zika virus. D) Zika viremia (determined by RT-qPCR) in serum of mice immunized on different days after challenge.
FIG. 7 recombinant MV expressing full length prME Zika antigen (construct A1) produces Zika VLPs. Vero cells were infected with three different rMV-zika_a1 clones (1, 2, 3) for 48 hours. Cell lysates and media were collected. The supernatant medium was clarified by low-speed centrifugation (1500 rpm) and then concentrated by ultracentrifugation on a 20% sucrose pad for 3 hours (36000 rpm). All materials were analyzed by western blot with 4g2 panflavi monoclonal antibody to detect the zika E protein (50 kD). (A) cell lysate, (B) concentrated medium, (C) unconcentrated medium, and positive and negative controls. The positive control was a lysate of Vero cells transfected with pcDNA5 plasmid expressing the zika A1 antigen for 48 hours. The recovered positive E proteins in group B after ultracentrifugation indicated that high density of VLPs were produced in the supernatant of infected Vero cells.
FIG. 8 Zika virus antigen expression assay. HEK293T cells were transfected with each codon optimized construct and cell lysates and culture media were collected after 48 hours. The supernatant medium was clarified by low-speed centrifugation (1500 rpm) and then concentrated by ultracentrifugation on a 20% sucrose pad for 3 hours (36000 rpm). All materials were analyzed by Western blot analysis using 4G2 pan-flavivirus antibody to detect the Zika E protein (. About.50 kD). (L) cell lysate, (S) unconcentrated medium, and (U) ultracentrifuged medium.
FIG. 9 shows the growth curve of expression of Zika virus antigen A1 and recombinant MV-Zika-A1 from measles vectors. (A) Immunofluorescence analysis showed large syncytia in Vero cells infected with MV-zhai-A1 for 24 hours (detection of zhai-ca virus E protein with 4G2 pan-flavivirus antibody). (B) Replication of recombinant MV-Zika-A1 virus on Vero cells at 32℃after infection was performed with a multiplicity of infection (titer determined by limiting dilution and Karber method) of 0.01.
FIG. 10 immunized CD46-IFNAR -/- Mice respond to ZIKV antibodies. Antibody titers against ZIKV EDIII were determined using indirect ELISA in mouse sera collected after primary and booster immunizations with MV-ZIKV-A1, MV-prMEd404 (native sequence, insert 4), MV-ssEd445 (native sequence, insert 5), MV-ZIKV-a12 or control empty MV-Schwarz. The reading of the wells coated with the mimetic antigen was subtracted from the wells with ZIKV-EDIII and the ZIKV-specific IgG titer was calculated as the reciprocal of the highest dilution of serum from individuals with absorbance of 0.5. A strong antibody response to ZIKV was induced in immunized mice, with slightly higher values for A1 (high replicability) and a12 (higher variability).
FIG. 11 immunized CD46-IFNAR -/- ZIKV neutralizing antibody titer in mice. In mouse serum collected after the last boost with MV-ZIKV-A1, MV-prMEd404 (native sequence, insert 4), MV-ssEd445 (native sequence, insert 5), MV-ZIKV-a12 or control empty MV-Schwarz and prior to challenge by using the plaque reduction neutralization assay (PRNT) 50 ) Neutralizing antibody titers against ZIKV were determined. The strongest neutralization titers were observed with the MV-ZIKV-A1 construct.
FIG. 12 CD46-IFNAR for protection against immunization -/- Mice were protected from ZIKV non-lethal challenge. One month after the last immunization, use 10 3 ZIKV of ffu (asia-south america pedigree, isolated 12 months of 2015) challenged mice immunized twice with MV-ZIKV-A1, MV-ZIKV-a12 or control null MV-Schwarz. Viral load was determined by RT-qPCR. LOD indicates the limit of detection for RT-qPCR. Mice immunized with construct MV-ZIKV-A1 were protected from viremia, while mice immunized with MV-ZIKV-A12 or empty MV Schwarz controls were infected.
FIG. 13 CD46-IFNAR for protection against immunization -/- Mice were protected from ZIKV lethal challenge. One month after the last immunization, 10 3 Mice immunized twice with MV-ZIKV-A1 or control null MV-Schwarz were challenged with ZIKV (African lineage mouse-adapted strain) of ffu. Animals were monitored for morbidity and mortality for 15 days. All animals immunized with MV-ZIKV-A1 survived without signs of diseaseWhile all control mice died on day 8.
Examples
Vaccine candidate generation
Previous experience with different flaviviruses (dengue, west nile, japanese encephalitis, tick-borne encephalitis) has widely demonstrated that the flavivirus surface envelope (E) protein is capable of eliciting protective neutralizing antibodies that allow for reduced viral infectivity. The ZIKV genome consists of a single stranded sense RNA molecule of-10800 kb length with 2 flanking non-coding regions (5 'and 3' ncr) and one long open reading frame encoding a polyprotein that is cleaved into three structural proteins (capsid protein (C), membrane precursor (prM), envelope (E)) and seven non-structural proteins (NS) (fig. 1). The E protein (53 kDa) is the major virion surface protein involved in various aspects of the viral cycle, mediating binding to target cells and membrane fusion.
Thus, the inventors chose to express the Zika virus E protein. Several forms of E protein were selected to express soluble secreted or anchored proteins to the surface of VLPs. The following Zika virus antigens were cloned and expressed in human cells from mammalian expression plasmids: prM-E, different forms of E with or without a stem or anchor region. These proteins contain the original signal peptide sequence of the Zika virus E or a heterologous signal peptide sequence from a JEV or MV fusion protein. These proteins contain a signal enzyme cleavage site located between prM and E sequences (fig. 3A, 3B, 3C, 3D).
Antigen selection and design
The Zika antigen was selected based on previous work that simultaneously indicated that the envelope antigen of the flavivirus may be capable of eliciting neutralizing antibodies and T cell responses. However, selection of the appropriate antigen should take into account the evolution of the virus over time as well as the diversity of existing strains. For this purpose, the inventors reconstructed phylogenetic events including representative members of the flaviviridae family of the Zika virus using only the amino acid region of the flaviviridae polyprotein corresponding to the envelope protein (E) gene. Unlike whole genome or polymerase (NS 5) based phylogenetic analyses of flaviviruses, the next to the zaka virus is a neurotropic virus such as san lewisite encephalitis virus, and the inventors noted that zaka E appears to be closer to DENV E (fig. 2) (barbe-Spaeth, et al nature 2016,536,48-53). The inventors then identified the different domains of the Zika membrane (M) protein, its precursor (prM) protein, and the E protein by structural homology modeling based on available data on DENV (Ekins et al Illustering and homology modeling the proteins of the Zika virus, F1000Research 2016, 5:275). The inventors also identified a signal peptide at the end of the capsid protein (C) gene just upstream of prM (http:// sigpep. Service. Name. Sbg. Ac. At/signalb last. Html; http:// www.cbs.dtu.dk/service/signalP/; http:// www.predisi.de /) using homology modeling again with dengue virus as a reference and publicly available algorithms to predict signal peptide sequences. The inventors selected to incorporate a signal peptide sequence to induce export and secretion of the candidate antigen, either full-length prM-E or E alone, outside the cell. For the E antigen, the inventors also predicted a signal peptide at the M-terminus, just upstream of E, and used this natural signal to design variants of the antigen (fig. 3A). Furthermore, the inventors have designed chimeric antigens in which the natural signal peptide of the zika virus is replaced by a signal peptide present at the C-terminus of JEV (fig. 3B) or the N-terminus of fusion protein of MV (F) (fig. 3C), provided that these sequences will provide enhanced export of candidate antigens. The inventors devised an additional version of the chimeric antigen comprising the signal peptide from F of MV, in which two amino acids corresponding to the junction between the end of the signal peptide of F and the initiation of F itself were removed (fig. 3D).
Second, the inventors also designed shorter antigen variants by removing the C-terminal fragment of the E protein, which corresponds to predicted stem and/or anchor domains (predicted by comparison to DENV) including the intermediate region between the stem and anchor domains. The purpose of these modifications to reduce the size of the antigen is to generate antigens capable of forming VLPs. For the third variant, the inventors removed the anchoring domain, the intermediate domain between the anchoring domain and the stem, and the fragment of the second helix that constitutes the stem, this time in homology modeling with WNV (variant Ed 445).
Finally, the inventors designed chimeric prM-E and E antigens using signal peptides from MV F protein, and replaced the zaka E anchor domain with the transmembrane region (TM) and cytoplasmic tail of MV F protein (fig. 3C and 3D).
For selected sequences of the antigen itself, the inventors analyzed all publicly available sequences of the zika virus (asia and african lineages), as well as the unpublished sequences generated by the inventors from southern america and pacific epidemics. Based on epidemiological data reporting that adult congenital syndrome and neurological diseases are associated with only the asian lineage, the inventors designed antigens using the consensus amino acid sequence of the zika virus observed to spread in 2015 and later, in particular designed to include the S139N change in prM, which resulted in new potential N glycosylation sites in prM that were not present in the african lineage, and V763M in E.
The sequence codon was optimised for human expression and the sequence was adapted to measles vector clones and the "six-digit rule" (total number of nucleotides divisible by 6). The regions that are very GC-rich (> 80%) or very GC-poor (< 30%) were avoided to improve RNA stability, high CAI values (0.97) were obtained to improve translation efficiency, avoiding the following CIS-active sequences: internal TATA-boxes, chi sites, ribosome entry sites, AT-or GC-rich sequence segments, ARE, INS, CRS elements, repeat sequences, RNA secondary structures, cryptic splice donor and acceptor sites, branching points. The following measles virus editing sequences were avoided as much as possible: AAAGGG, AAAAGG, GGGAAA, GGGGAA, TTAAA, AAAA, and also their complements on the same strand: TTCCCC, TTTCCC, CCTTTT, CCCTT, TTTAA, TTTT. Restriction sites BssHII, bsiWI were avoided internally and inserted at both ends for cloning purposes.
Antigen expression in mammalian cells
The optimized antigen sequences were cloned into pcDNA5 mammalian expression plasmids and transfected into HEK293 cells. After western blotting using appropriate antibodies for detection, the size and expression level of each antigen was characterized.
Antigen expression in measles vectors
The optimized Zika antigen sequence was inserted into the MV vector as a different additional transcriptional unit depending on the desired expression level. After sequencing measles vector plasmids expressing different Zika antigens, replicative recombinant vectors were generated by reverse genetics using a previously developed cell-based system (combretdet, C.et al, 2003, J Virol,77 (21): 11546-11554), and the rescued viruses were amplified and titrated on Vero cells. Recombinant viruses were grown on Vero cells to record expression of the zikaprotein detected in the supernatant and cells by western blot and indirect immunofluorescent staining with appropriate antibodies. After ultracentrifugation of the medium and western blotting, the presence of the Zika virus VLPs (in prM/E expressing vectors) was identified (FIG. 7). The correct processing of antigen in infected cells was checked by western blotting. Prior to amplification on Vero cells, vectors with optimal expression of the Zika antigen were isolated by serial dilution and single plaque cloning.
Growth ability of recombinant vaccine viruses
The growth capacity of the selected vaccine virus was compared to standard MV Schwarz. Growth curve analysis was performed by using different multiplicity of infection followed by titration in Vero cell culture.
Stability of recombinant vaccine viruses
The selected optimal vaccine vector was tested for its genetic stability by serial passage over Vero cell culture for more than 10 cell culture passages, followed by western blot and full sequence analysis for antigen expression.
Preclinical evaluation of the first MV-Zika recombinants in mice
Single immunization
Two recombinant vectors MV-prMEd404 (native sequence, insert 4) and MV-ssEd445 (native sequence, insert 5) were evaluated in CD46/IFNAR mice susceptible to measles infection. Mice were immunized with the established infectious units of vaccine virus in one or two intraperitoneal injections and analyzed for functional antibodies and cell-mediated immune responses using standard assays and specially developed assays. The binding antibodies to Zika virus were determined by ELISA and the neutralizing antibodies were specific plaquesReduction Neutralization Test (PRNT). T cell responses were analyzed by Elispot assay using the zika virus specific peptide for ex vivo stimulation of spleen cells. Vaccine vectors were then tested for protective efficacy: immunized mice were challenged with a lethal dose of zika virus. Dose-response challenge was previously established in CD46/IFNAR mice, showing a dose of 10 2 To 10 6 The zika virus african strain HD78788 (adaptation to mice) between the individual lesion formation units (ffu) effectively killed these mice.
In the first experiment, the preparation was administered by intraperitoneal injection 10 6 MV-prMEd404 of TCID50 (native sequence, insert 4), MV-ssEd445 (native sequence, insert 5) or empty MVSchw as a control were immunized with 6 mice per group. Blood was collected before and at day 30 post immunization and the ELISA titer of the zika virus was determined (fig. 5A).
Then by intraperitoneal injection 10 6 ffu the Zika virus African strain HD78788 (adapted to mice) challenged immunized mice on day 30. Morbidity and mortality were controlled within 12 days (fig. 5B), and the status of the card virus viremia was determined in serum by qRT-PCR (fig. 5C).
To determine the T cell response to the vaccine, another group of CD46/IFNAR mice was immunized with MV-prMEd404 (insert 4) or empty MVSchw and spleens were harvested 8 days after immunization. Freshly extracted splenocytes were subjected to Elispot analysis using MVSchw or zika virus to re-stimulate T cells or concanavalin a as a control (fig. 5D).
Prime-boost immunity
In the second set of experiments, the preparation was administered by two sequential intraperitoneal injections 10 6 MV-prMEd404 of TCID50 (native sequence, insert 4), MV-ssEd445 (native sequence, insert 5) or empty MVSchw immunized CD46/IFNAR mouse group as control. Blood was collected at 30, 45 and 55 days before and after immunization, and the Zika virus ELISA titers were determined (FIG. 6A). Neutralizing antibodies were determined in serum collected on day 50 using a specific neutralization test of the zika virus (fig. 6B). Then by intraperitoneal injection 10 6 ffu the Zika virus African strain HD78788 (adapted to mice) challenged immunized mice on day 60. Morbidity and mortalityThe rate was controlled within 12 days (fig. 6C), and the zika viremia was determined in serum by qRT-PCR at days 2, 4 and 6 post-infection (fig. 6D).
Preclinical evaluation in non-human primate (NHP)
Verification of ZIKV strains used in NHP challenge studies
Since little is known about the pathophysiology of ZIKV in cynomolgus macaques (Macaca fascicularis), three doses of wild-type zika virus (10 4 、10 5 And 10 6 pfu) were vaccinated with two animals to assess relevant clinical conditions in viral stocks and cynomolgus monkeys. Both animals were subjected to the same follow-up as vaccinated and challenged animals for only 6 months. The following points are demonstrated: virology (qRT-PCR; clinical conditions (rash, fever), blood cell count (lymphocytes, monocytes, granulocytes, platelets), biochemistry (ASAT, ALAT, CRP), nonspecific (congenital and inflammatory) and specific immune responses by cytokine/chemokine of luminex, NK, B and T cell profile (14 color flow cytometry), antibodies on consecutive serum samples (neutralization, binding), T cell functional response and memory cells (ELISpot, ICS). Viral shedding in biological fluids (saliva, tears, reproductive fluids) were assessed at different time points by qRT-PCR and/or isolation methods.
Vaccine immunogenicity studies in NHP
Macaques were immunized with a defined vaccine virus infection unit in one or two subcutaneous injections with 3 month intervals. Humoral and cell-mediated immune responses were determined at different times after immunization. The macaque was then challenged with ZIKV at the infectious dose. Infectious viremia and clinical signs were determined. For this task, 21 adult cynomolgus macaques negative for anti-flavivirus and anti-measles antibodies were selected; two groups of 7 animals were vaccinated with the selected optimal MV-ZIKV recombinant virus (native MV-prMEd 404) using either a single dose or a prime boost regimen. Immunity (humoral and cell-related) was studied and virology was followed up to 1 month after vaccination. Clinical and biological parameters were assessed in parallel with 7 animals of the third group, which were vaccinated with the control empty MVSchw strain according to the prime boost regimen. Antibody neutralization titers were determined.
Vaccine efficacy study in NHP
Two months after immunization, the immunized NHP was challenged with ZIKV. ZIKV viremia levels (qRT-PCR) were analyzed in blood, saliva and tears. Inflammatory and immune responses (neutralizing abs, cytokines) were evaluated in plasma.
Expression assay
The expression assays performed for all the resulting constructs (fig. 8) showed strong expression of several of them. Different amounts of signal were detected in the ultracentrifuge fractions for some of the candidate vaccines (especially A1 and a 12), which matches the generation of virus-like particles. Thus, both antigens were further cloned into measles vectors and showed high level expression as shown by immunofluorescence (fig. 9A). Replication of the recombinant MV-ZIKV-A1 vector was similar to that of the standard MV Schwarz virus, although the final titer was lower (FIG. 9B).
Their immunogenicity was tested in CD46/IFNAR mice, as detected by ELISA, MV-ZIKV-A1 and MV-ZIKV-a12 vectors elicited strong immune responses following priming and boosting protocols with 1 month intervals, comparable to MV-prMEd404 and MV-ssEd445 vectors (fig. 10). However, different amounts of neutralizing antibodies were induced (fig. 11). Only candidate MV-ZIKV-A1 induced a strong neutralization response (2 log stronger). This is related to the complete protection conferred to mice by immunization with MV-ZIKV-A1 against viremia (fig. 12) and protection from lethal challenge (fig. 13).
Taken together, this study shows that the A1 full-length zircard antigen expressed in MV vectors is able to provide detoxication protection against infectious and lethal challenge to immunized animals, associated with strong neutralizing antibody induction.
Sequence listing
<110> Pasteur institute
National Research Center
<120> recombinant measles virus expressing Zika virus protein and use thereof
<130> B12291A/AD/DP/KN
<140> PCT/EP
<141> 2018-06-06
<150> EP17305676.3
<151> 2017-06-07
<160> 171
<170> PatentIn version 3.5
<210> 1
<211> 6
<212> DNA
<213> Artificial sequence
<220>
<223> pTM-MVSchw
<400> 1
gcggccgcta atacgactca ctatagggcc aactttgttt ggtctgatga gtccgtgagg 60
acgaaacccg gagtcccggg tcaccaaaca aagttgggta aggatagttc aatcaatgat 120
catcttctag tgcacttagg attcaagatc ctattatcag ggacaagagc aggattaggg 180
atatccgaga tggccacact tttaaggagc ttagcattgt tcaaaagaaa caaggacaaa 240
ccacccatta catcaggatc cggtggagcc atcagaggaa tcaaacacat tattatagta 300
ccaatccctg gagattcctc aattaccact cgatccagac ttctggaccg gttggtgagg 360
ttaattggaa acccggatgt gagcgggccc aaactaacag gggcactaat aggtatatta 420
tccttatttg tggagtctcc aggtcaattg attcagagga tcaccgatga ccctgacgtt 480
agcataaggc tgttagaggt tgtccagagt gaccagtcac aatctggcct taccttcgca 540
tcaagaggta ccaacatgga ggatgaggcg gaccaatact tttcacatga tgatccaatt 600
agtagtgatc aatccaggtt cggatggttc gggaacaagg aaatctcaga tattgaagtg 660
caagaccctg agggattcaa catgattctg ggtaccatcc tagcccaaat ttgggtcttg 720
ctcgcaaagg cggttacggc cccagacacg gcagctgatt cggagctaag aaggtggata 780
aagtacaccc aacaaagaag ggtagttggt gaatttagat tggagagaaa atggttggat 840
gtggtgagga acaggattgc cgaggacctc tccttacgcc gattcatggt cgctctaatc 900
ctggatatca agagaacacc cggaaacaaa cccaggattg ctgaaatgat atgtgacatt 960
gatacatata tcgtagaggc aggattagcc agttttatcc tgactattaa gtttgggata 1020
gaaactatgt atcctgctct tggactgcat gaatttgctg gtgagttatc cacacttgag 1080
tccttgatga acctttacca gcaaatgggg gaaactgcac cctacatggt aatcctggag 1140
aactcaattc agaacaagtt cagtgcagga tcataccctc tgctctggag ctatgccatg 1200
ggagtaggag tggaacttga aaactccatg ggaggtttga actttggccg atcttacttt 1260
gatccagcat attttagatt agggcaagag atggtaagga ggtcagctgg aaaggtcagt 1320
tccacattgg catctgaact cggtatcact gccgaggatg caaggcttgt ttcagagatt 1380
gcaatgcata ctactgagga caagatcagt agagcggttg gacccagaca agcccaagta 1440
tcatttctac acggtgatca aagtgagaat gagctaccga gattgggggg caaggaagat 1500
aggagggtca aacagagtcg aggagaagcc agggagagct acagagaaac cgggcccagc 1560
agagcaagtg atgcgagagc tgcccatctt ccaaccggca cacccctaga cattgacact 1620
gcaacggagt ccagccaaga tccgcaggac agtcgaaggt cagctgacgc cctgcttagg 1680
ctgcaagcca tggcaggaat ctcggaagaa caaggctcag acacggacac ccctatagtg 1740
tacaatgaca gaaatcttct agactaggtg cgagaggccg agggccagaa caacatccgc 1800
ctaccatcca tcattgttat aaaaaactta ggaaccaggt ccacacagcc gccagcccat 1860
caaccatcca ctcccacgat tggagccaat ggcagaagag caggcacgcc atgtcaaaaa 1920
cggactggaa tgcatccggg ctctcaaggc cgagcccatc ggctcactgg ccatcgagga 1980
agctatggca gcatggtcag aaatatcaga caacccagga caggagcgag ccacctgcag 2040
ggaagagaag gcaggcagtt cgggtctcag caaaccatgc ctctcagcaa ttggatcaac 2100
tgaaggcggt gcacctcgca tccgcggtca gggacctgga gagagcgatg acgacgctga 2160
aactttggga atccccccaa gaaatctcca ggcatcaagc actgggttac agtgttatta 2220
cgtttatgat cacagcggtg aagcggttaa gggaatccaa gatgctgact ctatcatggt 2280
tcaatcaggc cttgatggtg atagcaccct ctcaggagga gacaatgaat ctgaaaacag 2340
cgatgtggat attggcgaac ctgataccga gggatatgct atcactgacc ggggatctgc 2400
tcccatctct atggggttca gggcttctga tgttgaaact gcagaaggag gggagatcca 2460
cgagctcctg agactccaat ccagaggcaa caactttccg aagcttggga aaactctcaa 2520
tgttcctccg cccccggacc ccggtagggc cagcacttcc gggacaccca ttaaaaaggg 2580
cacagacgcg agattagcct catttggaac ggagatcgcg tctttattga caggtggtgc 2640
aacccaatgt gctcgaaagt caccctcgga accatcaggg ccaggtgcac ctgcggggaa 2700
tgtccccgag tgtgtgagca atgccgcact gatacaggag tggacacccg aatctggtac 2760
cacaatctcc ccgagatccc agaataatga agaaggggga gactattatg atgatgagct 2820
gttctctgat gtccaagata ttaaaacagc cttggccaaa atacacgagg ataatcagaa 2880
gataatctcc aagctagaat cactgctgtt attgaaggga gaagttgagt caattaagaa 2940
gcagatcaac aggcaaaata tcagcatatc caccctggaa ggacacctct caagcatcat 3000
gatcgccatt cctggacttg ggaaggatcc caacgacccc actgcagatg tcgaaatcaa 3060
tcccgacttg aaacccatca taggcagaga ttcaggccga gcactggccg aagttctcaa 3120
gaaacccgtt gccagccgac aactccaagg aatgacaaat ggacggacca gttccagagg 3180
acagctgctg aaggaatttc agctaaagcc gatcgggaaa aagatgagct cagccgtcgg 3240
gtttgttcct gacaccggcc ctgcatcacg cagtgtaatc cgctccatta taaaatccag 3300
ccggctagag gaggatcgga agcgttacct gatgactctc cttgatgata tcaaaggagc 3360
caatgatctt gccaagttcc accagatgct gatgaagata ataatgaagt agctacagct 3420
caacttacct gccaacccca tgccagtcga cccaactagt acaacctaaa tccattataa 3480
aaaacttagg agcaaagtga ttgcctccca aggtccacaa tgacagagac ctacgacttc 3540
gacaagtcgg catgggacat caaagggtcg atcgctccga tacaacccac cacctacagt 3600
gatggcaggc tggtgcccca ggtcagagtc atagatcctg gtctaggcga caggaaggat 3660
gaatgcttta tgtacatgtt tctgctgggg gttgttgagg acagcgattc cctagggcct 3720
ccaatcgggc gagcatttgg gttcctgccc ttaggtgttg gcagatccac agcaaagccc 3780
gaaaaactcc tcaaagaggc cactgagctt gacatagttg ttagacgtac agcagggctc 3840
aatgaaaaac tggtgttcta caacaacacc ccactaactc tcctcacacc ttggagaaag 3900
gtcctaacaa cagggagtgt cttcaacgca aaccaagtgt gcaatgcggt taatctgata 3960
ccgctcgata ccccgcagag gttccgtgtt gtttatatga gcatcacccg tctttcggat 4020
aacgggtatt acaccgttcc tagaagaatg ctggaattca gatcggtcaa tgcagtggcc 4080
ttcaacctgc tggtgaccct taggattgac aaggcgatag gccctgggaa gatcatcgac 4140
aatacagagc aacttcctga ggcaacattt atggtccaca tcgggaactt caggagaaag 4200
aagagtgaag tctactctgc cgattattgc aaaatgaaaa tcgaaaagat gggcctggtt 4260
tttgcacttg gtgggatagg gggcaccagt cttcacatta gaagcacagg caaaatgagc 4320
aagactctcc atgcacaact cgggttcaag aagaccttat gttacccgct gatggatatc 4380
aatgaagacc ttaatcgatt actctggagg agcagatgca agatagtaag aatccaggca 4440
gttttgcagc catcagttcc tcaagaattc cgcatttacg acgacgtgat cataaatgat 4500
gaccaaggac tattcaaagt tctgtagacc gtagtgccca gcaatgcccg aaaacgaccc 4560
ccctcacaat gacagccaga aggcccggac aaaaaagccc cctccgaaag actccacgga 4620
ccaagcgaga ggccagccag cagccgacgg caagcgcgaa caccaggcgg ccccagcaca 4680
gaacagccct gacacaaggc caccaccagc caccccaatc tgcatcctcc tcgtgggacc 4740
cccgaggacc aacccccaag gctgcccccg atccaaacca ccaaccgcat ccccaccacc 4800
cccgggaaag aaacccccag caattggaag gcccctcccc ctcttcctca acacaagaac 4860
tccacaaccg aaccgcacaa gcgaccgagg tgacccaacc gcaggcatcc gactccctag 4920
acagatcctc tctccccggc aaactaaaca aaacttaggg ccaaggaaca tacacaccca 4980
acagaaccca gaccccggcc cacggcgccg cgcccccaac ccccgacaac cagagggagc 5040
ccccaaccaa tcccgccggc tcccccggtg cccacaggca gggacaccaa cccccgaaca 5100
gacccagcac ccaaccatcg acaatccaag acgggggggc ccccccaaaa aaaggccccc 5160
aggggccgac agccagcacc gcgaggaagc ccacccaccc cacacacgac cacggcaacc 5220
aaaccagaac ccagaccacc ctgggccacc agctcccaga ctcggccatc accccgcaga 5280
aaggaaaggc cacaacccgc gcaccccagc cccgatccgg cggggagcca cccaacccga 5340
accagcaccc aagagcgatc cccgaaggac ccccgaaccg caaaggacat cagtatccca 5400
cagcctctcc aagtcccccg gtctcctcct cttctcgaag ggaccaaaag atcaatccac 5460
cacacccgac gacactcaac tccccacccc taaaggagac accgggaatc ccagaatcaa 5520
gactcatcca atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt 5580
actgttaact ctccaaacac ccaccggtca aatccattgg ggcaatctct ctaagatagg 5640
ggtggtagga ataggaagtg caagctacaa agttatgact cgttccagcc atcaatcatt 5700
agtcataaaa ttaatgccca atataactct cctcaataac tgcacgaggg tagagattgc 5760
agaatacagg agactactga gaacagtttt ggaaccaatt agagatgcac ttaatgcaat 5820
gacccagaat ataagaccgg ttcagagtgt agcttcaagt aggagacaca agagatttgc 5880
gggagtagtc ctggcaggtg cggccctagg cgttgccaca gctgctcaga taacagccgg 5940
cattgcactt caccagtcca tgctgaactc tcaagccatc gacaatctga gagcgagcct 6000
ggaaactact aatcaggcaa ttgagacaat cagacaagca gggcaggaga tgatattggc 6060
tgttcagggt gtccaagact acatcaataa tgagctgata ccgtctatga accaactatc 6120
ttgtgattta atcggccaga agctcgggct caaattgctc agatactata cagaaatcct 6180
gtcattattt ggccccagtt tacgggaccc catatctgcg gagatatcta tccaggcttt 6240
gagctatgcg cttggaggag acatcaataa ggtgttagaa aagctcggat acagtggagg 6300
tgatttactg ggcatcttag agagcggagg aataaaggcc cggataactc acgtcgacac 6360
agagtcctac ttcattgtcc tcagtatagc ctatccgacg ctgtccgaga ttaagggggt 6420
gattgtccac cggctagagg gggtctcgta caacataggc tctcaagagt ggtataccac 6480
tgtgcccaag tatgttgcaa cccaagggta ccttatctcg aattttgatg agtcatcgtg 6540
tactttcatg ccagagggga ctgtgtgcag ccaaaatgcc ttgtacccga tgagtcctct 6600
gctccaagaa tgcctccggg ggtacaccaa gtcctgtgct cgtacactcg tatccgggtc 6660
ttttgggaac cggttcattt tatcacaagg gaacctaata gccaattgtg catcaatcct 6720
ttgcaagtgt tacacaacag gaacgatcat taatcaagac cctgacaaga tcctaacata 6780
cattgctgcc gatcactgcc cggtagtcga ggtgaacggc gtgaccatcc aagtcgggag 6840
caggaggtat ccagacgctg tgtacttgca cagaattgac ctcggtcctc ccatatcatt 6900
ggagaggttg gacgtaggga caaatctggg gaatgcaatt gctaagttgg aggatgccaa 6960
ggaattgttg gagtcatcgg accagatatt gaggagtatg aaaggtttat cgagcactag 7020
catagtctac atcctgattg cagtgtgtct tggagggttg atagggatcc ccgctttaat 7080
atgttgctgc agggggcgtt gtaacaaaaa gggagaacaa gttggtatgt caagaccagg 7140
cctaaagcct gatcttacgg gaacatcaaa atcctatgta aggtcgctct gatcctctac 7200
aactcttgaa acacaaatgt cccacaagtc tcctcttcgt catcaagcaa ccaccgcacc 7260
cagcatcaag cccacctgaa attatctccg gcttccctct ggccgaacaa tatcggtagt 7320
taatcaaaac ttagggtgca agatcatcca caatgtcacc acaacgagac cggataaatg 7380
ccttctacaa agataacccc catcccaagg gaagtaggat agtcattaac agagaacatc 7440
ttatgattga tagaccttat gttttgctgg ctgttctgtt tgtcatgttt ctgagcttga 7500
tcgggttgct agccattgca ggcattagac ttcatcgggc agccatctac accgcagaga 7560
tccataaaag cctcagcacc aatctagatg taactaactc aatcgagcat caggtcaagg 7620
acgtgctgac accactcttc aaaatcatcg gtgatgaagt gggcctgagg acacctcaga 7680
gattcactga cctagtgaaa ttaatctctg acaagattaa attccttaat ccggataggg 7740
agtacgactt cagagatctc acttggtgta tcaacccgcc agagagaatc aaattggatt 7800
atgatcaata ctgtgcagat gtggctgctg aagagctcat gaatgcattg gtgaactcaa 7860
ctctactgga gaccagaaca accaatcagt tcctagctgt ctcaaaggga aactgctcag 7920
ggcccactac aatcagaggt caattctcaa acatgtcgct gtccctgtta gacttgtatt 7980
taggtcgagg ttacaatgtg tcatctatag tcactatgac atcccaggga atgtatgggg 8040
gaacttacct agtggaaaag cctaatctga gcagcaaaag gtcagagttg tcacaactga 8100
gcatgtaccg agtgtttgaa gtaggtgtta tcagaaatcc gggtttgggg gctccggtgt 8160
tccatatgac aaactatctt gagcaaccag tcagtaatga tctcagcaac tgtatggtgg 8220
ctttggggga gctcaaactc gcagcccttt gtcacgggga agattctatc acaattccct 8280
atcagggatc agggaaaggt gtcagcttcc agctcgtcaa gctaggtgtc tggaaatccc 8340
caaccgacat gcaatcctgg gtccccttat caacggatga tccagtgata gacaggcttt 8400
acctctcatc tcacagaggt gttatcgctg acaatcaagc aaaatgggct gtcccgacaa 8460
cacgaacaga tgacaagttg cgaatggaga catgcttcca acaggcgtgt aagggtaaaa 8520
tccaagcact ctgcgagaat cccgagtggg caccattgaa ggataacagg attccttcat 8580
acggggtctt gtctgttgat ctgagtctga cagttgagct taaaatcaaa attgcttcgg 8640
gattcgggcc attgatcaca cacggttcag ggatggacct atacaaatcc aaccacaaca 8700
atgtgtattg gctgactatc ccgccaatga agaacctagc cttaggtgta atcaacacat 8760
tggagtggat accgagattc aaggttagtc cctacctctt cactgtccca attaaggaag 8820
caggcgaaga ctgccatgcc ccaacatacc tacctgcgga ggtggatggt gatgtcaaac 8880
tcagttccaa tctggtgatt ctacctggtc aagatctcca atatgttttg gcaacctacg 8940
atacttccag ggttgaacat gctgtggttt attacgttta cagcccaagc cgctcatttt 9000
cttactttta tccttttagg ttgcctataa agggggtccc catcgaatta caagtggaat 9060
gcttcacatg ggaccaaaaa ctctggtgcc gtcacttctg tgtgcttgcg gactcagaat 9120
ctggtggaca tatcactcac tctgggatgg tgggcatggg agtcagctgc acagtcaccc 9180
gggaagatgg aaccaatcgc agatagggct gctagtgaac caatcacatg atgtcaccca 9240
gacatcaggc atacccacta gtgtgaaata gacatcagaa ttaagaaaaa cgtagggtcc 9300
aagtggttcc ccgttatgga ctcgctatct gtcaaccaga tcttataccc tgaagttcac 9360
ctagatagcc cgatagttac caataagata gtagccatcc tggagtatgc tcgagtccct 9420
cacgcttaca gcctggagga ccctacactg tgtcagaaca tcaagcaccg cctaaaaaac 9480
ggattttcca accaaatgat tataaacaat gtggaagttg ggaatgtcat caagtccaag 9540
cttaggagtt atccggccca ctctcatatt ccatatccaa attgtaatca ggatttattt 9600
aacatagaag acaaagagtc aacgaggaag atccgtgaac tcctcaaaaa ggggaattcg 9660
ctgtactcca aagtcagtga taaggttttc caatgcttaa gggacactaa ctcacggctt 9720
ggcctaggct ccgaattgag ggaggacatc aaggagaaag ttattaactt gggagtttac 9780
atgcacagct cccagtggtt tgagcccttt ctgttttggt ttacagtcaa gactgagatg 9840
aggtcagtga ttaaatcaca aacccatact tgccatagga ggagacacac acctgtattc 9900
ttcactggta gttcagttga gttgctaatc tctcgtgacc ttgttgctat aatcagtaaa 9960
gagtctcaac atgtatatta cctgacattt gaactggttt tgatgtattg tgatgtcata 10020
gaggggaggt taatgacaga gaccgctatg actattgatg ctaggtatac agagcttcta 10080
ggaagagtca gatacatgtg gaaactgata gatggtttct tccctgcact cgggaatcca 10140
acttatcaaa ttgtagccat gctggagcct ctttcacttg cttacctgca gctgagggat 10200
ataacagtag aactcagagg tgctttcctt aaccactgct ttactgaaat acatgatgtt 10260
cttgaccaaa acgggttttc tgatgaaggt acttatcatg agttaactga agctctagat 10320
tacattttca taactgatga catacatctg acaggggaga ttttctcatt tttcagaagt 10380
ttcggccacc ccagacttga agcagtaacg gctgctgaaa atgttaggaa atacatgaat 10440
cagcctaaag tcattgtgta tgagactctg atgaaaggtc atgccatatt ttgtggaatc 10500
ataatcaacg gctatcgtga caggcacgga ggcagttggc caccgctgac cctccccctg 10560
catgctgcag acacaatccg gaatgctcaa gcttcaggtg aagggttaac acatgagcag 10620
tgcgttgata actggaaatc ttttgctgga gtgaaatttg gctgctttat gcctcttagc 10680
ctggatagtg atctgacaat gtacctaaag gacaaggcac ttgctgctct ccaaagggaa 10740
tgggattcag tttacccgaa agagttcctg cgttacgacc ctcccaaggg aaccgggtca 10800
cggaggcttg tagatgtttt ccttaatgat tcgagctttg acccatatga tgtgataatg 10860
tatgttgtaa gtggagctta cctccatgac cctgagttca acctgtctta cagcctgaaa 10920
gaaaaggaga tcaaggaaac aggtagactt tttgctaaaa tgacttacaa aatgagggca 10980
tgccaagtga ttgctgaaaa tctaatctca aacgggattg gcaaatattt taaggacaat 11040
gggatggcca aggatgagca cgatttgact aaggcactcc acactctagc tgtctcagga 11100
gtccccaaag atctcaaaga aagtcacagg ggggggccag tcttaaaaac ctactcccga 11160
agcccagtcc acacaagtac caggaacgtg agagcagcaa aagggtttat agggttccct 11220
caagtaattc ggcaggacca agacactgat catccggaga atatggaagc ttacgagaca 11280
gtcagtgcat ttatcacgac tgatctcaag aagtactgcc ttaattggag atatgagacc 11340
atcagcttgt ttgcacagag gctaaatgag atttacggat tgccctcatt tttccagtgg 11400
ctgcataaga ggcttgagac ctctgtcctg tatgtaagtg accctcattg cccccccgac 11460
cttgacgccc atatcccgtt atataaagtc cccaatgatc aaatcttcat taagtaccct 11520
atgggaggta tagaagggta ttgtcagaag ctgtggacca tcagcaccat tccctatcta 11580
tacctggctg cttatgagag cggagtaagg attgcttcgt tagtgcaagg ggacaatcag 11640
accatagccg taacaaaaag ggtacccagc acatggccct acaaccttaa gaaacgggaa 11700
gctgctagag taactagaga ttactttgta attcttaggc aaaggctaca tgatattggc 11760
catcacctca aggcaaatga gacaattgtt tcatcacatt tttttgtcta ttcaaaagga 11820
atatattatg atgggctact tgtgtcccaa tcactcaaga gcatcgcaag atgtgtattc 11880
tggtcagaga ctatagttga tgaaacaagg gcagcatgca gtaatattgc tacaacaatg 11940
gctaaaagca tcgagagagg ttatgaccgt taccttgcat attccctgaa cgtcctaaaa 12000
gtgatacagc aaattctgat ctctcttggc ttcacaatca attcaaccat gacccgggat 12060
gtagtcatac ccctcctcac aaacaacgac ctcttaataa ggatggcact gttgcccgct 12120
cctattgggg ggatgaatta tctgaatatg agcaggctgt ttgtcagaaa catcggtgat 12180
ccagtaacat catcaattgc tgatctcaag agaatgattc tcgcctcact aatgcctgaa 12240
gagaccctcc atcaagtaat gacacaacaa ccgggggact cttcattcct agactgggct 12300
agcgaccctt actcagcaaa tcttgtatgt gtccagagca tcactagact cctcaagaac 12360
ataactgcaa ggtttgtcct gatccatagt ccaaacccaa tgttaaaagg attattccat 12420
gatgacagta aagaagagga cgagggactg gcggcattcc tcatggacag gcatattata 12480
gtacctaggg cagctcatga aatcctggat catagtgtca caggggcaag agagtctatt 12540
gcaggcatgc tggataccac aaaaggcttg attcgagcca gcatgaggaa gggggggtta 12600
acctctcgag tgataaccag attgtccaat tatgactatg aacaattcag agcagggatg 12660
gtgctattga caggaagaaa gagaaatgtc ctcattgaca aagagtcatg ttcagtgcag 12720
ctggcgagag ctctaagaag ccatatgtgg gcgaggctag ctcgaggacg gcctatttac 12780
ggccttgagg tccctgatgt actagaatct atgcgaggcc accttattcg gcgtcatgag 12840
acatgtgtca tctgcgagtg tggatcagtc aactacggat ggttttttgt cccctcgggt 12900
tgccaactgg atgatattga caaggaaaca tcatccttga gagtcccata tattggttct 12960
accactgatg agagaacaga catgaagctt gccttcgtaa gagccccaag tcgatccttg 13020
cgatctgctg ttagaatagc aacagtgtac tcatgggctt acggtgatga tgatagctct 13080
tggaacgaag cctggttgtt ggctaggcaa agggccaatg tgagcctgga ggagctaagg 13140
gtgatcactc ccatctcaac ttcgactaat ttagcgcata ggttgaggga tcgtagcact 13200
caagtgaaat actcaggtac atcccttgtc cgagtggcga ggtataccac aatctccaac 13260
gacaatctct catttgtcat atcagataag aaggttgata ctaactttat ataccaacaa 13320
ggaatgcttc tagggttggg tgttttagaa acattgtttc gactcgagaa agataccgga 13380
tcatctaaca cggtattaca tcttcacgtc gaaacagatt gttgcgtgat cccgatgata 13440
gatcatccca ggatacccag ctcccgcaag ctagagctga gggcagagct atgtaccaac 13500
ccattgatat atgataatgc acctttaatt gacagagatg caacaaggct atacacccag 13560
agccatagga ggcaccttgt ggaatttgtt acatggtcca caccccaact atatcacatt 13620
ttagctaagt ccacagcact atctatgatt gacctggtaa caaaatttga gaaggaccat 13680
atgaatgaaa tttcagctct cataggggat gacgatatca atagtttcat aactgagttt 13740
ctgctcatag agccaagatt attcactatc tacttgggcc agtgtgcggc catcaattgg 13800
gcatttgatg tacattatca tagaccatca gggaaatatc agatgggtga gctgttgtca 13860
tcgttccttt ctagaatgag caaaggagtg tttaaggtgc ttgtcaatgc tctaagccac 13920
ccaaagatct acaagaaatt ctggcattgt ggtattatag agcctatcca tggtccttca 13980
cttgatgctc aaaacttgca cacaactgtg tgcaacatgg tttacacatg ctatatgacc 14040
tacctcgacc tgttgttgaa tgaagagtta gaagagttca catttctctt gtgtgaaagc 14100
gacgaggatg tagtaccgga cagattcgac aacatccagg caaaacactt atgtgttctg 14160
gcagatttgt actgtcaacc agggacctgc ccaccaattc gaggtctaag accggtagag 14220
aaatgtgcag ttctaaccga ccatatcaag gcagaggcta tgttatctcc agcaggatct 14280
tcgtggaaca taaatccaat tattgtagac cattactcat gctctctgac ttatctccgg 14340
cgaggatcga tcaaacagat aagattgaga gttgatccag gattcatttt cgacgccctc 14400
gctgaggtaa atgtcagtca gccaaagatc ggcagcaaca acatctcaaa tatgagcatc 14460
aaggctttca gacccccaca cgatgatgtt gcaaaattgc tcaaagatat caacacaagc 14520
aagcacaatc ttcccatttc agggggcaat ctcgccaatt atgaaatcca tgctttccgc 14580
agaatcgggt tgaactcatc tgcttgctac aaagctgttg agatatcaac attaattagg 14640
agatgccttg agccagggga ggacggcttg ttcttgggtg agggatcggg ttctatgttg 14700
atcacttata aagagatact taaactaaac aagtgcttct ataatagtgg ggtttccgcc 14760
aattctagat ctggtcaaag ggaattagca ccctatccct ccgaagttgg ccttgtcgaa 14820
cacagaatgg gagtaggtaa tattgtcaaa gtgctcttta acgggaggcc cgaagtcacg 14880
tgggtaggca gtgtagattg cttcaatttc atagttagta atatccctac ctctagtgtg 14940
gggtttatcc attcagatat agagaccttg cctgacaaag atactataga gaagctagag 15000
gaattggcag ccatcttatc gatggctctg ctcctgggca aaataggatc aatactggtg 15060
attaagctta tgcctttcag cggggatttt gttcagggat ttataagtta tgtagggtct 15120
cattatagag aagtgaacct tgtataccct agatacagca acttcatctc tactgaatct 15180
tatttggtta tgacagatct caaggctaac cggctaatga atcctgaaaa gattaagcag 15240
cagataattg aatcatctgt gaggacttca cctggactta taggtcacat cctatccatt 15300
aagcaactaa gctgcataca agcaattgtg ggagacgcag ttagtagagg tgatatcaat 15360
cctactctga aaaaacttac acctatagag caggtgctga tcaattgcgg gttggcaatt 15420
aacggaccta agctgtgcaa agaattgatc caccatgatg ttgcctcagg gcaagatgga 15480
ttgcttaatt ctatactcat cctctacagg gagttggcaa gattcaaaga caaccaaaga 15540
agtcaacaag ggatgttcca cgcttacccc gtattggtaa gtagcaggca acgagaactt 15600
atatctagga tcacccgcaa attctggggg cacattcttc tttactccgg gaacaaaaag 15660
ttgataaata agtttatcca gaatctcaag tccggctatc tgatactaga cttacaccag 15720
aatatcttcg ttaagaatct atccaagtca gagaaacaga ttattatgac ggggggtttg 15780
aaacgtgagt gggtttttaa ggtaacagtc aaggagacca aagaatggta taagttagtc 15840
ggatacagtg ccctgattaa ggactaattg gttgaactcc ggaaccctaa tcctgcccta 15900
ggtggttagg cattatttgc aatatattaa agaaaacttt gaaaatacga agtttctatt 15960
cccagctttg tctggtggcc ggcatggtcc cagcctcctc gctggcgccg gctgggcaac 16020
attccgaggg gaccgtcccc tcggtaatgg cgaatgggac gcggccgatc cggctgctaa 16080
caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc 16140
ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg 16200
atgcggccgc gggccctatg gtacccagct tttgttccct ttagtgaggg ttaattccga 16260
gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc 16320
cacacaacat aggagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgaggt 16380
aactcacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 16440
agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt 16500
ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16560
ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16620
tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16680
tccataggct cggcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16740
gaaacccgac aggactataa agataccagg cgttcccccc tggaagctcc ctcgtgcgct 16800
ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16860
tggcgctttc tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca 16920
agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 16980
atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta 17040
acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta 17100
actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct 17160
tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt 17220
tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 17280
tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca 17340
tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat 17400
caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg 17460
cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactg cccgtcgtgt 17520
agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag 17580
acccacgctc accggctcca gatttatcag caataaacca gccagccgga agggccgagc 17640
gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag 17700
ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctacaggca 17760
tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa 17820
ggcgagttac atgatccccc atgttgtgaa aaaaagcggt tagctccttc ggtcctccga 17880
tcgttgtcag aagtaagttg gccgcagtgt tatcactcat gcttatggca gcactgcata 17940
attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca 18000
agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaatacggg 18060
ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg 18120
ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg 18180
cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag 18240
gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac 18300
tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca 18360
tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag 18420
tgccacctga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 18480
tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 18540
agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 18600
tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 18660
catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 18720
aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 18780
ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 18840
taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtcccat tcgccattca 18900
ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagccac 18960
cgcggtg 18967
<210> 2
<211> 19795
<212> DNA
<213> Artificial sequence
<220>
<223> pTM2-MVSchw-gfp
<400> 2
gcggccgcta atacgactca ctatagggcc aactttgttt ggtctgatga gtccgtgagg 60
acgaaacccg gagtcccggg tcaccaaaca aagttgggta aggatagttc aatcaatgat 120
catcttctag tgcacttagg attcaagatc ctattatcag ggacaagagc aggattaggg 180
atatccgaga tggccacact tttaaggagc ttagcattgt tcaaaagaaa caaggacaaa 240
ccacccatta catcaggatc cggtggagcc atcagaggaa tcaaacacat tattatagta 300
ccaatccctg gagattcctc aattaccact cgatccagac ttctggaccg gttggtgagg 360
ttaattggaa acccggatgt gagcgggccc aaactaacag gggcactaat aggtatatta 420
tccttatttg tggagtctcc aggtcaattg attcagagga tcaccgatga ccctgacgtt 480
agcataaggc tgttagaggt tgtccagagt gaccagtcac aatctggcct taccttcgca 540
tcaagaggta ccaacatgga ggatgaggcg gaccaatact tttcacatga tgatccaatt 600
agtagtgatc aatccaggtt cggatggttc gggaacaagg aaatctcaga tattgaagtg 660
caagaccctg agggattcaa catgattctg ggtaccatcc tagcccaaat ttgggtcttg 720
ctcgcaaagg cggttacggc cccagacacg gcagctgatt cggagctaag aaggtggata 780
aagtacaccc aacaaagaag ggtagttggt gaatttagat tggagagaaa atggttggat 840
gtggtgagga acaggattgc cgaggacctc tccttacgcc gattcatggt cgctctaatc 900
ctggatatca agagaacacc cggaaacaaa cccaggattg ctgaaatgat atgtgacatt 960
gatacatata tcgtagaggc aggattagcc agttttatcc tgactattaa gtttgggata 1020
gaaactatgt atcctgctct tggactgcat gaatttgctg gtgagttatc cacacttgag 1080
tccttgatga acctttacca gcaaatgggg gaaactgcac cctacatggt aatcctggag 1140
aactcaattc agaacaagtt cagtgcagga tcataccctc tgctctggag ctatgccatg 1200
ggagtaggag tggaacttga aaactccatg ggaggtttga actttggccg atcttacttt 1260
gatccagcat attttagatt agggcaagag atggtaagga ggtcagctgg aaaggtcagt 1320
tccacattgg catctgaact cggtatcact gccgaggatg caaggcttgt ttcagagatt 1380
gcaatgcata ctactgagga caagatcagt agagcggttg gacccagaca agcccaagta 1440
tcatttctac acggtgatca aagtgagaat gagctaccga gattgggggg caaggaagat 1500
aggagggtca aacagagtcg aggagaagcc agggagagct acagagaaac cgggcccagc 1560
agagcaagtg atgcgagagc tgcccatctt ccaaccggca cacccctaga cattgacact 1620
gcaacggagt ccagccaaga tccgcaggac agtcgaaggt cagctgacgc cctgcttagg 1680
ctgcaagcca tggcaggaat ctcggaagaa caaggctcag acacggacac ccctatagtg 1740
tacaatgaca gaaatcttct agactaggtg cgagaggccg agggccagaa caacatccgc 1800
ctaccatcca tcattgttat aaaaaactta ggaaccaggt ccacacagcc gccagcccat 1860
caaccatcca ctcccacgat tggagccaat ggcagaagag caggcacgcc atgtcaaaaa 1920
cggactggaa tgcatccggg ctctcaaggc cgagcccatc ggctcactgg ccatcgagga 1980
agctatggca gcatggtcag aaatatcaga caacccagga caggagcgag ccacctgcag 2040
ggaagagaag gcaggcagtt cgggtctcag caaaccatgc ctctcagcaa ttggatcaac 2100
tgaaggcggt gcacctcgca tccgcggtca gggacctgga gagagcgatg acgacgctga 2160
aactttggga atccccccaa gaaatctcca ggcatcaagc actgggttac agtgttatta 2220
cgtttatgat cacagcggtg aagcggttaa gggaatccaa gatgctgact ctatcatggt 2280
tcaatcaggc cttgatggtg atagcaccct ctcaggagga gacaatgaat ctgaaaacag 2340
cgatgtggat attggcgaac ctgataccga gggatatgct atcactgacc ggggatctgc 2400
tcccatctct atggggttca gggcttctga tgttgaaact gcagaaggag gggagatcca 2460
cgagctcctg agactccaat ccagaggcaa caactttccg aagcttggga aaactctcaa 2520
tgttcctccg cccccggacc ccggtagggc cagcacttcc gggacaccca ttaaaaaggg 2580
cacagacgcg agattagcct catttggaac ggagatcgcg tctttattga caggtggtgc 2640
aacccaatgt gctcgaaagt caccctcgga accatcaggg ccaggtgcac ctgcggggaa 2700
tgtccccgag tgtgtgagca atgccgcact gatacaggag tggacacccg aatctggtac 2760
cacaatctcc ccgagatccc agaataatga agaaggggga gactattatg atgatgagct 2820
gttctctgat gtccaagata ttaaaacagc cttggccaaa atacacgagg ataatcagaa 2880
gataatctcc aagctagaat cactgctgtt attgaaggga gaagttgagt caattaagaa 2940
gcagatcaac aggcaaaata tcagcatatc caccctggaa ggacacctct caagcatcat 3000
gatcgccatt cctggacttg ggaaggatcc caacgacccc actgcagatg tcgaaatcaa 3060
tcccgacttg aaacccatca taggcagaga ttcaggccga gcactggccg aagttctcaa 3120
gaaacccgtt gccagccgac aactccaagg aatgacaaat ggacggacca gttccagagg 3180
acagctgctg aaggaatttc agctaaagcc gatcgggaaa aagatgagct cagccgtcgg 3240
gtttgttcct gacaccggcc ctgcatcacg cagtgtaatc cgctccatta taaaatccag 3300
ccggctagag gaggatcgga agcgttacct gatgactctc cttgatgata tcaaaggagc 3360
caatgatctt gccaagttcc accagatgct gatgaagata ataatgaagt agctacagct 3420
caacttacct gccaacccca tgccagtcga cccaactagc ctaccctcca tcattgttat 3480
aaaaaactta ggaaccaggt ccacacagcc gccagcccat caacgcgtac gatggtgagc 3540
aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 3600
aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 3660
accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 3720
accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 3780
ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 3840
gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 3900
atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 3960
tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 4020
gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 4080
cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 4140
acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 4200
ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagta ggcgcgcagc 4260
gcttagacgt ctcgcgatcg atactagtac aacctaaatc cattataaaa aacttaggag 4320
caaagtgatt gcctcccaag gtccacaatg acagagacct acgacttcga caagtcggca 4380
tgggacatca aagggtcgat cgctccgata caacccacca cctacagtga tggcaggctg 4440
gtgccccagg tcagagtcat agatcctggt ctaggcgaca ggaaggatga atgctttatg 4500
tacatgtttc tgctgggggt tgttgaggac agcgattccc tagggcctcc aatcgggcga 4560
gcatttgggt tcctgccctt aggtgttggc agatccacag caaagcccga aaaactcctc 4620
aaagaggcca ctgagcttga catagttgtt agacgtacag cagggctcaa tgaaaaactg 4680
gtgttctaca acaacacccc actaactctc ctcacacctt ggagaaaggt cctaacaaca 4740
gggagtgtct tcaacgcaaa ccaagtgtgc aatgcggtta atctgatacc gctcgatacc 4800
ccgcagaggt tccgtgttgt ttatatgagc atcacccgtc tttcggataa cgggtattac 4860
accgttccta gaagaatgct ggaattcaga tcggtcaatg cagtggcctt caacctgctg 4920
gtgaccctta ggattgacaa ggcgataggc cctgggaaga tcatcgacaa tacagagcaa 4980
cttcctgagg caacatttat ggtccacatc gggaacttca ggagaaagaa gagtgaagtc 5040
tactctgccg attattgcaa aatgaaaatc gaaaagatgg gcctggtttt tgcacttggt 5100
gggatagggg gcaccagtct tcacattaga agcacaggca aaatgagcaa gactctccat 5160
gcacaactcg ggttcaagaa gaccttatgt tacccgctga tggatatcaa tgaagacctt 5220
aatcgattac tctggaggag cagatgcaag atagtaagaa tccaggcagt tttgcagcca 5280
tcagttcctc aagaattccg catttacgac gacgtgatca taaatgatga ccaaggacta 5340
ttcaaagttc tgtagaccgt agtgcccagc aatgcccgaa aacgaccccc ctcacaatga 5400
cagccagaag gcccggacaa aaaagccccc tccgaaagac tccacggacc aagcgagagg 5460
ccagccagca gccgacggca agcgcgaaca ccaggcggcc ccagcacaga acagccctga 5520
cacaaggcca ccaccagcca ccccaatctg catcctcctc gtgggacccc cgaggaccaa 5580
cccccaaggc tgcccccgat ccaaaccacc aaccgcatcc ccaccacccc cgggaaagaa 5640
acccccagca attggaaggc ccctccccct cttcctcaac acaagaactc cacaaccgaa 5700
ccgcacaagc gaccgaggtg acccaaccgc aggcatccga ctccctagac agatcctctc 5760
tccccggcaa actaaacaaa acttagggcc aaggaacata cacacccaac agaacccaga 5820
ccccggccca cggcgccgcg cccccaaccc ccgacaacca gagggagccc ccaaccaatc 5880
ccgccggctc ccccggtgcc cacaggcagg gacaccaacc cccgaacaga cccagcaccc 5940
aaccatcgac aatccaagac gggggggccc ccccaaaaaa aggcccccag gggccgacag 6000
ccagcaccgc gaggaagccc acccacccca cacacgacca cggcaaccaa accagaaccc 6060
agaccaccct gggccaccag ctcccagact cggccatcac cccgcagaaa ggaaaggcca 6120
caacccgcgc accccagccc cgatccggcg gggagccacc caacccgaac cagcacccaa 6180
gagcgatccc cgaaggaccc ccgaaccgca aaggacatca gtatcccaca gcctctccaa 6240
gtcccccggt ctcctcctct tctcgaaggg accaaaagat caatccacca cacccgacga 6300
cactcaactc cccaccccta aaggagacac cgggaatccc agaatcaaga ctcatccaat 6360
gtccatcatg ggtctcaagg tgaacgtctc tgccatattc atggcagtac tgttaactct 6420
ccaaacaccc accggtcaaa tccattgggg caatctctct aagatagggg tggtaggaat 6480
aggaagtgca agctacaaag ttatgactcg ttccagccat caatcattag tcataaaatt 6540
aatgcccaat ataactctcc tcaataactg cacgagggta gagattgcag aatacaggag 6600
actactgaga acagttttgg aaccaattag agatgcactt aatgcaatga cccagaatat 6660
aagaccggtt cagagtgtag cttcaagtag gagacacaag agatttgcgg gagtagtcct 6720
ggcaggtgcg gccctaggcg ttgccacagc tgctcagata acagccggca ttgcacttca 6780
ccagtccatg ctgaactctc aagccatcga caatctgaga gcgagcctgg aaactactaa 6840
tcaggcaatt gagacaatca gacaagcagg gcaggagatg atattggctg ttcagggtgt 6900
ccaagactac atcaataatg agctgatacc gtctatgaac caactatctt gtgatttaat 6960
cggccagaag ctcgggctca aattgctcag atactataca gaaatcctgt cattatttgg 7020
ccccagttta cgggacccca tatctgcgga gatatctatc caggctttga gctatgcgct 7080
tggaggagac atcaataagg tgttagaaaa gctcggatac agtggaggtg atttactggg 7140
catcttagag agcggaggaa taaaggcccg gataactcac gtcgacacag agtcctactt 7200
cattgtcctc agtatagcct atccgacgct gtccgagatt aagggggtga ttgtccaccg 7260
gctagagggg gtctcgtaca acataggctc tcaagagtgg tataccactg tgcccaagta 7320
tgttgcaacc caagggtacc ttatctcgaa ttttgatgag tcatcgtgta ctttcatgcc 7380
agaggggact gtgtgcagcc aaaatgcctt gtacccgatg agtcctctgc tccaagaatg 7440
cctccggggg tacaccaagt cctgtgctcg tacactcgta tccgggtctt ttgggaaccg 7500
gttcatttta tcacaaggga acctaatagc caattgtgca tcaatccttt gcaagtgtta 7560
cacaacagga acgatcatta atcaagaccc tgacaagatc ctaacataca ttgctgccga 7620
tcactgcccg gtagtcgagg tgaacggcgt gaccatccaa gtcgggagca ggaggtatcc 7680
agacgctgtg tacttgcaca gaattgacct cggtcctccc atatcattgg agaggttgga 7740
cgtagggaca aatctgggga atgcaattgc taagttggag gatgccaagg aattgttgga 7800
gtcatcggac cagatattga ggagtatgaa aggtttatcg agcactagca tagtctacat 7860
cctgattgca gtgtgtcttg gagggttgat agggatcccc gctttaatat gttgctgcag 7920
ggggcgttgt aacaaaaagg gagaacaagt tggtatgtca agaccaggcc taaagcctga 7980
tcttacggga acatcaaaat cctatgtaag gtcgctctga tcctctacaa ctcttgaaac 8040
acaaatgtcc cacaagtctc ctcttcgtca tcaagcaacc accgcaccca gcatcaagcc 8100
cacctgaaat tatctccggc ttccctctgg ccgaacaata tcggtagtta atcaaaactt 8160
agggtgcaag atcatccaca atgtcaccac aacgagaccg gataaatgcc ttctacaaag 8220
ataaccccca tcccaaggga agtaggatag tcattaacag agaacatctt atgattgata 8280
gaccttatgt tttgctggct gttctgtttg tcatgtttct gagcttgatc gggttgctag 8340
ccattgcagg cattagactt catcgggcag ccatctacac cgcagagatc cataaaagcc 8400
tcagcaccaa tctagatgta actaactcaa tcgagcatca ggtcaaggac gtgctgacac 8460
cactcttcaa aatcatcggt gatgaagtgg gcctgaggac acctcagaga ttcactgacc 8520
tagtgaaatt aatctctgac aagattaaat tccttaatcc ggatagggag tacgacttca 8580
gagatctcac ttggtgtatc aacccgccag agagaatcaa attggattat gatcaatact 8640
gtgcagatgt ggctgctgaa gagctcatga atgcattggt gaactcaact ctactggaga 8700
ccagaacaac caatcagttc ctagctgtct caaagggaaa ctgctcaggg cccactacaa 8760
tcagaggtca attctcaaac atgtcgctgt ccctgttaga cttgtattta ggtcgaggtt 8820
acaatgtgtc atctatagtc actatgacat cccagggaat gtatggggga acttacctag 8880
tggaaaagcc taatctgagc agcaaaaggt cagagttgtc acaactgagc atgtaccgag 8940
tgtttgaagt aggtgttatc agaaatccgg gtttgggggc tccggtgttc catatgacaa 9000
actatcttga gcaaccagtc agtaatgatc tcagcaactg tatggtggct ttgggggagc 9060
tcaaactcgc agccctttgt cacggggaag attctatcac aattccctat cagggatcag 9120
ggaaaggtgt cagcttccag ctcgtcaagc taggtgtctg gaaatcccca accgacatgc 9180
aatcctgggt ccccttatca acggatgatc cagtgataga caggctttac ctctcatctc 9240
acagaggtgt tatcgctgac aatcaagcaa aatgggctgt cccgacaaca cgaacagatg 9300
acaagttgcg aatggagaca tgcttccaac aggcgtgtaa gggtaaaatc caagcactct 9360
gcgagaatcc cgagtgggca ccattgaagg ataacaggat tccttcatac ggggtcttgt 9420
ctgttgatct gagtctgaca gttgagctta aaatcaaaat tgcttcggga ttcgggccat 9480
tgatcacaca cggttcaggg atggacctat acaaatccaa ccacaacaat gtgtattggc 9540
tgactatccc gccaatgaag aacctagcct taggtgtaat caacacattg gagtggatac 9600
cgagattcaa ggttagtccc tacctcttca ctgtcccaat taaggaagca ggcgaagact 9660
gccatgcccc aacataccta cctgcggagg tggatggtga tgtcaaactc agttccaatc 9720
tggtgattct acctggtcaa gatctccaat atgttttggc aacctacgat acttccaggg 9780
ttgaacatgc tgtggtttat tacgtttaca gcccaagccg ctcattttct tacttttatc 9840
cttttaggtt gcctataaag ggggtcccca tcgaattaca agtggaatgc ttcacatggg 9900
accaaaaact ctggtgccgt cacttctgtg tgcttgcgga ctcagaatct ggtggacata 9960
tcactcactc tgggatggtg ggcatgggag tcagctgcac agtcacccgg gaagatggaa 10020
ccaatcgcag atagggctgc tagtgaacca atcacatgat gtcacccaga catcaggcat 10080
acccactagt gtgaaataga catcagaatt aagaaaaacg tagggtccaa gtggttcccc 10140
gttatggact cgctatctgt caaccagatc ttataccctg aagttcacct agatagcccg 10200
atagttacca ataagatagt agccatcctg gagtatgctc gagtccctca cgcttacagc 10260
ctggaggacc ctacactgtg tcagaacatc aagcaccgcc taaaaaacgg attttccaac 10320
caaatgatta taaacaatgt ggaagttggg aatgtcatca agtccaagct taggagttat 10380
ccggcccact ctcatattcc atatccaaat tgtaatcagg atttatttaa catagaagac 10440
aaagagtcaa cgaggaagat ccgtgaactc ctcaaaaagg ggaattcgct gtactccaaa 10500
gtcagtgata aggttttcca atgcttaagg gacactaact cacggcttgg cctaggctcc 10560
gaattgaggg aggacatcaa ggagaaagtt attaacttgg gagtttacat gcacagctcc 10620
cagtggtttg agccctttct gttttggttt acagtcaaga ctgagatgag gtcagtgatt 10680
aaatcacaaa cccatacttg ccataggagg agacacacac ctgtattctt cactggtagt 10740
tcagttgagt tgctaatctc tcgtgacctt gttgctataa tcagtaaaga gtctcaacat 10800
gtatattacc tgacatttga actggttttg atgtattgtg atgtcataga ggggaggtta 10860
atgacagaga ccgctatgac tattgatgct aggtatacag agcttctagg aagagtcaga 10920
tacatgtgga aactgataga tggtttcttc cctgcactcg ggaatccaac ttatcaaatt 10980
gtagccatgc tggagcctct ttcacttgct tacctgcagc tgagggatat aacagtagaa 11040
ctcagaggtg ctttccttaa ccactgcttt actgaaatac atgatgttct tgaccaaaac 11100
gggttttctg atgaaggtac ttatcatgag ttaactgaag ctctagatta cattttcata 11160
actgatgaca tacatctgac aggggagatt ttctcatttt tcagaagttt cggccacccc 11220
agacttgaag cagtaacggc tgctgaaaat gttaggaaat acatgaatca gcctaaagtc 11280
attgtgtatg agactctgat gaaaggtcat gccatatttt gtggaatcat aatcaacggc 11340
tatcgtgaca ggcacggagg cagttggcca ccgctgaccc tccccctgca tgctgcagac 11400
acaatccgga atgctcaagc ttcaggtgaa gggttaacac atgagcagtg cgttgataac 11460
tggaaatctt ttgctggagt gaaatttggc tgctttatgc ctcttagcct ggatagtgat 11520
ctgacaatgt acctaaagga caaggcactt gctgctctcc aaagggaatg ggattcagtt 11580
tacccgaaag agttcctgcg ttacgaccct cccaagggaa ccgggtcacg gaggcttgta 11640
gatgttttcc ttaatgattc gagctttgac ccatatgatg tgataatgta tgttgtaagt 11700
ggagcttacc tccatgaccc tgagttcaac ctgtcttaca gcctgaaaga aaaggagatc 11760
aaggaaacag gtagactttt tgctaaaatg acttacaaaa tgagggcatg ccaagtgatt 11820
gctgaaaatc taatctcaaa cgggattggc aaatatttta aggacaatgg gatggccaag 11880
gatgagcacg atttgactaa ggcactccac actctagctg tctcaggagt ccccaaagat 11940
ctcaaagaaa gtcacagggg ggggccagtc ttaaaaacct actcccgaag cccagtccac 12000
acaagtacca ggaacgtgag agcagcaaaa gggtttatag ggttccctca agtaattcgg 12060
caggaccaag acactgatca tccggagaat atggaagctt acgagacagt cagtgcattt 12120
atcacgactg atctcaagaa gtactgcctt aattggagat atgagaccat cagcttgttt 12180
gcacagaggc taaatgagat ttacggattg ccctcatttt tccagtggct gcataagagg 12240
cttgagacct ctgtcctgta tgtaagtgac cctcattgcc cccccgacct tgacgcccat 12300
atcccgttat ataaagtccc caatgatcaa atcttcatta agtaccctat gggaggtata 12360
gaagggtatt gtcagaagct gtggaccatc agcaccattc cctatctata cctggctgct 12420
tatgagagcg gagtaaggat tgcttcgtta gtgcaagggg acaatcagac catagccgta 12480
acaaaaaggg tacccagcac atggccctac aaccttaaga aacgggaagc tgctagagta 12540
actagagatt actttgtaat tcttaggcaa aggctacatg atattggcca tcacctcaag 12600
gcaaatgaga caattgtttc atcacatttt tttgtctatt caaaaggaat atattatgat 12660
gggctacttg tgtcccaatc actcaagagc atcgcaagat gtgtattctg gtcagagact 12720
atagttgatg aaacaagggc agcatgcagt aatattgcta caacaatggc taaaagcatc 12780
gagagaggtt atgaccgtta ccttgcatat tccctgaacg tcctaaaagt gatacagcaa 12840
attctgatct ctcttggctt cacaatcaat tcaaccatga cccgggatgt agtcataccc 12900
ctcctcacaa acaacgacct cttaataagg atggcactgt tgcccgctcc tattgggggg 12960
atgaattatc tgaatatgag caggctgttt gtcagaaaca tcggtgatcc agtaacatca 13020
tcaattgctg atctcaagag aatgattctc gcctcactaa tgcctgaaga gaccctccat 13080
caagtaatga cacaacaacc gggggactct tcattcctag actgggctag cgacccttac 13140
tcagcaaatc ttgtatgtgt ccagagcatc actagactcc tcaagaacat aactgcaagg 13200
tttgtcctga tccatagtcc aaacccaatg ttaaaaggat tattccatga tgacagtaaa 13260
gaagaggacg agggactggc ggcattcctc atggacaggc atattatagt acctagggca 13320
gctcatgaaa tcctggatca tagtgtcaca ggggcaagag agtctattgc aggcatgctg 13380
gataccacaa aaggcttgat tcgagccagc atgaggaagg gggggttaac ctctcgagtg 13440
ataaccagat tgtccaatta tgactatgaa caattcagag cagggatggt gctattgaca 13500
ggaagaaaga gaaatgtcct cattgacaaa gagtcatgtt cagtgcagct ggcgagagct 13560
ctaagaagcc atatgtgggc gaggctagct cgaggacggc ctatttacgg ccttgaggtc 13620
cctgatgtac tagaatctat gcgaggccac cttattcggc gtcatgagac atgtgtcatc 13680
tgcgagtgtg gatcagtcaa ctacggatgg ttttttgtcc cctcgggttg ccaactggat 13740
gatattgaca aggaaacatc atccttgaga gtcccatata ttggttctac cactgatgag 13800
agaacagaca tgaagcttgc cttcgtaaga gccccaagtc gatccttgcg atctgctgtt 13860
agaatagcaa cagtgtactc atgggcttac ggtgatgatg atagctcttg gaacgaagcc 13920
tggttgttgg ctaggcaaag ggccaatgtg agcctggagg agctaagggt gatcactccc 13980
atctcaactt cgactaattt agcgcatagg ttgagggatc gtagcactca agtgaaatac 14040
tcaggtacat cccttgtccg agtggcgagg tataccacaa tctccaacga caatctctca 14100
tttgtcatat cagataagaa ggttgatact aactttatat accaacaagg aatgcttcta 14160
gggttgggtg ttttagaaac attgtttcga ctcgagaaag ataccggatc atctaacacg 14220
gtattacatc ttcacgtcga aacagattgt tgcgtgatcc cgatgataga tcatcccagg 14280
atacccagct cccgcaagct agagctgagg gcagagctat gtaccaaccc attgatatat 14340
gataatgcac ctttaattga cagagatgca acaaggctat acacccagag ccataggagg 14400
caccttgtgg aatttgttac atggtccaca ccccaactat atcacatttt agctaagtcc 14460
acagcactat ctatgattga cctggtaaca aaatttgaga aggaccatat gaatgaaatt 14520
tcagctctca taggggatga cgatatcaat agtttcataa ctgagtttct gctcatagag 14580
ccaagattat tcactatcta cttgggccag tgtgcggcca tcaattgggc atttgatgta 14640
cattatcata gaccatcagg gaaatatcag atgggtgagc tgttgtcatc gttcctttct 14700
agaatgagca aaggagtgtt taaggtgctt gtcaatgctc taagccaccc aaagatctac 14760
aagaaattct ggcattgtgg tattatagag cctatccatg gtccttcact tgatgctcaa 14820
aacttgcaca caactgtgtg caacatggtt tacacatgct atatgaccta cctcgacctg 14880
ttgttgaatg aagagttaga agagttcaca tttctcttgt gtgaaagcga cgaggatgta 14940
gtaccggaca gattcgacaa catccaggca aaacacttat gtgttctggc agatttgtac 15000
tgtcaaccag ggacctgccc accaattcga ggtctaagac cggtagagaa atgtgcagtt 15060
ctaaccgacc atatcaaggc agaggctatg ttatctccag caggatcttc gtggaacata 15120
aatccaatta ttgtagacca ttactcatgc tctctgactt atctccggcg aggatcgatc 15180
aaacagataa gattgagagt tgatccagga ttcattttcg acgccctcgc tgaggtaaat 15240
gtcagtcagc caaagatcgg cagcaacaac atctcaaata tgagcatcaa ggctttcaga 15300
cccccacacg atgatgttgc aaaattgctc aaagatatca acacaagcaa gcacaatctt 15360
cccatttcag ggggcaatct cgccaattat gaaatccatg ctttccgcag aatcgggttg 15420
aactcatctg cttgctacaa agctgttgag atatcaacat taattaggag atgccttgag 15480
ccaggggagg acggcttgtt cttgggtgag ggatcgggtt ctatgttgat cacttataaa 15540
gagatactta aactaaacaa gtgcttctat aatagtgggg tttccgccaa ttctagatct 15600
ggtcaaaggg aattagcacc ctatccctcc gaagttggcc ttgtcgaaca cagaatggga 15660
gtaggtaata ttgtcaaagt gctctttaac gggaggcccg aagtcacgtg ggtaggcagt 15720
gtagattgct tcaatttcat agttagtaat atccctacct ctagtgtggg gtttatccat 15780
tcagatatag agaccttgcc tgacaaagat actatagaga agctagagga attggcagcc 15840
atcttatcga tggctctgct cctgggcaaa ataggatcaa tactggtgat taagcttatg 15900
cctttcagcg gggattttgt tcagggattt ataagttatg tagggtctca ttatagagaa 15960
gtgaaccttg tataccctag atacagcaac ttcatctcta ctgaatctta tttggttatg 16020
acagatctca aggctaaccg gctaatgaat cctgaaaaga ttaagcagca gataattgaa 16080
tcatctgtga ggacttcacc tggacttata ggtcacatcc tatccattaa gcaactaagc 16140
tgcatacaag caattgtggg agacgcagtt agtagaggtg atatcaatcc tactctgaaa 16200
aaacttacac ctatagagca ggtgctgatc aattgcgggt tggcaattaa cggacctaag 16260
ctgtgcaaag aattgatcca ccatgatgtt gcctcagggc aagatggatt gcttaattct 16320
atactcatcc tctacaggga gttggcaaga ttcaaagaca accaaagaag tcaacaaggg 16380
atgttccacg cttaccccgt attggtaagt agcaggcaac gagaacttat atctaggatc 16440
acccgcaaat tctgggggca cattcttctt tactccggga acaaaaagtt gataaataag 16500
tttatccaga atctcaagtc cggctatctg atactagact tacaccagaa tatcttcgtt 16560
aagaatctat ccaagtcaga gaaacagatt attatgacgg ggggtttgaa acgtgagtgg 16620
gtttttaagg taacagtcaa ggagaccaaa gaatggtata agttagtcgg atacagtgcc 16680
ctgattaagg actaattggt tgaactccgg aaccctaatc ctgccctagg tggttaggca 16740
ttatttgcaa tatattaaag aaaactttga aaatacgaag tttctattcc cagctttgtc 16800
tggtggccgg catggtccca gcctcctcgc tggcgccggc tgggcaacat tccgagggga 16860
ccgtcccctc ggtaatggcg aatgggacgc ggccgatccg gctgctaaca aagcccgaaa 16920
ggaagctgag ttggctgctg ccaccgctga gcaataacta gcataacccc ttggggcctc 16980
taaacgggtc ttgaggggtt ttttgctgaa aggaggaact atatccggat gcggccgcgg 17040
gccctatggt acccagcttt tgttcccttt agtgagggtt aattccgagc ttggcgtaat 17100
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatag 17160
gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgaggtaa ctcacattaa 17220
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 17280
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 17340
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 17400
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 17460
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcg 17520
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 17580
gactataaag ataccaggcg ttcccccctg gaagctccct cgtgcgctct cctgttccga 17640
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 17700
aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 17760
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 17820
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 17880
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 17940
ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 18000
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 18060
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 18120
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 18180
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 18240
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 18300
cgatctgtct atttcgttca tccatagttg cctgactgcc cgtcgtgtag ataactacga 18360
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 18420
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 18480
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 18540
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 18600
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 18660
gatcccccat gttgtgaaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 18720
gtaagttggc cgcagtgtta tcactcatgc ttatggcagc actgcataat tctcttactg 18780
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 18840
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 18900
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 18960
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 19020
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 19080
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 19140
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 19200
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgaaa 19260
ttgtaaacgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt 19320
ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag 19380
ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg 19440
tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat 19500
caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc 19560
gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga 19620
aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac 19680
ccgccgcgct taatgcgccg ctacagggcg cgtcccattc gccattcagg ctgcgcaact 19740
gttgggaagg gcgatcggtg cgggcctctt cgctattacg ccagccaccg cggtg 19795
<210> 3
<211> 72
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of signal peptide from the capsid of ZIKV (sp)
<400> 3
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg ca 72
<210> 4
<211> 72
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of signal peptide from the capsid of ZIKV (sp)
<400> 4
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg ca 72
<210> 5
<211> 24
<212> PRT
<213> Artificial sequence
<220>
<223> Signal peptide from the capsid of ZIKV (sp)
<400> 5
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala
20
<210> 6
<211> 51
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of signal peptide from the membrane protein of ZIKV (sp?
<400> 6
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag c 51
<210> 7
<211> 52
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of signal peptide from the membrane protein of ZIKV (sp?
<400> 7
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc ta 52
<210> 8
<211> 17
<212> PRT
<213> Artificial sequence
<220>
<223> Signal peptide from the membrane protein of ZIKV (sp?
<400> 8
Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr
1 5 10 15
Ser
<210> 9
<211> 72
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of signal peptide from the capsid of JEV (JEVsp)
<400> 9
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag cc 72
<210> 10
<211> 72
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of signal peptide from the capsid of JEV (JEVsp)
<400> 10
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag ca 72
<210> 11
<211> 24
<212> PRT
<213> Artificial sequence
<220>
<223> Signal peptide from the capsid of JEV (JEVsp)
<400> 11
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala
20
<210> 12
<211> 87
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of signal peptide from the fusion protein of MV (MVsp)
<400> 12
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccat 87
<210> 13
<211> 87
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of signal peptide from the fusion protein of MV (MVsp)
<400> 13
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccac 87
<210> 14
<211> 29
<212> PRT
<213> Artificial sequence
<220>
<223> Signal peptide from the fusion protein of MV (MVsp)
<400> 14
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His
20 25
<210> 15
<211> 81
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of modified signal peptide from the fusion protein of MV (MVsp?
<400> 15
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca a 81
<210> 16
<211> 81
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of modified signal peptide from the fusion protein of MV (MVsp?
<400> 16
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca g 81
<210> 17
<211> 27
<212> PRT
<213> Artificial sequence
<220>
<223> Modified signal peptide from the fusion protein of MV (MVsp?
<400> 17
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln
20 25
<210> 18
<211> 504
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of precursor of membrane (prM) protein of ZIKV
<400> 18
gcggaggtca ctagacgtgg gagtgcatac tatatgtact tggacagaaa cgatgctggg 60
gaggccatat cttttccaac cacattgggg atgaataagt gttatataca gatcatggat 120
cttggacaca tgtgtgatgc caccatgagc tatgaatgcc ctatgctgga tgagggggtg 180
gaaccagatg acgtcgattg ttggtgcaac acgacgtcaa cttgggttgt gtacggaacc 240
tgccatcaca aaaaaggtga agcacggaga tctagaagag ctgtgacgct cccctcccat 300
tccactagga agctgcaaac gcggtcgcaa acctggttgg aatcaagaga atacacaaag 360
cacttgatta gagtcgaaaa ttggatattc aggaaccctg gcttcgcgtt agcagcagct 420
gccatcgctt ggcttttggg aagctcaacg agccaaaaag tcatatactt ggtcatgata 480
ctgctgattg ccccggcata cagc 504
<210> 19
<211> 505
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of precursor of membrane (prM) protein of ZIKV
<400> 19
gcagaggtga ccaggagagg aagcgcctac tatatgtacc tggacaggaa tgatgccggc 60
gaggccatct ccttcccaac cacactgggc atgaacaagt gctacatcca gatcatggac 120
ctgggccaca tgtgcgatgc caccatgtcc tatgagtgtc caatgctgga cgagggcgtg 180
gagcccgacg atgtggattg ctggtgtaat accacatcta catgggtggt gtacggcacc 240
tgtcaccaca agaagggaga ggcccggcgg agccggcggg ccgtgacact gccttcccac 300
tctaccagga agctgcagac acgcagccag acctggctgg agtccagaga gtataccaag 360
cacctgatca gggtggagaa ctggatcttt cgcaatccag gattcgcact ggcagcagca 420
gcaatcgcat ggctgctggg aagctccacc agccagaaag tgatctacct ggtcatgatc 480
ctgctgatcg ctcctgccta ttcta 505
<210> 20
<211> 168
<212> PRT
<213> Artificial sequence
<220>
<223> Precursor of membrane (prM) protein of ZIKV
<400> 20
Ala Glu Val Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg
1 5 10 15
Asn Asp Ala Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn
20 25 30
Lys Cys Tyr Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr
35 40 45
Met Ser Tyr Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp
50 55 60
Val Asp Cys Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr
65 70 75 80
Cys His His Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr
85 90 95
Leu Pro Ser His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp
100 105 110
Leu Glu Ser Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp
115 120 125
Ile Phe Arg Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp
130 135 140
Leu Leu Gly Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile
145 150 155 160
Leu Leu Ile Ala Pro Ala Tyr Ser
165
<210> 21
<211> 1512
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of full-length E protein of ZIKV
<400> 21
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg tgggacttgg 60
gttgatgttg tcttggaaca tggaggttgt gtcaccgtaa tggcacagga caaaccgact 120
gtcgacatag agctggttac aacaacagtc agcaacatgg cggaggtaag atcctactgc 180
tatgaggcat caatatcaga catggcttcg gacagccgct gcccaacaca aggtgaagcc 240
taccttgaca agcaatcaga cactcaatat gtctgcaaaa gaacgttagt ggacagaggc 300
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc taagtttgca 360
tgctccaaga aaatgaccgg gaagagcatc cagccagaga atctggagta ccggataatg 420
ctgtcagttc atggctccca gcacagtggg atgatcgtta atgacacagg acatgaaact 480
gatgagaata gagcgaaggt tgagataacg cccaattcac caagagccga agccaccctg 540
gggggttttg gaagcctagg acttgattgt gaaccgagga caggccttga cttttcagat 600
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg gttccacgac 660
attccattac cttggcacgc tggggcagac accggaactc cacactggaa caacaaagaa 720
gcactggtag agttcaagga cgcacatgcc aaaaggcaaa ctgtcgtggt tctagggagt 780
caagaaggag cagttcacac ggcccttgct ggagctctgg aggctgagat ggatggtgca 840
aagggaaggc tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa acttagattg 900
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat cccggctgaa 960
acactgcacg ggacagtcac agtggaggta cagtacgcag ggacagatgg accttgcaag 1020
gttccagctc agatggcggt ggacatgcaa actctgaccc cagttgggag gttgataacc 1080
gctaaccccg taatcactga aagcactgag aactctaaga tgatgctgga acttgatcca 1140
ccatttgggg actcttacat tgtcatagga gtcggggaga agaagatcac ccaccactgg 1200
cacaggagtg gcagcaccat tggaaaagca tttgaagcca ctgtgagagg tgccaagaga 1260
atggcagtct tgggagacac agcctgggac tttggatcag ttggaggcgc tctcaactca 1320
ttgggcaagg gcatccatca aatttttgga gcagctttca aatcattgtt tggaggaatg 1380
tcctggttct cacaaattct cattggaacg ttgctgatgt ggttgggtct gaacacaaag 1440
aatggatcta tttcccttat gtgcttggcc ttagggggag tgttgatctt cttatccaca 1500
gccgtctctg ct 1512
<210> 22
<211> 1511
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of full-length E protein of ZIKV
<400> 22
tccggtgcat cggcgtgagc aatagagact tcgtggaggg aatgtccgga ggaacctggg 60
tggatgtggt gctggagcac ggcggctgcg tgacagtgat ggcccaggac aagccaaccg 120
tggatatcga gctggtgacc acaaccgtgt ccaacatggc cgaggtgagg tcttactgct 180
atgaggccag catctccgac atggcctctg atagcaggtg tccaacccag ggagaggcat 240
acctggacaa gcagtccgat acacagtacg tgtgcaagcg gaccctggtg gacagaggct 300
ggggcaatgg ctgtggcctg tttggcaagg gctctctggt gacatgcgcc aagttcgcct 360
gtagcaagaa gatgaccggc aagtccatcc agccagagaa cctggagtac cggatcatgc 420
tgtctgtgca cggctcccag cactctggca tgatcgtgaa cgacacaggc cacgagacag 480
atgagaatcg ggccaaggtg gagatcacac ctaactctcc aagagccgag gccaccctgg 540
gaggatttgg ctctctgggc ctggactgcg agcctagaac aggcctggac ttctccgatc 600
tgtactatct gaccatgaac aataagcact ggctggtgca caaggagtgg tttcacgaca 660
tcccactgcc atggcacgca ggagcagata caggaacacc acactggaac aataaggagg 720
ccctggtgga gttcaaggat gcccacgcca agcggcagac agtggtggtg ctgggcagcc 780
aggagggagc agtgcacacc gccctggcag gcgccctgga ggcagagatg gacggagcta 840
agggcagact gtctagcggc cacctgaagt gcaggctgaa gatggataag ctgcgcctga 900
agggcgtgtc ctactctctg tgcacagccg ccttcacctt caccaagatc cctgccgaga 960
cactgcacgg cacagtgacc gtggaggtgc agtatgccgg cacagacgga ccctgtaagg 1020
tgcctgccca gatggccgtg gatatgcaga cactgacacc tgtgggcagg ctgatcaccg 1080
ccaatccagt gatcacagag tctaccgaga acagcaagat gatgctggag ctggacccac 1140
catttggcga tagctatatc gtgatcggcg tgggcgagaa gaagatcaca caccactggc 1200
accgcagcgg ctccacaatc ggcaaggcct ttgaggcaac cgtgcgcgga gcaaagagaa 1260
tggccgtgct gggcgacacc gcatgggatt tcggatctgt gggaggcgcc ctgaacagcc 1320
tgggcaaggg catccaccag atcttcggcg ccgcctttaa gtccctgttc ggcggcatga 1380
gctggttctc acagatcctg atcggcacac tgctgatgtg gctgggcctg aacaccaaga 1440
atggctctat cagcctgatg tgcctggccc tgggaggcgt gctgatcttc ctgtccaccg 1500
ccgtgtctgc c 1511
<210> 23
<211> 504
<212> PRT
<213> Artificial sequence
<220>
<223> Full-length E protein of ZIKV
<400> 23
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
1 5 10 15
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
20 25 30
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
35 40 45
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
50 55 60
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
65 70 75 80
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
100 105 110
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
145 150 155 160
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
180 185 190
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
195 200 205
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
225 230 235 240
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
260 265 270
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
275 280 285
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
305 310 315 320
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
340 345 350
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
385 390 395 400
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
420 425 430
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
435 440 445
Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460
Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys
465 470 475 480
Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495
Phe Leu Ser Thr Ala Val Ser Ala
500
<210> 24
<211> 1368
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of E protein of ZIKV truncated at amino acid position x (Edx)
<400> 24
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg tgggacttgg 60
gttgatgttg tcttggaaca tggaggttgt gtcaccgtaa tggcacagga caaaccgact 120
gtcgacatag agctggttac aacaacagtc agcaacatgg cggaggtaag atcctactgc 180
tatgaggcat caatatcaga catggcttcg gacagccgct gcccaacaca aggtgaagcc 240
taccttgaca agcaatcaga cactcaatat gtctgcaaaa gaacgttagt ggacagaggc 300
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc taagtttgca 360
tgctccaaga aaatgaccgg gaagagcatc cagccagaga atctggagta ccggataatg 420
ctgtcagttc atggctccca gcacagtggg atgatcgtta atgacacagg acatgaaact 480
gatgagaata gagcgaaggt tgagataacg cccaattcac caagagccga agccaccctg 540
gggggttttg gaagcctagg acttgattgt gaaccgagga caggccttga cttttcagat 600
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg gttccacgac 660
attccattac cttggcacgc tggggcagac accggaactc cacactggaa caacaaagaa 720
gcactggtag agttcaagga cgcacatgcc aaaaggcaaa ctgtcgtggt tctagggagt 780
caagaaggag cagttcacac ggcccttgct ggagctctgg aggctgagat ggatggtgca 840
aagggaaggc tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa acttagattg 900
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat cccggctgaa 960
acactgcacg ggacagtcac agtggaggta cagtacgcag ggacagatgg accttgcaag 1020
gttccagctc agatggcggt ggacatgcaa actctgaccc cagttgggag gttgataacc 1080
gctaaccccg taatcactga aagcactgag aactctaaga tgatgctgga acttgatcca 1140
ccatttgggg actcttacat tgtcatagga gtcggggaga agaagatcac ccaccactgg 1200
cacaggagtg gcagcaccat tggaaaagca tttgaagcca ctgtgagagg tgccaagaga 1260
atggcagtct tgggagacac agcctgggac tttggatcag ttggaggcgc tctcaactca 1320
ttgggcaagg gcatccatca aatttttgga gcagctttca aatcattg 1368
<210> 25
<211> 1367
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of E protein of ZIKV truncated at amino acid position x (Edx)
<400> 25
tccggtgcat cggcgtgagc aatagagact tcgtggaggg aatgtccgga ggaacctggg 60
tggatgtggt gctggagcac ggcggctgcg tgacagtgat ggcccaggac aagccaaccg 120
tggatatcga gctggtgacc acaaccgtgt ccaacatggc cgaggtgagg tcttactgct 180
atgaggccag catctccgac atggcctctg atagcaggtg tccaacccag ggagaggcat 240
acctggacaa gcagtccgat acacagtacg tgtgcaagcg gaccctggtg gacagaggct 300
ggggcaatgg ctgtggcctg tttggcaagg gctctctggt gacatgcgcc aagttcgcct 360
gtagcaagaa gatgaccggc aagtccatcc agccagagaa cctggagtac cggatcatgc 420
tgtctgtgca cggctcccag cactctggca tgatcgtgaa cgacacaggc cacgagacag 480
atgagaatcg ggccaaggtg gagatcacac ctaactctcc aagagccgag gccaccctgg 540
gaggatttgg ctctctgggc ctggactgcg agcctagaac aggcctggac ttctccgatc 600
tgtactatct gaccatgaac aataagcact ggctggtgca caaggagtgg tttcacgaca 660
tcccactgcc atggcacgca ggagcagata caggaacacc acactggaac aataaggagg 720
ccctggtgga gttcaaggat gcccacgcca agcggcagac agtggtggtg ctgggcagcc 780
aggagggagc agtgcacacc gccctggcag gcgccctgga ggcagagatg gacggagcta 840
agggcagact gtctagcggc cacctgaagt gcaggctgaa gatggataag ctgcgcctga 900
agggcgtgtc ctactctctg tgcacagccg ccttcacctt caccaagatc cctgccgaga 960
cactgcacgg cacagtgacc gtggaggtgc agtatgccgg cacagacgga ccctgtaagg 1020
tgcctgccca gatggccgtg gatatgcaga cactgacacc tgtgggcagg ctgatcaccg 1080
ccaatccagt gatcacagag tctaccgaga acagcaagat gatgctggag ctggacccac 1140
catttggcga tagctatatc gtgatcggcg tgggcgagaa gaagatcaca caccactggc 1200
accgcagcgg ctccacaatc ggcaaggcct ttgaggcaac cgtgcgcgga gcaaagagaa 1260
tggccgtgct gggcgacacc gcatgggatt tcggatctgt gggaggcgcc ctgaacagcc 1320
tgggcaaggg catccaccag atcttcggcg ccgcctttaa gtccctg 1367
<210> 26
<211> 456
<212> PRT
<213> Artificial sequence
<220>
<223> E protein of ZIKV truncated at amino acid position x (Edx)
<400> 26
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
1 5 10 15
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
20 25 30
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
35 40 45
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
50 55 60
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
65 70 75 80
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
100 105 110
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
145 150 155 160
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
180 185 190
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
195 200 205
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
225 230 235 240
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
260 265 270
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
275 280 285
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
305 310 315 320
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
340 345 350
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
385 390 395 400
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
420 425 430
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
435 440 445
Phe Gly Ala Ala Phe Lys Ser Leu
450 455
<210> 27
<211> 1335
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of E protein of ZIKV truncated at amino acid position 411 (Ed411)
<400> 27
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg tgggacttgg 60
gttgatgttg tcttggaaca tggaggttgt gtcaccgtaa tggcacagga caaaccgact 120
gtcgacatag agctggttac aacaacagtc agcaacatgg cggaggtaag atcctactgc 180
tatgaggcat caatatcaga catggcttcg gacagccgct gcccaacaca aggtgaagcc 240
taccttgaca agcaatcaga cactcaatat gtctgcaaaa gaacgttagt ggacagaggc 300
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc taagtttgca 360
tgctccaaga aaatgaccgg gaagagcatc cagccagaga atctggagta ccggataatg 420
ctgtcagttc atggctccca gcacagtggg atgatcgtta atgacacagg acatgaaact 480
gatgagaata gagcgaaggt tgagataacg cccaattcac caagagccga agccaccctg 540
gggggttttg gaagcctagg acttgattgt gaaccgagga caggccttga cttttcagat 600
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg gttccacgac 660
attccattac cttggcacgc tggggcagac accggaactc cacactggaa caacaaagaa 720
gcactggtag agttcaagga cgcacatgcc aaaaggcaaa ctgtcgtggt tctagggagt 780
caagaaggag cagttcacac ggcccttgct ggagctctgg aggctgagat ggatggtgca 840
aagggaaggc tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa acttagattg 900
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat cccggctgaa 960
acactgcacg ggacagtcac agtggaggta cagtacgcag ggacagatgg accttgcaag 1020
gttccagctc agatggcggt ggacatgcaa actctgaccc cagttgggag gttgataacc 1080
gctaaccccg taatcactga aagcactgag aactctaaga tgatgctgga acttgatcca 1140
ccatttgggg actcttacat tgtcatagga gtcggggaga agaagatcac ccaccactgg 1200
cacaggagtg gcagcaccat tggaaaagca tttgaagcca ctgtgagagg tgccaagaga 1260
atggcagtct tgggagacac agcctgggac tttggatcag ttggaggcgc tctcaactca 1320
ttgggcaagg gcatc 1335
<210> 28
<211> 1334
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of E protein of ZIKV truncated at amino acid position 411 (Ed411)
<400> 28
tccggtgcat cggcgtgagc aatagagact tcgtggaggg aatgtccgga ggaacctggg 60
tggatgtggt gctggagcac ggcggctgcg tgacagtgat ggcccaggac aagccaaccg 120
tggatatcga gctggtgacc acaaccgtgt ccaacatggc cgaggtgagg tcttactgct 180
atgaggccag catctccgac atggcctctg atagcaggtg tccaacccag ggagaggcat 240
acctggacaa gcagtccgat acacagtacg tgtgcaagcg gaccctggtg gacagaggct 300
ggggcaatgg ctgtggcctg tttggcaagg gctctctggt gacatgcgcc aagttcgcct 360
gtagcaagaa gatgaccggc aagtccatcc agccagagaa cctggagtac cggatcatgc 420
tgtctgtgca cggctcccag cactctggca tgatcgtgaa cgacacaggc cacgagacag 480
atgagaatcg ggccaaggtg gagatcacac ctaactctcc aagagccgag gccaccctgg 540
gaggatttgg ctctctgggc ctggactgcg agcctagaac aggcctggac ttctccgatc 600
tgtactatct gaccatgaac aataagcact ggctggtgca caaggagtgg tttcacgaca 660
tcccactgcc atggcacgca ggagcagata caggaacacc acactggaac aataaggagg 720
ccctggtgga gttcaaggat gcccacgcca agcggcagac agtggtggtg ctgggcagcc 780
aggagggagc agtgcacacc gccctggcag gcgccctgga ggcagagatg gacggagcta 840
agggcagact gtctagcggc cacctgaagt gcaggctgaa gatggataag ctgcgcctga 900
agggcgtgtc ctactctctg tgcacagccg ccttcacctt caccaagatc cctgccgaga 960
cactgcacgg cacagtgacc gtggaggtgc agtatgccgg cacagacgga ccctgtaagg 1020
tgcctgccca gatggccgtg gatatgcaga cactgacacc tgtgggcagg ctgatcaccg 1080
ccaatccagt gatcacagag tctaccgaga acagcaagat gatgctggag ctggacccac 1140
catttggcga tagctatatc gtgatcggcg tgggcgagaa gaagatcaca caccactggc 1200
accgcagcgg ctccacaatc ggcaaggcct ttgaggcaac cgtgcgcgga gcaaagagaa 1260
tggccgtgct gggcgacacc gcatgggatt tcggatctgt gggaggcgcc ctgaacagcc 1320
tgggcaaggg catc 1334
<210> 29
<211> 445
<212> PRT
<213> Artificial sequence
<220>
<223> E protein of ZIKV truncated at amino acid position 411 (Ed411)
<400> 29
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
1 5 10 15
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
20 25 30
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
35 40 45
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
50 55 60
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
65 70 75 80
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
100 105 110
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
145 150 155 160
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
180 185 190
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
195 200 205
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
225 230 235 240
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
260 265 270
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
275 280 285
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
305 310 315 320
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
340 345 350
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
385 390 395 400
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
420 425 430
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
435 440 445
<210> 30
<211> 1212
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of E protein of ZIKV truncated at amino acid position 395 (Ed395)
<400> 30
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg tgggacttgg 60
gttgatgttg tcttggaaca tggaggttgt gtcaccgtaa tggcacagga caaaccgact 120
gtcgacatag agctggttac aacaacagtc agcaacatgg cggaggtaag atcctactgc 180
tatgaggcat caatatcaga catggcttcg gacagccgct gcccaacaca aggtgaagcc 240
taccttgaca agcaatcaga cactcaatat gtctgcaaaa gaacgttagt ggacagaggc 300
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc taagtttgca 360
tgctccaaga aaatgaccgg gaagagcatc cagccagaga atctggagta ccggataatg 420
ctgtcagttc atggctccca gcacagtggg atgatcgtta atgacacagg acatgaaact 480
gatgagaata gagcgaaggt tgagataacg cccaattcac caagagccga agccaccctg 540
gggggttttg gaagcctagg acttgattgt gaaccgagga caggccttga cttttcagat 600
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg gttccacgac 660
attccattac cttggcacgc tggggcagac accggaactc cacactggaa caacaaagaa 720
gcactggtag agttcaagga cgcacatgcc aaaaggcaaa ctgtcgtggt tctagggagt 780
caagaaggag cagttcacac ggcccttgct ggagctctgg aggctgagat ggatggtgca 840
aagggaaggc tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa acttagattg 900
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat cccggctgaa 960
acactgcacg ggacagtcac agtggaggta cagtacgcag ggacagatgg accttgcaag 1020
gttccagctc agatggcggt ggacatgcaa actctgaccc cagttgggag gttgataacc 1080
gctaaccccg taatcactga aagcactgag aactctaaga tgatgctgga acttgatcca 1140
ccatttgggg actcttacat tgtcatagga gtcggggaga agaagatcac ccaccactgg 1200
cacaggagtg gc 1212
<210> 31
<211> 1211
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of E protein of ZIKV truncated at amino acid position 395 (Ed395)
<400> 31
tccggtgcat cggcgtgagc aatagagact tcgtggaggg aatgtccgga ggaacctggg 60
tggatgtggt gctggagcac ggcggctgcg tgacagtgat ggcccaggac aagccaaccg 120
tggatatcga gctggtgacc acaaccgtgt ccaacatggc cgaggtgagg tcttactgct 180
atgaggccag catctccgac atggcctctg atagcaggtg tccaacccag ggagaggcat 240
acctggacaa gcagtccgat acacagtacg tgtgcaagcg gaccctggtg gacagaggct 300
ggggcaatgg ctgtggcctg tttggcaagg gctctctggt gacatgcgcc aagttcgcct 360
gtagcaagaa gatgaccggc aagtccatcc agccagagaa cctggagtac cggatcatgc 420
tgtctgtgca cggctcccag cactctggca tgatcgtgaa cgacacaggc cacgagacag 480
atgagaatcg ggccaaggtg gagatcacac ctaactctcc aagagccgag gccaccctgg 540
gaggatttgg ctctctgggc ctggactgcg agcctagaac aggcctggac ttctccgatc 600
tgtactatct gaccatgaac aataagcact ggctggtgca caaggagtgg tttcacgaca 660
tcccactgcc atggcacgca ggagcagata caggaacacc acactggaac aataaggagg 720
ccctggtgga gttcaaggat gcccacgcca agcggcagac agtggtggtg ctgggcagcc 780
aggagggagc agtgcacacc gccctggcag gcgccctgga ggcagagatg gacggagcta 840
agggcagact gtctagcggc cacctgaagt gcaggctgaa gatggataag ctgcgcctga 900
agggcgtgtc ctactctctg tgcacagccg ccttcacctt caccaagatc cctgccgaga 960
cactgcacgg cacagtgacc gtggaggtgc agtatgccgg cacagacgga ccctgtaagg 1020
tgcctgccca gatggccgtg gatatgcaga cactgacacc tgtgggcagg ctgatcaccg 1080
ccaatccagt gatcacagag tctaccgaga acagcaagat gatgctggag ctggacccac 1140
catttggcga tagctatatc gtgatcggcg tgggcgagaa gaagatcaca caccactggc 1200
accgcagcgg c 1211
<210> 32
<211> 404
<212> PRT
<213> Artificial sequence
<220>
<223> E protein of ZIKV truncated at amino acid position 395 (Ed395)
<400> 32
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
1 5 10 15
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
20 25 30
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
35 40 45
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
50 55 60
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
65 70 75 80
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
100 105 110
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
145 150 155 160
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
180 185 190
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
195 200 205
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
225 230 235 240
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
260 265 270
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
275 280 285
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
305 310 315 320
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
340 345 350
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
385 390 395 400
His Arg Ser Gly
<210> 33
<211> 156
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of E stem region of ZIKV
<400> 33
agcaccattg gaaaagcatt tgaagccact gtgagaggtg ccaagagaat ggcagtcttg 60
ggagacacag cctgggactt tggatcagtt ggaggcgctc tcaactcatt gggcaagggc 120
atccatcaaa tttttggagc agctttcaaa tcattg 156
<210> 34
<211> 156
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of E stem region of ZIKV
<400> 34
tccacaatcg gcaaggcctt tgaggcaacc gtgcgcggag caaagagaat ggccgtgctg 60
ggcgacaccg catgggattt cggatctgtg ggaggcgccc tgaacagcct gggcaagggc 120
atccaccaga tcttcggcgc cgcctttaag tccctg 156
<210> 35
<211> 52
<212> PRT
<213> Artificial sequence
<220>
<223> E stem region of ZIKV
<400> 35
Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg
1 5 10 15
Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly
20 25 30
Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala
35 40 45
Phe Lys Ser Leu
50
<210> 36
<211> 24
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of intermediate domain between E stem and E anchor regions of ZIKV
<400> 36
tttggaggaa tgtcctggtt ctca 24
<210> 37
<211> 34
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of intermediate domain between E stem and E anchor regions of ZIKV
<400> 37
ttcggcggca tgagctggtt ctcacagatc ctga 34
<210> 38
<211> 8
<212> PRT
<213> Artificial sequence
<220>
<223> Intermediate domain between E stem and E anchor regions of ZIKV
<400> 38
Phe Gly Gly Met Ser Trp Phe Ser
1 5
<210> 39
<211> 120
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of E anchor region of ZIKV
<400> 39
caaattctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa tggatctatt 60
tcccttatgt gcttggcctt agggggagtg ttgatcttct tatccacagc cgtctctgct 120
<210> 40
<211> 110
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of E anchor region of ZIKV
<400> 40
tcggcacact gctgatgtgg ctgggcctga acaccaagaa tggctctatc agcctgatgt 60
gcctggccct gggaggcgtg ctgatcttcc tgtccaccgc cgtgtctgcc 110
<210> 41
<211> 40
<212> PRT
<213> Artificial sequence
<220>
<223> E anchor region of ZIKV
<400> 41
Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys
1 5 10 15
Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
20 25 30
Phe Leu Ser Thr Ala Val Ser Ala
35 40
<210> 42
<211> 192
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of transmembrane (TM) and intracytoplasmic tail of MV F protein
<400> 42
atgaaaggtt tatcgagcac tagcatagtc tacatcctga ttgcagtgtg tcttggaggg 60
ttgataggga tccccgcttt aatatgttgc tgcagggggc gttgtaacaa aaagggagaa 120
caagttggta tgtcaagacc aggcctaaag cctgatctta cgggaacatc aaaatcctat 180
gtaaggtcgc tc 192
<210> 43
<211> 192
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of transmembrane (TM) and intracytoplasmic tail of MV F protein
<400> 43
atgaagggcc tgtcctctac ctctatcgtg tacatcctga tcgccgtgtg cctgggaggc 60
ctgatcggaa tcccagccct gatctgctgt tgcagaggcc gctgcaacaa gaagggagag 120
caagtgggaa tgtctcggcc aggcctgaag ccagacctga caggcacctc caagtcttat 180
gtgagaagcc tg 192
<210> 44
<211> 64
<212> PRT
<213> Artificial sequence
<220>
<223> Transmembrane (TM) and intracytoplasmic tail of MV F protein
<400> 44
Met Lys Gly Leu Ser Ser Thr Ser Ile Val Tyr Ile Leu Ile Ala Val
1 5 10 15
Cys Leu Gly Gly Leu Ile Gly Ile Pro Ala Leu Ile Cys Cys Cys Arg
20 25 30
Gly Arg Cys Asn Lys Lys Gly Glu Gln Val Gly Met Ser Arg Pro Gly
35 40 45
Leu Lys Pro Asp Leu Thr Gly Thr Ser Lys Ser Tyr Val Arg Ser Leu
50 55 60
<210> 45
<211> 2094
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_ZikaprME protein (A1)
<400> 45
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg cagcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ccatcaaatt 1920
tttggagcag ctttcaaatc attgtttgga ggaatgtcct ggttctcaca aattctcatt 1980
ggaacgttgc tgatgtggtt gggtctgaac acaaagaatg gatctatttc ccttatgtgc 2040
ttggccttag ggggagtgtt gatcttctta tccacagccg tctctgcttg atga 2094
<210> 46
<211> 2094
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_ZikaprME protein (A1)
<400> 46
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ccaccagatc 1920
ttcggcgccg cctttaagtc cctgttcggc ggcatgagct ggttctcaca gatcctgatc 1980
ggcacactgc tgatgtggct gggcctgaac accaagaatg gctctatcag cctgatgtgc 2040
ctggccctgg gaggcgtgct gatcttcctg tccaccgccg tgtctgcctg atga 2094
<210> 47
<211> 696
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_ZikaprME protein (A1)
<400> 47
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
625 630 635 640
Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
645 650 655
Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys
660 665 670
Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
675 680 685
Phe Leu Ser Thr Ala Val Ser Ala
690 695
<210> 48
<211> 1950
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_Zika_prME_no_Anchor protein (A2)
<400> 48
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg cagcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ccatcaaatt 1920
tttggagcag ctttcaaatc attgtgatga 1950
<210> 49
<211> 1950
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_Zika_prME_no_Anchor protein (A2)
<400> 49
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ccaccagatc 1920
ttcggcgccg cctttaagtc cctgtgatga 1950
<210> 50
<211> 648
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_Zika_prME_no_Anchor protein (A2)
<400> 50
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
625 630 635 640
Phe Gly Ala Ala Phe Lys Ser Leu
645
<210> 51
<211> 1914
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_Zika_prME411 protein (A3)
<400> 51
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg cagcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ctga 1914
<210> 52
<211> 1914
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_Zika_prME411 protein (A3)
<400> 52
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ctga 1914
<210> 53
<211> 637
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_Zika_prME411 protein (A3)
<400> 53
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635
<210> 54
<211> 1794
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_Zika_prME395 protein (A4)
<400> 54
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg cagcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggctg atga 1794
<210> 55
<211> 1794
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_Zika_prME395 protein (A4)
<400> 55
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctg atga 1794
<210> 56
<211> 596
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_Zika_prME395 protein (A4)
<400> 56
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly
595
<210> 57
<211> 1590
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_ZikaE protein (A5)
<400> 57
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg caatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatccat caaatttttg gagcagcttt caaatcattg 1440
tttggaggaa tgtcctggtt ctcacaaatt ctcattggaa cgttgctgat gtggttgggt 1500
ctgaacacaa agaatggatc tatttccctt atgtgcttgg ccttaggggg agtgttgatc 1560
ttcttatcca cagccgtctc tgcttgatga 1590
<210> 58
<211> 1590
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_ZikaE protein (A5)
<400> 58
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatccac cagatcttcg gcgccgcctt taagtccctg 1440
ttcggcggca tgagctggtt ctcacagatc ctgatcggca cactgctgat gtggctgggc 1500
ctgaacacca agaatggctc tatcagcctg atgtgcctgg ccctgggagg cgtgctgatc 1560
ttcctgtcca ccgccgtgtc tgcctgatga 1590
<210> 59
<211> 528
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_ZikaE protein (A5)
<400> 59
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
465 470 475 480
Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu
485 490 495
Met Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser Leu Met Cys
500 505 510
Leu Ala Leu Gly Gly Val Leu Ile Phe Leu Ser Thr Ala Val Ser Ala
515 520 525
<210> 60
<211> 1446
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_ZikaE_no_Anchor protein (A6)
<400> 60
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg caatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatccat caaatttttg gagcagcttt caaatcattg 1440
tgatga 1446
<210> 61
<211> 1446
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_ZikaE_no_Anchor protein (A6)
<400> 61
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatccac cagatcttcg gcgccgcctt taagtccctg 1440
tgatga 1446
<210> 62
<211> 480
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_ZikaE_no_Anchor protein (A6)
<400> 62
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
465 470 475 480
<210> 63
<211> 1410
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_ZikaE411 protein (A7)
<400> 63
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg caatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatctga 1410
<210> 64
<211> 1410
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_ZikaE411 protein (A7)
<400> 64
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatctga 1410
<210> 65
<211> 469
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_ZikaE411 protein (A7)
<400> 65
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile
465
<210> 66
<211> 1290
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp_ZikaE395 protein (A8)
<400> 66
atggagaaga agagacgagg cgcagatact agtgtcggaa ttgttggcct cctgctgacc 60
acagctatgg caatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggctgatga 1290
<210> 67
<211> 1290
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp_ZikaE395 protein (A8)
<400> 67
atggagaaga agcggagagg agcagacaca agcgtgggaa tcgtgggcct gctgctgacc 60
acagcaatgg caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctgatga 1290
<210> 68
<211> 428
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp_ZikaE395 protein (A8)
<400> 68
Met Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile Val Gly
1 5 10 15
Leu Leu Leu Thr Thr Ala Met Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly
420 425
<210> 69
<211> 1572
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp ZikaE protein (A9)
<400> 69
atgcaaaaag tcatatactt ggtcatgata ctgctgattg ccccggcata cagcatcagg 60
tgcataggag tcagcaatag ggactttgtg gaaggtatgt caggtgggac ttgggttgat 120
gttgtcttgg aacatggagg ttgtgtcacc gtaatggcac aggacaaacc gactgtcgac 180
atagagctgg ttacaacaac agtcagcaac atggcggagg taagatccta ctgctatgag 240
gcatcaatat cagacatggc ttcggacagc cgctgcccaa cacaaggtga agcctacctt 300
gacaagcaat cagacactca atatgtctgc aaaagaacgt tagtggacag aggctgggga 360
aatggatgtg gactttttgg caaagggagc ctggtgacat gcgctaagtt tgcatgctcc 420
aagaaaatga ccgggaagag catccagcca gagaatctgg agtaccggat aatgctgtca 480
gttcatggct cccagcacag tgggatgatc gttaatgaca caggacatga aactgatgag 540
aatagagcga aggttgagat aacgcccaat tcaccaagag ccgaagccac cctggggggt 600
tttggaagcc taggacttga ttgtgaaccg aggacaggcc ttgacttttc agatttgtat 660
tacttgacta tgaataacaa gcactggttg gttcacaagg agtggttcca cgacattcca 720
ttaccttggc acgctggggc agacaccgga actccacact ggaacaacaa agaagcactg 780
gtagagttca aggacgcaca tgccaaaagg caaactgtcg tggttctagg gagtcaagaa 840
ggagcagttc acacggccct tgctggagct ctggaggctg agatggatgg tgcaaaggga 900
aggctgtcct ctggccactt gaaatgtcgc ctgaaaatgg ataaacttag attgaagggc 960
gtgtcatact ccttgtgtac cgcagcgttc acattcacca agatcccggc tgaaacactg 1020
cacgggacag tcacagtgga ggtacagtac gcagggacag atggaccttg caaggttcca 1080
gctcagatgg cggtggacat gcaaactctg accccagttg ggaggttgat aaccgctaac 1140
cccgtaatca ctgaaagcac tgagaactct aagatgatgc tggaacttga tccaccattt 1200
ggggactctt acattgtcat aggagtcggg gagaagaaga tcacccacca ctggcacagg 1260
agtggcagca ccattggaaa agcatttgaa gccactgtga gaggtgccaa gagaatggca 1320
gtcttgggag acacagcctg ggactttgga tcagttggag gcgctctcaa ctcattgggc 1380
aagggcatcc atcaaatttt tggagcagct ttcaaatcat tgtttggagg aatgtcctgg 1440
ttctcacaaa ttctcattgg aacgttgctg atgtggttgg gtctgaacac aaagaatgga 1500
tctatttccc ttatgtgctt ggccttaggg ggagtgttga tcttcttatc cacagccgtc 1560
tctgcttgat ga 1572
<210> 70
<211> 1572
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp ZikaE protein (A9)
<400> 70
atgcagaaag tgatctacct ggtcatgatc ctgctgatcg ctcctgccta ttctatccgg 60
tgcatcggcg tgagcaatag agacttcgtg gagggaatgt ccggaggaac ctgggtggat 120
gtggtgctgg agcacggcgg ctgcgtgaca gtgatggccc aggacaagcc aaccgtggat 180
atcgagctgg tgaccacaac cgtgtccaac atggccgagg tgaggtctta ctgctatgag 240
gccagcatct ccgacatggc ctctgatagc aggtgtccaa cccagggaga ggcatacctg 300
gacaagcagt ccgatacaca gtacgtgtgc aagcggaccc tggtggacag aggctggggc 360
aatggctgtg gcctgtttgg caagggctct ctggtgacat gcgccaagtt cgcctgtagc 420
aagaagatga ccggcaagtc catccagcca gagaacctgg agtaccggat catgctgtct 480
gtgcacggct cccagcactc tggcatgatc gtgaacgaca caggccacga gacagatgag 540
aatcgggcca aggtggagat cacacctaac tctccaagag ccgaggccac cctgggagga 600
tttggctctc tgggcctgga ctgcgagcct agaacaggcc tggacttctc cgatctgtac 660
tatctgacca tgaacaataa gcactggctg gtgcacaagg agtggtttca cgacatccca 720
ctgccatggc acgcaggagc agatacagga acaccacact ggaacaataa ggaggccctg 780
gtggagttca aggatgccca cgccaagcgg cagacagtgg tggtgctggg cagccaggag 840
ggagcagtgc acaccgccct ggcaggcgcc ctggaggcag agatggacgg agctaagggc 900
agactgtcta gcggccacct gaagtgcagg ctgaagatgg ataagctgcg cctgaagggc 960
gtgtcctact ctctgtgcac agccgccttc accttcacca agatccctgc cgagacactg 1020
cacggcacag tgaccgtgga ggtgcagtat gccggcacag acggaccctg taaggtgcct 1080
gcccagatgg ccgtggatat gcagacactg acacctgtgg gcaggctgat caccgccaat 1140
ccagtgatca cagagtctac cgagaacagc aagatgatgc tggagctgga cccaccattt 1200
ggcgatagct atatcgtgat cggcgtgggc gagaagaaga tcacacacca ctggcaccgc 1260
agcggctcca caatcggcaa ggcctttgag gcaaccgtgc gcggagcaaa gagaatggcc 1320
gtgctgggcg acaccgcatg ggatttcgga tctgtgggag gcgccctgaa cagcctgggc 1380
aagggcatcc accagatctt cggcgccgcc tttaagtccc tgttcggcgg catgagctgg 1440
ttctcacaga tcctgatcgg cacactgctg atgtggctgg gcctgaacac caagaatggc 1500
tctatcagcc tgatgtgcct ggccctggga ggcgtgctga tcttcctgtc caccgccgtg 1560
tctgcctgat ga 1572
<210> 71
<211> 522
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp ZikaE protein (A9)
<400> 71
Met Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala
1 5 10 15
Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly
20 25 30
Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
35 40 45
Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
50 55 60
Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu
65 70 75 80
Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly
85 90 95
Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg
100 105 110
Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys
115 120 125
Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr
130 135 140
Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
145 150 155 160
Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
165 170 175
Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro
180 185 190
Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
195 200 205
Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met
210 215 220
Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro
225 230 235 240
Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
245 250 255
Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr
260 265 270
Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
275 280 285
Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser
290 295 300
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly
305 310 315 320
Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
325 330 335
Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly
340 345 350
Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
355 360 365
Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr
370 375 380
Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
385 390 395 400
Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
405 410 415
His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr
420 425 430
Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
435 440 445
Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His
450 455 460
Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp
465 470 475 480
Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
485 490 495
Thr Lys Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val
500 505 510
Leu Ile Phe Leu Ser Thr Ala Val Ser Ala
515 520
<210> 72
<211> 1428
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp ZikaE_no_Anchor protein (A10)
<400> 72
atgcaaaaag tcatatactt ggtcatgata ctgctgattg ccccggcata cagcatcagg 60
tgcataggag tcagcaatag ggactttgtg gaaggtatgt caggtgggac ttgggttgat 120
gttgtcttgg aacatggagg ttgtgtcacc gtaatggcac aggacaaacc gactgtcgac 180
atagagctgg ttacaacaac agtcagcaac atggcggagg taagatccta ctgctatgag 240
gcatcaatat cagacatggc ttcggacagc cgctgcccaa cacaaggtga agcctacctt 300
gacaagcaat cagacactca atatgtctgc aaaagaacgt tagtggacag aggctgggga 360
aatggatgtg gactttttgg caaagggagc ctggtgacat gcgctaagtt tgcatgctcc 420
aagaaaatga ccgggaagag catccagcca gagaatctgg agtaccggat aatgctgtca 480
gttcatggct cccagcacag tgggatgatc gttaatgaca caggacatga aactgatgag 540
aatagagcga aggttgagat aacgcccaat tcaccaagag ccgaagccac cctggggggt 600
tttggaagcc taggacttga ttgtgaaccg aggacaggcc ttgacttttc agatttgtat 660
tacttgacta tgaataacaa gcactggttg gttcacaagg agtggttcca cgacattcca 720
ttaccttggc acgctggggc agacaccgga actccacact ggaacaacaa agaagcactg 780
gtagagttca aggacgcaca tgccaaaagg caaactgtcg tggttctagg gagtcaagaa 840
ggagcagttc acacggccct tgctggagct ctggaggctg agatggatgg tgcaaaggga 900
aggctgtcct ctggccactt gaaatgtcgc ctgaaaatgg ataaacttag attgaagggc 960
gtgtcatact ccttgtgtac cgcagcgttc acattcacca agatcccggc tgaaacactg 1020
cacgggacag tcacagtgga ggtacagtac gcagggacag atggaccttg caaggttcca 1080
gctcagatgg cggtggacat gcaaactctg accccagttg ggaggttgat aaccgctaac 1140
cccgtaatca ctgaaagcac tgagaactct aagatgatgc tggaacttga tccaccattt 1200
ggggactctt acattgtcat aggagtcggg gagaagaaga tcacccacca ctggcacagg 1260
agtggcagca ccattggaaa agcatttgaa gccactgtga gaggtgccaa gagaatggca 1320
gtcttgggag acacagcctg ggactttgga tcagttggag gcgctctcaa ctcattgggc 1380
aagggcatcc atcaaatttt tggagcagct ttcaaatcat tgtgatga 1428
<210> 73
<211> 1428
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp ZikaE_no_Anchor protein (A10)
<400> 73
atgcagaaag tgatctacct ggtcatgatc ctgctgatcg ctcctgccta ttctatccgg 60
tgcatcggcg tgagcaatag agacttcgtg gagggaatgt ccggaggaac ctgggtggat 120
gtggtgctgg agcacggcgg ctgcgtgaca gtgatggccc aggacaagcc aaccgtggat 180
atcgagctgg tgaccacaac cgtgtccaac atggccgagg tgaggtctta ctgctatgag 240
gccagcatct ccgacatggc ctctgatagc aggtgtccaa cccagggaga ggcatacctg 300
gacaagcagt ccgatacaca gtacgtgtgc aagcggaccc tggtggacag aggctggggc 360
aatggctgtg gcctgtttgg caagggctct ctggtgacat gcgccaagtt cgcctgtagc 420
aagaagatga ccggcaagtc catccagcca gagaacctgg agtaccggat catgctgtct 480
gtgcacggct cccagcactc tggcatgatc gtgaacgaca caggccacga gacagatgag 540
aatcgggcca aggtggagat cacacctaac tctccaagag ccgaggccac cctgggagga 600
tttggctctc tgggcctgga ctgcgagcct agaacaggcc tggacttctc cgatctgtac 660
tatctgacca tgaacaataa gcactggctg gtgcacaagg agtggtttca cgacatccca 720
ctgccatggc acgcaggagc agatacagga acaccacact ggaacaataa ggaggccctg 780
gtggagttca aggatgccca cgccaagcgg cagacagtgg tggtgctggg cagccaggag 840
ggagcagtgc acaccgccct ggcaggcgcc ctggaggcag agatggacgg agctaagggc 900
agactgtcta gcggccacct gaagtgcagg ctgaagatgg ataagctgcg cctgaagggc 960
gtgtcctact ctctgtgcac agccgccttc accttcacca agatccctgc cgagacactg 1020
cacggcacag tgaccgtgga ggtgcagtat gccggcacag acggaccctg taaggtgcct 1080
gcccagatgg ccgtggatat gcagacactg acacctgtgg gcaggctgat caccgccaat 1140
ccagtgatca cagagtctac cgagaacagc aagatgatgc tggagctgga cccaccattt 1200
ggcgatagct atatcgtgat cggcgtgggc gagaagaaga tcacacacca ctggcaccgc 1260
agcggctcca caatcggcaa ggcctttgag gcaaccgtgc gcggagcaaa gagaatggcc 1320
gtgctgggcg acaccgcatg ggatttcgga tctgtgggag gcgccctgaa cagcctgggc 1380
aagggcatcc accagatctt cggcgccgcc tttaagtccc tgtgatga 1428
<210> 74
<211> 474
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp ZikaE_no_Anchor protein (A10)
<400> 74
Met Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala
1 5 10 15
Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly
20 25 30
Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
35 40 45
Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
50 55 60
Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu
65 70 75 80
Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly
85 90 95
Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg
100 105 110
Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys
115 120 125
Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr
130 135 140
Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
145 150 155 160
Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
165 170 175
Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro
180 185 190
Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
195 200 205
Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met
210 215 220
Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro
225 230 235 240
Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
245 250 255
Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr
260 265 270
Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
275 280 285
Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser
290 295 300
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly
305 310 315 320
Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
325 330 335
Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly
340 345 350
Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
355 360 365
Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr
370 375 380
Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
385 390 395 400
Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
405 410 415
His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr
420 425 430
Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
435 440 445
Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His
450 455 460
Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
465 470
<210> 75
<211> 1392
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp ZikaE411 protein (A11)
<400> 75
atgcaaaaag tcatatactt ggtcatgata ctgctgattg ccccggcata cagcatcagg 60
tgcataggag tcagcaatag ggactttgtg gaaggtatgt caggtgggac ttgggttgat 120
gttgtcttgg aacatggagg ttgtgtcacc gtaatggcac aggacaaacc gactgtcgac 180
atagagctgg ttacaacaac agtcagcaac atggcggagg taagatccta ctgctatgag 240
gcatcaatat cagacatggc ttcggacagc cgctgcccaa cacaaggtga agcctacctt 300
gacaagcaat cagacactca atatgtctgc aaaagaacgt tagtggacag aggctgggga 360
aatggatgtg gactttttgg caaagggagc ctggtgacat gcgctaagtt tgcatgctcc 420
aagaaaatga ccgggaagag catccagcca gagaatctgg agtaccggat aatgctgtca 480
gttcatggct cccagcacag tgggatgatc gttaatgaca caggacatga aactgatgag 540
aatagagcga aggttgagat aacgcccaat tcaccaagag ccgaagccac cctggggggt 600
tttggaagcc taggacttga ttgtgaaccg aggacaggcc ttgacttttc agatttgtat 660
tacttgacta tgaataacaa gcactggttg gttcacaagg agtggttcca cgacattcca 720
ttaccttggc acgctggggc agacaccgga actccacact ggaacaacaa agaagcactg 780
gtagagttca aggacgcaca tgccaaaagg caaactgtcg tggttctagg gagtcaagaa 840
ggagcagttc acacggccct tgctggagct ctggaggctg agatggatgg tgcaaaggga 900
aggctgtcct ctggccactt gaaatgtcgc ctgaaaatgg ataaacttag attgaagggc 960
gtgtcatact ccttgtgtac cgcagcgttc acattcacca agatcccggc tgaaacactg 1020
cacgggacag tcacagtgga ggtacagtac gcagggacag atggaccttg caaggttcca 1080
gctcagatgg cggtggacat gcaaactctg accccagttg ggaggttgat aaccgctaac 1140
cccgtaatca ctgaaagcac tgagaactct aagatgatgc tggaacttga tccaccattt 1200
ggggactctt acattgtcat aggagtcggg gagaagaaga tcacccacca ctggcacagg 1260
agtggcagca ccattggaaa agcatttgaa gccactgtga gaggtgccaa gagaatggca 1320
gtcttgggag acacagcctg ggactttgga tcagttggag gcgctctcaa ctcattgggc 1380
aagggcatct ga 1392
<210> 76
<211> 1392
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp ZikaE411 protein (A11)
<400> 76
atgcagaaag tgatctacct ggtcatgatc ctgctgatcg ctcctgccta ttctatccgg 60
tgcatcggcg tgagcaatag agacttcgtg gagggaatgt ccggaggaac ctgggtggat 120
gtggtgctgg agcacggcgg ctgcgtgaca gtgatggccc aggacaagcc aaccgtggat 180
atcgagctgg tgaccacaac cgtgtccaac atggccgagg tgaggtctta ctgctatgag 240
gccagcatct ccgacatggc ctctgatagc aggtgtccaa cccagggaga ggcatacctg 300
gacaagcagt ccgatacaca gtacgtgtgc aagcggaccc tggtggacag aggctggggc 360
aatggctgtg gcctgtttgg caagggctct ctggtgacat gcgccaagtt cgcctgtagc 420
aagaagatga ccggcaagtc catccagcca gagaacctgg agtaccggat catgctgtct 480
gtgcacggct cccagcactc tggcatgatc gtgaacgaca caggccacga gacagatgag 540
aatcgggcca aggtggagat cacacctaac tctccaagag ccgaggccac cctgggagga 600
tttggctctc tgggcctgga ctgcgagcct agaacaggcc tggacttctc cgatctgtac 660
tatctgacca tgaacaataa gcactggctg gtgcacaagg agtggtttca cgacatccca 720
ctgccatggc acgcaggagc agatacagga acaccacact ggaacaataa ggaggccctg 780
gtggagttca aggatgccca cgccaagcgg cagacagtgg tggtgctggg cagccaggag 840
ggagcagtgc acaccgccct ggcaggcgcc ctggaggcag agatggacgg agctaagggc 900
agactgtcta gcggccacct gaagtgcagg ctgaagatgg ataagctgcg cctgaagggc 960
gtgtcctact ctctgtgcac agccgccttc accttcacca agatccctgc cgagacactg 1020
cacggcacag tgaccgtgga ggtgcagtat gccggcacag acggaccctg taaggtgcct 1080
gcccagatgg ccgtggatat gcagacactg acacctgtgg gcaggctgat caccgccaat 1140
ccagtgatca cagagtctac cgagaacagc aagatgatgc tggagctgga cccaccattt 1200
ggcgatagct atatcgtgat cggcgtgggc gagaagaaga tcacacacca ctggcaccgc 1260
agcggctcca caatcggcaa ggcctttgag gcaaccgtgc gcggagcaaa gagaatggcc 1320
gtgctgggcg acaccgcatg ggatttcgga tctgtgggag gcgccctgaa cagcctgggc 1380
aagggcatct ga 1392
<210> 77
<211> 463
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp ZikaE411 protein (A11)
<400> 77
Met Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala
1 5 10 15
Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly
20 25 30
Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
35 40 45
Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
50 55 60
Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu
65 70 75 80
Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly
85 90 95
Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg
100 105 110
Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys
115 120 125
Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr
130 135 140
Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
145 150 155 160
Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
165 170 175
Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro
180 185 190
Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
195 200 205
Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met
210 215 220
Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro
225 230 235 240
Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
245 250 255
Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr
260 265 270
Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
275 280 285
Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser
290 295 300
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly
305 310 315 320
Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
325 330 335
Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly
340 345 350
Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
355 360 365
Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr
370 375 380
Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
385 390 395 400
Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
405 410 415
His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr
420 425 430
Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
435 440 445
Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
450 455 460
<210> 78
<211> 1266
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of Zikasp ZikaE395 protein (A12)
<400> 78
atgcaaaaag tcatatactt ggtcatgata ctgctgattg ccccggcata cagcatcagg 60
tgcataggag tcagcaatag ggactttgtg gaaggtatgt caggtgggac ttgggttgat 120
gttgtcttgg aacatggagg ttgtgtcacc gtaatggcac aggacaaacc gactgtcgac 180
atagagctgg ttacaacaac agtcagcaac atggcggagg taagatccta ctgctatgag 240
gcatcaatat cagacatggc ttcggacagc cgctgcccaa cacaaggtga agcctacctt 300
gacaagcaat cagacactca atatgtctgc aaaagaacgt tagtggacag aggctgggga 360
aatggatgtg gactttttgg caaagggagc ctggtgacat gcgctaagtt tgcatgctcc 420
aagaaaatga ccgggaagag catccagcca gagaatctgg agtaccggat aatgctgtca 480
gttcatggct cccagcacag tgggatgatc gttaatgaca caggacatga aactgatgag 540
aatagagcga aggttgagat aacgcccaat tcaccaagag ccgaagccac cctggggggt 600
tttggaagcc taggacttga ttgtgaaccg aggacaggcc ttgacttttc agatttgtat 660
tacttgacta tgaataacaa gcactggttg gttcacaagg agtggttcca cgacattcca 720
ttaccttggc acgctggggc agacaccgga actccacact ggaacaacaa agaagcactg 780
gtagagttca aggacgcaca tgccaaaagg caaactgtcg tggttctagg gagtcaagaa 840
ggagcagttc acacggccct tgctggagct ctggaggctg agatggatgg tgcaaaggga 900
aggctgtcct ctggccactt gaaatgtcgc ctgaaaatgg ataaacttag attgaagggc 960
gtgtcatact ccttgtgtac cgcagcgttc acattcacca agatcccggc tgaaacactg 1020
cacgggacag tcacagtgga ggtacagtac gcagggacag atggaccttg caaggttcca 1080
gctcagatgg cggtggacat gcaaactctg accccagttg ggaggttgat aaccgctaac 1140
cccgtaatca ctgaaagcac tgagaactct aagatgatgc tggaacttga tccaccattt 1200
ggggactctt acattgtcat aggagtcggg gagaagaaga tcacccacca ctggcacagg 1260
agtggc 1266
<210> 79
<211> 1272
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of Zikasp ZikaE395 protein (A12)
<400> 79
atgcagaaag tgatctacct ggtcatgatc ctgctgatcg ctcctgccta ttctatccgg 60
tgcatcggcg tgagcaatag agacttcgtg gagggaatgt ccggaggaac ctgggtggat 120
gtggtgctgg agcacggcgg ctgcgtgaca gtgatggccc aggacaagcc aaccgtggat 180
atcgagctgg tgaccacaac cgtgtccaac atggccgagg tgaggtctta ctgctatgag 240
gccagcatct ccgacatggc ctctgatagc aggtgtccaa cccagggaga ggcatacctg 300
gacaagcagt ccgatacaca gtacgtgtgc aagcggaccc tggtggacag aggctggggc 360
aatggctgtg gcctgtttgg caagggctct ctggtgacat gcgccaagtt cgcctgtagc 420
aagaagatga ccggcaagtc catccagcca gagaacctgg agtaccggat catgctgtct 480
gtgcacggct cccagcactc tggcatgatc gtgaacgaca caggccacga gacagatgag 540
aatcgggcca aggtggagat cacacctaac tctccaagag ccgaggccac cctgggagga 600
tttggctctc tgggcctgga ctgcgagcct agaacaggcc tggacttctc cgatctgtac 660
tatctgacca tgaacaataa gcactggctg gtgcacaagg agtggtttca cgacatccca 720
ctgccatggc acgcaggagc agatacagga acaccacact ggaacaataa ggaggccctg 780
gtggagttca aggatgccca cgccaagcgg cagacagtgg tggtgctggg cagccaggag 840
ggagcagtgc acaccgccct ggcaggcgcc ctggaggcag agatggacgg agctaagggc 900
agactgtcta gcggccacct gaagtgcagg ctgaagatgg ataagctgcg cctgaagggc 960
gtgtcctact ctctgtgcac agccgccttc accttcacca agatccctgc cgagacactg 1020
cacggcacag tgaccgtgga ggtgcagtat gccggcacag acggaccctg taaggtgcct 1080
gcccagatgg ccgtggatat gcagacactg acacctgtgg gcaggctgat caccgccaat 1140
ccagtgatca cagagtctac cgagaacagc aagatgatgc tggagctgga cccaccattt 1200
ggcgatagct atatcgtgat cggcgtgggc gagaagaaga tcacacacca ctggcaccgc 1260
agcggctgat ga 1272
<210> 80
<211> 422
<212> PRT
<213> Artificial sequence
<220>
<223> Zikasp Zikas 395 protein (A12)
<400> 80
Met Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala
1 5 10 15
Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly
20 25 30
Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
35 40 45
Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
50 55 60
Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu
65 70 75 80
Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly
85 90 95
Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg
100 105 110
Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys
115 120 125
Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr
130 135 140
Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
145 150 155 160
Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
165 170 175
Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro
180 185 190
Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
195 200 205
Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met
210 215 220
Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro
225 230 235 240
Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
245 250 255
Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr
260 265 270
Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
275 280 285
Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser
290 295 300
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly
305 310 315 320
Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
325 330 335
Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly
340 345 350
Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
355 360 365
Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr
370 375 380
Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
385 390 395 400
Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
405 410 415
His Trp His Arg Ser Gly
420
<210> 81
<211> 2094
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_ZikaprME protein (B1)
<400> 81
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccgcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ccatcaaatt 1920
tttggagcag ctttcaaatc attgtttgga ggaatgtcct ggttctcaca aattctcatt 1980
ggaacgttgc tgatgtggtt gggtctgaac acaaagaatg gatctatttc ccttatgtgc 2040
ttggccttag ggggagtgtt gatcttctta tccacagccg tctctgcttg atga 2094
<210> 82
<211> 2094
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_ZikaprME protein (B1)
<400> 82
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ccaccagatc 1920
ttcggcgccg cctttaagtc cctgttcggc ggcatgagct ggttctcaca gatcctgatc 1980
ggcacactgc tgatgtggct gggcctgaac accaagaatg gctctatcag cctgatgtgc 2040
ctggccctgg gaggcgtgct gatcttcctg tccaccgccg tgtctgcctg atga 2094
<210> 83
<211> 696
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_ZikaprME protein (B1)
<400> 83
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
625 630 635 640
Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
645 650 655
Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys
660 665 670
Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
675 680 685
Phe Leu Ser Thr Ala Val Ser Ala
690 695
<210> 84
<211> 1950
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_Zika_prME_no_Anchor protein (B2)
<400> 84
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccgcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ccatcaaatt 1920
tttggagcag ctttcaaatc attgtgatga 1950
<210> 85
<211> 1950
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_Zika_prME_no_Anchor protein (B2)
<400> 85
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ccaccagatc 1920
ttcggcgccg cctttaagtc cctgtgatga 1950
<210> 86
<211> 648
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_Zika_prME_no_Anchor protein (B2)
<400> 86
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
625 630 635 640
Phe Gly Ala Ala Phe Lys Ser Leu
645
<210> 87
<211> 1914
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_Zika_prME411 protein (B3)
<400> 87
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccgcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggcag caccattgga 1800
aaagcatttg aagccactgt gagaggtgcc aagagaatgg cagtcttggg agacacagcc 1860
tgggactttg gatcagttgg aggcgctctc aactcattgg gcaagggcat ctga 1914
<210> 88
<211> 1914
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_Zika_prME411 protein (B3)
<400> 88
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctc cacaatcggc 1800
aaggcctttg aggcaaccgt gcgcggagca aagagaatgg ccgtgctggg cgacaccgca 1860
tgggatttcg gatctgtggg aggcgccctg aacagcctgg gcaagggcat ctga 1914
<210> 89
<211> 637
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_Zika_prME411 protein (B3)
<400> 89
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
595 600 605
Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
610 615 620
Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635
<210> 90
<211> 1794
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_Zika_prME395 protein (B4)
<400> 90
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccgcggaggt cactagacgt gggagtgcat actatatgta cttggacaga 120
aacgatgctg gggaggccat atcttttcca accacattgg ggatgaataa gtgttatata 180
cagatcatgg atcttggaca catgtgtgat gccaccatga gctatgaatg ccctatgctg 240
gatgaggggg tggaaccaga tgacgtcgat tgttggtgca acacgacgtc aacttgggtt 300
gtgtacggaa cctgccatca caaaaaaggt gaagcacgga gatctagaag agctgtgacg 360
ctcccctccc attccactag gaagctgcaa acgcggtcgc aaacctggtt ggaatcaaga 420
gaatacacaa agcacttgat tagagtcgaa aattggatat tcaggaaccc tggcttcgcg 480
ttagcagcag ctgccatcgc ttggcttttg ggaagctcaa cgagccaaaa agtcatatac 540
ttggtcatga tactgctgat tgccccggca tacagcatca ggtgcatagg agtcagcaat 600
agggactttg tggaaggtat gtcaggtggg acttgggttg atgttgtctt ggaacatgga 660
ggttgtgtca ccgtaatggc acaggacaaa ccgactgtcg acatagagct ggttacaaca 720
acagtcagca acatggcgga ggtaagatcc tactgctatg aggcatcaat atcagacatg 780
gcttcggaca gccgctgccc aacacaaggt gaagcctacc ttgacaagca atcagacact 840
caatatgtct gcaaaagaac gttagtggac agaggctggg gaaatggatg tggacttttt 900
ggcaaaggga gcctggtgac atgcgctaag tttgcatgct ccaagaaaat gaccgggaag 960
agcatccagc cagagaatct ggagtaccgg ataatgctgt cagttcatgg ctcccagcac 1020
agtgggatga tcgttaatga cacaggacat gaaactgatg agaatagagc gaaggttgag 1080
ataacgccca attcaccaag agccgaagcc accctggggg gttttggaag cctaggactt 1140
gattgtgaac cgaggacagg ccttgacttt tcagatttgt attacttgac tatgaataac 1200
aagcactggt tggttcacaa ggagtggttc cacgacattc cattaccttg gcacgctggg 1260
gcagacaccg gaactccaca ctggaacaac aaagaagcac tggtagagtt caaggacgca 1320
catgccaaaa ggcaaactgt cgtggttcta gggagtcaag aaggagcagt tcacacggcc 1380
cttgctggag ctctggaggc tgagatggat ggtgcaaagg gaaggctgtc ctctggccac 1440
ttgaaatgtc gcctgaaaat ggataaactt agattgaagg gcgtgtcata ctccttgtgt 1500
accgcagcgt tcacattcac caagatcccg gctgaaacac tgcacgggac agtcacagtg 1560
gaggtacagt acgcagggac agatggacct tgcaaggttc cagctcagat ggcggtggac 1620
atgcaaactc tgaccccagt tgggaggttg ataaccgcta accccgtaat cactgaaagc 1680
actgagaact ctaagatgat gctggaactt gatccaccat ttggggactc ttacattgtc 1740
ataggagtcg gggagaagaa gatcacccac cactggcaca ggagtggctg atga 1794
<210> 91
<211> 1794
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_Zika_prME395 protein (B4)
<400> 91
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag cagcagaggt gaccaggaga ggaagcgcct actatatgta cctggacagg 120
aatgatgccg gcgaggccat ctccttccca accacactgg gcatgaacaa gtgctacatc 180
cagatcatgg acctgggcca catgtgcgat gccaccatgt cctatgagtg tccaatgctg 240
gacgagggcg tggagcccga cgatgtggat tgctggtgta ataccacatc tacatgggtg 300
gtgtacggca cctgtcacca caagaaggga gaggcccggc ggagccggcg ggccgtgaca 360
ctgccttccc actctaccag gaagctgcag acacgcagcc agacctggct ggagtccaga 420
gagtatacca agcacctgat cagggtggag aactggatct ttcgcaatcc aggattcgca 480
ctggcagcag cagcaatcgc atggctgctg ggaagctcca ccagccagaa agtgatctac 540
ctggtcatga tcctgctgat cgctcctgcc tattctatcc ggtgcatcgg cgtgagcaat 600
agagacttcg tggagggaat gtccggagga acctgggtgg atgtggtgct ggagcacggc 660
ggctgcgtga cagtgatggc ccaggacaag ccaaccgtgg atatcgagct ggtgaccaca 720
accgtgtcca acatggccga ggtgaggtct tactgctatg aggccagcat ctccgacatg 780
gcctctgata gcaggtgtcc aacccaggga gaggcatacc tggacaagca gtccgataca 840
cagtacgtgt gcaagcggac cctggtggac agaggctggg gcaatggctg tggcctgttt 900
ggcaagggct ctctggtgac atgcgccaag ttcgcctgta gcaagaagat gaccggcaag 960
tccatccagc cagagaacct ggagtaccgg atcatgctgt ctgtgcacgg ctcccagcac 1020
tctggcatga tcgtgaacga cacaggccac gagacagatg agaatcgggc caaggtggag 1080
atcacaccta actctccaag agccgaggcc accctgggag gatttggctc tctgggcctg 1140
gactgcgagc ctagaacagg cctggacttc tccgatctgt actatctgac catgaacaat 1200
aagcactggc tggtgcacaa ggagtggttt cacgacatcc cactgccatg gcacgcagga 1260
gcagatacag gaacaccaca ctggaacaat aaggaggccc tggtggagtt caaggatgcc 1320
cacgccaagc ggcagacagt ggtggtgctg ggcagccagg agggagcagt gcacaccgcc 1380
ctggcaggcg ccctggaggc agagatggac ggagctaagg gcagactgtc tagcggccac 1440
ctgaagtgca ggctgaagat ggataagctg cgcctgaagg gcgtgtccta ctctctgtgc 1500
acagccgcct tcaccttcac caagatccct gccgagacac tgcacggcac agtgaccgtg 1560
gaggtgcagt atgccggcac agacggaccc tgtaaggtgc ctgcccagat ggccgtggat 1620
atgcagacac tgacacctgt gggcaggctg atcaccgcca atccagtgat cacagagtct 1680
accgagaaca gcaagatgat gctggagctg gacccaccat ttggcgatag ctatatcgtg 1740
atcggcgtgg gcgagaagaa gatcacacac cactggcacc gcagcggctg atga 1794
<210> 92
<211> 596
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_Zika_prME395 protein (B4)
<400> 92
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ala Glu Val Thr Arg Arg Gly Ser
20 25 30
Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala Ile Ser
35 40 45
Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln Ile Met Asp
50 55 60
Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys Pro Met Leu
65 70 75 80
Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys Asn Thr Thr
85 90 95
Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys Gly Glu Ala
100 105 110
Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr Arg Lys
115 120 125
Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr Thr Lys
130 135 140
His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly Phe Ala
145 150 155 160
Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser Thr Ser Gln
165 170 175
Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr Ser
180 185 190
Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser
195 200 205
Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr
210 215 220
Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr
225 230 235 240
Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser
245 250 255
Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala
260 265 270
Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
275 280 285
Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
290 295 300
Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
305 310 315 320
Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
325 330 335
Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr
340 345 350
Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
355 360 365
Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro
370 375 380
Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
385 390 395 400
Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
405 410 415
Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu
420 425 430
Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
435 440 445
Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala
450 455 460
Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His
465 470 475 480
Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
485 490 495
Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu
500 505 510
Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
515 520 525
Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
530 535 540
Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
545 550 555 560
Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
565 570 575
Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp
580 585 590
His Arg Ser Gly
595
<210> 93
<211> 1590
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_ZikaE protein (B5)
<400> 93
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatccat caaatttttg gagcagcttt caaatcattg 1440
tttggaggaa tgtcctggtt ctcacaaatt ctcattggaa cgttgctgat gtggttgggt 1500
ctgaacacaa agaatggatc tatttccctt atgtgcttgg ccttaggggg agtgttgatc 1560
ttcttatcca cagccgtctc tgcttgatga 1590
<210> 94
<211> 1590
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_ZikaE protein (B5)
<400> 94
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatccac cagatcttcg gcgccgcctt taagtccctg 1440
ttcggcggca tgagctggtt ctcacagatc ctgatcggca cactgctgat gtggctgggc 1500
ctgaacacca agaatggctc tatcagcctg atgtgcctgg ccctgggagg cgtgctgatc 1560
ttcctgtcca ccgccgtgtc tgcctgatga 1590
<210> 95
<211> 528
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_ZikaE protein (B5)
<400> 95
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
465 470 475 480
Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu
485 490 495
Met Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser Leu Met Cys
500 505 510
Leu Ala Leu Gly Gly Val Leu Ile Phe Leu Ser Thr Ala Val Ser Ala
515 520 525
<210> 96
<211> 1446
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_ZikaE_no_Anchor protein (B6)
<400> 96
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatccat caaatttttg gagcagcttt caaatcattg 1440
tgatga 1446
<210> 97
<211> 1446
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_ZikaE_no_Anchor protein (B6)
<400> 97
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatccac cagatcttcg gcgccgcctt taagtccctg 1440
tgatga 1446
<210> 98
<211> 480
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_ZikaE_no_Anchor protein (B6)
<400> 98
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
465 470 475 480
<210> 99
<211> 1410
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_ZikaE411 protein (B7)
<400> 99
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggcagcacc attggaaaag catttgaagc cactgtgaga 1320
ggtgccaaga gaatggcagt cttgggagac acagcctggg actttggatc agttggaggc 1380
gctctcaact cattgggcaa gggcatctga 1410
<210> 100
<211> 1410
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_ZikaE411 protein (B7)
<400> 100
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctccaca atcggcaagg cctttgaggc aaccgtgcgc 1320
ggagcaaaga gaatggccgt gctgggcgac accgcatggg atttcggatc tgtgggaggc 1380
gccctgaaca gcctgggcaa gggcatctga 1410
<210> 101
<211> 469
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_ZikaE411 protein (B7)
<400> 101
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly
420 425 430
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu
435 440 445
Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser
450 455 460
Leu Gly Lys Gly Ile
465
<210> 102
<211> 1290
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of JEVsp_ZikaE395 protein (B8)
<400> 102
atgggcaaac gatcagccgg ctcaatcatg tggctcgcga gcttggcagt tgtcatagct 60
tgtgcaggag ccatcaggtg cataggagtc agcaataggg actttgtgga aggtatgtca 120
ggtgggactt gggttgatgt tgtcttggaa catggaggtt gtgtcaccgt aatggcacag 180
gacaaaccga ctgtcgacat agagctggtt acaacaacag tcagcaacat ggcggaggta 240
agatcctact gctatgaggc atcaatatca gacatggctt cggacagccg ctgcccaaca 300
caaggtgaag cctaccttga caagcaatca gacactcaat atgtctgcaa aagaacgtta 360
gtggacagag gctggggaaa tggatgtgga ctttttggca aagggagcct ggtgacatgc 420
gctaagtttg catgctccaa gaaaatgacc gggaagagca tccagccaga gaatctggag 480
taccggataa tgctgtcagt tcatggctcc cagcacagtg ggatgatcgt taatgacaca 540
ggacatgaaa ctgatgagaa tagagcgaag gttgagataa cgcccaattc accaagagcc 600
gaagccaccc tggggggttt tggaagccta ggacttgatt gtgaaccgag gacaggcctt 660
gacttttcag atttgtatta cttgactatg aataacaagc actggttggt tcacaaggag 720
tggttccacg acattccatt accttggcac gctggggcag acaccggaac tccacactgg 780
aacaacaaag aagcactggt agagttcaag gacgcacatg ccaaaaggca aactgtcgtg 840
gttctaggga gtcaagaagg agcagttcac acggcccttg ctggagctct ggaggctgag 900
atggatggtg caaagggaag gctgtcctct ggccacttga aatgtcgcct gaaaatggat 960
aaacttagat tgaagggcgt gtcatactcc ttgtgtaccg cagcgttcac attcaccaag 1020
atcccggctg aaacactgca cgggacagtc acagtggagg tacagtacgc agggacagat 1080
ggaccttgca aggttccagc tcagatggcg gtggacatgc aaactctgac cccagttggg 1140
aggttgataa ccgctaaccc cgtaatcact gaaagcactg agaactctaa gatgatgctg 1200
gaacttgatc caccatttgg ggactcttac attgtcatag gagtcgggga gaagaagatc 1260
acccaccact ggcacaggag tggctgatga 1290
<210> 103
<211> 1290
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of JEVsp_ZikaE395 protein (B8)
<400> 103
atgggcaaga ggtccgcagg gagcattatg tggctggcat ctctggcagt cgtcatcgct 60
tgtgcaggag caatccggtg catcggcgtg agcaatagag acttcgtgga gggaatgtcc 120
ggaggaacct gggtggatgt ggtgctggag cacggcggct gcgtgacagt gatggcccag 180
gacaagccaa ccgtggatat cgagctggtg accacaaccg tgtccaacat ggccgaggtg 240
aggtcttact gctatgaggc cagcatctcc gacatggcct ctgatagcag gtgtccaacc 300
cagggagagg catacctgga caagcagtcc gatacacagt acgtgtgcaa gcggaccctg 360
gtggacagag gctggggcaa tggctgtggc ctgtttggca agggctctct ggtgacatgc 420
gccaagttcg cctgtagcaa gaagatgacc ggcaagtcca tccagccaga gaacctggag 480
taccggatca tgctgtctgt gcacggctcc cagcactctg gcatgatcgt gaacgacaca 540
ggccacgaga cagatgagaa tcgggccaag gtggagatca cacctaactc tccaagagcc 600
gaggccaccc tgggaggatt tggctctctg ggcctggact gcgagcctag aacaggcctg 660
gacttctccg atctgtacta tctgaccatg aacaataagc actggctggt gcacaaggag 720
tggtttcacg acatcccact gccatggcac gcaggagcag atacaggaac accacactgg 780
aacaataagg aggccctggt ggagttcaag gatgcccacg ccaagcggca gacagtggtg 840
gtgctgggca gccaggaggg agcagtgcac accgccctgg caggcgccct ggaggcagag 900
atggacggag ctaagggcag actgtctagc ggccacctga agtgcaggct gaagatggat 960
aagctgcgcc tgaagggcgt gtcctactct ctgtgcacag ccgccttcac cttcaccaag 1020
atccctgccg agacactgca cggcacagtg accgtggagg tgcagtatgc cggcacagac 1080
ggaccctgta aggtgcctgc ccagatggcc gtggatatgc agacactgac acctgtgggc 1140
aggctgatca ccgccaatcc agtgatcaca gagtctaccg agaacagcaa gatgatgctg 1200
gagctggacc caccatttgg cgatagctat atcgtgatcg gcgtgggcga gaagaagatc 1260
acacaccact ggcaccgcag cggctgatga 1290
<210> 104
<211> 428
<212> PRT
<213> Artificial sequence
<220>
<223> JEVsp_ZikaE395 protein (B8)
<400> 104
Met Gly Lys Arg Ser Ala Gly Ser Ile Met Trp Leu Ala Ser Leu Ala
1 5 10 15
Val Val Ile Ala Cys Ala Gly Ala Ile Arg Cys Ile Gly Val Ser Asn
20 25 30
Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val
35 40 45
Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr
50 55 60
Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val
65 70 75 80
Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser
85 90 95
Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr
100 105 110
Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly
115 120 125
Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala
130 135 140
Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu
145 150 155 160
Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile
165 170 175
Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu
180 185 190
Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly
195 200 205
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp
210 215 220
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu
225 230 235 240
Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly
245 250 255
Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
260 265 270
His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala
275 280 285
Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala
290 295 300
Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp
305 310 315 320
Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe
325 330 335
Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val
340 345 350
Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln
355 360 365
Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr
370 375 380
Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu
385 390 395 400
Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly
405 410 415
Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly
420 425
<210> 105
<211> 2106
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaprME (C1)
<400> 105
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatgcg gaggtcacta gacgtgggag tgcatactat 120
atgtacttgg acagaaacga tgctggggag gccatatctt ttccaaccac attggggatg 180
aataagtgtt atatacagat catggatctt ggacacatgt gtgatgccac catgagctat 240
gaatgcccta tgctggatga gggggtggaa ccagatgacg tcgattgttg gtgcaacacg 300
acgtcaactt gggttgtgta cggaacctgc catcacaaaa aaggtgaagc acggagatct 360
agaagagctg tgacgctccc ctcccattcc actaggaagc tgcaaacgcg gtcgcaaacc 420
tggttggaat caagagaata cacaaagcac ttgattagag tcgaaaattg gatattcagg 480
aaccctggct tcgcgttagc agcagctgcc atcgcttggc ttttgggaag ctcaacgagc 540
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 600
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 660
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 720
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 780
tcaatatcag acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 840
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 900
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 960
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 1020
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 1080
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 1140
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 1200
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 1260
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 1320
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 1380
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 1440
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 1500
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1560
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1620
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1680
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1740
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1800
ggcagcacca ttggaaaagc atttgaagcc actgtgagag gtgccaagag aatggcagtc 1860
ttgggagaca cagcctggga ctttggatca gttggaggcg ctctcaactc attgggcaag 1920
ggcatccatc aaatttttgg agcagctttc aaatcattgt ttggaggaat gtcctggttc 1980
tcacaaattc tcattggaac gttgctgatg tggttgggtc tgaacacaaa gaatggatct 2040
atttccctta tgtgcttggc cttaggggga gtgttgatct tcttatccac agccgtctct 2100
gcttga 2106
<210> 106
<211> 2106
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaprME (C1)
<400> 106
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacgca gaggtgacca ggagaggaag cgcctactat 120
atgtacctgg acaggaatga tgccggcgag gccatctcct tcccaaccac actgggcatg 180
aacaagtgct acatccagat catggacctg ggccacatgt gcgatgccac catgtcctat 240
gagtgtccaa tgctggacga gggcgtggag cccgacgatg tggattgctg gtgtaatacc 300
acatctacat gggtggtgta cggcacctgt caccacaaga agggagaggc ccggcggagc 360
cggcgggccg tgacactgcc ttcccactct accaggaagc tgcagacacg cagccagacc 420
tggctggagt ccagagagta taccaagcac ctgatcaggg tggagaactg gatctttcgc 480
aatccaggat tcgcactggc agcagcagca atcgcatggc tgctgggaag ctccaccagc 540
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc tatccggtgc 600
atcggcgtga gcaatagaga cttcgtggag ggaatgtccg gaggaacctg ggtggatgtg 660
gtgctggagc acggcggctg cgtgacagtg atggcccagg acaagccaac cgtggatatc 720
gagctggtga ccacaaccgt gtccaacatg gccgaggtga ggtcttactg ctatgaggcc 780
agcatctccg acatggcctc tgatagcagg tgtccaaccc agggagaggc atacctggac 840
aagcagtccg atacacagta cgtgtgcaag cggaccctgg tggacagagg ctggggcaat 900
ggctgtggcc tgtttggcaa gggctctctg gtgacatgcg ccaagttcgc ctgtagcaag 960
aagatgaccg gcaagtccat ccagccagag aacctggagt accggatcat gctgtctgtg 1020
cacggctccc agcactctgg catgatcgtg aacgacacag gccacgagac agatgagaat 1080
cgggccaagg tggagatcac acctaactct ccaagagccg aggccaccct gggaggattt 1140
ggctctctgg gcctggactg cgagcctaga acaggcctgg acttctccga tctgtactat 1200
ctgaccatga acaataagca ctggctggtg cacaaggagt ggtttcacga catcccactg 1260
ccatggcacg caggagcaga tacaggaaca ccacactgga acaataagga ggccctggtg 1320
gagttcaagg atgcccacgc caagcggcag acagtggtgg tgctgggcag ccaggaggga 1380
gcagtgcaca ccgccctggc aggcgccctg gaggcagaga tggacggagc taagggcaga 1440
ctgtctagcg gccacctgaa gtgcaggctg aagatggata agctgcgcct gaagggcgtg 1500
tcctactctc tgtgcacagc cgccttcacc ttcaccaaga tccctgccga gacactgcac 1560
ggcacagtga ccgtggaggt gcagtatgcc ggcacagacg gaccctgtaa ggtgcctgcc 1620
cagatggccg tggatatgca gacactgaca cctgtgggca ggctgatcac cgccaatcca 1680
gtgatcacag agtctaccga gaacagcaag atgatgctgg agctggaccc accatttggc 1740
gatagctata tcgtgatcgg cgtgggcgag aagaagatca cacaccactg gcaccgcagc 1800
ggctccacaa tcggcaaggc ctttgaggca accgtgcgcg gagcaaagag aatggccgtg 1860
ctgggcgaca ccgcatggga tttcggatct gtgggaggcg ccctgaacag cctgggcaag 1920
ggcatccacc agatcttcgg cgccgccttt aagtccctgt tcggcggcat gagctggttc 1980
tcacagatcc tgatcggcac actgctgatg tggctgggcc tgaacaccaa gaatggctct 2040
atcagcctga tgtgcctggc cctgggaggc gtgctgatct tcctgtccac cgccgtgtct 2100
gcctga 2106
<210> 107
<211> 701
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaprME (C1)
<400> 107
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ala Glu Val
20 25 30
Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala
35 40 45
Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr
50 55 60
Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr
65 70 75 80
Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys
85 90 95
Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His
100 105 110
Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser
115 120 125
His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser
130 135 140
Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg
145 150 155 160
Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly
165 170 175
Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile
180 185 190
Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
195 200 205
Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His
210 215 220
Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile
225 230 235 240
Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr
245 250 255
Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
260 265 270
Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val
275 280 285
Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu
290 295 300
Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys
305 310 315 320
Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
325 330 335
Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp
340 345 350
Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
355 360 365
Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly
370 375 380
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr
385 390 395 400
Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His
405 410 415
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His
420 425 430
Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
435 440 445
Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr
450 455 460
Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
465 470 475 480
Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg
485 490 495
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr
500 505 510
Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln
515 520 525
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val
530 535 540
Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
545 550 555 560
Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp
565 570 575
Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys
580 585 590
Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
595 600 605
Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr
610 615 620
Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys
625 630 635 640
Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly
645 650 655
Met Ser Trp Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu
660 665 670
Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
675 680 685
Gly Gly Val Leu Ile Phe Leu Ser Thr Ala Val Ser Ala
690 695 700
<210> 108
<211> 1962
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_Zika_prME_no_anchor (C2)
<400> 108
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatgcg gaggtcacta gacgtgggag tgcatactat 120
atgtacttgg acagaaacga tgctggggag gccatatctt ttccaaccac attggggatg 180
aataagtgtt atatacagat catggatctt ggacacatgt gtgatgccac catgagctat 240
gaatgcccta tgctggatga gggggtggaa ccagatgacg tcgattgttg gtgcaacacg 300
acgtcaactt gggttgtgta cggaacctgc catcacaaaa aaggtgaagc acggagatct 360
agaagagctg tgacgctccc ctcccattcc actaggaagc tgcaaacgcg gtcgcaaacc 420
tggttggaat caagagaata cacaaagcac ttgattagag tcgaaaattg gatattcagg 480
aaccctggct tcgcgttagc agcagctgcc atcgcttggc ttttgggaag ctcaacgagc 540
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 600
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 660
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 720
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 780
tcaatatcag acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 840
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 900
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 960
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 1020
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 1080
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 1140
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 1200
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 1260
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 1320
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 1380
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 1440
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 1500
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1560
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1620
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1680
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1740
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1800
ggcagcacca ttggaaaagc atttgaagcc actgtgagag gtgccaagag aatggcagtc 1860
ttgggagaca cagcctggga ctttggatca gttggaggcg ctctcaactc attgggcaag 1920
ggcatccatc aaatttttgg agcagctttc aaatcattgt ga 1962
<210> 109
<211> 1962
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_Zika_prME_no_anchor (C2)
<400> 109
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacgca gaggtgacca ggagaggaag cgcctactat 120
atgtacctgg acaggaatga tgccggcgag gccatctcct tcccaaccac actgggcatg 180
aacaagtgct acatccagat catggacctg ggccacatgt gcgatgccac catgtcctat 240
gagtgtccaa tgctggacga gggcgtggag cccgacgatg tggattgctg gtgtaatacc 300
acatctacat gggtggtgta cggcacctgt caccacaaga agggagaggc ccggcggagc 360
cggcgggccg tgacactgcc ttcccactct accaggaagc tgcagacacg cagccagacc 420
tggctggagt ccagagagta taccaagcac ctgatcaggg tggagaactg gatctttcgc 480
aatccaggat tcgcactggc agcagcagca atcgcatggc tgctgggaag ctccaccagc 540
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc tatccggtgc 600
atcggcgtga gcaatagaga cttcgtggag ggaatgtccg gaggaacctg ggtggatgtg 660
gtgctggagc acggcggctg cgtgacagtg atggcccagg acaagccaac cgtggatatc 720
gagctggtga ccacaaccgt gtccaacatg gccgaggtga ggtcttactg ctatgaggcc 780
agcatctccg acatggcctc tgatagcagg tgtccaaccc agggagaggc atacctggac 840
aagcagtccg atacacagta cgtgtgcaag cggaccctgg tggacagagg ctggggcaat 900
ggctgtggcc tgtttggcaa gggctctctg gtgacatgcg ccaagttcgc ctgtagcaag 960
aagatgaccg gcaagtccat ccagccagag aacctggagt accggatcat gctgtctgtg 1020
cacggctccc agcactctgg catgatcgtg aacgacacag gccacgagac agatgagaat 1080
cgggccaagg tggagatcac acctaactct ccaagagccg aggccaccct gggaggattt 1140
ggctctctgg gcctggactg cgagcctaga acaggcctgg acttctccga tctgtactat 1200
ctgaccatga acaataagca ctggctggtg cacaaggagt ggtttcacga catcccactg 1260
ccatggcacg caggagcaga tacaggaaca ccacactgga acaataagga ggccctggtg 1320
gagttcaagg atgcccacgc caagcggcag acagtggtgg tgctgggcag ccaggaggga 1380
gcagtgcaca ccgccctggc aggcgccctg gaggcagaga tggacggagc taagggcaga 1440
ctgtctagcg gccacctgaa gtgcaggctg aagatggata agctgcgcct gaagggcgtg 1500
tcctactctc tgtgcacagc cgccttcacc ttcaccaaga tccctgccga gacactgcac 1560
ggcacagtga ccgtggaggt gcagtatgcc ggcacagacg gaccctgtaa ggtgcctgcc 1620
cagatggccg tggatatgca gacactgaca cctgtgggca ggctgatcac cgccaatcca 1680
gtgatcacag agtctaccga gaacagcaag atgatgctgg agctggaccc accatttggc 1740
gatagctata tcgtgatcgg cgtgggcgag aagaagatca cacaccactg gcaccgcagc 1800
ggctccacaa tcggcaaggc ctttgaggca accgtgcgcg gagcaaagag aatggccgtg 1860
ctgggcgaca ccgcatggga tttcggatct gtgggaggcg ccctgaacag cctgggcaag 1920
ggcatccacc agatcttcgg cgccgccttt aagtccctgt ga 1962
<210> 110
<211> 653
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_Zika_prME_no_anchor (C2)
<400> 110
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ala Glu Val
20 25 30
Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala
35 40 45
Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr
50 55 60
Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr
65 70 75 80
Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys
85 90 95
Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His
100 105 110
Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser
115 120 125
His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser
130 135 140
Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg
145 150 155 160
Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly
165 170 175
Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile
180 185 190
Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
195 200 205
Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His
210 215 220
Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile
225 230 235 240
Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr
245 250 255
Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
260 265 270
Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val
275 280 285
Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu
290 295 300
Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys
305 310 315 320
Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
325 330 335
Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp
340 345 350
Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
355 360 365
Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly
370 375 380
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr
385 390 395 400
Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His
405 410 415
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His
420 425 430
Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
435 440 445
Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr
450 455 460
Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
465 470 475 480
Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg
485 490 495
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr
500 505 510
Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln
515 520 525
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val
530 535 540
Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
545 550 555 560
Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp
565 570 575
Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys
580 585 590
Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
595 600 605
Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr
610 615 620
Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys
625 630 635 640
Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
645 650
<210> 111
<211> 1932
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_Zika_prME411 (C3)
<400> 111
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatgcg gaggtcacta gacgtgggag tgcatactat 120
atgtacttgg acagaaacga tgctggggag gccatatctt ttccaaccac attggggatg 180
aataagtgtt atatacagat catggatctt ggacacatgt gtgatgccac catgagctat 240
gaatgcccta tgctggatga gggggtggaa ccagatgacg tcgattgttg gtgcaacacg 300
acgtcaactt gggttgtgta cggaacctgc catcacaaaa aaggtgaagc acggagatct 360
agaagagctg tgacgctccc ctcccattcc actaggaagc tgcaaacgcg gtcgcaaacc 420
tggttggaat caagagaata cacaaagcac ttgattagag tcgaaaattg gatattcagg 480
aaccctggct tcgcgttagc agcagctgcc atcgcttggc ttttgggaag ctcaacgagc 540
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 600
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 660
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 720
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 780
tcaatatcag acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 840
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 900
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 960
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 1020
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 1080
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 1140
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 1200
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 1260
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 1320
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 1380
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 1440
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 1500
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1560
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1620
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1680
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1740
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1800
ggcagcacca ttggaaaagc atttgaagcc actgtgagag gtgccaagag aatggcagtc 1860
ttgggagaca cagcctggga ctttggatca gttggaggcg ctctcaactc attgggcaag 1920
ggcatctgat ga 1932
<210> 112
<211> 1932
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_Zika_prME411 (C3)
<400> 112
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacgca gaggtgacca ggagaggaag cgcctactat 120
atgtacctgg acaggaatga tgccggcgag gccatctcct tcccaaccac actgggcatg 180
aacaagtgct acatccagat catggacctg ggccacatgt gcgatgccac catgtcctat 240
gagtgtccaa tgctggacga gggcgtggag cccgacgatg tggattgctg gtgtaatacc 300
acatctacat gggtggtgta cggcacctgt caccacaaga agggagaggc ccggcggagc 360
cggcgggccg tgacactgcc ttcccactct accaggaagc tgcagacacg cagccagacc 420
tggctggagt ccagagagta taccaagcac ctgatcaggg tggagaactg gatctttcgc 480
aatccaggat tcgcactggc agcagcagca atcgcatggc tgctgggaag ctccaccagc 540
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc tatccggtgc 600
atcggcgtga gcaatagaga cttcgtggag ggaatgtccg gaggaacctg ggtggatgtg 660
gtgctggagc acggcggctg cgtgacagtg atggcccagg acaagccaac cgtggatatc 720
gagctggtga ccacaaccgt gtccaacatg gccgaggtga ggtcttactg ctatgaggcc 780
agcatctccg acatggcctc tgatagcagg tgtccaaccc agggagaggc atacctggac 840
aagcagtccg atacacagta cgtgtgcaag cggaccctgg tggacagagg ctggggcaat 900
ggctgtggcc tgtttggcaa gggctctctg gtgacatgcg ccaagttcgc ctgtagcaag 960
aagatgaccg gcaagtccat ccagccagag aacctggagt accggatcat gctgtctgtg 1020
cacggctccc agcactctgg catgatcgtg aacgacacag gccacgagac agatgagaat 1080
cgggccaagg tggagatcac acctaactct ccaagagccg aggccaccct gggaggattt 1140
ggctctctgg gcctggactg cgagcctaga acaggcctgg acttctccga tctgtactat 1200
ctgaccatga acaataagca ctggctggtg cacaaggagt ggtttcacga catcccactg 1260
ccatggcacg caggagcaga tacaggaaca ccacactgga acaataagga ggccctggtg 1320
gagttcaagg atgcccacgc caagcggcag acagtggtgg tgctgggcag ccaggaggga 1380
gcagtgcaca ccgccctggc aggcgccctg gaggcagaga tggacggagc taagggcaga 1440
ctgtctagcg gccacctgaa gtgcaggctg aagatggata agctgcgcct gaagggcgtg 1500
tcctactctc tgtgcacagc cgccttcacc ttcaccaaga tccctgccga gacactgcac 1560
ggcacagtga ccgtggaggt gcagtatgcc ggcacagacg gaccctgtaa ggtgcctgcc 1620
cagatggccg tggatatgca gacactgaca cctgtgggca ggctgatcac cgccaatcca 1680
gtgatcacag agtctaccga gaacagcaag atgatgctgg agctggaccc accatttggc 1740
gatagctata tcgtgatcgg cgtgggcgag aagaagatca cacaccactg gcaccgcagc 1800
ggctccacaa tcggcaaggc ctttgaggca accgtgcgcg gagcaaagag aatggccgtg 1860
ctgggcgaca ccgcatggga tttcggatct gtgggaggcg ccctgaacag cctgggcaag 1920
ggcatctgat ga 1932
<210> 113
<211> 642
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_Zika_prME411 (C3)
<400> 113
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ala Glu Val
20 25 30
Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala
35 40 45
Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr
50 55 60
Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr
65 70 75 80
Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys
85 90 95
Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His
100 105 110
Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser
115 120 125
His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser
130 135 140
Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg
145 150 155 160
Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly
165 170 175
Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile
180 185 190
Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
195 200 205
Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His
210 215 220
Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile
225 230 235 240
Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr
245 250 255
Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
260 265 270
Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val
275 280 285
Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu
290 295 300
Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys
305 310 315 320
Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
325 330 335
Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp
340 345 350
Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
355 360 365
Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly
370 375 380
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr
385 390 395 400
Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His
405 410 415
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His
420 425 430
Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
435 440 445
Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr
450 455 460
Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
465 470 475 480
Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg
485 490 495
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr
500 505 510
Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln
515 520 525
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val
530 535 540
Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
545 550 555 560
Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp
565 570 575
Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys
580 585 590
Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
595 600 605
Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr
610 615 620
Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys
625 630 635 640
Gly Ile
<210> 114
<211> 1806
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_Zika_prME395 (C4)
<400> 114
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatgcg gaggtcacta gacgtgggag tgcatactat 120
atgtacttgg acagaaacga tgctggggag gccatatctt ttccaaccac attggggatg 180
aataagtgtt atatacagat catggatctt ggacacatgt gtgatgccac catgagctat 240
gaatgcccta tgctggatga gggggtggaa ccagatgacg tcgattgttg gtgcaacacg 300
acgtcaactt gggttgtgta cggaacctgc catcacaaaa aaggtgaagc acggagatct 360
agaagagctg tgacgctccc ctcccattcc actaggaagc tgcaaacgcg gtcgcaaacc 420
tggttggaat caagagaata cacaaagcac ttgattagag tcgaaaattg gatattcagg 480
aaccctggct tcgcgttagc agcagctgcc atcgcttggc ttttgggaag ctcaacgagc 540
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 600
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 660
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 720
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 780
tcaatatcag acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 840
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 900
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 960
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 1020
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 1080
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 1140
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 1200
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 1260
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 1320
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 1380
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 1440
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 1500
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1560
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1620
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1680
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1740
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1800
ggctga 1806
<210> 115
<211> 1806
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_Zika_prME395 (C4)
<400> 115
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacgca gaggtgacca ggagaggaag cgcctactat 120
atgtacctgg acaggaatga tgccggcgag gccatctcct tcccaaccac actgggcatg 180
aacaagtgct acatccagat catggacctg ggccacatgt gcgatgccac catgtcctat 240
gagtgtccaa tgctggacga gggcgtggag cccgacgatg tggattgctg gtgtaatacc 300
acatctacat gggtggtgta cggcacctgt caccacaaga agggagaggc ccggcggagc 360
cggcgggccg tgacactgcc ttcccactct accaggaagc tgcagacacg cagccagacc 420
tggctggagt ccagagagta taccaagcac ctgatcaggg tggagaactg gatctttcgc 480
aatccaggat tcgcactggc agcagcagca atcgcatggc tgctgggaag ctccaccagc 540
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc tatccggtgc 600
atcggcgtga gcaatagaga cttcgtggag ggaatgtccg gaggaacctg ggtggatgtg 660
gtgctggagc acggcggctg cgtgacagtg atggcccagg acaagccaac cgtggatatc 720
gagctggtga ccacaaccgt gtccaacatg gccgaggtga ggtcttactg ctatgaggcc 780
agcatctccg acatggcctc tgatagcagg tgtccaaccc agggagaggc atacctggac 840
aagcagtccg atacacagta cgtgtgcaag cggaccctgg tggacagagg ctggggcaat 900
ggctgtggcc tgtttggcaa gggctctctg gtgacatgcg ccaagttcgc ctgtagcaag 960
aagatgaccg gcaagtccat ccagccagag aacctggagt accggatcat gctgtctgtg 1020
cacggctccc agcactctgg catgatcgtg aacgacacag gccacgagac agatgagaat 1080
cgggccaagg tggagatcac acctaactct ccaagagccg aggccaccct gggaggattt 1140
ggctctctgg gcctggactg cgagcctaga acaggcctgg acttctccga tctgtactat 1200
ctgaccatga acaataagca ctggctggtg cacaaggagt ggtttcacga catcccactg 1260
ccatggcacg caggagcaga tacaggaaca ccacactgga acaataagga ggccctggtg 1320
gagttcaagg atgcccacgc caagcggcag acagtggtgg tgctgggcag ccaggaggga 1380
gcagtgcaca ccgccctggc aggcgccctg gaggcagaga tggacggagc taagggcaga 1440
ctgtctagcg gccacctgaa gtgcaggctg aagatggata agctgcgcct gaagggcgtg 1500
tcctactctc tgtgcacagc cgccttcacc ttcaccaaga tccctgccga gacactgcac 1560
ggcacagtga ccgtggaggt gcagtatgcc ggcacagacg gaccctgtaa ggtgcctgcc 1620
cagatggccg tggatatgca gacactgaca cctgtgggca ggctgatcac cgccaatcca 1680
gtgatcacag agtctaccga gaacagcaag atgatgctgg agctggaccc accatttggc 1740
gatagctata tcgtgatcgg cgtgggcgag aagaagatca cacaccactg gcaccgcagc 1800
ggctga 1806
<210> 116
<211> 601
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_Zika_prME395 (C4)
<400> 116
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ala Glu Val
20 25 30
Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala
35 40 45
Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr
50 55 60
Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr
65 70 75 80
Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys
85 90 95
Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His
100 105 110
Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser
115 120 125
His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser
130 135 140
Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg
145 150 155 160
Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly
165 170 175
Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile
180 185 190
Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
195 200 205
Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His
210 215 220
Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile
225 230 235 240
Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr
245 250 255
Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
260 265 270
Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val
275 280 285
Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu
290 295 300
Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys
305 310 315 320
Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
325 330 335
Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp
340 345 350
Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
355 360 365
Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly
370 375 380
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr
385 390 395 400
Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His
405 410 415
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His
420 425 430
Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
435 440 445
Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr
450 455 460
Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
465 470 475 480
Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg
485 490 495
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr
500 505 510
Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln
515 520 525
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val
530 535 540
Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
545 550 555 560
Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp
565 570 575
Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys
580 585 590
Ile Thr His His Trp His Arg Ser Gly
595 600
<210> 117
<211> 1602
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaE (C5)
<400> 117
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatatc aggtgcatag gagtcagcaa tagggacttt 120
gtggaaggta tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtc 180
accgtaatgg cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc 240
aacatggcgg aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac 300
agccgctgcc caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc 360
tgcaaaagaa cgttagtgga cagaggctgg ggaaatggat gtggactttt tggcaaaggg 420
agcctggtga catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag 480
ccagagaatc tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg 540
atcgttaatg acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc 600
aattcaccaa gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa 660
ccgaggacag gccttgactt ttcagatttg tattacttga ctatgaataa caagcactgg 720
ttggttcaca aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc 780
ggaactccac actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa 840
aggcaaactg tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga 900
gctctggagg ctgagatgga tggtgcaaag ggaaggctgt cctctggcca cttgaaatgt 960
cgcctgaaaa tggataaact tagattgaag ggcgtgtcat actccttgtg taccgcagcg 1020
ttcacattca ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag 1080
tacgcaggga cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact 1140
ctgaccccag ttgggaggtt gataaccgct aaccccgtaa tcactgaaag cactgagaac 1200
tctaagatga tgctggaact tgatccacca tttggggact cttacattgt cataggagtc 1260
ggggagaaga agatcaccca ccactggcac aggagtggca gcaccattgg aaaagcattt 1320
gaagccactg tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt 1380
ggatcagttg gaggcgctct caactcattg ggcaagggca tccatcaaat ttttggagca 1440
gctttcaaat cattgtttgg aggaatgtcc tggttctcac aaattctcat tggaacgttg 1500
ctgatgtggt tgggtctgaa cacaaagaat ggatctattt cccttatgtg cttggcctta 1560
gggggagtgt tgatcttctt atccacagcc gtctctgctt ga 1602
<210> 118
<211> 1602
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaE (C5)
<400> 118
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacatc cggtgcatcg gcgtgagcaa tagagacttc 120
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 180
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 240
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 300
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 360
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 420
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 480
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 540
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 600
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 660
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 720
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 780
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 840
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 900
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 960
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 1020
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 1080
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 1140
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 1200
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 1260
ggcgagaaga agatcacaca ccactggcac cgcagcggct ccacaatcgg caaggccttt 1320
gaggcaaccg tgcgcggagc aaagagaatg gccgtgctgg gcgacaccgc atgggatttc 1380
ggatctgtgg gaggcgccct gaacagcctg ggcaagggca tccaccagat cttcggcgcc 1440
gcctttaagt ccctgttcgg cggcatgagc tggttctcac agatcctgat cggcacactg 1500
ctgatgtggc tgggcctgaa caccaagaat ggctctatca gcctgatgtg cctggccctg 1560
ggaggcgtgc tgatcttcct gtccaccgcc gtgtctgcct ga 1602
<210> 119
<211> 533
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaE (C5)
<400> 119
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ile Arg Cys
20 25 30
Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr
35 40 45
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala
50 55 60
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser
65 70 75 80
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp
85 90 95
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp
100 105 110
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg
115 120 125
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr
130 135 140
Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln
145 150 155 160
Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln
165 170 175
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn
180 185 190
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr
195 200 205
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly
210 215 220
Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp
225 230 235 240
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala
245 250 255
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val
260 265 270
Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly
275 280 285
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala
290 295 300
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys
305 310 315 320
Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu
325 330 335
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His
340 345 350
Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys
355 360 365
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val
370 375 380
Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn
385 390 395 400
Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile
405 410 415
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser
420 425 430
Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys
435 440 445
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly
450 455 460
Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala
465 470 475 480
Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu
485 490 495
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser
500 505 510
Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile Phe Leu Ser
515 520 525
Thr Ala Val Ser Ala
530
<210> 120
<211> 1458
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaE_no_anchor (C6)
<400> 120
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatatc aggtgcatag gagtcagcaa tagggacttt 120
gtggaaggta tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtc 180
accgtaatgg cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc 240
aacatggcgg aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac 300
agccgctgcc caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc 360
tgcaaaagaa cgttagtgga cagaggctgg ggaaatggat gtggactttt tggcaaaggg 420
agcctggtga catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag 480
ccagagaatc tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg 540
atcgttaatg acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc 600
aattcaccaa gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa 660
ccgaggacag gccttgactt ttcagatttg tattacttga ctatgaataa caagcactgg 720
ttggttcaca aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc 780
ggaactccac actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa 840
aggcaaactg tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga 900
gctctggagg ctgagatgga tggtgcaaag ggaaggctgt cctctggcca cttgaaatgt 960
cgcctgaaaa tggataaact tagattgaag ggcgtgtcat actccttgtg taccgcagcg 1020
ttcacattca ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag 1080
tacgcaggga cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact 1140
ctgaccccag ttgggaggtt gataaccgct aaccccgtaa tcactgaaag cactgagaac 1200
tctaagatga tgctggaact tgatccacca tttggggact cttacattgt cataggagtc 1260
ggggagaaga agatcaccca ccactggcac aggagtggca gcaccattgg aaaagcattt 1320
gaagccactg tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt 1380
ggatcagttg gaggcgctct caactcattg ggcaagggca tccatcaaat ttttggagca 1440
gctttcaaat cattgtga 1458
<210> 121
<211> 1458
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaE_no_anchor (C6)
<400> 121
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacatc cggtgcatcg gcgtgagcaa tagagacttc 120
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 180
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 240
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 300
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 360
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 420
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 480
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 540
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 600
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 660
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 720
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 780
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 840
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 900
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 960
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 1020
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 1080
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 1140
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 1200
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 1260
ggcgagaaga agatcacaca ccactggcac cgcagcggct ccacaatcgg caaggccttt 1320
gaggcaaccg tgcgcggagc aaagagaatg gccgtgctgg gcgacaccgc atgggatttc 1380
ggatctgtgg gaggcgccct gaacagcctg ggcaagggca tccaccagat cttcggcgcc 1440
gcctttaagt ccctgtga 1458
<210> 122
<211> 485
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaE_no_anchor (C6)
<400> 122
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ile Arg Cys
20 25 30
Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr
35 40 45
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala
50 55 60
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser
65 70 75 80
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp
85 90 95
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp
100 105 110
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg
115 120 125
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr
130 135 140
Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln
145 150 155 160
Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln
165 170 175
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn
180 185 190
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr
195 200 205
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly
210 215 220
Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp
225 230 235 240
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala
245 250 255
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val
260 265 270
Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly
275 280 285
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala
290 295 300
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys
305 310 315 320
Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu
325 330 335
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His
340 345 350
Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys
355 360 365
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val
370 375 380
Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn
385 390 395 400
Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile
405 410 415
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser
420 425 430
Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys
435 440 445
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly
450 455 460
Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala
465 470 475 480
Ala Phe Lys Ser Leu
485
<210> 123
<211> 1428
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaE411 (C7)
<400> 123
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatatc aggtgcatag gagtcagcaa tagggacttt 120
gtggaaggta tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtc 180
accgtaatgg cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc 240
aacatggcgg aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac 300
agccgctgcc caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc 360
tgcaaaagaa cgttagtgga cagaggctgg ggaaatggat gtggactttt tggcaaaggg 420
agcctggtga catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag 480
ccagagaatc tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg 540
atcgttaatg acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc 600
aattcaccaa gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa 660
ccgaggacag gccttgactt ttcagatttg tattacttga ctatgaataa caagcactgg 720
ttggttcaca aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc 780
ggaactccac actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa 840
aggcaaactg tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga 900
gctctggagg ctgagatgga tggtgcaaag ggaaggctgt cctctggcca cttgaaatgt 960
cgcctgaaaa tggataaact tagattgaag ggcgtgtcat actccttgtg taccgcagcg 1020
ttcacattca ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag 1080
tacgcaggga cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact 1140
ctgaccccag ttgggaggtt gataaccgct aaccccgtaa tcactgaaag cactgagaac 1200
tctaagatga tgctggaact tgatccacca tttggggact cttacattgt cataggagtc 1260
ggggagaaga agatcaccca ccactggcac aggagtggca gcaccattgg aaaagcattt 1320
gaagccactg tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt 1380
ggatcagttg gaggcgctct caactcattg ggcaagggca tctgatga 1428
<210> 124
<211> 1428
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaE411 (C7)
<400> 124
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacatc cggtgcatcg gcgtgagcaa tagagacttc 120
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 180
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 240
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 300
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 360
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 420
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 480
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 540
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 600
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 660
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 720
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 780
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 840
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 900
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 960
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 1020
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 1080
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 1140
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 1200
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 1260
ggcgagaaga agatcacaca ccactggcac cgcagcggct ccacaatcgg caaggccttt 1320
gaggcaaccg tgcgcggagc aaagagaatg gccgtgctgg gcgacaccgc atgggatttc 1380
ggatctgtgg gaggcgccct gaacagcctg ggcaagggca tctgatga 1428
<210> 125
<211> 474
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaE411 (C7)
<400> 125
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ile Arg Cys
20 25 30
Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr
35 40 45
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala
50 55 60
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser
65 70 75 80
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp
85 90 95
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp
100 105 110
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg
115 120 125
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr
130 135 140
Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln
145 150 155 160
Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln
165 170 175
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn
180 185 190
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr
195 200 205
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly
210 215 220
Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp
225 230 235 240
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala
245 250 255
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val
260 265 270
Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly
275 280 285
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala
290 295 300
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys
305 310 315 320
Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu
325 330 335
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His
340 345 350
Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys
355 360 365
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val
370 375 380
Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn
385 390 395 400
Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile
405 410 415
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser
420 425 430
Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys
435 440 445
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly
450 455 460
Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
465 470
<210> 126
<211> 1302
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaE395 (C8)
<400> 126
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatatc aggtgcatag gagtcagcaa tagggacttt 120
gtggaaggta tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtc 180
accgtaatgg cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc 240
aacatggcgg aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac 300
agccgctgcc caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc 360
tgcaaaagaa cgttagtgga cagaggctgg ggaaatggat gtggactttt tggcaaaggg 420
agcctggtga catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag 480
ccagagaatc tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg 540
atcgttaatg acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc 600
aattcaccaa gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa 660
ccgaggacag gccttgactt ttcagatttg tattacttga ctatgaataa caagcactgg 720
ttggttcaca aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc 780
ggaactccac actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa 840
aggcaaactg tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga 900
gctctggagg ctgagatgga tggtgcaaag ggaaggctgt cctctggcca cttgaaatgt 960
cgcctgaaaa tggataaact tagattgaag ggcgtgtcat actccttgtg taccgcagcg 1020
ttcacattca ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag 1080
tacgcaggga cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact 1140
ctgaccccag ttgggaggtt gataaccgct aaccccgtaa tcactgaaag cactgagaac 1200
tctaagatga tgctggaact tgatccacca tttggggact cttacattgt cataggagtc 1260
ggggagaaga agatcaccca ccactggcac aggagtggct ga 1302
<210> 127
<211> 1302
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaE395 (C8)
<400> 127
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacatc cggtgcatcg gcgtgagcaa tagagacttc 120
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 180
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 240
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 300
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 360
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 420
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 480
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 540
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 600
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 660
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 720
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 780
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 840
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 900
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 960
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 1020
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 1080
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 1140
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 1200
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 1260
ggcgagaaga agatcacaca ccactggcac cgcagcggct ga 1302
<210> 128
<211> 433
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaE395 (C8)
<400> 128
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ile Arg Cys
20 25 30
Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr
35 40 45
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala
50 55 60
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser
65 70 75 80
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp
85 90 95
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp
100 105 110
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg
115 120 125
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr
130 135 140
Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln
145 150 155 160
Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln
165 170 175
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn
180 185 190
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr
195 200 205
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly
210 215 220
Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp
225 230 235 240
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala
245 250 255
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val
260 265 270
Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly
275 280 285
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala
290 295 300
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys
305 310 315 320
Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu
325 330 335
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His
340 345 350
Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys
355 360 365
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val
370 375 380
Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn
385 390 395 400
Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile
405 410 415
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser
420 425 430
Gly
<210> 129
<211> 2178
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_ZikaprME_MVTMintracyto (C9)
<400> 129
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatgcg gaggtcacta gacgtgggag tgcatactat 120
atgtacttgg acagaaacga tgctggggag gccatatctt ttccaaccac attggggatg 180
aataagtgtt atatacagat catggatctt ggacacatgt gtgatgccac catgagctat 240
gaatgcccta tgctggatga gggggtggaa ccagatgacg tcgattgttg gtgcaacacg 300
acgtcaactt gggttgtgta cggaacctgc catcacaaaa aaggtgaagc acggagatct 360
agaagagctg tgacgctccc ctcccattcc actaggaagc tgcaaacgcg gtcgcaaacc 420
tggttggaat caagagaata cacaaagcac ttgattagag tcgaaaattg gatattcagg 480
aaccctggct tcgcgttagc agcagctgcc atcgcttggc ttttgggaag ctcaacgagc 540
caaaaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 600
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 660
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 720
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 780
tcaatatcag acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 840
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 900
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 960
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 1020
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 1080
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 1140
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 1200
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 1260
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 1320
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 1380
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 1440
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 1500
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1560
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1620
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1680
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1740
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1800
ggcagcacca ttggaaaagc atttgaagcc actgtgagag gtgccaagag aatggcagtc 1860
ttgggagaca cagcctggga ctttggatca gttggaggcg ctctcaactc attgggcaag 1920
ggcatccatc aaatttttgg agcagctttc aaatcattgt ttggaggaat gtcctggttc 1980
tcaatgaaag gtttatcgag cactagcata gtctacatcc tgattgcagt gtgtcttgga 2040
gggttgatag ggatccccgc tttaatatgt tgctgcaggg ggcgttgtaa caaaaaggga 2100
gaacaagttg gtatgtcaag accaggccta aagcctgatc ttacgggaac atcaaaatcc 2160
tatgtaaggt cgctctga 2178
<210> 130
<211> 2178
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_ZikaprME_MVTMintracyto (C9)
<400> 130
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacgca gaggtgacca ggagaggaag cgcctactat 120
atgtacctgg acaggaatga tgccggcgag gccatctcct tcccaaccac actgggcatg 180
aacaagtgct acatccagat catggacctg ggccacatgt gcgatgccac catgtcctat 240
gagtgtccaa tgctggacga gggcgtggag cccgacgatg tggattgctg gtgtaatacc 300
acatctacat gggtggtgta cggcacctgt caccacaaga agggagaggc ccggcggagc 360
cggcgggccg tgacactgcc ttcccactct accaggaagc tgcagacacg cagccagacc 420
tggctggagt ccagagagta taccaagcac ctgatcaggg tggagaactg gatctttcgc 480
aatccaggat tcgcactggc agcagcagca atcgcatggc tgctgggaag ctccaccagc 540
cagaaagtga tctacctggt catgatcctg ctgatcgctc ctgcctattc tatccggtgc 600
atcggcgtga gcaatagaga cttcgtggag ggaatgtccg gaggaacctg ggtggatgtg 660
gtgctggagc acggcggctg cgtgacagtg atggcccagg acaagccaac cgtggatatc 720
gagctggtga ccacaaccgt gtccaacatg gccgaggtga ggtcttactg ctatgaggcc 780
agcatctccg acatggcctc tgatagcagg tgtccaaccc agggagaggc atacctggac 840
aagcagtccg atacacagta cgtgtgcaag cggaccctgg tggacagagg ctggggcaat 900
ggctgtggcc tgtttggcaa gggctctctg gtgacatgcg ccaagttcgc ctgtagcaag 960
aagatgaccg gcaagtccat ccagccagag aacctggagt accggatcat gctgtctgtg 1020
cacggctccc agcactctgg catgatcgtg aacgacacag gccacgagac agatgagaat 1080
cgggccaagg tggagatcac acctaactct ccaagagccg aggccaccct gggaggattt 1140
ggctctctgg gcctggactg cgagcctaga acaggcctgg acttctccga tctgtactat 1200
ctgaccatga acaataagca ctggctggtg cacaaggagt ggtttcacga catcccactg 1260
ccatggcacg caggagcaga tacaggaaca ccacactgga acaataagga ggccctggtg 1320
gagttcaagg atgcccacgc caagcggcag acagtggtgg tgctgggcag ccaggaggga 1380
gcagtgcaca ccgccctggc aggcgccctg gaggcagaga tggacggagc taagggcaga 1440
ctgtctagcg gccacctgaa gtgcaggctg aagatggata agctgcgcct gaagggcgtg 1500
tcctactctc tgtgcacagc cgccttcacc ttcaccaaga tccctgccga gacactgcac 1560
ggcacagtga ccgtggaggt gcagtatgcc ggcacagacg gaccctgtaa ggtgcctgcc 1620
cagatggccg tggatatgca gacactgaca cctgtgggca ggctgatcac cgccaatcca 1680
gtgatcacag agtctaccga gaacagcaag atgatgctgg agctggaccc accatttggc 1740
gatagctata tcgtgatcgg cgtgggcgag aagaagatca cacaccactg gcaccgcagc 1800
ggctccacaa tcggcaaggc ctttgaggca accgtgcgcg gagcaaagag aatggccgtg 1860
ctgggcgaca ccgcatggga tttcggatct gtgggaggcg ccctgaacag cctgggcaag 1920
ggcatccacc agatcttcgg cgccgccttt aagtccctgt tcggcggcat gagctggttc 1980
tcaatgaagg gcctgtcctc tacctctatc gtgtacatcc tgatcgccgt gtgcctggga 2040
ggcctgatcg gaatcccagc cctgatctgc tgttgcagag gccgctgcaa caagaaggga 2100
gagcaagtgg gaatgtctcg gccaggcctg aagccagacc tgacaggcac ctccaagtct 2160
tatgtgagaa gcctgtga 2178
<210> 131
<211> 725
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_ZikaprME_MVTMintracyto (C9)
<400> 131
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ala Glu Val
20 25 30
Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala
35 40 45
Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr
50 55 60
Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr
65 70 75 80
Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys
85 90 95
Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His
100 105 110
Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser
115 120 125
His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser
130 135 140
Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg
145 150 155 160
Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly
165 170 175
Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile
180 185 190
Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
195 200 205
Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His
210 215 220
Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile
225 230 235 240
Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr
245 250 255
Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
260 265 270
Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val
275 280 285
Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu
290 295 300
Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys
305 310 315 320
Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
325 330 335
Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp
340 345 350
Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
355 360 365
Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly
370 375 380
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr
385 390 395 400
Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His
405 410 415
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His
420 425 430
Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
435 440 445
Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr
450 455 460
Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
465 470 475 480
Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg
485 490 495
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr
500 505 510
Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln
515 520 525
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val
530 535 540
Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
545 550 555 560
Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp
565 570 575
Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys
580 585 590
Ile Thr His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
595 600 605
Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr
610 615 620
Ala Trp Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys
625 630 635 640
Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly
645 650 655
Met Ser Trp Phe Ser Met Lys Gly Leu Ser Ser Thr Ser Ile Val Tyr
660 665 670
Ile Leu Ile Ala Val Cys Leu Gly Gly Leu Ile Gly Ile Pro Ala Leu
675 680 685
Ile Cys Cys Cys Arg Gly Arg Cys Asn Lys Lys Gly Glu Gln Val Gly
690 695 700
Met Ser Arg Pro Gly Leu Lys Pro Asp Leu Thr Gly Thr Ser Lys Ser
705 710 715 720
Tyr Val Arg Ser Leu
725
<210> 132
<211> 1674
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp_Zika_MVTMintracytoE (C10)
<400> 132
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatccatatc aggtgcatag gagtcagcaa tagggacttt 120
gtggaaggta tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtc 180
accgtaatgg cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc 240
aacatggcgg aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac 300
agccgctgcc caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc 360
tgcaaaagaa cgttagtgga cagaggctgg ggaaatggat gtggactttt tggcaaaggg 420
agcctggtga catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag 480
ccagagaatc tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg 540
atcgttaatg acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc 600
aattcaccaa gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa 660
ccgaggacag gccttgactt ttcagatttg tattacttga ctatgaataa caagcactgg 720
ttggttcaca aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc 780
ggaactccac actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa 840
aggcaaactg tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga 900
gctctggagg ctgagatgga tggtgcaaag ggaaggctgt cctctggcca cttgaaatgt 960
cgcctgaaaa tggataaact tagattgaag ggcgtgtcat actccttgtg taccgcagcg 1020
ttcacattca ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag 1080
tacgcaggga cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact 1140
ctgaccccag ttgggaggtt gataaccgct aaccccgtaa tcactgaaag cactgagaac 1200
tctaagatga tgctggaact tgatccacca tttggggact cttacattgt cataggagtc 1260
ggggagaaga agatcaccca ccactggcac aggagtggca gcaccattgg aaaagcattt 1320
gaagccactg tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt 1380
ggatcagttg gaggcgctct caactcattg ggcaagggca tccatcaaat ttttggagca 1440
gctttcaaat cattgtttgg aggaatgtcc tggttctcaa tgaaaggttt atcgagcact 1500
agcatagtct acatcctgat tgcagtgtgt cttggagggt tgatagggat ccccgcttta 1560
atatgttgct gcagggggcg ttgtaacaaa aagggagaac aagttggtat gtcaagacca 1620
ggcctaaagc ctgatcttac gggaacatca aaatcctatg taaggtcgct ctga 1674
<210> 133
<211> 1674
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp_Zika_MVTMintracytoE (C10)
<400> 133
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccacatc cggtgcatcg gcgtgagcaa tagagacttc 120
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 180
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 240
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 300
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 360
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 420
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 480
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 540
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 600
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 660
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 720
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 780
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 840
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 900
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 960
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 1020
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 1080
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 1140
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 1200
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 1260
ggcgagaaga agatcacaca ccactggcac cgcagcggct ccacaatcgg caaggccttt 1320
gaggcaaccg tgcgcggagc aaagagaatg gccgtgctgg gcgacaccgc atgggatttc 1380
ggatctgtgg gaggcgccct gaacagcctg ggcaagggca tccaccagat cttcggcgcc 1440
gcctttaagt ccctgttcgg cggcatgagc tggttctcaa tgaagggcct gtcctctacc 1500
tctatcgtgt acatcctgat cgccgtgtgc ctgggaggcc tgatcggaat cccagccctg 1560
atctgctgtt gcagaggccg ctgcaacaag aagggagagc aagtgggaat gtctcggcca 1620
ggcctgaagc cagacctgac aggcacctcc aagtcttatg tgagaagcct gtga 1674
<210> 134
<211> 557
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp_Zika_MVTMintracytoE (C10)
<400> 134
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile His Ile Arg Cys
20 25 30
Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr
35 40 45
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala
50 55 60
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser
65 70 75 80
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp
85 90 95
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp
100 105 110
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg
115 120 125
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr
130 135 140
Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln
145 150 155 160
Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln
165 170 175
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn
180 185 190
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr
195 200 205
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly
210 215 220
Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp
225 230 235 240
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala
245 250 255
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val
260 265 270
Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly
275 280 285
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala
290 295 300
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys
305 310 315 320
Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu
325 330 335
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His
340 345 350
Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys
355 360 365
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val
370 375 380
Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn
385 390 395 400
Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile
405 410 415
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser
420 425 430
Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys
435 440 445
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly
450 455 460
Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala
465 470 475 480
Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Met Lys Gly
485 490 495
Leu Ser Ser Thr Ser Ile Val Tyr Ile Leu Ile Ala Val Cys Leu Gly
500 505 510
Gly Leu Ile Gly Ile Pro Ala Leu Ile Cys Cys Cys Arg Gly Arg Cys
515 520 525
Asn Lys Lys Gly Glu Gln Val Gly Met Ser Arg Pro Gly Leu Lys Pro
530 535 540
Asp Leu Thr Gly Thr Ser Lys Ser Tyr Val Arg Ser Leu
545 550 555
<210> 135
<211> 2100
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaprME (D1)
<400> 135
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca agcggaggtc actagacgtg ggagtgcata ctatatgtac 120
ttggacagaa acgatgctgg ggaggccata tcttttccaa ccacattggg gatgaataag 180
tgttatatac agatcatgga tcttggacac atgtgtgatg ccaccatgag ctatgaatgc 240
cctatgctgg atgagggggt ggaaccagat gacgtcgatt gttggtgcaa cacgacgtca 300
acttgggttg tgtacggaac ctgccatcac aaaaaaggtg aagcacggag atctagaaga 360
gctgtgacgc tcccctccca ttccactagg aagctgcaaa cgcggtcgca aacctggttg 420
gaatcaagag aatacacaaa gcacttgatt agagtcgaaa attggatatt caggaaccct 480
ggcttcgcgt tagcagcagc tgccatcgct tggcttttgg gaagctcaac gagccaaaaa 540
gtcatatact tggtcatgat actgctgatt gccccggcat acagcatcag gtgcatagga 600
gtcagcaata gggactttgt ggaaggtatg tcaggtggga cttgggttga tgttgtcttg 660
gaacatggag gttgtgtcac cgtaatggca caggacaaac cgactgtcga catagagctg 720
gttacaacaa cagtcagcaa catggcggag gtaagatcct actgctatga ggcatcaata 780
tcagacatgg cttcggacag ccgctgccca acacaaggtg aagcctacct tgacaagcaa 840
tcagacactc aatatgtctg caaaagaacg ttagtggaca gaggctgggg aaatggatgt 900
ggactttttg gcaaagggag cctggtgaca tgcgctaagt ttgcatgctc caagaaaatg 960
accgggaaga gcatccagcc agagaatctg gagtaccgga taatgctgtc agttcatggc 1020
tcccagcaca gtgggatgat cgttaatgac acaggacatg aaactgatga gaatagagcg 1080
aaggttgaga taacgcccaa ttcaccaaga gccgaagcca ccctgggggg ttttggaagc 1140
ctaggacttg attgtgaacc gaggacaggc cttgactttt cagatttgta ttacttgact 1200
atgaataaca agcactggtt ggttcacaag gagtggttcc acgacattcc attaccttgg 1260
cacgctgggg cagacaccgg aactccacac tggaacaaca aagaagcact ggtagagttc 1320
aaggacgcac atgccaaaag gcaaactgtc gtggttctag ggagtcaaga aggagcagtt 1380
cacacggccc ttgctggagc tctggaggct gagatggatg gtgcaaaggg aaggctgtcc 1440
tctggccact tgaaatgtcg cctgaaaatg gataaactta gattgaaggg cgtgtcatac 1500
tccttgtgta ccgcagcgtt cacattcacc aagatcccgg ctgaaacact gcacgggaca 1560
gtcacagtgg aggtacagta cgcagggaca gatggacctt gcaaggttcc agctcagatg 1620
gcggtggaca tgcaaactct gaccccagtt gggaggttga taaccgctaa ccccgtaatc 1680
actgaaagca ctgagaactc taagatgatg ctggaacttg atccaccatt tggggactct 1740
tacattgtca taggagtcgg ggagaagaag atcacccacc actggcacag gagtggcagc 1800
accattggaa aagcatttga agccactgtg agaggtgcca agagaatggc agtcttggga 1860
gacacagcct gggactttgg atcagttgga ggcgctctca actcattggg caagggcatc 1920
catcaaattt ttggagcagc tttcaaatca ttgtttggag gaatgtcctg gttctcacaa 1980
attctcattg gaacgttgct gatgtggttg ggtctgaaca caaagaatgg atctatttcc 2040
cttatgtgct tggccttagg gggagtgttg atcttcttat ccacagccgt ctctgcttga 2100
<210> 136
<211> 2100
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaprME (D1)
<400> 136
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca ggcagaggtg accaggagag gaagcgccta ctatatgtac 120
ctggacagga atgatgccgg cgaggccatc tccttcccaa ccacactggg catgaacaag 180
tgctacatcc agatcatgga cctgggccac atgtgcgatg ccaccatgtc ctatgagtgt 240
ccaatgctgg acgagggcgt ggagcccgac gatgtggatt gctggtgtaa taccacatct 300
acatgggtgg tgtacggcac ctgtcaccac aagaagggag aggcccggcg gagccggcgg 360
gccgtgacac tgccttccca ctctaccagg aagctgcaga cacgcagcca gacctggctg 420
gagtccagag agtataccaa gcacctgatc agggtggaga actggatctt tcgcaatcca 480
ggattcgcac tggcagcagc agcaatcgca tggctgctgg gaagctccac cagccagaaa 540
gtgatctacc tggtcatgat cctgctgatc gctcctgcct attctatccg gtgcatcggc 600
gtgagcaata gagacttcgt ggagggaatg tccggaggaa cctgggtgga tgtggtgctg 660
gagcacggcg gctgcgtgac agtgatggcc caggacaagc caaccgtgga tatcgagctg 720
gtgaccacaa ccgtgtccaa catggccgag gtgaggtctt actgctatga ggccagcatc 780
tccgacatgg cctctgatag caggtgtcca acccagggag aggcatacct ggacaagcag 840
tccgatacac agtacgtgtg caagcggacc ctggtggaca gaggctgggg caatggctgt 900
ggcctgtttg gcaagggctc tctggtgaca tgcgccaagt tcgcctgtag caagaagatg 960
accggcaagt ccatccagcc agagaacctg gagtaccgga tcatgctgtc tgtgcacggc 1020
tcccagcact ctggcatgat cgtgaacgac acaggccacg agacagatga gaatcgggcc 1080
aaggtggaga tcacacctaa ctctccaaga gccgaggcca ccctgggagg atttggctct 1140
ctgggcctgg actgcgagcc tagaacaggc ctggacttct ccgatctgta ctatctgacc 1200
atgaacaata agcactggct ggtgcacaag gagtggtttc acgacatccc actgccatgg 1260
cacgcaggag cagatacagg aacaccacac tggaacaata aggaggccct ggtggagttc 1320
aaggatgccc acgccaagcg gcagacagtg gtggtgctgg gcagccagga gggagcagtg 1380
cacaccgccc tggcaggcgc cctggaggca gagatggacg gagctaaggg cagactgtct 1440
agcggccacc tgaagtgcag gctgaagatg gataagctgc gcctgaaggg cgtgtcctac 1500
tctctgtgca cagccgcctt caccttcacc aagatccctg ccgagacact gcacggcaca 1560
gtgaccgtgg aggtgcagta tgccggcaca gacggaccct gtaaggtgcc tgcccagatg 1620
gccgtggata tgcagacact gacacctgtg ggcaggctga tcaccgccaa tccagtgatc 1680
acagagtcta ccgagaacag caagatgatg ctggagctgg acccaccatt tggcgatagc 1740
tatatcgtga tcggcgtggg cgagaagaag atcacacacc actggcaccg cagcggctcc 1800
acaatcggca aggcctttga ggcaaccgtg cgcggagcaa agagaatggc cgtgctgggc 1860
gacaccgcat gggatttcgg atctgtggga ggcgccctga acagcctggg caagggcatc 1920
caccagatct tcggcgccgc ctttaagtcc ctgttcggcg gcatgagctg gttctcacag 1980
atcctgatcg gcacactgct gatgtggctg ggcctgaaca ccaagaatgg ctctatcagc 2040
ctgatgtgcc tggccctggg aggcgtgctg atcttcctgt ccaccgccgt gtctgcctga 2100
<210> 137
<211> 699
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaprME (D1)
<400> 137
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ala Glu Val Thr Arg
20 25 30
Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu
35 40 45
Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln
50 55 60
Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys
65 70 75 80
Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys
85 90 95
Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys
100 105 110
Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser
115 120 125
Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu
130 135 140
Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro
145 150 155 160
Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser
165 170 175
Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro
180 185 190
Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu
195 200 205
Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly
210 215 220
Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
225 230 235 240
Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
245 250 255
Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln
260 265 270
Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys
275 280 285
Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly
290 295 300
Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
305 310 315 320
Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu
325 330 335
Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly
340 345 350
His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser
355 360 365
Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp
370 375 380
Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr
385 390 395 400
Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile
405 410 415
Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
420 425 430
Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln
435 440 445
Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
450 455 460
Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser
465 470 475 480
Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys
485 490 495
Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile
500 505 510
Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala
515 520 525
Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
530 535 540
Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile
545 550 555 560
Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro
565 570 575
Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr
580 585 590
His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala
595 600 605
Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp
610 615 620
Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635 640
His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser
645 650 655
Trp Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu
660 665 670
Asn Thr Lys Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly
675 680 685
Val Leu Ile Phe Leu Ser Thr Ala Val Ser Ala
690 695
<210> 138
<211> 1956
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp Zika_prME_no_Anchor (D2)
<400> 138
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca agcggaggtc actagacgtg ggagtgcata ctatatgtac 120
ttggacagaa acgatgctgg ggaggccata tcttttccaa ccacattggg gatgaataag 180
tgttatatac agatcatgga tcttggacac atgtgtgatg ccaccatgag ctatgaatgc 240
cctatgctgg atgagggggt ggaaccagat gacgtcgatt gttggtgcaa cacgacgtca 300
acttgggttg tgtacggaac ctgccatcac aaaaaaggtg aagcacggag atctagaaga 360
gctgtgacgc tcccctccca ttccactagg aagctgcaaa cgcggtcgca aacctggttg 420
gaatcaagag aatacacaaa gcacttgatt agagtcgaaa attggatatt caggaaccct 480
ggcttcgcgt tagcagcagc tgccatcgct tggcttttgg gaagctcaac gagccaaaaa 540
gtcatatact tggtcatgat actgctgatt gccccggcat acagcatcag gtgcatagga 600
gtcagcaata gggactttgt ggaaggtatg tcaggtggga cttgggttga tgttgtcttg 660
gaacatggag gttgtgtcac cgtaatggca caggacaaac cgactgtcga catagagctg 720
gttacaacaa cagtcagcaa catggcggag gtaagatcct actgctatga ggcatcaata 780
tcagacatgg cttcggacag ccgctgccca acacaaggtg aagcctacct tgacaagcaa 840
tcagacactc aatatgtctg caaaagaacg ttagtggaca gaggctgggg aaatggatgt 900
ggactttttg gcaaagggag cctggtgaca tgcgctaagt ttgcatgctc caagaaaatg 960
accgggaaga gcatccagcc agagaatctg gagtaccgga taatgctgtc agttcatggc 1020
tcccagcaca gtgggatgat cgttaatgac acaggacatg aaactgatga gaatagagcg 1080
aaggttgaga taacgcccaa ttcaccaaga gccgaagcca ccctgggggg ttttggaagc 1140
ctaggacttg attgtgaacc gaggacaggc cttgactttt cagatttgta ttacttgact 1200
atgaataaca agcactggtt ggttcacaag gagtggttcc acgacattcc attaccttgg 1260
cacgctgggg cagacaccgg aactccacac tggaacaaca aagaagcact ggtagagttc 1320
aaggacgcac atgccaaaag gcaaactgtc gtggttctag ggagtcaaga aggagcagtt 1380
cacacggccc ttgctggagc tctggaggct gagatggatg gtgcaaaggg aaggctgtcc 1440
tctggccact tgaaatgtcg cctgaaaatg gataaactta gattgaaggg cgtgtcatac 1500
tccttgtgta ccgcagcgtt cacattcacc aagatcccgg ctgaaacact gcacgggaca 1560
gtcacagtgg aggtacagta cgcagggaca gatggacctt gcaaggttcc agctcagatg 1620
gcggtggaca tgcaaactct gaccccagtt gggaggttga taaccgctaa ccccgtaatc 1680
actgaaagca ctgagaactc taagatgatg ctggaacttg atccaccatt tggggactct 1740
tacattgtca taggagtcgg ggagaagaag atcacccacc actggcacag gagtggcagc 1800
accattggaa aagcatttga agccactgtg agaggtgcca agagaatggc agtcttggga 1860
gacacagcct gggactttgg atcagttgga ggcgctctca actcattggg caagggcatc 1920
catcaaattt ttggagcagc tttcaaatca ttgtga 1956
<210> 139
<211> 1956
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp Zika_prME_no_Anchor (D2)
<400> 139
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca ggcagaggtg accaggagag gaagcgccta ctatatgtac 120
ctggacagga atgatgccgg cgaggccatc tccttcccaa ccacactggg catgaacaag 180
tgctacatcc agatcatgga cctgggccac atgtgcgatg ccaccatgtc ctatgagtgt 240
ccaatgctgg acgagggcgt ggagcccgac gatgtggatt gctggtgtaa taccacatct 300
acatgggtgg tgtacggcac ctgtcaccac aagaagggag aggcccggcg gagccggcgg 360
gccgtgacac tgccttccca ctctaccagg aagctgcaga cacgcagcca gacctggctg 420
gagtccagag agtataccaa gcacctgatc agggtggaga actggatctt tcgcaatcca 480
ggattcgcac tggcagcagc agcaatcgca tggctgctgg gaagctccac cagccagaaa 540
gtgatctacc tggtcatgat cctgctgatc gctcctgcct attctatccg gtgcatcggc 600
gtgagcaata gagacttcgt ggagggaatg tccggaggaa cctgggtgga tgtggtgctg 660
gagcacggcg gctgcgtgac agtgatggcc caggacaagc caaccgtgga tatcgagctg 720
gtgaccacaa ccgtgtccaa catggccgag gtgaggtctt actgctatga ggccagcatc 780
tccgacatgg cctctgatag caggtgtcca acccagggag aggcatacct ggacaagcag 840
tccgatacac agtacgtgtg caagcggacc ctggtggaca gaggctgggg caatggctgt 900
ggcctgtttg gcaagggctc tctggtgaca tgcgccaagt tcgcctgtag caagaagatg 960
accggcaagt ccatccagcc agagaacctg gagtaccgga tcatgctgtc tgtgcacggc 1020
tcccagcact ctggcatgat cgtgaacgac acaggccacg agacagatga gaatcgggcc 1080
aaggtggaga tcacacctaa ctctccaaga gccgaggcca ccctgggagg atttggctct 1140
ctgggcctgg actgcgagcc tagaacaggc ctggacttct ccgatctgta ctatctgacc 1200
atgaacaata agcactggct ggtgcacaag gagtggtttc acgacatccc actgccatgg 1260
cacgcaggag cagatacagg aacaccacac tggaacaata aggaggccct ggtggagttc 1320
aaggatgccc acgccaagcg gcagacagtg gtggtgctgg gcagccagga gggagcagtg 1380
cacaccgccc tggcaggcgc cctggaggca gagatggacg gagctaaggg cagactgtct 1440
agcggccacc tgaagtgcag gctgaagatg gataagctgc gcctgaaggg cgtgtcctac 1500
tctctgtgca cagccgcctt caccttcacc aagatccctg ccgagacact gcacggcaca 1560
gtgaccgtgg aggtgcagta tgccggcaca gacggaccct gtaaggtgcc tgcccagatg 1620
gccgtggata tgcagacact gacacctgtg ggcaggctga tcaccgccaa tccagtgatc 1680
acagagtcta ccgagaacag caagatgatg ctggagctgg acccaccatt tggcgatagc 1740
tatatcgtga tcggcgtggg cgagaagaag atcacacacc actggcaccg cagcggctcc 1800
acaatcggca aggcctttga ggcaaccgtg cgcggagcaa agagaatggc cgtgctgggc 1860
gacaccgcat gggatttcgg atctgtggga ggcgccctga acagcctggg caagggcatc 1920
caccagatct tcggcgccgc ctttaagtcc ctgtga 1956
<210> 140
<211> 651
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp Zika_prME_no_Anchor (D2)
<400> 140
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ala Glu Val Thr Arg
20 25 30
Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu
35 40 45
Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln
50 55 60
Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys
65 70 75 80
Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys
85 90 95
Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys
100 105 110
Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser
115 120 125
Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu
130 135 140
Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro
145 150 155 160
Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser
165 170 175
Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro
180 185 190
Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu
195 200 205
Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly
210 215 220
Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
225 230 235 240
Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
245 250 255
Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln
260 265 270
Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys
275 280 285
Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly
290 295 300
Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
305 310 315 320
Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu
325 330 335
Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly
340 345 350
His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser
355 360 365
Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp
370 375 380
Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr
385 390 395 400
Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile
405 410 415
Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
420 425 430
Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln
435 440 445
Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
450 455 460
Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser
465 470 475 480
Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys
485 490 495
Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile
500 505 510
Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala
515 520 525
Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
530 535 540
Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile
545 550 555 560
Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro
565 570 575
Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr
580 585 590
His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala
595 600 605
Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp
610 615 620
Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635 640
His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu
645 650
<210> 141
<211> 1926
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp Zika_prME411 (D3)
<400> 141
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca agcggaggtc actagacgtg ggagtgcata ctatatgtac 120
ttggacagaa acgatgctgg ggaggccata tcttttccaa ccacattggg gatgaataag 180
tgttatatac agatcatgga tcttggacac atgtgtgatg ccaccatgag ctatgaatgc 240
cctatgctgg atgagggggt ggaaccagat gacgtcgatt gttggtgcaa cacgacgtca 300
acttgggttg tgtacggaac ctgccatcac aaaaaaggtg aagcacggag atctagaaga 360
gctgtgacgc tcccctccca ttccactagg aagctgcaaa cgcggtcgca aacctggttg 420
gaatcaagag aatacacaaa gcacttgatt agagtcgaaa attggatatt caggaaccct 480
ggcttcgcgt tagcagcagc tgccatcgct tggcttttgg gaagctcaac gagccaaaaa 540
gtcatatact tggtcatgat actgctgatt gccccggcat acagcatcag gtgcatagga 600
gtcagcaata gggactttgt ggaaggtatg tcaggtggga cttgggttga tgttgtcttg 660
gaacatggag gttgtgtcac cgtaatggca caggacaaac cgactgtcga catagagctg 720
gttacaacaa cagtcagcaa catggcggag gtaagatcct actgctatga ggcatcaata 780
tcagacatgg cttcggacag ccgctgccca acacaaggtg aagcctacct tgacaagcaa 840
tcagacactc aatatgtctg caaaagaacg ttagtggaca gaggctgggg aaatggatgt 900
ggactttttg gcaaagggag cctggtgaca tgcgctaagt ttgcatgctc caagaaaatg 960
accgggaaga gcatccagcc agagaatctg gagtaccgga taatgctgtc agttcatggc 1020
tcccagcaca gtgggatgat cgttaatgac acaggacatg aaactgatga gaatagagcg 1080
aaggttgaga taacgcccaa ttcaccaaga gccgaagcca ccctgggggg ttttggaagc 1140
ctaggacttg attgtgaacc gaggacaggc cttgactttt cagatttgta ttacttgact 1200
atgaataaca agcactggtt ggttcacaag gagtggttcc acgacattcc attaccttgg 1260
cacgctgggg cagacaccgg aactccacac tggaacaaca aagaagcact ggtagagttc 1320
aaggacgcac atgccaaaag gcaaactgtc gtggttctag ggagtcaaga aggagcagtt 1380
cacacggccc ttgctggagc tctggaggct gagatggatg gtgcaaaggg aaggctgtcc 1440
tctggccact tgaaatgtcg cctgaaaatg gataaactta gattgaaggg cgtgtcatac 1500
tccttgtgta ccgcagcgtt cacattcacc aagatcccgg ctgaaacact gcacgggaca 1560
gtcacagtgg aggtacagta cgcagggaca gatggacctt gcaaggttcc agctcagatg 1620
gcggtggaca tgcaaactct gaccccagtt gggaggttga taaccgctaa ccccgtaatc 1680
actgaaagca ctgagaactc taagatgatg ctggaacttg atccaccatt tggggactct 1740
tacattgtca taggagtcgg ggagaagaag atcacccacc actggcacag gagtggcagc 1800
accattggaa aagcatttga agccactgtg agaggtgcca agagaatggc agtcttggga 1860
gacacagcct gggactttgg atcagttgga ggcgctctca actcattggg caagggcatc 1920
tgatga 1926
<210> 142
<211> 1926
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp Zika_prME411 (D3)
<400> 142
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca ggcagaggtg accaggagag gaagcgccta ctatatgtac 120
ctggacagga atgatgccgg cgaggccatc tccttcccaa ccacactggg catgaacaag 180
tgctacatcc agatcatgga cctgggccac atgtgcgatg ccaccatgtc ctatgagtgt 240
ccaatgctgg acgagggcgt ggagcccgac gatgtggatt gctggtgtaa taccacatct 300
acatgggtgg tgtacggcac ctgtcaccac aagaagggag aggcccggcg gagccggcgg 360
gccgtgacac tgccttccca ctctaccagg aagctgcaga cacgcagcca gacctggctg 420
gagtccagag agtataccaa gcacctgatc agggtggaga actggatctt tcgcaatcca 480
ggattcgcac tggcagcagc agcaatcgca tggctgctgg gaagctccac cagccagaaa 540
gtgatctacc tggtcatgat cctgctgatc gctcctgcct attctatccg gtgcatcggc 600
gtgagcaata gagacttcgt ggagggaatg tccggaggaa cctgggtgga tgtggtgctg 660
gagcacggcg gctgcgtgac agtgatggcc caggacaagc caaccgtgga tatcgagctg 720
gtgaccacaa ccgtgtccaa catggccgag gtgaggtctt actgctatga ggccagcatc 780
tccgacatgg cctctgatag caggtgtcca acccagggag aggcatacct ggacaagcag 840
tccgatacac agtacgtgtg caagcggacc ctggtggaca gaggctgggg caatggctgt 900
ggcctgtttg gcaagggctc tctggtgaca tgcgccaagt tcgcctgtag caagaagatg 960
accggcaagt ccatccagcc agagaacctg gagtaccgga tcatgctgtc tgtgcacggc 1020
tcccagcact ctggcatgat cgtgaacgac acaggccacg agacagatga gaatcgggcc 1080
aaggtggaga tcacacctaa ctctccaaga gccgaggcca ccctgggagg atttggctct 1140
ctgggcctgg actgcgagcc tagaacaggc ctggacttct ccgatctgta ctatctgacc 1200
atgaacaata agcactggct ggtgcacaag gagtggtttc acgacatccc actgccatgg 1260
cacgcaggag cagatacagg aacaccacac tggaacaata aggaggccct ggtggagttc 1320
aaggatgccc acgccaagcg gcagacagtg gtggtgctgg gcagccagga gggagcagtg 1380
cacaccgccc tggcaggcgc cctggaggca gagatggacg gagctaaggg cagactgtct 1440
agcggccacc tgaagtgcag gctgaagatg gataagctgc gcctgaaggg cgtgtcctac 1500
tctctgtgca cagccgcctt caccttcacc aagatccctg ccgagacact gcacggcaca 1560
gtgaccgtgg aggtgcagta tgccggcaca gacggaccct gtaaggtgcc tgcccagatg 1620
gccgtggata tgcagacact gacacctgtg ggcaggctga tcaccgccaa tccagtgatc 1680
acagagtcta ccgagaacag caagatgatg ctggagctgg acccaccatt tggcgatagc 1740
tatatcgtga tcggcgtggg cgagaagaag atcacacacc actggcaccg cagcggctcc 1800
acaatcggca aggcctttga ggcaaccgtg cgcggagcaa agagaatggc cgtgctgggc 1860
gacaccgcat gggatttcgg atctgtggga ggcgccctga acagcctggg caagggcatc 1920
tgatga 1926
<210> 143
<211> 640
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp Zika_prME411 (D3)
<400> 143
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ala Glu Val Thr Arg
20 25 30
Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu
35 40 45
Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln
50 55 60
Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys
65 70 75 80
Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys
85 90 95
Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys
100 105 110
Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser
115 120 125
Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu
130 135 140
Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro
145 150 155 160
Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser
165 170 175
Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro
180 185 190
Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu
195 200 205
Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly
210 215 220
Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
225 230 235 240
Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
245 250 255
Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln
260 265 270
Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys
275 280 285
Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly
290 295 300
Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
305 310 315 320
Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu
325 330 335
Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly
340 345 350
His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser
355 360 365
Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp
370 375 380
Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr
385 390 395 400
Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile
405 410 415
Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
420 425 430
Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln
435 440 445
Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
450 455 460
Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser
465 470 475 480
Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys
485 490 495
Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile
500 505 510
Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala
515 520 525
Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
530 535 540
Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile
545 550 555 560
Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro
565 570 575
Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr
580 585 590
His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala
595 600 605
Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp
610 615 620
Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635 640
<210> 144
<211> 1800
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp Zika_prME395 (D4)
<400> 144
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca agcggaggtc actagacgtg ggagtgcata ctatatgtac 120
ttggacagaa acgatgctgg ggaggccata tcttttccaa ccacattggg gatgaataag 180
tgttatatac agatcatgga tcttggacac atgtgtgatg ccaccatgag ctatgaatgc 240
cctatgctgg atgagggggt ggaaccagat gacgtcgatt gttggtgcaa cacgacgtca 300
acttgggttg tgtacggaac ctgccatcac aaaaaaggtg aagcacggag atctagaaga 360
gctgtgacgc tcccctccca ttccactagg aagctgcaaa cgcggtcgca aacctggttg 420
gaatcaagag aatacacaaa gcacttgatt agagtcgaaa attggatatt caggaaccct 480
ggcttcgcgt tagcagcagc tgccatcgct tggcttttgg gaagctcaac gagccaaaaa 540
gtcatatact tggtcatgat actgctgatt gccccggcat acagcatcag gtgcatagga 600
gtcagcaata gggactttgt ggaaggtatg tcaggtggga cttgggttga tgttgtcttg 660
gaacatggag gttgtgtcac cgtaatggca caggacaaac cgactgtcga catagagctg 720
gttacaacaa cagtcagcaa catggcggag gtaagatcct actgctatga ggcatcaata 780
tcagacatgg cttcggacag ccgctgccca acacaaggtg aagcctacct tgacaagcaa 840
tcagacactc aatatgtctg caaaagaacg ttagtggaca gaggctgggg aaatggatgt 900
ggactttttg gcaaagggag cctggtgaca tgcgctaagt ttgcatgctc caagaaaatg 960
accgggaaga gcatccagcc agagaatctg gagtaccgga taatgctgtc agttcatggc 1020
tcccagcaca gtgggatgat cgttaatgac acaggacatg aaactgatga gaatagagcg 1080
aaggttgaga taacgcccaa ttcaccaaga gccgaagcca ccctgggggg ttttggaagc 1140
ctaggacttg attgtgaacc gaggacaggc cttgactttt cagatttgta ttacttgact 1200
atgaataaca agcactggtt ggttcacaag gagtggttcc acgacattcc attaccttgg 1260
cacgctgggg cagacaccgg aactccacac tggaacaaca aagaagcact ggtagagttc 1320
aaggacgcac atgccaaaag gcaaactgtc gtggttctag ggagtcaaga aggagcagtt 1380
cacacggccc ttgctggagc tctggaggct gagatggatg gtgcaaaggg aaggctgtcc 1440
tctggccact tgaaatgtcg cctgaaaatg gataaactta gattgaaggg cgtgtcatac 1500
tccttgtgta ccgcagcgtt cacattcacc aagatcccgg ctgaaacact gcacgggaca 1560
gtcacagtgg aggtacagta cgcagggaca gatggacctt gcaaggttcc agctcagatg 1620
gcggtggaca tgcaaactct gaccccagtt gggaggttga taaccgctaa ccccgtaatc 1680
actgaaagca ctgagaactc taagatgatg ctggaacttg atccaccatt tggggactct 1740
tacattgtca taggagtcgg ggagaagaag atcacccacc actggcacag gagtggctga 1800
<210> 145
<211> 1800
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp Zika_prME395 (D4)
<400> 145
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca ggcagaggtg accaggagag gaagcgccta ctatatgtac 120
ctggacagga atgatgccgg cgaggccatc tccttcccaa ccacactggg catgaacaag 180
tgctacatcc agatcatgga cctgggccac atgtgcgatg ccaccatgtc ctatgagtgt 240
ccaatgctgg acgagggcgt ggagcccgac gatgtggatt gctggtgtaa taccacatct 300
acatgggtgg tgtacggcac ctgtcaccac aagaagggag aggcccggcg gagccggcgg 360
gccgtgacac tgccttccca ctctaccagg aagctgcaga cacgcagcca gacctggctg 420
gagtccagag agtataccaa gcacctgatc agggtggaga actggatctt tcgcaatcca 480
ggattcgcac tggcagcagc agcaatcgca tggctgctgg gaagctccac cagccagaaa 540
gtgatctacc tggtcatgat cctgctgatc gctcctgcct attctatccg gtgcatcggc 600
gtgagcaata gagacttcgt ggagggaatg tccggaggaa cctgggtgga tgtggtgctg 660
gagcacggcg gctgcgtgac agtgatggcc caggacaagc caaccgtgga tatcgagctg 720
gtgaccacaa ccgtgtccaa catggccgag gtgaggtctt actgctatga ggccagcatc 780
tccgacatgg cctctgatag caggtgtcca acccagggag aggcatacct ggacaagcag 840
tccgatacac agtacgtgtg caagcggacc ctggtggaca gaggctgggg caatggctgt 900
ggcctgtttg gcaagggctc tctggtgaca tgcgccaagt tcgcctgtag caagaagatg 960
accggcaagt ccatccagcc agagaacctg gagtaccgga tcatgctgtc tgtgcacggc 1020
tcccagcact ctggcatgat cgtgaacgac acaggccacg agacagatga gaatcgggcc 1080
aaggtggaga tcacacctaa ctctccaaga gccgaggcca ccctgggagg atttggctct 1140
ctgggcctgg actgcgagcc tagaacaggc ctggacttct ccgatctgta ctatctgacc 1200
atgaacaata agcactggct ggtgcacaag gagtggtttc acgacatccc actgccatgg 1260
cacgcaggag cagatacagg aacaccacac tggaacaata aggaggccct ggtggagttc 1320
aaggatgccc acgccaagcg gcagacagtg gtggtgctgg gcagccagga gggagcagtg 1380
cacaccgccc tggcaggcgc cctggaggca gagatggacg gagctaaggg cagactgtct 1440
agcggccacc tgaagtgcag gctgaagatg gataagctgc gcctgaaggg cgtgtcctac 1500
tctctgtgca cagccgcctt caccttcacc aagatccctg ccgagacact gcacggcaca 1560
gtgaccgtgg aggtgcagta tgccggcaca gacggaccct gtaaggtgcc tgcccagatg 1620
gccgtggata tgcagacact gacacctgtg ggcaggctga tcaccgccaa tccagtgatc 1680
acagagtcta ccgagaacag caagatgatg ctggagctgg acccaccatt tggcgatagc 1740
tatatcgtga tcggcgtggg cgagaagaag atcacacacc actggcaccg cagcggctga 1800
<210> 146
<211> 599
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp Zika_prME395 (D4)
<400> 146
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ala Glu Val Thr Arg
20 25 30
Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu
35 40 45
Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln
50 55 60
Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys
65 70 75 80
Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys
85 90 95
Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys
100 105 110
Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser
115 120 125
Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu
130 135 140
Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro
145 150 155 160
Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser
165 170 175
Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro
180 185 190
Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu
195 200 205
Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly
210 215 220
Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
225 230 235 240
Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
245 250 255
Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln
260 265 270
Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys
275 280 285
Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly
290 295 300
Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
305 310 315 320
Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu
325 330 335
Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly
340 345 350
His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser
355 360 365
Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp
370 375 380
Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr
385 390 395 400
Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile
405 410 415
Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
420 425 430
Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln
435 440 445
Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
450 455 460
Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser
465 470 475 480
Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys
485 490 495
Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile
500 505 510
Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala
515 520 525
Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
530 535 540
Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile
545 550 555 560
Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro
565 570 575
Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr
580 585 590
His His Trp His Arg Ser Gly
595
<210> 147
<211> 1596
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaE (D5)
<400> 147
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatcaggtgc ataggagtca gcaataggga ctttgtggaa 120
ggtatgtcag gtgggacttg ggttgatgtt gtcttggaac atggaggttg tgtcaccgta 180
atggcacagg acaaaccgac tgtcgacata gagctggtta caacaacagt cagcaacatg 240
gcggaggtaa gatcctactg ctatgaggca tcaatatcag acatggcttc ggacagccgc 300
tgcccaacac aaggtgaagc ctaccttgac aagcaatcag acactcaata tgtctgcaaa 360
agaacgttag tggacagagg ctggggaaat ggatgtggac tttttggcaa agggagcctg 420
gtgacatgcg ctaagtttgc atgctccaag aaaatgaccg ggaagagcat ccagccagag 480
aatctggagt accggataat gctgtcagtt catggctccc agcacagtgg gatgatcgtt 540
aatgacacag gacatgaaac tgatgagaat agagcgaagg ttgagataac gcccaattca 600
ccaagagccg aagccaccct ggggggtttt ggaagcctag gacttgattg tgaaccgagg 660
acaggccttg acttttcaga tttgtattac ttgactatga ataacaagca ctggttggtt 720
cacaaggagt ggttccacga cattccatta ccttggcacg ctggggcaga caccggaact 780
ccacactgga acaacaaaga agcactggta gagttcaagg acgcacatgc caaaaggcaa 840
actgtcgtgg ttctagggag tcaagaagga gcagttcaca cggcccttgc tggagctctg 900
gaggctgaga tggatggtgc aaagggaagg ctgtcctctg gccacttgaa atgtcgcctg 960
aaaatggata aacttagatt gaagggcgtg tcatactcct tgtgtaccgc agcgttcaca 1020
ttcaccaaga tcccggctga aacactgcac gggacagtca cagtggaggt acagtacgca 1080
gggacagatg gaccttgcaa ggttccagct cagatggcgg tggacatgca aactctgacc 1140
ccagttggga ggttgataac cgctaacccc gtaatcactg aaagcactga gaactctaag 1200
atgatgctgg aacttgatcc accatttggg gactcttaca ttgtcatagg agtcggggag 1260
aagaagatca cccaccactg gcacaggagt ggcagcacca ttggaaaagc atttgaagcc 1320
actgtgagag gtgccaagag aatggcagtc ttgggagaca cagcctggga ctttggatca 1380
gttggaggcg ctctcaactc attgggcaag ggcatccatc aaatttttgg agcagctttc 1440
aaatcattgt ttggaggaat gtcctggttc tcacaaattc tcattggaac gttgctgatg 1500
tggttgggtc tgaacacaaa gaatggatct atttccctta tgtgcttggc cttaggggga 1560
gtgttgatct tcttatccac agccgtctct gcttga 1596
<210> 148
<211> 1596
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaE (D5)
<400> 148
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccggtgc atcggcgtga gcaatagaga cttcgtggag 120
ggaatgtccg gaggaacctg ggtggatgtg gtgctggagc acggcggctg cgtgacagtg 180
atggcccagg acaagccaac cgtggatatc gagctggtga ccacaaccgt gtccaacatg 240
gccgaggtga ggtcttactg ctatgaggcc agcatctccg acatggcctc tgatagcagg 300
tgtccaaccc agggagaggc atacctggac aagcagtccg atacacagta cgtgtgcaag 360
cggaccctgg tggacagagg ctggggcaat ggctgtggcc tgtttggcaa gggctctctg 420
gtgacatgcg ccaagttcgc ctgtagcaag aagatgaccg gcaagtccat ccagccagag 480
aacctggagt accggatcat gctgtctgtg cacggctccc agcactctgg catgatcgtg 540
aacgacacag gccacgagac agatgagaat cgggccaagg tggagatcac acctaactct 600
ccaagagccg aggccaccct gggaggattt ggctctctgg gcctggactg cgagcctaga 660
acaggcctgg acttctccga tctgtactat ctgaccatga acaataagca ctggctggtg 720
cacaaggagt ggtttcacga catcccactg ccatggcacg caggagcaga tacaggaaca 780
ccacactgga acaataagga ggccctggtg gagttcaagg atgcccacgc caagcggcag 840
acagtggtgg tgctgggcag ccaggaggga gcagtgcaca ccgccctggc aggcgccctg 900
gaggcagaga tggacggagc taagggcaga ctgtctagcg gccacctgaa gtgcaggctg 960
aagatggata agctgcgcct gaagggcgtg tcctactctc tgtgcacagc cgccttcacc 1020
ttcaccaaga tccctgccga gacactgcac ggcacagtga ccgtggaggt gcagtatgcc 1080
ggcacagacg gaccctgtaa ggtgcctgcc cagatggccg tggatatgca gacactgaca 1140
cctgtgggca ggctgatcac cgccaatcca gtgatcacag agtctaccga gaacagcaag 1200
atgatgctgg agctggaccc accatttggc gatagctata tcgtgatcgg cgtgggcgag 1260
aagaagatca cacaccactg gcaccgcagc ggctccacaa tcggcaaggc ctttgaggca 1320
accgtgcgcg gagcaaagag aatggccgtg ctgggcgaca ccgcatggga tttcggatct 1380
gtgggaggcg ccctgaacag cctgggcaag ggcatccacc agatcttcgg cgccgccttt 1440
aagtccctgt tcggcggcat gagctggttc tcacagatcc tgatcggcac actgctgatg 1500
tggctgggcc tgaacaccaa gaatggctct atcagcctga tgtgcctggc cctgggaggc 1560
gtgctgatct tcctgtccac cgccgtgtct gcctga 1596
<210> 149
<211> 531
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaE (D5)
<400> 149
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile Arg Cys Ile Gly
20 25 30
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val
35 40 45
Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp
50 55 60
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met
65 70 75 80
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala
85 90 95
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln
100 105 110
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp
115 120 125
Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala
130 135 140
Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu
145 150 155 160
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser
165 170 175
Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala
180 185 190
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly
195 200 205
Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp
210 215 220
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val
225 230 235 240
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala
245 250 255
Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
260 265 270
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln
275 280 285
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met
290 295 300
Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu
305 310 315 320
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr
325 330 335
Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr
340 345 350
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val
355 360 365
Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg
370 375 380
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys
385 390 395 400
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile
405 410 415
Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser
420 425 430
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met
435 440 445
Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala
450 455 460
Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe
465 470 475 480
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile Gly
485 490 495
Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser
500 505 510
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile Phe Leu Ser Thr Ala
515 520 525
Val Ser Ala
530
<210> 150
<211> 1452
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaE_no_Anchor (D6)
<400> 150
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatcaggtgc ataggagtca gcaataggga ctttgtggaa 120
ggtatgtcag gtgggacttg ggttgatgtt gtcttggaac atggaggttg tgtcaccgta 180
atggcacagg acaaaccgac tgtcgacata gagctggtta caacaacagt cagcaacatg 240
gcggaggtaa gatcctactg ctatgaggca tcaatatcag acatggcttc ggacagccgc 300
tgcccaacac aaggtgaagc ctaccttgac aagcaatcag acactcaata tgtctgcaaa 360
agaacgttag tggacagagg ctggggaaat ggatgtggac tttttggcaa agggagcctg 420
gtgacatgcg ctaagtttgc atgctccaag aaaatgaccg ggaagagcat ccagccagag 480
aatctggagt accggataat gctgtcagtt catggctccc agcacagtgg gatgatcgtt 540
aatgacacag gacatgaaac tgatgagaat agagcgaagg ttgagataac gcccaattca 600
ccaagagccg aagccaccct ggggggtttt ggaagcctag gacttgattg tgaaccgagg 660
acaggccttg acttttcaga tttgtattac ttgactatga ataacaagca ctggttggtt 720
cacaaggagt ggttccacga cattccatta ccttggcacg ctggggcaga caccggaact 780
ccacactgga acaacaaaga agcactggta gagttcaagg acgcacatgc caaaaggcaa 840
actgtcgtgg ttctagggag tcaagaagga gcagttcaca cggcccttgc tggagctctg 900
gaggctgaga tggatggtgc aaagggaagg ctgtcctctg gccacttgaa atgtcgcctg 960
aaaatggata aacttagatt gaagggcgtg tcatactcct tgtgtaccgc agcgttcaca 1020
ttcaccaaga tcccggctga aacactgcac gggacagtca cagtggaggt acagtacgca 1080
gggacagatg gaccttgcaa ggttccagct cagatggcgg tggacatgca aactctgacc 1140
ccagttggga ggttgataac cgctaacccc gtaatcactg aaagcactga gaactctaag 1200
atgatgctgg aacttgatcc accatttggg gactcttaca ttgtcatagg agtcggggag 1260
aagaagatca cccaccactg gcacaggagt ggcagcacca ttggaaaagc atttgaagcc 1320
actgtgagag gtgccaagag aatggcagtc ttgggagaca cagcctggga ctttggatca 1380
gttggaggcg ctctcaactc attgggcaag ggcatccatc aaatttttgg agcagctttc 1440
aaatcattgt ga 1452
<210> 151
<211> 1452
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaE_no_Anchor (D6)
<400> 151
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccggtgc atcggcgtga gcaatagaga cttcgtggag 120
ggaatgtccg gaggaacctg ggtggatgtg gtgctggagc acggcggctg cgtgacagtg 180
atggcccagg acaagccaac cgtggatatc gagctggtga ccacaaccgt gtccaacatg 240
gccgaggtga ggtcttactg ctatgaggcc agcatctccg acatggcctc tgatagcagg 300
tgtccaaccc agggagaggc atacctggac aagcagtccg atacacagta cgtgtgcaag 360
cggaccctgg tggacagagg ctggggcaat ggctgtggcc tgtttggcaa gggctctctg 420
gtgacatgcg ccaagttcgc ctgtagcaag aagatgaccg gcaagtccat ccagccagag 480
aacctggagt accggatcat gctgtctgtg cacggctccc agcactctgg catgatcgtg 540
aacgacacag gccacgagac agatgagaat cgggccaagg tggagatcac acctaactct 600
ccaagagccg aggccaccct gggaggattt ggctctctgg gcctggactg cgagcctaga 660
acaggcctgg acttctccga tctgtactat ctgaccatga acaataagca ctggctggtg 720
cacaaggagt ggtttcacga catcccactg ccatggcacg caggagcaga tacaggaaca 780
ccacactgga acaataagga ggccctggtg gagttcaagg atgcccacgc caagcggcag 840
acagtggtgg tgctgggcag ccaggaggga gcagtgcaca ccgccctggc aggcgccctg 900
gaggcagaga tggacggagc taagggcaga ctgtctagcg gccacctgaa gtgcaggctg 960
aagatggata agctgcgcct gaagggcgtg tcctactctc tgtgcacagc cgccttcacc 1020
ttcaccaaga tccctgccga gacactgcac ggcacagtga ccgtggaggt gcagtatgcc 1080
ggcacagacg gaccctgtaa ggtgcctgcc cagatggccg tggatatgca gacactgaca 1140
cctgtgggca ggctgatcac cgccaatcca gtgatcacag agtctaccga gaacagcaag 1200
atgatgctgg agctggaccc accatttggc gatagctata tcgtgatcgg cgtgggcgag 1260
aagaagatca cacaccactg gcaccgcagc ggctccacaa tcggcaaggc ctttgaggca 1320
accgtgcgcg gagcaaagag aatggccgtg ctgggcgaca ccgcatggga tttcggatct 1380
gtgggaggcg ccctgaacag cctgggcaag ggcatccacc agatcttcgg cgccgccttt 1440
aagtccctgt ga 1452
<210> 152
<211> 483
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaE_no_Anchor (D6)
<400> 152
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile Arg Cys Ile Gly
20 25 30
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val
35 40 45
Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp
50 55 60
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met
65 70 75 80
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala
85 90 95
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln
100 105 110
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp
115 120 125
Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala
130 135 140
Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu
145 150 155 160
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser
165 170 175
Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala
180 185 190
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly
195 200 205
Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp
210 215 220
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val
225 230 235 240
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala
245 250 255
Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
260 265 270
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln
275 280 285
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met
290 295 300
Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu
305 310 315 320
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr
325 330 335
Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr
340 345 350
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val
355 360 365
Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg
370 375 380
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys
385 390 395 400
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile
405 410 415
Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser
420 425 430
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met
435 440 445
Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala
450 455 460
Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe
465 470 475 480
Lys Ser Leu
<210> 153
<211> 1422
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaE411 (D7)
<400> 153
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatcaggtgc ataggagtca gcaataggga ctttgtggaa 120
ggtatgtcag gtgggacttg ggttgatgtt gtcttggaac atggaggttg tgtcaccgta 180
atggcacagg acaaaccgac tgtcgacata gagctggtta caacaacagt cagcaacatg 240
gcggaggtaa gatcctactg ctatgaggca tcaatatcag acatggcttc ggacagccgc 300
tgcccaacac aaggtgaagc ctaccttgac aagcaatcag acactcaata tgtctgcaaa 360
agaacgttag tggacagagg ctggggaaat ggatgtggac tttttggcaa agggagcctg 420
gtgacatgcg ctaagtttgc atgctccaag aaaatgaccg ggaagagcat ccagccagag 480
aatctggagt accggataat gctgtcagtt catggctccc agcacagtgg gatgatcgtt 540
aatgacacag gacatgaaac tgatgagaat agagcgaagg ttgagataac gcccaattca 600
ccaagagccg aagccaccct ggggggtttt ggaagcctag gacttgattg tgaaccgagg 660
acaggccttg acttttcaga tttgtattac ttgactatga ataacaagca ctggttggtt 720
cacaaggagt ggttccacga cattccatta ccttggcacg ctggggcaga caccggaact 780
ccacactgga acaacaaaga agcactggta gagttcaagg acgcacatgc caaaaggcaa 840
actgtcgtgg ttctagggag tcaagaagga gcagttcaca cggcccttgc tggagctctg 900
gaggctgaga tggatggtgc aaagggaagg ctgtcctctg gccacttgaa atgtcgcctg 960
aaaatggata aacttagatt gaagggcgtg tcatactcct tgtgtaccgc agcgttcaca 1020
ttcaccaaga tcccggctga aacactgcac gggacagtca cagtggaggt acagtacgca 1080
gggacagatg gaccttgcaa ggttccagct cagatggcgg tggacatgca aactctgacc 1140
ccagttggga ggttgataac cgctaacccc gtaatcactg aaagcactga gaactctaag 1200
atgatgctgg aacttgatcc accatttggg gactcttaca ttgtcatagg agtcggggag 1260
aagaagatca cccaccactg gcacaggagt ggcagcacca ttggaaaagc atttgaagcc 1320
actgtgagag gtgccaagag aatggcagtc ttgggagaca cagcctggga ctttggatca 1380
gttggaggcg ctctcaactc attgggcaag ggcatctgat ga 1422
<210> 154
<211> 1422
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaE411 (D7)
<400> 154
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccggtgc atcggcgtga gcaatagaga cttcgtggag 120
ggaatgtccg gaggaacctg ggtggatgtg gtgctggagc acggcggctg cgtgacagtg 180
atggcccagg acaagccaac cgtggatatc gagctggtga ccacaaccgt gtccaacatg 240
gccgaggtga ggtcttactg ctatgaggcc agcatctccg acatggcctc tgatagcagg 300
tgtccaaccc agggagaggc atacctggac aagcagtccg atacacagta cgtgtgcaag 360
cggaccctgg tggacagagg ctggggcaat ggctgtggcc tgtttggcaa gggctctctg 420
gtgacatgcg ccaagttcgc ctgtagcaag aagatgaccg gcaagtccat ccagccagag 480
aacctggagt accggatcat gctgtctgtg cacggctccc agcactctgg catgatcgtg 540
aacgacacag gccacgagac agatgagaat cgggccaagg tggagatcac acctaactct 600
ccaagagccg aggccaccct gggaggattt ggctctctgg gcctggactg cgagcctaga 660
acaggcctgg acttctccga tctgtactat ctgaccatga acaataagca ctggctggtg 720
cacaaggagt ggtttcacga catcccactg ccatggcacg caggagcaga tacaggaaca 780
ccacactgga acaataagga ggccctggtg gagttcaagg atgcccacgc caagcggcag 840
acagtggtgg tgctgggcag ccaggaggga gcagtgcaca ccgccctggc aggcgccctg 900
gaggcagaga tggacggagc taagggcaga ctgtctagcg gccacctgaa gtgcaggctg 960
aagatggata agctgcgcct gaagggcgtg tcctactctc tgtgcacagc cgccttcacc 1020
ttcaccaaga tccctgccga gacactgcac ggcacagtga ccgtggaggt gcagtatgcc 1080
ggcacagacg gaccctgtaa ggtgcctgcc cagatggccg tggatatgca gacactgaca 1140
cctgtgggca ggctgatcac cgccaatcca gtgatcacag agtctaccga gaacagcaag 1200
atgatgctgg agctggaccc accatttggc gatagctata tcgtgatcgg cgtgggcgag 1260
aagaagatca cacaccactg gcaccgcagc ggctccacaa tcggcaaggc ctttgaggca 1320
accgtgcgcg gagcaaagag aatggccgtg ctgggcgaca ccgcatggga tttcggatct 1380
gtgggaggcg ccctgaacag cctgggcaag ggcatctgat ga 1422
<210> 155
<211> 472
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaE411 (D7)
<400> 155
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile Arg Cys Ile Gly
20 25 30
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val
35 40 45
Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp
50 55 60
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met
65 70 75 80
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala
85 90 95
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln
100 105 110
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp
115 120 125
Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala
130 135 140
Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu
145 150 155 160
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser
165 170 175
Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala
180 185 190
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly
195 200 205
Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp
210 215 220
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val
225 230 235 240
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala
245 250 255
Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
260 265 270
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln
275 280 285
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met
290 295 300
Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu
305 310 315 320
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr
325 330 335
Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr
340 345 350
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val
355 360 365
Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg
370 375 380
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys
385 390 395 400
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile
405 410 415
Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser
420 425 430
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met
435 440 445
Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala
450 455 460
Leu Asn Ser Leu Gly Lys Gly Ile
465 470
<210> 156
<211> 1296
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaE395 (D8)
<400> 156
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatcaggtgc ataggagtca gcaataggga ctttgtggaa 120
ggtatgtcag gtgggacttg ggttgatgtt gtcttggaac atggaggttg tgtcaccgta 180
atggcacagg acaaaccgac tgtcgacata gagctggtta caacaacagt cagcaacatg 240
gcggaggtaa gatcctactg ctatgaggca tcaatatcag acatggcttc ggacagccgc 300
tgcccaacac aaggtgaagc ctaccttgac aagcaatcag acactcaata tgtctgcaaa 360
agaacgttag tggacagagg ctggggaaat ggatgtggac tttttggcaa agggagcctg 420
gtgacatgcg ctaagtttgc atgctccaag aaaatgaccg ggaagagcat ccagccagag 480
aatctggagt accggataat gctgtcagtt catggctccc agcacagtgg gatgatcgtt 540
aatgacacag gacatgaaac tgatgagaat agagcgaagg ttgagataac gcccaattca 600
ccaagagccg aagccaccct ggggggtttt ggaagcctag gacttgattg tgaaccgagg 660
acaggccttg acttttcaga tttgtattac ttgactatga ataacaagca ctggttggtt 720
cacaaggagt ggttccacga cattccatta ccttggcacg ctggggcaga caccggaact 780
ccacactgga acaacaaaga agcactggta gagttcaagg acgcacatgc caaaaggcaa 840
actgtcgtgg ttctagggag tcaagaagga gcagttcaca cggcccttgc tggagctctg 900
gaggctgaga tggatggtgc aaagggaagg ctgtcctctg gccacttgaa atgtcgcctg 960
aaaatggata aacttagatt gaagggcgtg tcatactcct tgtgtaccgc agcgttcaca 1020
ttcaccaaga tcccggctga aacactgcac gggacagtca cagtggaggt acagtacgca 1080
gggacagatg gaccttgcaa ggttccagct cagatggcgg tggacatgca aactctgacc 1140
ccagttggga ggttgataac cgctaacccc gtaatcactg aaagcactga gaactctaag 1200
atgatgctgg aacttgatcc accatttggg gactcttaca ttgtcatagg agtcggggag 1260
aagaagatca cccaccactg gcacaggagt ggctga 1296
<210> 157
<211> 1296
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaE395 (D8)
<400> 157
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccggtgc atcggcgtga gcaatagaga cttcgtggag 120
ggaatgtccg gaggaacctg ggtggatgtg gtgctggagc acggcggctg cgtgacagtg 180
atggcccagg acaagccaac cgtggatatc gagctggtga ccacaaccgt gtccaacatg 240
gccgaggtga ggtcttactg ctatgaggcc agcatctccg acatggcctc tgatagcagg 300
tgtccaaccc agggagaggc atacctggac aagcagtccg atacacagta cgtgtgcaag 360
cggaccctgg tggacagagg ctggggcaat ggctgtggcc tgtttggcaa gggctctctg 420
gtgacatgcg ccaagttcgc ctgtagcaag aagatgaccg gcaagtccat ccagccagag 480
aacctggagt accggatcat gctgtctgtg cacggctccc agcactctgg catgatcgtg 540
aacgacacag gccacgagac agatgagaat cgggccaagg tggagatcac acctaactct 600
ccaagagccg aggccaccct gggaggattt ggctctctgg gcctggactg cgagcctaga 660
acaggcctgg acttctccga tctgtactat ctgaccatga acaataagca ctggctggtg 720
cacaaggagt ggtttcacga catcccactg ccatggcacg caggagcaga tacaggaaca 780
ccacactgga acaataagga ggccctggtg gagttcaagg atgcccacgc caagcggcag 840
acagtggtgg tgctgggcag ccaggaggga gcagtgcaca ccgccctggc aggcgccctg 900
gaggcagaga tggacggagc taagggcaga ctgtctagcg gccacctgaa gtgcaggctg 960
aagatggata agctgcgcct gaagggcgtg tcctactctc tgtgcacagc cgccttcacc 1020
ttcaccaaga tccctgccga gacactgcac ggcacagtga ccgtggaggt gcagtatgcc 1080
ggcacagacg gaccctgtaa ggtgcctgcc cagatggccg tggatatgca gacactgaca 1140
cctgtgggca ggctgatcac cgccaatcca gtgatcacag agtctaccga gaacagcaag 1200
atgatgctgg agctggaccc accatttggc gatagctata tcgtgatcgg cgtgggcgag 1260
aagaagatca cacaccactg gcaccgcagc ggctga 1296
<210> 158
<211> 431
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaE395 (D8)
<400> 158
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile Arg Cys Ile Gly
20 25 30
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val
35 40 45
Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp
50 55 60
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met
65 70 75 80
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala
85 90 95
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln
100 105 110
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp
115 120 125
Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala
130 135 140
Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu
145 150 155 160
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser
165 170 175
Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala
180 185 190
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly
195 200 205
Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp
210 215 220
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val
225 230 235 240
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala
245 250 255
Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
260 265 270
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln
275 280 285
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met
290 295 300
Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu
305 310 315 320
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr
325 330 335
Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr
340 345 350
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val
355 360 365
Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg
370 375 380
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys
385 390 395 400
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile
405 410 415
Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly
420 425 430
<210> 159
<211> 2172
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp ZikaprME_MVTMintercyto (D9)
<400> 159
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca agcggaggtc actagacgtg ggagtgcata ctatatgtac 120
ttggacagaa acgatgctgg ggaggccata tcttttccaa ccacattggg gatgaataag 180
tgttatatac agatcatgga tcttggacac atgtgtgatg ccaccatgag ctatgaatgc 240
cctatgctgg atgagggggt ggaaccagat gacgtcgatt gttggtgcaa cacgacgtca 300
acttgggttg tgtacggaac ctgccatcac aaaaaaggtg aagcacggag atctagaaga 360
gctgtgacgc tcccctccca ttccactagg aagctgcaaa cgcggtcgca aacctggttg 420
gaatcaagag aatacacaaa gcacttgatt agagtcgaaa attggatatt caggaaccct 480
ggcttcgcgt tagcagcagc tgccatcgct tggcttttgg gaagctcaac gagccaaaaa 540
gtcatatact tggtcatgat actgctgatt gccccggcat acagcatcag gtgcatagga 600
gtcagcaata gggactttgt ggaaggtatg tcaggtggga cttgggttga tgttgtcttg 660
gaacatggag gttgtgtcac cgtaatggca caggacaaac cgactgtcga catagagctg 720
gttacaacaa cagtcagcaa catggcggag gtaagatcct actgctatga ggcatcaata 780
tcagacatgg cttcggacag ccgctgccca acacaaggtg aagcctacct tgacaagcaa 840
tcagacactc aatatgtctg caaaagaacg ttagtggaca gaggctgggg aaatggatgt 900
ggactttttg gcaaagggag cctggtgaca tgcgctaagt ttgcatgctc caagaaaatg 960
accgggaaga gcatccagcc agagaatctg gagtaccgga taatgctgtc agttcatggc 1020
tcccagcaca gtgggatgat cgttaatgac acaggacatg aaactgatga gaatagagcg 1080
aaggttgaga taacgcccaa ttcaccaaga gccgaagcca ccctgggggg ttttggaagc 1140
ctaggacttg attgtgaacc gaggacaggc cttgactttt cagatttgta ttacttgact 1200
atgaataaca agcactggtt ggttcacaag gagtggttcc acgacattcc attaccttgg 1260
cacgctgggg cagacaccgg aactccacac tggaacaaca aagaagcact ggtagagttc 1320
aaggacgcac atgccaaaag gcaaactgtc gtggttctag ggagtcaaga aggagcagtt 1380
cacacggccc ttgctggagc tctggaggct gagatggatg gtgcaaaggg aaggctgtcc 1440
tctggccact tgaaatgtcg cctgaaaatg gataaactta gattgaaggg cgtgtcatac 1500
tccttgtgta ccgcagcgtt cacattcacc aagatcccgg ctgaaacact gcacgggaca 1560
gtcacagtgg aggtacagta cgcagggaca gatggacctt gcaaggttcc agctcagatg 1620
gcggtggaca tgcaaactct gaccccagtt gggaggttga taaccgctaa ccccgtaatc 1680
actgaaagca ctgagaactc taagatgatg ctggaacttg atccaccatt tggggactct 1740
tacattgtca taggagtcgg ggagaagaag atcacccacc actggcacag gagtggcagc 1800
accattggaa aagcatttga agccactgtg agaggtgcca agagaatggc agtcttggga 1860
gacacagcct gggactttgg atcagttgga ggcgctctca actcattggg caagggcatc 1920
catcaaattt ttggagcagc tttcaaatca ttgtttggag gaatgtcctg gttctcaatg 1980
aaaggtttat cgagcactag catagtctac atcctgattg cagtgtgtct tggagggttg 2040
atagggatcc ccgctttaat atgttgctgc agggggcgtt gtaacaaaaa gggagaacaa 2100
gttggtatgt caagaccagg cctaaagcct gatcttacgg gaacatcaaa atcctatgta 2160
aggtcgctct ga 2172
<210> 160
<211> 2172
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp ZikaprME_MVTMintracyto (D9)
<400> 160
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca ggcagaggtg accaggagag gaagcgccta ctatatgtac 120
ctggacagga atgatgccgg cgaggccatc tccttcccaa ccacactggg catgaacaag 180
tgctacatcc agatcatgga cctgggccac atgtgcgatg ccaccatgtc ctatgagtgt 240
ccaatgctgg acgagggcgt ggagcccgac gatgtggatt gctggtgtaa taccacatct 300
acatgggtgg tgtacggcac ctgtcaccac aagaagggag aggcccggcg gagccggcgg 360
gccgtgacac tgccttccca ctctaccagg aagctgcaga cacgcagcca gacctggctg 420
gagtccagag agtataccaa gcacctgatc agggtggaga actggatctt tcgcaatcca 480
ggattcgcac tggcagcagc agcaatcgca tggctgctgg gaagctccac cagccagaaa 540
gtgatctacc tggtcatgat cctgctgatc gctcctgcct attctatccg gtgcatcggc 600
gtgagcaata gagacttcgt ggagggaatg tccggaggaa cctgggtgga tgtggtgctg 660
gagcacggcg gctgcgtgac agtgatggcc caggacaagc caaccgtgga tatcgagctg 720
gtgaccacaa ccgtgtccaa catggccgag gtgaggtctt actgctatga ggccagcatc 780
tccgacatgg cctctgatag caggtgtcca acccagggag aggcatacct ggacaagcag 840
tccgatacac agtacgtgtg caagcggacc ctggtggaca gaggctgggg caatggctgt 900
ggcctgtttg gcaagggctc tctggtgaca tgcgccaagt tcgcctgtag caagaagatg 960
accggcaagt ccatccagcc agagaacctg gagtaccgga tcatgctgtc tgtgcacggc 1020
tcccagcact ctggcatgat cgtgaacgac acaggccacg agacagatga gaatcgggcc 1080
aaggtggaga tcacacctaa ctctccaaga gccgaggcca ccctgggagg atttggctct 1140
ctgggcctgg actgcgagcc tagaacaggc ctggacttct ccgatctgta ctatctgacc 1200
atgaacaata agcactggct ggtgcacaag gagtggtttc acgacatccc actgccatgg 1260
cacgcaggag cagatacagg aacaccacac tggaacaata aggaggccct ggtggagttc 1320
aaggatgccc acgccaagcg gcagacagtg gtggtgctgg gcagccagga gggagcagtg 1380
cacaccgccc tggcaggcgc cctggaggca gagatggacg gagctaaggg cagactgtct 1440
agcggccacc tgaagtgcag gctgaagatg gataagctgc gcctgaaggg cgtgtcctac 1500
tctctgtgca cagccgcctt caccttcacc aagatccctg ccgagacact gcacggcaca 1560
gtgaccgtgg aggtgcagta tgccggcaca gacggaccct gtaaggtgcc tgcccagatg 1620
gccgtggata tgcagacact gacacctgtg ggcaggctga tcaccgccaa tccagtgatc 1680
acagagtcta ccgagaacag caagatgatg ctggagctgg acccaccatt tggcgatagc 1740
tatatcgtga tcggcgtggg cgagaagaag atcacacacc actggcaccg cagcggctcc 1800
acaatcggca aggcctttga ggcaaccgtg cgcggagcaa agagaatggc cgtgctgggc 1860
gacaccgcat gggatttcgg atctgtggga ggcgccctga acagcctggg caagggcatc 1920
caccagatct tcggcgccgc ctttaagtcc ctgttcggcg gcatgagctg gttctcaatg 1980
aagggcctgt cctctacctc tatcgtgtac atcctgatcg ccgtgtgcct gggaggcctg 2040
atcggaatcc cagccctgat ctgctgttgc agaggccgct gcaacaagaa gggagagcaa 2100
gtgggaatgt ctcggccagg cctgaagcca gacctgacag gcacctccaa gtcttatgtg 2160
agaagcctgt ga 2172
<210> 161
<211> 723
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp ZikaprME_MVTMintracyto (D9)
<400> 161
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ala Glu Val Thr Arg
20 25 30
Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu
35 40 45
Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile Gln
50 55 60
Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met Ser Tyr Glu Cys
65 70 75 80
Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val Asp Cys Trp Cys
85 90 95
Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys His His Lys Lys
100 105 110
Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu Pro Ser His Ser
115 120 125
Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu
130 135 140
Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro
145 150 155 160
Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser Ser
165 170 175
Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro
180 185 190
Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu
195 200 205
Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly
210 215 220
Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
225 230 235 240
Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
245 250 255
Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln
260 265 270
Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys
275 280 285
Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly
290 295 300
Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
305 310 315 320
Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu
325 330 335
Ser Val His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly
340 345 350
His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser
355 360 365
Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp
370 375 380
Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr
385 390 395 400
Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile
405 410 415
Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
420 425 430
Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln
435 440 445
Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
450 455 460
Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser
465 470 475 480
Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys
485 490 495
Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile
500 505 510
Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala
515 520 525
Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
530 535 540
Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile
545 550 555 560
Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro
565 570 575
Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr
580 585 590
His His Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala
595 600 605
Thr Val Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp
610 615 620
Asp Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
625 630 635 640
His Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser
645 650 655
Trp Phe Ser Met Lys Gly Leu Ser Ser Thr Ser Ile Val Tyr Ile Leu
660 665 670
Ile Ala Val Cys Leu Gly Gly Leu Ile Gly Ile Pro Ala Leu Ile Cys
675 680 685
Cys Cys Arg Gly Arg Cys Asn Lys Lys Gly Glu Gln Val Gly Met Ser
690 695 700
Arg Pro Gly Leu Lys Pro Asp Leu Thr Gly Thr Ser Lys Ser Tyr Val
705 710 715 720
Arg Ser Leu
<210> 162
<211> 1668
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of MVsp Zika_MVTMintracytoE (D10)
<400> 162
atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt actgttaact 60
ctccaaacac ccaccggtca aatcaggtgc ataggagtca gcaataggga ctttgtggaa 120
ggtatgtcag gtgggacttg ggttgatgtt gtcttggaac atggaggttg tgtcaccgta 180
atggcacagg acaaaccgac tgtcgacata gagctggtta caacaacagt cagcaacatg 240
gcggaggtaa gatcctactg ctatgaggca tcaatatcag acatggcttc ggacagccgc 300
tgcccaacac aaggtgaagc ctaccttgac aagcaatcag acactcaata tgtctgcaaa 360
agaacgttag tggacagagg ctggggaaat ggatgtggac tttttggcaa agggagcctg 420
gtgacatgcg ctaagtttgc atgctccaag aaaatgaccg ggaagagcat ccagccagag 480
aatctggagt accggataat gctgtcagtt catggctccc agcacagtgg gatgatcgtt 540
aatgacacag gacatgaaac tgatgagaat agagcgaagg ttgagataac gcccaattca 600
ccaagagccg aagccaccct ggggggtttt ggaagcctag gacttgattg tgaaccgagg 660
acaggccttg acttttcaga tttgtattac ttgactatga ataacaagca ctggttggtt 720
cacaaggagt ggttccacga cattccatta ccttggcacg ctggggcaga caccggaact 780
ccacactgga acaacaaaga agcactggta gagttcaagg acgcacatgc caaaaggcaa 840
actgtcgtgg ttctagggag tcaagaagga gcagttcaca cggcccttgc tggagctctg 900
gaggctgaga tggatggtgc aaagggaagg ctgtcctctg gccacttgaa atgtcgcctg 960
aaaatggata aacttagatt gaagggcgtg tcatactcct tgtgtaccgc agcgttcaca 1020
ttcaccaaga tcccggctga aacactgcac gggacagtca cagtggaggt acagtacgca 1080
gggacagatg gaccttgcaa ggttccagct cagatggcgg tggacatgca aactctgacc 1140
ccagttggga ggttgataac cgctaacccc gtaatcactg aaagcactga gaactctaag 1200
atgatgctgg aacttgatcc accatttggg gactcttaca ttgtcatagg agtcggggag 1260
aagaagatca cccaccactg gcacaggagt ggcagcacca ttggaaaagc atttgaagcc 1320
actgtgagag gtgccaagag aatggcagtc ttgggagaca cagcctggga ctttggatca 1380
gttggaggcg ctctcaactc attgggcaag ggcatccatc aaatttttgg agcagctttc 1440
aaatcattgt ttggaggaat gtcctggttc tcaatgaaag gtttatcgag cactagcata 1500
gtctacatcc tgattgcagt gtgtcttgga gggttgatag ggatccccgc tttaatatgt 1560
tgctgcaggg ggcgttgtaa caaaaaggga gaacaagttg gtatgtcaag accaggccta 1620
aagcctgatc ttacgggaac atcaaaatcc tatgtaaggt cgctctga 1668
<210> 163
<211> 1668
<212> DNA
<213> Artificial sequence
<220>
<223> Codon-optimized nucleotide sequence of MVsp Zika_MVTMintracytoE (D10)
<400> 163
atgagcatca tgggcctgaa ggtgaacgtg tccgccatct tcatggccgt gctgctgacc 60
ctgcagacac caacaggcca gatccggtgc atcggcgtga gcaatagaga cttcgtggag 120
ggaatgtccg gaggaacctg ggtggatgtg gtgctggagc acggcggctg cgtgacagtg 180
atggcccagg acaagccaac cgtggatatc gagctggtga ccacaaccgt gtccaacatg 240
gccgaggtga ggtcttactg ctatgaggcc agcatctccg acatggcctc tgatagcagg 300
tgtccaaccc agggagaggc atacctggac aagcagtccg atacacagta cgtgtgcaag 360
cggaccctgg tggacagagg ctggggcaat ggctgtggcc tgtttggcaa gggctctctg 420
gtgacatgcg ccaagttcgc ctgtagcaag aagatgaccg gcaagtccat ccagccagag 480
aacctggagt accggatcat gctgtctgtg cacggctccc agcactctgg catgatcgtg 540
aacgacacag gccacgagac agatgagaat cgggccaagg tggagatcac acctaactct 600
ccaagagccg aggccaccct gggaggattt ggctctctgg gcctggactg cgagcctaga 660
acaggcctgg acttctccga tctgtactat ctgaccatga acaataagca ctggctggtg 720
cacaaggagt ggtttcacga catcccactg ccatggcacg caggagcaga tacaggaaca 780
ccacactgga acaataagga ggccctggtg gagttcaagg atgcccacgc caagcggcag 840
acagtggtgg tgctgggcag ccaggaggga gcagtgcaca ccgccctggc aggcgccctg 900
gaggcagaga tggacggagc taagggcaga ctgtctagcg gccacctgaa gtgcaggctg 960
aagatggata agctgcgcct gaagggcgtg tcctactctc tgtgcacagc cgccttcacc 1020
ttcaccaaga tccctgccga gacactgcac ggcacagtga ccgtggaggt gcagtatgcc 1080
ggcacagacg gaccctgtaa ggtgcctgcc cagatggccg tggatatgca gacactgaca 1140
cctgtgggca ggctgatcac cgccaatcca gtgatcacag agtctaccga gaacagcaag 1200
atgatgctgg agctggaccc accatttggc gatagctata tcgtgatcgg cgtgggcgag 1260
aagaagatca cacaccactg gcaccgcagc ggctccacaa tcggcaaggc ctttgaggca 1320
accgtgcgcg gagcaaagag aatggccgtg ctgggcgaca ccgcatggga tttcggatct 1380
gtgggaggcg ccctgaacag cctgggcaag ggcatccacc agatcttcgg cgccgccttt 1440
aagtccctgt tcggcggcat gagctggttc tcaatgaagg gcctgtcctc tacctctatc 1500
gtgtacatcc tgatcgccgt gtgcctggga ggcctgatcg gaatcccagc cctgatctgc 1560
tgttgcagag gccgctgcaa caagaaggga gagcaagtgg gaatgtctcg gccaggcctg 1620
aagccagacc tgacaggcac ctccaagtct tatgtgagaa gcctgtga 1668
<210> 164
<211> 555
<212> PRT
<213> Artificial sequence
<220>
<223> MVsp Zika_MVTMintracytoE (D10)
<400> 164
Met Ser Ile Met Gly Leu Lys Val Asn Val Ser Ala Ile Phe Met Ala
1 5 10 15
Val Leu Leu Thr Leu Gln Thr Pro Thr Gly Gln Ile Arg Cys Ile Gly
20 25 30
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val
35 40 45
Asp Val Val Leu Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp
50 55 60
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met
65 70 75 80
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala
85 90 95
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln
100 105 110
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp
115 120 125
Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala
130 135 140
Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu
145 150 155 160
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser
165 170 175
Gly Met Ile Val Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala
180 185 190
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly
195 200 205
Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp
210 215 220
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val
225 230 235 240
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala
245 250 255
Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
260 265 270
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln
275 280 285
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met
290 295 300
Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu
305 310 315 320
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr
325 330 335
Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr
340 345 350
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val
355 360 365
Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg
370 375 380
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys
385 390 395 400
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile
405 410 415
Gly Val Gly Glu Lys Lys Ile Thr His His Trp His Arg Ser Gly Ser
420 425 430
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met
435 440 445
Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Ala
450 455 460
Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe
465 470 475 480
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Met Lys Gly Leu Ser
485 490 495
Ser Thr Ser Ile Val Tyr Ile Leu Ile Ala Val Cys Leu Gly Gly Leu
500 505 510
Ile Gly Ile Pro Ala Leu Ile Cys Cys Cys Arg Gly Arg Cys Asn Lys
515 520 525
Lys Gly Glu Gln Val Gly Met Ser Arg Pro Gly Leu Lys Pro Asp Leu
530 535 540
Thr Gly Thr Ser Lys Ser Tyr Val Arg Ser Leu
545 550 555
<210> 165
<211> 21169
<212> DNA
<213> Artificial sequence
<220>
<223> pTM2-MVSchw_A1_Zikasp_ZikaprME
<220>
<221> misc_feature
<222> (9)..(28)
<223> T7 promoter
<220>
<221> misc_feature
<222> (83)..(189)
<223> MV Leader and N promoter
<220>
<221> misc_feature
<222> (190)..(1767)
<223> MV N ORF
<220>
<221> misc_feature
<222> (1889)..(3414)
<223> MV P ORF
<220>
<221> misc_feature
<222> (3532)..(5625)
<223> A1. Zikasp_ZikaprME
<220>
<221> misc_feature
<222> (5722)..(6729)
<223> MV M ORF
<220>
<221> misc_feature
<222> (7733)..(9394)
<223> MV F ORF
<220>
<221> misc_feature
<222> (9555)..(11408)
<223> MV H ORF
<220>
<221> misc_feature
<222> (11518)..(18069)
<223> MV L ORF
<220>
<221> misc_feature
<222> (18179)..(18262)
<223> HDV ribozyme
<220>
<221> misc_feature
<222> (18263)..(18404)
<223> T7 terminator
<220>
<221> misc_feature
<222> (18438)..(18457)
<223> T3
<220>
<221> misc_feature
<222> (18475)..(18495)
<223> M13-rev
<220>
<221> misc_feature
<222> (18863)..(19491)
<223> ColE1 origin
<220>
<221> misc_feature
<222> (20642)..(21082)
<223> F1 ori
<220>
<221> misc_feature
<222> (20632)..(21087)
<223> M13 origin
<220>
<221> misc_feature
<222> (21089)..(21157)
<223> LacZ alpha
<220>
<221> misc_feature
<222> (18501)..(18523)
<223> LacO
<220>
<221> misc_feature
<222> (19643)..(20302)
<223> AmpR
<220>
<221> misc_feature
<222> (20542)..(20570)
<223> Amp prom
<400> 165
gcggccgcta atacgactca ctatagggcc aactttgttt ggtctgatga gtccgtgagg 60
acgaaacccg gagtcccggg tcaccaaaca aagttgggta aggatagttc aatcaatgat 120
catcttctag tgcacttagg attcaagatc ctattatcag ggacaagagc aggattaggg 180
atatccgaga tggccacact tttaaggagc ttagcattgt tcaaaagaaa caaggacaaa 240
ccacccatta catcaggatc cggtggagcc atcagaggaa tcaaacacat tattatagta 300
ccaatccctg gagattcctc aattaccact cgatccagac ttctggaccg gttggtgagg 360
ttaattggaa acccggatgt gagcgggccc aaactaacag gggcactaat aggtatatta 420
tccttatttg tggagtctcc aggtcaattg attcagagga tcaccgatga ccctgacgtt 480
agcataaggc tgttagaggt tgtccagagt gaccagtcac aatctggcct taccttcgca 540
tcaagaggta ccaacatgga ggatgaggcg gaccaatact tttcacatga tgatccaatt 600
agtagtgatc aatccaggtt cggatggttc gggaacaagg aaatctcaga tattgaagtg 660
caagaccctg agggattcaa catgattctg ggtaccatcc tagcccaaat ttgggtcttg 720
ctcgcaaagg cggttacggc cccagacacg gcagctgatt cggagctaag aaggtggata 780
aagtacaccc aacaaagaag ggtagttggt gaatttagat tggagagaaa atggttggat 840
gtggtgagga acaggattgc cgaggacctc tccttacgcc gattcatggt cgctctaatc 900
ctggatatca agagaacacc cggaaacaaa cccaggattg ctgaaatgat atgtgacatt 960
gatacatata tcgtagaggc aggattagcc agttttatcc tgactattaa gtttgggata 1020
gaaactatgt atcctgctct tggactgcat gaatttgctg gtgagttatc cacacttgag 1080
tccttgatga acctttacca gcaaatgggg gaaactgcac cctacatggt aatcctggag 1140
aactcaattc agaacaagtt cagtgcagga tcataccctc tgctctggag ctatgccatg 1200
ggagtaggag tggaacttga aaactccatg ggaggtttga actttggccg atcttacttt 1260
gatccagcat attttagatt agggcaagag atggtaagga ggtcagctgg aaaggtcagt 1320
tccacattgg catctgaact cggtatcact gccgaggatg caaggcttgt ttcagagatt 1380
gcaatgcata ctactgagga caagatcagt agagcggttg gacccagaca agcccaagta 1440
tcatttctac acggtgatca aagtgagaat gagctaccga gattgggggg caaggaagat 1500
aggagggtca aacagagtcg aggagaagcc agggagagct acagagaaac cgggcccagc 1560
agagcaagtg atgcgagagc tgcccatctt ccaaccggca cacccctaga cattgacact 1620
gcaacggagt ccagccaaga tccgcaggac agtcgaaggt cagctgacgc cctgcttagg 1680
ctgcaagcca tggcaggaat ctcggaagaa caaggctcag acacggacac ccctatagtg 1740
tacaatgaca gaaatcttct agactaggtg cgagaggccg agggccagaa caacatccgc 1800
ctaccatcca tcattgttat aaaaaactta ggaaccaggt ccacacagcc gccagcccat 1860
caaccatcca ctcccacgat tggagccaat ggcagaagag caggcacgcc atgtcaaaaa 1920
cggactggaa tgcatccggg ctctcaaggc cgagcccatc ggctcactgg ccatcgagga 1980
agctatggca gcatggtcag aaatatcaga caacccagga caggagcgag ccacctgcag 2040
ggaagagaag gcaggcagtt cgggtctcag caaaccatgc ctctcagcaa ttggatcaac 2100
tgaaggcggt gcacctcgca tccgcggtca gggacctgga gagagcgatg acgacgctga 2160
aactttggga atccccccaa gaaatctcca ggcatcaagc actgggttac agtgttatta 2220
cgtttatgat cacagcggtg aagcggttaa gggaatccaa gatgctgact ctatcatggt 2280
tcaatcaggc cttgatggtg atagcaccct ctcaggagga gacaatgaat ctgaaaacag 2340
cgatgtggat attggcgaac ctgataccga gggatatgct atcactgacc ggggatctgc 2400
tcccatctct atggggttca gggcttctga tgttgaaact gcagaaggag gggagatcca 2460
cgagctcctg agactccaat ccagaggcaa caactttccg aagcttggga aaactctcaa 2520
tgttcctccg cccccggacc ccggtagggc cagcacttcc gggacaccca ttaaaaaggg 2580
cacagacgcg agattagcct catttggaac ggagatcgcg tctttattga caggtggtgc 2640
aacccaatgt gctcgaaagt caccctcgga accatcaggg ccaggtgcac ctgcggggaa 2700
tgtccccgag tgtgtgagca atgccgcact gatacaggag tggacacccg aatctggtac 2760
cacaatctcc ccgagatccc agaataatga agaaggggga gactattatg atgatgagct 2820
gttctctgat gtccaagata ttaaaacagc cttggccaaa atacacgagg ataatcagaa 2880
gataatctcc aagctagaat cactgctgtt attgaaggga gaagttgagt caattaagaa 2940
gcagatcaac aggcaaaata tcagcatatc caccctggaa ggacacctct caagcatcat 3000
gatcgccatt cctggacttg ggaaggatcc caacgacccc actgcagatg tcgaaatcaa 3060
tcccgacttg aaacccatca taggcagaga ttcaggccga gcactggccg aagttctcaa 3120
gaaacccgtt gccagccgac aactccaagg aatgacaaat ggacggacca gttccagagg 3180
acagctgctg aaggaatttc agctaaagcc gatcgggaaa aagatgagct cagccgtcgg 3240
gtttgttcct gacaccggcc ctgcatcacg cagtgtaatc cgctccatta taaaatccag 3300
ccggctagag gaggatcgga agcgttacct gatgactctc cttgatgata tcaaaggagc 3360
caatgatctt gccaagttcc accagatgct gatgaagata ataatgaagt agctacagct 3420
caacttacct gccaacccca tgccagtcga cccaactagc ctaccctcca tcattgttat 3480
aaaaaactta ggaaccaggt ccacacagcc gccagcccat caacgcgtac gatggagaag 3540
aagcggagag gagcagacac aagcgtggga atcgtgggcc tgctgctgac cacagcaatg 3600
gcagcagagg tgaccaggag aggaagcgcc tactatatgt acctggacag gaatgatgcc 3660
ggcgaggcca tctccttccc aaccacactg ggcatgaaca agtgctacat ccagatcatg 3720
gacctgggcc acatgtgcga tgccaccatg tcctatgagt gtccaatgct ggacgagggc 3780
gtggagcccg acgatgtgga ttgctggtgt aataccacat ctacatgggt ggtgtacggc 3840
acctgtcacc acaagaaggg agaggcccgg cggagccggc gggccgtgac actgccttcc 3900
cactctacca ggaagctgca gacacgcagc cagacctggc tggagtccag agagtatacc 3960
aagcacctga tcagggtgga gaactggatc tttcgcaatc caggattcgc actggcagca 4020
gcagcaatcg catggctgct gggaagctcc accagccaga aagtgatcta cctggtcatg 4080
atcctgctga tcgctcctgc ctattctatc cggtgcatcg gcgtgagcaa tagagacttc 4140
gtggagggaa tgtccggagg aacctgggtg gatgtggtgc tggagcacgg cggctgcgtg 4200
acagtgatgg cccaggacaa gccaaccgtg gatatcgagc tggtgaccac aaccgtgtcc 4260
aacatggccg aggtgaggtc ttactgctat gaggccagca tctccgacat ggcctctgat 4320
agcaggtgtc caacccaggg agaggcatac ctggacaagc agtccgatac acagtacgtg 4380
tgcaagcgga ccctggtgga cagaggctgg ggcaatggct gtggcctgtt tggcaagggc 4440
tctctggtga catgcgccaa gttcgcctgt agcaagaaga tgaccggcaa gtccatccag 4500
ccagagaacc tggagtaccg gatcatgctg tctgtgcacg gctcccagca ctctggcatg 4560
atcgtgaacg acacaggcca cgagacagat gagaatcggg ccaaggtgga gatcacacct 4620
aactctccaa gagccgaggc caccctggga ggatttggct ctctgggcct ggactgcgag 4680
cctagaacag gcctggactt ctccgatctg tactatctga ccatgaacaa taagcactgg 4740
ctggtgcaca aggagtggtt tcacgacatc ccactgccat ggcacgcagg agcagataca 4800
ggaacaccac actggaacaa taaggaggcc ctggtggagt tcaaggatgc ccacgccaag 4860
cggcagacag tggtggtgct gggcagccag gagggagcag tgcacaccgc cctggcaggc 4920
gccctggagg cagagatgga cggagctaag ggcagactgt ctagcggcca cctgaagtgc 4980
aggctgaaga tggataagct gcgcctgaag ggcgtgtcct actctctgtg cacagccgcc 5040
ttcaccttca ccaagatccc tgccgagaca ctgcacggca cagtgaccgt ggaggtgcag 5100
tatgccggca cagacggacc ctgtaaggtg cctgcccaga tggccgtgga tatgcagaca 5160
ctgacacctg tgggcaggct gatcaccgcc aatccagtga tcacagagtc taccgagaac 5220
agcaagatga tgctggagct ggacccacca tttggcgata gctatatcgt gatcggcgtg 5280
ggcgagaaga agatcacaca ccactggcac cgcagcggct ccacaatcgg caaggccttt 5340
gaggcaaccg tgcgcggagc aaagagaatg gccgtgctgg gcgacaccgc atgggatttc 5400
ggatctgtgg gaggcgccct gaacagcctg ggcaagggca tccaccagat cttcggcgcc 5460
gcctttaagt ccctgttcgg cggcatgagc tggttctcac agatcctgat cggcacactg 5520
ctgatgtggc tgggcctgaa caccaagaat ggctctatca gcctgatgtg cctggccctg 5580
ggaggcgtgc tgatcttcct gtccaccgcc gtgtctgcct gatgagcgcg cagcgcttag 5640
acgtctcgcg atcgatacta gtacaaccta aatccattat aaaaaactta ggagcaaagt 5700
gattgcctcc caaggtccac aatgacagag acctacgact tcgacaagtc ggcatgggac 5760
atcaaagggt cgatcgctcc gatacaaccc accacctaca gtgatggcag gctggtgccc 5820
caggtcagag tcatagatcc tggtctaggc gacaggaagg atgaatgctt tatgtacatg 5880
tttctgctgg gggttgttga ggacagcgat tccctagggc ctccaatcgg gcgagcattt 5940
gggttcctgc ccttaggtgt tggcagatcc acagcaaagc ccgaaaaact cctcaaagag 6000
gccactgagc ttgacatagt tgttagacgt acagcagggc tcaatgaaaa actggtgttc 6060
tacaacaaca ccccactaac tctcctcaca ccttggagaa aggtcctaac aacagggagt 6120
gtcttcaacg caaaccaagt gtgcaatgcg gttaatctga taccgctcga taccccgcag 6180
aggttccgtg ttgtttatat gagcatcacc cgtctttcgg ataacgggta ttacaccgtt 6240
cctagaagaa tgctggaatt cagatcggtc aatgcagtgg ccttcaacct gctggtgacc 6300
cttaggattg acaaggcgat aggccctggg aagatcatcg acaatacaga gcaacttcct 6360
gaggcaacat ttatggtcca catcgggaac ttcaggagaa agaagagtga agtctactct 6420
gccgattatt gcaaaatgaa aatcgaaaag atgggcctgg tttttgcact tggtgggata 6480
gggggcacca gtcttcacat tagaagcaca ggcaaaatga gcaagactct ccatgcacaa 6540
ctcgggttca agaagacctt atgttacccg ctgatggata tcaatgaaga ccttaatcga 6600
ttactctgga ggagcagatg caagatagta agaatccagg cagttttgca gccatcagtt 6660
cctcaagaat tccgcattta cgacgacgtg atcataaatg atgaccaagg actattcaaa 6720
gttctgtaga ccgtagtgcc cagcaatgcc cgaaaacgac ccccctcaca atgacagcca 6780
gaaggcccgg acaaaaaagc cccctccgaa agactccacg gaccaagcga gaggccagcc 6840
agcagccgac ggcaagcgcg aacaccaggc ggccccagca cagaacagcc ctgacacaag 6900
gccaccacca gccaccccaa tctgcatcct cctcgtggga cccccgagga ccaaccccca 6960
aggctgcccc cgatccaaac caccaaccgc atccccacca cccccgggaa agaaaccccc 7020
agcaattgga aggcccctcc ccctcttcct caacacaaga actccacaac cgaaccgcac 7080
aagcgaccga ggtgacccaa ccgcaggcat ccgactccct agacagatcc tctctccccg 7140
gcaaactaaa caaaacttag ggccaaggaa catacacacc caacagaacc cagaccccgg 7200
cccacggcgc cgcgccccca acccccgaca accagaggga gcccccaacc aatcccgccg 7260
gctcccccgg tgcccacagg cagggacacc aacccccgaa cagacccagc acccaaccat 7320
cgacaatcca agacgggggg gcccccccaa aaaaaggccc ccaggggccg acagccagca 7380
ccgcgaggaa gcccacccac cccacacacg accacggcaa ccaaaccaga acccagacca 7440
ccctgggcca ccagctccca gactcggcca tcaccccgca gaaaggaaag gccacaaccc 7500
gcgcacccca gccccgatcc ggcggggagc cacccaaccc gaaccagcac ccaagagcga 7560
tccccgaagg acccccgaac cgcaaaggac atcagtatcc cacagcctct ccaagtcccc 7620
cggtctcctc ctcttctcga agggaccaaa agatcaatcc accacacccg acgacactca 7680
actccccacc cctaaaggag acaccgggaa tcccagaatc aagactcatc caatgtccat 7740
catgggtctc aaggtgaacg tctctgccat attcatggca gtactgttaa ctctccaaac 7800
acccaccggt caaatccatt ggggcaatct ctctaagata ggggtggtag gaataggaag 7860
tgcaagctac aaagttatga ctcgttccag ccatcaatca ttagtcataa aattaatgcc 7920
caatataact ctcctcaata actgcacgag ggtagagatt gcagaataca ggagactact 7980
gagaacagtt ttggaaccaa ttagagatgc acttaatgca atgacccaga atataagacc 8040
ggttcagagt gtagcttcaa gtaggagaca caagagattt gcgggagtag tcctggcagg 8100
tgcggcccta ggcgttgcca cagctgctca gataacagcc ggcattgcac ttcaccagtc 8160
catgctgaac tctcaagcca tcgacaatct gagagcgagc ctggaaacta ctaatcaggc 8220
aattgagaca atcagacaag cagggcagga gatgatattg gctgttcagg gtgtccaaga 8280
ctacatcaat aatgagctga taccgtctat gaaccaacta tcttgtgatt taatcggcca 8340
gaagctcggg ctcaaattgc tcagatacta tacagaaatc ctgtcattat ttggccccag 8400
tttacgggac cccatatctg cggagatatc tatccaggct ttgagctatg cgcttggagg 8460
agacatcaat aaggtgttag aaaagctcgg atacagtgga ggtgatttac tgggcatctt 8520
agagagcgga ggaataaagg cccggataac tcacgtcgac acagagtcct acttcattgt 8580
cctcagtata gcctatccga cgctgtccga gattaagggg gtgattgtcc accggctaga 8640
gggggtctcg tacaacatag gctctcaaga gtggtatacc actgtgccca agtatgttgc 8700
aacccaaggg taccttatct cgaattttga tgagtcatcg tgtactttca tgccagaggg 8760
gactgtgtgc agccaaaatg ccttgtaccc gatgagtcct ctgctccaag aatgcctccg 8820
ggggtacacc aagtcctgtg ctcgtacact cgtatccggg tcttttggga accggttcat 8880
tttatcacaa gggaacctaa tagccaattg tgcatcaatc ctttgcaagt gttacacaac 8940
aggaacgatc attaatcaag accctgacaa gatcctaaca tacattgctg ccgatcactg 9000
cccggtagtc gaggtgaacg gcgtgaccat ccaagtcggg agcaggaggt atccagacgc 9060
tgtgtacttg cacagaattg acctcggtcc tcccatatca ttggagaggt tggacgtagg 9120
gacaaatctg gggaatgcaa ttgctaagtt ggaggatgcc aaggaattgt tggagtcatc 9180
ggaccagata ttgaggagta tgaaaggttt atcgagcact agcatagtct acatcctgat 9240
tgcagtgtgt cttggagggt tgatagggat ccccgcttta atatgttgct gcagggggcg 9300
ttgtaacaaa aagggagaac aagttggtat gtcaagacca ggcctaaagc ctgatcttac 9360
gggaacatca aaatcctatg taaggtcgct ctgatcctct acaactcttg aaacacaaat 9420
gtcccacaag tctcctcttc gtcatcaagc aaccaccgca cccagcatca agcccacctg 9480
aaattatctc cggcttccct ctggccgaac aatatcggta gttaatcaaa acttagggtg 9540
caagatcatc cacaatgtca ccacaacgag accggataaa tgccttctac aaagataacc 9600
cccatcccaa gggaagtagg atagtcatta acagagaaca tcttatgatt gatagacctt 9660
atgttttgct ggctgttctg tttgtcatgt ttctgagctt gatcgggttg ctagccattg 9720
caggcattag acttcatcgg gcagccatct acaccgcaga gatccataaa agcctcagca 9780
ccaatctaga tgtaactaac tcaatcgagc atcaggtcaa ggacgtgctg acaccactct 9840
tcaaaatcat cggtgatgaa gtgggcctga ggacacctca gagattcact gacctagtga 9900
aattaatctc tgacaagatt aaattcctta atccggatag ggagtacgac ttcagagatc 9960
tcacttggtg tatcaacccg ccagagagaa tcaaattgga ttatgatcaa tactgtgcag 10020
atgtggctgc tgaagagctc atgaatgcat tggtgaactc aactctactg gagaccagaa 10080
caaccaatca gttcctagct gtctcaaagg gaaactgctc agggcccact acaatcagag 10140
gtcaattctc aaacatgtcg ctgtccctgt tagacttgta tttaggtcga ggttacaatg 10200
tgtcatctat agtcactatg acatcccagg gaatgtatgg gggaacttac ctagtggaaa 10260
agcctaatct gagcagcaaa aggtcagagt tgtcacaact gagcatgtac cgagtgtttg 10320
aagtaggtgt tatcagaaat ccgggtttgg gggctccggt gttccatatg acaaactatc 10380
ttgagcaacc agtcagtaat gatctcagca actgtatggt ggctttgggg gagctcaaac 10440
tcgcagccct ttgtcacggg gaagattcta tcacaattcc ctatcaggga tcagggaaag 10500
gtgtcagctt ccagctcgtc aagctaggtg tctggaaatc cccaaccgac atgcaatcct 10560
gggtcccctt atcaacggat gatccagtga tagacaggct ttacctctca tctcacagag 10620
gtgttatcgc tgacaatcaa gcaaaatggg ctgtcccgac aacacgaaca gatgacaagt 10680
tgcgaatgga gacatgcttc caacaggcgt gtaagggtaa aatccaagca ctctgcgaga 10740
atcccgagtg ggcaccattg aaggataaca ggattccttc atacggggtc ttgtctgttg 10800
atctgagtct gacagttgag cttaaaatca aaattgcttc gggattcggg ccattgatca 10860
cacacggttc agggatggac ctatacaaat ccaaccacaa caatgtgtat tggctgacta 10920
tcccgccaat gaagaaccta gccttaggtg taatcaacac attggagtgg ataccgagat 10980
tcaaggttag tccctacctc ttcactgtcc caattaagga agcaggcgaa gactgccatg 11040
ccccaacata cctacctgcg gaggtggatg gtgatgtcaa actcagttcc aatctggtga 11100
ttctacctgg tcaagatctc caatatgttt tggcaaccta cgatacttcc agggttgaac 11160
atgctgtggt ttattacgtt tacagcccaa gccgctcatt ttcttacttt tatcctttta 11220
ggttgcctat aaagggggtc cccatcgaat tacaagtgga atgcttcaca tgggaccaaa 11280
aactctggtg ccgtcacttc tgtgtgcttg cggactcaga atctggtgga catatcactc 11340
actctgggat ggtgggcatg ggagtcagct gcacagtcac ccgggaagat ggaaccaatc 11400
gcagataggg ctgctagtga accaatcaca tgatgtcacc cagacatcag gcatacccac 11460
tagtgtgaaa tagacatcag aattaagaaa aacgtagggt ccaagtggtt ccccgttatg 11520
gactcgctat ctgtcaacca gatcttatac cctgaagttc acctagatag cccgatagtt 11580
accaataaga tagtagccat cctggagtat gctcgagtcc ctcacgctta cagcctggag 11640
gaccctacac tgtgtcagaa catcaagcac cgcctaaaaa acggattttc caaccaaatg 11700
attataaaca atgtggaagt tgggaatgtc atcaagtcca agcttaggag ttatccggcc 11760
cactctcata ttccatatcc aaattgtaat caggatttat ttaacataga agacaaagag 11820
tcaacgagga agatccgtga actcctcaaa aaggggaatt cgctgtactc caaagtcagt 11880
gataaggttt tccaatgctt aagggacact aactcacggc ttggcctagg ctccgaattg 11940
agggaggaca tcaaggagaa agttattaac ttgggagttt acatgcacag ctcccagtgg 12000
tttgagccct ttctgttttg gtttacagtc aagactgaga tgaggtcagt gattaaatca 12060
caaacccata cttgccatag gaggagacac acacctgtat tcttcactgg tagttcagtt 12120
gagttgctaa tctctcgtga ccttgttgct ataatcagta aagagtctca acatgtatat 12180
tacctgacat ttgaactggt tttgatgtat tgtgatgtca tagaggggag gttaatgaca 12240
gagaccgcta tgactattga tgctaggtat acagagcttc taggaagagt cagatacatg 12300
tggaaactga tagatggttt cttccctgca ctcgggaatc caacttatca aattgtagcc 12360
atgctggagc ctctttcact tgcttacctg cagctgaggg atataacagt agaactcaga 12420
ggtgctttcc ttaaccactg ctttactgaa atacatgatg ttcttgacca aaacgggttt 12480
tctgatgaag gtacttatca tgagttaact gaagctctag attacatttt cataactgat 12540
gacatacatc tgacagggga gattttctca tttttcagaa gtttcggcca ccccagactt 12600
gaagcagtaa cggctgctga aaatgttagg aaatacatga atcagcctaa agtcattgtg 12660
tatgagactc tgatgaaagg tcatgccata ttttgtggaa tcataatcaa cggctatcgt 12720
gacaggcacg gaggcagttg gccaccgctg accctccccc tgcatgctgc agacacaatc 12780
cggaatgctc aagcttcagg tgaagggtta acacatgagc agtgcgttga taactggaaa 12840
tcttttgctg gagtgaaatt tggctgcttt atgcctctta gcctggatag tgatctgaca 12900
atgtacctaa aggacaaggc acttgctgct ctccaaaggg aatgggattc agtttacccg 12960
aaagagttcc tgcgttacga ccctcccaag ggaaccgggt cacggaggct tgtagatgtt 13020
ttccttaatg attcgagctt tgacccatat gatgtgataa tgtatgttgt aagtggagct 13080
tacctccatg accctgagtt caacctgtct tacagcctga aagaaaagga gatcaaggaa 13140
acaggtagac tttttgctaa aatgacttac aaaatgaggg catgccaagt gattgctgaa 13200
aatctaatct caaacgggat tggcaaatat tttaaggaca atgggatggc caaggatgag 13260
cacgatttga ctaaggcact ccacactcta gctgtctcag gagtccccaa agatctcaaa 13320
gaaagtcaca ggggggggcc agtcttaaaa acctactccc gaagcccagt ccacacaagt 13380
accaggaacg tgagagcagc aaaagggttt atagggttcc ctcaagtaat tcggcaggac 13440
caagacactg atcatccgga gaatatggaa gcttacgaga cagtcagtgc atttatcacg 13500
actgatctca agaagtactg ccttaattgg agatatgaga ccatcagctt gtttgcacag 13560
aggctaaatg agatttacgg attgccctca tttttccagt ggctgcataa gaggcttgag 13620
acctctgtcc tgtatgtaag tgaccctcat tgcccccccg accttgacgc ccatatcccg 13680
ttatataaag tccccaatga tcaaatcttc attaagtacc ctatgggagg tatagaaggg 13740
tattgtcaga agctgtggac catcagcacc attccctatc tatacctggc tgcttatgag 13800
agcggagtaa ggattgcttc gttagtgcaa ggggacaatc agaccatagc cgtaacaaaa 13860
agggtaccca gcacatggcc ctacaacctt aagaaacggg aagctgctag agtaactaga 13920
gattactttg taattcttag gcaaaggcta catgatattg gccatcacct caaggcaaat 13980
gagacaattg tttcatcaca tttttttgtc tattcaaaag gaatatatta tgatgggcta 14040
cttgtgtccc aatcactcaa gagcatcgca agatgtgtat tctggtcaga gactatagtt 14100
gatgaaacaa gggcagcatg cagtaatatt gctacaacaa tggctaaaag catcgagaga 14160
ggttatgacc gttaccttgc atattccctg aacgtcctaa aagtgataca gcaaattctg 14220
atctctcttg gcttcacaat caattcaacc atgacccggg atgtagtcat acccctcctc 14280
acaaacaacg acctcttaat aaggatggca ctgttgcccg ctcctattgg ggggatgaat 14340
tatctgaata tgagcaggct gtttgtcaga aacatcggtg atccagtaac atcatcaatt 14400
gctgatctca agagaatgat tctcgcctca ctaatgcctg aagagaccct ccatcaagta 14460
atgacacaac aaccggggga ctcttcattc ctagactggg ctagcgaccc ttactcagca 14520
aatcttgtat gtgtccagag catcactaga ctcctcaaga acataactgc aaggtttgtc 14580
ctgatccata gtccaaaccc aatgttaaaa ggattattcc atgatgacag taaagaagag 14640
gacgagggac tggcggcatt cctcatggac aggcatatta tagtacctag ggcagctcat 14700
gaaatcctgg atcatagtgt cacaggggca agagagtcta ttgcaggcat gctggatacc 14760
acaaaaggct tgattcgagc cagcatgagg aagggggggt taacctctcg agtgataacc 14820
agattgtcca attatgacta tgaacaattc agagcaggga tggtgctatt gacaggaaga 14880
aagagaaatg tcctcattga caaagagtca tgttcagtgc agctggcgag agctctaaga 14940
agccatatgt gggcgaggct agctcgagga cggcctattt acggccttga ggtccctgat 15000
gtactagaat ctatgcgagg ccaccttatt cggcgtcatg agacatgtgt catctgcgag 15060
tgtggatcag tcaactacgg atggtttttt gtcccctcgg gttgccaact ggatgatatt 15120
gacaaggaaa catcatcctt gagagtccca tatattggtt ctaccactga tgagagaaca 15180
gacatgaagc ttgccttcgt aagagcccca agtcgatcct tgcgatctgc tgttagaata 15240
gcaacagtgt actcatgggc ttacggtgat gatgatagct cttggaacga agcctggttg 15300
ttggctaggc aaagggccaa tgtgagcctg gaggagctaa gggtgatcac tcccatctca 15360
acttcgacta atttagcgca taggttgagg gatcgtagca ctcaagtgaa atactcaggt 15420
acatcccttg tccgagtggc gaggtatacc acaatctcca acgacaatct ctcatttgtc 15480
atatcagata agaaggttga tactaacttt atataccaac aaggaatgct tctagggttg 15540
ggtgttttag aaacattgtt tcgactcgag aaagataccg gatcatctaa cacggtatta 15600
catcttcacg tcgaaacaga ttgttgcgtg atcccgatga tagatcatcc caggataccc 15660
agctcccgca agctagagct gagggcagag ctatgtacca acccattgat atatgataat 15720
gcacctttaa ttgacagaga tgcaacaagg ctatacaccc agagccatag gaggcacctt 15780
gtggaatttg ttacatggtc cacaccccaa ctatatcaca ttttagctaa gtccacagca 15840
ctatctatga ttgacctggt aacaaaattt gagaaggacc atatgaatga aatttcagct 15900
ctcatagggg atgacgatat caatagtttc ataactgagt ttctgctcat agagccaaga 15960
ttattcacta tctacttggg ccagtgtgcg gccatcaatt gggcatttga tgtacattat 16020
catagaccat cagggaaata tcagatgggt gagctgttgt catcgttcct ttctagaatg 16080
agcaaaggag tgtttaaggt gcttgtcaat gctctaagcc acccaaagat ctacaagaaa 16140
ttctggcatt gtggtattat agagcctatc catggtcctt cacttgatgc tcaaaacttg 16200
cacacaactg tgtgcaacat ggtttacaca tgctatatga cctacctcga cctgttgttg 16260
aatgaagagt tagaagagtt cacatttctc ttgtgtgaaa gcgacgagga tgtagtaccg 16320
gacagattcg acaacatcca ggcaaaacac ttatgtgttc tggcagattt gtactgtcaa 16380
ccagggacct gcccaccaat tcgaggtcta agaccggtag agaaatgtgc agttctaacc 16440
gaccatatca aggcagaggc tatgttatct ccagcaggat cttcgtggaa cataaatcca 16500
attattgtag accattactc atgctctctg acttatctcc ggcgaggatc gatcaaacag 16560
ataagattga gagttgatcc aggattcatt ttcgacgccc tcgctgaggt aaatgtcagt 16620
cagccaaaga tcggcagcaa caacatctca aatatgagca tcaaggcttt cagaccccca 16680
cacgatgatg ttgcaaaatt gctcaaagat atcaacacaa gcaagcacaa tcttcccatt 16740
tcagggggca atctcgccaa ttatgaaatc catgctttcc gcagaatcgg gttgaactca 16800
tctgcttgct acaaagctgt tgagatatca acattaatta ggagatgcct tgagccaggg 16860
gaggacggct tgttcttggg tgagggatcg ggttctatgt tgatcactta taaagagata 16920
cttaaactaa acaagtgctt ctataatagt ggggtttccg ccaattctag atctggtcaa 16980
agggaattag caccctatcc ctccgaagtt ggccttgtcg aacacagaat gggagtaggt 17040
aatattgtca aagtgctctt taacgggagg cccgaagtca cgtgggtagg cagtgtagat 17100
tgcttcaatt tcatagttag taatatccct acctctagtg tggggtttat ccattcagat 17160
atagagacct tgcctgacaa agatactata gagaagctag aggaattggc agccatctta 17220
tcgatggctc tgctcctggg caaaatagga tcaatactgg tgattaagct tatgcctttc 17280
agcggggatt ttgttcaggg atttataagt tatgtagggt ctcattatag agaagtgaac 17340
cttgtatacc ctagatacag caacttcatc tctactgaat cttatttggt tatgacagat 17400
ctcaaggcta accggctaat gaatcctgaa aagattaagc agcagataat tgaatcatct 17460
gtgaggactt cacctggact tataggtcac atcctatcca ttaagcaact aagctgcata 17520
caagcaattg tgggagacgc agttagtaga ggtgatatca atcctactct gaaaaaactt 17580
acacctatag agcaggtgct gatcaattgc gggttggcaa ttaacggacc taagctgtgc 17640
aaagaattga tccaccatga tgttgcctca gggcaagatg gattgcttaa ttctatactc 17700
atcctctaca gggagttggc aagattcaaa gacaaccaaa gaagtcaaca agggatgttc 17760
cacgcttacc ccgtattggt aagtagcagg caacgagaac ttatatctag gatcacccgc 17820
aaattctggg ggcacattct tctttactcc gggaacaaaa agttgataaa taagtttatc 17880
cagaatctca agtccggcta tctgatacta gacttacacc agaatatctt cgttaagaat 17940
ctatccaagt cagagaaaca gattattatg acggggggtt tgaaacgtga gtgggttttt 18000
aaggtaacag tcaaggagac caaagaatgg tataagttag tcggatacag tgccctgatt 18060
aaggactaat tggttgaact ccggaaccct aatcctgccc taggtggtta ggcattattt 18120
gcaatatatt aaagaaaact ttgaaaatac gaagtttcta ttcccagctt tgtctggtgg 18180
ccggcatggt cccagcctcc tcgctggcgc cggctgggca acattccgag gggaccgtcc 18240
cctcggtaat ggcgaatggg acgcggccga tccggctgct aacaaagccc gaaaggaagc 18300
tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg 18360
ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatgcggcc gcgggcccta 18420
tggtacccag cttttgttcc ctttagtgag ggttaattcc gagcttggcg taatcatggt 18480
catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac ataggagccg 18540
gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag gtaactcaca ttaattgcgt 18600
tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg 18660
gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg 18720
actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa 18780
tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc 18840
aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctcggccccc 18900
ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat 18960
aaagatacca ggcgttcccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc 19020
cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcaatgct 19080
cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg 19140
aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 19200
cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga 19260
ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa 19320
ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta 19380
gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc 19440
agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg 19500
acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga 19560
tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg 19620
agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct 19680
gtctatttcg ttcatccata gttgcctgac tgcccgtcgt gtagataact acgatacggg 19740
agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc 19800
cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa 19860
ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc 19920
cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt 19980
cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 20040
ccatgttgtg aaaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 20100
tggccgcagt gttatcactc atgcttatgg cagcactgca taattctctt actgtcatgc 20160
catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 20220
gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata 20280
gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 20340
tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 20400
catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 20460
aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 20520
attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 20580
aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gaaattgtaa 20640
acgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 20700
aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga 20760
gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 20820
ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt 20880
ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 20940
gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 21000
cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 21060
cgcttaatgc gccgctacag ggcgcgtccc attcgccatt caggctgcgc aactgttggg 21120
aagggcgatc ggtgcgggcc tcttcgctat tacgccagcc accgcggtg 21169
<210> 166
<211> 20839
<212> DNA
<213> Artificial sequence
<220>
<223> pTM2-MVSchw_insert 4 (native sequence)
<400> 2
gcggccgcta atacgactca ctatagggcc aactttgttt ggtctgatga gtccgtgagg 60
acgaaacccg gagtcccggg tcaccaaaca aagttgggta aggatagttc aatcaatgat 120
catcttctag tgcacttagg attcaagatc ctattatcag ggacaagagc aggattaggg 180
atatccgaga tggccacact tttaaggagc ttagcattgt tcaaaagaaa caaggacaaa 240
ccacccatta catcaggatc cggtggagcc atcagaggaa tcaaacacat tattatagta 300
ccaatccctg gagattcctc aattaccact cgatccagac ttctggaccg gttggtgagg 360
ttaattggaa acccggatgt gagcgggccc aaactaacag gggcactaat aggtatatta 420
tccttatttg tggagtctcc aggtcaattg attcagagga tcaccgatga ccctgacgtt 480
agcataaggc tgttagaggt tgtccagagt gaccagtcac aatctggcct taccttcgca 540
tcaagaggta ccaacatgga ggatgaggcg gaccaatact tttcacatga tgatccaatt 600
agtagtgatc aatccaggtt cggatggttc gggaacaagg aaatctcaga tattgaagtg 660
caagaccctg agggattcaa catgattctg ggtaccatcc tagcccaaat ttgggtcttg 720
ctcgcaaagg cggttacggc cccagacacg gcagctgatt cggagctaag aaggtggata 780
aagtacaccc aacaaagaag ggtagttggt gaatttagat tggagagaaa atggttggat 840
gtggtgagga acaggattgc cgaggacctc tccttacgcc gattcatggt cgctctaatc 900
ctggatatca agagaacacc cggaaacaaa cccaggattg ctgaaatgat atgtgacatt 960
gatacatata tcgtagaggc aggattagcc agttttatcc tgactattaa gtttgggata 1020
gaaactatgt atcctgctct tggactgcat gaatttgctg gtgagttatc cacacttgag 1080
tccttgatga acctttacca gcaaatgggg gaaactgcac cctacatggt aatcctggag 1140
aactcaattc agaacaagtt cagtgcagga tcataccctc tgctctggag ctatgccatg 1200
ggagtaggag tggaacttga aaactccatg ggaggtttga actttggccg atcttacttt 1260
gatccagcat attttagatt agggcaagag atggtaagga ggtcagctgg aaaggtcagt 1320
tccacattgg catctgaact cggtatcact gccgaggatg caaggcttgt ttcagagatt 1380
gcaatgcata ctactgagga caagatcagt agagcggttg gacccagaca agcccaagta 1440
tcatttctac acggtgatca aagtgagaat gagctaccga gattgggggg caaggaagat 1500
aggagggtca aacagagtcg aggagaagcc agggagagct acagagaaac cgggcccagc 1560
agagcaagtg atgcgagagc tgcccatctt ccaaccggca cacccctaga cattgacact 1620
gcaacggagt ccagccaaga tccgcaggac agtcgaaggt cagctgacgc cctgcttagg 1680
ctgcaagcca tggcaggaat ctcggaagaa caaggctcag acacggacac ccctatagtg 1740
tacaatgaca gaaatcttct agactaggtg cgagaggccg agggccagaa caacatccgc 1800
ctaccatcca tcattgttat aaaaaactta ggaaccaggt ccacacagcc gccagcccat 1860
caaccatcca ctcccacgat tggagccaat ggcagaagag caggcacgcc atgtcaaaaa 1920
cggactggaa tgcatccggg ctctcaaggc cgagcccatc ggctcactgg ccatcgagga 1980
agctatggca gcatggtcag aaatatcaga caacccagga caggagcgag ccacctgcag 2040
ggaagagaag gcaggcagtt cgggtctcag caaaccatgc ctctcagcaa ttggatcaac 2100
tgaaggcggt gcacctcgca tccgcggtca gggacctgga gagagcgatg acgacgctga 2160
aactttggga atccccccaa gaaatctcca ggcatcaagc actgggttac agtgttatta 2220
cgtttatgat cacagcggtg aagcggttaa gggaatccaa gatgctgact ctatcatggt 2280
tcaatcaggc cttgatggtg atagcaccct ctcaggagga gacaatgaat ctgaaaacag 2340
cgatgtggat attggcgaac ctgataccga gggatatgct atcactgacc ggggatctgc 2400
tcccatctct atggggttca gggcttctga tgttgaaact gcagaaggag gggagatcca 2460
cgagctcctg agactccaat ccagaggcaa caactttccg aagcttggga aaactctcaa 2520
tgttcctccg cccccggacc ccggtagggc cagcacttcc gggacaccca ttaaaaaggg 2580
cacagacgcg agattagcct catttggaac ggagatcgcg tctttattga caggtggtgc 2640
aacccaatgt gctcgaaagt caccctcgga accatcaggg ccaggtgcac ctgcggggaa 2700
tgtccccgag tgtgtgagca atgccgcact gatacaggag tggacacccg aatctggtac 2760
cacaatctcc ccgagatccc agaataatga agaaggggga gactattatg atgatgagct 2820
gttctctgat gtccaagata ttaaaacagc cttggccaaa atacacgagg ataatcagaa 2880
gataatctcc aagctagaat cactgctgtt attgaaggga gaagttgagt caattaagaa 2940
gcagatcaac aggcaaaata tcagcatatc caccctggaa ggacacctct caagcatcat 3000
gatcgccatt cctggacttg ggaaggatcc caacgacccc actgcagatg tcgaaatcaa 3060
tcccgacttg aaacccatca taggcagaga ttcaggccga gcactggccg aagttctcaa 3120
gaaacccgtt gccagccgac aactccaagg aatgacaaat ggacggacca gttccagagg 3180
acagctgctg aaggaatttc agctaaagcc gatcgggaaa aagatgagct cagccgtcgg 3240
gtttgttcct gacaccggcc ctgcatcacg cagtgtaatc cgctccatta taaaatccag 3300
ccggctagag gaggatcgga agcgttacct gatgactctc cttgatgata tcaaaggagc 3360
caatgatctt gccaagttcc accagatgct gatgaagata ataatgaagt agctacagct 3420
caacttacct gccaacccca tgccagtcga cccaactagc ctaccctcca tcattgttat 3480
aaaaaactta ggaaccaggt ccacacagcc gccagcccat caacgcgtac gatgggtgtc 3540
ggaattgttg gcctcctgct gaccacagct atggcagcgg aggtcactag acgtgggagt 3600
gcatactata tgtacttgga cagaaacgac gctggggagg ccatatcttt tccaaccaca 3660
ttggggatga ataagtgtta tatacagatc atggatcttg gacacatgtg tgatgccacc 3720
atgagctatg aatgccctat gctggatgag ggggtggaac cagatgacgt cgattgttgg 3780
tgcaacacga cgtcaacttg ggttgtgtac ggaacctgcc atcacaaaaa aggtgaagca 3840
cggagatcta gaagagctgt gacgctcccc tcccattcca ctaggaagct gcaaacgcgg 3900
tcgcaaacct ggttggaatc aagagaatac acaaagcact tgattagagt cgaaaattgg 3960
atattcagga accctggctt cgcgttagca gcagctgcca tcgcttggct tttgggaagc 4020
tcaacgagcc aaaaagtcat atacttggtc atgatactgc tgattgcccc ggcatacagc 4080
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg tgggacttgg 4140
gttgatgttg tcttggaaca tggaggttgt gtcaccgtaa tggcacagga caaaccgact 4200
gtcgacatag agctggttac aacaacagtc agcaacatgg cggaggtaag atcctactgc 4260
tatgaggcat caatatcgga catggcttcg gacagccgct gcccaacaca aggtgaagcc 4320
taccttgaca agcaatcaga cactcaatat gtctgcaaaa gaacgttagt ggacagaggc 4380
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc taagtttgca 4440
tgctccaaga aaatgaccgg gaagagcatc cagccagaga atctggagta ccggataatg 4500
ctgtcagttc atggctccca gcacagtggg atgatcgtta atgacacagg acatgaaact 4560
gatgagaata gagcgaaggt tgagataacg cccaattcac caagagccga agccaccctg 4620
gggggttttg gaagcctagg acttgattgt gaaccgagga caggccttga cttttcagat 4680
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg gttccacgac 4740
attccattac cttggcacgc tggggcagac accggaactc cacactggaa caacaaagaa 4800
gcactggtag agttcaagga cgcacatgcc aaaaggcaaa ctgtcgtggt tctagggagt 4860
caagaaggag cagttcacac ggcccttgct ggagctctgg aggctgagat ggatggtgca 4920
aagggaaggc tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa acttagattg 4980
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat cccggctgaa 5040
acactgcacg ggacagtcac agtggaggta cagtacgcag ggacagatgg accttgcaag 5100
gttccagctc agatggcggt ggacatgcaa actctgaccc cagttgggag gttgataacc 5160
gctaaccccg taatcactga aagcactgag aactctaaga tgatgctgga acttgatcca 5220
ccatttgggg actcttacat tgtcatagga gtcggggaga agaagatcac ccaccactgg 5280
cacaggagtg gctaagcgcg cagcgcttag acgtctcgcg atcgatacta gtacaaccta 5340
aatccattat aaaaaactta ggagcaaagt gattgcctcc caaggtccac aatgacagag 5400
acctacgact tcgacaagtc ggcatgggac atcaaagggt cgatcgctcc gatacaaccc 5460
accacctaca gtgatggcag gctggtgccc caggtcagag tcatagatcc tggtctaggc 5520
gacaggaagg atgaatgctt tatgtacatg tttctgctgg gggttgttga ggacagcgat 5580
tccctagggc ctccaatcgg gcgagcattt gggttcctgc ccttaggtgt tggcagatcc 5640
acagcaaagc ccgaaaaact cctcaaagag gccactgagc ttgacatagt tgttagacgt 5700
acagcagggc tcaatgaaaa actggtgttc tacaacaaca ccccactaac tctcctcaca 5760
ccttggagaa aggtcctaac aacagggagt gtcttcaacg caaaccaagt gtgcaatgcg 5820
gttaatctga taccgctcga taccccgcag aggttccgtg ttgtttatat gagcatcacc 5880
cgtctttcgg ataacgggta ttacaccgtt cctagaagaa tgctggaatt cagatcggtc 5940
aatgcagtgg ccttcaacct gctggtgacc cttaggattg acaaggcgat aggccctggg 6000
aagatcatcg acaatacaga gcaacttcct gaggcaacat ttatggtcca catcgggaac 6060
ttcaggagaa agaagagtga agtctactct gccgattatt gcaaaatgaa aatcgaaaag 6120
atgggcctgg tttttgcact tggtgggata gggggcacca gtcttcacat tagaagcaca 6180
ggcaaaatga gcaagactct ccatgcacaa ctcgggttca agaagacctt atgttacccg 6240
ctgatggata tcaatgaaga ccttaatcga ttactctgga ggagcagatg caagatagta 6300
agaatccagg cagttttgca gccatcagtt cctcaagaat tccgcattta cgacgacgtg 6360
atcataaatg atgaccaagg actattcaaa gttctgtaga ccgtagtgcc cagcaatgcc 6420
cgaaaacgac ccccctcaca atgacagcca gaaggcccgg acaaaaaagc cccctccgaa 6480
agactccacg gaccaagcga gaggccagcc agcagccgac ggcaagcgcg aacaccaggc 6540
ggccccagca cagaacagcc ctgacacaag gccaccacca gccaccccaa tctgcatcct 6600
cctcgtggga cccccgagga ccaaccccca aggctgcccc cgatccaaac caccaaccgc 6660
atccccacca cccccgggaa agaaaccccc agcaattgga aggcccctcc ccctcttcct 6720
caacacaaga actccacaac cgaaccgcac aagcgaccga ggtgacccaa ccgcaggcat 6780
ccgactccct agacagatcc tctctccccg gcaaactaaa caaaacttag ggccaaggaa 6840
catacacacc caacagaacc cagaccccgg cccacggcgc cgcgccccca acccccgaca 6900
accagaggga gcccccaacc aatcccgccg gctcccccgg tgcccacagg cagggacacc 6960
aacccccgaa cagacccagc acccaaccat cgacaatcca agacgggggg gcccccccaa 7020
aaaaaggccc ccaggggccg acagccagca ccgcgaggaa gcccacccac cccacacacg 7080
accacggcaa ccaaaccaga acccagacca ccctgggcca ccagctccca gactcggcca 7140
tcaccccgca gaaaggaaag gccacaaccc gcgcacccca gccccgatcc ggcggggagc 7200
cacccaaccc gaaccagcac ccaagagcga tccccgaagg acccccgaac cgcaaaggac 7260
atcagtatcc cacagcctct ccaagtcccc cggtctcctc ctcttctcga agggaccaaa 7320
agatcaatcc accacacccg acgacactca actccccacc cctaaaggag acaccgggaa 7380
tcccagaatc aagactcatc caatgtccat catgggtctc aaggtgaacg tctctgccat 7440
attcatggca gtactgttaa ctctccaaac acccaccggt caaatccatt ggggcaatct 7500
ctctaagata ggggtggtag gaataggaag tgcaagctac aaagttatga ctcgttccag 7560
ccatcaatca ttagtcataa aattaatgcc caatataact ctcctcaata actgcacgag 7620
ggtagagatt gcagaataca ggagactact gagaacagtt ttggaaccaa ttagagatgc 7680
acttaatgca atgacccaga atataagacc ggttcagagt gtagcttcaa gtaggagaca 7740
caagagattt gcgggagtag tcctggcagg tgcggcccta ggcgttgcca cagctgctca 7800
gataacagcc ggcattgcac ttcaccagtc catgctgaac tctcaagcca tcgacaatct 7860
gagagcgagc ctggaaacta ctaatcaggc aattgagaca atcagacaag cagggcagga 7920
gatgatattg gctgttcagg gtgtccaaga ctacatcaat aatgagctga taccgtctat 7980
gaaccaacta tcttgtgatt taatcggcca gaagctcggg ctcaaattgc tcagatacta 8040
tacagaaatc ctgtcattat ttggccccag tttacgggac cccatatctg cggagatatc 8100
tatccaggct ttgagctatg cgcttggagg agacatcaat aaggtgttag aaaagctcgg 8160
atacagtgga ggtgatttac tgggcatctt agagagcgga ggaataaagg cccggataac 8220
tcacgtcgac acagagtcct acttcattgt cctcagtata gcctatccga cgctgtccga 8280
gattaagggg gtgattgtcc accggctaga gggggtctcg tacaacatag gctctcaaga 8340
gtggtatacc actgtgccca agtatgttgc aacccaaggg taccttatct cgaattttga 8400
tgagtcatcg tgtactttca tgccagaggg gactgtgtgc agccaaaatg ccttgtaccc 8460
gatgagtcct ctgctccaag aatgcctccg ggggtacacc aagtcctgtg ctcgtacact 8520
cgtatccggg tcttttggga accggttcat tttatcacaa gggaacctaa tagccaattg 8580
tgcatcaatc ctttgcaagt gttacacaac aggaacgatc attaatcaag accctgacaa 8640
gatcctaaca tacattgctg ccgatcactg cccggtagtc gaggtgaacg gcgtgaccat 8700
ccaagtcggg agcaggaggt atccagacgc tgtgtacttg cacagaattg acctcggtcc 8760
tcccatatca ttggagaggt tggacgtagg gacaaatctg gggaatgcaa ttgctaagtt 8820
ggaggatgcc aaggaattgt tggagtcatc ggaccagata ttgaggagta tgaaaggttt 8880
atcgagcact agcatagtct acatcctgat tgcagtgtgt cttggagggt tgatagggat 8940
ccccgcttta atatgttgct gcagggggcg ttgtaacaaa aagggagaac aagttggtat 9000
gtcaagacca ggcctaaagc ctgatcttac gggaacatca aaatcctatg taaggtcgct 9060
ctgatcctct acaactcttg aaacacaaat gtcccacaag tctcctcttc gtcatcaagc 9120
aaccaccgca cccagcatca agcccacctg aaattatctc cggcttccct ctggccgaac 9180
aatatcggta gttaatcaaa acttagggtg caagatcatc cacaatgtca ccacaacgag 9240
accggataaa tgccttctac aaagataacc cccatcccaa gggaagtagg atagtcatta 9300
acagagaaca tcttatgatt gatagacctt atgttttgct ggctgttctg tttgtcatgt 9360
ttctgagctt gatcgggttg ctagccattg caggcattag acttcatcgg gcagccatct 9420
acaccgcaga gatccataaa agcctcagca ccaatctaga tgtaactaac tcaatcgagc 9480
atcaggtcaa ggacgtgctg acaccactct tcaaaatcat cggtgatgaa gtgggcctga 9540
ggacacctca gagattcact gacctagtga aattaatctc tgacaagatt aaattcctta 9600
atccggatag ggagtacgac ttcagagatc tcacttggtg tatcaacccg ccagagagaa 9660
tcaaattgga ttatgatcaa tactgtgcag atgtggctgc tgaagagctc atgaatgcat 9720
tggtgaactc aactctactg gagaccagaa caaccaatca gttcctagct gtctcaaagg 9780
gaaactgctc agggcccact acaatcagag gtcaattctc aaacatgtcg ctgtccctgt 9840
tagacttgta tttaggtcga ggttacaatg tgtcatctat agtcactatg acatcccagg 9900
gaatgtatgg gggaacttac ctagtggaaa agcctaatct gagcagcaaa aggtcagagt 9960
tgtcacaact gagcatgtac cgagtgtttg aagtaggtgt tatcagaaat ccgggtttgg 10020
gggctccggt gttccatatg acaaactatc ttgagcaacc agtcagtaat gatctcagca 10080
actgtatggt ggctttgggg gagctcaaac tcgcagccct ttgtcacggg gaagattcta 10140
tcacaattcc ctatcaggga tcagggaaag gtgtcagctt ccagctcgtc aagctaggtg 10200
tctggaaatc cccaaccgac atgcaatcct gggtcccctt atcaacggat gatccagtga 10260
tagacaggct ttacctctca tctcacagag gtgttatcgc tgacaatcaa gcaaaatggg 10320
ctgtcccgac aacacgaaca gatgacaagt tgcgaatgga gacatgcttc caacaggcgt 10380
gtaagggtaa aatccaagca ctctgcgaga atcccgagtg ggcaccattg aaggataaca 10440
ggattccttc atacggggtc ttgtctgttg atctgagtct gacagttgag cttaaaatca 10500
aaattgcttc gggattcggg ccattgatca cacacggttc agggatggac ctatacaaat 10560
ccaaccacaa caatgtgtat tggctgacta tcccgccaat gaagaaccta gccttaggtg 10620
taatcaacac attggagtgg ataccgagat tcaaggttag tccctacctc ttcactgtcc 10680
caattaagga agcaggcgaa gactgccatg ccccaacata cctacctgcg gaggtggatg 10740
gtgatgtcaa actcagttcc aatctggtga ttctacctgg tcaagatctc caatatgttt 10800
tggcaaccta cgatacttcc agggttgaac atgctgtggt ttattacgtt tacagcccaa 10860
gccgctcatt ttcttacttt tatcctttta ggttgcctat aaagggggtc cccatcgaat 10920
tacaagtgga atgcttcaca tgggaccaaa aactctggtg ccgtcacttc tgtgtgcttg 10980
cggactcaga atctggtgga catatcactc actctgggat ggtgggcatg ggagtcagct 11040
gcacagtcac ccgggaagat ggaaccaatc gcagataggg ctgctagtga accaatcaca 11100
tgatgtcacc cagacatcag gcatacccac tagtgtgaaa tagacatcag aattaagaaa 11160
aacgtagggt ccaagtggtt ccccgttatg gactcgctat ctgtcaacca gatcttatac 11220
cctgaagttc acctagatag cccgatagtt accaataaga tagtagccat cctggagtat 11280
gctcgagtcc ctcacgctta cagcctggag gaccctacac tgtgtcagaa catcaagcac 11340
cgcctaaaaa acggattttc caaccaaatg attataaaca atgtggaagt tgggaatgtc 11400
atcaagtcca agcttaggag ttatccggcc cactctcata ttccatatcc aaattgtaat 11460
caggatttat ttaacataga agacaaagag tcaacgagga agatccgtga actcctcaaa 11520
aaggggaatt cgctgtactc caaagtcagt gataaggttt tccaatgctt aagggacact 11580
aactcacggc ttggcctagg ctccgaattg agggaggaca tcaaggagaa agttattaac 11640
ttgggagttt acatgcacag ctcccagtgg tttgagccct ttctgttttg gtttacagtc 11700
aagactgaga tgaggtcagt gattaaatca caaacccata cttgccatag gaggagacac 11760
acacctgtat tcttcactgg tagttcagtt gagttgctaa tctctcgtga ccttgttgct 11820
ataatcagta aagagtctca acatgtatat tacctgacat ttgaactggt tttgatgtat 11880
tgtgatgtca tagaggggag gttaatgaca gagaccgcta tgactattga tgctaggtat 11940
acagagcttc taggaagagt cagatacatg tggaaactga tagatggttt cttccctgca 12000
ctcgggaatc caacttatca aattgtagcc atgctggagc ctctttcact tgcttacctg 12060
cagctgaggg atataacagt agaactcaga ggtgctttcc ttaaccactg ctttactgaa 12120
atacatgatg ttcttgacca aaacgggttt tctgatgaag gtacttatca tgagttaact 12180
gaagctctag attacatttt cataactgat gacatacatc tgacagggga gattttctca 12240
tttttcagaa gtttcggcca ccccagactt gaagcagtaa cggctgctga aaatgttagg 12300
aaatacatga atcagcctaa agtcattgtg tatgagactc tgatgaaagg tcatgccata 12360
ttttgtggaa tcataatcaa cggctatcgt gacaggcacg gaggcagttg gccaccgctg 12420
accctccccc tgcatgctgc agacacaatc cggaatgctc aagcttcagg tgaagggtta 12480
acacatgagc agtgcgttga taactggaaa tcttttgctg gagtgaaatt tggctgcttt 12540
atgcctctta gcctggatag tgatctgaca atgtacctaa aggacaaggc acttgctgct 12600
ctccaaaggg aatgggattc agtttacccg aaagagttcc tgcgttacga ccctcccaag 12660
ggaaccgggt cacggaggct tgtagatgtt ttccttaatg attcgagctt tgacccatat 12720
gatgtgataa tgtatgttgt aagtggagct tacctccatg accctgagtt caacctgtct 12780
tacagcctga aagaaaagga gatcaaggaa acaggtagac tttttgctaa aatgacttac 12840
aaaatgaggg catgccaagt gattgctgaa aatctaatct caaacgggat tggcaaatat 12900
tttaaggaca atgggatggc caaggatgag cacgatttga ctaaggcact ccacactcta 12960
gctgtctcag gagtccccaa agatctcaaa gaaagtcaca ggggggggcc agtcttaaaa 13020
acctactccc gaagcccagt ccacacaagt accaggaacg tgagagcagc aaaagggttt 13080
atagggttcc ctcaagtaat tcggcaggac caagacactg atcatccgga gaatatggaa 13140
gcttacgaga cagtcagtgc atttatcacg actgatctca agaagtactg ccttaattgg 13200
agatatgaga ccatcagctt gtttgcacag aggctaaatg agatttacgg attgccctca 13260
tttttccagt ggctgcataa gaggcttgag acctctgtcc tgtatgtaag tgaccctcat 13320
tgcccccccg accttgacgc ccatatcccg ttatataaag tccccaatga tcaaatcttc 13380
attaagtacc ctatgggagg tatagaaggg tattgtcaga agctgtggac catcagcacc 13440
attccctatc tatacctggc tgcttatgag agcggagtaa ggattgcttc gttagtgcaa 13500
ggggacaatc agaccatagc cgtaacaaaa agggtaccca gcacatggcc ctacaacctt 13560
aagaaacggg aagctgctag agtaactaga gattactttg taattcttag gcaaaggcta 13620
catgatattg gccatcacct caaggcaaat gagacaattg tttcatcaca tttttttgtc 13680
tattcaaaag gaatatatta tgatgggcta cttgtgtccc aatcactcaa gagcatcgca 13740
agatgtgtat tctggtcaga gactatagtt gatgaaacaa gggcagcatg cagtaatatt 13800
gctacaacaa tggctaaaag catcgagaga ggttatgacc gttaccttgc atattccctg 13860
aacgtcctaa aagtgataca gcaaattctg atctctcttg gcttcacaat caattcaacc 13920
atgacccggg atgtagtcat acccctcctc acaaacaacg acctcttaat aaggatggca 13980
ctgttgcccg ctcctattgg ggggatgaat tatctgaata tgagcaggct gtttgtcaga 14040
aacatcggtg atccagtaac atcatcaatt gctgatctca agagaatgat tctcgcctca 14100
ctaatgcctg aagagaccct ccatcaagta atgacacaac aaccggggga ctcttcattc 14160
ctagactggg ctagcgaccc ttactcagca aatcttgtat gtgtccagag catcactaga 14220
ctcctcaaga acataactgc aaggtttgtc ctgatccata gtccaaaccc aatgttaaaa 14280
ggattattcc atgatgacag taaagaagag gacgagggac tggcggcatt cctcatggac 14340
aggcatatta tagtacctag ggcagctcat gaaatcctgg atcatagtgt cacaggggca 14400
agagagtcta ttgcaggcat gctggatacc acaaaaggct tgattcgagc cagcatgagg 14460
aagggggggt taacctctcg agtgataacc agattgtcca attatgacta tgaacaattc 14520
agagcaggga tggtgctatt gacaggaaga aagagaaatg tcctcattga caaagagtca 14580
tgttcagtgc agctggcgag agctctaaga agccatatgt gggcgaggct agctcgagga 14640
cggcctattt acggccttga ggtccctgat gtactagaat ctatgcgagg ccaccttatt 14700
cggcgtcatg agacatgtgt catctgcgag tgtggatcag tcaactacgg atggtttttt 14760
gtcccctcgg gttgccaact ggatgatatt gacaaggaaa catcatcctt gagagtccca 14820
tatattggtt ctaccactga tgagagaaca gacatgaagc ttgccttcgt aagagcccca 14880
agtcgatcct tgcgatctgc tgttagaata gcaacagtgt actcatgggc ttacggtgat 14940
gatgatagct cttggaacga agcctggttg ttggctaggc aaagggccaa tgtgagcctg 15000
gaggagctaa gggtgatcac tcccatctca acttcgacta atttagcgca taggttgagg 15060
gatcgtagca ctcaagtgaa atactcaggt acatcccttg tccgagtggc gaggtatacc 15120
acaatctcca acgacaatct ctcatttgtc atatcagata agaaggttga tactaacttt 15180
atataccaac aaggaatgct tctagggttg ggtgttttag aaacattgtt tcgactcgag 15240
aaagataccg gatcatctaa cacggtatta catcttcacg tcgaaacaga ttgttgcgtg 15300
atcccgatga tagatcatcc caggataccc agctcccgca agctagagct gagggcagag 15360
ctatgtacca acccattgat atatgataat gcacctttaa ttgacagaga tgcaacaagg 15420
ctatacaccc agagccatag gaggcacctt gtggaatttg ttacatggtc cacaccccaa 15480
ctatatcaca ttttagctaa gtccacagca ctatctatga ttgacctggt aacaaaattt 15540
gagaaggacc atatgaatga aatttcagct ctcatagggg atgacgatat caatagtttc 15600
ataactgagt ttctgctcat agagccaaga ttattcacta tctacttggg ccagtgtgcg 15660
gccatcaatt gggcatttga tgtacattat catagaccat cagggaaata tcagatgggt 15720
gagctgttgt catcgttcct ttctagaatg agcaaaggag tgtttaaggt gcttgtcaat 15780
gctctaagcc acccaaagat ctacaagaaa ttctggcatt gtggtattat agagcctatc 15840
catggtcctt cacttgatgc tcaaaacttg cacacaactg tgtgcaacat ggtttacaca 15900
tgctatatga cctacctcga cctgttgttg aatgaagagt tagaagagtt cacatttctc 15960
ttgtgtgaaa gcgacgagga tgtagtaccg gacagattcg acaacatcca ggcaaaacac 16020
ttatgtgttc tggcagattt gtactgtcaa ccagggacct gcccaccaat tcgaggtcta 16080
agaccggtag agaaatgtgc agttctaacc gaccatatca aggcagaggc tatgttatct 16140
ccagcaggat cttcgtggaa cataaatcca attattgtag accattactc atgctctctg 16200
acttatctcc ggcgaggatc gatcaaacag ataagattga gagttgatcc aggattcatt 16260
ttcgacgccc tcgctgaggt aaatgtcagt cagccaaaga tcggcagcaa caacatctca 16320
aatatgagca tcaaggcttt cagaccccca cacgatgatg ttgcaaaatt gctcaaagat 16380
atcaacacaa gcaagcacaa tcttcccatt tcagggggca atctcgccaa ttatgaaatc 16440
catgctttcc gcagaatcgg gttgaactca tctgcttgct acaaagctgt tgagatatca 16500
acattaatta ggagatgcct tgagccaggg gaggacggct tgttcttggg tgagggatcg 16560
ggttctatgt tgatcactta taaagagata cttaaactaa acaagtgctt ctataatagt 16620
ggggtttccg ccaattctag atctggtcaa agggaattag caccctatcc ctccgaagtt 16680
ggccttgtcg aacacagaat gggagtaggt aatattgtca aagtgctctt taacgggagg 16740
cccgaagtca cgtgggtagg cagtgtagat tgcttcaatt tcatagttag taatatccct 16800
acctctagtg tggggtttat ccattcagat atagagacct tgcctgacaa agatactata 16860
gagaagctag aggaattggc agccatctta tcgatggctc tgctcctggg caaaatagga 16920
tcaatactgg tgattaagct tatgcctttc agcggggatt ttgttcaggg atttataagt 16980
tatgtagggt ctcattatag agaagtgaac cttgtatacc ctagatacag caacttcatc 17040
tctactgaat cttatttggt tatgacagat ctcaaggcta accggctaat gaatcctgaa 17100
aagattaagc agcagataat tgaatcatct gtgaggactt cacctggact tataggtcac 17160
atcctatcca ttaagcaact aagctgcata caagcaattg tgggagacgc agttagtaga 17220
ggtgatatca atcctactct gaaaaaactt acacctatag agcaggtgct gatcaattgc 17280
gggttggcaa ttaacggacc taagctgtgc aaagaattga tccaccatga tgttgcctca 17340
gggcaagatg gattgcttaa ttctatactc atcctctaca gggagttggc aagattcaaa 17400
gacaaccaaa gaagtcaaca agggatgttc cacgcttacc ccgtattggt aagtagcagg 17460
caacgagaac ttatatctag gatcacccgc aaattctggg ggcacattct tctttactcc 17520
gggaacaaaa agttgataaa taagtttatc cagaatctca agtccggcta tctgatacta 17580
gacttacacc agaatatctt cgttaagaat ctatccaagt cagagaaaca gattattatg 17640
acggggggtt tgaaacgtga gtgggttttt aaggtaacag tcaaggagac caaagaatgg 17700
tataagttag tcggatacag tgccctgatt aaggactaat tggttgaact ccggaaccct 17760
aatcctgccc taggtggtta ggcattattt gcaatatatt aaagaaaact ttgaaaatac 17820
gaagtttcta ttcccagctt tgtctggtgg ccggcatggt cccagcctcc tcgctggcgc 17880
cggctgggca acattccgag gggaccgtcc cctcggtaat ggcgaatggg acgcggccga 17940
tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata 18000
actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg 18060
aactatatcc ggatgcggcc gcgggcccta tggtacccag cttttgttcc ctttagtgag 18120
ggttaattcc gagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc 18180
cgctcacaat tccacacaac ataggagccg gaagcataaa gtgtaaagcc tggggtgcct 18240
aatgagtgag gtaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa 18300
acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta 18360
ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 18420
gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg 18480
caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 18540
tgctggcgtt tttccatagg ctcggccccc ctgacgagca tcacaaaaat cgacgctcaa 18600
gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgttcccc cctggaagct 18660
ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc 18720
cttcgggaag cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg 18780
tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 18840
tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 18900
cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga 18960
agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga 19020
agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 19080
gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 19140
aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag 19200
ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat 19260
gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct 19320
taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac 19380
tgcccgtcgt gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa 19440
tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg 19500
gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt 19560
gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca 19620
ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt 19680
cccaacgatc aaggcgagtt acatgatccc ccatgttgtg aaaaaaagcg gttagctcct 19740
tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc atgcttatgg 19800
cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg 19860
agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg 19920
cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa 19980
aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt 20040
aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt 20100
gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt 20160
gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 20220
tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat 20280
ttccccgaaa agtgccacct gaaattgtaa acgttaatat tttgttaaaa ttcgcgttaa 20340
atttttgtta aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata 20400
aatcaaaaga atagaccgag atagggttga gtgttgttcc agtttggaac aagagtccac 20460
tattaaagaa cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc 20520
cactacgtga accatcaccc taatcaagtt ttttggggtc gaggtgccgt aaagcactaa 20580
atcggaaccc taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg 20640
cgagaaagga agggaagaaa gcgaaaggag cgggcgctag ggcgctggca agtgtagcgg 20700
tcacgctgcg cgtaaccacc acacccgccg cgcttaatgc gccgctacag ggcgcgtccc 20760
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 20820
tacgccagcc accgcggtg 20839
<210> 167
<211> 20467
<212> DNA
<213> Artificial sequence
<220>
<223> pTM2-MVSchw_ insert 5 (native sequence)
<400> 167
gcggccgcta atacgactca ctatagggcc aactttgttt ggtctgatga gtccgtgagg 60
acgaaacccg gagtcccggg tcaccaaaca aagttgggta aggatagttc aatcaatgat 120
catcttctag tgcacttagg attcaagatc ctattatcag ggacaagagc aggattaggg 180
atatccgaga tggccacact tttaaggagc ttagcattgt tcaaaagaaa caaggacaaa 240
ccacccatta catcaggatc cggtggagcc atcagaggaa tcaaacacat tattatagta 300
ccaatccctg gagattcctc aattaccact cgatccagac ttctggaccg gttggtgagg 360
ttaattggaa acccggatgt gagcgggccc aaactaacag gggcactaat aggtatatta 420
tccttatttg tggagtctcc aggtcaattg attcagagga tcaccgatga ccctgacgtt 480
agcataaggc tgttagaggt tgtccagagt gaccagtcac aatctggcct taccttcgca 540
tcaagaggta ccaacatgga ggatgaggcg gaccaatact tttcacatga tgatccaatt 600
agtagtgatc aatccaggtt cggatggttc gggaacaagg aaatctcaga tattgaagtg 660
caagaccctg agggattcaa catgattctg ggtaccatcc tagcccaaat ttgggtcttg 720
ctcgcaaagg cggttacggc cccagacacg gcagctgatt cggagctaag aaggtggata 780
aagtacaccc aacaaagaag ggtagttggt gaatttagat tggagagaaa atggttggat 840
gtggtgagga acaggattgc cgaggacctc tccttacgcc gattcatggt cgctctaatc 900
ctggatatca agagaacacc cggaaacaaa cccaggattg ctgaaatgat atgtgacatt 960
gatacatata tcgtagaggc aggattagcc agttttatcc tgactattaa gtttgggata 1020
gaaactatgt atcctgctct tggactgcat gaatttgctg gtgagttatc cacacttgag 1080
tccttgatga acctttacca gcaaatgggg gaaactgcac cctacatggt aatcctggag 1140
aactcaattc agaacaagtt cagtgcagga tcataccctc tgctctggag ctatgccatg 1200
ggagtaggag tggaacttga aaactccatg ggaggtttga actttggccg atcttacttt 1260
gatccagcat attttagatt agggcaagag atggtaagga ggtcagctgg aaaggtcagt 1320
tccacattgg catctgaact cggtatcact gccgaggatg caaggcttgt ttcagagatt 1380
gcaatgcata ctactgagga caagatcagt agagcggttg gacccagaca agcccaagta 1440
tcatttctac acggtgatca aagtgagaat gagctaccga gattgggggg caaggaagat 1500
aggagggtca aacagagtcg aggagaagcc agggagagct acagagaaac cgggcccagc 1560
agagcaagtg atgcgagagc tgcccatctt ccaaccggca cacccctaga cattgacact 1620
gcaacggagt ccagccaaga tccgcaggac agtcgaaggt cagctgacgc cctgcttagg 1680
ctgcaagcca tggcaggaat ctcggaagaa caaggctcag acacggacac ccctatagtg 1740
tacaatgaca gaaatcttct agactaggtg cgagaggccg agggccagaa caacatccgc 1800
ctaccatcca tcattgttat aaaaaactta ggaaccaggt ccacacagcc gccagcccat 1860
caaccatcca ctcccacgat tggagccaat ggcagaagag caggcacgcc atgtcaaaaa 1920
cggactggaa tgcatccggg ctctcaaggc cgagcccatc ggctcactgg ccatcgagga 1980
agctatggca gcatggtcag aaatatcaga caacccagga caggagcgag ccacctgcag 2040
ggaagagaag gcaggcagtt cgggtctcag caaaccatgc ctctcagcaa ttggatcaac 2100
tgaaggcggt gcacctcgca tccgcggtca gggacctgga gagagcgatg acgacgctga 2160
aactttggga atccccccaa gaaatctcca ggcatcaagc actgggttac agtgttatta 2220
cgtttatgat cacagcggtg aagcggttaa gggaatccaa gatgctgact ctatcatggt 2280
tcaatcaggc cttgatggtg atagcaccct ctcaggagga gacaatgaat ctgaaaacag 2340
cgatgtggat attggcgaac ctgataccga gggatatgct atcactgacc ggggatctgc 2400
tcccatctct atggggttca gggcttctga tgttgaaact gcagaaggag gggagatcca 2460
cgagctcctg agactccaat ccagaggcaa caactttccg aagcttggga aaactctcaa 2520
tgttcctccg cccccggacc ccggtagggc cagcacttcc gggacaccca ttaaaaaggg 2580
cacagacgcg agattagcct catttggaac ggagatcgcg tctttattga caggtggtgc 2640
aacccaatgt gctcgaaagt caccctcgga accatcaggg ccaggtgcac ctgcggggaa 2700
tgtccccgag tgtgtgagca atgccgcact gatacaggag tggacacccg aatctggtac 2760
cacaatctcc ccgagatccc agaataatga agaaggggga gactattatg atgatgagct 2820
gttctctgat gtccaagata ttaaaacagc cttggccaaa atacacgagg ataatcagaa 2880
gataatctcc aagctagaat cactgctgtt attgaaggga gaagttgagt caattaagaa 2940
gcagatcaac aggcaaaata tcagcatatc caccctggaa ggacacctct caagcatcat 3000
gatcgccatt cctggacttg ggaaggatcc caacgacccc actgcagatg tcgaaatcaa 3060
tcccgacttg aaacccatca taggcagaga ttcaggccga gcactggccg aagttctcaa 3120
gaaacccgtt gccagccgac aactccaagg aatgacaaat ggacggacca gttccagagg 3180
acagctgctg aaggaatttc agctaaagcc gatcgggaaa aagatgagct cagccgtcgg 3240
gtttgttcct gacaccggcc ctgcatcacg cagtgtaatc cgctccatta taaaatccag 3300
ccggctagag gaggatcgga agcgttacct gatgactctc cttgatgata tcaaaggagc 3360
caatgatctt gccaagttcc accagatgct gatgaagata ataatgaagt agctacagct 3420
caacttacct gccaacccca tgccagtcga cccaactagc ctaccctcca tcattgttat 3480
aaaaaactta ggaaccaggt ccacacagcc gccagcccat caacgcgtac gatggaagtc 3540
atatacttgg tcatgatact gctgattgcc ccggcataca gcatcaggtg cataggagtc 3600
agcaataggg actttgtgga aggtatgtca ggtgggactt gggttgatgt tgtcttggaa 3660
catggaggtt gtgtcaccgt aatggcacag gacaaaccga ctgtcgacat agagctggtt 3720
acaacaacag tcagcaacat ggcggaggta agatcctact gctatgaggc atcaatatcg 3780
gacatggctt cggacagccg ctgcccaaca caaggtgaag cctaccttga caagcaatca 3840
gacactcaat atgtctgcaa aagaacgtta gtggacagag gctggggaaa tggatgtgga 3900
ctttttggca aagggagcct ggtgacatgc gctaagtttg catgctccaa gaaaatgacc 3960
gggaagagca tccagccaga gaatctggag taccggataa tgctgtcagt tcatggctcc 4020
cagcacagtg ggatgatcgt taatgacaca ggacatgaaa ctgatgagaa tagagcgaag 4080
gttgagataa cgcccaattc accaagagcc gaagccaccc tggggggttt tggaagccta 4140
ggacttgatt gtgaaccgag gacaggcctt gacttttcag atttgtatta cttgactatg 4200
aataacaagc actggttggt tcacaaggag tggttccacg acattccatt accttggcac 4260
gctggggcag acaccggaac tccacactgg aacaacaaag aagcactggt agagttcaag 4320
gacgcacatg ccaaaaggca aactgtcgtg gttctaggga gtcaagaagg agcagttcac 4380
acggcccttg ctggagctct ggaggctgag atggatggtg caaagggaag gctgtcctct 4440
ggccacttga aatgtcgcct gaaaatggat aaacttagat tgaagggcgt gtcatactcc 4500
ttgtgtaccg cagcgttcac attcaccaag atcccggctg aaacactgca cgggacagtc 4560
acagtggagg tacagtacgc agggacagat ggaccttgca aggttccagc tcagatggcg 4620
gtggacatgc aaactctgac cccagttggg aggttgataa ccgctaaccc cgtaatcact 4680
gaaagcactg agaactctaa gatgatgctg gaacttgatc caccatttgg ggactcttac 4740
attgtcatag gagtcgggga gaagaagatc acccaccact ggcacaggag tggcagcacc 4800
attggaaaag catttgaagc cactgtgaga ggtgccaaga gaatggcagt cttgggagac 4860
acagcctggg actttggatc agttggaggc gctctcaact cattgggcaa gggcatctaa 4920
taagcgcgca gcgcttagac gtctcgcgat cgatactagt acaacctaaa tccattataa 4980
aaaacttagg agcaaagtga ttgcctccca aggtccacaa tgacagagac ctacgacttc 5040
gacaagtcgg catgggacat caaagggtcg atcgctccga tacaacccac cacctacagt 5100
gatggcaggc tggtgcccca ggtcagagtc atagatcctg gtctaggcga caggaaggat 5160
gaatgcttta tgtacatgtt tctgctgggg gttgttgagg acagcgattc cctagggcct 5220
ccaatcgggc gagcatttgg gttcctgccc ttaggtgttg gcagatccac agcaaagccc 5280
gaaaaactcc tcaaagaggc cactgagctt gacatagttg ttagacgtac agcagggctc 5340
aatgaaaaac tggtgttcta caacaacacc ccactaactc tcctcacacc ttggagaaag 5400
gtcctaacaa cagggagtgt cttcaacgca aaccaagtgt gcaatgcggt taatctgata 5460
ccgctcgata ccccgcagag gttccgtgtt gtttatatga gcatcacccg tctttcggat 5520
aacgggtatt acaccgttcc tagaagaatg ctggaattca gatcggtcaa tgcagtggcc 5580
ttcaacctgc tggtgaccct taggattgac aaggcgatag gccctgggaa gatcatcgac 5640
aatacagagc aacttcctga ggcaacattt atggtccaca tcgggaactt caggagaaag 5700
aagagtgaag tctactctgc cgattattgc aaaatgaaaa tcgaaaagat gggcctggtt 5760
tttgcacttg gtgggatagg gggcaccagt cttcacatta gaagcacagg caaaatgagc 5820
aagactctcc atgcacaact cgggttcaag aagaccttat gttacccgct gatggatatc 5880
aatgaagacc ttaatcgatt actctggagg agcagatgca agatagtaag aatccaggca 5940
gttttgcagc catcagttcc tcaagaattc cgcatttacg acgacgtgat cataaatgat 6000
gaccaaggac tattcaaagt tctgtagacc gtagtgccca gcaatgcccg aaaacgaccc 6060
ccctcacaat gacagccaga aggcccggac aaaaaagccc cctccgaaag actccacgga 6120
ccaagcgaga ggccagccag cagccgacgg caagcgcgaa caccaggcgg ccccagcaca 6180
gaacagccct gacacaaggc caccaccagc caccccaatc tgcatcctcc tcgtgggacc 6240
cccgaggacc aacccccaag gctgcccccg atccaaacca ccaaccgcat ccccaccacc 6300
cccgggaaag aaacccccag caattggaag gcccctcccc ctcttcctca acacaagaac 6360
tccacaaccg aaccgcacaa gcgaccgagg tgacccaacc gcaggcatcc gactccctag 6420
acagatcctc tctccccggc aaactaaaca aaacttaggg ccaaggaaca tacacaccca 6480
acagaaccca gaccccggcc cacggcgccg cgcccccaac ccccgacaac cagagggagc 6540
ccccaaccaa tcccgccggc tcccccggtg cccacaggca gggacaccaa cccccgaaca 6600
gacccagcac ccaaccatcg acaatccaag acgggggggc ccccccaaaa aaaggccccc 6660
aggggccgac agccagcacc gcgaggaagc ccacccaccc cacacacgac cacggcaacc 6720
aaaccagaac ccagaccacc ctgggccacc agctcccaga ctcggccatc accccgcaga 6780
aaggaaaggc cacaacccgc gcaccccagc cccgatccgg cggggagcca cccaacccga 6840
accagcaccc aagagcgatc cccgaaggac ccccgaaccg caaaggacat cagtatccca 6900
cagcctctcc aagtcccccg gtctcctcct cttctcgaag ggaccaaaag atcaatccac 6960
cacacccgac gacactcaac tccccacccc taaaggagac accgggaatc ccagaatcaa 7020
gactcatcca atgtccatca tgggtctcaa ggtgaacgtc tctgccatat tcatggcagt 7080
actgttaact ctccaaacac ccaccggtca aatccattgg ggcaatctct ctaagatagg 7140
ggtggtagga ataggaagtg caagctacaa agttatgact cgttccagcc atcaatcatt 7200
agtcataaaa ttaatgccca atataactct cctcaataac tgcacgaggg tagagattgc 7260
agaatacagg agactactga gaacagtttt ggaaccaatt agagatgcac ttaatgcaat 7320
gacccagaat ataagaccgg ttcagagtgt agcttcaagt aggagacaca agagatttgc 7380
gggagtagtc ctggcaggtg cggccctagg cgttgccaca gctgctcaga taacagccgg 7440
cattgcactt caccagtcca tgctgaactc tcaagccatc gacaatctga gagcgagcct 7500
ggaaactact aatcaggcaa ttgagacaat cagacaagca gggcaggaga tgatattggc 7560
tgttcagggt gtccaagact acatcaataa tgagctgata ccgtctatga accaactatc 7620
ttgtgattta atcggccaga agctcgggct caaattgctc agatactata cagaaatcct 7680
gtcattattt ggccccagtt tacgggaccc catatctgcg gagatatcta tccaggcttt 7740
gagctatgcg cttggaggag acatcaataa ggtgttagaa aagctcggat acagtggagg 7800
tgatttactg ggcatcttag agagcggagg aataaaggcc cggataactc acgtcgacac 7860
agagtcctac ttcattgtcc tcagtatagc ctatccgacg ctgtccgaga ttaagggggt 7920
gattgtccac cggctagagg gggtctcgta caacataggc tctcaagagt ggtataccac 7980
tgtgcccaag tatgttgcaa cccaagggta ccttatctcg aattttgatg agtcatcgtg 8040
tactttcatg ccagagggga ctgtgtgcag ccaaaatgcc ttgtacccga tgagtcctct 8100
gctccaagaa tgcctccggg ggtacaccaa gtcctgtgct cgtacactcg tatccgggtc 8160
ttttgggaac cggttcattt tatcacaagg gaacctaata gccaattgtg catcaatcct 8220
ttgcaagtgt tacacaacag gaacgatcat taatcaagac cctgacaaga tcctaacata 8280
cattgctgcc gatcactgcc cggtagtcga ggtgaacggc gtgaccatcc aagtcgggag 8340
caggaggtat ccagacgctg tgtacttgca cagaattgac ctcggtcctc ccatatcatt 8400
ggagaggttg gacgtaggga caaatctggg gaatgcaatt gctaagttgg aggatgccaa 8460
ggaattgttg gagtcatcgg accagatatt gaggagtatg aaaggtttat cgagcactag 8520
catagtctac atcctgattg cagtgtgtct tggagggttg atagggatcc ccgctttaat 8580
atgttgctgc agggggcgtt gtaacaaaaa gggagaacaa gttggtatgt caagaccagg 8640
cctaaagcct gatcttacgg gaacatcaaa atcctatgta aggtcgctct gatcctctac 8700
aactcttgaa acacaaatgt cccacaagtc tcctcttcgt catcaagcaa ccaccgcacc 8760
cagcatcaag cccacctgaa attatctccg gcttccctct ggccgaacaa tatcggtagt 8820
taatcaaaac ttagggtgca agatcatcca caatgtcacc acaacgagac cggataaatg 8880
ccttctacaa agataacccc catcccaagg gaagtaggat agtcattaac agagaacatc 8940
ttatgattga tagaccttat gttttgctgg ctgttctgtt tgtcatgttt ctgagcttga 9000
tcgggttgct agccattgca ggcattagac ttcatcgggc agccatctac accgcagaga 9060
tccataaaag cctcagcacc aatctagatg taactaactc aatcgagcat caggtcaagg 9120
acgtgctgac accactcttc aaaatcatcg gtgatgaagt gggcctgagg acacctcaga 9180
gattcactga cctagtgaaa ttaatctctg acaagattaa attccttaat ccggataggg 9240
agtacgactt cagagatctc acttggtgta tcaacccgcc agagagaatc aaattggatt 9300
atgatcaata ctgtgcagat gtggctgctg aagagctcat gaatgcattg gtgaactcaa 9360
ctctactgga gaccagaaca accaatcagt tcctagctgt ctcaaaggga aactgctcag 9420
ggcccactac aatcagaggt caattctcaa acatgtcgct gtccctgtta gacttgtatt 9480
taggtcgagg ttacaatgtg tcatctatag tcactatgac atcccaggga atgtatgggg 9540
gaacttacct agtggaaaag cctaatctga gcagcaaaag gtcagagttg tcacaactga 9600
gcatgtaccg agtgtttgaa gtaggtgtta tcagaaatcc gggtttgggg gctccggtgt 9660
tccatatgac aaactatctt gagcaaccag tcagtaatga tctcagcaac tgtatggtgg 9720
ctttggggga gctcaaactc gcagcccttt gtcacgggga agattctatc acaattccct 9780
atcagggatc agggaaaggt gtcagcttcc agctcgtcaa gctaggtgtc tggaaatccc 9840
caaccgacat gcaatcctgg gtccccttat caacggatga tccagtgata gacaggcttt 9900
acctctcatc tcacagaggt gttatcgctg acaatcaagc aaaatgggct gtcccgacaa 9960
cacgaacaga tgacaagttg cgaatggaga catgcttcca acaggcgtgt aagggtaaaa 10020
tccaagcact ctgcgagaat cccgagtggg caccattgaa ggataacagg attccttcat 10080
acggggtctt gtctgttgat ctgagtctga cagttgagct taaaatcaaa attgcttcgg 10140
gattcgggcc attgatcaca cacggttcag ggatggacct atacaaatcc aaccacaaca 10200
atgtgtattg gctgactatc ccgccaatga agaacctagc cttaggtgta atcaacacat 10260
tggagtggat accgagattc aaggttagtc cctacctctt cactgtccca attaaggaag 10320
caggcgaaga ctgccatgcc ccaacatacc tacctgcgga ggtggatggt gatgtcaaac 10380
tcagttccaa tctggtgatt ctacctggtc aagatctcca atatgttttg gcaacctacg 10440
atacttccag ggttgaacat gctgtggttt attacgttta cagcccaagc cgctcatttt 10500
cttactttta tccttttagg ttgcctataa agggggtccc catcgaatta caagtggaat 10560
gcttcacatg ggaccaaaaa ctctggtgcc gtcacttctg tgtgcttgcg gactcagaat 10620
ctggtggaca tatcactcac tctgggatgg tgggcatggg agtcagctgc acagtcaccc 10680
gggaagatgg aaccaatcgc agatagggct gctagtgaac caatcacatg atgtcaccca 10740
gacatcaggc atacccacta gtgtgaaata gacatcagaa ttaagaaaaa cgtagggtcc 10800
aagtggttcc ccgttatgga ctcgctatct gtcaaccaga tcttataccc tgaagttcac 10860
ctagatagcc cgatagttac caataagata gtagccatcc tggagtatgc tcgagtccct 10920
cacgcttaca gcctggagga ccctacactg tgtcagaaca tcaagcaccg cctaaaaaac 10980
ggattttcca accaaatgat tataaacaat gtggaagttg ggaatgtcat caagtccaag 11040
cttaggagtt atccggccca ctctcatatt ccatatccaa attgtaatca ggatttattt 11100
aacatagaag acaaagagtc aacgaggaag atccgtgaac tcctcaaaaa ggggaattcg 11160
ctgtactcca aagtcagtga taaggttttc caatgcttaa gggacactaa ctcacggctt 11220
ggcctaggct ccgaattgag ggaggacatc aaggagaaag ttattaactt gggagtttac 11280
atgcacagct cccagtggtt tgagcccttt ctgttttggt ttacagtcaa gactgagatg 11340
aggtcagtga ttaaatcaca aacccatact tgccatagga ggagacacac acctgtattc 11400
ttcactggta gttcagttga gttgctaatc tctcgtgacc ttgttgctat aatcagtaaa 11460
gagtctcaac atgtatatta cctgacattt gaactggttt tgatgtattg tgatgtcata 11520
gaggggaggt taatgacaga gaccgctatg actattgatg ctaggtatac agagcttcta 11580
ggaagagtca gatacatgtg gaaactgata gatggtttct tccctgcact cgggaatcca 11640
acttatcaaa ttgtagccat gctggagcct ctttcacttg cttacctgca gctgagggat 11700
ataacagtag aactcagagg tgctttcctt aaccactgct ttactgaaat acatgatgtt 11760
cttgaccaaa acgggttttc tgatgaaggt acttatcatg agttaactga agctctagat 11820
tacattttca taactgatga catacatctg acaggggaga ttttctcatt tttcagaagt 11880
ttcggccacc ccagacttga agcagtaacg gctgctgaaa atgttaggaa atacatgaat 11940
cagcctaaag tcattgtgta tgagactctg atgaaaggtc atgccatatt ttgtggaatc 12000
ataatcaacg gctatcgtga caggcacgga ggcagttggc caccgctgac cctccccctg 12060
catgctgcag acacaatccg gaatgctcaa gcttcaggtg aagggttaac acatgagcag 12120
tgcgttgata actggaaatc ttttgctgga gtgaaatttg gctgctttat gcctcttagc 12180
ctggatagtg atctgacaat gtacctaaag gacaaggcac ttgctgctct ccaaagggaa 12240
tgggattcag tttacccgaa agagttcctg cgttacgacc ctcccaaggg aaccgggtca 12300
cggaggcttg tagatgtttt ccttaatgat tcgagctttg acccatatga tgtgataatg 12360
tatgttgtaa gtggagctta cctccatgac cctgagttca acctgtctta cagcctgaaa 12420
gaaaaggaga tcaaggaaac aggtagactt tttgctaaaa tgacttacaa aatgagggca 12480
tgccaagtga ttgctgaaaa tctaatctca aacgggattg gcaaatattt taaggacaat 12540
gggatggcca aggatgagca cgatttgact aaggcactcc acactctagc tgtctcagga 12600
gtccccaaag atctcaaaga aagtcacagg ggggggccag tcttaaaaac ctactcccga 12660
agcccagtcc acacaagtac caggaacgtg agagcagcaa aagggtttat agggttccct 12720
caagtaattc ggcaggacca agacactgat catccggaga atatggaagc ttacgagaca 12780
gtcagtgcat ttatcacgac tgatctcaag aagtactgcc ttaattggag atatgagacc 12840
atcagcttgt ttgcacagag gctaaatgag atttacggat tgccctcatt tttccagtgg 12900
ctgcataaga ggcttgagac ctctgtcctg tatgtaagtg accctcattg cccccccgac 12960
cttgacgccc atatcccgtt atataaagtc cccaatgatc aaatcttcat taagtaccct 13020
atgggaggta tagaagggta ttgtcagaag ctgtggacca tcagcaccat tccctatcta 13080
tacctggctg cttatgagag cggagtaagg attgcttcgt tagtgcaagg ggacaatcag 13140
accatagccg taacaaaaag ggtacccagc acatggccct acaaccttaa gaaacgggaa 13200
gctgctagag taactagaga ttactttgta attcttaggc aaaggctaca tgatattggc 13260
catcacctca aggcaaatga gacaattgtt tcatcacatt tttttgtcta ttcaaaagga 13320
atatattatg atgggctact tgtgtcccaa tcactcaaga gcatcgcaag atgtgtattc 13380
tggtcagaga ctatagttga tgaaacaagg gcagcatgca gtaatattgc tacaacaatg 13440
gctaaaagca tcgagagagg ttatgaccgt taccttgcat attccctgaa cgtcctaaaa 13500
gtgatacagc aaattctgat ctctcttggc ttcacaatca attcaaccat gacccgggat 13560
gtagtcatac ccctcctcac aaacaacgac ctcttaataa ggatggcact gttgcccgct 13620
cctattgggg ggatgaatta tctgaatatg agcaggctgt ttgtcagaaa catcggtgat 13680
ccagtaacat catcaattgc tgatctcaag agaatgattc tcgcctcact aatgcctgaa 13740
gagaccctcc atcaagtaat gacacaacaa ccgggggact cttcattcct agactgggct 13800
agcgaccctt actcagcaaa tcttgtatgt gtccagagca tcactagact cctcaagaac 13860
ataactgcaa ggtttgtcct gatccatagt ccaaacccaa tgttaaaagg attattccat 13920
gatgacagta aagaagagga cgagggactg gcggcattcc tcatggacag gcatattata 13980
gtacctaggg cagctcatga aatcctggat catagtgtca caggggcaag agagtctatt 14040
gcaggcatgc tggataccac aaaaggcttg attcgagcca gcatgaggaa gggggggtta 14100
acctctcgag tgataaccag attgtccaat tatgactatg aacaattcag agcagggatg 14160
gtgctattga caggaagaaa gagaaatgtc ctcattgaca aagagtcatg ttcagtgcag 14220
ctggcgagag ctctaagaag ccatatgtgg gcgaggctag ctcgaggacg gcctatttac 14280
ggccttgagg tccctgatgt actagaatct atgcgaggcc accttattcg gcgtcatgag 14340
acatgtgtca tctgcgagtg tggatcagtc aactacggat ggttttttgt cccctcgggt 14400
tgccaactgg atgatattga caaggaaaca tcatccttga gagtcccata tattggttct 14460
accactgatg agagaacaga catgaagctt gccttcgtaa gagccccaag tcgatccttg 14520
cgatctgctg ttagaatagc aacagtgtac tcatgggctt acggtgatga tgatagctct 14580
tggaacgaag cctggttgtt ggctaggcaa agggccaatg tgagcctgga ggagctaagg 14640
gtgatcactc ccatctcaac ttcgactaat ttagcgcata ggttgaggga tcgtagcact 14700
caagtgaaat actcaggtac atcccttgtc cgagtggcga ggtataccac aatctccaac 14760
gacaatctct catttgtcat atcagataag aaggttgata ctaactttat ataccaacaa 14820
ggaatgcttc tagggttggg tgttttagaa acattgtttc gactcgagaa agataccgga 14880
tcatctaaca cggtattaca tcttcacgtc gaaacagatt gttgcgtgat cccgatgata 14940
gatcatccca ggatacccag ctcccgcaag ctagagctga gggcagagct atgtaccaac 15000
ccattgatat atgataatgc acctttaatt gacagagatg caacaaggct atacacccag 15060
agccatagga ggcaccttgt ggaatttgtt acatggtcca caccccaact atatcacatt 15120
ttagctaagt ccacagcact atctatgatt gacctggtaa caaaatttga gaaggaccat 15180
atgaatgaaa tttcagctct cataggggat gacgatatca atagtttcat aactgagttt 15240
ctgctcatag agccaagatt attcactatc tacttgggcc agtgtgcggc catcaattgg 15300
gcatttgatg tacattatca tagaccatca gggaaatatc agatgggtga gctgttgtca 15360
tcgttccttt ctagaatgag caaaggagtg tttaaggtgc ttgtcaatgc tctaagccac 15420
ccaaagatct acaagaaatt ctggcattgt ggtattatag agcctatcca tggtccttca 15480
cttgatgctc aaaacttgca cacaactgtg tgcaacatgg tttacacatg ctatatgacc 15540
tacctcgacc tgttgttgaa tgaagagtta gaagagttca catttctctt gtgtgaaagc 15600
gacgaggatg tagtaccgga cagattcgac aacatccagg caaaacactt atgtgttctg 15660
gcagatttgt actgtcaacc agggacctgc ccaccaattc gaggtctaag accggtagag 15720
aaatgtgcag ttctaaccga ccatatcaag gcagaggcta tgttatctcc agcaggatct 15780
tcgtggaaca taaatccaat tattgtagac cattactcat gctctctgac ttatctccgg 15840
cgaggatcga tcaaacagat aagattgaga gttgatccag gattcatttt cgacgccctc 15900
gctgaggtaa atgtcagtca gccaaagatc ggcagcaaca acatctcaaa tatgagcatc 15960
aaggctttca gacccccaca cgatgatgtt gcaaaattgc tcaaagatat caacacaagc 16020
aagcacaatc ttcccatttc agggggcaat ctcgccaatt atgaaatcca tgctttccgc 16080
agaatcgggt tgaactcatc tgcttgctac aaagctgttg agatatcaac attaattagg 16140
agatgccttg agccagggga ggacggcttg ttcttgggtg agggatcggg ttctatgttg 16200
atcacttata aagagatact taaactaaac aagtgcttct ataatagtgg ggtttccgcc 16260
aattctagat ctggtcaaag ggaattagca ccctatccct ccgaagttgg ccttgtcgaa 16320
cacagaatgg gagtaggtaa tattgtcaaa gtgctcttta acgggaggcc cgaagtcacg 16380
tgggtaggca gtgtagattg cttcaatttc atagttagta atatccctac ctctagtgtg 16440
gggtttatcc attcagatat agagaccttg cctgacaaag atactataga gaagctagag 16500
gaattggcag ccatcttatc gatggctctg ctcctgggca aaataggatc aatactggtg 16560
attaagctta tgcctttcag cggggatttt gttcagggat ttataagtta tgtagggtct 16620
cattatagag aagtgaacct tgtataccct agatacagca acttcatctc tactgaatct 16680
tatttggtta tgacagatct caaggctaac cggctaatga atcctgaaaa gattaagcag 16740
cagataattg aatcatctgt gaggacttca cctggactta taggtcacat cctatccatt 16800
aagcaactaa gctgcataca agcaattgtg ggagacgcag ttagtagagg tgatatcaat 16860
cctactctga aaaaacttac acctatagag caggtgctga tcaattgcgg gttggcaatt 16920
aacggaccta agctgtgcaa agaattgatc caccatgatg ttgcctcagg gcaagatgga 16980
ttgcttaatt ctatactcat cctctacagg gagttggcaa gattcaaaga caaccaaaga 17040
agtcaacaag ggatgttcca cgcttacccc gtattggtaa gtagcaggca acgagaactt 17100
atatctagga tcacccgcaa attctggggg cacattcttc tttactccgg gaacaaaaag 17160
ttgataaata agtttatcca gaatctcaag tccggctatc tgatactaga cttacaccag 17220
aatatcttcg ttaagaatct atccaagtca gagaaacaga ttattatgac ggggggtttg 17280
aaacgtgagt gggtttttaa ggtaacagtc aaggagacca aagaatggta taagttagtc 17340
ggatacagtg ccctgattaa ggactaattg gttgaactcc ggaaccctaa tcctgcccta 17400
ggtggttagg cattatttgc aatatattaa agaaaacttt gaaaatacga agtttctatt 17460
cccagctttg tctggtggcc ggcatggtcc cagcctcctc gctggcgccg gctgggcaac 17520
attccgaggg gaccgtcccc tcggtaatgg cgaatgggac gcggccgatc cggctgctaa 17580
caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc 17640
ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg 17700
atgcggccgc gggccctatg gtacccagct tttgttccct ttagtgaggg ttaattccga 17760
gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc 17820
cacacaacat aggagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgaggt 17880
aactcacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 17940
agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt 18000
ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 18060
ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 18120
tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 18180
tccataggct cggcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 18240
gaaacccgac aggactataa agataccagg cgttcccccc tggaagctcc ctcgtgcgct 18300
ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 18360
tggcgctttc tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca 18420
agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 18480
atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta 18540
acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta 18600
actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct 18660
tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt 18720
tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 18780
tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca 18840
tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat 18900
caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg 18960
cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactg cccgtcgtgt 19020
agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag 19080
acccacgctc accggctcca gatttatcag caataaacca gccagccgga agggccgagc 19140
gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag 19200
ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctacaggca 19260
tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa 19320
ggcgagttac atgatccccc atgttgtgaa aaaaagcggt tagctccttc ggtcctccga 19380
tcgttgtcag aagtaagttg gccgcagtgt tatcactcat gcttatggca gcactgcata 19440
attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca 19500
agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaatacggg 19560
ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg 19620
ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg 19680
cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag 19740
gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac 19800
tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca 19860
tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag 19920
tgccacctga aattgtaaac gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 19980
tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 20040
agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 20100
tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 20160
catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 20220
aagggagccc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 20280
ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 20340
taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtcccat tcgccattca 20400
ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagccac 20460
cgcggtg 20467
<210> 168
<211> 1764
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of insert 4
<400> 168
atgggtgtcg gaattgttgg cctcctgctg accacagcta tggcagcgga ggtcactaga 60
cgtgggagtg catactatat gtacttggac agaaacgacg ctggggaggc catatctttt 120
ccaaccacat tggggatgaa taagtgttat atacagatca tggatcttgg acacatgtgt 180
gatgccacca tgagctatga atgccctatg ctggatgagg gggtggaacc agatgacgtc 240
gattgttggt gcaacacgac gtcaacttgg gttgtgtacg gaacctgcca tcacaaaaaa 300
ggtgaagcac ggagatctag aagagctgtg acgctcccct cccattccac taggaagctg 360
caaacgcggt cgcaaacctg gttggaatca agagaataca caaagcactt gattagagtc 420
gaaaattgga tattcaggaa ccctggcttc gcgttagcag cagctgccat cgcttggctt 480
ttgggaagct caacgagcca aaaagtcata tacttggtca tgatactgct gattgccccg 540
gcatacagca tcaggtgcat aggagtcagc aatagggact ttgtggaagg tatgtcaggt 600
gggacttggg ttgatgttgt cttggaacat ggaggttgtg tcaccgtaat ggcacaggac 660
aaaccgactg tcgacataga gctggttaca acaacagtca gcaacatggc ggaggtaaga 720
tcctactgct atgaggcatc aatatcggac atggcttcgg acagccgctg cccaacacaa 780
ggtgaagcct accttgacaa gcaatcagac actcaatatg tctgcaaaag aacgttagtg 840
gacagaggct ggggaaatgg atgtggactt tttggcaaag ggagcctggt gacatgcgct 900
aagtttgcat gctccaagaa aatgaccggg aagagcatcc agccagagaa tctggagtac 960
cggataatgc tgtcagttca tggctcccag cacagtggga tgatcgttaa tgacacagga 1020
catgaaactg atgagaatag agcgaaggtt gagataacgc ccaattcacc aagagccgaa 1080
gccaccctgg ggggttttgg aagcctagga cttgattgtg aaccgaggac aggccttgac 1140
ttttcagatt tgtattactt gactatgaat aacaagcact ggttggttca caaggagtgg 1200
ttccacgaca ttccattacc ttggcacgct ggggcagaca ccggaactcc acactggaac 1260
aacaaagaag cactggtaga gttcaaggac gcacatgcca aaaggcaaac tgtcgtggtt 1320
ctagggagtc aagaaggagc agttcacacg gcccttgctg gagctctgga ggctgagatg 1380
gatggtgcaa agggaaggct gtcctctggc cacttgaaat gtcgcctgaa aatggataaa 1440
cttagattga agggcgtgtc atactccttg tgtaccgcag cgttcacatt caccaagatc 1500
ccggctgaaa cactgcacgg gacagtcaca gtggaggtac agtacgcagg gacagatgga 1560
ccttgcaagg ttccagctca gatggcggtg gacatgcaaa ctctgacccc agttgggagg 1620
ttgataaccg ctaaccccgt aatcactgaa agcactgaga actctaagat gatgctggaa 1680
cttgatccac catttgggga ctcttacatt gtcataggag tcggggagaa gaagatcacc 1740
caccactggc acaggagtgg ctaa 1764
<210> 169
<211> 587
<212> PRT
<213> Artificial sequence
<220>
<223> Insert 4
<400> 169
Met Gly Val Gly Ile Val Gly Leu Leu Leu Thr Thr Ala Met Ala Ala
1 5 10 15
Glu Val Thr Arg Arg Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn
20 25 30
Asp Ala Gly Glu Ala Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys
35 40 45
Cys Tyr Ile Gln Ile Met Asp Leu Gly His Met Cys Asp Ala Thr Met
50 55 60
Ser Tyr Glu Cys Pro Met Leu Asp Glu Gly Val Glu Pro Asp Asp Val
65 70 75 80
Asp Cys Trp Cys Asn Thr Thr Ser Thr Trp Val Val Tyr Gly Thr Cys
85 90 95
His His Lys Lys Gly Glu Ala Arg Arg Ser Arg Arg Ala Val Thr Leu
100 105 110
Pro Ser His Ser Thr Arg Lys Leu Gln Thr Arg Ser Gln Thr Trp Leu
115 120 125
Glu Ser Arg Glu Tyr Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile
130 135 140
Phe Arg Asn Pro Gly Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu
145 150 155 160
Leu Gly Ser Ser Thr Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu
165 170 175
Leu Ile Ala Pro Ala Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn Arg
180 185 190
Asp Phe Val Glu Gly Met Ser Gly Gly Thr Trp Val Asp Val Val Leu
195 200 205
Glu His Gly Gly Cys Val Thr Val Met Ala Gln Asp Lys Pro Thr Val
210 215 220
Asp Ile Glu Leu Val Thr Thr Thr Val Ser Asn Met Ala Glu Val Arg
225 230 235 240
Ser Tyr Cys Tyr Glu Ala Ser Ile Ser Asp Met Ala Ser Asp Ser Arg
245 250 255
Cys Pro Thr Gln Gly Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln
260 265 270
Tyr Val Cys Lys Arg Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys
275 280 285
Gly Leu Phe Gly Lys Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys
290 295 300
Ser Lys Lys Met Thr Gly Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr
305 310 315 320
Arg Ile Met Leu Ser Val His Gly Ser Gln His Ser Gly Met Ile Val
325 330 335
Asn Asp Thr Gly His Glu Thr Asp Glu Asn Arg Ala Lys Val Glu Ile
340 345 350
Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser
355 360 365
Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu
370 375 380
Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His Lys Glu Trp
385 390 395 400
Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr
405 410 415
Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys Asp Ala His
420 425 430
Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu Gly Ala Val
435 440 445
His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu Met Asp Gly Ala Lys
450 455 460
Gly Arg Leu Ser Ser Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys
465 470 475 480
Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr
485 490 495
Phe Thr Lys Ile Pro Ala Glu Thr Leu His Gly Thr Val Thr Val Glu
500 505 510
Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met
515 520 525
Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala
530 535 540
Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met Met Leu Glu
545 550 555 560
Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly Val Gly Glu
565 570 575
Lys Lys Ile Thr His His Trp His Arg Ser Gly
580 585
<210> 170
<211> 1392
<212> DNA
<213> Artificial sequence
<220>
<223> Native nucleotide sequence of insert 5
<400> 170
atggaagtca tatacttggt catgatactg ctgattgccc cggcatacag catcaggtgc 60
ataggagtca gcaataggga ctttgtggaa ggtatgtcag gtgggacttg ggttgatgtt 120
gtcttggaac atggaggttg tgtcaccgta atggcacagg acaaaccgac tgtcgacata 180
gagctggtta caacaacagt cagcaacatg gcggaggtaa gatcctactg ctatgaggca 240
tcaatatcgg acatggcttc ggacagccgc tgcccaacac aaggtgaagc ctaccttgac 300
aagcaatcag acactcaata tgtctgcaaa agaacgttag tggacagagg ctggggaaat 360
ggatgtggac tttttggcaa agggagcctg gtgacatgcg ctaagtttgc atgctccaag 420
aaaatgaccg ggaagagcat ccagccagag aatctggagt accggataat gctgtcagtt 480
catggctccc agcacagtgg gatgatcgtt aatgacacag gacatgaaac tgatgagaat 540
agagcgaagg ttgagataac gcccaattca ccaagagccg aagccaccct ggggggtttt 600
ggaagcctag gacttgattg tgaaccgagg acaggccttg acttttcaga tttgtattac 660
ttgactatga ataacaagca ctggttggtt cacaaggagt ggttccacga cattccatta 720
ccttggcacg ctggggcaga caccggaact ccacactgga acaacaaaga agcactggta 780
gagttcaagg acgcacatgc caaaaggcaa actgtcgtgg ttctagggag tcaagaagga 840
gcagttcaca cggcccttgc tggagctctg gaggctgaga tggatggtgc aaagggaagg 900
ctgtcctctg gccacttgaa atgtcgcctg aaaatggata aacttagatt gaagggcgtg 960
tcatactcct tgtgtaccgc agcgttcaca ttcaccaaga tcccggctga aacactgcac 1020
gggacagtca cagtggaggt acagtacgca gggacagatg gaccttgcaa ggttccagct 1080
cagatggcgg tggacatgca aactctgacc ccagttggga ggttgataac cgctaacccc 1140
gtaatcactg aaagcactga gaactctaag atgatgctgg aacttgatcc accatttggg 1200
gactcttaca ttgtcatagg agtcggggag aagaagatca cccaccactg gcacaggagt 1260
ggcagcacca ttggaaaagc atttgaagcc actgtgagag gtgccaagag aatggcagtc 1320
ttgggagaca cagcctggga ctttggatca gttggaggcg ctctcaactc attgggcaag 1380
ggcatctaat aa 1392
<210> 171
<211> 462
<212> PRT
<213> Artificial sequence
<220>
<223> Insert 5
<400> 171
Met Glu Val Ile Tyr Leu Val Met Ile Leu Leu Ile Ala Pro Ala Tyr
1 5 10 15
Ser Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
20 25 30
Ser Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
35 40 45
Thr Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr
50 55 60
Thr Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala
65 70 75 80
Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu
85 90 95
Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr
100 105 110
Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
115 120 125
Ser Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly
130 135 140
Lys Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val
145 150 155 160
His Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu
165 170 175
Thr Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg
180 185 190
Ala Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu
195 200 205
Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn
210 215 220
Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu
225 230 235 240
Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys
245 250 255
Glu Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val
260 265 270
Val Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly
275 280 285
Ala Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly
290 295 300
His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val
305 310 315 320
Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala
325 330 335
Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr
340 345 350
Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr
355 360 365
Leu Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu
370 375 380
Ser Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly
385 390 395 400
Asp Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His
405 410 415
Trp His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val
420 425 430
Arg Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe
435 440 445
Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile
450 455 460

Claims (40)

1. A nucleic acid construct comprising:
(1) A polynucleotide encoding at least (i) a membrane precursor (prM) protein of ZIKV consisting of SEQ ID No. 20, and an envelope (E) protein of ZIKV consisting of SEQ ID No. 23, or (ii) a prM protein of ZIKV consisting of SEQ ID No. 20, and a truncated version of E protein of ZIKV consisting of SEQ ID No. 32, or (iii) a truncated version of E protein of ZIKV consisting of SEQ ID No. 29; and
(2) A cDNA molecule encoding the full-length infectious antigenomic (+) RNA strand of Measles Virus (MV);
wherein the polynucleotide encoding at least (i) a prM protein of ZIKV consisting of SEQ ID No. 20 and an E protein of ZIKV consisting of SEQ ID No. 23 or (ii) a prM protein of ZIKV consisting of SEQ ID No. 20 and a truncated version of an E protein of ZIKV consisting of SEQ ID No. 32 or (iii) a truncated version of an E protein of ZIKV consisting of SEQ ID No. 29 is operably linked to the cDNA molecule;
The nucleic acid construct comprises, from 5 'to 3', the following polynucleotides:
(a) A polynucleotide encoding an N protein of MV;
(b) A polynucleotide encoding a P protein of MV;
(c) The polynucleotide encoding at least (i) a prM protein of ZIKV consisting of SEQ ID No. 20 and an E protein of ZIKV consisting of SEQ ID No. 23 or (ii) a prM protein of ZIKV consisting of SEQ ID No. 20 and a truncated version of an E protein of ZIKV consisting of SEQ ID No. 32 or (iii) a truncated version of an E protein of ZIKV consisting of SEQ ID No. 29;
(d) A polynucleotide encoding an M protein of MV;
(e) A polynucleotide encoding an F protein of MV;
(f) A polynucleotide encoding an H protein of MV; and
(g) A polynucleotide encoding an L protein of MV;
wherein the polynucleotide is operably linked in the nucleic acid construct and under the control of viral replication and transcription regulatory sequences.
2. The nucleic acid construct according to claim 1, characterized in that the polynucleotide of (1) and the cDNA molecule of (2) together consist of a number of nucleotides which is a multiple of six.
3. The nucleic acid construct of claim 2, wherein the viral replication and transcription control sequences are MV lead sequences and tail sequences.
4. The nucleic acid construct of claim 1, wherein the measles virus is an attenuated strain selected from the group consisting of: schwarz strain, zagreb strain, AIK-C strain and Moraten strain.
5. The nucleic acid construct of claim 1, wherein the polynucleotide encoding at least (i) the prM protein of ZIKV consisting of SEQ ID NO:20 and the E protein of ZIKV consisting of SEQ ID NO:23 or (ii) the prM protein of ZIKV consisting of SEQ ID NO:20 and the truncated version of the E protein of ZIKV consisting of SEQ ID NO:32 or (iii) the truncated version of the E protein of ZIKV consisting of SEQ ID NO:29 has been optimized for macaque codon usage or for human codon usage.
6. The nucleic acid construct of claim 1, wherein measles edit-like sequence has been deleted from the polynucleotide encoding at least (i) a prM protein of ZIKV consisting of SEQ ID No. 20 and a prM protein of ZIKV consisting of SEQ ID No. 23 or (ii) a prM protein of ZIKV consisting of SEQ ID No. 20 and a truncated version of an E protein of ZIKV consisting of SEQ ID No. 32 or (iii) a truncated version of an E protein of ZIKV consisting of SEQ ID No. 29.
7. The nucleic acid construct of claim 1, wherein the ZIKV is from african lineages or from asian strains.
8. The nucleic acid construct of claim 7, wherein the african lineage is african strain ArB1362 as shown in GenBank: KF383115 or african isolate IbH _30656 as shown in GenBank: HQ 234500.
9. The nucleic acid construct of claim 7, wherein the asian strain is asian strain BeH818995 as shown in GenBank, KU 365777.
10. The nucleic acid construct of claim 1, wherein the polynucleotide encoding at least (i) a prM protein of ZIKV consisting of SEQ ID No. 20 and an E protein of ZIKV consisting of SEQ ID No. 23 or (ii) a prM protein of ZIKV consisting of SEQ ID No. 20 and a truncated version of an E protein of ZIKV consisting of SEQ ID No. 32 further encodes (iii) a signal peptide from a capsid protein of ZIKV and a signal peptide from a membrane protein of ZIKV, or
Wherein said polynucleotide encoding at least (iii) a truncated version of the E protein of ZIKV consisting of SEQ ID NO. 29 further encodes (iii) a signal peptide from the capsid protein of ZIKV or a signal peptide from the membrane protein of ZIKV.
11. The nucleic acid construct of claim 1, wherein the polynucleotide encoding the prM protein of ZIKV has a sequence consisting of SEQ ID No. 19, the polynucleotide encoding the E protein of ZIKV has a sequence consisting of SEQ ID No. 22, and the polynucleotide encoding a truncated version of the E protein of ZIKV has a sequence consisting of SEQ ID No. 28 or SEQ ID No. 31.
12. The nucleic acid construct of claim 1, wherein the nucleic acid construct comprises a sequence selected from the group consisting of: SEQ ID NO. 46, SEQ ID NO. 55, SEQ ID NO. 76, SEQ ID NO. 168 and SEQ ID NO. 170.
13. The nucleic acid construct of claim 12, wherein the nucleic acid construct comprises a sequence consisting of SEQ ID No. 46, SEQ ID No. 55 or SEQ ID No. 76.
14. The nucleic acid construct of claim 13, wherein the nucleic acid construct comprises a sequence consisting of SEQ ID No. 46.
15. The nucleic acid construct of claim 1, comprising the sequence from nucleotide 83 to nucleotide 18404 in the sequence consisting of SEQ ID No. 165, or the sequence from nucleotide 83 to nucleotide 18074 in the sequence consisting of SEQ ID No. 166, or the sequence from nucleotide 83 to nucleotide 17702 in the sequence consisting of SEQ ID No. 167.
16. A transfer vector plasmid comprising the nucleic acid construct of any one of claims 1-15.
17. The transfer vector plasmid of claim 16, the sequence of which consists of SEQ ID No. 165, SEQ ID No. 166 or SEQ ID No. 167.
18. The transfer vector plasmid of claim 17, the sequence of which consists of SEQ ID No. 165.
19. A transformed cell comprising the nucleic acid construct according to any one of claims 1-15 inserted in its genome or comprising the transfer vector plasmid according to any one of claims 16-18.
20. The transformed cell of claim 19, which is a eukaryotic cell.
21. The transformed cell of claim 20, which is an avian cell, a mammalian cell, or a yeast cell.
22. The transformed cell of claim 20, which is a CEF cell.
23. Recombinant infectious replication competent measles virus-ZIKV virus (MV-ZIKV) particles comprising as their genome the nucleic acid construct according to any of claims 1-15.
24. The recombinant infectious replicative MV-ZIKV particle of claim 23, which is rescued from a helper cell line expressing an RNA polymerase recognized by the cell line, a nucleoprotein of MV, a phosphoprotein of MV, and the helper cell line is further transfected with a transfer vector plasmid according to any of claims 16-18.
25. The recombinant infectious replicative MV-ZIKV particle of claim 24, wherein the RNA polymerase recognized by the cell line is a T7 RNA polymerase.
26. The recombinant infectious replicative MV-ZIKV particle of claim 24, wherein the helper cell line further expresses an RNA polymerase macroprotein of MV.
27. The recombinant infectious replicative MV-ZIKV particle of claim 23, wherein the particle comprises in its genome a polynucleotide sequence comprising a sequence selected from the group consisting of: SEQ ID NO. 46, SEQ ID NO. 55, SEQ ID NO. 76, SEQ ID NO. 168 and SEQ ID NO. 170.
28. The recombinant infectious replicative MV-ZIKV particle of claim 27, wherein the polynucleotide sequence comprises a sequence consisting of SEQ ID No. 46, SEQ ID No. 55, or SEQ ID No. 76.
29. The recombinant infectious replicative MV-ZIKV particle of claim 28, wherein the polynucleotide sequence comprises a sequence consisting of SEQ ID No. 46.
30. A composition or kit of active ingredients comprising the recombinant infectious replicative MV-ZIKV particle of any one of claims 23-29, a ZIKV-Virus Like Particle (VLP) expressing the same ZIKV protein as the MV-ZIKV particle in combination with the recombinant infectious replicative MV-ZIKV particle, and a pharmaceutically acceptable carrier.
31. Use of a composition or formulation of an active ingredient according to claim 30 in the manufacture of a vaccine for eliciting a protective immune response against ZIKV in a host by eliciting antibodies against the ZIKV protein and/or eliciting a cellular immune response.
32. The use of claim 31, wherein the host is a human host in need thereof.
33. Use of a recombinant infectious replicative MV-ZIKV particle according to any one of claims 23-29, or a composition or formulation of an active ingredient according to claim 30, in combination with a ZIKV-VLP expressing the same ZIKV protein, in the manufacture of a vaccine for preventing infection by ZIKV in a subject, or for preventing clinical consequences of infection by ZIKV in a subject.
34. The use of claim 33, wherein the subject is a human.
35. A method of rescuing recombinant infectious measles virus-ZIKV particles and ZIKV virus-like particles (VLPs), the recombinant infectious measles virus-ZIKV particles expressing at least (i) a prM protein of ZIKV consisting of SEQ ID NO:20 and a prM protein of ZIKV consisting of SEQ ID NO:23 or (ii) a prM protein of ZIKV consisting of SEQ ID NO:20 and a truncated version of an E protein of ZIKV consisting of SEQ ID NO:32 or (iii) a truncated version of an E protein of ZIKV consisting of SEQ ID NO:29, the ZIKV virus-like particles (VLPs) expressing the same ZIKV protein, the method comprising:
1) Co-transfecting a helper cell stably expressing T7 RNA polymerase and measles N and P proteins with (i) the transfer vector plasmid of any one of claims 16-18 and (ii) a vector encoding MV L polymerase;
2) Culturing the co-transfected helper cells under conditions capable of producing recombinant MV-ZIKV particles;
3) Propagating the recombinant MV-ZIKV particles thus produced by co-culturing the helper cells of step 2) with cells capable of said propagating;
4) Recovering recombinant infectious replicative MV-ZIKV particles and ZIKV VLPs, said recombinant infectious replicative MV-ZIKV particles expressing at least (i) a prM protein of ZIKV consisting of SEQ ID NO:20 and a prM protein of ZIKV consisting of SEQ ID NO:23 or (ii) a prM protein of ZIKV consisting of SEQ ID NO:20 and a truncated version of an E protein of ZIKV consisting of SEQ ID NO:32 or (iii) a truncated version of an E protein of ZIKV consisting of SEQ ID NO:29, said ZIKV VLPs expressing the same ZIKV protein.
36. The method of claim 35, wherein the cell capable of undergoing said proliferation is a Vero cell.
37. The method of claim 35, wherein the helper cell is a HEK293 helper cell.
38. The method of claim 35, wherein the vector is a plasmid.
39. The method of claim 35, wherein the transfer vector plasmid has a sequence consisting of SEQ ID No. 165, SEQ ID No. 166, or SEQ ID No. 167.
40. The method of claim 39, wherein the transfer vector plasmid has a sequence consisting of SEQ ID NO. 165.
CN201880047044.6A 2017-06-07 2018-06-06 Recombinant measles virus expressing Zika virus protein and application thereof Active CN110891600B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP17305676.3 2017-06-07
EP17305676.3A EP3412307A1 (en) 2017-06-07 2017-06-07 Recombinant measles virus expressing zika virus proteins and their applications
PCT/EP2018/064943 WO2018224573A1 (en) 2017-06-07 2018-06-06 Recombinant measles virus expressing zika virus proteins and their applications

Publications (2)

Publication Number Publication Date
CN110891600A CN110891600A (en) 2020-03-17
CN110891600B true CN110891600B (en) 2024-04-02

Family

ID=59276634

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880047044.6A Active CN110891600B (en) 2017-06-07 2018-06-06 Recombinant measles virus expressing Zika virus protein and application thereof

Country Status (9)

Country Link
US (1) US11857616B2 (en)
EP (2) EP3412307A1 (en)
JP (2) JP2020524496A (en)
CN (1) CN110891600B (en)
AU (1) AU2018281889B2 (en)
BR (1) BR112019025310A2 (en)
CA (1) CA3064322A1 (en)
MX (1) MX2019014674A (en)
WO (1) WO2018224573A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3031923A1 (en) * 2014-12-11 2016-06-15 Institut Pasteur Lentiviral vector-based japanese encephalitis immunogenic composition
KR101966841B1 (en) * 2018-12-12 2019-04-08 대한민국 Recombinant antigen derived from zika virus e protein and use thereof
CN114072168A (en) * 2019-04-10 2022-02-18 勒芬天主教大学 Chimeric Zika-Japanese encephalitis virus
CN113999823B (en) * 2021-12-30 2022-04-26 北京赛尔富森生物科技有限公司 D8 gene type chimeric measles virus attenuated strain and its preparation method and use
WO2023164441A1 (en) * 2022-02-22 2023-08-31 Vyro Bio Inc. Nucleic acid compositions for delivering exogenous polynucleotides and methods of use

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104918952A (en) * 2012-09-27 2015-09-16 巴斯德研究院 Recombinant measles virus expressing chikungunya virus polypeptides and their applications
WO2016199936A1 (en) * 2015-06-12 2016-12-15 国立大学法人三重大学 Human parainfluenza type 2 virus vector and vaccine
WO2016210127A1 (en) * 2015-06-25 2016-12-29 Technovax, Inc. Flavivirus and alphavirus virus-like particles (vlps)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1375512B1 (en) 2002-06-20 2009-07-22 Institut Pasteur Infectious cDNA of an approved vaccine strain of measles virus. Use for immunogenic compositions
DK1939214T3 (en) 2006-12-22 2013-10-14 Pasteur Institut Cells and methodology for generating non-segmented negative-stranded RNA viruses
EP3184119A1 (en) * 2015-12-23 2017-06-28 Themis Bioscience GmbH Chromatography based purification strategies for measles scaffold based viruses

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104918952A (en) * 2012-09-27 2015-09-16 巴斯德研究院 Recombinant measles virus expressing chikungunya virus polypeptides and their applications
WO2016199936A1 (en) * 2015-06-12 2016-12-15 国立大学法人三重大学 Human parainfluenza type 2 virus vector and vaccine
WO2016210127A1 (en) * 2015-06-25 2016-12-29 Technovax, Inc. Flavivirus and alphavirus virus-like particles (vlps)

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Role of a ZIKV CHIM in vaccine evaluation;Anna Durbin;《Johns Hopkins Bloomberg School of Public Health WHO Zika Workshop》;20170602;第22页表格 *
寨卡病毒研究进展;谌章舟等;《中国病毒病杂志》;20160920(第05期);第394页第1栏倒数第1段 *

Also Published As

Publication number Publication date
MX2019014674A (en) 2020-09-07
JP2020524496A (en) 2020-08-20
CA3064322A1 (en) 2018-12-13
EP3634482A1 (en) 2020-04-15
AU2018281889A1 (en) 2019-12-19
CN110891600A (en) 2020-03-17
JP2023107780A (en) 2023-08-03
AU2018281889B2 (en) 2022-03-03
BR112019025310A2 (en) 2020-06-23
WO2018224573A8 (en) 2019-07-25
WO2018224573A1 (en) 2018-12-13
US11857616B2 (en) 2024-01-02
EP3412307A1 (en) 2018-12-12
US20200237893A1 (en) 2020-07-30

Similar Documents

Publication Publication Date Title
CN110891600B (en) Recombinant measles virus expressing Zika virus protein and application thereof
AU2019203955C1 (en) Multipartite signaling proteins and uses thereof
JP2022130714A (en) Methods for assessing presence or absence of replication competent virus
EP0712442A1 (en) Vaccine compositions
US20040101514A1 (en) High transgene expression of a pseudotyped adeno-associated virus type
CN111918972A (en) Methods and reagents for assessing the presence or absence of replication competent viruses
US6488926B1 (en) Vaccine compositions
KR20080094910A (en) Novel protein expression system
KR20220007155A (en) Modified S1 subunit of coronavirus spike protein
CN111440801A (en) sgRNA for targeted knockout of human NKG2A/K L RC1 gene, expression vector, kit and application thereof
KR20230010231A (en) Vectors and methods for in vivo transduction
CN114164114B (en) Toxoplasma ribulose-5-phosphate isomerase TgRPI gene editing insect strain and application thereof
CN113046255B (en) Large-scale gene rearranged saccharomycete and construction method thereof
KR20200104343A (en) Lhasa vaccine
KR102093495B1 (en) Chimeric vaccine antigens against hepatitis c virus
CN110225765B (en) Attenuated swine influenza vaccines and methods of making and using the same
KR20150100606A (en) Arterivirus protein and expression mechanisms
CN117083389A (en) Plant chloroplast cytosine base editor and mitochondrial cytosine base editor
CN117222664A (en) Vaccine composition for disruption of self tolerance
CN101516199A (en) Targeted gene delivery for dendritic cell vaccination
CN113025718A (en) Application of regulating EIF4A3 expression to regulating liver cancer cell proliferation capacity
CN117500931A (en) measles-HIV or measles-HTLV vaccine
DK2921048T3 (en) SUS SCROFA V2G: SAFE HARBOR PLACE FOR LONG-TERM EXPRESSION AND HIGH INTEGRATION OF TRANSGENERS IN A PIG
CN114774469B (en) Method for preparing NK (Natural killer) cells with enhanced ADCC (advanced cellular cytotoxicity) function, NK cells and composition thereof
KR100927098B1 (en) Recombinant vectors for switching of epitope tags

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40025052

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant