US20230142867A1 - Aav native-neuro platform and use for neuronal disease gene therapy - Google Patents

Aav native-neuro platform and use for neuronal disease gene therapy Download PDF

Info

Publication number
US20230142867A1
US20230142867A1 US17/920,483 US202117920483A US2023142867A1 US 20230142867 A1 US20230142867 A1 US 20230142867A1 US 202117920483 A US202117920483 A US 202117920483A US 2023142867 A1 US2023142867 A1 US 2023142867A1
Authority
US
United States
Prior art keywords
disease
neuronal
raav vector
raav
goi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US17/920,483
Inventor
Senthil Ramu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Aav Gene Therapeutics Inc
Original Assignee
Aav Gene Therapeutics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Aav Gene Therapeutics Inc filed Critical Aav Gene Therapeutics Inc
Priority claimed from PCT/US2021/028803 external-priority patent/WO2021216975A1/en
Publication of US20230142867A1 publication Critical patent/US20230142867A1/en
Assigned to AAV GENE THERAPEUTICS, INC. reassignment AAV GENE THERAPEUTICS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: RAMU, SENTHIL
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • A61K48/0066Manipulation of the nucleic acid to modify its expression pattern, e.g. enhance its duration of expression, achieved by the presence of particular introns in the delivered nucleic acid
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K1/00Printed circuits
    • H05K1/02Details
    • H05K1/0296Conductive pattern lay-out details not covered by sub groups H05K1/02 - H05K1/0295
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K1/00Printed circuits
    • H05K1/02Details
    • H05K1/03Use of materials for the substrate
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K1/00Printed circuits
    • H05K1/02Details
    • H05K1/11Printed elements for providing electric connections to or between printed circuits
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K1/00Printed circuits
    • H05K1/02Details
    • H05K1/11Printed elements for providing electric connections to or between printed circuits
    • H05K1/111Pads for surface mounting, e.g. lay-out
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K3/00Apparatus or processes for manufacturing printed circuits
    • H05K3/22Secondary treatment of printed circuits
    • H05K3/28Applying non-metallic protective coatings
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K3/00Apparatus or processes for manufacturing printed circuits
    • H05K3/30Assembling printed circuits with electric components, e.g. with resistor
    • H05K3/32Assembling printed circuits with electric components, e.g. with resistor electrically connecting electric components or wires to printed circuits
    • H05K3/34Assembling printed circuits with electric components, e.g. with resistor electrically connecting electric components or wires to printed circuits by soldering
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K3/00Apparatus or processes for manufacturing printed circuits
    • H05K3/30Assembling printed circuits with electric components, e.g. with resistor
    • H05K3/32Assembling printed circuits with electric components, e.g. with resistor electrically connecting electric components or wires to printed circuits
    • H05K3/34Assembling printed circuits with electric components, e.g. with resistor electrically connecting electric components or wires to printed circuits by soldering
    • H05K3/3452Solder masks
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K3/00Apparatus or processes for manufacturing printed circuits
    • H05K3/46Manufacturing multilayer circuits
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K3/00Apparatus or processes for manufacturing printed circuits
    • H05K3/46Manufacturing multilayer circuits
    • H05K3/4644Manufacturing multilayer circuits by building the multilayer layer by layer, i.e. build-up multilayer circuits
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • A61K48/0058Nucleic acids adapted for tissue specific expression, e.g. having tissue specific promoters as part of a contruct
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14133Use of viral protein as therapeutic agent other than vaccine, e.g. apoptosis inducing or anti-inflammatory
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14143Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14145Special targeting system for viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/008Vector systems having a special element relevant for transcription cell type or tissue specific enhancer/promoter combination
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K2201/00Indexing scheme relating to printed circuits covered by H05K1/00
    • H05K2201/20Details of printed circuits not provided for in H05K2201/01 - H05K2201/10
    • H05K2201/2081Compound repelling a metal, e.g. solder

Definitions

  • AAV Adeno Associated Virus
  • Efficient and specific delivery to a target organ, such as the brain, is a rate limiting step in many protein-based therapeutics. There are multiple barriers to efficient direct protein delivery to the CNS.
  • BBB blood-brain barrier
  • TJs intercellular tight junctions
  • gene transfer vehicles attempting entry into the CNS are forced to go through the transcellular route.
  • specialized lumen facing efflux transporters which include both the ABC family of transporters and multidrug resistance-related proteins, contribute to the formation of the “transport barrier” that prevents xenobiotic substances from gaining access to the neuronal environment.
  • trancytotic/endosomal sorting Another obstacle that partially prevents endothelial crossing of gene-therapy vehicles is the trancytotic/endosomal sorting, which can lead to degradation.
  • endothelium is the greatest obstacle in restricting access to the CNS, the presence of a complex extracellular space, molecular composition and geometry also significantly impede diffusion of viral-based gene delivery systems. That is, even if a therapy can be designed to target one of these receptors and traverse the BMVEC cytoplasm to be released at the basal cell membrane, only the first of many guardian mechanisms at the BBB has been overcome.
  • the CAG/CBA/CB promoter is generally viewed as a ubiquitous promoter as it works in numerous tissues and cell types. However, the assumption that it is highly active in every cell type may not be entirely warranted in the CNS. Studies have been conducted using AAV vectors with the same capsid but carrying different CNS cell specific promoters or the CBA promoter. The apparent discrepancy in CNS transduction profiles of these different AAV vectors suggests that the CBA promoter may not mediate detectable transgene expression in all cell types in the brain.
  • AAV vectors with numerous tropism carrying the cytomegalovirus (CMV), CBA, Rous sarcoma virus (RSV), or human GUSB promoters to drive transgene expression, in the context of intra-parenchymal delivery to different brain structures.
  • CMV cytomegalovirus
  • RSV Rous sarcoma virus
  • human GUSB promoters to drive transgene expression, in the context of intra-parenchymal delivery to different brain structures.
  • AAV vectors for treating neurological diseases have used the CMV immediate early promoter followed by a chimeric intron composed of a CMV splice donor and a human globin splice acceptor, or a version of the CAG/CBA promoter (supra).
  • AAV vectors carrying mammalian promoters such as the mouse phosphoglycerate kinase (PGK) promoter, or the rat neuron-specific enolase (NSE) promoter.
  • PGK mouse phosphoglycerate kinase
  • NSE rat neuron-specific enolase
  • AAV2 vector encoding aromatic amino acid decarboxylase AADC
  • AAV-CMV or AAV-CAG/CBA vectors are their ability to drive strong gene expression in many neurons in the CNS of multiple species.
  • the disparate CNS transduction results between cellular promoters and strong non-mammalian promoters raises the possibility that the current generation of AAV vectors express transgenes at exceptionally high levels in a small percentage of all cells that harbor transcriptionally ready vector genomes in the nucleus.
  • TPP-I human tripeptidyl peptidase
  • the potential for toxicity due to excessive transgene expression is particularly acute in neurological diseases where the expressed proteins are not secreted, and thus the need for high level expression is even less.
  • the development of an AAV gene therapy approach for Rett syndrome represents a significant challenge to the prevalent approach of simply using strong promoters. This disease is caused by mutations in the MeCP2 gene on the X-chromosome. Interestingly, duplication of this gene also leads to neurodevelopmental delays, suggesting that expression of this gene in the CNS need to be tightly controlled, which is a challenge to current AAV gene therapy.
  • the invention described herein provides a gene replacement therapy platform to replace mutated genes by AAV Gene Therapy.
  • exemplary central nervous system (CNS) disease drug protypes are packaged in recombinant AAV vectors for intravenous administration to a patient, in order to express functional genes in native neuronal cells.
  • the technology is designed for intravenous dosing of AAV to express genes precisely in intended cell types by using endogenous/native promoter fused with viral/heterologous enhancer.
  • the gene cassette in the AAV vector of the invention begins with AAV inverted terminal repeat (ITR), followed by proprietary heterologous enhancer, followed by a native promoter of a gene of interest (GOI), 5′UTR, proprietary codon optimized gene of interest (GOI), 3′UTR, and ends in another AAV inverted terminal repeat (ITR).
  • ITR AAV inverted terminal repeat
  • GOI gene of interest
  • 5′UTR native promoter of a gene of interest
  • GOI proprietary codon optimized gene of interest
  • 3′UTR proprietary codon optimized gene of interest
  • a recombinant adeno-associated viral (rAAV) vector comprising: (1) a polynucleotide comprising a gene of interest (GOI), under the transcriptional control of the native/endogenous promoter of said GOI; (2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and, (3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
  • GOI gene of interest
  • ITR inverted terminal repeat
  • the GOI is a neuronal gene, or is a gene expressed in neuronal tissues (CNS and/or PNS).
  • the GOI is defective (e.g., loss-of-function) in a neuronal disease or condition.
  • the neuronal disease or condition is a CNS disease or condition.
  • the CNS disease or condition is: Ceroid-Lipofuscinosis, Fucosidosis, Giant Axonal Neuropathy, Globoid cell leukodystrophy, Mucolipidiosis Type IV, Neuronal Ceroid Lipofusinoses, Niemann-Pick Disease, Sandhoff Disease, Sanfilippo syndrome, Tay-Sachs Disease, Hunter Syndrome, Sialidosis, Multiple Sulfatase Deficiency, Childhood Epilepsy, CMT Peripheral Neuropathy, Neuronal Ceroid Lipofusinoses, Aspartylglycosaminuria, Mucopolysaccharidoses, Parkinsons Disease/Symptoms, GM1-Gangliosidosis, Charcot-Marie-Tooth Type 1A, Retts Syndrome, Dannon Disease, Mucopolysaccharidoses (due to GNS, HGSNAT, NAGLU, or ARSB), Sly Syndrome, Biotin basal ganglia disease, Pelizaeus-M
  • the GOI is: CLN3, FUCA1, GAN, GALC, MCOLN1, PPT1, SMPD1, HEXB, SGSH, HEXA, IDS, NEU1, SUMF1, GAT1/SLCA1, FIG. 4 , CLN5, AGA, IDUA, GDNF, GLB1, PMP22/MFN2, MECP2, LAMP2, NAGLU, ARSB, GUSB, SLC19A3, PLP1, TPP1/CLN2, ACY2/ASPA, MANBA, LIPA, CTNS, GNS, HGSNAT, NEUROD1, SLC17A5, CLN6, CLN8, or GM2A.
  • the rAAV has a tropism for neuronal cells, or for a cell in neuronal tissue.
  • the polynucleotide further comprises a 5′-UTR coding region, and a 3′-UTR coding region.
  • the heterologous enhancer is a viral enhancer.
  • the viral enhancer is CMV enhancer.
  • the viral enhancer is SV-40 enhancer.
  • Another aspect of the invention provides a pharmaceutical composition comprising any one of the rAAV vector of the invention.
  • the pharmaceutical composition is formulated for intravenous administration (e.g., to the blood) or intrathecal administration (e.g., to spinal fluid).
  • Another aspect of the invention provides a cell infected with any one of the rAAV vector of the invention.
  • Another aspect of the invention provides a recombinant AAV virus comprising any one of the rAAV vector of the invention, wherein the serotype or pseudotype of the recombinant AAV virus is for preferential infection of a neuronal cell or a cell in neuronal tissue.
  • Another aspect of the invention provides a method of treating a (genetic) disease or disorder in a subject, the method comprising administering the recombinant AAV virus of the invention into the subject.
  • the genetic disease or disorder is a neuronal disease or disorder, and/or wherein the recombinant AAV virus preferentially infects a neuronal cell or a cell in neuronal tissue of the subject, thereby treating the neuronal disease or disorder.
  • the rAAV virus is administered to the subject intravenously or intrathecally.
  • Another aspect of the invention provides a method of producing any one of the rAAV vector of the invention, the method comprising introducing any one of the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • the rep/cap proteins comprise one or more AAV9 capsid proteins.
  • FIG. 1 is a schematic drawing (not to scale) showing a generic structure of the subject rAAV vector, as well as an intended mode of action for the vector.
  • the Gene of Interest (GOI) is put under the control of its native promoter. Transcription from the native promoter is further enhanced by the presence of a powerful enhancer, such as a viral enhancer like CMV enhancer, to facilitate the expression of the GOI in the native cells such GOI is normally expressed.
  • the entire recombinant sequence is flanked by two AAV ITR sequences, such as AAV9 ITR sequences for the preferred neuronal cell tropism.
  • FIG. 2 is a gel image confirming the construction of rAAV viral vectors comprising a number of representative GOI's that are defective in neuronal diseases/disorders. Also shown is a rAAV vector encoding a GFP reporter gene, which can be switched out by any of the GOI's.
  • FIG. 3 is a gel image confirming the construction of additional rAAV viral vectors comprising a number of representative GOI' s that are defective in neuronal diseases/disorders.
  • the invention described herein provides methods and compositions for using recombinant viral vectors, such as AAV-based viral vectors or lentiviral vectors to express any gene-of-interest (GOI) under the control of its native promoter so that the GOI is preferentially and exclusively expressed in the cells the GOI is intended to be expressed.
  • the GOI is further under the transcriptional control of a powerful heterologous enhancer, such as a viral enhancer like CMV enhancer, which further boosts the transcription and expression of the GOI. Therefore, after a single intravenous treatment, the protein product of interest is expressed at high levels only in the endogenous/native cell types, thereby improving therapeutic safety and efficacy.
  • the invention described herein provides a recombinant adeno-associated viral (rAAV) vector, comprising: (1) a polynucleotide comprising a gene of interest (GOI), under the control of the native/endogenous promoter of said GOI; (2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and, (3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
  • a adeno-associated viral vector comprising: (1) a polynucleotide comprising a gene of interest (GOI), under the control of the native/endogenous promoter of said GOI; (2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and, (3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
  • a adeno-associated viral vector comprising: (1) a polynucleotide
  • the GOI is a neuronal gene.
  • the neuronal gene is defective (e.g., loss-of-function) in a neuronal disease or condition.
  • the neuronal disease or condition is a CNS disease or condition.
  • the CNS disease or condition is: Ceroid-Lipofuscinosis, Fucosidosis, Giant Axonal Neuropathy, Globoid cell leukodystrophy, Mucolipidiosis Type IV, Neuronal Ceroid Lipofusinoses, Niemann-Pick Disease, Sandhoff Disease, Sanfilippo syndrome, Tay-Sachs Disease, Hunter Syndrome, Sialidosis, Multiple Sulfatase Deficiency, Childhood Epilepsy, CMT Peripheral Neuropathy, Neuronal Ceroid Lipofusinoses, Aspartylglycosaminuria, Mucopolysaccharidoses, Parkinsons Disease/Symptoms, GM1-Gangliosidosis, Charcot-Marie-Tooth Type 1A, Retts Syndrome, Dannon Disease, Mucopolysaccharidoses (due to GNS, HGSNAT, NAGLU, or ARSB), Sly Syndrome, Biotin basal ganglia disease, Pelizaeus-M
  • the GOI is: CLN3, FUCA1, GAN, GALC, MCOLN1, PPT1, SMPD1, HEXB, SGSH, HEXA, IDS, NEU1, SUMF1, GAT1/SLCA1, FIG. 4 , CLN5, AGA, IDUA, GDNF, GLB1, PMP22/MFN2, MECP2, LAMP2, NAGLU, ARSB, GUSB, SLC19A3, PLP1, TPP1/CLN2, ACY2/ASPA, MANBA, LIPA, CTNS, GNS, HGSNAT, NEUROD1, SLC17A5, CLN6, CLN8, or GM2A.
  • the rAAV has a tropism for neuronal cells.
  • the rAAV serotype is, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 or rAAVrh.10.
  • the rAAV serotype is AAV9. In certain embodiments, the rAAV has a pseudotype of xxx. In certain embodiments, the polynucleotide further comprises a 5′-UTR coding region, and a 3′-UTR coding region.
  • the heterologous enhancer is a viral enhancer.
  • the viral enhancer is CMV enhancer.
  • the viral enhancer is SV40 enhancer.
  • Another aspect of the invention provides a pharmaceutical composition comprising the rAAV vector of the invention.
  • the pharmaceutical composition is formulated for intravenous administration.
  • Another aspect of the invention provides a cell infected with the rAAV vector of the invention.
  • Another aspect of the invention provides a recombinant AAV virus comprising the rAAV vector of the invention, wherein the serotype or pseudotype of the recombinant AAV virus is for preferential infection of a neuronal cell.
  • Another aspect of the invention provides a method of treating a (genetic) disease or disorder in a subject, the method comprising introducing the recombinant AAV virus of the invention into the subject.
  • the genetic disease or disorder is a neuronal disease or disorder
  • the recombinant AAV virus preferentially infects a neuronal cell of the subject, and treats the neuronal disease or disorder.
  • Another aspect of the invention provides a method of producing the rAAV vector of the invention, comprising introducing the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • the recombinant AAV vector and recombinant AAV virus of the invention, together with the pharmaceutical composition comprising the same, can be used to treat a number of diseases and conditions, particularly neuronal diseases or indications treatable by gene therapy.
  • Diseases or conditions treatable by the AAV vector/virus/method of the invention are typically characterized by lacking a functional neuronal protein or enzyme, which neuronal diseases or indications can be alleviated (if not cured), treated, or prevented (e.g., at least the onset is delayed), by providing functional versions of the proteins or enzymes.
  • the externally provided functional versions of the proteins or enzymes may be identical in sequence to the wild-type proteins or enzymes that are lacking in a host having such diseases or conditions, or may be a variant of the wild-type proteins or enzymes.
  • the neuronal diseases or conditions can be treated by introducing into the host the subject AAV virus that will infect target host (neuronal) tissues, and stably express the encoded GOI that is defective in the diseases or conditions throughout the life span of the host.
  • the subject AAV virus may only need to be introduced into the host once, or can be introduced into the host multiple times as needed.
  • polynucleotide sequences of the invention encoding several genes-of-interest (GOI) are listed below. These polynucleotide sequences can be part of the subject AAV vector sequences. These sequences are for illustrative purpose only and are non-limiting.
  • GAN Giant Axonal Neuropathy
  • Giant axonal neuropathy is a severe, slowly progressive neurodegenerative disorder characterized by progressive motor and sensory peripheral neuropathy, central nervous system involvement (including pyramidal and cerebellar signs), and characteristic kinky hair in most cases.
  • GAN is inherited as an autosomal recessive trait, and is associated with more than 50 different causative mutations in GAN gene (16q24.1), encoding the ubiquitously expressed cytoskeletal protein gigaxonin.
  • the GOI is GAN that encodes gigaxonin
  • the disease or condition treatable by the subject AAV vector/virus is Giant Axonal Neuropathy (GAN).
  • GAN Giant Axonal Neuropathy
  • the composition and methods of the invention can be used to produce gigaxonin expressed permanently from AAV vectors stably maintained with the host's genome as extra chromosomal material, in native neuronal cells that normally express GAN, thus effectively treating giant axonal neuropathy.
  • a representative polynucleotide of the invention that can be encoded by the AAV vector of the invention is described below.
  • double underlined sequences represent proprietary viral enhancer sequences (SEQ ID NO: 1, see below); bold sequences are native promoter sequences for GAN; italic sequences are coding sequence for the GOI (GAN in this case); the subscripted sequences immediately 5′ and 3′ to the GOI coding sequence are 5′-UTR and 3′-UTR sequences, respectively.
  • the proprietary enhancer sequences (SEQ ID NO: 1) is listed below.
  • the GAN protein sequence is:
  • Conservative changes to the protein sequence above e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 conservative residue changes, or up to 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% conservative residue changes
  • a functional equivalent version of the protein e.g., that retains at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 99% of the wild-type enzyme activity
  • nucleotide sequence encoding the protein and variants thereof such as in SEQ ID NO: 2
  • nucleotide sequence changes that do not result in sequence change of the encoded protein in SEQ ID NO: 3 due to, for example, codon degeneracy are within the scope of the invention.
  • codon optimization of any of the polynucleotide sequence for optimal expression in a specific type of host cell is also within the scope of the invention (see further details below).
  • Neuronal ceroid lipofuscinosis refers to a family of at least eight genetically separate neurodegenerative lysosomal storage diseases that result from excessive accumulation of lipopigments (lipofuscin) made up of fats and proteins in the body's tissues.
  • lipopigments lipopigments
  • the most commonly identified mutations are in the CLN3 gene, located on the short arm of chromosome 16 (16p12.1), found in childhood or juvenile NCLs (JNCLs).
  • JNCL is an autosomal recessive disorder.
  • the normal function of CLN3 gene is not presently known, but it is known to be a transmembrane protein.
  • Studies of the yeast CLN3 ortholog which encodes Battenin have suggested that the protein may play a role in lysosomal pH homeostasis.
  • recent studies have also implied the protein's role in cathepsin D deficiency; the overexpression of the defective protein appears to have significant effects on cathepsin D processing, with implications suggesting that accumulation of ATP synthase subunit C would result.
  • the GOI is CLN3, and the disease or condition treatable by the subject AAV vector/virus is JNCL.
  • the composition and methods of the invention can be used to produce CLN3 gene product expressed permanently from AAV vectors stably maintained with the host's genome as extra chromosomal material, in native neuronal cells that normally express CLN3, thus effectively treating JNCL.
  • a representative polynucleotide of the invention that can be encoded by the AAV vector of the invention is described below.
  • the proprietary enhancer sequence is in SEQ ID NO: 1.
  • Native CLN3 promoter is known in the art.
  • the coding sequence for CLN3 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • the CLN3 protein sequence is:
  • nucleotide sequence encoding the protein and variants thereof such as in SEQ ID NO: 4
  • nucleotide sequence changes that do not result in sequence change of the encoded protein in SEQ ID NO: 5 due to, for example, codon degeneracy are within the scope of the invention.
  • codon optimization of any of the polynucleotide sequence for optimal expression in a specific type of host cell is also within the scope of the invention.
  • a representative (non-limiting) list of the treatable diseases and disorders, along with the corresponding GOI's, are provides in the table below.
  • the FUCA1 protein sequence is:
  • the GALC protein sequence is:
  • the MCOLN1 protein sequence is:
  • the PPT1 protein sequence is:
  • the SMPD1 protein sequence is:
  • the HEXB protein sequence is:
  • the SGSH protein sequence is:
  • the HEXA protein sequence is:
  • the proprietary viral enhancer sequences, native promoter sequences, coding sequence for HEXA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • the IDS protein sequence is:
  • the NEU1 protein sequence is:
  • the SUMF1 protein sequence is:
  • the GAT1 protein sequence is:
  • the FIG. 4 protein sequence is:
  • FIG. 4 DNA Sequence
  • the CLN5 protein sequence is:
  • the AGA protein sequence is:
  • the IDUA protein sequence is:
  • the GDNF protein sequence is:
  • the GLB 1 protein sequence is:
  • the PMP22/MFN2 protein sequence is:
  • the MECP2 protein sequence is:
  • the LAMP2 protein sequence is:
  • the NAGLU protein sequence is:
  • the ARSB protein sequence is:
  • the GUSB protein sequence is:
  • the SLC19A3 protein sequence is:
  • the PLP1 protein sequence is:
  • the TPP1/CLN2 protein sequence is:
  • the ACY2/ASPA protein sequence is:
  • the MANBA protein sequence is:
  • the LIPA protein sequence is:
  • CTNS protein sequence is:
  • the GNS protein sequence is:
  • the HGSNAT protein sequence is:
  • the NeuroD1 protein sequence is:
  • the SLC17A5 protein sequence is:
  • the CLN6 protein sequence is:
  • the CLN8 protein sequence is:
  • the GM2A protein sequence is:
  • the HYAL1 protein sequence is:
  • any of the GOI described herein are codon optimized for manufacturing, optimal expression in a host cell, such as in a human cell. Further Codon optimization of GOI can be done based on codon optimization table publically available, such as the Codon and Codon Pair Usage Tables (CoCoPUTs) maintained at the George Washington University (GWU).
  • the CoCoPUTs is derived from all available GenBank and RefSeq data source.
  • the GOI's are codon optimized for expression in neuronal tissues or specific cell types in neuronal tissues, such as neurons, depending on the target cell type the GOI's are to be expressed.
  • TissueCoCoPUTs which are human tissue-specific codon and codon pair usage tables derived from human genomic codon usage information and tissue-specific transcriptomic data.
  • the TissueCoCoPUTs maintained by the GWU website represent 52 human tissues.
  • Transcriptomic data are derived from the Genotype Tissue Expression (GTEx) Portal.
  • the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by intravenous injection or direct intraparenchymal injection or direct intracranial injection. Such delivery means effectively bypass the BBB, and are common means for the administration of gene therapeutics to the brain.
  • the neuronal target cells intended to receive gene therapy through intraparenchymal injection are clustered in a focal location, and are this particularly suitable for focal delivery of the viral vector.
  • the viral vector (such as AAV) of the invention is delivered locally or compartmentally, such as direct intraocular injection, or injection in the cochlea.
  • the viral vectors of the invention are delivered to neuronal target cells by intravenous injection.
  • the neuronal target cells intended to receive gene therapy through intravenous injection are globally located in the CNS, or in multiple CNS regions.
  • Such global delivery across the BBB is achieved by using selected AAV serotypes (such as AAV9) that have been shown to deliver genes across the BBB after intravenous injection.
  • the viral vector (such as AAV) of the invention is delivered to achieve widespread delivery for diseases of extensive pathology including glioblastoma multiforme (GBM), lysosomal storage diseases (LSDs), Alzheimer's disease, Parkinson's disease and Canavan's disease.
  • the viral vectors are delivered intracerebroventricularly, intrathecally, intranasally, or systemicaly (e.g., intravenous and intra-arterial injections).
  • the systemic delivery route for gene delivery vehicles is very promising, and has the potential to be a conduit to the entire CNS (every cell in the brain is a maximum distance of 40 ⁇ m from an endothelial cell), provided that the issue of BBB blockage and the filtration systems of the spleen and liver are adequately dealt with.
  • the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by intravascular, intrathecal, and/or cerebrospinal fluid (CSF) administration.
  • CSF cerebrospinal fluid
  • Many AAV capsids such as AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 and rAAVrh.10, have remarkable CNS tropism after vascular (Foust et al., Nat Biotechnol. 27:59-65, 2009; Gray et al., Gene Ther. 20:450-459, 2013; Zhang et al., Mol Ther.
  • the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by direct intracranial injection.
  • a large range of AAV capsids including pseudotypes rAAV2/1, rAAV2/2, and rAAV2/5, hu.32, hu.37, pi.2, hu.11, rh.8, hu.48R3, bb2, cy5, rh20, rh39, rh43, AAV7, AAV8, and AAV9, have been demonstrated for their CNS gene transfer properties by direct intracranial injection (Burger et al., Mol Ther. 10:302-317, 2004; Cearley et al., Mol Ther.
  • Vectors based on distinct AAV serotypes or pseudotypes can be chosen for specific applications in the nervous system, depending on their preferential or global tropism within the different regions of the brain and target neuronal tissues.
  • the viral vector is AAV9.
  • AAV9 The vast majority (>90%) of cells in the CNS transduced by intravenously injected AAV9 using strong promoters like CBA are astrocytes and neurons. The distribution of vector is widespread with transduced cells detected in all areas of the brain from the olfactory bulb to cerebellum.
  • delivery of the viral vectors of the invention across the BBB is further enhanced by a variety of techniques including: intra-nasal administration and/or intracranial NSC transplantation that bypasses the BBB; transiently permeating the BBB using osmotic manipulation; magnetic resonance imaging guided focused ultrasound (MRIgFUS) to selectively open the BBB and allow therapeutics to pass into the parenchyma; convection enhanced delivery (Debinski and Tatter, Curr Opin Mol Ther. 12:647-653, 2010); introduction into a fluid space, e.g., vasculature, brain ventricles or intrathecal space in the spinal cord, such that cells lining the space produce the deficient protein (Chen et al., Nat Med.
  • MRIgFUS magnetic resonance imaging guided focused ultrasound
  • the viral vector (such as AAV) of the invention comprises a targeting peptide that bind selectively to the brain endothelial cells or a brain endothelial cell receptor (such as the low-density lipoprotein (LDL) and other related receptors that can be targeted by Apolipoprotein E-derived peptides) to target the viral vector to the brain vasculature.
  • a targeting peptide that bind selectively to the brain endothelial cells or a brain endothelial cell receptor (such as the low-density lipoprotein (LDL) and other related receptors that can be targeted by Apolipoprotein E-derived peptides) to target the viral vector to the brain vasculature.
  • LDL low-density lipoprotein
  • Another aspect of the invention provides a method of producing the rAAV vector of the invention, comprising introducing the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • the packaging cell line is a HEK293 cell line, a HeLa cell, or an A549 cell.
  • the subject rAAV vector can be produced using any of many art-recognized approach.
  • the rAAV vector is produced based on the helper-virus-free transient transfection method with all cis and trans components (e.g., vector plasmid and packaging plasmids, along with helper genes isolated from adenovirus) in host cells such as 293 cells.
  • the rAAV vector is produced using recombinant herpes simplex virus (rHSV)-based AAV production system, which utilizes rHSV vectors to bring the AAV vector and the Rep and Cap genes into the cells.
  • rHSV herpes simplex virus
  • the rAAV vector is produced based on baculovirus system which requires simultaneous infection of insect cells with several baculovirus vectors to deliver the rAAV vector cassette and the Rep and Cap genes.
  • the rAAV vector is produced based on the AAV producer cell lines derived from HeLa or A549, which stably harbored AAV Rep/cap genes.
  • the AAV vector cassette can either be stably integrated in the host genome or be introduced by an adenovirus that contained the cassette.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Microelectronics & Electronic Packaging (AREA)
  • Chemical & Material Sciences (AREA)
  • Biotechnology (AREA)
  • Manufacturing & Machinery (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Organic Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Public Health (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Veterinary Medicine (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Epidemiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Virology (AREA)
  • Neurosurgery (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Neurology (AREA)
  • Production Of Multi-Layered Print Wiring Board (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

The invention described herein provides methods and compositions for using recombinant AAV-based viral vectors to express any gene-of-interest (GOI) under its native promoter and a proprietary heterologous enhancer, preferably expressing a GOI defective in a neuronal disease or disorder in order to treat the neuronal disease or disorder.

Description

    RELATED APPLICATIONS
  • This application claims priority to U.S. Provisional Application No. 63/014,283, filed on Apr. 23, 2020, the entire contents of which is expressly incorporated herein by reference.
  • BACKGROUND OF THE INVENTION
  • Gene-based neurotherapeutics holds great therapeutic potential. Clinical efficacy has already been obtained for central nervous system (CNS) diseases and eye disease through Adeno Associated Virus (AAV) gene therapy. The general lack of current effective neurotherapies for many devastating neurologic diseases portends great future demands for AAV gene-based neurotherapy.
  • Efficient and specific delivery to a target organ, such as the brain, is a rate limiting step in many protein-based therapeutics. There are multiple barriers to efficient direct protein delivery to the CNS.
  • One of the great challenges lies in the blood-brain barrier (BBB). Unlike the vasculature in other organ systems, the brain endothelium forms intercellular tight junctions (TJs) that limit greatly the paracellular access of blood-borne components to the tissue parenchyma. As a consequence of this “physical barrier” generated by TJs, gene transfer vehicles attempting entry into the CNS are forced to go through the transcellular route. There, however, specialized lumen facing efflux transporters, which include both the ABC family of transporters and multidrug resistance-related proteins, contribute to the formation of the “transport barrier” that prevents xenobiotic substances from gaining access to the neuronal environment. Another obstacle that partially prevents endothelial crossing of gene-therapy vehicles is the trancytotic/endosomal sorting, which can lead to degradation. Although the endothelium is the greatest obstacle in restricting access to the CNS, the presence of a complex extracellular space, molecular composition and geometry also significantly impede diffusion of viral-based gene delivery systems. That is, even if a therapy can be designed to target one of these receptors and traverse the BMVEC cytoplasm to be released at the basal cell membrane, only the first of many guardian mechanisms at the BBB has been overcome. If neurons are the target cell type, once the BBB endothelium has been passed, the basal lamina, pericytes, microglial ramifications, and the glial limitans comprised of astrocytic endfeet still have to be navigated before surpassing all of the checkpoints at the BBB and encountering a neuron.
  • As a result, intravascular administration of most viral vectors with the exception of few AAV serotypes has proven unsuccessful in the treatment of neurological diseases, potentially because these gene delivery platforms cannot negotiate their way across the BBB to access their target cells.
  • Another challenge lies in the promoters used in the gene therapy vectors. With the discovery of new viral vectors, mainly AAV capsids, with remarkable CNS tropism after vascular, CSF administration, or direct intracranial injection, the field of AAV mediated CNS gene therapy is accelerating rapidly towards clinical trials in a number of neurological diseases, and the expedient approach has been to use specific promoters with a proven track record of mediating long-term gene expression given the serious nature of many of these diseases, many of which are pediatric.
  • The majority of the pre-clinical studies have used strong ubiquitous promoters to provide a broad overview of transgene expression in different tissues and cell types throughout the body after intravascular infusions, or CNS cell types after intracranial delivery.
  • The CAG/CBA/CB promoter is generally viewed as a ubiquitous promoter as it works in numerous tissues and cell types. However, the assumption that it is highly active in every cell type may not be entirely warranted in the CNS. Studies have been conducted using AAV vectors with the same capsid but carrying different CNS cell specific promoters or the CBA promoter. The apparent discrepancy in CNS transduction profiles of these different AAV vectors suggests that the CBA promoter may not mediate detectable transgene expression in all cell types in the brain. Additional experiments have been conducted using AAV vectors with numerous tropism carrying the cytomegalovirus (CMV), CBA, Rous sarcoma virus (RSV), or human GUSB promoters to drive transgene expression, in the context of intra-parenchymal delivery to different brain structures.
  • In clinical studies, the majority of AAV vectors for treating neurological diseases have used the CMV immediate early promoter followed by a chimeric intron composed of a CMV splice donor and a human globin splice acceptor, or a version of the CAG/CBA promoter (supra). Two other clinical trials have tested AAV vectors carrying mammalian promoters such as the mouse phosphoglycerate kinase (PGK) promoter, or the rat neuron-specific enolase (NSE) promoter. The choice of promoters is based on the availability of extensive data from pre-clinical studies in different animal models showing that AAV-mediated transgene expression under these promoters is stable and long lasting in the mammalian brain. The stability of AAV gene expression in the human brain has been evaluated in clinical trials using an AAV2 vector encoding aromatic amino acid decarboxylase (AADC) injected into the putamen of Parkinson's disease patients, or children afflicted with AADC deficiency.
  • An indisputable fact about these AAV-CMV or AAV-CAG/CBA vectors is their ability to drive strong gene expression in many neurons in the CNS of multiple species. The disparate CNS transduction results between cellular promoters and strong non-mammalian promoters raises the possibility that the current generation of AAV vectors express transgenes at exceptionally high levels in a small percentage of all cells that harbor transcriptionally ready vector genomes in the nucleus. A recent report documented considerable toxicity in non-human primates (NHPs) receiving intraparenchymal injections of an AAV2 vector encoding human acid sphingomyelinase. Another safety study on intracranial delivery of an AAVrh10 vector encoding human tripeptidyl peptidase (TPP-I) in NHPs reported white matter edema and inflammation. The interpretation of these results is complicated, but a possible interpretation of the results is that these inflammatory responses are secondary to protein expression overload and non-specific expression in AAV transduced neurons.
  • The potential for toxicity due to excessive transgene expression is particularly acute in neurological diseases where the expressed proteins are not secreted, and thus the need for high level expression is even less. In one particular case, the development of an AAV gene therapy approach for Rett syndrome represents a significant challenge to the prevalent approach of simply using strong promoters. This disease is caused by mutations in the MeCP2 gene on the X-chromosome. Interestingly, duplication of this gene also leads to neurodevelopmental delays, suggesting that expression of this gene in the CNS need to be tightly controlled, which is a challenge to current AAV gene therapy.
  • SUMMARY OF THE INVENTION
  • The invention described herein provides a gene replacement therapy platform to replace mutated genes by AAV Gene Therapy.
  • According to the invention described herein, about 40 exemplary central nervous system (CNS) disease drug protypes are packaged in recombinant AAV vectors for intravenous administration to a patient, in order to express functional genes in native neuronal cells. The technology is designed for intravenous dosing of AAV to express genes precisely in intended cell types by using endogenous/native promoter fused with viral/heterologous enhancer.
  • In an exemplary embodiment, the gene cassette in the AAV vector of the invention begins with AAV inverted terminal repeat (ITR), followed by proprietary heterologous enhancer, followed by a native promoter of a gene of interest (GOI), 5′UTR, proprietary codon optimized gene of interest (GOI), 3′UTR, and ends in another AAV inverted terminal repeat (ITR).
  • Thus one aspect of the invention provides a recombinant adeno-associated viral (rAAV) vector, comprising: (1) a polynucleotide comprising a gene of interest (GOI), under the transcriptional control of the native/endogenous promoter of said GOI; (2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and, (3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
  • In certain embodiments, the GOI is a neuronal gene, or is a gene expressed in neuronal tissues (CNS and/or PNS).
  • In certain embodiments, the GOI is defective (e.g., loss-of-function) in a neuronal disease or condition.
  • In certain embodiments, the neuronal disease or condition is a CNS disease or condition.
  • In certain embodiments, the CNS disease or condition is: Ceroid-Lipofuscinosis, Fucosidosis, Giant Axonal Neuropathy, Globoid cell leukodystrophy, Mucolipidiosis Type IV, Neuronal Ceroid Lipofusinoses, Niemann-Pick Disease, Sandhoff Disease, Sanfilippo syndrome, Tay-Sachs Disease, Hunter Syndrome, Sialidosis, Multiple Sulfatase Deficiency, Childhood Epilepsy, CMT Peripheral Neuropathy, Neuronal Ceroid Lipofusinoses, Aspartylglycosaminuria, Mucopolysaccharidoses, Parkinsons Disease/Symptoms, GM1-Gangliosidosis, Charcot-Marie-Tooth Type 1A, Retts Syndrome, Dannon Disease, Mucopolysaccharidoses (due to GNS, HGSNAT, NAGLU, or ARSB), Sly Syndrome, Biotin basal ganglia disease, Pelizaeus-Merzbacher disease, Neuronal Ceroid Lipofusinoses, Canavan Disease, Beta-Mannosidosis, Wolman Disease, Cystinosis, Alzheimer's Disease/Symptom, Salla Disease, Jansky-Bielschowsky disease, Neuronal ceroid lipofuscinoses, or GM2 gangliosidosis.
  • In certain embodiments, the GOI is: CLN3, FUCA1, GAN, GALC, MCOLN1, PPT1, SMPD1, HEXB, SGSH, HEXA, IDS, NEU1, SUMF1, GAT1/SLCA1, FIG. 4 , CLN5, AGA, IDUA, GDNF, GLB1, PMP22/MFN2, MECP2, LAMP2, NAGLU, ARSB, GUSB, SLC19A3, PLP1, TPP1/CLN2, ACY2/ASPA, MANBA, LIPA, CTNS, GNS, HGSNAT, NEUROD1, SLC17A5, CLN6, CLN8, or GM2A.
  • In certain embodiments, the rAAV has a tropism for neuronal cells, or for a cell in neuronal tissue.
  • In certain embodiments, the rAAV is AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 or rAAVrh.10, preferably AAV9.
  • In certain embodiments, the rAAV comprises AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 and rAAVrh.10 capsids, preferably AAV9 capsids.
  • In certain embodiments, the polynucleotide further comprises a 5′-UTR coding region, and a 3′-UTR coding region.
  • In certain embodiments, the heterologous enhancer is a viral enhancer.
  • In certain embodiments, the viral enhancer is CMV enhancer.
  • In certain embodiments, the viral enhancer is SV-40 enhancer.
  • In certain embodiments, the heterologous enhancer comprises a polynucleotide of SEQ ID NO: 1.
  • Another aspect of the invention provides a pharmaceutical composition comprising any one of the rAAV vector of the invention.
  • In certain embodiments, the pharmaceutical composition is formulated for intravenous administration (e.g., to the blood) or intrathecal administration (e.g., to spinal fluid).
  • Another aspect of the invention provides a cell infected with any one of the rAAV vector of the invention.
  • Another aspect of the invention provides a recombinant AAV virus comprising any one of the rAAV vector of the invention, wherein the serotype or pseudotype of the recombinant AAV virus is for preferential infection of a neuronal cell or a cell in neuronal tissue.
  • Another aspect of the invention provides a method of treating a (genetic) disease or disorder in a subject, the method comprising administering the recombinant AAV virus of the invention into the subject.
  • In certain embodiments, the genetic disease or disorder is a neuronal disease or disorder, and/or wherein the recombinant AAV virus preferentially infects a neuronal cell or a cell in neuronal tissue of the subject, thereby treating the neuronal disease or disorder.
  • In certain embodiments, the rAAV virus is administered to the subject intravenously or intrathecally.
  • Another aspect of the invention provides a method of producing any one of the rAAV vector of the invention, the method comprising introducing any one of the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • In certain embodiments, the rep/cap proteins comprise one or more AAV9 capsid proteins.
  • It should be understood that any one embodiment of any one of the aspects of the invention above, including specific embodiments described only in one section of the application, can be combined with one or more additional embodiments.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a schematic drawing (not to scale) showing a generic structure of the subject rAAV vector, as well as an intended mode of action for the vector. The Gene of Interest (GOI) is put under the control of its native promoter. Transcription from the native promoter is further enhanced by the presence of a powerful enhancer, such as a viral enhancer like CMV enhancer, to facilitate the expression of the GOI in the native cells such GOI is normally expressed. The entire recombinant sequence is flanked by two AAV ITR sequences, such as AAV9 ITR sequences for the preferred neuronal cell tropism.
  • FIG. 2 is a gel image confirming the construction of rAAV viral vectors comprising a number of representative GOI's that are defective in neuronal diseases/disorders. Also shown is a rAAV vector encoding a GFP reporter gene, which can be switched out by any of the GOI's.
  • FIG. 3 is a gel image confirming the construction of additional rAAV viral vectors comprising a number of representative GOI' s that are defective in neuronal diseases/disorders.
  • DETAILED DESCRIPTION OF THE INVENTION 1. Overview
  • The invention described herein provides methods and compositions for using recombinant viral vectors, such as AAV-based viral vectors or lentiviral vectors to express any gene-of-interest (GOI) under the control of its native promoter so that the GOI is preferentially and exclusively expressed in the cells the GOI is intended to be expressed. The GOI is further under the transcriptional control of a powerful heterologous enhancer, such as a viral enhancer like CMV enhancer, which further boosts the transcription and expression of the GOI. Therefore, after a single intravenous treatment, the protein product of interest is expressed at high levels only in the endogenous/native cell types, thereby improving therapeutic safety and efficacy.
  • Thus in one aspect, the invention described herein provides a recombinant adeno-associated viral (rAAV) vector, comprising: (1) a polynucleotide comprising a gene of interest (GOI), under the control of the native/endogenous promoter of said GOI; (2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and, (3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
  • In certain embodiments, the GOI is a neuronal gene.
  • In certain embodiments, the neuronal gene is defective (e.g., loss-of-function) in a neuronal disease or condition.
  • In certain embodiments, the neuronal disease or condition is a CNS disease or condition.
  • In certain embodiments, the CNS disease or condition is: Ceroid-Lipofuscinosis, Fucosidosis, Giant Axonal Neuropathy, Globoid cell leukodystrophy, Mucolipidiosis Type IV, Neuronal Ceroid Lipofusinoses, Niemann-Pick Disease, Sandhoff Disease, Sanfilippo syndrome, Tay-Sachs Disease, Hunter Syndrome, Sialidosis, Multiple Sulfatase Deficiency, Childhood Epilepsy, CMT Peripheral Neuropathy, Neuronal Ceroid Lipofusinoses, Aspartylglycosaminuria, Mucopolysaccharidoses, Parkinsons Disease/Symptoms, GM1-Gangliosidosis, Charcot-Marie-Tooth Type 1A, Retts Syndrome, Dannon Disease, Mucopolysaccharidoses (due to GNS, HGSNAT, NAGLU, or ARSB), Sly Syndrome, Biotin basal ganglia disease, Pelizaeus-Merzbacher disease, Neuronal Ceroid Lipofusinoses, Canavan Disease, Beta-Mannosidosis, Wolman Disease, Cystinosis, Alzheimer's Disease/Symptom, Salla Disease, Jansky-Bielschowsky disease, Neuronal ceroid lipofuscinoses, or GM2 gangliosidosis.
  • In certain embodiments, the GOI is: CLN3, FUCA1, GAN, GALC, MCOLN1, PPT1, SMPD1, HEXB, SGSH, HEXA, IDS, NEU1, SUMF1, GAT1/SLCA1, FIG. 4 , CLN5, AGA, IDUA, GDNF, GLB1, PMP22/MFN2, MECP2, LAMP2, NAGLU, ARSB, GUSB, SLC19A3, PLP1, TPP1/CLN2, ACY2/ASPA, MANBA, LIPA, CTNS, GNS, HGSNAT, NEUROD1, SLC17A5, CLN6, CLN8, or GM2A.
  • In certain embodiments, the rAAV has a tropism for neuronal cells. In certain embodiment, the rAAV serotype is, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 or rAAVrh.10.
  • In certain embodiments, the rAAV serotype is AAV9. In certain embodiments, the rAAV has a pseudotype of xxx. In certain embodiments, the polynucleotide further comprises a 5′-UTR coding region, and a 3′-UTR coding region.
  • In certain embodiments, the heterologous enhancer is a viral enhancer.
  • In certain embodiments, the viral enhancer is CMV enhancer.
  • In certain embodiments, the viral enhancer is SV40 enhancer.
  • Another aspect of the invention provides a pharmaceutical composition comprising the rAAV vector of the invention.
  • In certain embodiments, the pharmaceutical composition is formulated for intravenous administration.
  • Another aspect of the invention provides a cell infected with the rAAV vector of the invention.
  • Another aspect of the invention provides a recombinant AAV virus comprising the rAAV vector of the invention, wherein the serotype or pseudotype of the recombinant AAV virus is for preferential infection of a neuronal cell.
  • Another aspect of the invention provides a method of treating a (genetic) disease or disorder in a subject, the method comprising introducing the recombinant AAV virus of the invention into the subject.
  • In certain embodiments, the genetic disease or disorder is a neuronal disease or disorder, the recombinant AAV virus preferentially infects a neuronal cell of the subject, and treats the neuronal disease or disorder.
  • Another aspect of the invention provides a method of producing the rAAV vector of the invention, comprising introducing the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • With the general aspects of the invention described, further detailed embodiments of the invention are provided below.
  • 2. Treatable Disease or Conditions
  • The recombinant AAV vector and recombinant AAV virus of the invention, together with the pharmaceutical composition comprising the same, can be used to treat a number of diseases and conditions, particularly neuronal diseases or indications treatable by gene therapy.
  • Diseases or conditions treatable by the AAV vector/virus/method of the invention are typically characterized by lacking a functional neuronal protein or enzyme, which neuronal diseases or indications can be alleviated (if not cured), treated, or prevented (e.g., at least the onset is delayed), by providing functional versions of the proteins or enzymes. The externally provided functional versions of the proteins or enzymes may be identical in sequence to the wild-type proteins or enzymes that are lacking in a host having such diseases or conditions, or may be a variant of the wild-type proteins or enzymes.
  • In certain embodiments, the neuronal diseases or conditions can be treated by introducing into the host the subject AAV virus that will infect target host (neuronal) tissues, and stably express the encoded GOI that is defective in the diseases or conditions throughout the life span of the host. The subject AAV virus may only need to be introduced into the host once, or can be introduced into the host multiple times as needed.
  • A few representative polynucleotide sequences of the invention encoding several genes-of-interest (GOI) are listed below. These polynucleotide sequences can be part of the subject AAV vector sequences. These sequences are for illustrative purpose only and are non-limiting.
  • Giant Axonal Neuropathy (GAN)
  • Giant axonal neuropathy (GAN) is a severe, slowly progressive neurodegenerative disorder characterized by progressive motor and sensory peripheral neuropathy, central nervous system involvement (including pyramidal and cerebellar signs), and characteristic kinky hair in most cases. GAN is inherited as an autosomal recessive trait, and is associated with more than 50 different causative mutations in GAN gene (16q24.1), encoding the ubiquitously expressed cytoskeletal protein gigaxonin.
  • Disease prognosis varies but is usually poor. Most patients become wheelchair-dependent in the second decade of life, and eventually progress to a bedridden state in early adulthood. Secondary complications, such as respiratory failure, may occur. Life expectancy does not exceed the third decade.
  • Current GAN treatment is symptomatic, focusing on stimulating intellectual and physical development, as well as communication skills, through physical, occupational and speech therapy, starting from the initiation of symptoms. Skeletal deformities and ophthalmoplegia may need surgery.
  • Thus in one embodiment, the GOI is GAN that encodes gigaxonin, and the disease or condition treatable by the subject AAV vector/virus is Giant Axonal Neuropathy (GAN). The composition and methods of the invention can be used to produce gigaxonin expressed permanently from AAV vectors stably maintained with the host's genome as extra chromosomal material, in native neuronal cells that normally express GAN, thus effectively treating giant axonal neuropathy.
  • A representative polynucleotide of the invention that can be encoded by the AAV vector of the invention is described below.
  • (SEQ ID NO: 2)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    CTCACAGGGATACTGGGTAGGTATCATGGAGAAGA
    TAAGTCCAAGCTGCTTGTCTTGTGTGTTTCTTACC
    TTTTTAAATATATTTTTCGGCCGGGTGTGGTGGCT
    TATGCCTGTAATTCCAGTACTTTGGGAAGCCAAGG
    TGGGCGGATCACAAGGTCAGGAGTTCGATACCAGC
    CTGGCCAACATGGTAAAACCCCATCTCTACTAAAG
    ATGCAAAAGTTAGCCAGGCATGGTGGCACATGCCT
    GTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGA
    ACCACTTGAACCCGGGAGGCGGAGGTTGCAGTGAA
    CAGAGATTGCACAACTGCACTCCAGCCTGGGCAAT
    AGAGGGAGACTCCATCTCAAAAAAAAAAAAGAAAG
    AAGAAAATATTTTTCAACATCCACAAAAATATAGA
    AACAGTACAATGAATCTCCACATCCCCATTAAACA
    TCCACAACAATATAGAGACAGTACAACGAATCTCC
    ACATCCCCATTACGTGGCTTTAACAATTAGTAACT
    TGCGGCCACTTTTGACTCATCTACACTCCCATCTG
    CTTCCCCATCCCTTTGTTTTGTTTTGAAGCAAATC
    CTAGACATCTTTATCATTGCATCCGGGAACACTGA
    GCTGCATTTACAGAAGAAAGTGGAAATGCATGTGT
    CTTCCACAGCCGTACCCTCCCCATCTATTCTGCCC
    TGAAAACAGTTAGATACTCGGATTCCTGCCAGTGA
    CTCCAAAGCCTTGGCTGGTTCAGGGTAAATACCCA
    AGACTTTATTGGAGCCTTTCAGTTGGTTGAAGGAC
    AGTGAAAATTCCATCCTGTCTTGCCTGGAATGCTA
    GCCACATTCTTCTGGTCCGGAGTCCTCATACAATT
    GCTAATTTTTTAGAGGCTTATGGAGTCCCTAACGT
    TTGCACAAAATGGTTGCTACATTCCACACTCCCAT
    GTGCCCTGAGCAACCTCCAGGGAAACTTCAGATAC
    CGTGGGGCTTGGCCTGCCAGCTGAGCTGACCCAGC
    GATCCTTTCTATCACAGCTTTGATCTTCACTGGGT
    TGGGAGCCTGTCTGAATCTAAACATCTGCCTCTCT
    ACTGCCCACCCCTTCAGCATTTCCTAGCTCTCCCT
    GAAAGGTGCAAACATGCATGCAGTTACTCAAACAG
    AATATTTATCCTTAACTCCCTCTCCCTCACAATTT
    CTTTTAAAACAATCCACTATTGTCTACCCCCGCCC
    CCGCATCACCCTTTCATCCAGGTGCCCATTCCCTT
    CTCTCTAGTCCTGCAATACCTAAGCATCGATCTGC
    CTTTAGCACATCCATCCCCGCCTTATTCTCCACGC
    AGGAGCTGAGTGACCTGCACAAAATCCCCTGCTAA
    ATACGGTACTTCCATGGTTCTTCACCACTGCAGAA
    ACCAAACTCCTTCCCACACCAAGGAGGCCCTGCAA
    AATTTAAGGCCTCCTTCGGACTTCTCCAGCTTCAT
    CTGCTACATCTCCTTCCTAGCCCGCAACCAAAACC
    ATCTTTTCCGGGTCTGTTTTCTCGGGTTTTTGTCC
    TCTCACCTGGGCAGCTCCTTTTCTCTCATAAATAA
    AGAAAGCTACAGCGACTTCGCCCTTGCTTCCATGC
    ATCATCTCATGAGTCCACAACAACACTAAGAATTC
    GCTTTCATTTTGCAGACGGAGAAACTGAGGATGGG
    CACCTTACCCATGGCCACAGAGCTAGTAAGTGGAA
    CAGCTGCAACCAGGATTCAAACTCGCGGCTCCAGA
    GGCCACAGGCTTAACCACGGAATCCTAGCCCGCCC
    GCACATCCCTCCATCGCGGGAGCGACCTATCTCAT
    CCGTAGACACCTTCCGCTGCACCCAGCCGCAGCTG
    GTCAGGCGGGCGGAGGAAGGAGGCTTCTGATTGGC
    CATCTCCAGCAGCGGAGGCGAGAAGGCGGGGAGCG
    ACGGTGCGGGGGCGTCGCGGAGAAATGACTTCATG
    ACGCGGCTTCTTAAAGGCGCCGGCCCAGCGCGCCG
    CGGA
    Figure US20230142867A1-20230511-P00001
    Figure US20230142867A1-20230511-P00002
    ATGGCTGAGGGAAGTGCGGTCTCAGACCCCCAACA
    CGCTGCAAGACTTCTGCGAGCTCTCTCATCTTTCA
    GAGAGGAGTCACGCTTCTGTGACGCACACTTGGTC
    CTTGACGGAGAAGAGATACCTGTCCAGAAAAATAT
    ACTTGCCGCTGCATCCCCATACATTCGCACCAAGC
    TGAACTACAATCCCCCTAAAGACGACGGGAGCACC
    TATAAAATCGAATTGGAGGGCATCTCCGTGATGGT
    AATGCGAGAGATACTGGATTACATATTTAGCGGCC
    AGATCAGGCTGAATGAGGATACGATCCAAGATGTT
    GTACAGGCAGCGGACTTGCTGCTTCTGACGGACCT
    TAAGACCCTTTGCTGCGAGTTCCTCGAAGGGTGTA
    TTGCTGCCGAGAATTGTATCGGAATACGAGACTTC
    GCACTCCATTACTGCCTTCACCACGTTCATTACCT
    TGCGACAGAGTATTTGGAAACACATTTCCGCGACG
    TTAGCTCTACAGAGGAATTTCTCGAACTTAGTCCT
    CAAAAGCTCAAGGAGGTTATTAGTCTTGAAAAGCT
    CAACGTCGGAAATGAGCGGTACGTGTTTGAGGCCG
    TGATACGGTGGATCGCCCACGACACCGAGATCCGC
    AAGGTCCACATGAAGGATGTAATGTCAGCGCTGTG
    GGTATCCGGTCTTGACTCTTCATACTTGAGAGAGC
    AGATGCTCAACGAACCTTTGGTGAGGGAAATAGTG
    AAGGAGTGTAGTAACATCCCCCTCAGCCAACCCCA
    GCAAGGCGAAGCGATGCTCGCGAACTTCAAGCCCC
    GAGGCTACAGCGAATGCATAGTAACAGTGGGAGGT
    GAAGAACGGGTCTCCCGGAAACCCACTGCGGCGAT
    GCGGTGTATGTGTCCCCTGTATGATCCCAATCGCC
    AACTTTGGATCGAGCTGGCGCCGCTGTCTATGCCA
    CGAATAAATCACGGGGTTCTTTCAGCCGAGGGGTT
    TCTGTTCGTCTTTGGAGGGCAGGATGAGAACAAAC
    AAACACTCTCCAGTGGGGAGAAATACGACCCCGAT
    GCAAACACTTGGACAGCTCTGCCACCGATGAACGA
    AGCCCGACACAACTTTGGTATAGTCGAAATCGACG
    GTATGCTGTATATCCTTGGAGGAGAGGACGGAGAA
    AAAGAACTCATCTCCATGGAATGTTACGACATTTA
    CAGCAAAACTTGGACAAAGCAACCCGACCTTACTA
    TGGTGAGGAAGATAGGATGTTATGCGGCAATGAAG
    AAAAAGATATACGCCATGGGAGGCGGTTCTTATGG
    TAAGCTCTTCGAGAGTGTGGAGTGTTATGACCCTC
    GAACGCAGCAATGGACAGCCATATGTCCCCTGAAG
    GAACGGCGATTTGGGGCGGTAGCGTGTGGAGTTGC
    AATGGAGTTGTACGTTTTCGGAGGTGTTCGAAGTC
    GGGAGGACGCGCAAGGGTCAGAGATGGTCACATGT
    AAAAGCGAATTTTATCATGACGAATTTAAGCGGTG
    GATTTATCTGAACGACCAAAACCTCTGTATTCCAG
    CTTCCAGTTCCTTCGTATACGGCGCTGTGCCAATA
    GGTGCGTCAATTTACGTCATAGGCGACTTGGACAC
    TGGTACTAATTATGATTACGTACGCGAGTTTAAGA
    GGAGTACAGGCACCTGGCACCACACAAAACCGCTG
    CTCCCGTCCGATCTGCGAAGGACAGGTTGCGCTGC
    ATTGAGGATTGCTAACTGCAAACTTTTCCGCCTGC
    AACTTCAGCAAGGGCTTTTCCGCATACGAGTCCAC
    AGTCCTGACTATAAAGATGACGACGACAAATA
    Figure US20230142867A1-20230511-P00003
    Figure US20230142867A1-20230511-P00004
    Figure US20230142867A1-20230511-P00005
    Figure US20230142867A1-20230511-P00006
    Figure US20230142867A1-20230511-P00007
  • In the sequence herein, double underlined sequences represent proprietary viral enhancer sequences (SEQ ID NO: 1, see below); bold sequences are native promoter sequences for GAN; italic sequences are coding sequence for the GOI (GAN in this case); the subscripted sequences immediately 5′ and 3′ to the GOI coding sequence are 5′-UTR and 3′-UTR sequences, respectively.
  • The proprietary enhancer sequences (SEQ ID NO: 1) is listed below.
  • (SEQ ID NO: 1)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGT
    ACCTTCTGAGGCGGAAAGAACCAGCGTTACATAACTTACGGTAAATGGCC
    CGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACG
    TATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT
    GGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATA
    TGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGG
    CATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG 
  • The GAN protein sequence is:
  • (SEQ ID NO: 3)
    MAEGSAVSDPQHAARLLRALSSFREESRFCDAHLVLDGEEIPVQKNILAA
    ASPYIRTKLNYNPPKDDGSTYKIELEGISVMVMREILDYIFSGQIRLNED
    TIQDVVQAADLLLLTDLKTLCCEFLEGCIAAENCIGIRDFALHYCLHHVH
    YLATEYLETHFRDVSSTEEFLELSPQKLKEVISLEKLNVGNERYVFEAVI
    RWIAHDTEIRKVHMKDVMSALWVSGLDSSYLREQMLNEPLVREIVKECSN
    IPLSQPQQGEAMLANFKPRGYSECIVTVGGEERVSRKPTAAMRCMCPLYD
    PNRQLWIELAPLSMPRINHGVLSAEGFLFVFGGQDENKQTLSSGEKYDPD
    ANTWTALPPMNEARHNFGIVEIDGMLYILGGEDGEKELISMECYDIYSKT
    WTKQPDLTMVRKIGCYAAMKKKIYAMGGGSYGKLFESVECYDPRTQQWTA
    ICPLKERRFGAVACGVAMELYVFGGVRSREDAQGSEMVTCKSEFYHDEFK
    RWIYLNDQNLCIPASSSFVYGAVPIGASIYVIGDLDTGTNYDYVREFKRS
    TGTWHHTKPLLPSDLRRTGCAALRIANCKLFRLQLQQGLFRIRVHSPDYK 
    DDDDK
  • Conservative changes to the protein sequence above (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 conservative residue changes, or up to 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% conservative residue changes) that result in a functional equivalent version of the protein (e.g., that retains at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 99% of the wild-type enzyme activity) are also within the scope of the invention.
  • Likewise, in any of the nucleotide sequence encoding the protein and variants thereof, such as in SEQ ID NO: 2, nucleotide sequence changes that do not result in sequence change of the encoded protein in SEQ ID NO: 3 due to, for example, codon degeneracy, are within the scope of the invention. Further, codon optimization of any of the polynucleotide sequence for optimal expression in a specific type of host cell is also within the scope of the invention (see further details below).
  • Juvenile Neuronal Ceroid Lipofuscinosis (JNCL or Batten Disease)
  • Neuronal ceroid lipofuscinosis refers to a family of at least eight genetically separate neurodegenerative lysosomal storage diseases that result from excessive accumulation of lipopigments (lipofuscin) made up of fats and proteins in the body's tissues. The most commonly identified mutations are in the CLN3 gene, located on the short arm of chromosome 16 (16p12.1), found in childhood or juvenile NCLs (JNCLs).
  • JNCL is an autosomal recessive disorder. The normal function of CLN3 gene is not presently known, but it is known to be a transmembrane protein. Studies of the yeast CLN3 ortholog which encodes Battenin have suggested that the protein may play a role in lysosomal pH homeostasis. Furthermore, recent studies have also implied the protein's role in cathepsin D deficiency; the overexpression of the defective protein appears to have significant effects on cathepsin D processing, with implications suggesting that accumulation of ATP synthase subunit C would result.
  • All mutations resulting in JNCL have been shown to occur at the CLN3 gene. Of the mutations known to cause JNCL, 85% result from a 1.02 kb deletion, with a loss of amino acids 154-438, while the remaining 15% appear to result from either point or frameshift mutations.
  • Thus in one embodiment, the GOI is CLN3, and the disease or condition treatable by the subject AAV vector/virus is JNCL. The composition and methods of the invention can be used to produce CLN3 gene product expressed permanently from AAV vectors stably maintained with the host's genome as extra chromosomal material, in native neuronal cells that normally express CLN3, thus effectively treating JNCL.
  • A representative polynucleotide of the invention that can be encoded by the AAV vector of the invention is described below. Specifically, the proprietary enhancer sequence is in SEQ ID NO: 1. Native CLN3 promoter is known in the art. The coding sequence for CLN3 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • CLN3-DNA
  • (SEQ ID NO: 4)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    TTCCCCCAGGGGCGGAAGGGGCCGGATTGGTGGCT
    CTGGGGTGCATCCCCTCACCTATGATTAGGCTTGC
    AAAAAAGAACTTGGCTAAACTTTATATTGTGGATA
    TCTGTGTCTGCCTCCCACCTTCCTGTGACTTCCTT
    GATGGCAGATGCCATGTATGAAACGCTGCTGAATT
    TCTAGAGCCCTGCACGTTAGTAGGATTCCATTTTC
    CATCAGCTGAGTGACCTGGGAGGCACACTAGAGAG
    GGCACATTTCTTTTCTTTTCTTTTCTTTTTTTTTT
    TGAGACAGTCTCACTCTGTCACCCAGGCTGGAGTG
    TAGTGGCGCAGTCTCAGCTCACTGCAACCTCTGCC
    TCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCC
    CGAGTAGCTGGGATTACAGGTGCGTGCCACCACAC
    CTGGTTAATTTTTGTATCTTTAGTAGAGACAGGGT
    TTCACCATGTTAGCCAGGCTGGTTTCGAACTCCTG
    ACCTCAGGTGATCTGCCCACCTTGGCCTCCCAAAG
    TGCTGGGATGACAGGCATGAGCCACCACACCCAGA
    CGAGAGGGCACATTTCCAACTGCATTCTGAAGGCC
    TGGTGGTGAGAAAGAGCATATAGAGGTGGCACAGC
    CCATAGCAGAGATGAGGCCAAGGACAAGAAGAGAA
    TGCAGGATGCCGCTTTCCACCTAGAAGAAGGTAAG
    GGTATGATTCCTTATTTGGAACTACATAAAGTGAC
    CTTGCTCTACTTCTCTTCCCCATACCCTCCTTCTC
    CCCTCCCCAGCCCCACTTTCCTATTCCAATTAGGG
    CAATTCTATGAAATCACAGCATCAGAATCTTAGGC
    CTTAAAATCATTTAGTCTGGCCAGGTGCAGTGGCT
    CACGCCTGTAGTCCTAGCACTTTGGGAAGCCAAGG
    CAGGTGGATTGCTTGAGCTCAGGAGTTTGAGGCCA
    CCCTGGGCAACGTGGCAAAATCCAGTCTCTACAAA
    AAACACAAAAATTAGCTGGGCACAGTGGTGCGCAC
    CTGCTGTCCCAGCTACTCGGGAGGCTGAGGTGGGA
    GGATCACCTGAGCCTGGGAGGTCAAGGTTGCAGCA
    AACCAAGATCCTGCCACTGCACTCCAGCCTGGGCA
    ACAGAGCAAAACCCTGTCAAAAAAAAAAAAAAAAT
    CTGGTCTAAGCACCACCTTTGATGGAGGCAGGAAT
    CCTGCAGCAGCCTGGAGCCAACCCAATGGCTCCCT
    GCCTCTGGTGCTCTATTTCTTATCACCATACAGGT
    CTCTAAGGTCACCTTTAGGTCTAACATTGTATGAA
    TGATTGTCAGCAACTTTCCAATTTTTTTCATATAC
    ATATATTTTTTTTATTTTATTTTTTACTTTTTCTT
    TTTTTGAGACAGGATCTCACTCTGTCACCCAGGCT
    GGAGTGCAGTGGCGTGATCATAGCTCACTGTAGCT
    TCAACCTCCTGGGCACAAGTGATCCTCCCACCTCA
    GCTTCCAAGGTACCTGTGACTACAGGAATGCGCCA
    CCATGCCCGGCTAATTTTGTTATATATATATATAT
    ATATATTTGTACCTTGGTTTTCTCATCTGTAACAC
    CAGGGGTTGAAGTGCTGTGATATTTTTTGGTTCTA
    GGATAATTTGGAAATAGGGATCTCTTAACATCTTG
    GATACTGCCAATAGATCATTCACGAAATCCCTTTA
    GATGAGTGTTATGTTTACTTAGGTCTCTAAAACAA
    TTTGTAAGGTCAACAGATGGGGATAAGAATGACCT
    GAAGCTGGTCGTCTTTGTTCGCAACCAGAAACAGT
    CCTGCCTCAAAAAAAAAAAAAAAAAAAAGGGAGTA
    GGCAGGTGGCCATATTTGTTTCTGGCAAGTGAGTG
    CTGAAGGAAAGGAGCTGAAGCCTCCCACAGTCATA
    ACTGGTGCTGGCAGGCTACTGTCTCGGTCTTGGGC
    GCCACTGATCTAAGGTCACGGCTCTGCTTGCTGCT
    CCCACCCGCTCCAGTTTAAAACCTGCGGTTCCAGG
    GTTCTCCAGCCCCTCCCTTTTTCACGCTCCGAAGC
    CGAGAAGGCCCAAAGCGAAGACAGAGAGGACCCGG
    AAGTAGGGAAAACCTCTGAGCACGTGATGGGGGAA
    CACGCGGGTGCTGT
    Figure US20230142867A1-20230511-P00008
    Figure US20230142867A1-20230511-P00009
    ATGGGAGGTTGTGCTGGTTCTCGACGCCGGTTCTC
    AGATAGTGAGGGGGAGGAAACGGTGCCCGAGCCTC
    GGCTTCCACTTCTGGACCATCAAGGCGCTCATTGG
    AAGAATGCAGTCGGGTTTTGGCTCCTCGGTCTCTG
    CAACAATTTCTCTTACGTGGTCATGTTGAGCGCTG
    CCCATGACATTCTTAGTCATAAAAGAACCTCAGGC
    AATCAGTCCCACGTTGATCCAGGTCCAACACCTAT
    TCCTCATAACTCTAGCAGCAGATTTGACTGTAATA
    GTGTTAGCACAGCTGCAGTGCTGCTCGCTGATATA
    CTTCCGACGCTTGTTATAAAGTTGTTGGCTCCTCT
    GGGATTGCACTTGCTGCCCTATAGTCCCCGAGTCT
    TGGTTTCTGGCATTTGTGCTGCCGGCTCATTTGTC
    TTGGTGGCTTTCAGTCACAGTGTGGGGACCAGTCT
    GTGTGGAGTCGTGTTCGCATCAATTAGCTCTGGAC
    TGGGCGAAGTAACCTTCCTGTCCCTCACAGCCTTT
    TATCCGCGCGCCGTAATTAGTTGGTGGTCATCTGG
    AACCGGCGGCGCGGGGCTGCTCGGCGCCCTTAGCT
    ACCTGGGATTGACTCAGGCGGGGTTGTCACCACAA
    CAGACTCTCCTTAGCATGCTTGGCATTCCAGCCTT
    GCTTCTGGCTTCTTACTTTCTGCTCCTTACGTCCC
    CGGAAGCGCAGGACCCTGGCGGTGAAGAGGAGGCG
    GAGTCCGCCGCTCGACAGCCACTCATTCGAACTGA
    AGCTCCCGAGTCCAAACCAGGAAGCTCTTCCTCCC
    TCTCCCTGCGGGAACGGTGGACAGTCTTTAAGGGA
    CTGCTGTGGTACATAGTGCCTCTTGTGGTAGTGTA
    TTTCGCGGAATACTTCATAAACCAGGGTCTTTTTG
    AGCTTCTTTTTTTTTGGAATACAAGCCTGTCACAC
    GCCCAGCAGTACCGATGGTATCAGATGCTGTACCA
    GGCGGGGGTTTTCGCTAGCCGATCCTCCCTGCGGT
    GCTGCAGGATTCGATTCACTTGGGCTCTCGCGTTG
    CTGCAATGCTTGAACCTCGTTTTCCTTCTCGCAGA
    CGTCTGGTTCGGTTTTCTCCCTAGTATCTATCTGG
    TCTTTTTGATTATATTGTATGAAGGTCTTCTGGGA
    GGAGCGGCATACGTGAATACTTTTCACAATATCGC
    CCTTGAGACTTCCGACGAGCATCGGGAATTTGCCA
    TGGCGGCGACGTGTATCTCCGATACCCTGGGTATA
    AGCCTGAGTGGACTGCTCGCGCTTCCGCTCCACGA
    TTTTCTGTGTCAATTGAGCGATTATAAAGATGATG
    ATGACAAATA
    Figure US20230142867A1-20230511-P00010
    Figure US20230142867A1-20230511-P00011
    Figure US20230142867A1-20230511-P00012
    Figure US20230142867A1-20230511-P00013
    Figure US20230142867A1-20230511-P00014
    Figure US20230142867A1-20230511-P00015
  • The CLN3 protein sequence is:
  • (SEQ ID NO: 5)
    MGGCAGSRRRFSDSEGEETVPEPRLPLLDHQGAHWKNAVGFWLLGLCNNF
    SYVVMLSAAHDILSHKRTSGNQSHVDPGPTPIPHNSSSRFDCNSVSTAAV
    LLADILPTLVIKLLAPLGLHLLPYSPRVLVSGICAAGSFVLVAFSHSVGT
    SLCGVVFASISSGLGEVTFLSLTAFYPRAVISWWSSGTGGAGLLGALSYL
    GLTQAGLSPQQTLLSMLGIPALLLASYFLLLTSPEAQDPGGEEEAESAAR
    QPLIRTEAPESKPGSSSSLSLRERWTVFKGLLWYIVPLVVVYFAEYFINQ
    GLFELLFFWNTSLSHAQQYRWYQMLYQAGVFASRSSLRCCRIRFTWALAL
    LQCLNLVFLLADVWFGFLPSIYLVFLIILYEGLLGGAAYVNTFHNIALET
    SDEHREFAMAATCISDTLGISLSGLLALPLHDFLCQLSDYKDDDDK 
  • Conservative changes to the CLN3 protein sequence above (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 conservative residue changes, or up to 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% conservative residue changes) that result in a functional equivalent version of the protein (e.g., that retains at least 70%, 75%, 80%, 85%, 90%, 95%, 97%, 99% of the wild-type enzyme activity) are also within the scope of the invention.
  • Likewise, in any of the nucleotide sequence encoding the protein and variants thereof, such as in SEQ ID NO: 4, nucleotide sequence changes that do not result in sequence change of the encoded protein in SEQ ID NO: 5 due to, for example, codon degeneracy, are within the scope of the invention. Further, codon optimization of any of the polynucleotide sequence for optimal expression in a specific type of host cell is also within the scope of the invention.
  • Numerous other neuronal diseases/disorders can be similarly treated by the constructs, AAV vectors, and methods of the invention. A representative (non-limiting) list of the treatable diseases and disorders, along with the corresponding GOI's, are provides in the table below.
  • Target No. Diseases Target Gene (GOI) GOI Size
    1 Ceroid-Lipofuscinosis CLN3 1403
    2 Fucosidosis FUCA1 1487
    3 Giant Axonal Neuropathy GAN 1880
    4 Globoid cell leukodystrophy GALC 2144
    5 Mucolipidiosis Type IV MCOLN1 1829
    6 Neuronal Ceroid Lipofusinoses PPT1 1007
    7 Niemann-Pick Disease SMPD1 1982
    8 Sandhoff Disease HEXB 1757
    9 Sanfilippo syndrome SGSH 1595
    10 Tay-Sachs Disease HEXA 1676
    11 Hunter Syndrome IDS 1739
    12 Sialidosis NEU1 1334
    13 Multiple Sulfatase Deficiency SUMF1 941
    14 Childhood Epilepsy GAT1/SLCA1 1886
    15 CMT Peripheral Neuropathy FIG4 2810
    16 Neuronal Ceroid Lipofusinoses CLN5 1163
    17 Aspartylglycosaminuria AGA 1127
    18 Mucopolysaccharidoses IDUA 2048
    19 Parkinsons Disease/Symptoms GDNF 722
    20 GM1-Gangliosidosis GLB1 2120
    21 Charcot-Marie-Tooth Type 1A PMP22/MFN2 569
    22 Retts Syndrome MECP2 1583
    23 Dannon Disease LAMP2 1322
    24 Mucopolysaccharidoses NAGLU 2318
    25 Mucopolysaccharidoses ARSB 1688
    26 Sly Syndrome GUSB 2042
    27 Biotin basal ganglia disease SLC19A3 1577
    28 Pelizaeus-Merzbacher disease PLP1 920
    29 Neuronal Ceroid Lipofusinoses TPP1/CLN2 1778
    30 Canavan Disease ACY2/ASPA 1028
    31 Beta-Mannosidosis MANBA 2726
    32 Wolman Disease LIPA 1286
    33 Cystinosis CTNS 1289
    34 Mucopolysaccharidoses GNS 1745
    35 Mucopolysaccharidoses HGSNAT 1994
    36 Alzheimer's Disease/Symptom NeuroD1 1157
    37 Salla Disease SLC17A5 1574
    38 Jansky-Bielschowsky disease CLN6 1022
    39 Neuronal ceroid lipofuscinoses CLN8 947
    40 GM2 gangliosidosis GM2A 668
  • The sequences of the above genes that can be included in the viral vectors as GOI's are provided below.
  • The FUCA1 protein sequence is:
  • (SEQ ID NO: 6)
    MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLP
    AWFDEAKFGVFIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPG
    FSYADFGPQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSPVS
    WNWNSKDVGPHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNG
    FKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLY
    NDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID
    KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPI
    FQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFL
    HWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPS
    AVPAEFAWTIKLTGVKDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for FUCA1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • FUCA1-DNA
  • (SEQ ID NO: 7)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    CCCATCTGTATTAAAAATACAAAAATTAGCCAGGC
    GTGGTGGCGTACCTGTAAACCCAGCTACTCGGGAG
    GCTGAGGCAGGAGAATGGCTTGAACCCAGGAGGTG
    GAGGTTGCAGTGAGCTGAGATTGCAGCACTGCATT
    CCAGCCTGGGCGACAGAGCAAGACTGCATCTCAAA
    AGAAAAGAAAAGAAAAAAGAAAAGAAAAGAAAAGA
    AAACAGCTGTGATTCAGCAATAGTCATTTTCTCTC
    CTTGTCACTGCCAGAGTGAGGAAGTGCTGGGTTCT
    GTCAACATAGGCTCTGACTTCTCATGAGGTGCAAG
    AAAGAAGCAGCATGAACAAAGGCATAGATTTTGCA
    CCTGGAGACACGTATTCTAGTCCCACCTCTGACTC
    AAACTAGCTGCATGATCTTGGGCAGCCCCCCAGCC
    CTCTCTGGGCTTCAGTTCCCTCCTCGGTACAATGG
    GAAGGGTAGGGAAGAGGATGGTGCCTAGGATCTTC
    AAAGTCCCAACTCTGATGTGCTGCCATTCTCTCAC
    TGGCATGTGGGAACTGAGTGTGCTACGTGGAGTCA
    TGAGACTTCTTCTTCCTTTTCTTTTTCTTTTTTTT
    TTTTTTTTTTTTTGAGACCGAGTCTCACTCTGTCG
    CCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCA
    CTGTAACCTCCACCTCCCGAGTTCAAGCAATTCTC
    CTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGC
    GCACCCCATGCCCGGCTAATTTTTGTATTTTTAGT
    AGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTC
    TCAAACTCCCGACCTCAGGTGATCTGCCCACCTCG
    GCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA
    CCTTGTCTGTTCAGAGACTTCTGACTTCCGTATTT
    CCCTCGCTAATGGCCTCACCTCCCGTTTGAGTATG
    TGCAGCATGTGCAAATTGCAGAGGAACTGGGGAAC
    TTCCCTGCCAGGCAATCCTGGGTCCTCTTGGCTCT
    CCACTTCCTCCTTCCTTCCTTCCTTCCTCCTTCCT
    TCCTTCCTTCCTTCCTTTCTTTCTTCCTTGGTCTG
    GCTCTGTCGCCCAGGGGCTGGAGTGCAGTGGCGGT
    ATCTCGGTACAAACTAAGCTATTTGAAATGCCTCG
    GAATTCCCAAAGAAAGCTCAGTAAATACCAGACAG
    ATGAAACGGTCCTGCCCGCTCTCCCTCTTTCAGTA
    CCCAGTTTTCTCTTTCCTCTCCTTTCATAACGTTT
    CTTCCCATCATCCCTCTACCTCCTTATTTCTGTCA
    GGGCTGTTAGATACCCTTCCGGTCTGTTATCTCCT
    CCTCCAGCCTTGTGCTACAGGACAACTCAAAAGCC
    CCCTATGGCTCCCAAGTGATCACTAAATAAAACTC
    GTGATCTCACTTCAGCCTACCCCTGCAGGATTGCT
    GCGGCAAAATGGACTACTCCCAGAGGCGTGATCAC
    GGTTCGGTTCACTGCAGCCCAGACTTCCCGGGCTC
    AAGCGATCCCGAACCTCCCACCTCAGCCTCTCGAG
    TAGCTGGGACTCCAGACGTGCACCACTACGCCAGG
    CTCATTTTGTTTATTTTTTATAGAGACAGGCTAGT
    CTCCAACTTGTGGGCTCAGAGCGATCCTCCCGCCT
    CGACTTCCGAAAGTGTTAGGATTACAGGCGTGAGC
    CACTGCGTCCGGCTCAATGGACTCCTTCTTAAATG
    TGAGTTCTCCGCACACACTGGGCCTTGCTCACCTC
    CACGATTTAACCTCGCAGTTAAGTCCCTTTTCAAG
    GTTTTTATTCGTCCTTCAGCGAAGCGCTGTCAACG
    CCTCCCTGCGTCTTTCCCTTCTTGAGCACCAGAGG
    GCGCTCCTCCGACCCGAGGAGGAAGCGACTCGGGC
    CAACCTGTCCAGGTTGTCCCACCCCTTTTTCCTTC
    CGGGCCAATCGTTAGTCAGAGTGGGCGGAGCCGCC
    CGCGGGCACCTGCGCGTTAAGAGTGGGCCGCGTCG
    CTGAGGGG
    Figure US20230142867A1-20230511-P00016
    ATGAGAGCACCCGGCATGCGATCAAGGCCCGCAGG
    GCCAGCTCTCCTCCTCCTGCTCCTTTTTTTGGGGG
    CTGCAGAATCTGTAAGAAGAGCTCAACCCCCAAGG
    AGGTACACTCCTGATTGGCCCTCATTGGACTCACG
    CCCACTCCCGGCGTGGTTTGATGAAGCTAAGTTCG
    GAGTGTTTATCCATTGGGGTGTATTCAGCGTGCCC
    GCGTGGGGGTCTGAGTGGTTTTGGTGGCACTGGCA
    AGGAGAGGGCCGCCCGCAATACCAGCGATTCATGA
    GGGACAATTATCCTCCAGGCTTCTCATACGCTGAC
    TTCGGTCCTCAGTTTACAGCTAGATTCTTCCATCC
    AGAGGAGTGGGCCGATCTTTTCCAGGCGGCTGGAG
    CTAAGTACGTCGTTCTGACGACAAAGCACCATGAA
    GGGTTCACCAACTGGCCAAGTCCAGTGTCCTGGAA
    CTGGAACAGTAAGGATGTCGGTCCTCATCGGGATC
    TGGTAGGCGAACTTGGGACTGCTCTCCGGAAGCGC
    AATATACGCTATGGTCTTTATCACAGTCTGCTGGA
    ATGGTTCCACCCACTGTACTTGCTGGACAAGAAGA
    ACGGATTTAAAACCCAACATTTTGTTAGCGCAAAG
    CATATGCCAGAGTTGTACGACCTGGTAAATTCCTA
    TAAACCCGACCTCATCTGGAGCGACGGAGAATGGG
    AGTGCCCGGATACTTATTGGAACTCAACTAACTTT
    CTCTCCTGGCTGTATAATGACTCCCCTGTGAAGGA
    CGAAGTTGTTGTAAATGACAGATGGGGTCAAAACT
    GCTCATGTCATCACGGGGGATACTATAATTGCGAA
    GACAAGTTTAAGCCTCAATCTCTTCCCGATCATAA
    ATGGGAGATGTGTACTTCTATTGACAAGTTTAGTT
    GGGGATACAGGCGCGACATGGCCTTGTCTGATGTT
    ACGGAGGAATCAGAAATCATCTCCGAGCTGGTACA
    AACGGTGTCATTGGGAGGCAACTACCTGCTGAACA
    TAGGCCCAACGAAAGACGGGCTTATCGTGCCGATT
    TTTCAAGAACGCCTTCTCGCAGTTGGCAAATGGCT
    CTCTATTAATGGGGAAGCTATTTACGCGTCAAAAC
    CTTGGAGGGTCCAATGGGAGAAAAACACCACTAGT
    GTATGGTACACAAGCAAAGGAAGCGCCGTCTACGC
    TATCTTCCTGCACTGGCCAGAGAATGGCGTACTCA
    ATTTGGAATCACCTATAACCACATCCACGACTAAG
    ATCACCATGCTCGGGATTCAAGGTGACCTTAAATG
    GTCTACAGACCCAGATAAAGGTCTTTTTATTAGTC
    TTCCCCAGCTGCCTCCCAGCGCGGTTCCGGCCGAG
    TTCGCCTGGACGATCAAGCTGACCGGGGTTAAAGA
    CTATAAAGACGATGATGATAAATA
    Figure US20230142867A1-20230511-P00017
    Figure US20230142867A1-20230511-P00018
    Figure US20230142867A1-20230511-P00019
    Figure US20230142867A1-20230511-P00020
    Figure US20230142867A1-20230511-P00021
    Figure US20230142867A1-20230511-P00022
  • The GALC protein sequence is:
  • (SEQ ID NO: 8)
    MAEWLLSASWQRRAKAMTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDG
    LGREFDGIGAVSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKV
    EIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLP
    WSFPGWLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERS
    YNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLDAELFKVVDVI
    GAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGY
    MTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQF
    TQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLP
    YFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDS
    DGSFTLSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPF
    FSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPITWAADASNTIS
    IIGDYNWTNLTIKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIF
    ANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFTSGMLNDKSL
    WTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATRDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GALC and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GALC-DNA
  • (SEQ ID NO: 9)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TCCTCTGTCATCACCTTGGTATTTCCCATC
    TAGTCACGGTATTAAGTGGGTCTTATGCGCTTGTTAAGAGCTGGTAAGAGAAGCTTAACATTCACTTG
    ATGGAATGGTTACTGTGGCTTGTATCCACAGGAGCTCCAGCTCATGCTTAAAATACTGTGTTAGAACA
    AATGGATCAGAGTGAATTCTGTAACAGTAAATCCACAGAACTGAAAATTACTGACTTGCATCATAAAC
    TGGTTGCAGTGGCAAGGCAGAGAGTTCAGATTCGTGTTGTACTGGGGAAAGAAGAAACCAAAGAAAAT
    AAGTGGTTCATACACAGGGGACTATTTAACTGATCCAAGCCAGAAGACTACTGAGGGTGAAAGGAGCA
    TTATCAGTAATTATGTTTAGACAAAAGTTGTGACTGTCCTTAGTAAACCAGGATAGACGTAGAGTCAA
    TTCAAAAAGAAAGAAAGAACAAACATATCTAATGTACATTAAGAACCCAGGGATATCGGGTTCTTAAT
    GTGCATATTTATTGGATTGCCCAGAGCTTTGACCTTCAAGTATAATTGAGTTCTTGAAACTTAGATTT
    AGGTTTGTGGTTAAGGGGCAAAATAATCAGATTCCAAATAGGGTTCTGCAGACAAATCCCACAGGTCC
    TAAAGTCTCTGGGTCTTTTCCTTGCCATTGTGGTATCTCAGGGGCTTGTGTAGGCAATGATAACCAGT
    TCCATACTTCTGCTAAGAGAAATTTGGGGATCAGAAAATCTTTCTCTGCAGCTCTTCAGAGTCTCTTT
    AATTTACTTCAATAACAAGGAAATCAGAAATCTCAGGACAGTGCTGGATGTGGCAGAAATTCAGGGCA
    ATTCACAACTTTCCTACCAATTTAGAGTTACCTGTCTGGAATCACTTCAACTGACAGTGGAAGATAGT
    TTATAAAATTGTAGGGGAGGCAAAATTTCACCTTCGTCCCTGTAGGGTCCCAGCATGGCCCTAGTATT
    AAATTGATTTAAGATATTAATAGACTGACAGGAGACAACCACACAAATGTTTATTTTACATAGCACGG
    GGGCCCTCATTAAAAAAAAAAAAGACCTAAAGCAGCAGTTAAGAGCCAATTGCTTATAGAGTGAAATG
    GACAAAAAATAGTAGTTGTGAAACATGATAAGGCGAAGGGGCTTGGGGTAAAGAAGCGACTAGAAAGA
    TAGGGATTAACAAATTTTGTTTGGACAGATTTCCCTCGACTTCAACTTTCTGTCCTTGAAGATAACAA
    TGTTATTTTATAGGGAAGACATCTTTCACATGGGAATTTCATCTCCTCCTTTTAAGACACATAATATC
    CATGACAGTGATCTTGCACCTGCTGTTTTTAAGTGCCTTTAATTTAAAATAGTCAATATGCCACAGCA
    GCATATTTTGAGGTGGTATATTATTAACTCCTTCAAGGCTAAAAGCTTTGAATATGCCGGGGTAGTTA
    CCAGACTGGAGAACCTTCTCAGCTATTTGCCACTTGTGGAATGCATTCAAATGTACTGAGTGCTTATT
    AACTTTTCTCTCAGTCCCTCACTCTGCTGCTCTTCTCGCCTCATCTCACATAAGGGAAAACTCAGGCT
    CCAAAAGCAAAGTGATTCCACCAAGATGACAATGGAATCGGGACCTGGTGTCCAAGCCTGGTCCTCGC
    ACGAGGTAGATACTCCCTGAGATGCTGGGGAAGAGCCATGGTAGGGCCACTCATGGCCCTCTTCCTTT
    GGCGGACCTTGGAATCCACCTTCCCTGCGTTAAAGTGCTCCACCAGGTGACGCCGGAAGAGAAGCCAG
    CAGTAAGTACATGTGAAATCCAGTAGCTGACCACAGACACGCGAGCGATAAGAAAAAGCTCGCTGCTT
    CTTTTCTCAACACTCCGCCAGCTGCGTCAGAGGCCCTTGACCCGCACTCAAAATGGCGGCGGCGGCGT
    CAGCATCAGCGGCCTCCTGCCCGTATCTATCGTGGCGGCGACGGGACCCGCCTCCCTGGGCGCCGG
    Figure US20230142867A1-20230511-P00023
    Figure US20230142867A1-20230511-P00024
    ATGGCCGAATGGCTCCTTTCAGGATCATGGCAGCGCCGA
    GCTAAAGCGATGACGGCTGCCGCGGGCTCTGCTGGGAGAGCAGCTGTTCCTCTTCTGCTTTGTGCCTT
    GTTGGCACCTGGCGGAGCCTACGTTTTGGATGACTCTGATGGATTGGGTCGCGAGTTCGATGGTATCG
    GTGCTGTGTCTGGTGGCGGAGCCACGTCCAGGCTGCTGGTAAATTACCCGGAACCATATCGAAGCCAA
    ATCCTTGACTATCTTTTCAAACCTAACTTTGGAGCGAGCCTTCATATTCTCAAGGTCGAAATAGGCGG
    AGACGGGCAGACAACTGATGGCACCGAGCCGTCTCATATGCATTACGCTTTGGACGAAAATTACTTTA
    GAGGATACGAATGGTGGCTGATGAAGGAAGCCAAGAAAAGAAATCCCAATATTACCTTGATCGGCTTG
    CCGTGGTCATTTCCCGGGTGGCTGGGCAAGGGATTTGATTGGCCGTATGTAAATCTTCAATTGACGGC
    CTATTACGTAGTTACATGGATAGTAGGTGCCAAACGCTACCATGACCTGGATATTGATTACATTGGAA
    TTTGGAACGAACGATCCTATAACGCAAATTACATCAAAATTCTCCGAAAGATGCTCAACTATCAAGGC
    CTCCAGAGAGTTAAAATCATCGCGTCAGATAACCTTTGGGAGTCTATTTCTGCCTCAATGCTGCTCGA
    CGCGGAACTTTTTAAGGTGGTTGACGTTATAGGGGCGCATTATCCAGGTACCCACAGCGCTAAAGACG
    CAAAGCTCACGGGCAAAAAGCTGTGGAGCAGTGAGGACTTTAGTACCTTGAATAGTGACATGGGTGCA
    GGGTGTTGGGGTAGGATACTGAACCAGAATTACATCAATGGATATATGACGAGCACTATAGCATGGAA
    CTTGGTAGCGAGTTATTACGAGCAGTTGCCGTACGGTCGATGTGGTCTTATGACGGCGCAAGAACCGT
    GGAGCGGGCACTATGTGGTCGAGAGTCCCGTTTGGGTAAGCGCTCATACAACTCAATTCACGCAGCCT
    GGGTGGTATTATCTCAAGACCGTTGGACACCTTGAGAAAGGCGGGAGTTACGTAGCACTTACCGACGG
    CTTGGGGAACTTGACTATTATCATTGAAACAATGAGTCACAAACACAGCAAATGTATTAGGCCTTTCC
    TGCCATATTTTAATGTCTCTCAGCAGTTCGCTACTTTTGTGCTTAAAGGAAGCTTCAGTGAAATACCC
    GAGCTCCAAGTGTGGTATACCAAACTTGGTAAAACTAGTGAACGATTTCTTTTCAAACAACTCGACTC
    ACTTTGGCTGCTTGACTCTGACGGAAGCTTCACTCTCTCTCTTCACGAAGACGAGTTGTTTACTCTCA
    CTACGCTTACTACTGGACGCAAGGGTTCATATCCTCTTCCTCCAAAATCTCAGCCATTCCCTTCCACA
    TATAAGGATGATTTTAACGTCGATTATCCCTTCTTCTCCGAAGCGCCGAATTTTGCGGACCAAACAGG
    TGTATTTGAATATTTTACTAACATTGAGGACCCAGGAGAGCATCACTTCACCTTGCGACAGGTCTTGA
    ACCAGAGACCAATCACTTGGGCAGCCGACGCCTCTAATACCATATCAATTATCGGTGATTACAACTGG
    ACTAATCTCACAATTAAATGTGATGTCTATATTGAAACTCCTGACACTGGAGGTGTTTTCATCGCCGG
    GCGCGTCAACAAAGGAGGTATCCTTATCCGATCAGCTCGCGGTATCTTTTTTTGGATTTTTGCGAATG
    GAAGCTACCGAGTTACAGGTGATCTTGCTGGGTGGATTATCTACGCCCTCGGCAGAGTAGAGGTTACT
    GCCAAAAAATGGTATACTTTGACGCTGACGATCAAGGGCCACTTCACAAGTGGTATGCTTAACGATAA
    AAGCCTGTGGACGGACATTCCAGTGAACTTTCCTAAGAACGGTTGGGCCGCCATTGGGACACACTCAT
    TCGAATTTGCCCAATTCGATAATTTTCTCGTGGAGGCAACCAGAGATTACAAGGATGATGATGACAAA
    TA
    Figure US20230142867A1-20230511-P00025
    Figure US20230142867A1-20230511-P00026
    Figure US20230142867A1-20230511-P00027
    Figure US20230142867A1-20230511-P00028
    Figure US20230142867A1-20230511-P00029
  • The MCOLN1 protein sequence is:
  • (SEQ ID NO: 10)
    MTAPAGPRGSETERLLTPNPGYGTQAGPSPAPPT
    PPEEEDLRRRLKYFFMSPCDKFRAKGRKPCKLML
    QVVKILVVTVQLILFGLSNQLAVTFREENTIAFR
    HLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQY
    LALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYY
    HRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPP
    PSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLK
    TINLQSLINNEIPDCYTFSVLITFDNKAHSGRIP
    ISLETQAHIQECKHPSVFQHGDNSFRLLFDVVVI
    LTCSLSFLLCARSLLRGFLLQNEFVGFMWRQRGR
    VISLWERLEFVNGWYILLVTSDVLTISGTIMKIG
    IEAKNLASYDVCSILLGTSTLLVWVGVIRYLTFF
    HNYNILIATLRVALPSVMRFCCCVAVIYLGYCFC
    GWIVLGPYHVKFRSLSMVSECLFSLINGDDMFVT
    FAAMQAQQGRSSLVWLFSQLYLYSFISLFIYMVL
    SLFIALITGAYDTIKHPGGAGAEESELQAYIAQC
    ODSPTSGKFRRGSGSACSLLCCCGRDPSEEHSLL
    VNDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for MCOLN1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • MCOLN1-DNA
  • (SEQ ID NO: 11)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG ACCCTGACCCTGGCTCTGGGCCGGGCACTC
    TGCCGGATCCCAGCTCCAAACCCCTCCCCGGCTCCAGATCCACCCCCAGCCCTACTCCTGTGGAATCT
    TCTGACCCAAAGGCTGGGCACGACGCTGGTCCCGACCTTGTGCCCAGCCCAGACCTTGATCCTGTGCC
    CAGCCCAGACCCTGATCCTGTGCCCAGCCCTGATCCCAACCCTGTGTCCTGCCCTGACCCCTGTTCTC
    CCACTCGTGGCACTGTCAGCCCAGCCCTCCCTACCGGCGAGAGTCCAGAGTGGGTACAGGAGCAAGGG
    GCACTGCTGGGGCCTGATGGCTGAAGGAGACGCCGGCATCCTCGGGGGCCTGGGGAAGTTGTGTGTTG
    TGCAGTCAGTAAAATCCTCCCACTGCCTCCGGGCTCTGTGTCGTGGCTTGCCTTTTGTGCTTTCTTTC
    TGAGCACAGCCCCTTGCTCCGGAGATGGGACATTGTGGCCAGGGCCAGCCACAATCTGGAGCTCCAGT
    GTCCTGGCTTCCTTCTTGGGCCCACCAGCTAGACTTTCCAGCTTAACCCACAGTGTCTCATCCTGGTC
    CATCCCACTAGTCACCACTCTCTGTTCCTCTCACTTACCCCTTGCTCTTCAAAGCCCATACAGTAGGT
    ATACAAGTGGACAAAAAAAGTTGCTCATTTATGCAATCAACAAACATCTCTGGATTGCTGGGGTCTCA
    GCAGGGAACAAGATAAATATGGCCTCGACCTGCATGGAGCTCATAGATACTAAATTCAGAATACTTAA
    AAAATAATTACGGGGTATAGTACATTCTAGGAGAAGCATAACAAGACTTCTGATATAAATGGCAGGCA
    GCTTTCTCAATGAAGGATTTTGTAATCCCAATAATCACTAATTTAATAATCAGTACTGTTTGCCCAGC
    CTTATGCGATAGTTTTTGCATTCTCTCATTTAATCCTCTCAACAGCCCCAGTAGGTAGATGACTTTGA
    ATATCCCCATTTTGCAAATGAGAAAATTGAGGCACATTTTTTTTTTTTTTTTTAGACAGTCTTGCTCT
    GTTGCCCAGGCTGGAGTGCAGTGGTGTGATCATAGCTCACTGCAGCCTCGACCTCCTGGGCTCAAGCG
    ATCCTCCCACCTTAGCCTCCCGAGTAGCTGGGATTGCCGGTGCATGCCACCGCCCACTGCGCTCAGCT
    TGGAGTTGAAGGGACTCTGGAAGATGTAGAAGTGGCATTGTCAGTGCCTAGATTTAAATCCCAATTGC
    CCTCCAGGGTCCAAATTCTTAACCATTACGCTCCAGGGCAAAAGTATGCAAAGGCTCTGGGGCTATAG
    AAAGATGAGCTTTGGATGGAGGTAGGAGCCAGATCAGAGGGCCCTGATAGACGAGAGTGGGGACTCTG
    CCTGTCATTACAGAGCAATGGGAAGCCGAGGGCAGGTTCTCGCAGGAAGGATAGGAATTATTCTTTGA
    AGATGCTTGTGGCTGCTGGGTAGAGAGTGGAGTGGAGGGAGGCTGAGATCGGGGAGGAGGTTGCTGCA
    AAGATCCAGGCCAGGAATGTTGGAAGACTCTGGGCTGGGGGCCATGGGGTGGGGATAAGTGGTTCTAT
    TTGATACATAATTAGGAAATCGTGTTTGCTGAAGATGCGCAGGAGAAGGGTAAAAGGAGTTTCTGGGA
    GAAAGAGGAAGACAGCGTTGAGATAGTAGGCAGGGTCATCACCAGGCACCAAGGAGGATAAGGGGTCA
    AGCTCTGGACATGGAAGTCACAAGCCTGGCACCGGATTCGGGGCATGGCCGGGAGCCAGGGCAGAGCT
    CGTCGTTGCCAAACTCAGAGTCAGCCCATCCCCCGCCACCCAGAGCGCGTCGGCGCTAGGACCTAGCG
    ACTGCCTTCGACCCAGAGGGCGCCGGCAGAGGCACGCATGCGCGCTGTTCCGGCAGGGGTTGTCGTGG
    CGCAGGGGGCGGGACCAGAGGCGGTCACGTGAGGGGCTCTGGGCTACCGGGTCACGTGACCGAGGC
    Figure US20230142867A1-20230511-P00030
    Figure US20230142867A1-20230511-P00031
    ATGACTGCGCCGGCAGGTCCACGAGGGTCTGAAACAGAA
    CGGCTCCTTACTCCCAATCCCGGCTATGGGACACAGGCCGGTCCTAGCCCTGCCCCACCCACACCCCC
    TGAGGAGGAAGATCTGAGGAGACGGCTCAAGTATTTCTTCATGAGCCCGTGCGATAAATTTCGAGCTA
    AGGGACGAAAGCCGTGCAAGCTGATGCTCCAGGTGGTAAAGATTCTCGTAGTTACTGTGCAACTCATC
    CTTTTCGGTCTGTCCAATCAGTTGGCAGTCACTTTCAGAGAAGAAAACACTATCGCTTTTCGGCATCT
    GTTCCTGCTGGGTTATAGTGACGGTGCAGATGACACCTTCGCCGCCTACACTCGCGAACAACTTTACC
    AGGCTATTTTCCACGCGGTAGATCAATATCTTGCGCTCCCCGACGTTTCATTGGGGCGGTACGCCTAT
    GTTCGCGGAGGAGGCGATCCCTGGACCAACGGCAGTGGTCTCGCATTGTGCCAACGATACTACCACCG
    GGGGCACGTAGATCCCGCCAACGACACCTTCGACATAGACCCGATGGTAGTAACAGACTGTATTCAGG
    TGGACCCCCCCGAGAGGCCACCTCCACCACCTTCCGATGACCTGACGCTGCTCGAGAGCAGCAGTTCT
    TATAAGAATCTTACCCTTAAATTCCATAAATTGGTAAATGTAACCATCCATTTTCGCCTCAAAACTAT
    CAACCTGCAAAGCCTCATAAATAACGAAATTCCGGATTGTTACACTTTCTCTGTTTTGATAACATTCG
    ACAATAAGGCCCACAGCGGAAGGATTCCGATATCCCTTGAAACACAAGCACACATACAAGAATGCAAA
    CACCCTAGTGTCTTCCAGCATGGCGATAATAGTTTTAGACTGCTGTTCGATGTAGTTGTGATCCTTAC
    ATGCTCACTGTCATTCCTCCTTTGTGCACGATCACTTCTTCGGGGTTTCCTGTTGCAAAATGAGTTTG
    TGGGATTTATGTGGAGGCAACGAGGACGCGTCATATCTTTGTGGGAACGGCTTGAGTTCGTCAATGGA
    TGGTACATACTTTTGGTTACATCCGATGTGTTGACTATATCCGGTACAATTATGAAAATTGGCATTGA
    AGCGAAGAACCTCGCCAGTTATGACGTTTGCAGTATCCTTCTGGGAACGTCCACCCTCCTCGTATGGG
    TAGGAGTCATACGGTACCTGACTTTCTTCCATAACTATAATATACTGATCGCAACCCTTCGAGTCGCG
    CTTCCGAGCGTGATGAGGTTCTGTTGCTGCGTAGCTGTTATCTACCTGGGATACTGTTTCTGTGGATG
    GATCGTACTCGGCCCGTATCACGTAAAGTTCCGGTCCCTTTCTATGGTCTCCGAATGTCTTTTCAGTC
    TCATCAACGGTGACGATATGTTCGTCACCTTTGCTGCGATGCAAGCTCAACAAGGGAGGAGTAGTCTG
    GTTTGGCTTTTTAGTCAACTGTACTTGTACTCCTTTATCAGTCTTTTTATTTACATGGTGCTCTCATT
    GTTCATCGCACTTATCACGGGCGCCTACGACACGATTAAACATCCGGGTGGAGCCGGCGCCGAGGAAA
    GTGAGCTCCAAGCCTACATAGCGCAATGTCAAGATTCCCCGACCTCAGGCAAGTTCAGGAGAGGCTCC
    GGAAGCGCCTGTTCTTTGCTTTGCTGTTGCGGGAGAGACCCATCCGAAGAGCACTCCTTGCTGGTCAA
    TGACTATAAAGATGACGACGATAAATA
    Figure US20230142867A1-20230511-P00032
    Figure US20230142867A1-20230511-P00033
    Figure US20230142867A1-20230511-P00034
    Figure US20230142867A1-20230511-P00035
    Figure US20230142867A1-20230511-P00036
    Figure US20230142867A1-20230511-P00037
  • The PPT1 protein sequence is:
  • (SEQ ID NO: 12)
    MASPGCLWLLAVALLPWTCASRALQHLDPPAPLP
    LVIWHGMGDSCCNPLSMGAIKKMVEKKIPGIYVL
    SLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD
    PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINL
    ISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAG
    AYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLAD
    INQERGINESYKKNLMALKKFVMVKFLNDSIVDP
    VDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLK
    EMDNAGQLVFLATEGDHLOLSEEWFYAHIIPFLG
    DYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for PPT1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • PPT1-DNA
  • (SEQ ID NO: 13)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TGGTCTGGCACAATGGCTCACAGCTGTGAT
    CCTAGAACTTTGGGAGGCTGAGGGGGGCGGATTGCCTGAGCTCAGGAGTTCGAGACCAGCCTGGGCAA
    CATGGCGAAACCCTGTCTCTACAAAAAACACAAAAAATTAGCGGGGCATGTTGGTGGTGTGTCTGTAA
    TCCCAGCTACTCGGGAGGCAGAGGCACGAGAATCGCTTGAACCCGGGAGGTGGAGGTTACAGTGAGCC
    AAGATCACGCCACTGCACTCCAGTCTGGGGGACAGAACAAGACCCTGTCTCAAAAAAAAAAAAAAATT
    GCATTCAGATTGCTTCACAAGTGTCTTTCAGTTATTGCTCCAGTTTAAAGAAACCTTTAAAGAAATTA
    TAAATAGTTCATTAAAGGCAGGAAACTTCATTGAGCGGGCTATATTCACAGAAAAATCCTGAAACTAC
    TTCAATTTGTAAGTCTTAAGTTAAAATAAAATGCTTATACAATAAGCTTATTTTTCTTAAAACAAAAC
    TGTATACTGTACTGTATAATGCTTTATGATTCTGCATTTTTTTTTCTTTGAGACAGAGTTTCACTCTT
    GTTGCCCAGGCTGGAGTGCAGTAGCTCAATCTTGGCTCACTGCACCCTCTGCCTCCTGGGTTCAAGTG
    ATTCTCCTGTCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACCCACCACCATGCCCGGCTAATTTTT
    TGTATTTTTAGTAGAGATGGGGTTTCATCATGTTGGCCAGGCTGGTCTCAAACTCCCAACCTCAGGTG
    ATCCACCTGCCTCAGCCTCCCAAAGTGCGCCCAGTGGATTCTGCATTTTAATCGACTTTTCAACTAAC
    AACTTACCTCTAGTCCCAATGGCATGGAATAGAAGAGATTACAGACCGTGGGTTTGTAATCGACTTTT
    CTTTTACAACATATTAAGGGCATTTTTCATGTCACTTTATGTAAATTTACCTCATTTTTCTTTTTTGA
    GAGAGACAGGGTCTCACTGTTGCCCAGGGCAACAGCGAGATCATAGCTCAGTGCAATGGCGAGATCAT
    AGCTCAATGCAGCCTTGAACTCCTGGGCTCAAGTGATCCTCCCACCTTAGCCTCCCCAGTAGCTAGGA
    TTACAGGCACATACCACCACCCCTGGCTAATTAAAAAAAAATTTTTTTTTTGTAGAGATGGGGTCTTG
    CTATGCTGCCTGGGCTGTACTCTCTTTTTATTTATTTTTTATTCTTATTTTTTGAGACAGAGTTTTGC
    TCTTGTTGCCCAGGCTGGAGTACAATGGTATGATCTCAGCTTACCGCAACCTCTGCCTCCCGGGTTCA
    AGAGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAAGTATGCGCCACCACACCCGGCTAAT
    TTTGTAATTTTAGTAGAGACGGGATTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCGCTACCTTAGA
    TGATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGAGTGAGCCACCGCGCCCGGCCCTATA
    CTCATTTTTTATAAAACTGGCCTCATATTATTCCATTGTATAGTGGATACACTCTAATTTCTTTATAA
    ATTATTTAAACTATTTTTTGATGTTTTGTTGAAAAAAAACTTGAGCAGGGAACTGATATTATGAAGGT
    GGATTGAGGGATGAATGTGGCAATAGAGCATAAAATGAATTAAAGAGGAGAGAATGGAAAAGGAAATG
    GTTGATGAGAGACCCTTGAAATAATCCAGCTGAGATTAGGCCAGTTAGTGGCCATGGGTTTCCCCCTC
    ACTCTTATGCATCCAACATATGCTTTCGTCCGTCTTGAAGTCCCGACATTTCTAGAAAACACAGTTGT
    CTTTCTCCTCTCTACCTTAGCTCATCTCAGTCCTTCTGAATTTCATCTTTGCTTGTTGAAATCCTACC
    TCTCCTTCAATATCCAGAACAAACCCCACACCCTGCCATGATTGGTCCTGGCTGGAAACCGTCCCTCC
    TTCTTTCTTCCTTCCTCTGTGCATGTTTCTTGCAAAGGCAGTACAGGGCAGTGGTGAAGAGCAACGAC
    TTTGGAACCACCAATTTGCTATCTCTGTGACCTTGGGCACCTTACTCAGTGCCAATTCGCAGTGCCTC
    AGTGTCCTCACCTGTCAGGGGGAGAAGTGCATACAGCATTGTTATGAAGAGTGAATTAGTTAAAGGTT
    GTAAGAATAGTGCCTGATAGCATCTGATCAGCACTGAATATATGTTAGCAATTATTTTGATTCACCGC
    AGAGGGCGGTCTACGAGAGCGCAGAGCCCCACTCGGCCAGCGGGGTCTGGCGGGGGACCTGTCGCGCT
    GAAAGCTCCAGGGTAGGGCCGACGCCCATCAGGCTGGGCATCCGTTCGGGATGCGCAGGTTGCGATCT
    GCAACCGGCGGCGCCACGCCCAGGCGGGCGGAGCGCGGTTCCCGGAGTCTCGCGCCCGCGGTCA
    Figure US20230142867A1-20230511-P00038
    Figure US20230142867A1-20230511-P00039
    ATGGCGTCTCCTGGTTGTCTTTGGTTGCTTGCCGTCGCGTT
    GTTGCCTTGGACATGTGCTTCTCGGGCTTTGCAGCACCTCGATCCCCCAGCCCCATTGCCTCTTGTAA
    TCTGGCATGGCATGGGTGATTCATGCTGTAATCCCCTCAGCATGGGGGCCATAAAGAAAATGGTAGAG
    AAAAAGATCCCAGGCATATACGTACTCTCACTTGAGATAGGGAAGACCTTGATGGAAGACGTTGAGAA
    CTCTTTTTTTCTCAATGTTAACTCCCAAGTCACTACGGTATGCCAGGCGCTGGCGAAAGATCCTAAAC
    TTCAACAGGGGTATAACGCCATGGGATTCTCTCAAGGTGGGCAGTTCCTTAGAGCGGTCGCCCAGAGG
    TGTCCATCCCCCCCAATGATTAATTTGATCAGTGTGGGTGGACAACATCAAGGCGTGTTTGGCCTCCC
    CAGATGTCCAGGTGAGTCATCTCATATCTGTGATTTCATCCGGAAGACATTGAACGCAGGTGCGTATT
    CAAAGGTAGTGCAGGAGAGGCTGGTGCAGGCTGAGTACTGGCACGACCCTATCAAGGAGGATGTCTAC
    CGAAATCACTCAATTTTCCTTGCAGACATCAACCAGGAGCGAGGCATTAATGAAAGCTATAAAAAGAA
    TTTGATGGCTTTGAAAAAATTTGTTATGGTCAAGTTCCTCAATGACTCAATAGTCGATCCGGTGGATT
    CCGAGTGGTTCGGGTTTTATCGGTCAGGTCAAGCGAAAGAAACAATTCCCCTTCAAGAAACGTCACTT
    TATACGCAGGACCGACTCGGCCTGAAGGAAATGGATAACGCTGGGCAACTGGTATTTTTGGCTACCGA
    GGGCGATCACCTGCAGTTGAGCGAGGAATGGTTCTACGCTCACATCATACCCTTTCTTGGGGACTACA
    AGGACGACGATGACAAGTA
    Figure US20230142867A1-20230511-P00040
    Figure US20230142867A1-20230511-P00041
    Figure US20230142867A1-20230511-P00042
    Figure US20230142867A1-20230511-P00043
    Figure US20230142867A1-20230511-P00044
    Figure US20230142867A1-20230511-P00045
  • The SMPD1 protein sequence is:
  • (SEQ ID NO: 14)
    MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMG
    LVLALALALALALALSDSRVLWAPAEAHPLSPQG
    HPARLHRIVPRLRDVFGWGNLTCPICKGLFTAIN
    LGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSI
    VHLFEDDMVEVWRRSVLSPSEACGLLLGSTCGHW
    DIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRIL
    FLTDLHWDHDYLEGTDPDCADPLCCRRGSGLPPA
    SRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPF
    DMVYWTGDIPAHDVWHQTRQDQLRALTTVTALVR
    KFLGPVPVYPAVGNHESTPVNSFPPPFIEGNHSS
    RWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPY
    PGLRLISLNMNFCSRENFWLLINSTDPAGQLQWL
    VGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYY
    RIVARYENTLAAQFFGHTHVDEFEVFYDEETLSR
    PLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGSS
    HVVLDHETYILNLTQANIPGAIPHWQLLYRARET
    YGLPNTLPTAWHNLVYRMRGDMQLFQTFWFLYHK
    GHPPSEPCGTPCRLATLCAQLSARADSPALCRHL
    MPDGSLPEAQSLWPRPLFCDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for SMPD1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • SMPD1-DNA
  • (SEQ ID NO: 15)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AGCACTTACTAGCTAGCTGTGGGATCTTGA
    GCCACTTACTTAATCCTACTTTCCTTCTCCGTAAAATGGGGATAATTATAATACCGACTTCCAAATTT
    GTTGTGAGGATTAACGTAAGGCAGAGTCCTCAGAAAAGTACCTACAGAGGGTTATTCGGGCCAAGGGC
    CAAGGTTGAGGACTGCGACCCAGGACACACTTGCAAGTTGTCTTGGGGGAGCACTCTGGACAACAAAA
    GAGAGGCTCATATTTTCAAAGAAAAAAAGATGAATTAGGAGAGGAAATGAATACAAAAGTTGTTCATC
    AGGAATTCTCATTGTTTACAGAAATAACACTGGTTATACATTGTTTAGCGCTTGGCTATACATTGTTG
    AACTATAAGGTGTATGGCATTTAATGGCTACTTGGCATAGCAAGTGGTTTCAAGAGATAATTACTTAG
    CTCAAAGAAGAGTGAGAGTGACTGTTGCTACATTTTAAATGCCTGTCTGGGCCTGATAATTTAAAGGG
    GCTCATCTTCCTTAGATAAAAAGGTTTTGTTTTGTTTTGTTTTTCATAGTACATAATCTTAAAAATGC
    ATGCTAATTATTACTGTGAGTACTCTATCTGAGATGATGTCCCCAGAACTGGGGGGAAAAAACACTTG
    TAAAAAATTATCAAAGGAAAGAACTGACAGGACTTGATAGCTCTGAGGATTCCTCAGAAGCACTCGTC
    AAACAGCTTAGGGTTTAATTGCTATTCTGCCACTTAACTTCTTTGTGCCTCATCCTATCTGTAAAGGA
    GGGTAATAATAGTATTTGCAGGGACCAAGGGAAACTTTCCCTTCGCCCTCTGAAATTCTGCTGAAAAA
    TCAACCCACAAAAAACAGATTAATTGAAGAAAAGGCATAGAAATTGTATTAATGTGTCCACGGGGAAC
    CACAGAGTGACTACCCACTTCCCAGACGAGTTCAGAAGCTTAGGTGTCATTCAGGCAAAGGGGGTTGT
    GGGAGTGAGGGGAAGAGGAATTCTGCTGAGGGAGAAGGAATGGATCAGGGAACAGATTAACTAGCACA
    TTATCATGTGAAGGGGTCTGTTCAGGTGTGATTATATTCTTGGTTTTGTAGGGAGGGGAAGAAAAAAA
    CCGTTGTTCCTTTTGATGGGTCTGGACGCTAGGTAACGGCTTTGGGAGAGAGGGCGCGGGAGTAGGAA
    GGCCAGACACACCTTAAGGCTTAACTTAGTAAGTCAAACCACCATATTTTGGGTATGGATTTCTGAGC
    CCCAATATAATAGGTACTTCTTAGGGTTGTTGTGAGAATTGAAGATTATTAATTATAAAGTGCCTGGG
    TTGGTGCTTGGCACATCATAAACAACTCCATAAATGCTCGCTTTTACTGTCGTGGATACACGGAGGTG
    GGAGACCCAATTGGGTTCCCGCCCATGTTATTGGCCCCCACGGCTATATGGTCTTTACGGCTTCCTAG
    TCTCTTCACCATCTCTAAGGTTTGTTTCCTCATCTGAATCACAGGTGAGTGATTCTCAGGGTGACTCA
    AGAGGGCTGGTGCCAGTGGCAGAGCATGGGGAATCGAGGGAAGGCGCCTTCCTGTCAGAGGCAACACA
    CCCAGTGGGAGGATCCTGCCCGCCCTGCTCTCTCCTACCTCCCCTTCCCGGGCTAAACGCTGGGGTCG
    GTCTGGCAGCTATTCCGGGAATCTGAGCGCGGATTCTGACAAAGGAGACGTCTTCCACCGACCACATC
    ACATGGAGCTCCAAAGCACGAGCACGGCCTCCCGCGGCTGTGACCTCAAGGCGGAGCCCCTGGTGACC
    TCAGGGAGAGTCCCCACCCCCGCAGCCCGTGCGCCCGGGGCAGGGCGGGGGCAGGGAGAGGGGGCGGA
    ATCGGGGCGGTCCCGGGAGCGCCCCGCCCCCTCCGCCTCCGCAGCGTTGACAGCCGCCCGCCACCGAG
    AGATCAGCTGTCAGAGATCAGAGGAAGAGGAAGGGGCGGAGCTGCTTTGCGGCCGGCCGCGGAG
    Figure US20230142867A1-20230511-P00046
    Figure US20230142867A1-20230511-P00047
    ATGCCAAGGTATGGTGCTAGCTTGAGGCAATCCTGTCCTCG
    CTCTGGCAGGGAACAGGGACAGGATGGTACGGCGGGAGCCCCGGGCCTGCTGTGGATGGGGCTCGTGC
    TTGCATTGGCGCTCGCCCTTGCTCTGGCGTTGGCACTGTCTGACAGTAGAGTCCTTTGGGCGCCTGCA
    GAAGCCCACCCACTGTCTCCCCAAGGGCACCCCGCCCGACTCCATCGAATCGTACCTCGCCTGCGGGA
    TGTTTTCGGTTGGGGCAATTTGACCTGCCCAATATGTAAAGGCCTCTTTACGGCCATCAACCTCGGTC
    TCAAAAAAGAACCTAATGTCGCTCGCGTTGGCTCAGTTGCCATCAAGCTGTGTAATCTGTTGAAAATA
    GCTCCGCCGGCAGTGTGTCAAAGTATTGTGCATCTGTTCGAGGACGATATGGTTGAGGTATGGCGACG
    ATCCGTCTTGTCCCCCTCTGAGGCCTGCGGGTTGCTGTTGGGGTCTACATGTGGCCATTGGGACATAT
    TTAGTAGTTGGAACATCTCATTGCCGACGGTCCCGAAGCCCCCTCCTAAACCTCCTAGCCCGCCAGCG
    CCTGGTGCCCCTGTGTCTCGGATTCTCTTTTTGACAGATCTCCACTGGGATCATGATTATTTGGAAGG
    GACCGACCCCGACTGTGCTGATCCTCTTTGCTGCCGCCGCGGTAGTGGACTTCCTCCTGCCTCTCGAC
    CAGGGGCGGGCTACTGGGGGGAGTATAGCAAGTGCGACCTTCCCCTCCGAACCCTCGAGTCACTGTTG
    TCAGGGCTCGGCCCCGCAGGACCTTTTGACATGGTTTACTGGACAGGAGACATACCAGCACACGACGT
    TTGGCATCAGACTCGGCAGGATCAGCTGCGGGCGCTCACTACTGTTACAGCGTTGGTTCGCAAGTTTC
    TGGGTCCCGTCCCAGTGTACCCCGCTGTTGGAAACCACGAGAGTACGCCGGTGAACAGTTTCCCCCCT
    CCGTTTATAGAAGGAAATCACTCCTCTCGGTGGTTGTACGAAGCGATGGCAAAAGCCTGGGAGCCGTG
    GCTCCCTGCAGAAGCATTGCGGACATTGAGAATTGGCGGGTTTTACGCCCTGTCCCCTTATCCAGGCT
    TGCGGCTCATATCTTTGAATATGAACTTCTGCTCTCGGGAGAATTTCTGGCTTCTGATCAATTCTACC
    GACCCTGCAGGACAACTTCAATGGCTTGTAGGCGAACTGCAAGCGGCCGAAGACCGCGGTGACAAGGT
    GCATATAATCGGCCACATTCCGCCTGGGCACTGCCTTAAAAGTTGGAGTTGGAACTACTACCGGATCG
    TTGCACGGTATGAAAACACTCTTGCAGCGCAGTTCTTCGGGCATACGCATGTTGACGAGTTCGAAGTA
    TTTTACGACGAGGAAACTTTGAGTAGACCACTCGCCGTTGCATTCTTGGCGCCGTCAGCTACAACTTA
    CATAGGTCTGAACCCAGGGTACAGAGTATATCAAATAGACGGGAATTACAGCGGCAGTAGCCATGTAG
    TCCTGGACCACGAAACATATATACTCAACCTTACCCAGGCAAACATTCCGGGCGCGATTCCTCACTGG
    CAACTGCTGTACCGCGCACGGGAAACTTATGGTCTCCCCAACACACTTCCGACAGCCTGGCATAACCT
    CGTGTACCGCATGAGAGGGGACATGCAGCTTTTCCAGACATTTTGGTTTCTGTACCATAAGGGGCACC
    CCCCTTCTGAGCCTTGCGGTACACCCTGCCGCTTGGCCACCCTTTGTGCTCAACTCTCCGCACGGGCG
    GATAGCCCTGCATTGTGTAGGCATTTGATGCCAGATGGGAGCTTGCCTGAAGCCCAAAGTCTGTGGCC
    ACGGCCCCTGTTCTGTGATTACAAGGATGATGACGATAAGTA
    Figure US20230142867A1-20230511-P00048
    Figure US20230142867A1-20230511-P00049
    Figure US20230142867A1-20230511-P00050
    Figure US20230142867A1-20230511-P00051
    Figure US20230142867A1-20230511-P00052
  • The HEXB protein sequence is:
  • (SEQ ID NO: 16)
    MELCGLGLPRPPMLLALLLATLLAAMLALLTQVA
    LVVQVAEAARAPSVSAKPGPALWPLPLSVKMTPN
    LLHLAPENFYISHSPNSTAGPSCTLLEEAFRRYH
    GYIFGFYKWHHEPAEFQAKTQVQQLLVSITLOSE
    CDAFPNISSDESYTLLVKEPVAVLKANRVWGALR
    GLETFSQLVYQDSYGTFTINESTIIDSPRFSHRG
    ILIDTSRHYLPVKIILKTLDAMAFNKFNVLHWHI
    VDDQSFPYQSITFPELSNKGSYSLSHVYTPNDVR
    MVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLL
    TPCYSRQNKLDSFGPINPTLNTTYSFLTTFFKEI
    SEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQK
    GFGTDFKKLESFYIQKVLDIIATINKGSIVWQEV
    FDDKAKLAPGTIVEVWKDSAYPEELSRVTASGFP
    VILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQK
    QKQLFIGGEACLWGEYVDATNLTPRLWPRASAVG
    ERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQ
    PLYAGYCNHENMDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for HEXB and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • HEXB DNA
  • (SEQ ID NO: 17)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG 
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA 
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG 
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA 
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GGAATGAAGTCCTCAAGAACCTGCCTGTCC
    AAGGATTCCTAAGGAAGCTGTTTATCCCGCTGTTTATCCCCAGAGGGTAAACTGTGTGGCTGGTTGCA
    GTATCTGTTCCAGACACATAACAGCCAGTCCCAAAAGCGCAGCTGAAACTCTGTTTCCTTTAACCATT
    CAGACATTTTTAAATGCATATTCTAAGAATTATTAATAAATGCAAATGAATGATTTGCCACAGATTCA
    TGGGCTAAAATACTAACCTGCACCTCCCCAACCAGATGTGAGCCCCACTATAAGACTGCACAGAAAAC
    CACAAAGGCAGGGGCATGACACATATACCTCATACCATTAGGTGATGCAACTAGTTTTCTTTCCTTTT
    TGGAATATAATTTTTTATGATGAGTAAACATGGCTGAGTGGAAGAAAGGAAGCAACAAAGTCCTAGAG
    CTCCAGTGTTAGGAAGACTTTACAAAGCACCCAGTAATTTCCACAGTGACTTGAGATATAAAAAATGG
    TGAGGAGCCAGCTTCACTCCCACCTCAAGCTAAGCATCTACTGTCTACTCCATTCTAACCAATGCCCA
    GTGTTTTGGAGAATGACCATATGAACCTATGGGAATGCATCAGGTTTCCAAAGGGTTAAAACATTCAT
    TAGCATTCTCAATGCTCCACCACTGTAGTTGCTATTTTAATTAGTACATTCAATCTGGAGTATCTGGA
    GGAAGACTTTAAACCTAAACCTCAATAGACTGTCTACTTACTCTTGTGAATCTGCCGCCTCTCTCAGA
    GCAAATGCTGCCCTGAGTGAAGGAAGAATGAGGCTCCCTGAAACTAGGTGCACAGGCACCAAAATGCT
    TACACATTGAGCTGGAATCATGCTAGGCCCAGAACACATTTCCTTGAACATTAAATTCAGGATGGTGC
    AAAACAACATTCACAACCAGACTTACACCTAAGAAAACAATTTGCTTAGAAGACCTTAGGATATAGGA
    TCTACATCTCTCCGCATCCTGTGTTTATGCTCCTGTATGTTCCAGGTTGGCCCAACAAGCCAGAAGGA
    AGAGCCTAGCCTGTCATCTTGGATACCACCCAGATGGAAAAGCACGTCACAGTCAAAATCACAATCAA
    CATTCTTGCAACAATGTTTGATTAAAATAAAAAGTAAACTTGGAAACCCAAGAGGAAATACTTGCTTT
    CTGGTTTAAAACTGTGCAAACCTCAAAATATCAGCAGAAACCATGTATACAAGTCACTGAACACACCT
    TCAACATAAAATGAATCATCTTTCGAGCATTTTTAAGCCCCCAGAGAAATCACATTTGCATACCAAGT
    AAGTGGCCAAGAAAATGCACACATGTACTTAACCTACGTCTGCAAAGAAGACAGAGAAAGGGCTGCTA
    TTACTGTGAACATCAAAAGGGCTTCAGTTACAGTAACTTCAGTTCACTTCATGCTCCTAACTCAGGCT
    CTCTGCCTCCAATGCTTAGCGTCCCGCACCCTTCTCCAGGTTTTGAACCTGATCTCAATCCAATCAAA
    AGAGAAACTAAGCAGATTTTTTTCTGGGCATTTACATTACCGGTAACAATAGAACATTATAAAGATCT
    TGTTAACAAAAGTAGCTTCTTGCCAGAGTCTTTTTATTAACACTTGCAACGCTATTTAACAAGCTGGC
    CATGCCTGTGTGACACCAAACCGCATGTGGATAAATTCAGTGTTTCATCTCCAGGCCCACCAATTCCT
    ATCCTTTGTAACTAATAATGTTAACACGACAGATCAAGAAACTAGTTTTTCAGAAAGCAGTTTTCATT
    CATTCTCTATAGACATTTCCGCCGCAGTGAACATGCCCGCACTCAGGGTGTCTGTGCATGGCCCCAAC
    TTTCCTCCAAGCCGGGGAGGATGCAACGTGGGCCAACCATCCCGCGCGCTCGGCCCTGGAGGCTGCGG
    GGAGCGACGCAGCGCGACATTCACCCCCAGGAAAGCGCTGCTTTCCGCAGAGGAGAGAGGCTGAC
    Figure US20230142867A1-20230511-P00053
    Figure US20230142867A1-20230511-P00054
    ATGGAGCTCTGCGGACTGGGTCTGCCTCGCCCGCCTATG
    CTTCTCGCGCTTCTGTTGGCCACTCTTCTCGCTGCGATGTTGGCTTTGTTGACACAGGTTGCTCTCGT
    AGTCCAAGTGGCAGAGGCCGCCAGGGCACCGTCCGTGAGCGCTAAACCCGGGCCCGCTTTGTGGCCGC
    TGCCTTTGTCTGTTAAAATGACTCCAAATTTGCTCCACCTTGCCCCGGAGAACTTTTACATAAGCCAT
    TCTCCCAACAGTACCGCAGGGCCGTCATGCACTCTTTTGGAGGAAGCATTCCGCAGATACCATGGATA
    TATCTTTGGTTTTTACAAGTGGCACCACGAGCCGGCCGAGTTCCAAGCTAAAACCCAAGTGCAGCAAC
    TGCTCGTATCTATTACACTCCAAAGTGAGTGTGACGCCTTCCCCAATATTAGTAGTGACGAGTCATAT
    ACCCTCCTCGTTAAGGAGCCCGTTGCTGTTCTGAAAGCTAATCGGGTCTGGGGTGCTCTGCGAGGCTT
    GGAGACTTTCAGCCAACTGGTGTATCAGGACTCATATGGCACCTTTACGATTAATGAGTCTACGATCA
    TTGATAGTCCCCGGTTCTCCCACCGCGGTATATTGATTGATACATCCCGGCATTACTTGCCAGTCAAA
    ATCATTCTGAAAACGCTGGACGCTATGGCGTTTAACAAATTCAACGTTCTTCATTGGCATATAGTAGA
    CGATCAATCATTCCCTTATCAGAGCATTACCTTTCCAGAACTGTCTAACAAGGGATCTTACTCACTCT
    CACATGTGTACACGCCTAATGACGTTAGAATGGTGATTGAATACGCGCGATTGAGGGGTATTAGAGTT
    CTCCCGGAGTTCGACACACCTGGCCATACTCTCAGCTGGGGCAAGGGTCAAAAGGACTTGCTTACACC
    TTGTTATTCCAGACAGAATAAACTTGATTCATTCGGTCCCATTAACCCTACGCTGAATACTACATATA
    GTTTCCTTACTACGTTCTTTAAGGAGATAAGTGAGGTTTTTCCCGACCAATTCATTCATTTGGGAGGT
    GACGAGGTGGAATTTAAGTGCTGGGAAAGCAATCCAAAGATTCAAGACTTTATGCGGCAGAAGGGTTT
    TGGCACCGACTTTAAGAAACTTGAATCCTTTTACATTCAGAAAGTTCTGGACATAATCGCCACAATTA
    ACAAGGGTAGTATCGTGTGGCAGGAGGTATTCGATGACAAAGCTAAACTGGCCCCGGGAACTATCGTA
    GAAGTCTGGAAGGATTCCGCTTACCCGGAGGAACTGAGCCGCGTCACAGCGAGTGGCTTTCCTGTGAT
    ATTGTCAGCACCTTGGTATCTGGACCTTATCTCCTATGGGCAAGACTGGCGCAAATATTATAAGGTTG
    AACCTCTGGATTTTGGCGGGACTCAAAAACAAAAACAACTCTTCATCGGTGGCGAAGCCTGTCTGTGG
    GGTGAATACGTAGACGCTACGAATCTCACTCCCAGACTCTGGCCACGAGCCTCTGCTGTCGGGGAACG
    ACTGTGGTCTTCAAAAGATGTAAGGGATATGGACGACGCCTATGACCGGCTGACTAGACACAGATGTA
    GAATGGTAGAGAGGGGAATTGCTGCACAGCCGCTTTACGCCGGGTATTGCAATCATGAGAACATGGAT
    TACAAAGACGATGACGACAAATA
    Figure US20230142867A1-20230511-P00055
    Figure US20230142867A1-20230511-P00056
    Figure US20230142867A1-20230511-P00057
    Figure US20230142867A1-20230511-P00058
    Figure US20230142867A1-20230511-P00059
    Figure US20230142867A1-20230511-P00060
  • The SGSH protein sequence is:
  • (SEQ ID NO: 18)
    MSCPVPACCALLLVLGLCRARPRNALLLLADDGG
    FESGAYNNSAIATPHLDALARRSLLFRNAFTSVS
    SCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFD
    KVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDF
    AYTEENGSVLQVGRNITRIKLLVRKFLQTQDDRP
    FFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGM
    GRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAA
    QYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTS
    DNGIPFPSGRTNLYWPGTAEPLLVSSPEHPKRWG
    QVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKT
    IHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSY
    PMRSVQHRHFRLVHNLNFKMPFPIDQDFYVSPTF
    QDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRS
    RDPHETQNLATDPRFAQLLEMLRDQLAKWQWETH
    DPWVCAPDGVLEEKLSPQCOPLHNELDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for SGSH and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • SGSH-DNA
  • (SEQ ID NO: 19)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG 
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA 
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG 
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA 
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT 
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG CACAGTCTCGGCTCACTGCAAGCTCCGCCT
    CCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAGGTAGCTGGGACTACAGGCACCCGCCACCACA
    CCTGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCCTGTTAGCCAGGATGGTCTCCATCT
    CCTGACCTTGCGATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCTTGAGCCACCGCACC
    CAGCCTAATTTTTGTATTTTTAGTGGAGACGGGGTTTCACCATTTTGGTCGGGCTGGTCTTGAACTCC
    TGAGGCCTCCATGAGTCATCTCGCTGGCCCCTCCATAACCAGCTCAGTGTCTGTGTGTGCCAAACAGG
    CATAGGAGCCTGGGTCGGCCCAGAGACAATCTGCGTCACACACTGTAAACCAAAGGTTCCTTGGGGCC
    CAGGGGCTGGAAACCTGGAATCGCTGATGGTGTCTGGGATTGATACTGTGTATTCTGCACTGTTTCTG
    AGATATGGCTGGCAGCAGGGTCAGACGCCTCACCTGGGGCGGGAGTCCAGCCACTTCAGCATAGGCCA
    GCGCCTGGGGAATGGCAGTGAGGCCAACTGAGAGGCCGGCGACGAAATCCATCTTCAGCCACTGCAGG
    GAGTAGCTGGGCAGCCACGCCAGGATGGGCAGCCTCCTCTGCAGGGCCGCAGGGGAGCAGCAGCAGGC
    GCTCGGGGCCATCCCGGGGCCAGAGGACCTGGCCTGACCCAGCGCCGTCACCGAAGAAGGCATCTCTA
    CGGTGGGGCTGGGGGGTGCAGGTGACACAGACCAGCAGTCAGAGCGTGGCCTTTGAAGAACCCGGCAA
    CACTGGTCAGGGTCGGCGGGTGTCAGGAGAGGTCACTATGGGTCTCCCCTCCCCAGGGGGGCGAACGG
    CTCTAAGAGGAAGCCAACAATGTGTGCAAGATGGGGACTGGGCCAAGACTGGAAGGCCCACCCCGTCC
    CTCATCACCCTCTCTCCCTGAATCCCCCCCGTGCAAATCCTGGGTGCCTTCTGTCCCTCCATCCTCAA
    CTCTCCCAGATAAGAAACTGATAAGCGATTCCTCATTGAGAAGTCTGAAGACTAATTAGGCTCAACAC
    TTCTGGCCCCTGGAGAACCGACTTCCTTTGCCTTTTTTTCCGTCTCCCCAGGGAATTTAACGGGGAGC
    AGCGCAGAGAGAAGCTGGGCCCGCGGGCGCACCGTAGGCTGGCGGGGGGTGCGGGTGGCAGCCAGGAG
    ATCCAGCGGGGCTCCGAGGTTTTCCAACTGCGCACCAAGGCGGGCTCCACCCGCTACGGTGGACTGCG
    GCGCTCGAGGGAGAGTGGGGGGGTCCTCCGCTCCTGAGGCCACGCCTCGCTCAGCCCCCACCTGTGGC
    TTTCACAGCCCTGGAGTTCTAGTTGCTTCTGGGAGGACATCAGAGGCGCCCGGTTAATTCAGGTGCGC
    AGCATCTTCCAGGGCCCCGGCGCCCGCGCGCAGAGCCGCCACCCTCACCCCCAGCCCCCCACGCCCCC
    GGCCGCTCCGCTCCGGGCAAAATCCACTCCCCGGCCCCCGGCCCCGCCCACAAGGCCGCTGGCTCCGG
    ATCCTGGGGTCTCCGGAGGAGACCCTGCGCCACCTGACCGTCCGCGGTCCCTTGGCCCCGAGCAAGGC
    CGAGGGCCGCGGGGACCCGGGTAGGGACGCGGCGGGGGAGTGGTCCCCACGGGAGGCCCCCGCACTTC
    AGGACTCCGCGCTGGCCCCCGCGTCGGCTGCCCCCGGCGACCCCGGCCACTGTCCGCTCCCCCTGCCC
    CTGGTCCCCGCTCGCCGCCCTCGCGGGCCACGGACTCACCCACTCGGCGATGCAGCTTGGGGTCTCCT
    GGCGTCGCCGCAGCTGGGTCCGCAGGAATCGCCGGGGACAGGCCGGGGCGGGGTCTGGCGCGCACGTG
    ACCGGGGTCGGGGCAGGGGGCGGGGCCGAGCGGGAGACCAGAGAGCCGGAGCCGGATCCCGATCC
    Figure US20230142867A1-20230511-P00061
    Figure US20230142867A1-20230511-P00062
    ATGTCTTGCCCGGTACCGGCCTGCTGTGCTCTCCTTCTG
    GTCCTCGGTCTGTGTCGCGCTCGCCCCCGAAATGCGTTGCTCTTGCTGGCCGACGACGGGGGCTTTGA
    AAGCGGTGCGTATAACAATAGTGCTATCGCAACCCCTCACCTTGATGCACTCGCACGCCGGTCTCTCC
    TGTTCAGGAACGCCTTTACATCCGTCAGTTCTTGTTCACCTAGTAGGGCTAGTCTCTTGACAGGCCTC
    CCCCAGCACCAAAACGGGATGTACGGACTGCATCAAGACGTGCATCATTTCAACAGTTTCGACAAAGT
    TAGATCCTTGCCCCTCTTGCTTAGCCAGGCGGGTGTGAGGACCGGCATAATTGGAAAAAAACATGTGG
    GGCCTGAGACTGTCTACCCTTTCGACTTCGCTTACACGGAGGAAAATGGTTCAGTCCTGCAAGTAGGG
    AGAAACATTACCAGAATTAAATTGCTTGTGAGAAAATTCCTGCAAACCCAAGATGATCGCCCATTTTT
    TCTGTATGTCGCCTTTCATGACCCTCACCGCTGTGGTCACAGTCAGCCTCAATACGGTACATTCTGTG
    AAAAGTTTGGCAACGGGGAAAGTGGCATGGGCCGGATTCCTGATTGGACACCACAAGCGTATGATCCA
    CTGGACGTCCTTGTCCCGTATTTTGTTCCAAACACCCCGGCTGCTCGGGCAGATTTGGCAGCACAATA
    TACTACTGTCGGGCGAATGGATCAGGGAGTTGGTCTCGTTCTGCAAGAGCTGCGGGACGCAGGTGTCC
    TGAACGATACGCTGGTAATCTTTACCTCAGACAATGGTATCCCGTTCCCATCAGGGAGGACGAACCTT
    TACTGGCCTGGTACTGCAGAGCCTCTGTTGGTGTCCTCCCCAGAACATCCCAAGAGGTGGGGCCAGGT
    GTCCGAAGCTTATGTTTCTCTTCTTGACCTTACCCCGACCATACTTGATTGGTTTTCCATCCCCTATC
    CGTCTTATGCAATATTTGGTTCTAAAACGATACATCTTACCGGCCGATCTCTTCTGCCGGCGTTGGAG
    GCGGAACCCCTGTGGGCCACTGTATTTGGTTCACAAAGTCACCATGAAGTCACGATGAGTTACCCGAT
    GCGAAGTGTCCAGCACAGGCACTTCCGACTCGTGCATAATTTGAATTTCAAGATGCCGTTTCCAATCG
    ACCAGGATTTTTACGTAAGCCCGACTTTCCAAGACCTGCTGAACCGCACAACAGCGGGTCAACCAACG
    GGGTGGTACAAAGATTTGCGCCACTACTATTATCGAGCTCGATGGGAGCTGTACGATCGGAGTCGAGA
    TCCTCATGAAACTCAAAATCTGGCGACAGATCCTAGATTTGCGCAGCTCCTGGAGATGCTTCGGGATC
    AGCTTGCGAAATGGCAATGGGAAACCCATGATCCTTGGGTGTGTGCGCCAGATGGTGTACTCGAGGAG
    AAACTGTCCCCGCAATGTCAGCCCCTGCACAATGAGCTTGATTACAAGGACGACGACGACAAGTA
    Figure US20230142867A1-20230511-P00063
    Figure US20230142867A1-20230511-P00064
    Figure US20230142867A1-20230511-P00065
    Figure US20230142867A1-20230511-P00066
    Figure US20230142867A1-20230511-P00067
    Figure US20230142867A1-20230511-P00068
  • The HEXA protein sequence is:
  • (SEQ ID NO: 20)
    MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSD
    QRYVLYPNNFQFQYDVSSAAQPGCSVLDEAFQRY
    RDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTP
    GCNQLPTLESVENYTLTINDDQCLLLSETVWGAL
    RGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHR
    GLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWH
    LVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQD
    VKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPG
    LLTPCYSGSEPSGTFGPVNPSLNNTYEFMSTFFL
    EVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMR
    KKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQ
    EVFDNKVKIQPDTIIQVWREDIPVNYMKELELVT
    KAGFRALLSAPWYLNRISYGPDWKDFYIVEPLAF
    EGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPR
    AGAVAERLWSNKLTSDLTFAYERLSHFRCELLRR
    GVQAQPLNVGFCEQEFEQTDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for HEXA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • HEXA-DNA
  • (SEQ ID NO: 21)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG 
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA 
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG 
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA 
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT 
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG CCATCTCAAAAAAGAAAAAAATAAAGAGAG
    ATGGGATTTCCTCGTATTACCCATGCTGATCTTGAACTCCTGGGCTCAAGTGATCCTCCCATCTTGGC
    CTTCCACAGTGCTGGAATTGCAGGTGTGAGCCACCATCCCCAGCCTGTGATCCATTTTTAATTAAGTT
    TGTGTATGCTGTGAGGTAGGGGATCATAGTTCATTTTTCCCACATAGGGATATCTATTTTCTTCATCA
    CCTTTTATTGAAAATAACTTGCCTTCCCCACTAAATTAAATTGGTGTCTTCATTGAAATGGAGTAATA
    ATCAAATGGCTCTTTTCCTGGACCCTCTATTCCGTTCCATCTATTTGTCTGCCATTATGCCAAAACCC
    CACTCTAGTGTATAGTAAGGGGGGATATTTTGGAAGATCAACAAATTTATCTTGTGTAACTTACTTTT
    ATGTATGCATAAGAATATTTGGTTTAAAAATCTGGTTAAGTCTAAAAGAGCTCTTTCAGTAATAATTA
    AATAACTGAATCCCTGTAATCCACCATGCTCAGCCCCATTAGTAATTTTTATAGCGTCTTTATTTGGA
    TCTAGCTCTTTCATCTTGTGTAACTCTTTTAAGTAAATGACTCCTCCCTTCTGGTACAGGTTGAATAT
    CCATAAAAGGGTGCCTTGGCATGTGCTCAGTCCTGAAATTCTACTTTGATCTACTGCTCTCATCTTTG
    AAACTGCTCTAAACACTCCCCCAAATTATTGATCCTGTGGGCCTGATTATGAACTTGGCTCTTCTGAC
    AATACTTCTCAAATCCTGTACCTTTGGTAACATCTCTCCTGACTGGCAGAGTACCCAGACACTTAATT
    AATGCTGATGAAAATTAAGATTTAAGCAAAGAAGAAATTGGTGTAAGAATGCAAGCCTTAGGATCTGA
    AACTTGATGACTGTGGGATCTAATTTCTGCTTCTTTTGCTACAAGCTCTGTTTTTGAATAATCAACTT
    TCAAAGACAAATTGCTTGATGAATCTCTGTATTGTCAGAATGTTTCTTTGATTTCCCATCAAAGCCAT
    GTATGTGTAACTAGTAATTAGGTACCTACGACCTGAATATAAATTATCATTATTTCAGTAGCCCTTTA
    TTTGTTTTTGAGAAAGGGTTTCACTATACTGCCCAGGCTAAACTTGAACTCCTTGGTTCAAGTGATCC
    TGCCGCCTCAGCCTCCTGAATAGCTGGGAGTAGCTGGGACTACAGGGATGTGTTATCATGCCCAGTGA
    AAGCCCTTTATTTTTAAAAGTTCATTTTCCTGATCTTTACATGTATTTTAATACTAAAATTAAATTTA
    AAAATTTTAAAAAGTACCTACTAACCAAATAATGCCACCCAATAACAATAGCATTATTTGTATTATGT
    TCTGGTAATATAGTGGTGAACAAGGAAGACAGGGTCTCTGATAACTGTGGAATTCTCCTTGATCACTT
    CTCAAGCCTCTCTTCCGGATACTATTTCTTGGATCTACAGCTGTCTTCCTTCCCCACTTCTGTAGTTC
    AGTCTCTCATTATTCTCTACCTCCAATAAAAAGGGGCCTCCTAACTGGTTTCTTATGCAACTACTCAA
    CATGGTATTTCTAAAATGCAGATCTTTCGGGATGATTTTCAATTGCTCAGAGAATAAATTCCTTGACT
    TTCTACGTAAGGCCCTGCAGGACCAGGCCCCTGTCATTATCTCTGAGCACTCTCCATCTTGCACCCAA
    AGCCCCAGCAATATCTTACTGCTTTCAGTAAATACACACCGGGCTATGTGGGGTTTTGTCTGGGGAAA
    GGGGATGTGAGTTGTAACTAAGAGATTTATAGTCAATCTTGAAGGGAAGAGCTCATATTGGAGGCAAT
    GGAATGCCTCAAGCTGGCTCTGAAAGTACACATTTTTTCCAGTCATAAGTAGATCACATTTGTGTGAT
    CTTGTGATCCTCTCTCTAGCAGAGCCGTGCAGACTACCTACTTCCGGCCCTTTCTTGCCCCACGAAAT
    TTCACATCACAACGACTTGTGGTTTTAATCCTCCGTTTTTCTGCTTCTGAAGTTACTTCAGCCTGGCA
    AGTCCTTTACCTCCCCGTAGGCCTGGCGAGCTGCATCACAACATTCAAGATTCACCCTAGAGCCATCT
    GGGAAACTTTCTTCTCCAGGTCGCCCTGCGTCCTCGCCTCCCCACCCCGTTCTTCTCGAGTCGGGTGA
    GCTGTCTAGTTCCATCACGGCCGGCACGGCCGCAGGGGTGGCCGGTTATTTACTGCTCTACTGGGCCC
    GTGAACAGTCTGGCGAGCCGAGCAGTTGCCGACGCCCGGCACAATCCGCTGCACGTAGCAGGAGCCTC
    AGGTCCAGGCCGGAAGTGAAAGGGCAGGGTGTGGGTCCTCCTGGGGTCGCAGGCGCAGAGCCGCCTCT
    GGTCACGTGATTCGCCGATAAGTCACGGGGGCGCCGCTCACCTGACCAGGG
    Figure US20230142867A1-20230511-P00069
    Figure US20230142867A1-20230511-P00070
    ATGACGTCTAGCAGATTGTGGTTCAGTCTGTTGCTCGCCGCGGCGTTTGCCGG
    CCGAGCCACCGCATTGTGGCCCTGGCCCCAGAATTTTCAAACTTCCGATCAAAGATACGTCCTCTATC
    CTAACAATTTCCAATTTCAATATGATGTGAGTTCAGCTGCACAGCCGGGATGTAGCGTCTTGGACGAG
    GCGTTCCAAAGATACAGAGATCTCTTGTTTGGCTCAGGCTCATGGCCACGGCCGTACCTTACTGGCAA
    GCGGCATACACTTGAAAAGAACGTGCTCGTCGTAAGTGTAGTAACTCCGGGTTGTAATCAGTTGCCCA
    CGCTCGAGAGCGTCGAAAACTACACACTCACGATTAACGACGATCAGTGTCTGCTCCTTTCAGAGACC
    GTATGGGGGGCGCTGAGGGGCCTTGAAACATTCTCACAGCTTGTTTGGAAATCAGCGGAGGGAACGTT
    TTTTATCAACAAAACGGAAATAGAAGATTTCCCCAGATTTCCACATAGGGGCCTGTTGCTGGACACCA
    GTCGCCACTACCTTCCACTCTCATCCATACTTGATACCTTGGACGTAATGGCCTATAACAAACTGAAT
    GTTTTTCACTGGCACCTTGTAGATGATCCATCCTTCCCTTACGAAAGTTTCACTTTTCCCGAGCTCAT
    GCGGAAGGGCTCCTATAACCCTGTTACACATATATATACAGCTCAAGATGTAAAAGAGGTGATCGAAT
    ACGCACGCTTGAGGGGGATACGAGTGCTCGCTGAGTTTGACACACCAGGTCACACACTTAGTTGGGGG
    CCTGGCATTCCTGGGCTTTTGACACCCTGCTATAGCGGATCAGAGCCCTCAGGGACTTTTGGACCTGT
    GAATCCATCTCTGAATAATACGTACGAATTTATGAGCACATTTTTCCTGGAGGTCAGCTCCGTGTTTC
    CCGATTTCTATCTGCATCTGGGAGGCGACGAGGTGGATTTTACCTGTTGGAAATCCAACCCCGAAATC
    CAGGATTTCATGCGGAAGAAGGGCTTCGGTGAAGATTTTAAGCAACTCGAGAGTTTCTACATACAGAC
    CCTTCTTGATATCGTCTCTAGCTACGGTAAGGGGTATGTTGTATGGCAAGAAGTATTTGACAACAAGG
    TAAAAATACAGCCCGACACCATAATCCAGGTCTGGAGGGAAGATATCCCGGTTAATTATATGAAGGAG
    CTCGAGTTGGTTACAAAAGCAGGTTTCCGAGCGCTTCTCTCTGCTCCTTGGTATCTGAACAGGATCAG
    TTATGGTCCGGATTGGAAGGACTTTTATATAGTTGAGCCATTGGCATTTGAGGGGACACCTGAACAAA
    AAGCTCTTGTCATCGGAGGGGAAGCCTGTATGTGGGGCGAATACGTAGATAATACGAACCTCGTGCCG
    CGATTGTGGCCCAGGGCCGGGGCGGTAGCGGAAAGGTTGTGGTCAAACAAACTTACGTCAGATCTCAC
    CTTTGCCTACGAGCGATTGTCCCATTTCCGCTGCGAATTGCTCAGGCGGGGAGTCCAAGCCCAACCCC
    TCAATGTAGGTTTCTGCGAGCAGGAGTTCGAACAAACCGATTATAAAGATGACGATGACAAATA
    Figure US20230142867A1-20230511-P00071
    Figure US20230142867A1-20230511-P00072
    Figure US20230142867A1-20230511-P00073
    Figure US20230142867A1-20230511-P00074
    Figure US20230142867A1-20230511-P00075
    Figure US20230142867A1-20230511-P00076
  • The IDS protein sequence is:
  • (SEQ ID NO: 22)
    MPPPRTGRGLLWLGLVLSSVCVALGSETQANSTT
    DALNVLLIIVDDLRPSLGCYGDKLVRSPNIDQLA
    SHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRL
    YDFNSYWRVHAGNFSTIPQYFKENGYVTMSVGKV
    FHPGISSNHTDDSPYSWSFPPYHPSSEKYENTKT
    CRGPDGELHANLLCPVDVLDVPEGTLPDKQSTEQ
    AIQLLEKMKTSASPFFLAVGYHKPHIPFRYPKEF
    QKLYPLENITLAPDPEVPDGLPPVAYNPWMDIRQ
    REDVQALNISVPYGPIPVDFQRKIRQSYFASVSY
    LDTQVGRLLSALDDLQLANSTIIAFTSDHGWALG
    EHGEWAKYSNFDVATHVPLIFYVPGRTASLPEAG
    EKLFPYLDPFDSASQLMEPGROSMDLVELVSLFP
    TLAGLAGLQVPPRCPVPSFHVELCREGKNLLKHF
    RFRDLEEDPYLPGNPRELIAYSQYPRPSDIPQWN
    SDKPSLKDIKIMGYSIRTIDYRYTVWVGFNPDEF
    LANFSDIHAGELYFVDSDPLQDHNMYNDSQGGDL
    FQLLMPDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for IDS and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • IDS-DNA
  • (SEQ ID NO: 23)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG 
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA 
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG 
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA 
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT 
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AAATAAGATCACATTCTGAGGTACTGGGTG
    TTAGGACTCCAACATACAATTTTTGGGGGGAGGACACTAACACCTAAGTCAAAAAGGACCAAAAGACA
    TACCATACAAAGCAAAGGCCCTCTGCCCTGCACCTTCCCTCTTCTGTTCTCATTTTAAATGTTTCTAC
    TTTCATTTCTTCTAGTAGATTGTCCCTGTCTTTCTAAATACTATGCTATTTCTATATCAACTTTCTAA
    TGTTTTCAACTTTAGACATTGCTGACTTCTTCCTATGATAGATGAGATCACTTATCCATTCCTCTTTT
    TATGTACATATTTTTAACCTTGGATTTTGGAAATTGATCAAACCTATACAAGATTACAGAAAATAGTA
    TAATAAACCCCAGGTGTTCATCAATGAACTTCATTGGTTATCAACTCATGACAAATTTTGTTTCATCT
    CTCCCTGCCATTTTCCCTTCTCCTTTTACTATATAGATTTATTGCCATTTTTAAGTTTTCTATTGTTT
    ACTTTTGTATCTTTAAGTAATATCCCTAAATCTGTATTTCTCTTTCTATGAACTGGGGACAGCGACTA
    TTGATTTATTATTATTATTATTATTATTTTTTTGCGATAGGCTGGGGCAAAGGAGACAATCCTAAATT
    GCTAACCCTAAATTAATGTGCCTACATTTCCTGCCATCTCCCTTCCTCCTCCCAGCTTTTTGTCAACT
    GTTGCTGCAGGCCAGGTTCCCAAGGAAGCTGACTAAGATGTAGAGTAATGTGCAGGAAGTTTCTTAGG
    GAGTACTCTTAGGATCAACACCTGTAGAAGGGGAGGGGAGGGAAGAAATCAAAATTCAACAGACAGAG
    AAAGTGGGCAGTCTCAAAAAAGGCTTCAGTTAACCCCACAGGGAGCTCTGAAGCTGAAATGGCCTTTC
    AGAGATGTCCCAAGTTGGGGTAAGGGGCTGCACCTTTGTACCTTCAAGTTGAATCACTGATGTGGGCT
    TCCTGAGAACAGGGGTGACCTTGGCTGAGGTACCTATCTTTTTCTAAGACAATTTCTGATGAGGGCTA
    ACGACAGCTAACAGCTCTCAAGCCAGCTAGGGAAATAAAATTTTTGGTGTTGAAGGGAGATCTGAGCA
    GTGCACCACAGCACCATCACAGCTGTATTTGTACCTTCCCATTGTCCATTGTTGTTTAGTCATAATTA
    AGTCATCTGGGCTTTGTTATAGATTGATTCTCAAAGTTGAAAATCAAACTGTGTTTGCAGTATTAGGA
    CGGTATAATAGTATTCAGTACAGTACAGTGTAGGGCTAGGATTCCATCTCCCCACAATTGCTGCTCTC
    TCTGTTTGGCGCACCCTTCTTGCAGACCTCAATGCGGCTTGCTCCCTCACTTGCTTCAAAATTTTGTT
    CAAATGCCACTTCACTAGTGAGGCCCTGCCTGGCCACCCCATAGAAAATAACAAGCATGCCCCGCACC
    CACGCTTTATTTTTCTCCAGACCGGGTATGCCCACCGACACTCTCTATATTTTTTTATTGTTTTAAGC
    TCCACCAAAGCAGGGACATAGAGCCCCAGTGCCACCAACAGCGCCTGGTATACATATTTGCGATACTT
    AATAAGAATTAGTTGAAGAAATGAATAGGAAAGGGAGTCCCTTTAGATGGAGGAAGGGACTTTCTCAA
    AAGGCCTCATGGAAGGAGAATGGCATTGAGGCTGGCTTACAAGAAGTGTGTCTTTTATAAAAAGAAGG
    AAGGGCAGGGCTTCCAGGTGAAAGGCATAGCACAAGCAAAGGCAAAAAGACGGGTAACTGCTAGTTGT
    TAAGGGCAGTGGCTGAATCGCTATACCCCTAGCAGTTCTACACCTAGGGGCCTGCGGGCCGCTGGGTC
    TGCGAAGCTTGCAAGGCAACTGGCCCGCCCCCGCTCTGCGCCTGTCTCTCGGCCACGCCTATTGCTGC
    AGGATGACGCGCACCTCTAGAACCCGCCCCGGAGGGGAGGGACGCAGGGAAGAGTCGCACGGACG
    Figure US20230142867A1-20230511-P00077
    Figure US20230142867A1-20230511-P00078
    ATGCCGCCTCCTCGGACGGGGCGCGGGCTTCTGTGGCTC
    GGTTTGGTATTGTCTTCAGTGTGCGTGGCCCTGGGATCAGAAACTCAGGCGAACTCCACGACTGATGC
    ATTGAACGTATTGTTGATAATAGTTGATGACCTGAGACCCAGTCTCGGCTGCTACGGTGATAAACTCG
    TACGAAGCCCCAACATTGATCAATTGGCAAGTCATTCCCTCTTGTTTCAGAACGCTTTTGCCCAACAG
    GCGGTTTGCGCACCGAGCAGGGTTTCCTTCCTCACAGGCCGCCGCCCTGACACTACCCGCTTGTACGA
    CTTTAATTCATACTGGCGCGTCCATGCCGGTAACTTCTCTACGATACCGCAATACTTTAAGGAAAACG
    GTTATGTCACTATGAGCGTAGGTAAGGTATTTCACCCGGGGATCAGTTCCAACCACACCGACGATTCT
    CCTTACTCTTGGAGTTTTCCTCCATATCACCCAAGCAGCGAAAAGTATGAGAATACAAAAACTTGTAG
    AGGCCCCGATGGTGAGCTTCACGCAAATCTGCTTTGCCCAGTCGATGTGCTCGATGTCCCAGAGGGCA
    CTCTGCCGGATAAGCAATCCACGGAGCAGGCTATACAGCTGCTCGAAAAGATGAAAACTTCAGCGAGC
    CCGTTTTTCCTCGCGGTGGGTTACCACAAGCCGCATATCCCGTTCCGGTACCCTAAGGAGTTTCAAAA
    ACTTTATCCGTTGGAGAATATCACACTCGCGCCCGACCCCGAAGTCCCAGATGGACTTCCTCCGGTCG
    CTTATAATCCCTGGATGGACATTCGCCAGAGGGAAGATGTACAAGCTCTCAATATCTCTGTTCCATAC
    GGGCCCATTCCGGTTGACTTTCAGAGGAAGATTAGACAATCATACTTCGCCAGCGTATCCTATCTTGA
    CACCCAAGTTGGACGCCTGCTCTCCGCGCTTGATGACCTGCAACTGGCAAACAGTACCATAATAGCCT
    TTACCAGTGACCACGGTTGGGCACTCGGAGAACATGGAGAGTGGGCCAAGTATAGTAATTTTGACGTG
    GCTACCCATGTACCATTGATTTTTTATGTTCCCGGCCGGACAGCTAGTCTCCCGGAGGCTGGAGAGAA
    ACTGTTTCCATACCTTGACCCCTTTGATTCCGCGTCTCAGCTTATGGAACCGGGCCGCCAAAGCATGG
    ACTTGGTTGAACTGGTTAGTCTCTTCCCGACGTTGGCTGGCCTTGCAGGGCTCCAAGTTCCACCGCGG
    TGTCCCGTACCTAGTTTCCACGTTGAACTCTGTCGGGAGGGGAAGAACCTCCTTAAGCACTTCCGCTT
    TCGGGATCTTGAGGAGGACCCCTACTTGCCAGGTAATCCTAGAGAGCTGATAGCGTACTCCCAGTACC
    CACGACCGTCTGATATTCCACAGTGGAACTCCGACAAACCTAGTCTGAAAGACATTAAGATCATGGGC
    TACAGTATTAGAACGATTGACTACAGGTATACCGTGTGGGTCGGCTTTAATCCAGACGAATTTCTCGC
    GAACTTTAGCGATATCCATGCGGGTGAGCTTTATTTTGTTGATTCCGATCCGCTCCAGGATCATAATA
    TGTATAATGATTCACAAGGAGGTGATTTGTTTCAACTTCTCATGCCGGACTACAAGGATGATGACGAC
    AAGTA
    Figure US20230142867A1-20230511-P00079
    Figure US20230142867A1-20230511-P00080
    Figure US20230142867A1-20230511-P00081
    Figure US20230142867A1-20230511-P00082
    Figure US20230142867A1-20230511-P00083
    Figure US20230142867A1-20230511-P00084
  • The NEU1 protein sequence is:
  • (SEQ ID NO: 24)
    MTGERPSTALPDRRWGPRILGFWGGCRVWVFAAIFLLLSLAASWSKAEND
    FGLVQPLVTMEQLLWVSGRQIGSVDTFRIPLITATPRGTLLAFAEARKMS
    SSDEGAKFIALRRSMDQGSTWSPTAFIVNDGDVPDGLNLGAVVSDVETGV
    VFLFYSLCAHKAGCQVASTMLVWSKDDGVSWSTPRNLSLDIGTEVFAPGP
    GSGIQKQREPRKGRLIVCGHGTLERDGVFCLLSDDHGASWRYGSGVSGIP
    YGQPKQENDFNPDECQPYELPDGSVVINARNQNNYHCHCRIVLRSYDACD
    TLRPRDVTFDPELVDPVVAAGAVVTSSGIVFFSNPAHPEFRVNLTLRWSF
    SNGTSWRKETVQLWPGPSGYSSLATLEGSMDGEEQAPQLYVLYEKGRNHY
    TESISVAKISVYGTLDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for NEU1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • NEU1-DNA
  • (SEQ ID NO: 25)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG ATCATGGTGAGTGACTCCCCTCTCTGCTGC
    TCCACCCCCAACTCCCCAGAGGAACCCAATAACCCCAACGGGTCATGTCTTCCAGACCTCCATCCTGG
    GGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGC
    TTCCGTGAGTGACACCCCCAGCCACCTGTCCCCAACCTTAAAGTACTGAGCCGTTCAAGCATTTTTTT
    TTTTTTTTGGAGATTTGGTGCTTCTAGAAAGAATGCCTTTGTGAGCTAAACGAACCAGAGGGAGCTTC
    TTGTGACATCCTGAGGCCAGGCGTTTAGGGAAGAAGGAAGAGAGTGAGCCCCCAGGCCGGTAGGCTCT
    CCAAAGAAGGAGCATTCTGATGGAGAGGTCTCTGCTATCTTCCCTAAGAAATCTCGGGTAGAAGTTAG
    CCATTAGAGGCCAGGCATGGTGGCTCACGCCTGTCATGCCAGCACTTTGGGGGACCGAGGTGGGCAGA
    TCACCCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCCACATGGTGAAACCCCTCTCTACTAAAAATAC
    AAAAAATTAGCTGGGCGCGGTGGCGGGCGCCTGTAATCCCAGCCACTTGGGAGGCTGAAGCAGGAGAA
    TCGCTTGAACTCAGGAGGCGGAGGTTGCAGCGAGCCGAGATTGCGCTACTGCACTTCAGCTTGGGCGA
    CAGCAAGACTCCGTCTCAAAACAAAGAAAAAAGATTAAAAAAAAAAAAGAGCCATTAGAGAGACTTCC
    CAGCAGGTGGCTGTAGCGGCAGGGGATTTTCTTACTCTACAGCATGCATCCATTCAGGGTTTGCCAGT
    CTCCTCCTAAGCGACTGACATGCATTTTTTCCGCTAATTCTTATCATCGCTCTATGCAACATAATATT
    CCCGACCCCACTTCACAGGAGGAAACTGAGATTACATAACCTGCCCGGGGGTAAGGGTGGTGGTGCAG
    CTTGGATTCCAAATCCCGCAGGCAGCTTGCCTCCCAGTCCCCGATTTCCTGGCCCCAGGGTCCGCTCT
    AACCGGTCTCTGTTGCGGCTCAGTGGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACT
    ACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGG
    AAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCA
    CTTCGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTG
    TAATCCAACACTTTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCC
    AACATGGTGAAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTC
    ATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGC
    CGAGATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA
    AAGATTTTATTAAAGATATTTTGTTAACTCAGTAAATCTGTGACTCATTCTCTCCACGACGACAGGCG
    GCCTTTTTTTTTCCGCGCCCGGCTCTTTAAAATCGGCCCCGCCCCTGTGACGTGTCCTGGTGCCGCGG
    CCAATCAAAAAGCCCAATAAGGATGGTGCCAGTCCCTTCTTCACCCCAGCCAGTCCGCTTCCCGTTCG
    GGTCACGCGCTCCGGGCCTCCAAGAGGGCCTCCAAGAGGGCCAATCGGAAGGGCAAGCTTCGAGATGC
    TGCGTGATCACGTGGGAGGAGATGAGCTACTTGAAGACCAATTAGAGTCCGGGAAGCGCGGCGGGGCC
    TCCAGACCGGGGCGGGCTTAAGGGTGACATCTGCGCTTTAAAGGGTCCGGGTCAGCTGACTCCCG
    Figure US20230142867A1-20230511-P00085
    Figure US20230142867A1-20230511-P00086
    ATGACAGGCGAACGCCCTAGCACTGCATTGCCTGACAGA
    CGGTGGGGCCCTCGAATACTCGGGTTCTGGGGAGGGTGCAGAGTATGGGTTTTTGCGGCCATATTCCT
    CCTGTTGAGCCTGGCTGCATCCTGGTCAAAAGCGGAGAACGACTTTGGGCTGGTGCAACCGTTGGTTA
    CGATGGAGCAGCTCCTGTGGGTTAGCGGCAGACAGATCGGGTCAGTAGATACGTTCCGAATACCACTG
    ATCACAGCCACTCCGCGAGGCACACTCCTTGCTTTTGCAGAAGCCAGGAAGATGAGCTCATCAGACGA
    GGGCGCTAAATTCATCGCATTGCGGCGATCCATGGACCAGGGATCTACATGGAGCCCGACGGCATTTA
    TAGTGAACGATGGGGACGTGCCGGATGGACTTAACCTTGGAGCGGTTGTATCAGATGTGGAGACAGGC
    GTTGTGTTTTTGTTCTACTCACTTTGCGCACACAAGGCGGGCTGCCAGGTTGCTAGCACTATGCTCGT
    ATGGTCCAAGGATGACGGCGTGTCCTGGTCTACGCCACGAAATCTCTCACTCGACATTGGTACTGAAG
    TATTTGCACCAGGGCCGGGTTCCGGTATCCAGAAGCAACGGGAACCTCGGAAAGGTAGACTTATAGTA
    TGCGGACACGGAACTCTTGAGCGGGACGGCGTTTTCTGTCTCTTGTCTGACGATCACGGGGCATCCTG
    GAGGTATGGTAGTGGGGTTAGTGGGATACCCTATGGCCAGCCTAAACAGGAGAACGATTTTAATCCGG
    ACGAGTGCCAGCCTTATGAACTTCCCGACGGCTCAGTGGTCATAAATGCGCGCAATCAAAATAACTAT
    CACTGCCACTGTAGGATTGTGTTGCGGAGTTACGATGCTTGTGATACACTGCGACCGCGAGATGTGAC
    ATTCGACCCAGAATTGGTGGACCCCGTAGTAGCGGCGGGGGCCGTGGTCACAAGTTCAGGGATAGTTT
    TTTTCAGTAACCCTGCTCATCCCGAGTTTAGAGTGAATCTCACTCTTCGGTGGTCTTTTTCCAATGGG
    ACCAGTTGGAGAAAAGAAACGGTACAACTCTGGCCTGGACCGTCCGGTTATTCTAGTCTCGCGACGTT
    GGAGGGGTCAATGGACGGAGAGGAGCAGGCCCCGCAACTCTACGTCCTGTACGAGAAGGGGCGGAACC
    ATTACACCGAGTCAATAAGTGTCGCTAAAATCTCCGTTTACGGCACATTGGATTACAAGGACGACGAT
    GACAAGTA
    Figure US20230142867A1-20230511-P00087
    Figure US20230142867A1-20230511-P00088
    Figure US20230142867A1-20230511-P00089
    Figure US20230142867A1-20230511-P00090
    Figure US20230142867A1-20230511-P00091
    Figure US20230142867A1-20230511-P00092
  • The SUMF1 protein sequence is:
  • (SEQ ID NO: 26)
    MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEK
    FVNSTGYLTEAEKFGDSEVEEGMLSEQVKTNIQQAVAAAPWWLPVKGA
    NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYS
    CRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVD
    AFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK
    GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPTMDDYKD
    DDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for SUMF1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • SUMF1-DNA
  • (SEQ ID NO: 27)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG ATAGCTGGACCAGGAATCAAAGAAGTTCAG
    GTCTCTGAAGCGCAGCTGACACAGCATGTGCTGAGACTGATATTTGAAGGAGGCGAACAAAGATTACC
    GTTGACCTAGACCTTGACTATATATTATGCACAGTAAATACATACAACCTTGGTTAAGGCATTGACCT
    TTTAGCTTCATCAATTTTGAAATGAGTATAGTAATAATACATTTCTTGCCAAGTGGTTGTAATCTCAA
    ATGAGAAAATGTGTGAAAGGCCTTGGCAAACTGTAAAATAGTCAACAAGGAGAAAAGGCCTTTTTTTT
    TTTTTTGAGAGATGGTCTCACTTTGTCACCCAGGCTGGAGTGCAGTGGCACAATCATGGCTCACTGCA
    GCCTCTGCCTACCAGGCTCAAGTGATCCTCCTGCCTCAGCCTCCGAAGTAGCCAGGACCACCGGTGTG
    CGTCACCATGGCCAGTTAATTTTCTTAATTTTTTGTAGAGACAAGGTCTTGCTGTGTTGCCCAGGCTG
    GTCTCAAGCTCCTGGCCTCTAGCGATTCTCCTGCCTCGGCCTCCTAAAGTGCTGGGATTACAGGCTTG
    AGACACGGTGCCTGGCTGGTCTTTTCTTTATGCTAAAAAAATTCAAACAGATGCTACATTAGAATAGT
    GCAATAAACACCCAGATGCCCATCAACCAGCTATGACAGTTATCAACTCATGGCCAATCATGCTTCAT
    CTTTAGCCAGACCCTGATTACCCATTTTTTGGGAAGCAAATCTTTGACTGCCTATATTTTTATCCATA
    TATATATATGTGTGTGTGTGTGTGTGTGTGTGTGTATACCTCAGTGAGTGTGTGTGTACCTCAGTGAG
    GGGATTCTTTTAAAAAATATAGCCACAAATACCATTATTACTCCTAAAAGCCTAAAAACAAAATTTAC
    AGCAATTCCTTAATTTTATCAAATATTATAATTTGACTTAGTGGTTAAATTTCCCTGATTATCTCAAA
    AATTTTGAACTCAGTAGGTTTGAATTAGTATTTAGATAAGGTTCACATATTGCAGCTGGCTGCTATGT
    TTTTTTTTTTTTTTTTTTTTTCAGATGGAGTCTTGCTCTGTCCCCAGGCTGGAGTGCGGTGGCGCCAT
    CTCGGCTCACTGCAAGCTCCACCTCCCGGATTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTG
    GGACTACAGGCGCGTGCCACCACGCCGGGCTAATTTTTTGTATTTTTTAGTAGAGATGGGGTTTCACC
    GTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGG
    GATTACAGGCTTGAGCCACCGCGCCCGGCCTGCTATGTTGTTTTTTAATCTGTAAGTTTCTCCTTCAT
    CTTCTTTTTGTTTTGGTTTTGCATTTTGTTCGATTACGGAACTGTAGAACAGTGGTTCTTAAACTTAA
    GTGTGGATTAGAATCATCTGGAGAGCTTGTGAAACCCTGGATTGCTGTGTCCAGTCCCCACGGATTCT
    GATTCCGTATTGTATTTCTCTTTTTTTTTTGAGACGGAGTCTTGCTCTGTCGCCGAGGCTGGAGTGCA
    GTGGCGGGATCTGGGCTCACTGAAACCTCCGCCTCCCAGGTTCAAACGATTCTCCTGCCTTGGCCTCC
    CGACTAGCTGGGACTACAAGCGCGCCATTACTGGGTCTTTGTGAGGATTAAATGAGATTTTAAAAAGT
    GAAGTGCTTAGCATAGTACTCTGGCACTTACCAAATGCCCAGCAAATTTGACTGTTATTTTTCAACCA
    AGAAATCATAAAGGGGTGCACGGTCTGTGGAAGGCCTTGAGTTTTGTGGAGTTTTGTGTTTTGCTGCG
    TTTTGACGTTAGCACAGGCAAGCAGCAGCGCCAAAGAAGTACCTGAAGAAAGGTGAAGAGCAACTCTC
    CTGGTCACTTCGTAAGTCAGCTCAAAATTCTGCCCCTTTAAGGAGGGCTCAG
    Figure US20230142867A1-20230511-P00093
    Figure US20230142867A1-20230511-P00094
    ATGGTTCCAATCCCCGCCGGGGTGTTCACCATGGGGACGGACGACCCACAGAT
    TAAGCAGGATGGCGAAGCTCCAGCGAGAAGAGTCACCATAGACGCTTTTTACATGGATGCTTACGAAG
    TGTCTAACACGGAGTTTGAAAAATTTGTTAATAGCACAGGTTATTTGACGGAGGCGGAGAAGTTCGGT
    GACAGCTTTGTCTTCGAGGGCATGCTTTCTGAACAGGTGAAGACTAACATTCAGCAGGCAGTTGCAGC
    GGCACCTTGGTGGCTCCCAGTTAAGGGGGCAAATTGGCGACACCCAGAGGGACCAGACTCTACTATAT
    TGCACCGGCCTGACCACCCGGTTCTTCACGTATCCTGGAATGACGCCGTTGCCTACTGCACCTGGGCA
    GGAAAACGACTGCCGACCGAAGCAGAGTGGGAGTATTCTTGTAGAGGAGGACTGCATAATCGGCTTTT
    TCCCTGGGGTAACAAGCTTCAGCCCAAAGGCCAACACTATGCTAACATCTGGCAAGGGGAATTTCCCG
    TCACGAACACTGGGGAGGACGGATTTCAGGGGACCGCACCTGTAGATGCGTTTCCCCCCAACGGGTAT
    GGTCTCTACAACATCGTCGGTAATGCCTGGGAATGGACCAGCGACTGGTGGACGGTGCACCATTCTGT
    TGAAGAAACATTGAATCCCAAAGGGCCACCAAGCGGAAAGGATCGAGTGAAGAAAGGCGGATCTTATA
    TGTGTCATCGGTCCTACTGCTACCGGTATCGGTGTGCTGCTCGCTCTCAGAATACACCCGACTCAAGC
    GCAAGCAATCTTGGTTTCCGGTGTGCGGCCGATCGCTTGCCGACAATGGATGACTACAAGGACGATGA
    CGACAAATA
    Figure US20230142867A1-20230511-P00095
    Figure US20230142867A1-20230511-P00096
    Figure US20230142867A1-20230511-P00097
    Figure US20230142867A1-20230511-P00098
    Figure US20230142867A1-20230511-P00099
    Figure US20230142867A1-20230511-P00100
  • The GAT1 protein sequence is:
  • (SEQ ID NO: 28)
    MATNGSKVADGQISTEVSEAPVANDKPKTLVVKVQKKAADLPDRDTWKG
    RFDFLMSCVGYAIGLGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLF
    LLECSLGQYTSIGGLGVWKLAPMFKGVGLAAAVLSFWLNIYYIVIISWA
    IYYLYNSFTTTLPWKQCDNPWNTDRCFSNYSMVNTTNMTSAVVEFWERN
    MHQMTDGLDKPGQIRWPLAITLAIAWILVYFCIWKGVGWTGKVVYFSAT
    YPYIMLIILFFRGVTLPGAKEGILFYITPNFRKLSDSEVWLDAATQIFF
    SYGLGLGSLIALGSYNSFHNNVYRDSIIVCCINSCTSMFAGFVIFSIVG
    FMAHVTKRSIADVAASGPGLAFLAYPEAVTQLPISPLWAILFFSMLLML
    GIDSQFCTVEGFITALVDEYPRLLRNRRELFIAAVCIISYLIGLSNITQ
    GGIYVFKLFDYYSASGMSLLFLVFFECVSISWFYGVNRFYDNIQEMVGS
    RPCIWWKLCWSFFTPIIVAGVFIFSAVQMTPLTMGNYVFPKWGQGVGWL
    MALSSMVLIPGYMAYMFLTLKGSLKQRIQVMVQPSEDIVRPENGPEQPQ
    AGSSTSKEAYIDYKDDDDK 
  • GAT1 DNA
  • (SEQ ID NO: 29)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AGGTCGGGTCTGCTCAAGTTCTGCAAAGAT
    GGCTGCCAACACCTCTATGCTTAGGAGCTTTGGTGGAGATTGAGTTTCTGTTTGACAATAGTTTTAGC
    AAAAGTCCCAGACAGAGTTCTCATTGGCCCAGCATATGTCACACGTCCCCTCATTGAGGTCAGAGTGA
    TGTGCTCTGATTATCTGTGCCCTGGGTCACATGTCTGTCTCCAGGAGGAGGGAGCCCCACCCAACCAT
    ATTGAATATGAGTGAAGGAGGGGAAGTTTCCCAAATGAAAATTGAGGTTCTGTCACCCAAAGAAGTTG
    AAATGAATGTCAGGCAAGCAAAAGTAGCCTTCATTTTTGCTACCCTTGTATCTCCACTCAAAGCTATG
    CCCCACATCTCCATCCAGACAGCTCCTACTCATCCTTCAGACCTCAGTTCAAATGGCGCTTCCTTGAG
    GAAACCTTCCCAAACCATCCCACACTAGACTGGAATCCCCTGCTGGACTAGAATCCCCTGCATGTGTG
    CTCCTGGCAGAGCTTGTCATTCTGGGTTTGTGCGGTTATTTGATGAAAGCCTGTCTCCCATACTGGAC
    TGGCATCACCTTGAAAGCTGTGGCCTGTCTTGTCCATCGCTCTGTCCCTGGAGCCTAGGACAGTGCCT
    GGCACACAGGTGGAACTCGGGAAATATTTATTTGAAATACCAACGGAAGTCCACAGCACATATGACCT
    GGTGATCAGTGGTGGGAGGTGAATGGGTCAAGAAAGACACCAAGATTCCCTGCTCATGAGTGAGAGTG
    TGGGGAAACAGAAGAAGTCAGGGTGACCTTGGTGAGCACTGCCCTGGGGGACTCGAGGATGGATGAAA
    GCAGGGAGGAGGCCAGATTAGGTGGGGTAGAGGACCTGAGGTCCATCAGCAAAGGCCATCCCTGGGGA
    CAAAGAGAGGAGGGGAAGAGTAGGTCTGGGCTGGAGAGAAGGAATCTTTTCTCTGAGTTCCTGGGGAC
    CCCAGAGGGAAGGGCAGATGTGGCCCCATTCCCAGAAGCCTTTATAAAAGAGGGTGGGGAGGTCAGGC
    CTTGGCAAGGACAAGGGAAGGAGTCCCTACGTGCTGGATTGGGATTGGCTGGAAAGGGGACTTCTCTG
    GGAGAATCCCTCAAACCTCAAGAACTGAGAGAAGGGTGTCTGGGGCTCCTGCCACCATCCCTGTTTCC
    CTTTTAAGTAATCTGTTTCCCCATCTGTCCATCCATACACACAGCCACTTGTGTCTCCATGACCAACC
    GCTGGCAGTGGAAGGGTGTCCTTCCCACCCCCACTCTTACACACACTCCCAGCTGGTACCCAGAGCCT
    GGTCACCCCAGGCCAGGCCTGTGTTTCCAGGTGTAACGGGCAGCAGACGCTGCCCTAGGACTAGAGCA
    GGGAGGGGGCACGGGCCCACCCCAACCCACAGCGACCCACAGAGGGCGAAAAGAGGACGACCGCAGAG
    AGAAACGGAAAGGACAGGCCAACGGAAGCAGTACTGCAAGGCTGGAAGGAGAAAAGCCAGGAGGGGAG
    TGCTTGCTGTGAAAGACAGGGAGACAGAGACCAAGACGGACAGGCAGACAGGCTGGTGACCCAGGATG
    AGGCCGGAAAGAGCCATCAAAGGAAGGAGAAGGAAGGAGAGAGATTGGAGCGGGACGGCGGGGCAGGC
    GAGGGAAGGAGGGGGTGGGGAGAGGGAGGGAGGAAGAGAGGGGAGAAAGAGGGAGGAAGAGAGGGGAG
    AAGGAGGGAGAAGAGAGCGGGAGAATGCGAGAGGAAAGAAGGGAGAGGGGAGGCGTAGAAGGGGAGAG
    GAGGTGAAGGGAAAAGGAGAGAGCCTGCTGGCGGCGAAGCTGCAAGAGGCAGCTGCGGAGGGAGCGCG
    CGGCGGGCCTGGGGGAGCGCTGGGCGGGGGCGGGCGGTGCGGGCAGGGCTATACCCGAGCTGGGCGGG
    CTCCGGGCGCCGCGGGCCCTGCCCTCCCCCTCCATCCCTCCGGACTCGCTCCCCCCTCCTCTCCCTTC
    CCCGCGACCCTCCGCC
    Figure US20230142867A1-20230511-P00101
    ATGGCAACTAACGGTAGTAA
    AGTCGCGGACGGTCAGATCTCCACAGAAGTTTCTGAAGCGCCGGTCGCAAATGATAAACCAAAGACCC
    TCGTAGTCAAGGTACAGAAAAAAGCTGCTGACCTTCCTGATCGAGATACCTGGAAGGGCAGGTTTGAT
    TTCCTCATGTCATGCGTTGGGTACGCCATCGGCCTCGGAAACGTATGGAGGTTCCCCTACCTGTGCGG
    TAAGAATGGTGGCGGAGCCTTCCTGATACCATACTTTTTGACTCTCATATTCGCAGGCGTGCCACTCT
    TCTTGCTCGAGTGCAGCTTGGGCCAGTACACGTCTATAGGCGGCTTGGGCGTATGGAAGCTGGCGCCC
    ATGTTCAAAGGTGTTGGCTTGGCCGCTGCTGTACTGTCCTTTTGGCTGAACATTTATTACATAGTGAT
    CATATCCTGGGCAATTTACTACCTCTACAATTCTTTTACGACAACACTGCCCTGGAAGCAATGCGATA
    ACCCATGGAATACGGACCGATGCTTTTCCAACTATTCAATGGTAAACACCACGAATATGACGTCTGCG
    GTAGTCGAATTTTGGGAACGGAACATGCACCAGATGACCGATGGTCTCGATAAACCGGGTCAGATTAG
    ATGGCCACTCGCAATAACTCTTGCCATCGCATGGATACTCGTATACTTTTGCATCTGGAAAGGTGTCG
    GATGGACAGGCAAAGTAGTCTATTTTAGTGCTACATACCCTTATATCATGTTGATTATACTGTTTTTC
    AGAGGAGTCACACTTCCTGGGGCGAAGGAGGGCATTCTCTTCTATATCACGCCAAACTTTCGAAAGCT
    GAGTGACTCAGAGGTTTGGCTCGATGCTGCAACTCAGATATTCTTCTCATACGGCCTTGGTCTGGGTA
    GTCTCATAGCCTTGGGTTCCTACAATTCCTTTCACAATAACGTGTACCGAGACAGCATTATAGTATGT
    TGTATTAATTCTTGCACTTCAATGTTCGCCGGTTTCGTCATTTTCAGCATTGTCGGATTTATGGCGCA
    TGTTACCAAAAGGAGCATTGCGGATGTGGCGGCTAGTGGGCCAGGACTCGCTTTCCTCGCCTACCCAG
    AAGCAGTTACTCAGTTGCCGATTAGCCCACTGTGGGCTATCTTGTTCTTTTCTATGCTCCTTATGCTT
    GGGATAGATTCCCAGTTTTGCACGGTAGAGGGTTTCATAACGGCACTTGTCGATGAATATCCGCGACT
    CCTGAGAAATAGACGCGAACTCTTTATAGCAGCGGTATGTATAATCTCATATTTGATAGGCTTGTCAA
    ACATAACACAAGGGGGAATATACGTTTTCAAATTGTTCGACTACTATAGTGCCTCCGGGATGTCACTT
    CTTTTCCTGGTATTTTTCGAGTGCGTTTCAATCAGTTGGTTTTATGGCGTGAATCGATTTTATGACAA
    CATTCAGGAGATGGTAGGAAGTAGACCGTGCATATGGTGGAAACTCTGTTGGTCTTTTTTCACGCCCA
    TCATCGTGGCCGGCGTGTTTATATTTAGCGCAGTGCAAATGACTCCTCTCACTATGGGGAACTATGTC
    TTCCCCAAATGGGGCCAAGGGGTAGGCTGGCTCATGGCGTTGAGTTCCATGGTCCTTATTCCAGGTTA
    TATGGCTTACATGTTCCTGACACTGAAAGGGAGTCTGAAACAACGGATTCAGGTGATGGTTCAACCTA
    GTGAAGATATTGTCAGACCTGAGAACGGTCCCGAGCAACCTCAGGCGGGAAGTAGTACCTCTAAGGAG
    GCTTATATAGACTATAAAGACGACGACGATAAATA
    Figure US20230142867A1-20230511-P00102
    Figure US20230142867A1-20230511-P00103
    Figure US20230142867A1-20230511-P00104
    Figure US20230142867A1-20230511-P00105
    Figure US20230142867A1-20230511-P00106
    Figure US20230142867A1-20230511-P00107
  • The FIG. 4 protein sequence is:
  • (SEQ ID NO: 30)
    MPTAAAPIISSVQKLVLYETRARYFLVGSNNAETKYRVLKIDRTEPKDLVII
    DDRHVYTQQEVRELLGRLDLGNRTKMGQKGSSGLFRAVSAFGVVGFVRFLEG
    YYIVLITKRRKMADIGGHAIYKVEDTNMIYIPNDSVRVTHPDEARYLRIFQN
    VDLSSNFYFSYSYDLSHSLQYNLTVLRMPLEMLKSEMTQNRQESFDIEEDEG
    LITQGGSGVFGICSEPYMKYVWNGELLDIIKSTVHRDWLLYIIHGFCGQSKL
    LIYGRPVYVTLIARRSSKFAGTRFLKRGANCEGDVANEVETEQILCDASVMS
    FTAGSYSSYVQVRGSVPLYWSQDISTMMPKPPITLDQADPFAHVAALHFDQM
    FQRFGSPIIILNLVKEREKRKHERILSEELVAAVTYLNQFLPPEHTIVYIPW
    DMAKYTKSKLCNVLDRLNVIAESVVKKTGFFVNRPDSYCSILRPDEKWNELG
    GCVIPTGRLQTGILRTNCVDCLDRTNTAQFMVGKCALAYQLYSLGLIDKPNL
    QFDTDAVRLFEELYEDHGDTLSLQYGGSQLVHRVKTYRKIAPWTQHSKDIMQ
    TLSRYYSNAFSDADRQDSINLFLGVFHPTEGKPHLWELPTDFYLHHKNTMRL
    LPTRRSYTYWWTPEVIKHLPLPYDEVICAVNLKKLIVKKFHKYEEEIDIHNE
    FFRPYELSSFDDTFCLAMTSSARDFMPKTVGIDPSPFTVRKPDETGKSVLGN
    KSNREEAVLQRKTAASAPPPPSEEAVSSSSEDDSGTDREEEGSVSQRSTPVK
    MTDAGDSAKVTENVVQPMKELYGINLSDGLSEEDFSIYSRFVQLGQSQHKQD
    KNSQQPCSRCSDGVIKLTPISAFSQDNIYEVQPPRVDRKSTEIFQAHIQASQ
    GIMQPLGKEDSSMYREYIRNRYLDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for FIG. 4 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • FIG. 4-DNA Sequence
  • (SEQ ID NO: 31)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GTAGCAGCTGCTATTAGTGCCTCCCCCATA
    TCCACTTGACATACACCTTTTCTATGCACGACCAGCCCAACTTCCAAAATGTTAGCATGTGCGACTTT
    TTGCCTAAGGGCTTTCTCTGGCTGCCAGAACCTACTCTCCCTGAACACAGGGTAGACTGGAAGTTCTA
    GGGCATCAATGTTCTGCCCTATCAGGCTTCATCCAATAACTGATGGAAGTTGGCGAATAAATACTCCA
    GCTTCCTTGCCCCTCAAGAGGGGATAACTGAGACATGCCTCCCAGTATTCCCTAGTGAATTCAGCCCA
    CAGAGAAATTTCTCTGGTAATATTTCCTTTATTAGTTTTCTTCCCCTCCCTGCTGCTGTTTCCTGGGA
    TTACGTCTCATATAAACAACCTGCCTTCAAAGTTTTGTCTTAGGATCTGATTCCGGGTGAAATAAACT
    AAGATAAAGTTAAAATGGAAGTGCTAAACTATAATAATTGAAAACTAGAATCCTTCAGGTATTTTAAG
    AGTTGCTATCCTAGATAAAAACACTGTGCAAACATTTAACTAATATTCAGAAGAGTTGGTTTCATTAT
    GAAACAGAACACAGCTAACTATTTTTCCCAGTTAAAAGATCCAAGTCCAAAATGCCAGAACGGACACG
    GCCATCCTTCTAGTTCTTAATAAAAGGAGTACCAAACTACAGGGGGAAAGAAGCGCAAAATAACATAG
    ACTTAAGTATTTCTTTACAATTTTTGTAATTTCTCAAATTTTAATTTACAATATGATCTTTGGTATTA
    CATGAATGTTATCTGGGTATGCTATAATTCTATAATTTTTTTTTAAAGTTTCCTTACACTTAGCAGTT
    GAGAAGCCCTCTAGCAGAGGTTTCTGAGGTTTAGAGGTTTCCTGGGAAACCTAGAATTGCCGCCTAAG
    ATAGGATAGCTGCCTCTCCCCTACATCTTATACTCTTGCAGTAGGTGTTCAGTAAATATTTGTGGAAT
    AAATACTTCAGTCGTTGAAAGGAGACGTGGTTTTAAAAAGTGGATTTACAAAGAGACCGTTTTGCAAG
    AAAAGAAGCTGGTCTGGGGGTCCCCGCAGGAGGCGGGGTTCCGTGCAACAGATACCAAGATGCCTCCA
    GGGCGGACCACGACCCAGCGGGCCGCCGGAAGGCTCCTTACCTGGCCGCAGCCGGACCCTCCTCAGGG
    AAGCAACTGCCTCCAGGGACTCCGCGGGGCTGTCGGGCGGTTGCCGGGGAGACCTCTGGCCGCCACGC
    ACCTGCTGCTTGGTGGGGCGGGGCCTCGAGAGGCGGGGCTGTAGTTCTTTGTCATTTCTTTTCCAATT
    CGAAATAGTCACTTTTGTACCTAATTTGGCATTTATAATTATAATACACTCATTTTCACAACTATGGA
    AGAGAGGGTACGCGCATGCCCATTTATGTCGCACACATGAAATGAAGAAACCTCATTTCAAGCGCGGC
    TCCGCCACACTGCGCTAGTCTCTGTCGACAACTTCAAGTGGCTAGCCTCACGCACAAACGTGTGGCAA
    AGCCACACTTTCCAGCTTTAGGTGATTTTAAAACAATTTCTTGGGCCACACAGCGTAGGTACAACATG
    GATTAGAGGTGAGGGGAAAACTGACACTCAGTGAAGGACCGTCAGAGGGCCGGCTTCCAGGAAAATGG
    GCCGCAGCAGGCAGGGAAAACACAAGGATCTATCCTGCAACTCGGAGAACCCTTTTCAAAAGAAGCTA
    CCCAATCCCAAACCTTCCGCTTTTTTCCAAAGAGAGGGAGGAGATAAATAGACTCAATTAATCCGTCG
    ACTTTTTCTCCGCCCATGTTTTCCTCCAATCCTTACCAAAGATGGTGCCCTTCGCTTCCTCTCTCGGC
    AGCACGCAGGTCCCGGGAGCCTCTACCCGACCTCTCTATGATACGTCCTCCAGCCCCGCTCCCGACGT
    GAGGGGCGGGGCTTGCCTGGAGGCGGGGCGCAGGGATCCGGAAACACC
    Figure US20230142867A1-20230511-P00108
    Figure US20230142867A1-20230511-P00109
    ATGCCCACGGCAGCGGCTCCAATAATCTCTTCCGTTCAAAAGCTTGTTTTGTATGA
    AACTCGCGCGCGGTATTTTCTTGTAGGTTCAAATAATGCGGAAACAAAGTACAGAGTACTGAAGATAG
    ACAGAACTGAGCCTAAGGACCTTGTCATAATCGATGATAGACATGTTTATACGCAACAGGAAGTGCGG
    GAACTCTTGGGCAGATTGGATTTGGGGAATAGAACAAAAATGGGGCAGAAAGGTAGCTCTGGGTTGTT
    CCGAGCTGTCTCAGCTTTCGGCGTAGTAGGCTTTGTGAGATTTCTGGAGGGCTATTACATAGTGCTTA
    TTACCAAAAGGCGCAAGATGGCGGATATAGGCGGCCATGCGATTTACAAGGTTGAGGATACAAACATG
    ATATACATACCAAATGATTCAGTGCGAGTCACCCATOOTGATGAAGCTCGCTACCTGCGCATATTTCA
    GAATGTCGATCTGTCTAGTAACTTCTATTTCTCCTATAGTTATGATCTTAGTCATAGTCTCCAATATA
    ATCTCACCGTCCTGCGGATGCCCTTGGAGATGCTCAAATCCGAGATGACACAGAACCGACAAGAAAGC
    TTCGATATTTTTGAAGACGAAGGTTTGATCACACAGGGAGGTAGCGGTGTATTCGGAATTTGTAGTGA
    ACCCTATATGAAGTACGTATGGAATGGCGAGCTCCTGGACATTATAAAAAGCACGGTCCACAGAGACT
    GGTTGCTTTACATCATTCATGGCTTCTGCGGCCAGTCTAAGTTGCTGATTTACGGGCGGCCTGTGTAC
    GTAACTCTGATCGCAAGGCGATCTTCTAAATTTGCGGGCACCAGATTTCTCAAGCGAGGCGCCAACTG
    CGAGGGAGACGTAGCTAACGAAGTTGAGACTGAACAAATTCTGTGCGACGCGTCCGTTATGAGTTTTA
    CAGCAGGTTCCTACTCCTCCTATGTACAAGTACGGGGCTCAGTTCCTCTCTACTGGAGTCAAGACATT
    AGCACTATGATGCCCAAACCCCCAATTACCCTCGACCAGGCCGACCCGTTCGCTCACGTTGCGGCCCT
    GCATTTTGATCAAATGTTCCAACGATTCGGCAGCCCGATTATCATTCTGAATCTTGTGAAGGAGCGGG
    AAAAGAGGAAGCACGAGCGGATCTTGTCTGAGGAATTGGTGGCAGCCGTGACATATCTGAACCAATTC
    CTGCCACCGGAGCACACGATAGTATATATTCCTTGGGACATGGCTAAGTACACAAAATCTAAATTGTG
    CAATGTGTTGGATCGCCTTAACGTGATCGCGGAAAGCGTAGTCAAGAAGACAGGTTTTTTCGTAAACA
    GGCCCGATTCATACTGTTCAATTCTCCGACCTGACGAGAAATGGAACGAACTGGGGGGATGCGTGATA
    CCTACTGGTCGCCTGCAGACTGGTATTCTCAGGACGAATTGTGTAGATTGCCTCGATCGGACGAATAC
    AGCGCAATTTATGGTAGGGAAGTGCGCGCTGGCCTACCAGCTGTATTCACTGGGGCTGATAGACAAGC
    CGAATTTGCAGTTCGACACCGATGCGGTGCGCCTCTTTGAGGAGCTCTATGAGGACCATGGAGATACA
    CTCAGTCTCCAGTATGGAGGGTCACAACTCGTACATCGAGTTAAAACATACAGGAAGATCGCCCCATG
    GACACAACATAGTAAAGATATTATGCAAACCTTGTCCAGATATTATTCCAACGCCTTCAGCGATGCCG
    ATAGGCAAGATTCAATCAATCTCTTTCTGGGGGTGTTCCATCCCACTGAAGGTAAGCCCCACTTGTGG
    GAGTTGCCTACAGATTTTTACCTTCATCATAAGAACACCATGAGACTGCTCCCAACACGGCGGTCTTA
    TACGTACTGGTGGACACCTGAAGTTATTAAGCACCTCCCCCTCCCGTATGACGAGGTGATCTGCGCGG
    TTAATCTCAAAAAATTGATTGTGAAGAAATTTCATAAATACGAAGAGGAAATTGATATCCATAACGAA
    TTTTTTCGCCCATACGAACTGTCCTCTTTTGATGATACATTCTGTCTTGCCATGACTAGTTCTGCGAG
    GGATTTTATGCCTAAAACGGTCGGAATCGACCCGAGTCCATTTACCGTTAGGAAACCTGACGAGACGG
    GGAAGTCAGTATTGGGGAATAAGAGCAACCGAGAAGAGGCGGTGTTGCAGCGGAAGACCGCAGCTTCT
    GCTCCTCCGCCCCCATCCGAGGAGGCTGTGAGTAGCTCTTCAGAAGACGATTCCGGCACCGACAGAGA
    GGAAGAGGGATCAGTTAGTCAGAGAAGCACACCGGTGAAAATGACTGACGCGGGCGACTCCGCTAAGG
    TAACTGAAAACGTCGTGCAACCAATGAAAGAGCTCTACGGTATCAACCTCAGCGACGGCCTTTCAGAA
    GAGGACTTCTCTATCTACAGCAGATTTGTTCAACTCGGTCAAAGCCAACACAAGCAAGACAAAAATTC
    CCAACAACCCTGCTCAAGATGCTCCGACGGCGTAATTAAACTGACTCCGATTAGCGCGTTTTCACAGG
    ATAACATTTACGAAGTACAACCTCCACGAGTTGACCGGAAAAGTACGGAGATATTTCAGGCTCACATC
    CAGGCCTCTCAAGGAATCATGCAACCCCTCGGAAAGGAGGATTCCTCAATGTACAGAGAATATATACG
    CAATCGATATTTGGATTACAAAGATGACGACGACAAGTA
    Figure US20230142867A1-20230511-P00110
    Figure US20230142867A1-20230511-P00111
    Figure US20230142867A1-20230511-P00112
    Figure US20230142867A1-20230511-P00113
    Figure US20230142867A1-20230511-P00114
  • The CLN5 protein sequence is:
  • (SEQ ID NO: 32)
    MAQEVDTAQGAEMRRGAGAARGRASWCWALALLWLAVVPGWSRVSGIPS
    RRHWPVPYKRFDFRPKPDPYCQAKYTFCPTGSPIPVMEGDDDIEVFRLQ
    APVWEFKYGDLLGHLKIMHDAIGFRSTLTGKNYTMEWYELFQLGNCTFP
    HLRPEMDAPFWCNQGAACFFEGIDDVHWKENGTLVQVATISGNMFNQMA
    WVKQDNETGIYYETWNVKASPEKGAETWFDSYDCSKFVLRTFNKLAEFG
    AEFKKNIETNYTRIFLYSGEPTYLGNETSVFGPTGNKTLGLAIKRFYYP
    FKPHLPTKEFLLSLLQIFDAVIVHKQFYLFYNFEYWFLPMKFPFIKITY
    EEIPLPIRNKTLSGLDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for CLN5 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • CLN5 DNA
  • (SEQ ID NO: 33)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG CTTGGTTATGTCAGTTCCTGTGTGTAGACA
    GTAAGAAAAAAAAAGCCATGCTATGTGTGACATTTTTTTCCAATGTTTATTTCCCACCAAAAAGTAAA
    TGCATTTTAACCCATTCTGTGGTTCACTGTAGCTTAAGGAAACCAAGCATATAGATGCATTAGTGATT
    TTGTTTATATTATGTAAAATGTAACGATCTATCTTAGAAATACCACAGTTTGTATTTTTCTTTAAGGA
    GTAAAGATTTGTCTTTAAAAATAACTTGATATTTTCCTGGCTTTCATTTAATACAATGGAAAATAAAG
    TATTACACGAAAAAAAAATTTTTTCTTAGAGACAGTATCTCATTATGTTACCCAGGCTGAAGTGCAGT
    GGCTATTCACAGGCACGATTATAGCTCACTATAGCCTCAAAGTCCTGGGTTCAAGCCTCCTGAGTAGC
    TAGAACTTCAGGTACATGCCACCTTGCCTGGCTTCCTGCTTTCTTATTCACTAAGAAACTTTGTTCTC
    TAAAATCATAGATTAGAGACAACTTCGTGTTTGAAATTACATCAGTAACAATAAAATATCCCACCTTT
    GCTTGAAGGACCTCAGTCATTCCACACAGAGAATTGGCCTCAATTTACAACCCAGATGTCAAGCTTCT
    GATAAGGGGTTTCCAGACACAAAACTCCATCGCTATTTTAACTTCATAGTCTCCAAGGAAATAGGCCC
    TTCATCCACTTTGCAGTCTAGACCTAGAGCCTGTGTGTTGATGGCAGCAGCAGGCCGTCTGGAGCAGC
    CACTACCATCATTGACATACAGCCTTGTTTTGCTTTGAGTCTACCAAAATACTGCTTATTTAAATTTC
    ATCTCAACTCCACACTTCCCCAGAATCCTATCACTCTATCTGGGGAGATGCCATGTGGTCCAGGCAGT
    CTTAGGCTGAATGGGCAAGGATCCAAGGGAAACCAATTAGATTGAAATTAGTTCTGTCTGTGGGCCCT
    CAAGATTTAAAAGCCATGGACAAGGGGGAAGCAACCAGTATCGACCAGCGATAATCTATGAGCACCTT
    CAGCTTGGGGTGTGGGGTCATAGACAAAAAGAATGGGGTTTAATCAGTGTTCCCATCCAAGGCAAAAC
    CAGACCAGAAGAATGGGAGCTACCAGGAGGCTGAGTTAAGACCAATACTAGGGCCAGGCATGGTGACT
    ACAGCCTGTAATCCCAGCACTTTTGGAGGCCGAGGCAGGTGGATTACTTGAGGTCAGGAGTTCGAGAC
    CAGCCTGGCCAACATGGTGAAACCTCGTCTCTACCAAAAAAAAAAAAAAAAATTACAACTGGCACACA
    CTTGTACCCAGCCACTAGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGT
    GAGCCGAAATCACATCACTGCACTCCAGGCTGGGTGACAGAAGGAGACTCCGTCTCCAAAAAAATAAA
    AAATAAAAACCAATACTAGGAAGTGCTTTCTTACCATGAAAGCTCCCTGCACACAGAATGAAGAGACA
    GATGGCAGAAGCAAGACCAGAGGCCTCCACCACTGAACAAGGTCAACCTGAATTTGTTTTTGTAGCTC
    TGGGAAGTCCTGAACTGAATGGGAGGTTCTACTGTGGCCCTCTACTAACAGTAGATGGAACCTGGAAC
    CATCTACTGTTAATTGTCCTTTTAGTAGCCAAGACATGAAGTACCTATCTCCCTTCTTTGTATTTGGT
    GAAAAGATTTGAACTCGAGCCATCTCTGCTAAAGGCATGACTTTAGTCACCCCACTGCAGTTGCTTCT
    CTTTGCCTCCAAGAAAGTCCAATCTGATGGGCAGATTGTGCTTGCAGATGACTGGATTAAACTGCAAA
    TATCTCATGAAGCTGCTGCCTGCATCACCGCTCTCCAGGCAGCCACGGAGGCTCTGGTTGTTGAGGCC
    ATCTACTGTGGCCCTCATACATCATACTGTGTATGGTTCCAGGATTTGTAAATGCTCTAATCAATGTT
    TTTTTTTTTTTTTAAAGCTGTTCCTGCTGATTGCTTTTTAAGCTAGGCTCAAGTGTGGGGCCACGGAA
    TCCTTCAGGTGCCCTTCAAATCAGACATTACATTTCAGCTCTCAGGGCTAGAATGCTTCCCCCTCTTA
    TCTTGGGCAGGTCTTTGAAGAAGCCAGTTTAGCCTTGGTAATTTTGAATTTGGCCATAGTATACGGAG
    GAGAGCTAGGTAGCTATCAAAACGCTTAGACAAACGTATGGTAAAATGCAGCCCAGCCTGAACCTCCG
    CAAAACCCCCAAACCTGGAGCAGCAAGTGGGCAAAGGAGTGTGTGTTTGGGGCAGGGTACTGAGTCCC
    ATATGTAATCAAGAGACCTGGAGGGCCCAAGGAAGTCAGTGACACCCCCACAAGCTACATATCCTCCA
    ATGGATTGTGGTGGTCATCTCCGGCTACCGAAGCCACCTTGCTTATTCCCATAGGCTTACAAACATTA
    TGATGCTTCACACGAGTTGTGGGTGAAGTGAAGGAATTGGAAAGTCCCGGTCTAAGGCCCTGCAAAAC
    TGCTGTCTGGTAAAAACAAGTGACTATGGCAAGGCCCTGAATGTACTACGTATAGACTTAAGTCAGTA
    TTTGTGCCTAACAGCAAAGCATGCCACAAGAGGCTCCTCAAAAGCCGGGTAATTGATGCCATTGTTTT
    CTTGACCTAGTTGAAGAGTCAAAGAAGAGAAGGTGATGCAGAAACTTCTTTCCAGTGAAGTGACAGAT
    CCAAGCAAGGAACACCTGCCTGTGAGGGGAGCGGTAGGGGAGGGGGTAGATTCGGCAGCACTCGGGTA
    GTGGCAGGGACCACATATTGTCGGTGTTAACCACTGTTATCACTGGCCCAGGTCGAGTGAGGACGAGT
    CATTTCCCCACTAAGCACTTGGGGAGTGGCAGGGACCACATATTGTCAGTGTTAACCACTGTTATGAC
    TGGCCCAGGTCGGGTGACGACGAGCCATTTCCGCACTATTGTCCCTTCTTCCTCACTGCACCCCCAGC
    CCTGCACCCTCAGCTCCAAATACACATTCTCGTTCACTCAACGTTTGTTGACACCTAGCGTGGGCCCT
    GTGGGATGGGGTAAAAGTTCGGTGTTGGGAGGAGCCCCGGTGACAGATTTTTGGTTTTGGTGGTGGCT
    TTTGGGCAAGGACAGGTCAATTCATAAAAGATGACGCCCCAGGTCTGGATCGAATTCTCCTGGGAGCC
    AGACCAAAGCACCTTCCTGGACTGCAAAGTGTGGAAGCCGCCGCGGGCCGGGCGCGGGGAGGTGTCAT
    GCGCCGGAACCTGCGCTTGGGGCCAAGCTCTGGAGCTGACGCGCAGGGGCAAGGCGCCCCGCGTCCCG
    GACTGGCGGCTCCGCGCATGCTCCTCCCACCGGCGTCGCAGGCCTCGAGAGGCTCCGG
    Figure US20230142867A1-20230511-P00115
    Figure US20230142867A1-20230511-P00116
    ATGGCTCAGGAGGTCGATACAGGAGAAGGGGGAGAGATGGGGAGGG
    GAGCGGGAGCGGCGCGAGGCAGGGCAAGCTGGTGTTGGGCACTCGCACTGCTCTGGCTGGCGGTTGTC
    CCGGGATGGTCTAGAGTGAGCGGCATTCCAAGTAGGAGGCATTGGCCAGTGCCCTACAAACGATTTGA
    TTTCAGACCCAAGCCAGACCCGTACTGCCAAGCAAAGTATACATTCTGCCCCACTGGCAGCCCCATCC
    CTGTGATGGAGGGGGATGATGATATAGAGGTCTTTCGGCTCCAAGCCCCAGTATGGGAGTTTAAATAT
    GGAGATCTCTTGGGCCATCTGAAAATTATGCACGATGCGATCGGGTTTAGATCAACATTGACAGGCAA
    AAATTACACCATGGAGTGGTATGAACTCTTTCAGTTGGGAAACTGCACTTTTCCGCACCTTCGACCTG
    AAATGGATGCCCCTTTTTGGTGCAATCAGGGCGCGGCTTGTTTCTTTGAGGGTATAGACGATGTACAC
    TGGAAGGAGAATGGCACACTTGTACAGGTGGCAACAATCTCCGGTAATATGTTCAACCAAATGGCTAA
    ATGGGTTAAACAGGATAATGAGACTGGGATATATTACGAAACCTGGAATGTAAAGGCTAGCCCAGAAA
    AGGGTGCAGAGACGTGGTTTGATAGCTACGACTGCAGCAAGTTCGTCCTGCGGACCTTCAACAAATTG
    GCGGAGTTTGGAGCAGAGTTTAAAAACATAGAGACTAATTACACACGCATTTTTCTTTATAGCGGCGA
    GCCCACTTACCTCGGTAACGAGACATCAGTCTTCGGGCCCACCGGAAACAAGACCCTTGGTTTGGCGA
    TTAAAAGATTTTATTATCCATTCAAACCTCATTTGCCAACTAAGGAATTTCTCCTCTCTCTCCTGCAG
    ATATTCGATGCGGTCATTGTTCACAAGCAATTCTACTTGTTCTACAACTTCGAATATTGGTTCCTTCC
    AATGAAATTCCCATTCATTAAAATCACATATGAGGAGATTCCCTTGCCCATAAGGAACAAGACTCTCT
    CAGGGCTCGATTACAAAGATGATGACGACAAATA
    Figure US20230142867A1-20230511-P00117
    Figure US20230142867A1-20230511-P00118
    Figure US20230142867A1-20230511-P00119
    Figure US20230142867A1-20230511-P00120
    Figure US20230142867A1-20230511-P00121
    Figure US20230142867A1-20230511-P00122
  • The AGA protein sequence is:
  • (SEQ ID NO: 34)
    MARKSNLPVLLVPFLLCQALVRCSSPLPLVVNTWPFKNATEAAWRALA
    SGGSALDAVESGCAMCEREQCDGSVGFGGSPDELGETTLDAMIMDGTT
    MDVGAVGDLRRIKNAIGVARKVLEHTTHTLLVGESATTFAQSMGFINE
    DLSTTASQALHSDWLARNCQPNYWRNVIPDPSKYCGPYKPPGILKQDI
    PIHKETEDDRGHDTIGMVVIHKTGHIAAGTSTNGIKFKIHGRVGDSPI
    PGAGAYADDTAGAAAATGNGDILMRFLPSYQAVEYMRRGEDPTIACQK
    VISRIQKHFPEFFGAVICANVTGSYGAACNKLSTFTQFSFMVYNSEKN
    QPTEEKVDCIDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for AGA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • AGA-DNA
  • (SEQ ID NO: 35)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG6
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TGTCTCTACTTAAAATACAAAAATTAGCCG
    GGCATGGTAGTAGGCAGTAGCAATCCCAGCTACTCAGGAGGCTGGGGCAAGAGAATCGCTTGAATGCA
    GGAGGTGGAGGTTGCAGTGAGCTGAGATTACACCAGTGAACTCCAGCCTGGGCGATAGAGCAAGACTC
    CATCTCAAAAAATAAATACATACATACATACATACATACAAATATCCATGGAAAAGTTGATATATTGA
    GACCTTCTATCAGCCATTCCCACTATAGTGTAATTTAGCTGTTTCTCTGACAGGAATTTTTATTTAAG
    AAGAAAGAATTTGGAGAAATAGACATAGTAGGAGGTAACAGATCAGGCTCTTTCCACCAGAAGAAACA
    CAAATACCTTAAAAAACAGGAAGAGGCTAGAGAGATCAGTGGGGTACTAAGACCAATTTTCCATGTAT
    TGTATTTTACAATTTTTGCTAAAGGCATGTCCTAATTCCTTAAGGCTGCATAAAAATACTCAGCCCAG
    GAGCTAAGGAAAATTTACTCAGAGGACAATGTATGTTTTTCTGGCTTTCTTACCCTGTTTCACCCAAG
    TTTCCAGTTTCACCTGCCATTGGGTTATGATTACTCCAGCACAGATGACTTGTTCCTCTTCCAAACCC
    CATGAAAGGAACTTAGAAAATTTGTCTTTATTATATCTTTTAATTAACACTTCCCCAACGGGATAATG
    ATACTTCTGGGCTATCTTATGTTCTTTAAATTATGTATTTAATTTCCTAGCAATCAATAAGAGCCTTG
    AAGGTTTTGTAAGTCTCACAACTCCTACCTAGTTTATATTTTTACATTTATTTATTTATTTATTTTTG
    AGACAGAATCTCACTCTGTTGTCCAGGCTGGAGTGAAGTGGTGCGATCTCGGCTCACCACAATCTCCA
    CCTCCCGGCTTCAAGCAATTCTGCCTCAGCCTCCCAAGTAGTTGGGACTACAGGCATGCACCACCACG
    CCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGTTTCACTATGTTGGCCAGGCTGGTCTCGAACTCC
    TGACCTCGTGATCCACCCACCTCAGCATCCCAAAGTGTTGGGATTACAGGCGTGAGCCATCGTGCCCA
    GCCTAGTTCATATTTTTAAAGTTTGTATAGATTTATAGATGCTTTTAAAAATTATGTGATTATGTGTC
    TTGCCTTGGCAACCTGGGCAGGCATTCCAGACACCCAAAGGGATTATTAGTTCATGTTCGCACGTCTG
    CCCAGCAACAAAACTGAGCAGCCTCCACCAAGCACGTAAAATCCCCATTCCTCCTCTTATTCATTTAT
    TTAGCACTTAATAGTGCCAGGCAGGAAGGTAAAAATACATTTAAAAAGGAATGGAGGTAGCCGTCTTG
    CCCTGGAGGTGCAGGAAGCTGCACTTCAGTGTGCTAGGTAGAGGGTACATAGGCCGTTTCTGTTTTTC
    TTCCAGCTCAGGGATTTATAATGAGAGGAAGTGCTACAAAAAGCTGTGGTGCGAAAATGACGGACGCT
    ATGAATGGAAAAAAAATCTAAAATCAATCCGGTCCTAAGCAAACACGCTTCACCTACACTCATAACTA
    TTGCAGACCTCCGAGGCCTGGGATCCCCAAGAATATACTGAGTTTGACAAACTTTTCAACTTCAACTT
    TAAATTAAAAAGACAGTAAAAGAACCAAATCCATAGTACACAGCAATCGGCTAAAGTTCCTGGGGCCC
    TGCAACCCAGAGTTGAATAATTTTGTATTAAATTCCCAATATCAAGCTAAATCTATTTTAACCACAGG
    AAAAGCAGCAACTTGTCTGGCTATTTTAAAAATCTGAACAGCACTTAGGAAGAAGCACCTTAGCGCAG
    GGAACAGCTCAGTGCCCCGTGACACAACTCTCCCGCGGGCCCAGGGACGCCTCGTCTCGCGAGAGTTG
    AGGGACGCCTGAGCGAACCCCCGAGAGAGCGGGCGTGGGCGCCAGGCGGGCGGGGCACTGGGGAT
    Figure US20230142867A1-20230511-P00123
    Figure US20230142867A1-20230511-P00124
    ATGGCCCGCAAAAGTAACCTTCCCGTTCTCCTGGTACCG
    TTCTTGTTGTGTCAAGCACTTGTACGCTGCTCTTCACCGCTCCCGCTTGTCGTTAACACCTGGCCTTT
    CAAGAATGCAACAGAGGCGGCATGGCGAGCACTCGCATCTGGCGGGTCTGCCCTCGATGCAGTAGAGT
    CCGGCTGTGCCATGTGCGAAAGGGAACAGTGTGACGGGAGTGTGGGATTTGGCGGCTCTCCAGATGAG
    TTGGGAGAAACGACCCTTGATGCGATGATTATGGATGGAACAACCATGGATGTGGGGGCGGTCGGTGA
    CCTCCGCAGGATAAAAAATGCTATTGGGGTGGCGAGGAAAGTCCTGGAACACACCACCCATACTTTGC
    TCGTTGGTGAGTCTGCAACAACGTTCGCGCAAAGCATGGGCTTTATAAATGAGGACCTGTCCACAACT
    GCCTCTCAGGCTTTGCACTCTGACTGGTTGGCGAGGAATTGCCAACCCAATTACTGGAGAAATGTTAT
    TCCCGATCCATCCAAGTATTGCGGGCCTTATAAGCCACCGGGAATCCTTAAACAAGATATACCTATTC
    ATAAGGAAACCGAGGATGACCGGGGACATGATACGATCGGAATGGTAGTGATCCACAAGACAGGGCAT
    ATTGCGGCGGGGACCAGCACAAATGGTATCAAATTCAAAATTCACGGTCGAGTCGGTGACTCCCCCAT
    ACCTGGTGCCGGGGCTTATGCTGACGACACAGCTGGAGCTGCTGCCGCGACAGGTAATGGTGATATCC
    TGATGCGGTTTCTCCCATCATACCAGGCTGTCGAATATATGAGACGGGGTGAGGACCCCACTATAGCC
    TGTCAAAAGGTTATTTCTCGGATACAGAAGCATTTTCCGGAGTTTTTCGGTGCAGTTATCTGCGCCAA
    CGTTACTGGTAGCTATGGAGCTGCATGTAACAAGTTGAGTACGTTTACCCAATTTTCATTCATGGTTT
    ATAACTCCGAGAAAAATCAGCCGACTGAGGAAAAGGTTGACTGTATTGACTACAAAGACGACGACGAT
    AAATA
    Figure US20230142867A1-20230511-P00125
    Figure US20230142867A1-20230511-P00126
    Figure US20230142867A1-20230511-P00127
    Figure US20230142867A1-20230511-P00128
    Figure US20230142867A1-20230511-P00129
    Figure US20230142867A1-20230511-P00130
  • The IDUA protein sequence is:
  • (SEQ ID NO: 36)
    MRPLRPRAALLALLASLLAAPPVAPAEAPHLVHVDAARALWPLRRFWRST
    GFCPPLPHSQADQYVLSWDQQLNLAYVGAVPHRGIKQVRTHWLLELVTTR
    GSTGRGLSYNFTHLDGYLDLLRENQLLPGFELMGSASGHFTDFEDKQQVF
    EWKDLVSSLARRYIGRYGLAHVSKWNFETWNEPDHHDFDNVSMTMQGFLN
    YYDACSEGLRAASPALRLGGPGDSFHTPPRSPLSWGLLRHCHDGTNFFTG
    EAGVRLDYISLHRKGARSSISILEQEKVVAQQIRQLFPKFADTPIYNDEA
    DPLVGWSLPQPWRADVTYAAMVVKVIAQHQNLLLANTTSAFPYALLSNDN
    AFLSYHPHPFAQRTLTARFQVNNTRPPHVQLLRKPVLTAMGLLALLDEEQ
    LWAEVSQAGTVLDSNHTVGVLASAHRPQGPADAWRAAVLIYASDDTRAHP
    NRSVAVTLRLRGVPPGPGLVYVTRYLDNGLCSPDGEWRRLGRPVFPTAEQ
    FRRMRAAEDPVAAAPRPLPAGGRLTLRPALRLPSLLLVHVCARPEKPPGQ
    VTRLRALPLTQGQLVLVWSDEHVGSKCLWTYEIQFSQDGKAYTPVSRKPS
    TFNLFVFSPDTGAVSGSYRVRALDYWARPGPFSDPVPYLEVPVPRGPPSP 
    GNPDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for IDUA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • IDUA-DNA
  • (SEQ ID NO: 37)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TCCAGGGAAGGTGTACTTTTCTGTCCACAG
    GTGTCCCGGTGAGCGGCTCACCTTAATTCTCTCAGGGCTTGAGGTGATTTAAGGCTGGCTTCAGTCCC
    CACAAAGACAATTCTGTTGCTGGTTTACCTTTGCTTTTAGGGGATAGAACTTTGGGGTCCCAAGAAAA
    ACCCAAAGTCTTTACCAGGCCCCCCGCCTGAGCCTCGACTTCTGTCCTGTGGCCACAGTGTGAGGAAC
    GAACGCAGCAGCGTGTCGCCATGTCTTGTGTGGCTGGGATACGCTGCAGGGGTGTGGAGTCGCCGCCC
    CAAAGCGGGCCCCGCCCCCTCCCGGCTTGTCCCTGCAGAGCCCCGAGGCCTCAGAGGCTCCTCAGAGC
    TCCCAGTGCCCTGGACGGTTCCGTTCAGCGCTCTCCAGTTCTGATGGTTGCTCTCAGGAGGCGGCTGG
    TCTGAAACAATTCAGTTTCCATTGCTGTGTCCTGCACCACGGTGTGGGACTGGTGGTTTCTTACTGTT
    TTTCTCAGTTCTCTGACGCTTTGAGGAATTGTTTTTAAAGCGAGTTATCCATCATTTTAGTTGTTTTC
    TCTGGGGTCGTTGGACTGAGGGTCTTACCCCGCTGTTCCTGGAGACAGTTCTGTACTTGCATGGTTTC
    TCTCGCCTTTCCCTCCCTTTCTTACTTTTTAGTTTCTTAATGAATTTTTTTGTATTATGTTTTTCCTC
    CTTGCTAATTTGGAATATATACACACTATTTCACTTTTAAATGGCTAATCACTTTTAATGCAATTTCA
    ACATAAGAAGACCTAATGTTAAGCAATATTTTAACCCCTCTCCCCAAAATAAAAAGACCCAGAACACT
    GTAACTCTAATTATCCTTCTCCCAACTTAGAGAGGATTATTGTCCAGTATTTTCGTTTTGTTAGTAGA
    CTGTTCTTTTTTTCTTTTCTTTTTTTAGAGATGGGGTCTCATTTTGTCAACCAGGCTGGAATGTAGTG
    GTGTGATCATGGCCCGTCTCAGCCTTGAACTCCTAAGCTCAAGCAATCTCCCACCTCAATTTCGTGGG
    TAGCTGGGCCTCACAGGCATATACCATCGCATCTGGCTAAATTTTTTTTTTGTACAGATGGGGTCTTG
    GTACATCTCCCAGGCCGATTTCAAACTCCAGGCCTCAAGTGATCCTCCCGTGTTGGCCTGCAATTCCA
    AAGTGCTGGGATTGCAGGTGTGAGCCACCGCACCAAGCCTGTTCTTTCCATTATTATTTTTATACAAT
    GTTTGCTTGGATTCACACATGCATACATGGACTACACATGTACTACAGTTGTCCCATGGTATCCTCGG
    GGTATTGGTTCCAGGACCCTCCCAGATACCAAAATTTGCAGGTGCTCAAGTACTTGATATAAAATGGC
    ACAGTATTTGCACATTACCTACGCACATCCTACTGTGTAGTTTAAATCAAGATTACTTATAGTCCCTA
    AAACAATGTAAATGCTCTGTAAATAGCTGTTATACTGTATTGTTTTAAAATATGTATTATTTCGTATT
    GCTTTGGCTTTAAAACAATATTTTCAGTCTGCTTGGTTGAATCCATGGATCTGAAGCCAAGTGTAAGG
    AGGGCCCAGTGTGCTCTTCTGGGGTAATTCTCCTTCCTGCTAAAGCGCACGCTTTACTCAGGAGGCTG
    GGGTGAGAAAATCGCTGAAGCCCCGGAGATGGAGGTTGCAGTGAGCTGAGATCGCGCCACTGCACCTC
    AGCCTGGGCGACAAAGCAAGACTCTGTCTCAAAAACACACAAAAACAGAGAAAAACAAGACAGTAATG
    GCTCAACTCACATAGCACCAACGGGCGAAGCGTTCTTCTGAGCGCTTTCCGAGTCATCGGTCCTCAGA
    GCAGCCCCTGAGGCCCGCAAGGAAGCGGGGCTCCAAGCCCTGCCGTGCTCCCGGCTCCCCGAGGCTCC
    CCGAGGCCACCCAACCCCTCCCACCCGGCCATCGCCCCCTCACCAAGGCCCCGCCCCGCGGCGGCGGT
    CACATGGGGTGCGCGCCCAGACTCCGACCCGGAGGCGGAACCGG
    Figure US20230142867A1-20230511-P00131
    Figure US20230142867A1-20230511-P00132
    ATGAGGCCACTGAGGCCAAGAGCTGCACTTCTGGCGCTCTTGGCGAGTCTTCTCGCCGCA
    CCTCCCGTTGCCCCTGCGGAGGCACCACATCTCGTTCATGTTGACGCTGCGAGGGCTTTGTGGCCACT
    TAGGCGATTCTGGAGAAGTACGGGCTTCTGTCCACCGCTGCCGCACTCACAGGCAGACCAATATGTCT
    TGAGCTGGGATCAGCAACTCAACTTGGCGTATGTGGGAGCTGTCCCGCATCGCGGTATTAAGCAAGTG
    CGAACGCATTGGTTGCTTGAGTTGGTAACCACGAGAGGTTCCACTGGGCGAGGGCTCTCCTATAACTT
    CACACACCTCGATGGGTATCTGGACCTGCTGCGAGAAAATCAACTCCTTCCCGGGTTCGAACTGATGG
    GCTCCGCATCTGGCCACTTCACGGACTTCGAAGATAAGCAGCAGGTCTTCGAATGGAAAGATCTGGTG
    TCAAGTCTGGCTCGCAGGTATATTGGACGATACGGTCTGGCCCACGTCAGTAAGTGGAATTTTGAAAC
    GTGGAACGAACCAGACCACCATGACTTTGATAACGTATCTATGACTATGCAGGGATTTCTCAATTACT
    ATGACGCTTGCTCTGAAGGACTCCGGGCTGCAAGTCCAGCTCTCCGCTTGGGCGGGCCTGGGGATAGT
    TTCCATACACCTCCCCGATCCCCGTTGAGTTGGGGGCTCCTCCGACATTGTCACGACGGGACGAACTT
    TTTCACCGGTGAAGCGGGAGTTCGGCTCGATTACATCTCTCTGCATCGGAAGGGCGCAAGAAGCTCCA
    TAAGTATCTTGGAGCAAGAGAAAGTCGTGGCGCAGCAGATCAGACAGCTCTTCCCGAAGTTTGCGGAT
    ACACCTATATACAATGATGAGGCTGACCCACTGGTCGGCTGGAGCCTGCCCCAGCCGTGGAGGGCTGA
    CGTAACATACGCCGCGATGGTGGTTAAGGTAATAGCCCAGCACCAGAATCTTCTGTTGGCAAATACCA
    CGAGTGCATTCCCTTATGCTCTGTTGTCAAATGATAATGCCTTCTTGTCTTACCACCCGCATCCTTTC
    GCTCAACGCACGCTGACCGCCCGCTTTCAAGTAAATAACACTCGCCCACCCCACGTGCAGCTCCTGAG
    GAAGCCCGTTCTGACTGCTATGGGGCTCCTCGCGCTGCTGGATGAGGAGCAGTTGTGGGCTGAGGTAT
    CTCAAGCCGGGACTGTATTGGACTCCAATCACACTGTGGGGGTTCTTGCTAGCGCGCACCGCCCTCAA
    GGACCTGCCGACGCCTGGCGCGCTGCAGTGTTGATATATGCCAGTGATGATACGCGAGCCCACCCGAA
    TAGGTCTGTTGCCGTGACGTTGCGCCTCAGAGGCGTTCCCCCTGGTCCAGGACTCGTCTATGTTACTC
    GATATTTGGATAACGGCCTCTGTTCCCCTGACGGTGAATGGCGGAGATTGGGGAGACCTGTGTTTCCG
    ACAGCGGAGCAGTTCCGGCGCATGCGAGCGGCAGAAGACCCTGTTGCAGCTGCTCCACGACCGCTCCC
    TGCAGGCGGCAGGCTTACTTTGAGACCAGCTTTGAGGTTGCCGAGCCTCCTGTTGGTCCATGTATGCG
    CGCGGCCAGAGAAGCCACCCGGGCAAGTAACCCGGCTTCGCGCCCTCCCGCTCACCCAAGGCCAGCTC
    GTGCTTGTATGGAGCGACGAGCATGTTGGATCTAAGTGTTTGTGGACTTACGAGATACAATTTAGCCA
    GGATGGAAAGGCGTATACCCCTGTGAGTCGGAAGCCTTCTACATTCAACCTCTTCGTGTTCAGCCCCG
    ACACAGGTGCTGTTAGCGGGTCCTATCGGGTGCGGGCGCTGGATTACTGGGCACGGCCCGGGCCGTTT
    TCCGACCCTGTTCCCTACCTGGAAGTTCCCGTTCCGCGAGGCCCACCTAGCCCTGGGAATCCAGACTA
    TAAGGACGACGACGATAAATA
    Figure US20230142867A1-20230511-P00133
    Figure US20230142867A1-20230511-P00134
    Figure US20230142867A1-20230511-P00135
    Figure US20230142867A1-20230511-P00136
    Figure US20230142867A1-20230511-P00137
    Figure US20230142867A1-20230511-P00138
  • The GDNF protein sequence is:
  • (SEQ ID NO: 38)
    MKLWDVVAVCLVLLHTASAFPLPAGKRPPEAPAEDRSLGRRRAPFALS
    SDSNMPEDYPDQFDDVMDFIQATIKRLKRSPDKQMAVLPRRERNRQAA
    AANPENSRGKGRRGQRGKNRGCVLTAIHLNVTDLGLGYETKEELIFRY
    CSGSCDAAETTYDKILKNLSRNRRLVSDKVGQACCRPIAFDDDLSFLD
    DNLVYHILRKHSAKRCGCIDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GDNF and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GDNF-DNA
  • (SEQ ID NO: 39)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AAACCCTCTTAAGACGGAAGGGAAATCTAG
    TGCCTCTGTGGCTTCCTGCTGGCGGGCACCGCGATTTGGAATGGCGCATTAACCCCGCGCCTCCATGC
    TCCTCCGGTCGGACCGCCAGCCATTTCGCACGCACCCGGGAGCTGGCACCGGCGGGGGGCTTGAGGGG
    GGAAGCTCGCGTTCCCCAGGTCCTAGCTGCCAAAGTACCTTTCTGGGCTCATTTTGCATGGCCTGGTG
    CAGTTTTCCTGTGTCTGCACATCGCGACCCAGAACCTAGCTTTTTCCCGAGTTTGCAAACCAGCCCGC
    GAGGCAAGAGGCGCTCGGTGCTGCAGGTCACTAGGAGTTTCTAGTCCCTACACCCTCTCGAGCCCAAC
    AGCTGCATAGCGAACAAACAGACGCTCAGGAGATGTAAAATGCATTAGTCTCTGCAGAGGTGGGAGGT
    GGCGGCCGAATTAAAGGCTTCCCCGGGTTGCCTGCACCGGGACAGGGAGTGGGGTTCCGGGTAGAGAT
    GAGGACTGGAACCCTGGAATGGAGGCGGGGGTGCCTGTGAACTAATGGCTAGGAAAGGCACTGAGGTT
    TTTGCATTAACCTGGGTTTTGCATTTCCCTCCCGCATGGGAAAAAAAAAATCGAGTTTTTCCTAAGAA
    CCGTTTTATGGAGCCGGTCGAGGGGATAGGATAATCGACTGCACGAATCTCGCAGATTCCGCTTGAGG
    AGATTCTCTCTAGGTCACTAGTGCCCTGGAGACGAACCCTGGGATTAGGAAGGCACTGGATAACACTG
    CACCCCAAGATGCCTCGGTTCTTCCCCGTCTCCTCATTGGTTGGAACTCGCCCCCAAATTTACAGCCC
    CTTTTTCTAACCCACCCCCAGCTGCGTGGCCCTCCTTTGTAGGGGTGTGAGGATTGAGAAGCCAAACA
    GAGCCCACCTCGGGCTGAAAAGAGCTGAACCCCCTACTCTGCGCCGTACCACGGTCTAGGCCTTCCAG
    TGCCAGAGCACC
    Figure US20230142867A1-20230511-P00139
    ATGAAGCTCTGGGATGTCGTGGCG
    GTGTGTCTTGTCCTTCTCCATACTGCGAGTGCGTTTCCACTGCCAGCTGGGAAAAGACCCCCAGAGGC
    CCCTGCAGAAGACCGATCTCTTGGGAGGAGGAGAGCACCGTTCGCTCTGAGCAGCGACAGTAATATGC
    CGGAGGATTATCCCGATCAGTTCGACGACGTAATGGATTTTATCCAGGCAACGATAAAGCGGCTGAAA
    CGGAGTCCGGACAAACAAATGGCAGTGCTCCCGCGCCGGGAACGAAATAGACAGGCCGCCGCGGCAAA
    TCCGGAAAATAGTAGGGGCAAAGGAAGACGGGGGCAGAGAGGCAAAAACAGGGGCTGTGTATTGACTG
    CCATTCATTTGAACGTAACGGACCTGGGGCTTGGTTACGAGACTAAAGAAGAACTGATATTTAGATAC
    TGCAGTGGATCTTGCGATGCCGCTGAAACCACCTATGACAAGATTTTGAAGAATCTTTCTCGAAACAG
    AAGACTGGTTTCTGATAAGGTTGGTCAGGCGTGTTGCCGACCCATCGCGTTTGACGACGACCTTAGTT
    TTCTCGACGATAATCTTGTCTATCATATCCTTCGCAAGCACTCCGCTAAGAGGTGCGGATGCATTGAT
    TACAAAGATGATGATGACAAGTA
    Figure US20230142867A1-20230511-P00140
    Figure US20230142867A1-20230511-P00141
    Figure US20230142867A1-20230511-P00142
    Figure US20230142867A1-20230511-P00143
    Figure US20230142867A1-20230511-P00144
    Figure US20230142867A1-20230511-P00145
  • The GLB 1 protein sequence is:
  • (SEQ ID NO: 40)
    MPGFLVRILPLLLVLLLLGPTRGLRNATQRMFEIDYSRDSFLKDGQPFR
    YISGSIHYSRVPREYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQF
    SEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILL
    RSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACD
    FDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGTG
    SNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLY
    DILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDL
    TEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDIL
    CPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHD
    RAYVAVDGIPQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYIN
    DFKGLVSNLTLSSNILTDWTIFPLDTEDAVCSHLGGWGHRDSGHHDEAW
    AHNSSNYTLPAFYMGNFSIPSGIPDLPQDTFIQFPGWTKGQVWINGFNL
    GRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWAPCSSDDPELCAVT
    FVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHVDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GLB1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GLB1-DNA
  • (SEQ ID NO: 41)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AATCAATACACTAATACAAAACTATTATTT
    AATATTTAGACCGTGTACCATTATCAAAATCAGAAAGTTAACATTGATGCGGCCGGGCACAGTGGCTC
    ATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCAAATCACTTGAGGTCAAGAGCTCCAGACC
    AGCCTGGCTAACATGGTGATACCCCATCTCTACTAAAAATACAAAAATTAGCCAAGCCTAGTGGTGGG
    TGCCTGTAATCCCAGCTACTTTGGAGGCTGAGGTGGGAGAATCACTTGAACCTGGGAGGCGGAGGTTG
    CAGTGAGCCGAGATTGTGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTCCCCCCACCCC
    CCCAAAAAGGAAATTAACATTGATGCATGGTACTAATCTAACCTACAAGTCTAATTCTGATTTCATCA
    ACTGTCCCACTAACATCCTGTATATACCAAGCTTCTTATCGTCATCATCTCTGGTAATTTCCCCAATT
    ATCCAGTGAATTTGATATTATTAGCCCTGCTTTACCAACAAAGAAACTAAGCTCAGAGAGGTTAAGTA
    ATTTGCCCACAGTTACACAGACAGTAAGTGGCAGAGCTGGAATTCCAACTCAAGTGTACTTCCTCCAG
    AGCCCTTGTTCTTTAATACTGAACTAGAATTGCAGAGCTTGGAAAGGACTTCACAGTGCAAGGACACG
    GAATTGGTACTACAGGCATGAAAAGAAGCCAGGCACGGTGGCTCATGTCTGTAATTCCAGCATTTTGG
    GAGTTCCAGGCAGGTAGATTGCTTGATCCCAGAACTTCAAGACCTGCCTGGGAAACGTGGTGAGACCC
    TGTCTCTACAAAAAATACAAAAATTAGCCGGGTGTGGTGGCGCACGCCTGTAGTCCCAGCCACTTGGT
    AGGCTGAGACGGGAGAATTGCTTGAGCCCAGGAGGCCAAAGCTGCAGTAAGCCGTGATTGCGCCCCAC
    TCCAGCCTGGGTGACAGAGTGAGACCCTGTCTCAAAAAACACAAAAACAAAACAAAAAAACAACCCCC
    CAAACCAAAAAACAAAAACCTAAGAAAAGCCAGTAACTGATACTTTCTCAATCTCTCAATGAAATGTC
    CTTTCTGATACTTTCTCAATCTCTCAAGGACATGTCCGGTAGGAAAGGAGAAAGGAGTGAACAAATAG
    AATTTTGGCTACTTTGTTATACAATGTAAAAAGGCTTTTTGGAACACCAAGGCATAAACTAAGGTTAT
    TTTAAAAAAGAAAATTTTTTTTTGATACGGAATTTCGCTCTTATTGCCCAGGCTGGAGCGCAATGGCT
    CGATCTCGGTTTACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTGTCTCAGCCTCCTCAGTA
    GCTGGGATTACAGGCGCCTGCCACCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCA
    TCATACTGGTCAGGCTGGTCTGGAACTCCTGACCTCAGGCGATCCGCCCGCCTCGGCCTCCCAAAGTG
    CTGGGATTACAGGCGTGAGCAGGATTTTCATTCTAACAAGTTCCAGGTGAGTTGATACAGTGGCTCCA
    GGGACCGACCACATTTTGCTAACCCCCGGCTTAGAGTTATTCAAAGAGCCCGTATATGAGACGCGGAT
    TCCATCTAGGGCGTTTAGGTTTAATGATTAACAATTTCCCTCTTCTGCTCTCTCAAGGCAGCCAGGGA
    ACAGGGAGACCATGATTCATGTCCAATCCCCGAGGCGCGTTATCAAGCTGCTGAAAGCAGGCCCTCCG
    GACTGCAGTTCCAAAGGGTCCCTTCCCAGGGAAGACGCCTGCAAAACCCAGATAGTACTATCCTGGAG
    TCACGCGGCGGCGCGCAGCCTCCTAGCCGCCCCCACCCGCCCGGCTCGG
    Figure US20230142867A1-20230511-P00146
    Figure US20230142867A1-20230511-P00147
    ATGCCGGGGTTCCTGGTGAGAATACTTCCCCTGTTGCTTGTGTTGCTCCTGCTTG
    GTCCCACCAGGGGTTTGAGAAACGCGACACAAAGGATGTTTGAGATTGATTATAGTAGGGACTCATTC
    CTCAAAGACGGGCAACCATTCCGCTACATCTCAGGCTCCATTCATTACAGCCGAGTACCTCGCTTCTA
    TTGGAAGGATCGGCTGTTGAAGATGAAAATGGCCGGGCTCAACGCGATCCAAACTTATGTCCCCTGGA
    ATTTTCATGAACCGTGGCCTGGCCAGTACCAGTTCTCCGAGGACCACGATGTGGAATACTTTTTGAGA
    CTTGCTCATGAGCTTGGCTTGCTTGTAATTTTGCGGCCGGGGCCTTATATATGTGCAGAGTGGGAGAT
    GGGCGGACTTCCAGCGTGGCTTCTTGAAAAAGAATCAATTCTGCTCCGGAGCTCAGACCCTGATTATC
    TTGCCGCTGTTGACAAATGGCTGGGGGTCTTGCTCCCTAAAATGAAACCACTCCTTTACCAGAACGGA
    GGTCCGGTAATTACCGTTCAGGTTGAAAACGAATACGGTTCCTACTTCGCCTGCGACTTCGACTACCT
    CCGCTTTTTGCAAAAGCGATTTCGGCACCACCTTGGAGATGACGTCGTCTTGTTTACGACTGACGGGG
    CTCACAAGACATTTTTGAAGTGTGGCGCCTTGCAAGGTTTGTATACCACCGTTGACTTTGGCACCGGT
    AGTAATATCACTGATGCATTTCTCTCTCAGAGAAAGTGTGAGCCAAAGGGTCCTTTGATCAATTCAGA
    GTTCTATACTGGATGGCTGGACCACTGGGGACAGCCTCATAGTACGATCAAGACTGAAGCCGTCGCGT
    CTAGCCTTTACGATATCCTCGCACGCGGAGCCTCCGTTAATCTGTACATGTTCATCGGTGGGACGAAC
    TTTGCGTACTGGAACGGAGCAAATTCTCCGTATGCTGCACAACCCACTAGCTATGATTATGATGCTCC
    TCTGAGTGAGGCGGGCGACCTTACAGAGAAATATTTCGCTCTCAGGAACATCATCCAGAAATTCGAGA
    AAGTTCCAGAGGGGCCCATCCCACCTTCAACACCAAAGTTCGCCTATGGGAAAGTCACACTGGAAAAA
    TTGAAAACAGTAGGAGCAGCTCTCGATATACTGTGCCCTTCCGGTCCGATAAAGTCTCTGTACCCACT
    TACATTCATTCAAGTTAAACAGCATTATGGTTTTGTGTTGTACAGGACTACCCTTCCCCAAGATTGTT
    CAAATCCTGCACCGTTGAGCTCACCACTTAATGGGGTGCATGATAGAGCGTATGTCGCGGTGGACGGT
    ATCCCCCAAGGGGTTTTGGAGCGAAATAACGTCATTACGCTCAATATAACTGGTAAGGCTGGGGCCAC
    CCTGGATTTGCTTGTAGAGAACATGGGGCGGGTTAACTATGGCGCGTACATCAACGATTTCAAGGGTC
    TGGTGTCAAACCTTACGCTGAGCAGTAACATACTGACTGATTGGACTATCTTTCCTCTTGATACGGAG
    GACGCTGTTTGTTCTCACCTTGGGGGCTGGGGACATCGCGACTCAGGGCATCATGATGAAGCTTGGGC
    ACATAACTCCTCTAACTACACGCTCCCCGCCTTTTACATGGGCAACTTCAGTATACCAAGTGGCATCC
    CCGACCTCCCTCAAGATACCTTTATTCAATTCCCAGGCTGGACAAAGGGTCAGGTCTGGATCAATGGT
    TTTAATTTGGGAAGATATTGGCCAGCCCGGGGACCCCAGCTGACTTTGTTTGTGCCCCAACATATTCT
    GATGACGAGCGCTCCCAACACTATAACCGTTCTCGAATTGGAGTGGGCCCCCTGTTCCAGTGACGATC
    CTGAGCTCTGCGCAGTCACCTTTGTAGACAGGCCAGTCATTGGTTCCTCTGTGACTTATGACCATCCA
    TCCAAGCCTGTCGAGAAGAGATTGATGCCGCCTCCTCCGCAGAAGAATAAGGATAGCTGGCTCGACCA
    TGTAGACTACAAAGACGACGACGATAAGTA
    Figure US20230142867A1-20230511-P00148
    Figure US20230142867A1-20230511-P00149
    Figure US20230142867A1-20230511-P00150
    Figure US20230142867A1-20230511-P00151
    Figure US20230142867A1-20230511-P00152
    Figure US20230142867A1-20230511-P00153
  • The PMP22/MFN2 protein sequence is:
  • (SEQ ID NO: 42)
    MLLLLLSIIVLHVAVLVLLFVSTIVSQWIVGNGHATDLWQNCSTSSSGN
    VHHCFSSSPNEWLQSVQATMILSIIFSILSLFLFFCQLFTLTKGGRFYI
    TGIFQILAGLCVMSAAAIYTVRHPEWHLNSDYSYGFAYILAWVAFPLAL
    LSGVIYVILRKREDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for PMP22/MFN2 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • PMP22-DNA
  • (SEQ ID NO: 43)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GGAGTTTTCCTCACCTGACTCTGGGAAGGG
    AAGAGAGTCTGGCCCAGATGGGTAAAGACAAGTGTGTTGGCATCTGGTAGACAGGGCCAGTGGCACAC
    AACAGTGAACAGCTCTACTGCTGACAGCCAACAGGGACTTCGGGGTTTGGCTGCTGGAGATGGAGGAC
    CTGGGCAAAAGAGACGTGAGGCCAAGAGAGCAGGTCTCAGAATGTGGAGCTCATGGAAGGAGAGAGGG
    AGGGCTAATCAGACTGCTCCTATCTGGCCAGAAGCCATGCCCCACACTCACCAGGCCCTACCCCTTTT
    CCTCTGAATCTTTAGCTCTTAGATCAGAAATAATCTTACGTGTCTTCAGGCTTGATCCTGGCCTGACG
    GACAGCAGCCTCTGTCGTGTAGCCTTTGGAGCTGATCATGGCAGTACCCAGGAAAAGACTGTCTGGAG
    AGGCTGCAAAAACCATGCTCACTGTTCCCAGTCCCCGTACAGACCATGTCAGAGGAGGACTGAGGGAG
    GCTCCTCTCTCTCTCTCTCTGTCACACACACGCACACGCGCACACACACACACACAGATACACGCGCA
    CACACACACACCGTTTTGCCTGGCAGAACAGACTTATAACTTGCATAATACTGCAGCAGCCCACACTG
    TACCTCATCTCCATTTGGGTTCCACAACAGAAGGGAGCTGGAAGGAAATCAGCGACAATCTAACTCCA
    CATGACAACCTGGCTGGACAACGAAACATCCAGCGGGGAGAAAGACACTGGAAATTACCCATACCTGA
    GCCCCATCCCAAAACACCTGAGTCAGCTTCTCTGACCATGGCCCAGGCTAGGTCATTTGCAAAGCTCT
    TGAGTGATTCTAAAGAACCACTGCTGTGAAAAATAAAAATGATTTCAGCAAAGAGAGAAAGGTGAAAC
    TATTTTATTGGCACTGGCTGTGATGCTGGCTGTGAGCAGATGCTGGCTGTGGGCAGAACAGGGGAGAG
    GGCCATTGAGAGGCCCTTGTGACTGGAGAGTCCAATCAAACGTAAAGTGTGTGACAGGGAGGGACTCC
    TCCAAATTCTGCAGGTCTCTCCAGACTGGGGATCCCTAACACAAGAGGAAGTCTTATGCCCCAAGTGC
    CAGAATTCCAGGAAACTCATGCAATGAAACCAGTCCTCTTGTGTCAGAGTTTGCAGCTAGAAGCGGTT
    CAAGGGATGGAGGACTCTCTGGGAGCTGGAGTGGTACGCTGCAAAGAACCTAAATATGCCATAACTTC
    CTTTACTTCCCTCTCCAAGAATTTCCAATTGCTCTTCCAGCCTATTCTGCCTGACAGGTTAGAAAGTT
    TTCAATAGTTTTCAATAGAGGTTCGTTTACTTGCATTGAATTATCTCATGAAGGTCAGCTTGATTCGG
    GCACATTAAAGACAGAGTCGGAAGAACACATTTCCCTAAGTTTTATTCTAATGTACAACACATGTGAT
    CATTTGCGATGGGGTCTAGGCAGACAAGGCAGGTAACAGGGAGTCCTTCCAACCAGGGGTTGGAACAA
    AGGAGGCTTGATTTGGTGACTCTTGAGACATTTGGCTCACTGCTGTGATGCTGTGAGAGATTAGCTGT
    GCAATGTTTGGGCTCCTTAAAAGGAAGTTTATTTAAAATAAAACTTACCTGCACGTATGTAACACTGT
    AGACACAGATCCTTCCCACAGTATATTTAATCTCTGCAGAATTCACTGGGAGGGGAGGGGAGCCAGTG
    GGACCTCTTGGCTATTACACAGGTTGGCACTTCCAGAGAGAACAGTCTTGGCATCACAGGCTTCAGGC
    ATACTCAAAGCTCTTCTCCCTTCTGATTCCAGTTTCTCCATGCCCTGCAGGGCCTCTTGGGATTATTG
    TATTCTGGAAAGCAAACAAAGTTGGACACTGTCTCTTTAAATAATAGAGGCTGAGAACCTCTCAGGCC
    ACCATGACATATCCCAGCATTGGACCAGCCCCTGAATAAACTGGAAAGACGCCTGGTCTGGCTT
    Figure US20230142867A1-20230511-P00154
    Figure US20230142867A1-20230511-P00155
    ATGCTTTTGCTGCTTCTCTCCATAATAGTGTTGCACGTAGC
    AGTCTTGGTCCTGCTCTTTGTTTCTACCATTGTATCACAGTGGATTGTGGGTAACGGCCACGCTACCG
    ATCTGTGGCAAAATTGTAGCACGTCCAGCTCTGGTAATGTACACCATTGTTTCAGCAGTTCACCTAAC
    GAGTGGCTCCAGTCAGTCCAAGCCACAATGATCCTGAGCATAATTTTTAGCATCCTCAGCCTTTTCCT
    GTTCTTCTGCCAGCTCTTTACATTGACAAAGGGTGGACGGTTCTACATTACAGGGATATTCCAAATTC
    TTGCGGGTCTGTGCGTGATGTCCGCCGCCGCCATCTACACCGTTAGGCACCCTGAATGGCATCTTAAC
    TCCGACTATAGTTATGGTTTCGCCTATATCCTCGCATGGGTGGCTTTCCCTCTTGCCCTGCTTAGCGG
    CGTAATCTATGTGATTCTCAGGAAGAGAGAAGATTATAAGGATGACGATGACAAATA
    Figure US20230142867A1-20230511-P00156
    Figure US20230142867A1-20230511-P00157
    Figure US20230142867A1-20230511-P00158
    Figure US20230142867A1-20230511-P00159
    Figure US20230142867A1-20230511-P00160
    Figure US20230142867A1-20230511-P00161
  • The MECP2 protein sequence is:
  • (SEQ ID NO: 44)
    MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVKKDKKE
    EKEGKHEPVQPSAHHSAEPAEAGKAETSEGSGSAPAVPEASASPKQRRS
    IIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSK
    VELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGT
    GRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGK
    AEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPGSVVAAAAAE
    AKKKAVKESSIRSVQETVLPIKKRKTRETVSIEVKEVVKPLLVSTLGEK
    SGKGLKTCKSPGRKSKESSPKGRSSSASSPPKKEHHHHHHHSESPKAPV
    PLLPPLPPPPPEPESSEDPTSPPEPQDLSSSVCKEEKMPRGGSLESDGC
    PKEPAKTQPAVATAATAAEKYKHRGEGERKDIVSSSMPRPNREEPVDSR
    TPVTERVSDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for MECP2 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • MECP2-DNA
  • (SEQ ID NO: 45)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TCCTTGTTTCTTCCCTTGCTCTATGCCCTA
    CATCCAATCCTTTTGTTCAGTTCTGAAAACATATCTTGAATCTCACAACTTCTCAGAGCTGCACTGCT
    GCCACCCTAGTTAAAGCTACCATCAACTCTTTCTCCTAGACTGTTCCAGGGCCTTGCAACTAGCCTTG
    TGCTGTAGTTTTGTTTCATCACGTCCAGTTCTCCACTCTACACCTGCAACATAGATCAGACAGCTCCT
    GGCTCAAAATCCTCTGAGGGCTTCTCATCTTAGAATAAACTCTCGGTTCTGGCCGGGTGCGGTGGCTT
    ACGCCTGTAGTCCCAGCACTTTGGGAGTCCGAGGCGGGCGGATCACTTGAACTCAGGAGTTTGAGACC
    AGCCTGGGCAACATGGTGAACTCCCATCTCTATCAAAAATACAAAAACTTAGCCAGGCGTGGTGGTTC
    GCATCTGTGGTCCCAGCTACTTAGGACGCTGAGGAGGGAGGATCGCTTGAGCTCAGGGTGGACGTTGC
    AGTGAGCCAAGATTGCGCCACTGCACTGCAGCCTGGGTGACAGAATGAGACCCCATCCCCACCCCCCC
    CAAAAAAGAATGAACTCCCAGTTCTCATAGTGGCCCCAGCTGCCTTTCCAATCACATTCCCTACCACT
    CTCCAGCAACACTGACTTCCTCGTTAGTCCCCAACATGCCAGGCATAGTCTCTCCTCATGTCCTTTGA
    ACTTGCCTGGAATGTTCTTTCCCCAGATATTCATATGAGGGAGTAAAATGAGGGTGAAAACCAGCAGA
    TATCTAAATAGCACCCCCTTCACTTAGTTTATCTTTCTCAAAGCCCTTATCACTATGTGAAATGATAT
    ATTATACTTATTTGTATGCTAGTATGAATCTTCCCGGCAAGAATGTTAGTTTGCTGTCTGTTCAGTAC
    CGTGCATCCAGAGCCTGGAAGAGTGCCTGGCACATAGCAGGTAGTCAATAAATGAATGGGGGCAAGCA
    GCCAAATCAGAATCAGGTTTTCTTGCTAAGCATAGAACTAACAGAAGGATCATTGAATGGATTGGATA
    ATGACTGGCATCAGGGTAAGGTCCCCTTAACAAACACTCCTGTCCTGAACACCTGGTTAGCTAACAGT
    TTTCTCATACTCTTATTTTCCCAAAACACAATTGCTGGATCTCAGCTCCAAATCAACTCTTCTAGGAA
    AGTGAAAAATTGCTGGATCTCAGCTCCCAAATCATATCTTCCAGGCAGAGCTAACATTGCCCCTTATT
    CACACCTCCACCAAACCATCTGATCCAACAGTGACAGGTGTCACGAGGCCTTGGCATGCACTCTCTTC
    CCCCGCCAGAGTTCTGCGAAAGCCAGGGTTGCGATTTGTTGTCAGTTTATTCCCCGCCTCTATGAGAG
    TGTGAGCACTGGGCAGGCTCGGATGAAATAATGCATTGAGTAGGCCTCTGAAACCAAGGCCCCTCAGC
    TGGGGCAACGTCAGGCTCCAGGGTGGGCAACTTTGCTGCTTCTGCCGAAGATAGTGATATTGAGAAAA
    TGTGGGTGCAATGAAACGCTTATTGCAGCGCACTCGGTGCATCTGTGGACAGAGGGTCAATCGCCCCT
    CAGAGCAGCGCAAACAGGCGTCCCAAGCCTAGGCCTTCACTTGCCCCAGCATCCGCAAGGGTCCATTA
    ATCCTTAACATTCAAATTCCGCCCACTAAACCAGTCCCTCCGCGCCCAAGCCGCCTCTTTTCCCCAAA
    CGACGGCCGAAAGCAGCCAATCAACAGCTGGAGGGGTCCGCCCCCTTTTCCCTGGCCGAAATGGACAG
    GAAATCTCGCCAATTGACGGCATCGCCGCTGAGACTCCCCCCTCCCCCGTCCTCCCCGTCCCAGCCCG
    GCCATCACAGCCAATGACGGGCGGGCTCGCAGCGGCGCCGAGGGCGGGGCGCGGGCGCGCAGGTGCAG
    CAGCGCGCGGGCCGGCCAAGAGGGCGGGGCGCGACGTCGGCCGTGCGGGGTCCCGGCGTCGGCGG
    Figure US20230142867A1-20230511-P00162
    Figure US20230142867A1-20230511-P00163
    ATGGCCGCCGCTGCTGCCGCGGCTCCCAGCGGCGGCGGT
    GGAGGTGGTGAAGAAGAAAGACTTGAGGAGAAATCAGAGGACCAGGATCTCCAAGGTCTCAAGGATAA
    GCCGTTGAAGTTCAAGAAGGTGAAGAAGGACAAGAAGGAAGAGAAAGAGGGCAAACACGAACCGGTCC
    AACCTAGCGCTCATCATTCCGCCGAACCAGCAGAAGCGGGCAAAGCAGAAACGAGTGAAGGATCAGGT
    AGCGCTCCTGCAGTTCCGGAAGCGAGCGCTTCACCGAAACAGCGCCGGTCTATCATCCGGGATCGGGG
    ACCTATGTACGACGATCCAACGCTTCCGGAGGGATGGACGAGAAAGTTGAAGCAACGCAAGTCCGGAA
    GAAGTGCTGGCAAGTACGATGTGTACCTGATTAACCCCCAAGGAAAGGCTTTCAGGTCTAAAGTAGAG
    CTCATAGCTTATTTTGAAAAAGTGGGCGACACCTCTCTCGACCCTAACGATTTCGATTTCACCGTCAC
    TGGCAGAGGCAGTCCTTCTAGACGCGAACAGAAACCCCCTAAAAAACCCAAGAGCCCCAAAGCCCCCG
    GAACGGGACGAGGTCGCGGCCGCCCAAAGGGTAGCGGAACAACAAGACCAAAAGCGGCTACATCCGAA
    GGTGTGCAAGTTAAGCGCGTGCTGGAAAAAAGCCCAGGAAAGCTCTTGGTCAAAATGCCATTCCAGAC
    CTCACCTGGTGGCAAGGCAGAAGGTGGGGGCGCGACAACGTCCACCCAGGTAATGGTGATCAAAAGAC
    CAGGCCGCAAGCGAAAGGCAGAGGCAGATCCGCAAGCGATCCCCAAAAAACGGGGCCGAAAGCCAGGA
    AGTGTTGTCGCTGCTGCCGCGGCCGAAGCAAAGAAAAAAGCTGTCAAAGAGAGCTCCATTAGGTCAGT
    ACAGGAAACCGTGCTGCCAATCAAAAAAAGAAAAACTCGGGAAACCGTCTCCATAGAAGTAAAGGAGG
    TTGTCAAGCCCCTTCTTGTTTCCACCCTGGGAGAAAAATCAGGAAAAGGTCTCAAAACGTGTAAAAGC
    CCGGGTCGAAAGTCAAAAGAATCATCACCAAAAGGCCGGAGTAGTTCAGCCAGTTCTCCGCCAAAGAA
    AGAACACCACCACCATCACCACCACTCAGAATCCCCTAAGGCCCCAGTGCCTTTGCTCCCACCCCTTC
    CTCCGCCGCCGCCAGAACCGGAGAGTTCTGAAGACCCCACTTCACCTCCGGAGCCACAGGACCTGTCC
    TCTTCAGTTTGCAAAGAGGAGAAGATGCCTCGAGGCGGATCTCTGGAATCTGACGGGTGCCCGAAAGA
    ACCCGCCAAGACGCAACCGGCGGTAGCAACCGCAGCAACCGCGGCCGAGAAATATAAGCATCGAGGAG
    AAGGAGAACGCAAAGACATCGTTAGCTCTTCTATGCCACGGCCAAACCGGGAGGAACCAGTGGATTCA
    CGAACGCCGGTCACAGAACGGGTCAGCGATTATAAGGACGATGACGACAAATA
    Figure US20230142867A1-20230511-P00164
    Figure US20230142867A1-20230511-P00165
    Figure US20230142867A1-20230511-P00166
    Figure US20230142867A1-20230511-P00167
    Figure US20230142867A1-20230511-P00168
    Figure US20230142867A1-20230511-P00169
  • The LAMP2 protein sequence is:
  • (SEQ ID NO: 46)
     MVCFRLFPVPGSGLVLVCLVLGAVRSYALELNLTDSENATCLYAKWQMNF
    TVRYETTNKTYKTVTISDHGTVTYNGSICGDDQNGPKIAVQFGPGFSWIA
    NFTKAASTYSIDSVSFSYNTGDNTTFPDAEDKGILTVDELLAIRIPLNDL
    FRCNSLSTLEKNDVVQHYWDVLVQAFVQNGTVSTNEFLCDKDKTSTVAPT
    IHTTVPSPTTTPTPKEKPEAGTYSVNNGNDTCLLATMGLQLNITODKVAS
    VININPNTTHSTGSCRSHTALLRLNSSTIKYLDFVFAVKNENRFYLKEVN
    ISMYLVNGSVFSIANNNLSYWDAPLGSSYMCNKEQTVSVSGAFQINTFDL
    RVQPFNVTQGKYSTAEECSADSDLNFLIPVAVGVALGFLIIVVFISYMIG
    RRKSRTGYQSVDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for LAMP2 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • LAMP2-DNA
  • (SEQ ID NO: 47)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GTTTGTTAAAGAAGTGATTAATAAGACTGC
    CACAGACCTACTGTTTCAGAATCTCTGGGGAATGAGGTACAGGAATCTGTATGTTAAAAAAAAAAAAC
    CCAAAACCAGATGGTTCTTTTGGAAACTAACACTTGAGAATCAGTGCTTTATTCCAACTCAAGATTTT
    TAAATGCTTTCATCTGTTGAATATTTTAGGTTTTGTCTGTTCATCTATCAAGTATAGCAAACAGTATG
    CTTAGTATGAACTCATTACAGAATATTTGTGTGTATGTACATAGGAAAAATATCTGGAAGTATAGATG
    TGGGTGTGTGTATATATGTGTGTAAATATATATAATAATGTTAATAGTAGGAATACAGATGATCTCTA
    TTTTTCCGCTTTATCCTTCTGTTTTATTTCTCTCATTTTTTTCTTCCTGTGTTTAAAAAAGATGCTTC
    TTGTTTCTGAGCTCAATTCCCCCACCTGAGAACAGGGTGCTGTAACTAGGTGAATTTCAAGATCATGT
    CTTGTTCCGATGGTCCGTGACTCCATACTACCCCCTAGGTTCCCTCGTCCATCTCACAGCATGAACTG
    CATAGGTAACAACAGTGTTTGGAGAATAAGCATGTATTATTTTTGTATGTATTACGTTTGAGATGCCT
    AGCAACCGTTTTTGTGCCCGCAAAATTTTTCTCTTCTTTGTCAAGGAGCCAAAACGGAAATAGTCTAC
    TATCTACGGCCCTCAAAAGTTCCTCCTCTCTCCTTAGCCTTAGAAAATACCGTCCGCCATAGGCATGG
    GCAAAGACCTCATGGCTAAAACACCAAAAGCAATCGCAACAAAAGTCAAAATTGACAAATGGGATCTA
    ATTAAAAGAACTTCTGCACAGCAAAAGAAATTAGCATTAGCGTGAACAGGCAACCTGCAGAATGGGAG
    AAAATTTTTGCAATCTACCCATCTGACAAAGGTCTAATATCCAGAATCTACAAGGACCTTAAACAAAT
    TTACAAGAAAAAAACAACCCCATCAAAAAGTGGGCAAAGGATATGACCAGACACTTCTCAAAAAAAGA
    CATTTATGCAGCAACAAACATGAAAAAAAGCTCATTATCACTGATCATTAGAGAAAGGCAAATCAAAA
    CCACAATGAGATACCATCTCACGCCAGTCAGAATGGTGATTATTAAAAAATCAGGAAACAATAGATGC
    TGGCGAGGCTGAGGAAAAATAGGAATGCTTTTACACTGTTGGTGGGAGTGTAAATTAGTTCAACTATT
    GTGGCAGACAGTGTGGCGATTCCTCAAAAATCTAGAACCAGAAATACCATTTGACGCATGCACACGTA
    TGTTTATTGCAGCACTATTTACAATAGCAAAGACTTGGAACCAACCCAAATGCCCATCAATGATAGAC
    TAGATAAAGAAAACGTGGCACATATATGCCATGGAATACTATGCAGCCATAAAAAAATGAGATCATGT
    CCTTTGCAGGGACATGGATGAAGCTGGAAGCCATCATCCTTAGCAAACTAACACAGGAACAGAAAACA
    AAACACTGCATGTTCTCACTCATAAGTGGGAGTTGAACAATGAGAACACATGGACACAGGGAGGGGAA
    CAACACACACGGGGGCCTGTCAAGGGGTGGGGGCCAAGGGGAGGGAGAGCATTAGGAGAAATACCTAA
    TGCATGTGGGGCTTAAAACCTAGATGACGGGTTGATAGGTGCAGTAAACCACCATGGCACATGTATAC
    CTATGTAACAAAGCTGCACATTCTGCACATGTATCCCAGAACTTAAAGTAAAATTAAAAAAAAAAAAA
    AAAAAGAAAGAAAAAGAAAAAGAAAATGCCGTCCGCGCATATCTCTCAGGAGCATAGGAAGGGCTGTG
    AACCAAAAAGCCAGGGCGTGGTCGCCCCGCCCACCGGCCGCTACATCACCACCCCTCTCGCCGCTTTC
    AAGCGCGGAGATTGGCTGTAAGCAAGAAAGAGCCCCGCCCC
    Figure US20230142867A1-20230511-P00170
    Figure US20230142867A1-20230511-P00171
    ATGGTCTGCTTTAGACTGTTTCCAGTACCAGGTTCTGGACTCGTGCTGGTGTGTCTTGTACTG
    GGTGCAGTCCGATCTTACGCCTTGGAACTTAACCTCACTGATTCTGAGAATGCCACTTGTCTTTACGC
    AAAGTGGCAGATGAACTTTACGGTGCGGTACGAGACCACGAATAAAACTTATAAGACCGTAACGATAA
    GCGATCATGGGACCGTCACTTACAACGGCTCCATCTGCGGGGACGATCAAAATGGCCCAAAAATCGCT
    GTCCAATTCGGCCCAGGGTTCAGTTGGATAGCCAACTTCACTAAAGCTGCATCAACCTATAGCATAGA
    TTCTGTATCCTTCTCTTACAATACCGGAGATAACACAACGTTTCCCGATGCAGAAGACAAAGGCATAC
    TCACCGTGGACGAGTTGCTTGCTATTAGGATACCGCTTAACGACTTGTTTCGCTGCAATAGTTTGAGC
    ACCCTTGAAAAGAATGATGTGGTTCAGCATTATTGGGACGTACTCGTTCAAGCTTTTGTCCAGAATGG
    TACGGTTTCAACCAATGAATTTCTCTGCGATAAGGACAAAACTTCCACAGTTGCCCCAACTATTCACA
    CGACCGTTCCAAGCCCCACAACGACCCCGACTCCGAAAGAAAAACCAGAGGCCGGAACCTACTCAGTG
    AATAATGGTAATGACACATGCCTCCTTGCTACGATGGGGCTCCAGTTGAACATCACACAAGACAAAGT
    TGCGTCTGTCATAAATATTAACCCCAATACCACCCATAGTACCGGGTCCTGTCGATCTCATACTGCGC
    TTCTCAGACTTAATAGCAGCACCATCAAGTACCTGGATTTCGTTTTCGCTGTAAAAAACGAGAATCGG
    TTTTATCTCAAAGAAGTAAATATTAGTATGTACTTGGTAAATGGTAGTGTGTTCTCTATAGCGAACAA
    TAACCTCTCATACTGGGATGCGCCTCTTGGTAGTTCCTACATGTGCAACAAGGAGCAAACAGTGAGTG
    TCAGTGGGGCATTTCAAATCAATACGTTTGATCTTAGAGTCCAGCCATTCAATGTGACTCAGGGGAAG
    TACTCTACGGCGGAAGAGTGTTCCGCGGACTCCGATCTTAATTTTTTGATCCCTGTCGCCGTGGGGGT
    CGCCCTGGGATTTCTGATAATTGTCGTTTTTATAAGCTACATGATAGGCAGGAGGAAGAGTCGGACGG
    GATACCAGTCTGTGGATTACAAAGATGATGATGACAAATA
    Figure US20230142867A1-20230511-P00172
    Figure US20230142867A1-20230511-P00173
    Figure US20230142867A1-20230511-P00174
    Figure US20230142867A1-20230511-P00175
    Figure US20230142867A1-20230511-P00176
  • The NAGLU protein sequence is:
  • (SEQ ID NO: 48)
    MEAVAVAAAVGVLLLAGAGGAAGDEAREAAAVRALVARLLGPGPAADFS
    VSVERALAAKPGLDTYSLGGGGAARVRVRGSTGVAAAAGLHRYLRDFCG
    CHVAWSGSQLRLPRPLPAVPGELTEATPNRYRYYONVCTQSYSFVWWDW
    ARWEREIDWMALNGINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGP
    AFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAF
    AGHVPEAVTRVFPQVNVTKMGSWGHFNCSYSCSFLLAPEDPIFPIIGSL
    FLRELIKEFGTDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDT
    EAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYT
    RTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGT
    GMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDA
    GAAWRLLLRSVYNCSGEACRGHNRSPLVRRPSLQMNTSIWYNRSDVFEA
    WRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKEL
    ASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADF
    YEQNSRYQLTLWGPEGNILDYANKQLAGLVANYYTPRWRLFLEALVDSV
    AQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYY
    PRWVAGSWDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for NAGLU and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • NAGLU -DNA
  • (SEQ ID NO: 49)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GGACAACATGGTGAAATCCCATCCCTACAA
    AGGTTAACTGGGCTTGGTGGCACGTGCCTGTAATCCCAGCTACTCAGGAGACTGAGACAGGAGAATGG
    CTTGAACCTGGGTGGCTGAGGTTCCCGTGAGCTGAGATTAAAAAAAAAAATCATACTTCCCTAGTATC
    TAAAGTTGATATCTGATTCCAGGCCTCTTTCAACTTTTTTTGTTTGTTTGTTTGTTTTTTTGATATGG
    AGTCTCGCTCTGTCATCCAGGCTGGAATGCAAATGGCACGATCTTGGCTCACTGCAACTTCCACCTCC
    AGCGTTCAAGCGATTCTCCTGCCTCAGCTTCCCAAGTAGCTGGGACTACAGGCGTGTGCCACCAAGCC
    TGGCTAACTTTTTTTCTTTTTTTTTTTTTTGAGAGGGAGTCTTGCTCTGTCCCCCAGGCTGGAGTGCA
    GTGGTGTGATCTTGGCTCACTGCAACCTCCGCCTCTCCGGTTCACGCCATTCTCCTGCCTTAGCCTCC
    CGAGTAGCTGGGAGTACAGGCGCTCGCCAACACGCCCGGCTAATTTTTTGTATTTTTAGTAGAGATGG
    AGTTTCACCGTGTTAGCCAGGAGGGTCTTGATCTCCTGACCTCGTATCCGCCCGTCTTGGCCTCCCAA
    AGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCAGCACCTGGCTAACTTTTGTATTTTTAGTACAGA
    CGGGGTTTCACTGTATGTTGGCCAGGCTGGTCTCAAACCCCTGACTTGAGGTGATCTGCAAGCCTCAG
    CCTCCCAGAGTGCTGGGATTACAGGCGTAAGCCACCGCTCCTGGCCTAAGGTTGGCTATTTTTATGGT
    TATTTCTTGATTATATGATAAACAAGGGGTGGGTTAGTAATGAATTTTTCAGAAAAGGGGTGGGGATC
    CCCCCCAACTGAAGGTTCCTCCACTGTTTAGACCATATAGGGTAACTTCTGGACGTTGCCATGGCATT
    TGTAAACTGCCTGGCGCTGCTAGGAGTGTCTTTAGCATACTAATGCATTATAATTAGCGTATAATGAG
    CAGTGAGGACGATCAGAGGTCACCTTCCTGTCTTGGTTTTGGCAGGTTTTGACCAGTTTCTTTGCTGC
    ATTCTGTTTTATCAGCGGGGTCTTGTGACCTTTTATCTTGTGCTGACCTCCTGTCTCATCCTGTGACG
    AAGGCCTAACCTCCTGGGAATTCAGCCCAGCAGGTCTCTGCCTCATTTTACCCAGCCCCTGTTCAAGA
    TGGAGTCGCTCTGGTTGGAAACTTCTGACAAAATGACAGCTCCTGTTATGTTGCTGCTGCTGCCGCCA
    ATGGACAGCCTTTAACGTGCCCGCCAGCCCTGCTCCACCGCCGGCCTGGGCTCACATGGCCCCATCCC
    TCCTCGAACCTCCTAGCCTGTTAGTTACTCAAATCTGCAAGCTCTCTGCCTTCTCAGGGCCTTCAATA
    AATGCATTTCTTCTGTCTGGAAGGCTCTTCCTTTCCCTCTTCTAGCCAATTCCTATTCATCCCTGAGT
    TTCAGATTAAAAGTCACTTCCTTTGGAAACCTTACTTCGCTACTTCGCTACTTACTGCACTACTTCGC
    AGCATCACAACTATGATGGAAATCCTTACTTACGTTAAATATCTGGTTTCTAGGTCACCTCCCTGACG
    GGGACGGTAGGGACCGTCTTCTCGTTCATCAGTAGGGAAGTAGCTATGGCAGTGCCTGATACAAAATA
    AACTCCAAATGTGTATTTATTAGATGGTTGGATGGAAGTTATTTGCGTGTGAAAGCGCGTTTTACCCG
    AAGGCGCTCTGTGAGGGCCAGCGGGTCCCCTTCGGCCCTGGAGCCGGGGTCACACGCTCCCCACCGCG
    TGCGGTCACGAGACGCCCCCAAGGGAGTATCCTGGTACCCGGAAGCCGCGACTCCTGGCCCTGAGCCC
    GGGCTTAGCCTTCGGGTCCACGTGGCCGGAGGCCGGCAGCTGATTGGACGCGGGCCGCCCCACC
    Figure US20230142867A1-20230511-P00177
    Figure US20230142867A1-20230511-P00178
    ATGGAGGCGGTAGCCGTTGCCGCTGCCGTCGGAGTACTT
    TTGCTCGCAGGAGCAGGTGGTGCAGCTGGTGACGAGGCTCGGGAGGCAGCGGCCGTACGCGCACTGGT
    AGCTCGGCTGCTGGGGCCAGGTCCCGCAGCTGACTTTAGCGTGAGTGTGGAGCGGGCCCTCGCTGCGA
    AGCCAGGCCTTGATACCTACTCTCTGGGCGGGGGAGGAGCGGCACGCGTGAGAGTTCGGGGAAGTACC
    GGGGTGGCTGCGGCGGCGGGCTTGCATCGGTATCTTAGGGACTTTTGTGGGTGTCACGTCGCCTGGTC
    AGGTAGCCAACTCAGGCTGCCACGACCTTTGCCAGCAGTACCCGGCGAGTTGACGGAAGCGACGCCAA
    ATAGGTATAGGTACTATCAAAACGTCTGTACCCAGTCTTATTCTTTTGTATGGTGGGATTGGGCTCGC
    TGGGAACGAGAAATAGACTGGATGGCACTTAACGGCATCAATCTTGCGTTGGCCTGGAGCGGGCAGGA
    GGCAATCTGGCAGCGAGTGTATCTCGCATTGGGGCTCACACAGGCTGAAATAAATGAGTTTTTTACCG
    GTCCCGCCTTCTTGGCTTGGGGCAGGATGGGTAATCTCCATACGTGGGATGGACCGCTCCCACCTTCA
    TGGCATATAAAGCAGCTGTACCTGCAACATAGGGTGTTGGACCAAATGAGGAGCTTTGGAATGACACC
    GGTACTTCCGGCATTCGCCGGTCACGTTCCTGAAGCGGTCACGCGAGTATTTCCGCAAGTGAATGTAA
    CAAAGATGGGTTCATGGGGACACTTCAATTGTTCATATTCATGCTCTTTCCTGCTTGCGCCGGAAGAT
    CCTATTTTTCCTATCATTGGAAGTCTGTTCCTTCGGGAATTGATCAAGGAGTTTGGAACAGATCACAT
    ATATGGAGCAGACACTTTTAACGAAATGCAACCTCCGTCTAGCGAGCCCTCTTATCTGGCAGCTGCGA
    CCACGGCGGTGTATGAAGCGATGACAGCTGTCGATACGGAAGCAGTCTGGCTCCTTCAAGGTTGGCTG
    TTTCAACACCAACCGCAATTTTGGGGACCAGCCCAGATTAGGGCTGTGTTGGGTGCTGTGCCCCGGGG
    TAGGCTCTTGGTGTTGGATCTTTTTGCGGAGAGTCAGCCTGTGTATACCCGAACGGCGAGCTTTCAAG
    GTCAGCCGTTTATCTGGTGTATGCTTCATAACTTTGGCGGGAACCATGGTCTTTTCGGTGCCCTTGAA
    GCCGTAAATGGGGGGCCAGAGGCTGCTCGATTGTTTCCTAACAGCACTATGGTAGGAACAGGTATGGC
    ACCCGAGGGTATTAGTCAAAATGAGGTCGTATACAGTTTGATGGCAGAACTTGGGTGGCGCAAAGATC
    CCGTACCTGATCTGGCAGCGTGGGTTACGAGCTTCGCGGCTCGAAGGTACGGAGTAAGTCACCCAGAC
    GCTGGGGCCGCCTGGCGGTTGCTCCTGCGATCCGTCTACAACTGCTCAGGTGAGGCATGCCGCGGGCA
    CAATAGAAGCCCACTCGTCCGACGACCATCCCTTCAAATGAACACTAGTATCTGGTACAACAGGTCTG
    ACGTATTCGAGGCATGGCGCCTCCTCCTCACTAGCGCACCGAGTCTGGCTACCTCCCCCGCGTTTAGG
    TACGACCTTCTGGACTTGACCAGACAAGCTGTGCAGGAGCTCGTGTCACTTTACTACGAAGAGGCCAG
    ATCTGCATATCTGTCTAAGGAACTGGCATCCTTGCTTAGAGCGGGTGGGGTTCTGGCTTACGAACTCC
    TTCCAGCACTGGACGAGGTGCTCGCGAGTGACAGTAGGTTTCTCCTCGGGTCATGGCTGGAACAAGCA
    AGGGCCGCGGCCGTTTCTGAAGCCGAGGCTGACTTTTACGAACAGAACTCTCGATACCAGTTGACGCT
    GTGGGGACCCGAGGGGAATATCTTGGATTACGCGAACAAGCAGCTGGCAGGTCTGGTAGCCAACTACT
    ATACGCCGAGGTGGAGGCTCTTCCTCGAGGCTCTTGTCGATTCCGTCGCCCAAGGGATACCCTTTCAG
    CAGCACCAATTCGACAAGAACGTTTTTCAACTGGAACAGGCTTTCGTACTTTCCAAACAACGCTATCC
    CAGCCAGCCTCGGGGTGACACAGTTGACCTGGCAAAAAAAATCTTCCTTAAATACTATCCAAGATGGG
    TCGCCGGTAGTTGGGATTATAAAGATGATGATGACAAGTA
    Figure US20230142867A1-20230511-P00179
    Figure US20230142867A1-20230511-P00180
    Figure US20230142867A1-20230511-P00181
    Figure US20230142867A1-20230511-P00182
    Figure US20230142867A1-20230511-P00183
  • The ARSB protein sequence is:
  • (SEQ ID NO: 50)
    MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRPPHLVFLL
    ADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR
    YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRK
    ECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVAT
    GYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPY
    DFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTL
    AGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTL
    VKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNIDPNFVDSSPCPRN
    SMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNV
    SEIPSSDPPTKTLWLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSV
    PVYFPAQDPRCDPKATGVWGPWMDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for ARSB and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • ARSB-DNA
  • (SEQ ID NO: 51)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AAATATTTATTAATTACAAGAAGAAAATAT
    TAACCTTACAATGAGGAAATGTTTACCTTAACAAAGTTATCCCATAAGAAAGCAATTCAGCATCATGT
    GCCTCTTAATAGGAAGCACCGAGAAACATACCACATCACTTTTTGTGGTATTCCTGCACAAATCCATA
    ACCTCTATCTAATTGTGAAGAAACATCAAACTCCAATTGAGAGATTTTATAAAATAACTAGCTTGTAC
    TGCACATTATGTGTTCAATACATAGATGGAAAATAGAATTAACTCTTCAATTTGCACATTTCCAGAGC
    ACTATTAACATATGCTTTTATTTTAAAAACATAGGACGTAAGAATCTGTGCAAGTTTAGCTAGTTTCA
    GGGGAAGTTGGTTAAAGGAGGAATTGTTCCTGCAGATTAGCACCCATCAATTATTTCTTCTTAATTTT
    TATCAAGGAGAATAGGAAGGATTACGGGGGAGAAAGACACAAAACCACAGCGGAAATTCTCAGTCCTT
    CAAGGTCTTCTACAGGGTGTTTAAATGAAAAGTTCAACCCTTGACATAATGAGTCAAACAGTTCTCTC
    AAAAGTTATTCTTTGAATTTAATGCTAAAATTTTCACTTTTTTCTGCTAAATTTCATTGATTCCACTA
    GAGATTAAAGTGCATACAACATAAACTAATCTACTTCATGCAAAGAAAAAAAGTGTATGTAAGAGGCA
    AACATTTCATGTGTTAAAGGATTCTTGGGGAAAAAAAGGTAAAATCATGACCCTCATACAAATATAAA
    AGTGAATACATGATTTTATTTAACTCATTAATAAGGAAATTGGTAAGGTGTTAAAACCAATTCAAAGG
    ACAATCCAAAGAACAGATCAGGAATACTAAAATAAATATGCAAGCGGAGGTGAAACTGTTTTCCTTGG
    TAGTGGTGGAGGGGAAGGATTGCTACTCCGCTGGATAAAGTTCATTTGTGTATATATAAATAAGAATT
    ATTTTCCATTGTTATTTATCTATAACTTATAAAGTTGTAAACAACTTCCACGGAATCAGACTCAACCT
    GGAAGGGTATGGTCTCTAGGCAATGCAAAAATTTTCCCCTACACCTGTTAACAACTATAATATCTCCA
    GACAGAGTAGACAGAAAGTCTGGATGGCAACGGGAATCTACTGGTCATACGGCTAACTTCCTAATTCA
    ATAAGCACGTGACTAAAGGATTTTTTCCTTCCACTCAGATATTTCAGGCTAACTAGATACTGTGTGCT
    TCTTAGTGTCACTGCTTAGTGGGGGAGCCAGCTCTGAGTGGGGTCATATCCGGACAAGCGAATGAGCT
    ATTTATTCAATGACCACGCAACACTCCAAATCCTCCCAGGGCAACTTGAAAGTAACCGCACCTTCCAA
    AGGGCACCGTGCAATCAGACTGTGTGTTTGGCCTCCTGTTTGCTAGTGGGGAGGAAGCGGCTTCATGG
    GTGTACACTACGCATAAATGAATGTGAAAGGCTATTTAGACCTCTGCCTTTTCACCGTCCTCCCACCT
    GCCACAGGCTGGGCTCTTGTGCTAGAAATGACTTGCTAGCTAGACATCATGGTTCAGGATCTGAGTCA
    GAGGTTTAACCATTTATAAGCTTTTTTCTTATGAAAAATTGGCACTAATTATAATGTCTAACTGTCAG
    AGTTGTTGCAGGCTTTACAGGAGACGCGGGCTGTGAAGATGCTTTGTAAATTGTGAAGCGTTATTAAA
    GAACACATCTTTTTTTTTTAGGAAACCACAGTGCAAATTTAATTGCCGGGGAAGATAACGGGCCTTGG
    TGCCCTCCAAGCGTCAGCTGAGTTTCCAAGAAGCCGGGCAGCGGGCGCCCGCGGGTTCGTCTCTGGCT
    CCTCCTCCGCCACAGCAGCCGGGGGCCCGGGTCGGAGGCGGCGGGGGCCGAGCGCCCGGCCTCGCAAG
    CCCACGGCCCGCTGGGGGTGCCGTCCCGCGCCGGGGCGGAGCAGGCCCCGGCAGCCCAGTTCCT
    Figure US20230142867A1-20230511-P00184
    Figure US20230142867A1-20230511-P00185
    ATGGGCCCCCGGGGCGCAGCGAGCCTCCCCAGAGGGCCAGG
    ACCAAGGAGGCTCCTTTTGCCAGTCGTCTTGCCGCTTTTGCTCCTTTTGCTTCTCGCGCCACCCGGCA
    GTGGGGCAGGAGCTTCTCGCCCACCTCACCTTGTCTTCCTGCTTGCTGACGACCTTGGATGGAATGAT
    GTGGGTTTTCACGGTTCCCGGATTCGAACACCACACTTGGATGCTCTGGCTGCGGGGGGCGTACTGCT
    CGATAATTATTACACACAACCGTTGTGCACACCGTCTCGGAGTCAGCTTCTGACGGGACGATATCAAA
    TAAGGACTGGGCTGCAACATCAGATTATATGGCCATGTCAACCCAGTTGCGTGCCCCTTGACGAGAAA
    CTCCTGCCTCAGCTGTTGAAGGAGGCTGGATACACGACACATATGGTCGGCAAGTGGCATCTGGGAAT
    GTACAGGAAAGAGTGTCTCCCAACCCGGCGCGGATTCGATACGTATTTTGGGTACTTGTTGGGCTCAG
    AAGATTACTATTCACACGAAAGGTGCACTCTTATCGACGCACTCAACGTTACGCGCTGCGCACTTGAT
    TTCCGAGATGGAGAGGAAGTTGCAACTGGGTATAAGAATATGTATTCAACAAACATCTTTACTAAGCG
    GGCAATAGCCCTCATTACTAATCATCCCCCCGAAAAACCGCTGTTTCTCTATCTCGCCCTGCAGTCAG
    TCCATGAGCCGCTGCAGGTGCCTGAGGAGTACCTCAAACCATACGACTTCATACAAGATAAGAACCGA
    CACCATTATGCGGGTATGGTCTCTCTGATGGACGAGGCTGTCGGGAATGTTACTGCAGCATTGAAATC
    TTCCGGCCTTTGGAACAACACAGTCTTCATATTTTCTACGGACAATGGCGGGCAGACTCTTGCCGGTG
    GCAATAACTGGCCTCTTAGGGGTCGCAAGTGGTCCCTCTGGGAAGGAGGCGTCCGCGGTGTAGGGTTT
    GTAGCGAGCCCCCTCCTTAAACAAAAAGGAGTTAAAAACAGGGAACTTATTCACATTTCCGATTGGTT
    GCCCACGCTCGTCAAGCTCGCGAGGGGTCATACAAACGGGACAAAACCGCTTGACGGGTTTGATGTAT
    GGAAAACCATTTCTGAGGGGTCTCCTAGCCCCAGGATCGAGCTGTTGCATAACATAGACCCAAACTTT
    GTAGACTCCTCACCCTGTCCCCGAAATTCAATGGCCCCTGCCAAAGATGATTCATCCCTCCCTGAGTA
    CTCCGCTTTTAACACAAGCGTACATGCCGCTATTAGGCACGGCAATTGGAAGCTGCTGACCGGGTATC
    CTGGTTGTGGTTATTGGTTCCCTCCCCCCTCACAATACAACGTGTCTGAAATACCGAGTAGTGATCCG
    CCGACAAAAACCCTGTGGCTGTTCGATATTGATCGAGACCCTGAAGAACGCCACGATCTCTCCCGGGA
    ATACCCCCACATTGTCACCAAGCTTCTTTCACGATTGCAATTTTACCACAAGCATTCAGTACCCGTTT
    ACTTCCCCGCCCAGGACCCAAGGTGCGATCCTAAAGCAACGGGGGTTTGGGGACCATGGATGGACTAC
    AAAGATGATGACGATAAATAG
    Figure US20230142867A1-20230511-P00186
    Figure US20230142867A1-20230511-P00187
    Figure US20230142867A1-20230511-P00188
    Figure US20230142867A1-20230511-P00187
    Figure US20230142867A1-20230511-P00189
    Figure US20230142867A1-20230511-P00190
  • The GUSB protein sequence is:
  • (SEQ ID NO: 52)
    MARGSAVAWAALGPLLWGCALGLQGGMLYPQESPSRECKELDGLWSFRAD
    FSDNRRRGFEEQWYRRPLWESGPTVDMPVPSSFNDISQDWRLRHFVGWVW
    YEREVILPERWTQDLRTRVVLRIGSAHSYAIVWVNGVDTLEHEGGYLPFE
    ADISNLVQVGPLPSRLRITIAINNTLTPTTLPPGTIQYLTDTSKYPKGYF
    VQNTYFDFFNYAGLQRSVLLYTTPTTYIDDITVTTSVEQDSGLVNYQISV
    KGSNLFKLEVRLLDAENKVVANGTGTQGQLKVPGVSLWWPYLMHERPAYL
    YSLEVQLTAQTSLGPVSDFYTLPVGIRTVAVTKSQFLINGKPFYFHGVNK
    HEDADIRGKGFDWPLLVKDFNLLRWLGANAFRTSHYPYAEEVMQMCDRYG
    IVVIDECPGVGLALPQFFNNVSLHHHMQVMEEVVRRDKNHPAVVMWSVAN
    EPASHLESAGYYLKMVIAHTKSLDPSRPVTFVSNSNYAADKGAPYVDVIC
    LNSYYSWYHDYGHLELIQLQLATQFENWYKKYQKPIIQSEYGAETIAGFH
    QDPPLMFTEEYQKSLLEQYHLGLDQKRRKYVVGELIWNFADFMTEQSPTR
    VLGNKKGIFTRQRQPKSAAFLLRERYWKIANETRYPHSVAKSOCLENSLF
    TDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GUSB and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GUSB-DNA
  • (SEQ ID NO: 53)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TGCCCAGCTAAATATGGAGATGCGGTTTCT
    CCATATTGGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCTGCCTGCTTCAGCCTCCCAAAGTG
    CTGGGATGACAGGCGTGAGCCACCGCACCTGGCCAAGGCACTTGGTTTCTTAGATCACCCACTTGGCC
    CTCTTCCAAGTTGTACTTTCCTTCTTTCCTTCCTGTTCTAAAGTTTTTTAATATTTTTTTTTTTCCTG
    AAATGGCATCTCACTCTGTAGCCCAGGCTGGAGTGCGGTGGCGTGATCTCAGCTCACTGCAACCTCTG
    CCTCCTGGGTTAAAGCATTTCTCCTGCCTCAGCCTCCCAAATAGTAGGGATTACAGGTGCCTGCCACC
    ACGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCTCCACGTTGGCCAGGTTGGTCTCAAA
    CTCCTGATATCAAGTGATCCTCCCGCCTTGGCCTCCCAAAGTGCTGGAGTTTGGAGTTACAGGCCTGA
    ACCTCTGTGCCTGGCCTGCTTTTTTTTTTAGACTGAGTCTTGCTCTCTCGCCAGGCTGTAGTGCTGTG
    GTGCGATCTCGGCTGACTGCAACCTCTGCCTCCCGCGTTCAATCGATTCTCCTGCCTCAGCCTCCCGA
    GTAGCTGGGACTACGGGCGCATGCCACCATGCCCAGCTAATTTTTTAGTAGAGATGGGGTTTCATCAT
    ATTGGCCAGGCTGGTCTCAAACCCCTGACCTCGTGATCCCCCCACCTCCGCCTCCCAAAGTGCTGGGA
    TTACAGGTGTGAGCCACTACGCTCAGCACTGGCCTGCTTTTTAACAAGCTTTTACTACTGCTCTGCAG
    CTTACCTCCCTCTGGCTTTCTGCCTTATGCCCCTCAGTGCAATTCTTTCTTCTGAGGAGGCAAGAATT
    GAGGTTGTTGCTGATGTGTAGGGATTCACCACCCGGAACTCCACCAGTAACAGGTCGAGGATGCTAGT
    TGAAAATGTTATGTAAGCTGCATGCTTTTTACAAATGGTAGTGGTTCTCATGTCCAGCAATGGCCACT
    GCACCGTCCCTGTATGAAAGTCCCCTCCATAAATCTATGTCTCCTTTGCTGTCTCCAGATCTCCTCCT
    CTACCTCTCCCACACGGTGCCTTCCCTACTGGAGTAAACTGGGGTCCCGCAGGGCACCCTTTCCCTGC
    AATTTTTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCT
    CGGCTTACTGCAGCCTTTGCCTCCCGGGTTCAAGCGATTCTCCTACCTCAGCTTCCTGAGTAGCTGGG
    ATTACAGGCACGCACCATCACGCCCCGCTAATTCTTTTTTTTTTTTTTGTATGTTTACAAAATAGGGA
    CGGGGTTTCACTATCGTTGGCCAGGCTGGTCTCGAACTCCTGGCCTCAAGTGATCTGCCCCCCTCGGG
    TTCCCAAAGTGCACCTGGCCTCTCTCCCTGCATCTTTATGTGCCTGCAACACCAAGAGGGAGCTCTTG
    GCCCCTGGGTTAAGGTCAGGGCTCCAGAAGCCCAGGGTCAAGGCTATGGCCGCATCCCAAGGACGTGC
    ATCCCTGCAAGAGGCTGCATCAGACAGGCATGGAAGAGGCGCCGCTGGGGCCTCCTTGAATTCCTGCT
    GGGAAAAGCAAGTGGAGGTGCTCCTTGAAGAAACAGGGGGATCCCACCGATCTCAGGGGTTCTGTTCT
    GGCCTGCGGCCCTGGATCGTCCAGCCTGGGTCGGGGTGGGGAGCAGACCTCGCCCTTATCGGCTGGGG
    CTGAGGGTGAGGGTCCCGTTTCCCCAAAGGCCTAGCCTGGGGTTCCAGCCACAAGCCCTACCGGGCAG
    CGCCCGGCCCCGCCCCTCCAGGCCTGGCACTCGTCCTCAACCAAGATGGCGCGGATGGCTTCAGGCGC
    ATCACGACACCGGCGCGTCACGCGACCCGCCCTACGGGCACCTCCCGCGCTTTTCTTAGCGCCG
    Figure US20230142867A1-20230511-P00191
    Figure US20230142867A1-20230511-P00192
    ATGGCGAGAGGCAGTGCAGTAGCCTGGGCTGCTTTGGGACC
    ATTGTTGTGGGGGTGCGCTCTCGGACTTCAGGGCGGTATGCTCTACCCTCAAGAGAGTCCCTCCAGGG
    AATGTAAGGAGTTGGACGGCCTCTGGTCCTTCAGAGCCGATTTCAGTGACAATCGCCGCAGAGGGTTC
    GAGGAGCAGTGGTACCGCCGACCGCTCTGGGAGTCAGGCCCAACCGTAGACATGCCAGTCCCATCTTC
    ATTCAATGACATAAGCCAGGACTGGAGGCTGCGACACTTCGTCGGGTGGGTATGGTATGAGCGCGAAG
    TGATCCTCCCTGAACGGTGGACTCAGGACTTGCGGACGAGGGTAGTACTTCGGATAGGTAGTGCTCAC
    TCCTATGCTATTGTCTGGGTGAACGGAGTGGATACTCTGGAGCATGAGGGTGGCTATCTTCCTTTTGA
    AGCTGACATCAGTAACCTGGTGCAAGTGGGGCCGCTGCCTTCCAGACTGCGAATCACCATCGCTATCA
    ATAATACTCTGACCCCCACTACGTTGCCGCCAGGCACCATCCAGTATTTGACCGACACCTCTAAATAC
    CCCAAGGGCTATTTTGTGCAAAACACTTACTTTGATTTCTTCAACTATGCCGGCCTGCAAAGATCTGT
    GCTGCTTTACACCACTCCGACGACCTATATAGATGACATAACCGTGACGACGAGTGTCGAACAAGACT
    CAGGTCTCGTCAACTACCAGATTAGTGTAAAAGGCAGTAATCTTTTTAAGCTCGAAGTGAGGCTTTTG
    GATGCTGAAAATAAGGTGGTGGCAAATGGAACGGGCACACAAGGCCAACTTAAAGTGCCGGGGGTAAG
    TCTGTGGTGGCCGTATCTTATGCATGAACGACCCGCTTATCTCTACAGTCTCGAAGTCCAATTGACGG
    CACAAACAAGTCTTGGTCCTGTGAGTGATTTCTATACGTTGCCTGTGGGCATAAGGACTGTAGCTGTG
    ACAAAGTCCCAATTTCTTATAAACGGTAAGCCCTTCTACTTTCATGGCGTCAACAAACATGAGGACGC
    TGATATTCGCGGAAAAGGGTTTGATTGGCCTCTCCTGGTCAAGGACTTCAACCTTCTCAGATGGTTGG
    GCGCGAACGCCTTTAGAACAAGCCACTATCCGTATGCAGAGGAGGTCATGCAAATGTGCGACAGATAT
    GGGATTGTTGTAATCGATGAATGTCCCGGCGTGGGCTTGGCGCTCCCCCAATTTTTTAATAACGTGTC
    TCTTCACCACCATATGCAAGTAATGGAGGAGGTAGTACGCCGCGACAAGAACCACCCCGCGGTAGTTA
    TGTGGTCAGTAGCGAATGAACCAGCTTCTCATCTGGAAAGTGCCGGTTACTACCTCAAGATGGTGATA
    GCCCATACAAAGTCTCTCGACCCTTCCCGCCCCGTGACATTCGTATCAAACAGCAATTATGCAGCCGA
    TAAAGGCGCCCCATACGTAGACGTTATATGTCTTAATAGCTATTACTCCTGGTATCACGACTACGGTC
    ATTTGGAACTCATTCAGCTGCAGCTCGCCACGCAATTCGAGAACTGGTACAAGAAATATCAGAAGCCA
    ATTATACAGAGTGAATATGGTGCTGAAACAATCGCTGGATTCCATCAAGACCCACCATTGATGTTTAC
    CGAGGAGTACCAGAAAAGTCTCCTCGAGCAATATCATCTTGGACTGGACCAAAAGCGACGAAAATACG
    TAGTAGGGGAACTTATTTGGAATTTTGCCGATTTTATGACGGAACAGAGCCCTACACGAGTTTTGGGG
    AACAAAAAGGGCATTTTCACCCGGCAACGGCAACCCAAGAGTGCCGCCTTCCTTCTTAGAGAAAGATA
    CTGGAAGATTGCTAACGAAACACGCTATCCCCACTCTGTTGCTAAGAGCCAATGCTTGGAAAACTCCT
    TGTTTACAGACTACAAGGATGATGACGATAAATA
    Figure US20230142867A1-20230511-P00193
    Figure US20230142867A1-20230511-P00194
    Figure US20230142867A1-20230511-P00195
    Figure US20230142867A1-20230511-P00196
    Figure US20230142867A1-20230511-P00197
  • The SLC19A3 protein sequence is:
  • (SEQ ID NO: 54)
    MDCYRTSLSSSWIYPTVILCLFGEFSMMRPSEPF
    LIPYLSGPDKNLTSAEITNEIFPVWTYSYLVLLL
    PVFVLTDYVRYKPVIILQGISFIITWLLLLFGQG
    VKTMQVVEFFYGMVTAAEVAYYAYIYSVVSPEHY
    QRVSGYCRSVTLAAYTAGSVLAQLLVSLANMSYF
    YLNVISLASVSVAFLFSLFLPMPKKSMFFHAKPS
    REIKKSSSVNPVLEETHEGEAPGCEEQKPTSEIL
    STSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKEC
    YSSKRLFYWSLWWAFATAGFNQVLNYVQILWDYK
    APSQDSSIYNGAVEAIATFGGAVAAFAVGYVKVN
    WDLLGELALVVFSVVNAGSLFLMHYTANIWACYA
    GYLIFKSSYMLLITIAVFQIAVNLNVERYALVFG
    INTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVY
    GSYFAVIAGIFLMRSMYITYSTKSQKDVQSPAPS
    ENPDVSHPEEESNIIMSTKLDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for SLC19A3 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • SLC19A-DNA
  • (SEQ ID NO: 55)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCG CTATTACCATGCTATGCATGAATGAATGAATTCCTCAATAT
    CTCTCTCTGATTTAGACTGTGGCTATGAGATATTAGAGAATCAGAGTAAACTTCTCAAGGTCCAATTT
    GAACTCATTTTCTTCAAGAGGTGTGAAGGAAATCATGACACCTTCAGCCTTTAAGAAAATTTCACAAT
    ATTCTTTGGGAGGACCCAGCAATTTGCAAGGTTGCACAATGAGCAGGGCACTCCTACACCTAAGGTTC
    AGCTCAGGACTTTTCACAAATCATAGGAAGATGAAAATAGAAAACGCAGAGTCGTAGGACAGAGAACC
    AAGACACCTCGGGGTGTTTTAATCCATTTCACTTCCACTGGTAGCCCTTAGAAGCCACGCAGTGTTTA
    TCTTTAGGGAGGGTTTTTCAAGTCCACCAAGAGATGAACAGAAATAAAAATATTTTTTCCATTTTAAC
    CAATAAGAACCTTGTAATGCTTTAAAAAAATTCTCTGTTTAATAAGAAGTTTCACTTTAATATATAAA
    CTCTTTGCAGGCTAAGCGTAGTGGCTCATGCCTGTAATCCCAACACTTTGGGCGGGCAAGGCAGGAGG
    ATCCTTTGAGCCCAGGAGTTCAAGACCAGCCTGGGCAAGATAGTGAGACTCTGTCTCTACCAAAAAAA
    AAAAACAAAAAAATTAATGTGACGTGGTGGCACCCACCTGCAAGCCCAGCTACTTGGGAGGCTGAGGC
    AGGAGGATCAGTGTGCCTGGGAGATTGAGGTTGCAGTGAGCTGGGGTTGCACCACTGCACTCCAGCCT
    GGGAGATAAAGTGAGACTTTGTCAAAAAAAAAAAAAAAAGAAAGAAAGAAAGAGAAAAGAAAAGAAAA
    GAGGCCAGGTGGCCGGGTGCTGTGGCTCACACCTGTAATCCTAGCACTTTGGGAGGCCGAGGCGGGTG
    GATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGACAACATGGTGAAACCCCATCTCTACTAAAAAT
    ACAAAAATTAGCTGGGTGTGGTGACGGGCGCCTGTAATCCCAGCTACTAGGGAGGCTGAGGCAGAAGA
    ATTGCTGGAACCCGGGAGGTGGAGGTTGCAGTGAGCCAAGATCGTGCCATTGCACTCCAGCCCGGGGC
    CAACAACAGCGAGGCTCCATCTCAAAAAAAAAAAGAAAGAAAGAAAGAAAGAAAAAAGACCAGGCGCG
    GTGGCTCACGCCTGTAATCTCAGCACTTTGAGAGGCCGAGGCAGGCGGATCACGAGGCCAGGAGTTCG
    AGATGAGCCTGGCCAATATGGTGAAACCAAACCCCGTCTCTACTAAAAACACAAAAATTAACTGGGCG
    TTAATTTTTGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGGTTGTAGTGAGCCGAG
    ATCGCGCCACTGCACTCCATCCTGGGCGACAGAGGGAGACTCTGTCTCAAAAAAAAAAAAAAGAAAGA
    AAGAAAAGAGAAAAGAAAAGAAAAAGAAGAAGAAAAGAAAAAATTGTTTGTATCAGATTTTCAAAGTG
    CCAGGAAACCATCCCACTCTACCTCCTTTAAAAAAAAAAAGTGCAGAGTGATTATAACCACTGGCTCT
    GGAGCCAGGAGCCCCAACTAAAACCCAGGCTTCTCTATTTACTAGGCCTACTTCCTTCCGCCTCACTT
    TCTTCATGTGTAGAGTGGGGGTGACAGCCAGCCAGCCCTCTCTCCTAAGCTTTTGGAGAGAAATAAAT
    GGATTCATAAGAAAGGACATATTTGTAAATAAAGCAAACGATCGCTGTTGGATTCAGGCCCTCTTGCT
    TCATTATGCTGAATGTCAGTGTATTACTTAAGTTGTTCCTTTCAGCAGAGTTGACTGTTTCTGCCCAG
    AAAACCGGCCCTGGGCCAGGAAAATCTGGGCTTCCACACTGGGATGAAAGGTCTGGACGCCCGGATTC
    GCATATGCAAAGCGTGGGGGCGTGGCCCCGGGCTCCGGGCCAGGCAGGCTCCGCCCCCTTCCCACC
    Figure US20230142867A1-20230511-P00198
    Figure US20230142867A1-20230511-P00199
    ATGGACTGTTACCGGACCTCACTCTCTAGCAGTTGGATC
    TACCCCACAGTCATCTTGTGCCTCTTTGGATTTTTCTCCATGATGCGGCCAAGCGAGCCGTTCCTTAT
    ACCATACCTTAGCGGACCCGACAAAAACCTTACGAGCGCGGAAATTACGAACGAGATCTTCCCGGTAT
    GGACCTACAGTTATCTTGTGCTTTTGTTGCCTGTCTTTGTCCTGACTGATTACGTTCGGTACAAGCCA
    GTTATAATACTGCAGGGGATTTCCTTCATCATTACCTGGTTGTTGCTGCTCTTTGGACAGGGCGTAAA
    AACTATGCAAGTAGTAGAGTTTTTCTACGGCATGGTTACCGCCGCAGAAGTGGCCTACTATGCCTACA
    TCTACAGTGTGGTAAGCCCAGAGCATTACCAAAGAGTGAGTGGATATTGTAGATCAGTTACCCTTGCC
    GCCTACACCGCGGGTTCCGTCTTGGCGCAGCTCCTGGTCTCACTTGCCAATATGTCCTACTTCTATCT
    TAACGTTATTAGCCTCGCCTCCGTTAGCGTCGCCTTTTTGTTTTCCTTGTTTCTGCCAATGCCGAAAA
    AAAGTATGTTTTTTCACGCAAAACCAAGTAGGGAGATCAAGAAGAGCAGCTCTGTTAATCCCGTTCTG
    GAAGAGACACACGAGGGAGAGGCTCCAGGCTGCGAAGAACAGAAACCTACCAGCGAGATACTGTCTAC
    ATCAGGAAAGTTGAACAAGGGACAGTTGAACTCACTCAAGCCAAGTAACGTCACAGTTGATGTGTTTG
    TGCAATGGTTCCAGGATCTCAAAGAATGTTACAGTTCAAAAAGACTTTTTTACTGGAGCTTGTGGTGG
    GCATTCGCAACCGCCGGTTTCAATCAAGTGCTTAACTATGTTCAAATCCTTTGGGATTACAAAGCCCC
    AAGTCAGGACTCATCCATCTATAACGGCGCAGTCGAGGCGATTGCTACATTCGGAGGCGCTGTTGCAG
    CGTTCGCCGTGGGCTACGTCAAGGTCAACTGGGATTTGCTGGGTGAGCTGGCTCTCGTGGTATTCTCC
    GTCGTGAACGCAGGATCTCTTTTCTTGATGCACTACACTGCAAACATCTGGGCGTGCTACGCCGGGTA
    CCTTATCTTTAAATCCAGTTACATGCTTCTCATCACTATCGCGGTTTTCCAGATTGCGGTAAATCTCA
    ATGTCGAACGATACGCGCTCGTATTCGGCATAAATACTTTCATAGCGTTGGTTATTCAGACAATAATG
    ACGGTGATCGTTGTAGATCAAAGAGGGCTGAACCTCCCCGTGTCCATTCAGTTCCTGGTGTATGGTAG
    CTACTTCGCTGTTATAGCTGGAATATTCCTTATGCGCTCAATGTACATTACATACAGCACCAAATCCC
    AGAAAGATGTGCAGTCTCCGGCTCCGTCTGAAAACCCCGACGTCAGTCATCCGGAGGAGGAGTCTAAC
    ATCATAATGTCAACGAAACTTGATTATAAGGATGACGATGATAAATA
    Figure US20230142867A1-20230511-P00200
    Figure US20230142867A1-20230511-P00201
    Figure US20230142867A1-20230511-P00202
    Figure US20230142867A1-20230511-P00203
    Figure US20230142867A1-20230511-P00204
  • The PLP1 protein sequence is:
  • (SEQ ID NO: 56)
    MGLLECCARCLVGAPFASLVATGLCFFGVALFCG
    CGHEALTGTEKLIETYFSKNYQDYEYLINVIHAE
    QYVIYGTASFFFLYGALLLAEGFYTTGAVRQIFG
    DYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLE
    RVCHCLGKWLGHPDKFVGITYALTVVWLLVFACS
    AVPVYIYFNTWTTCQSIAFPSKTSASIGSLCADA
    RMYGVLPWNAFPGKVCGSNLLSICKTAEFQMTFH
    LFIAAFVGAAATLVSLLTFMIAATYNFAVLKLMG
    RGTKFDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for PLP1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • PLP1-DNA
  • (SEQ ID NO: 57)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GGAGTTGGGAACCTTAAGGCACTGGGATGA
    GTGAGACACGGTCACCTGGCTGGGTCCCTTTTCTGGCCCTGGGCATAAGAACAAGCATAAGAACAAGG
    CGAAGACAGGGAATGTGGAGAGAGAAAAAGGAAACAGAGACAGACACATAGAGAGAAACAGAGACAGA
    GACAGGTAACAGAGACAAAGAGATTGAGACTTGGGGAGAGGATAGAATGAATATTAAGGCAATGGAGA
    TGGAAGAAAGGAGAAGAGAAATGTGTGTTAAGGGAATGTGGTTGATAAATTGGTATCTGTGGGTTTAT
    TGGGAACACATATTTTCTTTCCAGTAGAGTTGCTGTCCCCAGAGCTGCTGCTGAGAACTCAGGGGAAG
    CTTTGTGAGAGGGCTCAGTCTGAGTGATGTCCCCTGTGTGGAGAGCTGTCTCATCTATTCGAGCTGTG
    AAGGATCATTTGCCTCATCCTCTAAATCTGAACTCCCAAAGTTTGGGCTCACTCCTGCCATCTGATGA
    GCTGAGGAGACACATTTGGCGATTTCTTGCTCTTGGTCCTCCCCCTTCTCCAGGGCCATTCCGACAGT
    GACCATCCAGAAGTTTATTGCTGGCTCAATCCTACAGTTGAGCTTCCAAAGCAGGCATAGTCTGTGGC
    TGGCGAGCAGGGCTGGGGAGGGCGAGGGGCTCTAGGACCCTTTTCCATCAGTCACATGGCCTTAGTCT
    CGTCTGCTCTGGAAAGCTACTATTATACCGTTTTGCAAGGGGCAGCATTTCCAGAGATCCTTTTTTCT
    TGGGGCTGATACAAGAGCAAAGGATCTAGAGTTCTAGTACTCTAAGCAAGCCTCAAACGGTGCAGGAT
    TTGGCCCCAGTGGGCCCACAGGGGCATCTGCCAAAGACTCGTCCATTTCCTAACAGCAGAGCCCAAGC
    CAGTAACATGTCCAAAGTCACAGCCCAAAGAGAAAACTGTAAGACACAATCTTGCCTTTCTCCCACCC
    CAGGAAGGTACTAGTCTCTGCTCAGTATCTCCCTCCTCTTCTCTCCTCTCCCAAGGCAAAATCACAGG
    AAAAGTTCCAGGAGACCATACCTTCCTTCTCCGGAGTCTTCCAACAGGTGCCCCTCTTATTATATGCC
    AGCCAGTTGTTTTAAAGGCCATCCATGGGAGGACACCGGCCTCTAGGTCAAGGGAGCTGTGAAGGGAG
    ACAGGACTGACAAGGAGCCCAGACAGACACTGTTTAAGATTCACCTGTTCTAGAAAACCTTCCCAGGC
    TGATCCCATCTAGCAGTGGGCATTTAAATACCTTCACAGTCTTTAGAAATAGCTTATTGCTAAGGCAG
    GTTTTATATAGTATTTCCCTGTTTTTGCTGCTTAGCCATTTTTGTCTTACCGCTTCTTTCTCTGTGGC
    AAGGACTATATTTCGTTTCTTCTCTCCACCCTCACTGCCACCTGCTGCTTCACTCTAACCCACTCCCT
    GCCTCTTGTCTGCCTGGCTTAAGCTCTCTGATGAAGATATTACACTCTCAGGATAGAATACTGACTAC
    CTGATTCCCAGATCGTGTTCTTGACTTGTCTCGAAGTCGATTATCCTTCTTGGAAATCACCCACATCT
    TCAACGCTGGCCTTTTCTTTAACCTAGTCTGTGAACTTGCATAGGTCTTGCCCACTTCTGGGTCTTTA
    TTATTGTAGCTATAAAATTATTACAATTCTAGCCTCCCCCAGCCTAATCACAATCTGTTCATCGAATT
    GACAGCCTGCATTTTGTCTTCATGCAGTTGAAACTGAGAAGGATTTGGAGGAAATTATCTTAAAAGAC
    AATTTTTTCTTCTCTATTGTGTTCGTAGAGCTGGGGGTGGGGGTGGGGGAGGAACGCTTATTTTCCAA
    GGAATCGGGAGGGGAAAGGTGGAGAGGGCCAAGGGCAGCTAGGAGTGATGTGGGGAGCTGGAGCAGAT
    TCGGAAGGACTTTGGGAGCTAATATCTAGGTTTTTGACTCTGAGCCCCTGTTGGGGCTCTCACTTCAT
    GGCTTCTCACGCTTGTGCTGCATATCCCACACCAATTAGACCCAAGGATCAGTTGGAAGTTTCCAGGA
    CATCTTCATTTTATTTCCACCCTCAATCCACATTTCCAGATGTCTCTGCAGCAAAGCGAAATTCCAGG
    CAAGCCTTAGGGAAAAAAGGAAAAACAAAGAAAATGAAACAATTGGCAGTGAAAGGCAGAAAGAGAAG
    ATGGAGCCCTTAGAGAAGGGAGTATCCCTGAGTAGGTGGGGAAAAGGGGAGGAGAAGGGAGGAGGAGA
    GGAGGAGGAAAGCAGGCCTGTCCCTTTAAGGGGGTTGGCTGTCAATCAGAAAGC
    Figure US20230142867A1-20230511-P00205
    Figure US20230142867A1-20230511-P00206
    TGGGTCTCCTTGAATGCTGTGCTCGATGTCTCGTAGGAGCGCCGTTCGCC
    TCTCTCGTCGCCACCGGATTGTGTTTTTTCGGTGTTGCGCTCTTTTGCGGCTGTGGTCACGAAGCTCT
    TACAGGTACGGAGAAACTCATTGAGACTTATTTCAGTAAGAATTATCAAGATTATGAGTATCTGATAA
    ATGTAATTCATGCATTCCAGTATGTTATTTACGGAACGGCTTCTTTCTTTTTCCTGTATGGAGCCCTT
    CTCCTGGCTGAGGGTTTCTATACAACAGGTGCAGTGCGCCAAATCTTTGGCGATTATAAAACTACTAT
    ATGTGGAAAGGGGTTGTCCGCGACCGTTACAGGCGGCCAAAAGGGTCGGGGTAGCAGAGGTCAACACC
    AGGCACACAGTCTGGAGCGAGTATGCCACTGTCTCGGAAAGTGGTTGGGTCACCCAGATAAATTTGTA
    GGGATTACTTACGCTTTGACGGTTGTATGGTTGCTGGTTTTCGCGTGCTCAGCTGTACCCGTTTACAT
    CTATTTTAATACCTGGACGACCTGTCAGTCTATAGCGTTCCCAAGTAAGACCTCTGCATCAATCGGTA
    GTCTGTGTGCCGACGCGAGGATGTATGGCGTGCTGCCATGGAATGCTTTTCCAGGGAAGGTGTGTGGG
    AGCAACCTCCTTAGTATATGTAAAACTGCCGAGTTTCAAATGACGTTTCACTTGTTCATAGCGGCGTT
    CGTGGGAGCAGCAGCAACCCTGGTATCTCTCCTCACATTTATGATCGCTGCTACGTACAACTTTGCAG
    TACTGAAACTTATGGGACGGGGCACTAAATTCGACTACAAGGACGACGACGATAAATA
    Figure US20230142867A1-20230511-P00207
    Figure US20230142867A1-20230511-P00208
    Figure US20230142867A1-20230511-P00209
    Figure US20230142867A1-20230511-P00210
    Figure US20230142867A1-20230511-P00211
    Figure US20230142867A1-20230511-P00212
  • The TPP1/CLN2 protein sequence is:
  • (SEQ ID NO: 58)
    MGLQACLLGLFALILSGKCSYSPEPDQRRTLPPG
    WVSLGRADPEEELSLTFALRQQNVERLSELVQAV
    SDPSSPQYGKYLTLENVADLVRPSPLTLHTVQKW
    LLAAGAQKCHSVITQDFLTCWLSIRQAELLLPGA
    EFHHYVGGPTETHVVRSPHPYQLPQALAPHVDFV
    GGLHRFPPTSSLRQRPEPQVTGTVGLHLGVTPSV
    IRKRYNLTSQDVGSGTSNNSQACAQFLEQYFHDS
    DLAQFMRLFGGNFAHQASVARVVGQQGRGRAGIE
    ASLDVQYLMSAGANISTWVYSSPGRHEGQEPFLQ
    WLMLLSNESALPHVHTVSYGDDEDSLSSAYIQRV
    NTELMKAAARGLTLLFASGDSGAGCWSVSGRHQF
    RPTFPASSPYVTTVGGTSFQEPFLITNEIVDYIS
    GGGFSNVFPRPSYQEEAVTKFLSSSPHLPPSSYF
    NASGRAYPDVAALSDGYWVVSNRVPIPWVSGTSA
    STPVFGGILSLINEHRILSGRPPLGFLNPRLYQQ
    HGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDP
    VIGWGTPNFPALLKTLLNPDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for TPP1/CLN2 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • TPP1-DNA
  • (SEQ ID NO: 59)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG CTCCTTTGGGTTCCACTGACACCCCTGCCC
    CTGCCCCTGCCCAGAACCAAGTGCCATTTCTCACTCTGGAGCCTTAATAAACTGCAATTTGTATCCAG
    TCTCCAGCTTTGTTCTATAGGGATGACTGGAAGACACCTGGCAGAGTATTGTGGATACCTCAGGAGGC
    ACAGAGGGGTGGTGGGGTGGGGATGGCACTGGGAGCTCAGGAGCCTAGCCTGAGGCTGATGATCACAT
    CTCTCTGCAGCTATTCTTTTCCCTCTAATATGGGCAAAGAGGCCAGCAGGGCCTTGAGAGGGATCAGT
    TGGGTGGGAGAGGGGATCTTGGGGATGAGCAGCTCTGGGCCCGAAGTTCTTAAAATAAGTTGGGGTGG
    GTACAAGAGCAGTCAAGTGGTTTATTTGTTCTTCTTGGGGGCCACTAAAAATCCACATTGTAAAAAGA
    GCTCAATTTGTTGAGCCTTTGTCATGTCAGGCTCTCTGCAGGTGGCTGCCTTGCTTAATCTCAGTGTA
    AATTTTCACAGTCATACAAAGGGATTTGGGAACCACTGGTGTTGCCTGTTCTTTGGGGAGCCAGGCTG
    TGAGGGAAAGGCAATTGGCCAGTGTTTGAGAAGTTCATGAGATTGAGGAAATTAGAAAAAAAAAAAAA
    AGTTGTGGTCAAAGGGAAATGTTTCTGAGCATGATGCTAGTAAATACAGCTCTAGGATCCACGCTTAG
    TGGGAAACTGGAGGGATCAATGGACAGAGAGAAGTCAGATTAATAGGAGTGAGGAAAAACAATGGCTT
    GAAAAAACAGAAGGTTATAGTCTAAGAAAGCATTCTATAATTATAATTTCAAAGGTAGAGCAGTTTTG
    AGAGATAGCAAGCCCAATGCAGCCATGGGACTGGTTTGCTTAAGTGAAGTGAAGGTAATGCTCACTGG
    GTTTAATGAGATGAAGAAATGGAAGGTTAGGGTGTTGAATGGGTCTTTTACCTAGATATTTTGATCAT
    CTAGGTCAAATGATCGCAGGAGTTGGGATGGACAGAAAGGTGAAGTGCCATAATCTTCAGTGTGGGAA
    TATAATCTATAGGTTAATGGAAGCCAGTGGCCTGTAGGACAACAGCAGGAGTAGAGGGATCTATCTAA
    AAAGTATTGAGGTCAGAACAAGGGATTTCACATGAGTATTGGATTAATGGTTTTGGAAGTTAAAATGA
    CACTGCTGACACCATCTTCCATATGGATTTTTGGGGGGAAAAAAAGGAAGACTGGAAGGGAATTGAAT
    CCTCAGGGTTTTAACTCTCATTGAAAATCCTCTTGTAATGACCCCAGTTGGAAGTAGTAAACTCAGAC
    CTCCTTGTGGGCTTTGAGTTACCATTAGAAGAAGCAAAGGTCCGTTTCCATTTTATCCACCTGAGCCC
    ACTAAACAAGATGCAAGATTCCTCCATGGCCCTTAGAACAAAGTTGATTTCCTTGAAAACCTTGGGAA
    CCCATAATAAACAGGACCCCAGCTGATTGTAGATCTCCTGGGAAGCTACTTAAAACTGGAAACTGATT
    TGAATAAAAGGATCCTGGTCTTGCAAAAGCTTTACTCTGTTCATAAATGTAGTCGTAAAATGAAGACA
    AAATTGATAATATGGGGACATAAAGGATAAAAGAGGAAAAAAATTTTTTCACTGCAATCTTTCGAGGC
    AGATCACTTTTATATAAGTCTCCATATTGTAAATGCAGAAACCTAGGCTCTGAGAGGTTATTTTGCTG
    ATGTCATACAGCTAGGAAGAACCCCAGTCTGTGGGTCTCTACAGCCCACTCACTTGTGGTGATTCCCA
    GGGGACCAGGCTCAGGACGCAGGTGGGGAGCCCTGGGCTTACTCAGTGCTTGACTGGCCAGAGGGGAG
    AATCCGGGTGGCGGCCCCACCCTTGCCTAGCATTTGGGACCACCCATGCAAGGGAGGAGCCAGTACCG
    TCACTAGTTACTAGGCAGAGGGGTAGTGGTGGTGGAATATAGAGCTCATGTGATCCGTCACATGAC
    Figure US20230142867A1-20230511-P00213
    Figure US20230142867A1-20230511-P00214
    ATGGGCTTGCAAGCGTGCCTGTTGGGGCTCTTCGCATTG
    ATTTTGAGTGGAAAGTGTTCCTACTCACCCGAGCCCGACCAGCGCCGCACACTCCCTCCTGGATGGGT
    AAGCCTTGGCCGGGCCGATCCAGAAGAAGAGCTTAGTCTTACTTTTGCATTGCGACAACAGAATGTCG
    AAAGGCTTTCCGAACTGGTACAAGCTGTGTCAGATCCGTCTTCACCTCAGTATGGTAAATACTTGACT
    CTCGAAAACGTCGCAGATCTCGTACGCCCGTCTCCCCTTACTTTGCATACTGTTCAGAAGTGGCTGCT
    TGCGGCTGGTGCTCAAAAGTGCCACTCCGTGATAACTCAAGATTTTCTGACCTGTTGGCTTAGCATCC
    GGCAAGCAGAATTGCTCCTGCCCGGAGCGGAGTTCCACCACTATGTCGGTGGTCCTACAGAGACGCAC
    GTGGTGAGATCACCTCACCCCTACCAGTTGCCACAAGCGTTGGCCCCTCATGTCGACTTTGTAGGAGG
    ACTCCATAGGTTTCCCCCAACTTCAAGTTTGCGCCAGCGGCCAGAACCACAAGTCACTGGTACAGTTG
    GACTGCATTTGGGAGTCACCCCCAGCGTCATACGCAAAAGATACAACCTTACTTCACAAGATGTCGGC
    TCCGGGACGTCCAATAACTCTCAGGCATGCGCGCAGTTCCTGGAGCAGTACTTTCATGACTCCGATTT
    GGCGCAATTTATGAGGCTTTTCGGAGGTAATTTTGCTCACCAAGCCTCTGTAGCAAGGGTCGTTGGTC
    AACAAGGACGGGGGCGGGCTGGCATCGAAGCAAGTCTTGATGTACAATACCTGATGTCAGCGGGAGCT
    AATATTTCTACGTGGGTTTATAGCTCCCCCGGCCGCCATGAGGGTCAGGAACCGTTTCTTCAATGGCT
    GATGTTGTTGAGCAATGAGAGCGCGCTTCCTCATGTCCACACGGTCTCCTACGGAGATGACGAAGATA
    GCCTGTCATCCGCATACATACAACGGGTCAACACAGAACTCATGAAAGCCGCGGCCAGGGGCCTCACC
    TTGTTGTTTGCTTCTGGAGATAGCGGAGCCGGCTGCTGGTCTGTGTCTGGGCGACATCAATTCAGGCC
    AACATTCCCGGCGTCCAGTCCCTATGTCACTACCGTGGGGGGCACCTCCTTTCAAGAGCCGTTCCTCA
    TAACCAACGAAATTGTCGACTACATTAGTGGAGGCGGCTTCTCAAATGTGTTCCCCCGGCCGTCATAC
    CAGGAGGAGGCAGTTACAAAATTCCTCTCTAGTAGCCCGCATTTGCCTCCATCCAGCTACTTCAATGC
    AAGTGGGCGCGCGTATCCCGATGTAGCAGCCCTTTCAGACGGCTATTGGGTAGTCAGTAATCGCGTCC
    CCATACCCTGGGTTTCTGGCACTTCCGCCTCAACGCCTGTGTTCGGGGGCATATTGTCACTGATAAAC
    GAACACAGGATATTGTCCGGCAGGCCGCCCTTGGGATTTTTGAACCCTAGATTGTATCAACAACACGG
    GGCGGGTTTGTTCGACGTAACCCGAGGCTGCCATGAGTCTTGCTTGGACGAAGAGGTAGAAGGTCAAG
    GGTTCTGTAGTGGGCCAGGCTGGGACCCTGTAACTGGATGGGGCACCCCGAACTTTCCCGCTCTGCTT
    AAAACGCTTCTCAACCCTGACTATAAGGATGACGACGACAAGTA
    Figure US20230142867A1-20230511-P00215
    Figure US20230142867A1-20230511-P00216
    Figure US20230142867A1-20230511-P00216
    Figure US20230142867A1-20230511-P00217
    Figure US20230142867A1-20230511-P00218
  • The ACY2/ASPA protein sequence is:
  • (SEQ ID NO: 60)
    MTSCHIAEEHIQKVAIFGGTHGNELTGVFLVKHW
    LENGAEIQRTGLEVKPFITNPRAVKKCTRYIDCD
    LNRIFDLENLGKKMSEDLPYEVRRAQEINHLFGP
    KDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFL
    IQMFHYIKTSLAPLPCYVYLIEHPSLKYATTRSI
    AKYPVGIEVGPQPQGVLRADILDQMRKMIKHALD
    FIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGE
    IAAIIHPNLQDQDWKPLHPGDPMFLTLDGKTIPL
    GGDCTVYPVFVNEAAYYEKKEAFAKTTKLTLNAK
    SIRCCLHDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for ACY2/ASPA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • ACY2/ASPA-DNA
  • (SEQ ID NO: 61)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TTTTTTGGGTCATCCCAAAAATCAAGTACA
    CTTTGGGAAACAGGGAGTTAAAAATCTTTCGGAAGTTAACTTGGTATAAGAGTATGAAAAGCCTCTGT
    TGCTCTACTTCTGAGGAAATGCTCTCCTTTTTCCTAATAAAGCTTTCCACAGAAGACAAACTCTGTAC
    ACAAAAAACAAGGTAAGTATGGGAAATAATTGCTGCAATTCATTAACTCTCAAAAAAGTTATTCATTT
    TGACATGTAAGAGAAACTTTATTTATGAATGTGAAATGTGCTTCAAAGTGCTGATCTGTTAAAATTCC
    ATAAAACAAGCTTCAAAATGCAAGTCATTTGTGCCAGCTACGAGTAAATAACTGAAGCAGCATTTCCA
    GAGAGACTAGGGCAAAGGACAGAACAAACTACAAATCAGCAGAAAAGAAACCAAGCATAGTAGAGTGT
    TAATAATTCACCATGGTTACTGCTCTTAAAACCAAAGCAACTAAACAACTGTCATCTGTTTACTTTTA
    CGTGTGTTTTCATCATCTCTACACCGTCTGAAGTGTTCAGTGTACATGGATGTGGACAGACAGTCAAA
    GCCTACAGGAGCTTTTTTTTTCTATTAGAACAGTCATTTTAATGCATAGATTTGATGTGGTTATTTTG
    CTTTCTCTATCTCTGCATTATAAGCTCTTAAAGGAAGAACATTCAGTGGGTTTATTAAATATTAGGAA
    GCATTCATATTTTATTTTTCCACCATGTATTTGGTTTGTATGCATGTTTGTTTCCTGAGCACACTGTA
    ATGGCTGTTTATGTGAATTCTGAAAGAGCAAAACTAGAACTGTGAAGAAGACCAAAAATAACAGAGAG
    GCACCCTATTGTCCTGTGAAAAACACTAGACTTGGGATGAGAAACATGGGTCTCAGGCCTAGGTTCTC
    CTACATCCCTGCAGAACCTTAACTGAGTTACTTAACCTCTCTTTCGTTTACCCCTGCAGTGGGGATGT
    CAACACCTAACCCACCCTATCGTATAAAATTAATGTATGAATAAACTGAGTGAAAGGACATTATAAAT
    TAAACTCTGCAATTGTAATAAATTACTCTTATTGTACCCTCTATCAAACAACCTGGCGTTACTAGTAC
    ATGGTATTCCCAGTACTTGAACATATGTGACTTTCTGGGTGACTGAGCATGAAATCAAATCAGATTTG
    GCTCAGAGTCATTTCTCCAAAGGGATGTAATCATCTTAGTATTCTGGAAAATTTCCCCTAATAAGACC
    CCTTTTTGAGTAACACAGTCCAGTGGAGAGTCCCCGCTGACAGTGAATAAGGCTGTGTCTATGTAAGT
    TCACATGATGTGGTTAGCACATGCAGTGTGTCCCAAGACATTTTCAGGCATCACAAAAACTTGAAAAG
    TACTAACAGTTGTCTTTCCTTTGGTATTTAGGCAAAAAATGAGGAAGATTCATAAGTAAGTTAACACA
    GAATTACCAAAAACCTCATGAGAAAGAAATGAAGGGATGTTTCAGGGTAGCTTTTCGAGTCCCAAGTG
    ATCATTGATTTGGCCCTTCTGTTTCTGATTGTGAAAAAGCATTCTGAAAGAATGGGCGCCGGTTCAAG
    AGGAGGTCTCTGAGGGTCAGCCTTTGGCTGTTTATAGAGGTGAGATTCAAATGCCCTCAGATCATGAA
    AAGTGCTTGAACTCATTTCAGAGAAGAAATGTTTTGAAATGCCCGTGGAAATACTGAGTGGGCCACTT
    TTTTATTTTGTGAAATTAGAAACGTATAAACTATTTTTATTATTACAGATAATTGTGAAAGCCTCACT
    GGATGACATCCTAGCTGAGGAAAAGAATGGAATTTTTGTATTAGTATGAAATGCTAATTTATTACTGT
    ATTTAGATCTACTGACCATCTGAGGGAGTTAGAAGTTAAAGTAAACAGCTGTATCTCTAGTTGATCTT
    TGCATATTTTAATCCAAATATGGCAAAGGGCAGGGCTAAAGAAGGGAGTGTCCATAAACGGGGCTC
    Figure US20230142867A1-20230511-P00219
    Figure US20230142867A1-20230511-P00220
    ATGACAAGTTGTCACATAGCTGAGGAACACATCCAGAAG
    GTTGCTATTTTTGGCGGCACTCACGGTAACGAACTGACAGGGGTCTTCCTCGTCAAGCACTGGCTTGA
    GAATGGGGCTGAAATTCAGCGAACAGGACTTGAAGTAAAACCGTTTATAACGAACCCACGCGCTGTAA
    AAAAGTGCACGCGATATATCGATTGCGACCTTAACCGGATATTTGATCTGGAGAATCTTGGGAAGAAA
    ATGTCAGAGGACCTCCCTTATGAGGTGAGACGAGCGCAGGAGATTAACCACCTCTTTGGTCCCAAGGA
    TTCCGAAGACTCCTACGATATAATTTTCGATCTCCACAACACGACCTCTAACATGGGTTGTACCTTGA
    TCCTCGAGGATAGTCGCAACAATTTCTTGATTCAAATGTTCCACTACATAAAAACATCTCTCGCTCCC
    CTTCCTTGCTACGTATACCTCATTGAGCATCCAAGTCTTAAGTACGCAACCACCCGGAGCATAGCGAA
    GTATCCAGTGGGAATTGAAGTCGGGCCTCAACCCCAGGGAGTTTTGCGGGCGGACATCCTCGACCAGA
    TGAGGAAAATGATTAAGCACGCGCTGGACTTCATACACCATTTTAATGAAGGGAAAGAGTTCCCGCCA
    TGTGCCATAGAGGTTTATAAGATTATTGAGAAGGTAGATTACCCTAGGGACGAGAATGGGGAAATCGC
    TGCCATCATACACCCCAATCTCCAGGACCAAGATTGGAAGCCACTTCATCCCGGCGACCCAATGTTCT
    TGACGCTGGACGGCAAAACGATTCCCCTTGGGGGTGATTGCACGGTATATCCAGTGTTTGTGAACGAG
    GCTGCTTATTACGAAAAGAAAGAAGCATTCGCCAAAACCACGAAATTGACACTGAATGCAAAATCTAT
    CAGATGCTGTCTCCACGACTACAAGGATGACGATGATAAATA
    Figure US20230142867A1-20230511-P00221
    Figure US20230142867A1-20230511-P00222
    Figure US20230142867A1-20230511-P00223
    Figure US20230142867A1-20230511-P00224
    Figure US20230142867A1-20230511-P00225
  • The MANBA protein sequence is:
  • (SEQ ID NO: 62)
    MRLHLLLLLALCGAGTTAAELSYSLRGNWSICNG
    NGSLELPGAVPGCVHSALFQQGLIQDSYYRFNDL
    NYRWVSLDNWTYSKEFKIPFEISKWQKVNLILEG
    VDTVSKILFNEVTIGETDNMFNRYSFDITNVVRD
    VNSIELRFQSAVLYAAQQSKAHTRYQVPPDCPPL
    VQKGECHVNFVRKEQCSFSWDWGPSFPTQGIWKD
    VRIEAYNICHLNYFTFSPIYDKSAQEWNLEIEST
    FDVVSSKPVGGQVIVAIPKLQTQQTYSIELQPGK
    RIVELFVNISKNITVETWWPHGHGNQTGYNMTVL
    FELDGGLNIEKSAKVYFRTVELIEEPIKGSPGLS
    FYFKINGFPIFLKGSNWIPADSFQDRVTSELLRL
    LLQSVVDANMNTLRVWGGGIYEQDEFYELCDELG
    IMVWQDFMFACALYPTDQGFLDSVTAEVAYQIKR
    LKSHPSIIIWSGNNENEEALMMNWYHISFTDRPI
    YIKDYVTLYVKNIRELVLAGDKSRPFITSSPTNG
    AETVAEAWVSQNPNSNYFGDVHFYDYISDCWNWK
    VFPKARFASEYGYQSWPSFSTLEKVSSTEDWSFN
    SKFSLHRQHHEGGNKQMLYQAGLHFKLPQSTDPL
    RTFKDTIYLTQVMQAQCVKTETEFYRRSRSEIVD
    QQGHTMGALYWQLNDIWQAPSWASLEYGGKWKML
    HYFAQNFFAPLLPVGFENENTFYIYGVSDLHSDY
    SMTLSVRVHTWSSLEPVCSRVTERFVMKGGEAVC
    LYEEPVSELLRRCGNCTRESCVVSFYLSADHELL
    SPTNYHFLSSPKEAVGLCKAQITAIISQQGDIFV
    FDLETSAVAPFVWLDVGSIPGRFSDNGFLMTEKT
    RTILFYPWEPTSKNELEQSFHVTSLTDIYDYKDD
    DDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for MANBA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • MANBA-DNA
  • (SEQ ID NO: 63)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG GATTATTCTTTGATGTTCTCTGTTGTCAAA
    GTGAAGTAACCTAAATGATGTTATGCTTGCATCCTGGAATGAAAGGAGGAATGAGAATCTTCTATTAT
    TGTTTTTCTAAAAATTATATTTCTTAATAAACAGAATATATCCATATGTATCATCAAGGATGAATTTT
    CCTCTGATTTGGCTTAATGAAATCTCTGGGAATTATGAAAAAATTCCTTTGCCGCTCTTGTTGCCATA
    AACCTTGTGTTATCAGTGTTGCACAGACCATGGCTAGCCATTTAATGCCGTGAATTATGGCATATAAT
    TTGAATTGAGAGCTTTGTTTCTATCAAAGGTAAAAATGCTTCTGGAGTGAGTGAGGAAGGCTTTGAAG
    CAGTGAGAGCGTTTCCCCTATAGCATCATTGGGCTCTGACAGTAGGACCTTGAAGAGGAGGGTTCTAA
    TCCTGTCTGCCACTATGTGACAGGCAAGTCACCTGATCTTTTTTTTTTTTTTTTTTTTAGAGTCTTTC
    TCTATAGCCCAGGCTGGAGTGCAGTAGTGCCATCTCGGCTTACTGCAACCTCTGCTTCCTGGGTTCAA
    GCGGTTCTCCTGCCTCAGTTTCCCGAGCACCTGCGACTACAGGCACGTGCCACCACGCGCCCAGCTAA
    TTTTTGTATTTTTAATAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGCCTCGAACTCCTGACCTCA
    AGTGATCCACCCGCCTCGGCCTCCCAGAAGTGCTGGGATTACAGACGTGAGCCACCGCGCCTGGCCCC
    GATCTCTTTGAACTCAGTTCCCTCATCTGTAAAGCCAGCGGCTTGGGTTGGATAATCCATACTGTCCT
    TTAAGCCAAGATATTCAGGGCTTGTAAATAATGCTGACTTTGCAAAGTAAGATATTGCCCGGTGTCTG
    CAGAATCAGGCAATTCCAAGGCCATTGTAAGGCGAAACAAAGCTCAGATTTGATGAGAGAAAATTTCT
    TTTCAAGGACTTCTTCCGAAGCATAAATATACGGCAGATAGCGTCAACACAGGAGAAAGGCAGTCTAT
    TTTATGCAATGTAAAATTCCGTCTTAACTTCTAGGATTACTGTTACATCCTATTGCAAAAATGGGACT
    CAGGTAGACCATTTGACTACAAGACCCGAAATAAGTCAGTGGTATTTCTTGGGTACTCTGGCATTGTT
    AGGGACGATCACATATGTTTACTCATTTTATACTTATAACACCTTGAAAACTTCAAGAAACGGAGATG
    GAGATTTTGCGGCTTGCCCAGGTCCCACAGCTAGTGAGTGCAGAGGATTTTAAACTCAGCTGCGCGCC
    TAGTGCAGGTTCCAGAACCTCAGAAACCTCCGTGGGGCTTTATGAAACATAAGGACGCAGGCCGCTAG
    TAACAATAGCTAACGTGAATGCCTTCTCTGGGCCAGGCTCCATGCTGCATGCTATAGTCGCATTTTCT
    TGGTTAGCCCTCAGAACAACCTTCTGATGGAGATACTGCTAGTACTTGTTTACTCTTTGAAGGGCGGG
    GTAGGCTCTGAGACCCGCGAGTTGAGCTGCAGCGGGGTGAGTGTTACGCTACCAGGTAGACGAGGGTC
    TGTGTAAATTTGGCGGAAGGGAAGTGGTGGGTTGTCTCCCAGTTTGGGTTCTCTGAAGGAGTGAGGGG
    ACTCTCCAGCTATGGTCTCCACGTACAGTCGGTCTCTGATCTCCCGAGGCTTTGTGCCACTGCGCGCC
    CAGGGATCTTTCTGTCCCAATTACCGTGCAAGATTCTTGCCTCCAGTTGTATGTACTGAGGCGGCGGG
    AAGAAGCGCCGAGGGAAGGCTCAGACCGCAATGCCCCCCAGCCTGTAGCTATTCTCGGGGGCTGCAGC
    TCCCGCACCCGCGACGCCCGGCTTTCAGCGGGTGCAGGCGCAGCCTCCTCGTTCCCCTCCCGCCCGCG
    GCTGCGCCCGGAGAGCGCTCAGCTGACCTAGGGGCGGGGCCGGGGCGGGGACCTAAAGCCTCGGCTAC
    TTCCGAGGGGG
    Figure US20230142867A1-20230511-P00226
    ATGAGGCTTCACCTGCTGCTCCTTC
    TCGCGTTGTGTGGCGCGGGTACAACCGCCGCGGAATTGAGTTATTCCTTGAGGGGGAATTGGTCCATC
    TGCAATGGCAACGGCAGCCTGGAACTTCCGGGCGCTGTACCGGGTTGTGTGCATTCTGCTCTTTTCCA
    ACAAGGGCTCATTCAAGATTCATACTACAGATTTAACGACCTTAACTATAGGTGGGTGAGCCTTGATA
    ACTGGACATATTCCAAGGAGTTCAAAATCCCTTTCGAAATATCAAAATGGCAAAAGGTTAATCTCATC
    CTTGAAGGCGTTGACACCGTTTCTAAAATACTGTTCAACGAGGTGACTATCGGAGAAACCGACAATAT
    GTTCAATCGATATTCTTTCGACATTACTAACGTGGTTCGAGACGTAAACAGCATTGAATTGAGGTTTC
    AGTCAGCAGTGCTCTATGCCGCCCAGCAAAGCAAAGCCCATACCCGGTATCAGGTACCTCCAGACTGC
    CCTCCTCTCGTCCAGAAAGGTGAATGCCATGTCAACTTCGTGAGGAAAGAGCAGTGCAGTTTCTCTTG
    GGACTGGGGTCCATCCTTTCCGACCCAGGGCATTTGGAAAGATGTTCGAATTGAGGCCTACAATATTT
    GCCATCTCAATTACTTTACCTTCTCCCCGATATACGACAAGTCCGCCCAGGAGTGGAATCTCGAGATA
    GAATCCACGTTTGACGTAGTTAGCTCTAAGCCTGTTGGCGGCCAAGTGATCGTAGCCATACCAAAGCT
    TCAAACACAACAGACGTACTCAATCGAGCTCCAACCAGGAAAGAGAATAGTGGAACTTTTCGTTAATA
    TTAGCAAAAATATAACCGTTGAGACCTGGTGGCCTCACGGCCATGGAAATCAAACGGGTTACAACATG
    ACTGTTTTGTTCGAGCTTGACGGGGGCCTGAATATAGAGAAAAGCGCCAAAGTTTACTTCCGGACCGT
    AGAACTTATCGAAGAACCGATTAAAGGTTCCCCGGGCCTGAGCTTTTATTTCAAAATAAACGGCTTTC
    CTATCTTTCTTAAAGGCTCCAATTGGATACCAGCAGATTCCTTTCAGGATCGAGTAACATCTGAATTG
    TTGCGACTGTTGCTGCAATCAGTAGTAGATGCCAATATGAATACGCTGAGAGTCTGGGGAGGCGGGAT
    TTACGAACAAGACGAGTTTTACGAACTCTGTGATGAGCTTGGGATTATGGTGTGGCAAGATTTCATGT
    TTGCATGCGCACTTTATCCCACTGACCAGGGTTTCTTGGACTCCGTCACTGCGGAGGTAGCATATCAA
    ATTAAAAGGTTGAAGAGTCACCCCTCAATTATAATATGGTCCGGAAACAATGAAAATGAAGAGGCTTT
    GATGATGAATTGGTATCACATTTCCTTTACCGATCGGCCAATTTATATAAAAGATTACGTGACGCTTT
    ACGTGAAGAACATACGGGAGCTTGTGTTGGCTGGAGACAAGTCCAGACCATTCATCACCAGTTCACCT
    ACGAATGGGGCGGAAACCGTAGCCGAGGCATGGGTATCACAAAATCCTAATTCCAATTACTTCGGCGA
    CGTGCATTTTTATGACTACATAAGTGATTGTTGGAACTGGAAGGTTTTCCCGAAGGCAAGGTTTGCGT
    CCGAATATGGCTATCAGAGTTGGCCAAGTTTTTCCACTCTCGAGAAAGTTAGTAGCACTGAAGACTGG
    AGCTTCAACTCCAAATTCAGTTTGCATCGGCAGCACCACGAAGGCGGAAATAAGCAAATGCTTTATCA
    GGCTGGACTCCACTTTAAACTGCCCCAGTCCACGGACCCGCTCCGAACGTTTAAAGACACTATATACC
    TGACGCAAGTCATGCAAGCACAATGTGTTAAGACGGAAACCGAATTTTACAGACGATCTAGGAGTGAG
    ATTGTGGATCAACAGGGCCACACGATGGGAGCTCTGTACTGGCAGCTTAATGACATTTGGCAGGCGCC
    CTCTTGGGCTAGTTTGGAGTATGGAGGTAAGTGGAAGATGCTTCATTATTTTGCTCAAAACTTCTTCG
    CGCCCCTCCTTCCCGTCGGTTTTGAAAACGAAAACACATTTTATATCTATGGGGTCAGCGATCTGCAC
    TCCGACTACTCAATGACGCTTAGCGTTCGAGTGCATACCTGGTCAAGTCTCGAGCCCGTCTGCTCTAG
    AGTCACGGAACGCTTTGTCATGAAGGGTGGGGAGGCCGTATGCTTGTACGAAGAACCTGTATCCGAGC
    TTTTGCGGAGGTGTGGTAATTGTACTCGAGAAAGCTGTGTAGTGAGCTTCTATCTTTCTGCTGACCAC
    GAGTTGCTTAGTCCTACCAATTATCATTTTCTGAGTAGCCCCAAGGAAGCGGTTGGCCTGTGCAAGGC
    GCAGATTACGGCGATTATTAGCCAACAAGGGGATATCTTTGTGTTCGACCTCGAGACCTCCGCCGTTG
    CCCCCTTCGTGTGGCTTGATGTGGGCAGTATCCCGGGAAGATTCTCCGACAACGGATTCCTGATGACA
    GAAAAAACTCGCACAATACTTTTTTATCCATGGGAACCGACTTCCAAGAACGAACTTGAGCAATCATT
    TCATGTGACGAGCTTGACGGATATATACGATTATAAGGACGATGACGATAAGTA
    Figure US20230142867A1-20230511-P00227
    Figure US20230142867A1-20230511-P00228
    Figure US20230142867A1-20230511-P00229
    Figure US20230142867A1-20230511-P00230
    Figure US20230142867A1-20230511-P00231
  • The LIPA protein sequence is:
  • (SEQ ID NO: 64)
    MKMRFLGLVVCLVLWTLHSEGSGGKLTAVDPETN
    MNVSEIISYWGFPSEEYLVETEDGYILCLNRIPH
    GRKNHSDKGPKPVVFLQHGLLADSSNWVTNLANS
    SLGFILADAGFDVWMGNSRGNTWSRKHKTLSVSQ
    DEFWAFSYDEMAKYDLPASINFILNKTGQEQVYY
    VGHSQGTTIGFIAFSQIPELAKRIKMFFALGPVA
    SVAFCTSPMAKLGRLPDHLIKDLFGDKEFLPQSA
    FLKWLGTHVCTHVILKELCGNLCFLLCGFNERNL
    NMSRVDVYTTHSPAGTSVQNMLHWSQAVKFQKFQ
    AFDWGSSAKNYFHYNQSYPPTYNVKDMLVPTAVW
    SGGHDWLADVYDVNILLTQITNLVFHESIPEWEH
    LDFIWGLDAPWRLYNKIINLMRKYQDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for LIPA and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • LIPA-DNA
  • (SEQ ID NO: 65)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG TGTATCTAAGAAGCAACTAACTTGCTTTTG
    ATTTTACCAGCTAATAGGTGGAAGGGACTTGCCTTGTCTCAGGTGAGACTTTGGACTATGGACTTTTG
    AGTTAATGCTGAAATGAGTTAAGACTTTGGGGGACTACTGGGAAGGCATGATTGGTTTTGAAATGTGA
    GGACATGAGATTTGGGAGTGGCCAAGGGCAGAATGATATGGTTTGGCTGTGTCCCCACCCAAATCTCA
    TCTTGAATTATAGCTCCTGTAATTCCCACATGTTGTGGGAGGGACCCGGTGGGAGATAACTGAATCAT
    GGGGGCAGTCTAGCCCATACTGTTCTTGTGGTGGTAAATAAGTCTCACAAGGTCTGATGGTTTTATAA
    GAGGAAAATCCTTTTACTTGGTTCTTTCTCTCACATGTGCCGCCATGTAAGACAAGCCTTTTGCCTTC
    TGCCATGATTGTGAGGCCTCCCCAGTCCTGTGAAACTGTGAGTCAGTTAAACCTTTTTCCTTCATAAA
    TTACCCAGTCTTGGGTATGTCTTTATTAGCAGCATGAGAACAGACTAATACACCTTCTAATTAGTCTC
    CCTGCTTCTGTAGTTCAGAATGATCTTTCTAAACTAGATTATGCTGCCCTTCTATTGAAAATATTACA
    CTGGCTTCTCATCTTACTCAGAAGACACGGTAAGATCTTGACAAGTGGCTCCCAAGGCACCAGACTTC
    TAGGACTTCATGTTCTATTACTTTCTACTATGTACATTCTGTTTTAGCCTCAAAGACCTCACCACTCT
    GGCTAGGACCCACCAGGAATATTCCTACCACAGGGCCTTTGTACCTGCTATTTTCACCCAGAATGTGC
    TTTCTCCAGTATCTAAGGCTGTTAACTACCTCCTTGCCTGTGCTCAGGTGTCACTTCATAAGAGTCCT
    TTCCTGATTACCATATACATATCATCCTCCTTATCCCTGTTTTTTTTTTTTTTTGGGGGATGGAGTCT
    CACTCTGTTGTCCATGCTGGAGTGCAGTGGCATGATCTCCGCTCACTGCAACCTCCACCTACCTGGTT
    CAAGCCATTTTCCTGCCTCAGCCTCCCAAGGAGCTGGGATTAGAGGCACACACCACCACACCCGCTAA
    TTTTGTATTTTTAGTAGAGATGGGGTTTCACCAAGTTGGCCAGGCTGGTCCTGAACTCCTGTCCTCAG
    GTGATCCGCCCTCCTCAGCTTCCCAAACTGCTGGGATTACAGGCGTGAGCCACTGTGCCCAGCCCCTT
    GTTGTTTGTTGTTGTTGTTGTTTGTTTGTTTTTGTTGTTATTTATAGCACTTATCACCACCTCATGTA
    TTATATAGTTATTCATGATGTGTCTCTTCCCATCAGAATATATACAAGAGAACAGGAACCTTGTTTGG
    TTGGTTTCTATACCTCCACTGTCTTACTAATGCCTGGCATTTAGTAGGTGCTCAATAAAATCTTTGTT
    TTAGAAAATGATTAATAAATAATATGTACATATTGAATATTGTTTCCTCTTTTAACAAATCTTTAATA
    ATAATAAAAAATTTAAAGCAGCAAGATTGCTTTGCATCAGGCACAATGATAAGCGCTTTAAATGGAGC
    CCATGATTCACTTTCATTTATAGAGCCCACATGCTCTCTGTCAGGCGCAGAAGGTGCCAGCCCTGCAG
    GAGCTTCGGGGGCGGAGCAGAAGGCCGCCTAAAACAGCCTTTGCTAAGAGAGCATGCGTAGGCGACGC
    GCTGGTAGAGCTGTGGACCTGCCAGCCTGCGAGGCGGAGGACGGGCTCCATCTCTTAGAAACGCCTAC
    GGCGCATGCTCTATGGGGTCAACTGGGGGGCTGGCAAGCGGCAGCGCTGGTCTGGGGCGGAGTCTCCG
    AGGCACTTCCCGGTGGCTGGCTGCTCTGATTGGCTGAACAAATAGTCCGAGGGTGGTGGGCATCCGCC
    CTCCCGACAAGGCAGACCAGGCCCCCTGCAGGTCCCCTATCCGCACCCCGGCCCCTGAGAGCTGG
    Figure US20230142867A1-20230511-P00232
    Figure US20230142867A1-20230511-P00233
    ATGAAGATGCGCTTCCTTGGCCTCGTAGTTTGTCTGGTT
    CTGTGGACACTGCACAGTGAGGGTAGCGGCGGTAAATTGACGGCTGTAGATCCCGAAACTAATATGAA
    TGTTAGTGAAATAATCTCTTACTGGGGATTTCCATCCGAGGAGTATCTTGTAGAAACAGAAGATGGCT
    ATATACTTTGTCTCAACCGGATACCACACGGACGGAAAAACCACAGCGATAAGGGGCCAAAGCCGGTG
    GTTTTCCTTCAACACGGTCTTCTGGCAGATTCTAGTAATTGGGTTACCAACCTTGCAAACTCTTCACT
    TGGATTCATTCTTGCGGATGCTGGATTTGACGTATGGATGGGGAACTCCAGAGGGAATACGTGGTCCA
    GAAAACACAAGACATTGTCTGTATCTCAAGATGAGTTCTGGGCATTCAGTTACGACGAAATGGCGAAG
    TATGATCTCCCAGCATCCATAAATTTCATCCTGAACAAGACTGGTCAAGAACAAGTATATTACGTCGG
    ACACAGTCAAGGCACTACTATTGGCTTTATCGCTTTCTCCCAAATCCCGGAATTGGCGAAAAGAATCA
    AGATGTTCTTCGCATTGGGTCCGGTGGCGTCTGTCGCTTTTTGCACCAGCCCCATGGCGAAACTTGGG
    AGGTTGCCGGATCATCTCATAAAGGACCTCTTTGGAGATAAAGAGTTTCTTCCTCAAAGCGCATTTCT
    TAAGTGGTTGGGAACACATGTCTGCACTCACGTGATCTTGAAAGAACTTTGTGGAAACCTGTGTTTCC
    TCCTGTGCGGGTTCAATGAACGAAATCTTAATATGTCCAGGGTGGACGTCTATACCACCCACAGCCCG
    GCAGGTACGTCAGTCCAGAATATGTTGCATTGGAGCCAAGCGGTAAAGTTCCAAAAATTCCAGGCATT
    TGACTGGGGTTCTTCAGCTAAGAATTACTTCCACTATAATCAGAGCTACCCGCCCACTTATAATGTCA
    AGGATATGTTGGTTCCAACTGCAGTTTGGAGCGGCGGGCACGATTGGCTGGCGGATGTGTATGATGTG
    AATATCCTTCTCACACAAATTACTAATCTCGTGTTCCACGAGAGCATACCAGAGTGGGAACATTTGGA
    TTTTATATGGGGACTGGATGCTCCATGGCGGCTCTACAATAAAATCATCAATCTGATGCGGAAATACC
    AAGACTACAAAGACGACGATGATAAGTA
    Figure US20230142867A1-20230511-P00234
    Figure US20230142867A1-20230511-P00235
    Figure US20230142867A1-20230511-P00236
    Figure US20230142867A1-20230511-P00237
    Figure US20230142867A1-20230511-P00238
    Figure US20230142867A1-20230511-P00239
  • The CTNS protein sequence is:
  • (SEQ ID NO: 66)
    MIRNWLTIFILFPLKLVEKCESSVSLTVPPVVKL
    ENGSSTNVSLTLRPPLNATLVITFEITFRSKNIT
    ILELPDEVVVPPGVTNSSFQVTSQNVGQLTVYLH
    GNHSNQTGPRIRFLVIRSSAISIINQVIGWIYFV
    AWSISFYPQVIMNWRRKSVIGLSFDFVALNLTGF
    VAYSVFNIGLLWVPYIKEQFLLKYPNGVNPVNSN
    DVFFSLHAVVLTLIIIVQCCLYERGGQRVSWPAI
    GFLVLAWLFAFVTMIVAAVGVTTWLQFLFCFSYI
    KLAVTLVKYFPQAYMNFYYKSTEGWSIGNVLLDF
    TGGSFSLLQMFLQSYNNDQWTLIFGDPTKFGLGV
    FSIVFDVVFFIQHFCLYRKRPGLQAARTGSGSRL
    RQDWAPSLQPKALPQTTSVSASSLKGDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for CTNS and the 5′ and 3′ UTR sequences are listed below using the same
  • CTNS-DNA
  • (SEQ ID NO: 67)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAG
    AACCAGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGA
    CGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG
    TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGA
    CGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTT
    GGCAGTACATCTACGTATTAGTCATCGCTATTACCATG AAACTGAGGCACGAAAATCACTTGAACCCG
    GGAGGCAGAGGTTTCAGTGAGCAGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCGAGACCC
    TGTCTCAAAAAAAAAAAAAAAAAAAAAAGAAGTGCTCTATTTCAGGAGAAACTGGCACTTTCTGAGCC
    TACTCTCCCCTAATGCCAGCTCTCCTGCTCACCCCACCAGGGTCAGAGCCAACTTTGCCTCCAATTCA
    TAGTCCTTTAAGTAAGAATCCTTTTAATATGCCCTAATGTCCCAACCAAACTAATCTTGAAAGCTTCT
    ATGTAGATACAAAGTGCTCCTGAAATCCCTATCCTCAGAAATGCTTCTGAGCCAAATGGGCTCTGAAC
    CCTAAACAACCGTGTCCATGTATGTGGCAAGAGCTTGTGAAAAACAAAGCTGGGCCAGGCGCAGTGAC
    TCACAACTGTAATCCTAGCACTTTGGGAGGCTGAAGTGGGCAGATCACTTGAGGTCAGGAGTTCAAGA
    CCAGTCTGGCGAACATGGCGAAACCCTGTCTCTACTAAAAATACAAAAAGTAGCCGGGCGCGGTGGCT
    CACACCTGTAGTCCCAGCTACTCGGGAGGCTGAAGCAGGAGAATCACTTGAATCCAGTTGGCGGAGGT
    TGCAGTGAGCCCAGATCACGCCACTGTACTCCAGCCTGGGCAACAGAGCGAGACTTGGTAAGAAAGAG
    AAAGAAAGGAAAGAATGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAG
    GGAAGGAAGGGAAGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAACGGAGCGATCTCGACTCACTG
    CAAGCTCCGCCTCCCGGGTTCGCGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCG
    CCCGCCACCACGCCCCGCTAATTTTTTGTATTTTTAGTACAGACGGGGTTTCACCGTGTTAGCCAGGA
    TGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCAAAGCGCTGGGATTACAGGCGTG
    AGCCACCGCGCCCGGCTGACCAAAGGTTTCTTGGTCCGCATTCTGCTTCTGTGGAATGAGCCAGGAGC
    CAGTTAGGCCTGATTTGACATCTGATTTCCGGAGGAAAACCCAGACTCTGCCCTGGGCAACAAACTGA
    ATCCTGAACTTGAGGTCACAGGGCAGGTGTGAGGAGCGGAGAGCAGCAAGAGTGAAAGGGAGGCCTGT
    GGTCATTCCATACACACAAGAGATCAGTTCCTCCAAGGTCAGGGGACAGAGAGCACAGGGATCCAGCG
    CCAAGCGCAAGGCCCCCAGAAGAAGCCAGAGAGTCGGGGAGGGGGCGGGGGGGAATCGGTCCCAGCAG
    GTGGGAAGGATTCTGGGACCAGACCTAAGGGATCATGAGCACAGCTGCTGCAGGCAGACGGGCCCCTG
    GAGAAGCTGGGGACAAGCTGGAATAGAGACTTCATTGCGGGAAGGGCTGTCAGGGAGGCCTCCTGGGG
    TGGAAAAGGGTGGTCAGGAGGCTCCTGGAGGCGGCGCGGCCCCGGGGGTCCAACTCACCTGGGGCCCG
    GCCACCGCGCTCTCGACCGCCGCCTCTGCCCGCGCAGCACGGGCACAGCTCGCCAGCACTGCGAACCC
    GGATGGGTCGTCGGGCGCGGCCCTCAGCAGAGCTGCCTTCACAGATGTGGTGCCCAGGTCAATGCCGA
    GGGTGATCGGCCGCGCAGCCATTATCTCCCTGACCCGCGCAGCTCCAGTCTGCAGCCAGCGGCCCCAC
    AAGTCCGCGCTCTTCGCCCAGGGGGGCGGGGCAGGGGCGGGGAGTCGCCTGCCAATCTTTCAGCCACA
    CCCAACATGGAGGCTTCTCGTCTTCCCACTGGCCGGGGAAGGCGAGCTTCCACGCAACCTCTCGGCGG
    GCCCCGGCTATAGGCGGAGAGGCGGCGGAAGGCGGGACCTAAAGGGGGCCCCGCCCCACGGGCTC
    Figure US20230142867A1-20230511-P00240
    Figure US20230142867A1-20230511-P00241
    ATGATCCGAAACTGGCTGACCATATTCATCCTTTTTCCC
    CTCAAACTTGTCGAAAAATGCGAGTCTAGCGTCAGCCTCACAGTCCCTCCGGTCGTGAAACTCGAAAA
    TGGCTCCAGCACCAATGTGAGTCTGACGCTTAGGCCCCCCTTGAACGCTACCCTCGTTATTACGTTTG
    AGATTACTTTTAGGTCTAAAAATATCACTATACTGGAACTGCCTGATGAAGTGGTTGTGCCTCCTGGC
    GTCACCAATTCAAGTTTCCAGGTTACCTCACAAAACGTAGGGCAACTTACAGTCTACTTGCATGGAAA
    CCATAGCAATCAGACTGGGCCGCGCATCCGCTTCCTCGTTATTCGAAGTTCAGCGATATCAATTATCA
    ATCAAGTGATAGGATGGATATACTTTGTCGCCTGGAGTATTAGCTTTTATCCCCAGGTAATTATGAAT
    TGGAGAAGAAAATCTGTTATAGGGTTGTCATTTGACTTCGTGGCACTCAATCTTACGGGCTTTGTGGC
    ATATTCCGTATTTAACATAGGACTCCTTTGGGTGCCGTATATCAAAGAGCAATTTCTTCTCAAGTACC
    CCAATGGGGTAAATCCAGTGAACTCTAATGACGTCTTTTTCTCTTTGCATGCGGTCGTGCTTACACTG
    ATCATAATCGTACAGTGCTGCCTTTATGAGCGGGGAGGCCAGAGGGTATCATGGCCTGCAATCGGTTT
    TTTGGTCCTTGCCTGGCTTTTCGCTTTCGTTACTATGATCGTTGCGGCAGTGGGCGTAACAACATGGC
    TTCAATTTCTCTTTTGTTTTTCTTATATAAAGCTGGCAGTCACTCTGGTCAAGTATTTTCCCCAAGCA
    TACATGAACTTTTATTACAAGTCTACTGAGGGGTGGTCCATTGGGAACGTCCTCCTTGATTTTACAGG
    TGGCTCCTTCAGTTTGTTGCAAATGTTTCTCCAAAGCTATAATAATGACCAGTGGACATTGATATTTG
    GCGACCCGACTAAATTCGGGCTCGGTGTATTCTCTATCGTTTTCGACGTTGTATTCTTTATACAGCAC
    TTCTGTCTCTATAGAAAACGCCCTGGTCTTCAGGCTGCCAGAACCGGTAGCGGTTCTCGGCTGAGACA
    AGATTGGGCGCCATCTTTGCAACCCAAGGCGTTGCCACAAACAACTTCTGTAAGCGCATCCTCTTTGA
    AGGGAGACTATAAAGATGATGACGATAAGTA
    Figure US20230142867A1-20230511-P00242
    Figure US20230142867A1-20230511-P00243
    Figure US20230142867A1-20230511-P00244
    Figure US20230142867A1-20230511-P00245
    Figure US20230142867A1-20230511-P00246
  • The GNS protein sequence is:
  • (SEQ ID NO: 68)
    MRLLPLAPGRLRRGSPRHLPSCSPALLLLVLGGC
    LGVFGVAAGTRRPNVVLLLTDDQDEVLGGMTPLK
    KTKALIGEMGMTFSSAYVPSALCCPSRASILTGK
    YPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAIL
    RSMCGYQTFFAGKYLNEYGAPDAGGLEHVPLGWS
    YWYALEKNSKYYNYTLSINGKARKHGENYSVDYL
    TDVLANVSLDFLDYKSNFEPFFMMIATPAPHSPW
    TAAPQYQKAFQNVFAPRNKNFNIHGTNKHWLIRQ
    AKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKL
    VKRLEFTGELNNTYIFYTSDNGYHTGQFSLPIDK
    RQLYEFDIKVPLLVRGPGIKPNQTSKMLVANIDL
    GPTILDIAGYDLNKTQMDGMSLLPILRGASNLTW
    RSDVLVEYQGEGRNVTDPTCPSLSPGVSQCFPDC
    VCEDAYNNTYACVRTMSALWNLQYCEFDDQEVFV
    EVYNLTADPDQITNIAKTIDPELLGKMNYRLMML
    QSCSGPTCRTPGVFDPGYRFDPRLMFSNRGSVRT
    RRFSKHLLDYKDDDDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GNS and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GNS-DNA
  • (SEQ ID NO: 69)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    TCTTGCCTAGGCTAGATTGCAGTGGTGCAGTCATA
    GCTCACTGCAGCCTCGAACTTCTGGGCACAAACAA
    TCCTCCCACCTCAGCCTCTTAAGTAGCTGGGACTA
    CGGGCATGCACTACCATGCCCAGCTAATTTTTATA
    TTTTTTGTAGAGCCAGGGTCTTGCTGTGTTGCCCA
    GACTGGTCTCAAACTCCTGGCCTCAAGCAGTCCTC
    CTGCCTCTGCCTTCTAAAGTGCTGGGATAACAGGT
    GTGAGCCACTGCACCGGGCCTAAAGTCTAAATTCC
    TATGTGGCCCTGCATCACTGACTCCTGCCTGCCTC
    TCTGACCTCATCTTACCTCACTCTCCCTTTTCCTC
    CCTAAGCTGCTTCCATCACCACACTGGCTTCTCTG
    TTTCCTCCCAGGTGCCTCTCTCCTTCCTCTGGCAG
    CCCTTGTACTTGCTATATACCCAGCCTGAAACATC
    TTCTGAATTTTTGCATGACTGACTCCTCATCGCAT
    GGCTCTCCGTACTGAAGTTACCTCCTGAGAGGCCT
    TTTCTGACCACCCATCTAAAGTAGTTCCCCTTTCT
    TCCCACCAGCTGTAATCACATCATCCTGGTTTATT
    CTCTTCATAGCACTTACCACCACCTGAAATGACTG
    ATTGTTTACCTGTTAGTTGTTTTAAGTCCCCAAGA
    GTGCAGAGATCATGAGGGCGAGGACCATCTATCCT
    CTGTCCACCACTGCCTAGATTAATGCCTGGCACGT
    GTTGTGTTTAATAAATGTTATTACACAGTCTTACT
    GTAAATCAGATATTCTTTCATAATTTTACATATTT
    ATCAGACAGTTTGACAATTAAGTCCCATACAGTCA
    ACTCTGGTCCTTTTCTCCACTTACTCTTTGAATCC
    TCATTTCACCTTCAAATGTCCTCACAGTAACTCAG
    GTGTTTTATCAAACCAATGTCTCATGAAGCCCAGT
    ATTCTTTTCCTGAGGATGGTACCCAGTGATGCTTT
    GAGGAGAGTATGGCTGACGACCTTGCTTCGGAAGG
    CTGGCGAAGATCTTCTGCTTCTCTAGCGTCTCTTG
    GCAGCCTCTTCTGGATCTACTCATGAAATTATCAG
    AGCTTTTTTTGGATAAGTTAGAATCCTCATCCTTT
    TGAGTTTATGGATTCCGTAGGTTTTGTATTTATAT
    AAAACAATGGGGAAAAGTCATTTATACATTATTGA
    AAATGAGGCTCCTGTTAACCTACTTATGTATTTAG
    ATTAGCAATCTTTAAGGAATTATTGCACAACAAAA
    TATACTTAAAACTGCATTCAATATGGTCTTAGGTT
    TCAAAGGTATCTCAGAATCATCGTGAACTGACCCA
    TATTACAACTAAAATGACCTCTTCAATGGTGCTTT
    TTGCTGTCTGGAATCTGAAGAATATTAGTGTTCCT
    TTTATTCGTATTTAAATGACTGTCTTCATACTTTT
    GTCCTCATTCTAGAAGGAAGTTAATTGTTCTCTGT
    ACTTGTAAGGATCTTAGATTTTGAACAAAAGATGA
    AAAGCAGGACATAAAATAGTTTCTGAGTAGCCTTT
    TGCCTGACAGTGTTCTGTGTACACCTGCTTGCCAC
    CTGGTTCAAGTCCTTATCCTGAGGTGCTTGTGGGT
    CTATGCAGACAACAGTAGTTGGTAGTTGAACAGTG
    TGTGTCTGGCCTCCACCACTGAGTGTGTGAGGTTT
    GAATCAGTCTCGGAGTCCTGGGGCCTTGTCTGGCT
    TTCAGAATCCAACTGTTGTCTTACTTCTGTTCTAT
    AGGGCGGAAGGCTCCCTTTATATCATTTCCCACAT
    CCTACTTCTAAAGATTAATTCATGGAGATAGGTCT
    GCCCTATCTGCCTTATCTGGGCGCTGATGACTAAT
    AAGGGCACAAAGATGTTATTGCCATCAACTGAACT
    TTCCCTTCCTGCAAGCATTTTGACTTCCTGCACTT
    TTATACAGGATGGAGCTGGCCGGAACCAAACTTGT
    CTCTAGAATCATACAAGACTCTCTTTGGCTGTTGC
    TTAGAGCTGACCATTTGGGAATAAAAGATTGAGTA
    AGATGTAACAAAACAGGAATTCTGGGCTGTGTGTC
    TCAATTTATTGGAAGAAATTGTTTAATGGATAATT
    ACTTACTTTTAC
    Figure US20230142867A1-20230511-P00247
    Figure US20230142867A1-20230511-P00248
    ATGAGGCTCCTGCCGCTCGCCCCGGGACGGCTCAG
    AAGAGGGTCTCCAAGGCACCTCCCAAGTTGTTCCC
    CCGCGCTTCTCTTGCTTGTGCTGGGAGGATGTCTC
    GGAGTTTTTGGCGTTGCGGCGGGAACACGACGGCC
    AAACGTTGTCCTCCTCTTGACGGATGATCAAGACG
    AGGTGCTGGGGGGTATGACACCGTTGAAGAAGACC
    AAAGCACTGATAGGTGAGATGGGTATGACTTTCAG
    CAGCGCTTATGTTCCGTCTGCCCTTTGCTGCCCTA
    GCCGCGCCAGTATCCTTACGGGGAAGTACCCACAT
    AATCACCATGTAGTTAACAACACCTTGGAAGGTAA
    TTGCTCTTCAAAGTCCTGGCAGAAAATTCAGGAAC
    CCAATACCTTCCCGGCCATACTTCGATCTATGTGT
    GGATATCAAACATTTTTCGCCGGGAAGTATCTCAA
    TGAATACGGTGCTCCCGATGCGGGCGGACTCGAAC
    ACGTTCCCCTGGGTTGGAGCTATTGGTACGCACTC
    GAGAAAAATTCAAAGTACTATAATTACACCTTGTC
    TATAAACGGAAAAGCCCGGAAACATGGTGAGAACT
    ATTCTGTCGACTATCTGACGGACGTGCTGGCGAAT
    GTAAGCCTCGATTTTCTTGATTACAAGAGCAATTT
    CGAACCTTTCTTTATGATGATCGCAACGCCTGCTC
    CCCATAGTCCATGGACTGCGGCACCACAATATCAA
    AAAGCGTTTCAGAATGTCTTTGCCCCACGAAACAA
    AAACTTTAATATACACGGCACCAATAAGCATTGGC
    TGATAAGACAAGCGAAGACACCCATGACCAATTCA
    AGCATACAGTTCCTGGACAACGCATTCCGCAAGAG
    GTGGCAAACGCTGCTCTCAGTTGACGATTTGGTTG
    AGAAATTGGTAAAAAGACTTGAATTTACTGGCGAA
    CTGAACAATACGTATATCTTCTATACATCTGATAA
    CGGATACCACACTGGTCAGTTTAGTCTCCCCATAG
    ATAAACGGCAGCTCTACGAGTTCGATATCAAAGTA
    CCCTTGTTGGTAAGGGGACCGGGAATAAAACCTAA
    CCAGACGTCCAAAATGCTGGTGGCAAACATCGACC
    TCGGCCCAACGATTCTGGACATCGCTGGGTATGAC
    CTTAACAAGACACAAATGGATGGCATGTCCCTGCT
    GCCAATCCTCAGAGGGGCCAGCAATCTTACATGGC
    GCTCAGATGTCCTGGTAGAATATCAAGGTGAGGGA
    CGGAATGTAACAGACCCTACCTGTCCCAGCTTGAG
    CCCTGGTGTATCTCAGTGTTTTCCTGATTGTGTTT
    GTGAAGACGCATACAATAACACATACGCTTGTGTA
    AGGACGATGAGTGCACTTTGGAACTTGCAGTATTG
    CGAGTTTGACGATCAGGAGGTGTTTGTTGAGGTGT
    ATAACTTGACGGCAGATCCGGACCAAATTACCAAT
    ATAGCTAAAACCATTGATCCGGAACTCCTCGGCAA
    GATGAATTATAGACTTATGATGCTGCAGTCATGCA
    GTGGTCCCACATGCCGAACGCCAGGAGTGTTCGAC
    CCTGGGTACCGGTTCGATCCCCGCCTGATGTTCTC
    CAATAGGGGTTCAGTTCGGACGAGGAGATTCTCAA
    AGCACCTCCTTGACTATAAAGATGACGACGACAAG
    TA
    Figure US20230142867A1-20230511-P00249
    Figure US20230142867A1-20230511-P00250
    Figure US20230142867A1-20230511-P00251
    Figure US20230142867A1-20230511-P00252
    Figure US20230142867A1-20230511-P00253
    Figure US20230142867A1-20230511-P00254
  • The HGSNAT protein sequence is:
  • (SEQ ID NO: 70)
    MSGAGRALAALLLAASVLSAALLAPGGSSGRDAQAAPPRDLDKKRHAELK
    MDQALLLIHNELLWTNLTVYWKSECCYHCLFQVLVNVPQSPKAGKPSAAA
    ASVSTQHGSILQLNDTLEEKEVCRLEYRFGEFGNYSLLVKNIHNGVSEIA
    CDLAVNEDPVDSNLPVSIAFLIGLAVIIVISFLRLLLSLDDFNNWISKAI
    SSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIA
    LILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSI
    LQRGCSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGPLSWDKVRIPGVLQ
    RLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLE
    GLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDD
    HLYQHPSSAVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKART
    KDILIRFTAWCCILGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFA
    FFILLVLYPVVDVKGLWTGTPFFYPGMNSILVYVGHEVFENYFPFQWKLK
    DNQSHKEHLTQNIVATALWVLIAYILYRKKIFWKIDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for HGSNAT and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • HGSNAT-DNA
  • (SEQ ID NO: 71)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    CAAAAGGGTAAAGTTGTTTACCATTCGGACTTTTG
    GCCCCCTTCTCCGTGTACAAATTGGTAAAAAGCCT
    CAGGATTTTTGAGCTGTCCTTATCCCTCCCCTTGT
    TTCTTTTCTTTTCTTTTTTTTTTTGAGACAGGGTC
    TCACTCTGTCGCCCAGGCTGGAGTGCAGTGATGTG
    GTCTTGGCTCACTGCAACCTCCGTTCAAACGATTC
    TCCTGCCTCAACCTCCTAAGTAGCTGGGAGTACAG
    GCAGGCACCACCACTGCTGGCTAATTTTTGTATTT
    TTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGC
    TGGTCTTGAACTCCTGACCTGCCAGGCTGGAGTGC
    AGTGGCATAATCTTGGCTCACTGCAAACTCTGCCT
    CCCGGATTCAAGTGATTCTCCTGCCTCAGCCTCCC
    AAGTAGCTGGGACTACAGGTACTTGCCATGACGCC
    CGGCTAATTTTTTGTATTTTTAGTAGTGACAGGGT
    TTCACCATGTTGGCCAGGATGGTCTCGATTTCTTG
    ACCTCGTGATCCACCTGCCTTGGCCTCCCAAAGTG
    CTGGGATTACAGGCGTGAGCCACTGTGCCCGGCCA
    TATGTGTTTAATAACCCAGTTTGTCTCTTCTTGCC
    TTCAGGACATCAAACGGTCATGCTACTGGAAGCTC
    GGACCATGGCCCCTTCTGCTGGGCACCCTTAGATA
    GGCCTCTGAGGGAGATCTGATTGCCATTTTCCCGA
    AACAGTGCCCCCTATCAGCAGGAAGCAGTTAAGAT
    CTGTCTTTGTCCCTATTCTTATCCTAGTGGCAGTT
    AGATAGACTTCTTTAGAGCGAGGAATGAGCCAGAT
    GGGAAGGGTTCCCTGGTAAAACTCCAACTGGCCTG
    TGCACTGCTCTGGGGTGGAGCCACAGAAGTTCGCT
    CCTCAGTTTGCAGCTGGGAGGAGCCTGGCCCTCCT
    CTTCCTCGGGAGAACCTGGGATTTAATCTGTGAGG
    TGGGAAGCCTACAACTAGGAGAACTCTGGCTCTGC
    TGAGTCCCTGTTCCCCTTTTTCTTCCTTTACACCC
    AATAAAACCCTGCCTTACCCTTCAAATTGTCTGCG
    AGCCTAATCTTTCCTGGCGTGTGACAAGGACCTCC
    ACCGTCTTTAGCTGAACTAAGGAAAAGTCCCGCAA
    CATTTTCATGTGTGCCTTTGTATTTTTATTACATA
    TATGGATGTCCCTAAACAACATACATTATTGTTTT
    GCTTTATATATTAAAAAGGCATTCTGCAATTTGCA
    TTTTCCCCAGCCTTGTGTTTTGAGATTTATCCATG
    TGGATGCGGGAAGCTGTAGAGCACTTCCCCTGCTG
    TGGGGCAGTTTGTTTTCTCGCTGGCGGACATTCAG
    CTTGTTGACAAGGTTTTGCAATTACAAGCACCTCT
    GCTATAGGTATTACTGTACCTGTCCCTTATACCTG
    CGATAGTGTCTCCCGGCATACAGCCAAGAATGGGT
    GTGCACACTCCTCGACCTGCTCCCTTCCACTTCGT
    CAGCCTGGCCGGGGGCTCTTTCAAGAGGTGGGGCC
    GCTTGGTGGCTTCTAAGTCGTGCCTCTCCCTCTGG
    GTAATAAACGCCTTGGGGGCAGGGACTTTGCAGTC
    TTGCTTATTGAGGCGTTTTCCAAAACAGGGACCGA
    GGGCTCTAACACCCGTGCGGAGGGTGGCCGGCGGG
    GGCGGGAGTTTGGCAGAATGCGGCTGGAGAAGGTG
    GGGCGAGGGAAGAAGGCTCCTGGCCTGCAGGGGCC
    TCTCTTCCGGCTGTTTTTCCTCCCGAAGACAAACA
    CTCCCAACGCAGGGGCGCAGAGCCTCGGCTTCTTT
    CCTGGCCAGCTCCCTCAAGGCAGGGCCTAGCATGG
    CCCGGCCCAGCCCCTAGGTTCCCCCGCCCGCGGCT
    CCCGGGCGCCTCCGCCTCCCGCGGTGACTCAGGCG
    GCGGTGACGGCAGCGGAGGGGGCGATGACGGGCGC
    GCGGGCGTCCGCGGCGGAGCAGCGCAGGGCGGGGC
    GCAG
    Figure US20230142867A1-20230511-P00255
    Figure US20230142867A1-20230511-P00256
    ATGAGCGGCGCGGGCAGAGCTTTGGCGGCCCTTCT
    TCTGGCGGCATCTGTGCTTAGTGCAGCACTCTTGG
    CTCCTGGGGGAAGTAGTGGTCGAGACGCTCAGGCC
    GCGCCACCGAGAGATCTGGATAAGAAACGGCACGC
    AGAGCTCAAGATGGACCAGGCACTTCTGCTCATAC
    ACAACGAATTGCTTTGGACCAATTTGACGGTATAT
    TGGAAGAGTGAGTGTTGTTACCATTGTCTCTTCCA
    GGTTCTTGTGAACGTGCCACAGAGTCCTAAAGCTG
    GGAAGCCTAGTGCTGCGGCGGCATCTGTTTCTACA
    CAGCATGGTAGCATACTGCAACTTAACGACACTCT
    GGAGGAAAAAGAGGTATGTCGATTGGAATATAGAT
    TTGGCGAGTTTGGCAATTATTCATTGCTGGTTAAA
    AATATCCACAACGGTGTTAGCGAGATTGCATGCGA
    TCTCGCTGTTAATGAAGATCCGGTCGATTCTAACC
    TGCCGGTCTCAATAGCTTTTCTGATAGGGCTGGCA
    GTGATTATCGTAATCTCATTCTTGAGGTTGCTCCT
    CTCACTCGACGATTTTAACAACTGGATCAGCAAGG
    CAATTAGTTCCAGAGAAACTGACAGGCTCATTAAC
    AGCGAACTTGGTTCACCCAGTCGCACAGACCCCCT
    TGACGGCGATGTGCAACCTGCGACATGGAGATTGT
    CCGCCCTCCCTCCGCGACTGCGATCAGTGGATACC
    TTCAGAGGAATTGCGCTCATTCTTATGGTTTTCGT
    AAATTACGGAGGGGGTAAATACTGGTATTTTAAAC
    ATGCCAGTTGGAATGGACTCACCGTTGCAGATCTC
    GTGTTCCCGTGGTTCGTCTTCATTATGGGCTCATC
    CATTTTTCTCAGCATGACCTCCATCCTCCAGCGAG
    GATGCTCCAAATTTCGCCTCCTCGGAAAGATAGCC
    TGGCGAAGTTTCCTGTTGATTTGTATTGGGATTAT
    TATCGTAAACCCCAACTACTGTCTGGGGCCTTTGT
    CATGGGATAAAGTAAGGATACCTGGAGTTCTCCAA
    AGACTCGGCGTGACTTATTTCGTCGTTGCAGTTCT
    CGAACTCCTCTTTGCCAAGCCGGTCCCCGAGCACT
    GCGCATCAGAACGCTCATGCCTCAGCCTTCGCGAC
    ATAACAAGCAGTTGGCCTCAGTGGCTGCTTATCCT
    TGTCCTGGAAGGCCTTTGGCTTGGTCTTACCTTTT
    TGCTTCCCGTACCAGGATGCCCAACTGGCTACCTC
    GGGCCCGGAGGGATCGGCGACTTTGGAAAGTATCC
    CAATTGTACCGGAGGAGCCGCGGGCTATATTGATC
    GACTTCTCTTGGGGGATGACCATCTTTACCAACAC
    CCGTCTAGTGCAGTACTTTACCATACCGAGGTAGC
    CTATGACCCGGAGGGAATCCTTGGTACCATTAACT
    CAATAGTCATGGCGTTCCTTGGTGTACAGGCTGGA
    AAAATTCTCCTTTACTACAAAGCTCGAACGAAGGA
    TATTCTTATAAGATTTACAGCGTGGTGCTGCATCC
    TTGGACTGATTTCTGTGGCCCTGACGAAGGTATCC
    GAGAACGAGGGATTCATCCCCGTTAATAAGAATCT
    GTGGAGCCTTTCCTATGTCACCACGCTCAGCTCTT
    TTGCTTTCTTCATTTTGCTTGTTCTCTATCCAGTA
    GTGGATGTAAAGGGCCTGTGGACAGGGACTCCGTT
    CTTCTACCCGGGGATGAACAGCATACTTGTTTATG
    TGGGACATGAAGTATTTGAAAACTATTTCCCCTTT
    CAATGGAAACTTAAAGATAATCAAAGTCACAAAGA
    GCACTTGACACAAAACATTGTAGCGACAGCGTTGT
    GGGTCCTCATTGCATATATCCTCTACAGAAAGAAG
    ATATTTTGGAAGATAGACTATAAAGATGATGATGA
    TAAGTA
    Figure US20230142867A1-20230511-P00257
    Figure US20230142867A1-20230511-P00258
    Figure US20230142867A1-20230511-P00259
    Figure US20230142867A1-20230511-P00260
    Figure US20230142867A1-20230511-P00261
    Figure US20230142867A1-20230511-P00262
  • The NeuroD1 protein sequence is:
  • (SEQ ID NO: 72)
    MTKSYSESGLMGEPQPQGPPSWTDECLSSQDEEHEADKKEDDLETMNAEE
    DSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKARLERFK
    LRRMKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYI
    WALSEILRSGKSPDLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQN
    QDMPPHLPTASASFPVHPYSYQSPGLPSPPYGTMDSSHVFHVKPPPHAYS
    AALEPFFESPLTDCTSPSFDGPLSPPLSINGNFSFKHEPSAEFEKNYAFT
    MHYPAATLAGAQSHGSIFSGTAAPRCEIPIDNIMSFDSHSHHERVMSAQL
    NAIFHDDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for NeuroD1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • NeuroD1-DNA
  • (SEQ ID NO: 73)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    TGTCGCGAACTGGAGGTTTCTGCTTTCTGGGGGTG
    AGGACAAGGATTTAGCAGAGGGACCTCTAGGCTAC
    CCCTGACCTCGAAGCTAGCTGGAGTCCTTTCTCCC
    ACTCCGTCCCCAGTTCTCCTCCGGAACTCCCCAAA
    GTTGGAAAAATGAGAGCCTAGGCACGAGTTGCATA
    TGCCTGCGGCATAGTGTGCTGCCAGTCGGGGCAGC
    TGCCGGGTGAGGGGCAGTACTTAACCTGGCCCTGC
    CGCAAACGTCGGGCTGTGTACTATCTTCTAAGGGC
    TAGTACTCCGACGCCCTCGAGTTTTAGCCCAACAC
    TGAAGATACAGACAGTACAGGGGTAAATCCTGGGC
    CCAGGCTCCTGGGTGAATTATTTTATCCTCCCCTA
    TGCGCTTTGTCAGCCAAGTGAATGTAGAAAAGCAA
    GCAAAACTCCAGAACCCCACAACGATTCACGCCAC
    GCTTCGGTGGCACAACTCTTTTCTTCCCCTTTTAG
    ATTAGTACTGTGCAGGGAAGGCTGCACCTGCTACC
    TTCAGACACCGGGGATCTGGCTAGGACCCTCTTCC
    TCTACCCCTACTCTCTCCTCCCGTCCCCACCCTCT
    CCCAGATGCACCCCAATCCAGCCTCTCAACCCTGT
    CGCTGTTTGCAAACTCGAGGGATGTTTAGTGCAAT
    TGCCTTTAAATAAGCAATTCACGTAAGAGGTATTT
    ATTGCAGAGGGTTTCAGGTCTGGGAGGGGGTAAGG
    AGTTAGTTAATGAAGGATTAGAGGCAAAGCTGGGA
    AAGAACGAGTAGACAGGCCAGTTTCTTTCTTTCTT
    TTTAAAATCTATTATTCTGGAAAACCCTTGTTCGA
    AAAGAGTAAAGAATTGCTATTTGGGCAACAATATG
    GCTCCTTCAGAACTGGTCGAGAATTTCTGTGGTCT
    TTGCTTGACCTTCAAGTTGGGAGTTTCAAATCAGA
    GGCTTTGAAGTCTGGAGGAGTTCGCAGCCATTAAT
    CCTTCCGGGGTTGTGCTAAAGGCGTTTCTAGAAAG
    AGATCGGAAAGACCCGTTCAGCACAGGCCCATTGC
    ATCTTCTCTTTGAAAACGCTTGTCTGAATCAAGTA
    TCTTATACAGACCCGAGACTGAGGGGAGCAAGAGC
    GCAGGTCCGGCTGTGGCGAGGTGTCCACCACCGTA
    CTTGTGGTCAATGCCTGGACTAGAGAGCCGAGGCC
    GTCCAGGGCGGGCTGGCCAGGGAACCAGAAGCGTC
    CCTGGACTGGCTGAGTGAGGGTGCCCTTTCCAGCA
    ACCCAGTGGCCATGGTTAAAAGGCAGAATACAACC
    GTCAAAGAAGTCCCCAGCTCCCACATAAGAGAGCG
    CAAAGGCCAATGGAACCAAGGCCTCCTCGCGAGAA
    CTAGCTCGCTTTGAGGACAAGATCCTGGGGATGGA
    GGTGGGGACTGACGTAGTGAGAGGGTCTGGAGGAA
    CAGTCGTAGCTGAAGGTCAGGACATGAAGAATTGC
    ACGTATGGTAGGGATAGAATGGCAGGTACAACTTT
    TCTGTACTGAAGGAACTGGGAGAGGACGATCCGGT
    TAGGGAGGTTGGGGAACTAATCTCAACGCTGCGTT
    TACAGATGAAGCCGCTTTTATATGGCGTATATGTT
    TGCTTAGAGGGGCCGACGGAGATTAGGAGAAGCCA
    TCCTTTGGCGCCAATGATCAAAGCGTCTGCCAAGG
    AGAAGAAGCCAAGGGATGGGCCTTTCAGAGAGGGC
    AAGGAGTCATGCTGCTCTGGATGCCAGTGTCAGGA
    CAAGAAATCGAAAGGAGCGAGGACTCTTCACTGCG
    TGCCTCAGTCTCCCCGCTTCTGCCTCTTTCACCTC
    TGTCCTACTTCCGGCGCGAAAGCAGGCCACTCGCT
    CTGATCTAGACCTAGTTAACATTAGCTTTTCCCTT
    CCTTCCCTCAAATCCCCTCCCCCCTCTATCCCCGT
    CCCTTCTGCCGCCTGAAAGGGTTAATCTCTCCTGC
    GGGTAAAAACAGGTCCGCGGAGTCTCTAACTGGCG
    ACAGATGGGCCACTTTCTTCTGGCCACAAAGGGGC
    CGGAATGGAGCGCTCCGCGGCATACAAATGGGCAG
    GTCACGTGGTTCCAGGCTCTTGGCTGGACCGGGAA
    GACCATATGGCGCATGCCGGGGAGGAAGGAGGAGG
    GGCGGGGGTAGGGGTGGAGGGTGAGGGGAGCGGTT
    GTCGGAGGAGGGCGGGAGACGAGCAAGGCGTGGGG
    AGAAGTGGGGAGGAGGGGAGAACGGGGAGCGCACA
    GCCTGGACGCGTGCGCAGGCGTCAGGCGCATAGAC
    CTGCTAGCCCCTCAGCTAGCGGCCCCGCCCGCGCT
    TAGCATCACTAACTGGGCTATATAACCTGAGCGCC
    CGCGCGGCCACGA
    Figure US20230142867A1-20230511-P00263
    ATGACGAAAAGCTACAGCGAGTCAGGGTTGATGGG
    GGAGCCTCAACCCCAAGGGCCCCCATCTTGGACGG
    ATGAATGCCTGTCATCACAAGACGAGGAACACGAG
    GCAGACAAGAAGGAGGACGACCTGGAAACGATGAA
    CGCTGAGGAAGACAGCCTCAGGAACGGAGGCGAAG
    AAGAAGATGAAGACGAAGACCTGGAGGAAGAGGAG
    GAAGAAGAGGAAGAGGATGACGACCAGAAGCCCAA
    ACGGCGCGGGCCCAAAAAGAAGAAAATGACTAAAG
    CCAGGTTGGAGCGGTTCAAACTCAGGAGGATGAAG
    GCTAATGCAAGAGAGAGGAATCGCATGCATGGACT
    CAACGCCGCTCTCGATAACCTCCGGAAGGTCGTGC
    CATGTTATTCTAAAACACAAAAACTGTCCAAAATC
    GAGACCCTGAGGTTGGCAAAGAACTACATATGGGC
    GCTCAGCGAGATTCTTAGATCTGGTAAGTCTCCGG
    ACCTCGTTTCCTTCGTCCAAACCCTGTGTAAAGGT
    CTGTCACAGCCAACCACCAACCTTGTAGCTGGGTG
    CTTGCAGTTGAACCCCCGCACTTTCCTTCCTGAAC
    AAAATCAGGATATGCCTCCGCACCTCCCTACCGCC
    AGTGCATCTTTTCCGGTACATCCCTACTCATACCA
    AAGCCCGGGTCTGCCTAGTCCGCCGTATGGTACAA
    TGGATTCCTCCCACGTGTTCCACGTAAAACCCCCA
    CCTCATGCCTATTCCGCTGCCTTGGAGCCTTTTTT
    TGAGTCTCCGTTGACAGACTGCACCAGTCCTTCTT
    TTGATGGGCCCCTGTCACCTCCTCTTAGCATCAAT
    GGAAACTTCTCTTTCAAACACGAGCCAAGTGCAGA
    GTTCGAGAAAAATTATGCTTTTACTATGCACTACC
    CCGCAGCTACCCTGGCAGGTGCGCAAAGTCATGGA
    TCTATCTTTTCAGGAACGGCTGCGCCGAGGTGTGA
    GATACCCATTGATAATATCATGAGTTTCGATAGCC
    ACAGCCATCACGAGCGCGTGATGAGCGCTCAACTG
    AATGCAATTTTTCATGACGACTATAAAGATGACGA
    CGATAAGTA
    Figure US20230142867A1-20230511-P00264
    Figure US20230142867A1-20230511-P00265
    Figure US20230142867A1-20230511-P00266
    Figure US20230142867A1-20230511-P00267
    Figure US20230142867A1-20230511-P00268
    Figure US20230142867A1-20230511-P00269
  • The SLC17A5 protein sequence is:
  • (SEQ ID NO: 74)
    MRSPVRDLARNDGEESTDRTPLLPGAPRAEAAPVCCSARYNLAILAFFGF
    FIVYALRVNLSVALVDMVDSNTTLEDNRTSKACPEHSAPIKVHHNQTGKK
    YQWDAETQGWILGSFFYGYIITQIPGGYVASKIGGKMLLGFGILGTAVLT
    LFTPIAADLGVGPLIVLRALEGLGEGVTFPAMHAMWSSWAPPLERSKLLS
    ISYAGAQLGTVISLPLSGIICYYMNWTYVFYFFGTIGIFWFLLWIWLVSD
    TPQKHKRISHYEKEYILSSLRNQLSSQKSVPWVPILKSLPLWAIVVAHFS
    YNWTFYTLLTLLPTYMKEILRFNVQENGFLSSLPYLGSWLCMILSGQAAD
    NLRAKWNFSTLCVRRIFSLIGMIGPAVFLVAAGFIGCDYSLAVAFLTIST
    TLGGFCSSGFSINHLDIAPSYAGILLGITNTFATIPGMVGPVIAKSLTPD
    NTVGEWQTVFYIAAAINVFGAIFFTLFAKGEVQNWALNDHHGHRHDYKDD 
    DDK
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for SLC17A5 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • SLC17A5-DNA
  • (SEQ ID NO: 75)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    GTCAAACTAAGCAAAGGGTATGGAGAACTCACTGT
    ACTATCTTTGCAACGTTTATGTAAACCTAAAATTT
    CTAAAATAAAAAATATTAAAAAAAAGAATTCCTCT
    GGTTTGAAGTCTTTCAAGTCTGTTGCAACTATATT
    CCCTGGCCCAAGTTATCTATAACCTTGGTCATCTG
    GGAGCTTGTTAGCAATACAGAATCTCAGGCACATC
    CCAGATGTACTGGGCTTCTCAGAGAAACAGCCCCA
    ATTGGGTAGTTTGGGTAATAAAGTTTTGGCTTTCA
    CAATATAGGCTGGCCAACTCGACCCAAAACAGATG
    ATGGTGCACATGCCTGGAGACCCAGGAGGACCTGT
    GGTGCAGATGAAGTCTGAGGACACTCTGCTGGAGA
    AGTCTCCCTTGCTTGGGCAAGAGAAGTGGGGGAGT
    CGGATTTTTTATTCTAGATAGGCCTTCAACTGATT
    GAAGGAAGCCCACCCAGATTATGGAAGGCAGCCAT
    CTTTACCCAGAGTTCAATGATTTTAGTGTTAACCT
    CATCCAAAGACAACCAGCAAGTTAACACATAAAAT
    TAACCATCACACCAGATCTGCTGAATCTGAATCTA
    CATTTTAAAAGGATTTGCCAGGTGACTCTGCATAA
    TCCAGTTTGAAAATTGCATTTTTCTACAACCCTTA
    GAGAGTGTAAAAATCTACATGGTAATTCAGTAACT
    CAGAAACAAGGCTTGATTTGCTGCTTTAACTTGAG
    AACTTTTCTAGCATTTACTGATCTGGAGGTAGGGA
    TGCCTACAGAAATCCTCGCCCTCCCTTCTAGGTAA
    AGGACCCTCCCCTTTGTTCTCACGTGAACCTCACT
    AAGCTTCTAACCTGGTTTACTGAATTAGTGGCTTG
    TGGGTCTTCCCTGTCTACACCAAACTCTCCGAGAG
    CATCTACATCTTATTTTCTACCTCCACAGAGACAG
    CCTTTATTGGCGGATCCAGATCTGCCCTGCCAAGG
    CAAGCTGGGTTTTCCTAGTTTCACTTTCAGAGTCC
    GCAAATGCTAGCACAATCTATCCAAATTCTATCGT
    GGAGATAAAGCATGCACACATACATCAGAGGATTC
    CATAGGTCGAACCATATGAAACTACCTATGTTGAT
    TACTTTTACCTATAAAACCAGCAACATCATAGGAC
    TCAACAAATTTCCATTGCTGTTTTTCCGCCTAAAA
    GCTATTTTAGGACCAGGAGCAGTGGCTCACGCCTG
    TAATCCCAGCACTTTGGGAGGCCGAGGAGGATTGC
    TTGAGCCCAGGAGTTCGAAACCAGCTTGGGCAATA
    TAATAAGACCAAGCCTCTACAAAAATAAAAAGTAG
    AAACAAAATTAGCCAGATGTGGTGGCATGCTCCTG
    TAGTCCCAGCTACTTGGGGGAGCTGAAGAGAATCG
    CTTGAGTCCAGGAGTTGGAGGTTGCAGTGAGCTGC
    GATTGCGCCACTGCATTCCAGCTTGGGCAACAGAG
    CAAGACCCTGCCTGGGAGAAAAAAAAAAAAAAAAA
    AAAGCTATTTTAGGGACAATATGCCGCCACAGTCA
    CGAGGTAAATATGCCGCCACAGTCACGAGGTAGAT
    CGTACAAAGGTTGCTAACAGTATGGCAACTGACCG
    CTAACAAGCGAATGTCAAGTTGTGAAATGCACACC
    ACCCACTTCCCTAGTTCGGTCGAAGCCCTCCCCTT
    AATTATGTGCAATTCAAGTCCCCACTGCCCGCCCG
    CAAGCCCCCACTCATCCTCGCTGCGGGCAGGGTGG
    CCCCTGCACTTTACAAGGGGGTGCAGGAGCGGGAG
    ACGGTCGTCCGAACACCGGCTCCCCGGCATGCGTA
    GACCGGCGGGCGGAGCGGGCTCACTTTGCGCCAAT
    CCTACGAGAACTCCCAGAACTCCGCTTCCCTAGTC
    CAACCCAAGCCAGAGTTGCCCACACCTAAGATGGC
    GGCGGGGGCGGAGTCGGCGCGGCCGCCTCTGGGCG
    GGACCGCGGGGACTAGACGTGGCCGCGGGGCGGTGT
    CAT
    Figure US20230142867A1-20230511-P00270
    Figure US20230142867A1-20230511-P00271
    ATGCGGAGCCCCGTGAGAGACCTGGCCAGAAACGA
    TGGGGAAGAGTCTACAGACCGGACCCCGTTGCTTC
    CGGGTGCTCCCAGGGCAGAGGCCGCCCCTGTCTGT
    TGTAGCGCACGCTACAACCTCGCCATACTTGCTTT
    TTTCGGGTTCTTCATAGTTTACGCCCTGCGGGTGA
    ACCTTAGCGTAGCTCTTGTAGATATGGTCGACTCT
    AACACTACTCTGGAAGACAACAGAACCTCTAAAGC
    GTGTCCAGAGCACTCAGCCCCTATTAAGGTACACC
    ATAACCAAACCGGAAAAAAGTATCAGTGGGACGCG
    GAAACCCAGGGGTGGATACTCGGCTCATTTTTCTA
    CGGTTATATTATCACCCAGATTCCTGGAGGCTATG
    TAGCCTCCAAGATCGGTGGGAAAATGCTCTTGGGA
    TTCGGCATTTTGGGTACTGCGGTACTTACCCTGTT
    CACTCCTATAGCAGCCGATCTTGGGGTTGGCCCAT
    TGATCGTTCTCAGAGCCCTCGAAGGTCTGGGAGAG
    GGCGTTACATTTCCAGCAATGCACGCGATGTGGTC
    AAGTTGGGCGCCACCCTTGGAGCGAAGCAAACTTC
    TCTCCATTTCATATGCAGGGGCCCAGCTGGGAACT
    GTAATAAGTCTGCCGCTGAGCGGTATAATCTGCTA
    CTATATGAATTGGACATACGTCTTTTATTTTTTCG
    GAACAATAGGGATTTTCTGGTTTCTCTTGTGGATA
    TGGCTGGTTAGCGATACCCCGCAGAAGCACAAACG
    CATAAGTCACTATGAAAAAGAGTACATCTTGTCTT
    CACTTCGCAATCAACTGTCTTCTCAAAAGAGCGTG
    CCTTGGGTACCGATATTGAAGAGCCTGCCTCTCTG
    GGCAATCGTAGTAGCGCACTTTTCATATAATTGGA
    CCTTTTACACACTGTTGACTCTTCTCCCAACGTAT
    ATGAAGGAAATTCTCCGATTTAACGTGCAAGAAAA
    TGGATTCTTGTCATCACTTCCTTACTTGGGTTCAT
    GGCTCTGCATGATTTTGTCTGGTCAGGCGGCCGAT
    AATCTGCGCGCCAAATGGAACTTTTCTACCTTGTG
    TGTAAGACGAATCTTTTCCTTGATCGGCATGATAG
    GCCCCGCTGTGTTTTTGGTAGCAGCCGGCTTCATT
    GGGTGCGATTACTCACTGGCTGTCGCGTTTCTCAC
    AATTTCTACCACACTTGGCGGTTTTTGTAGCTCTG
    GCTTCAGCATAAACCATCTCGACATAGCACCTTCC
    TACGCGGGTATACTGTTGGGTATCACAAATACCTT
    TGCCACAATCCCAGGTATGGTCGGCCCAGTTATAG
    CCAAATCTCTGACTCCTGACAATACTGTAGGAGAA
    TGGCAAACAGTATTCTATATCGCCGCTGCTATCAA
    TGTTTTCGGGGCAATTTTCTTCACGTTGTTCGCGA
    AGGGGGAAGTCCAAAACTGGGCTCTTAATGACCAT
    CATGGGCACAGGCACGACTACAAGGATGATGATGA 
    TAAGTA
    Figure US20230142867A1-20230511-P00272
    Figure US20230142867A1-20230511-P00273
    Figure US20230142867A1-20230511-P00274
    Figure US20230142867A1-20230511-P00275
    Figure US20230142867A1-20230511-P00276
    Figure US20230142867A1-20230511-P00277
  • The CLN6 protein sequence is:
  • (SEQ ID NO: 76)
    MEATRRRQHLGATGGPGAQLGASFLQARHGSVSADEAARTAPFHLDLWF
    YFTLQNWVLDFGRPIAMLVFPLEWFPLNKPSVGDYFHMAYNVITPFLLL
    KLIERSPRTLPRSITYVSIIIFIMGASIHLVGDSVNHRLLFSGYQHHLS
    VRENPIIKNLKPETLIDSFELLYYYDEYLGHCMWYIPFFLILFMYFSGC
    FTASKAESLIPGPALLLVAPSGLYYWYLVTEGQIFILFIFTFFAMLALV
    LHQKRKRLFLDSNGLFLFSSFALTLLLVALWVAWLWNDPVLRKKYPGVI
    YVPEPWAFYTLHVSSRHDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for CLN6 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • CLN6-DNA
  • (SEQ ID NO: 77)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    AAAGGCAGATGTGTTTGTATTTAAGAGATTGGAGG
    ATTAAATGAGCTAGAGAGGCTCCCTCCACACGGCC
    TCATCCTCTGATGGCTGGAAGCCCACCTCTGAGTG
    TTATTAAATCGATGATGACATCATGGGCACTGCAT
    GCCCCTCCCTCTGGAGCCCAGCAGGCTTGCAAAGC
    CTGCAGAGGATAAGAAATTGGGGTCAGAGCGGGAA
    GCCTCAATCCTTCTTCTCTGGTGTATACCGAGATC
    GATGTGTAGATCAGATTCACCAGAATTAAATCTGG
    GGCCCCGTCTGAGTGCTGTCCTTGGCTGACCCTTG
    GGCACAGTACTTCAGCTCAGGGCCTAGGTTTTTCT
    CTGCTACAAAACAGAGACAATCACCTGTTCTCCTT
    ACGACCTGGTGTGTGTGGATCAAATGAAACAAGGA
    AAGCTTTAAAGCAAATGTGTTCTTCAGGGGACCAG
    GGGCGGAGGGGGGGGATTGAGGTGTGGCCTGTGGG
    CTGCTAGAGCCCAGGATGGGGCTGATGGAAAATGA
    CTCCCAGGCAGGCAGTCCTAGCCTCCCACAGCCAA
    GAAAGGACATGCAGGTTCTGACCAAAGGGGAAAGA
    AATAAATCCAATAAACATCTCCCAAGAAGAAAGAC
    AGGAAAGAAAATAGAATGGAAATAGGCATAGCTCA
    GACCCCTCATGGTCTGGGAAGCAGACCAAGGATAC
    TGGGCAGGGGAATGGCACATTGCATTTGCATCTGA
    GGCACATCCCTTCCAAGAGGCACATCCCTTCTGCA
    GCGGCTTGAGGGCTGGGTGAGAGGAGACAGGACTG
    GAGACCTGCACATGTCGTGTGCGCCTGGTGTAGCA
    CGGGGGTGGTGGGGAGAATGGGAGTTGGTTGCAGT
    TGTTCTAGTTAGCGGCCTGGGCTGCTGACAGTGAC
    CGTGGGGAGGAGAGAGTTTGTGACATTGACAGTGG
    TGTGGAAGGTAACCTTTCTGTGGAGGGAAACCTCA
    GGTCTCGAGACGGGCTATTGTAAATAACTTAGTCC
    ACCAGTGCTGGGCTCTCTGCTAAGAGTTTCATATA
    GTCATCACAACTCCAATGTAGTAGATGCAATAGGC
    ATCTTCTTGTTACAGATGAGTAAACTGAAGCTGTG
    AGGGGTTGAGCCATTTGCCCGCATCACAGCTAGTT
    CAAAGTGGTACCACTAATGAAGTCCTGGCAATCTG
    CCCTGCAGGCTGTTCTCAGGTTTGACCTTTGGTTC
    TGTTTTTCACTTGAGGATGCTGTCCCCTTAGGTTC
    TGTCTGTCTTGTATGCTCCGGTGACTCACGAGCCT
    GTCCACTGGGTTCCCAGGTCTCGCCTCAGCTTCAG
    ACCACCGGTCCAACAGCCTTCCTGACCCACCTCAA
    GCTTAATGCGTTCAAAGATGAATCTTTATTTACAC
    GAAATCTGCTCTAGCCCACTTCCCGGGGGGTGCCT
    CCACTATCCACCCAGGGGCAGAATCCGGCCACTGC
    CTGAACTCTTCTCCCCTCACCCCCCACATCCAGAG
    TCCCAGTCACCATACCTCTCACACGACTCTAGATG
    TCACCAACTCGCTCCTTCCCACTGTCACCGCCAGC
    CCCTCACTAGAACATCTGCAACCCACGTGCTGCAC
    CACGCGGGTGCTACACGTGCAGGTCTCCAGTCCTG
    CCTCCTCTCCTCCACCCGTAAGAACAAGGGAAATC
    CCACAGAACCCTCTGCCCCGGGGAGCATTTAAATC
    AGCTTAGTCTATCTCCAGGTCGCTACCCCGCCCCA
    GCCCAACACCCAGGCCTCAGTCACCCTAGACAGAG
    ATTCCCGTACCCAGGCAGGCAAGGGACCCTAACAC
    ATCCCAGCCCGCTGGTGGTCACGCGCGCAGCCCGG
    ACAGGCCAGCAGCCCCCGCCCCGCGAGGACGCCAC
    ACTCGCAGCGCCCGGCCCTCCCCTAGCGGAGCCCG
    CCCCGTCTCCCTCCGCCCCGCCTCCCTCCGCTCCG
    CTCCGCTCCGCTCCGCCCCGCCCCGGGTCCGCAGC
    CCCG
    Figure US20230142867A1-20230511-P00278
    Figure US20230142867A1-20230511-P00279
    ATGGAAGCCACTAGAAGAAGACAACACTTGGGAGC
    GACAGGTGGACCCGGTGCGCAGTTGGGAGCCTCCT
    TTCTTCAAGCTAGGCATGGTAGCGTAAGTGCTGAC
    GAAGCCGCACGCACTGCACCTTTCCACTTGGATTT
    GTGGTTTTATTTCACCCTCCAGAATTGGGTTCTGG
    ATTTCGGGCGACCCATCGCAATGCTGGTGTTCCCG
    CTTGAGTGGTTCCCCCTTAATAAACCCAGCGTGGG
    AGACTATTTTCACATGGCATACAACGTGATAACCC
    CTTTCTTGCTCCTTAAACTCATAGAGCGCAGCCCC
    AGAACGCTTCCCAGAAGCATAACGTATGTGTCCAT
    TATTATCTTCATAATGGGTGCCTCAATACACCTTG
    TCGGAGATAGCGTAAATCACCGACTCCTCTTTTCA
    GGATACCAACACCACCTGAGTGTTAGGGAAAACCC
    GATTATTAAGAACCTCAAACCTGAGACCCTTATTG
    ATTCTTTCGAGCTGCTGTACTATTACGACGAATAC
    TTGGGTCATTGTATGTGGTATATTCCATTCTTTCT
    CATTCTCTTCATGTATTTTTCTGGGTGCTTTACGG
    CAAGCAAGGCGGAGAGCCTTATCCCAGGCCCCGCG
    CTCCTTCTTGTTGCCCCAAGCGGTCTCTATTATTG
    GTATCTGGTAACCGAGGGGCAAATATTTATACTTT
    TTATTTTCACCTTCTTTGCCATGCTCGCCTTGGTT
    TTGCACCAGAAGAGGAAGAGATTGTTCCTCGATAG
    CAACGGACTTTTTCTGTTCTCTTCCTTCGCGCTTA
    CCCTTCTCTTGGTAGCCTTGTGGGTCGCCTGGCTG
    TGGAATGACCCAGTTTTGCGGAAAAAGTATCCCGG
    TGTGATCTACGTGCCTGAACCTTGGGCATTCTACA
    CTCTTCATGTGTCCTCTAGACACGATTATAAGGAT
    GATGATGACAAATA
    Figure US20230142867A1-20230511-P00280
    Figure US20230142867A1-20230511-P00281
    Figure US20230142867A1-20230511-P00282
    Figure US20230142867A1-20230511-P00283
    Figure US20230142867A1-20230511-P00284
  • The CLN8 protein sequence is:
  • (SEQ ID NO: 78)
    MNPASDGGTSESIFDLDYASWGIRSTLMVAGFVFYLGVFVVCHQLSSSLN
    ATYRSLVAREKVFWDLAATRAVFGVQSTAAGLWALLGDPVLHADKARGQQ
    NWCWFHITTATGFFCFENVAVHLSNLIFRTFDLFLVIHHLFAFLGFLGCL
    VNLQAGHYLAMTTLLLEMSTPFTCVSWMLLKAGWSESLFWKLNQWLMIHM
    FHCRMVLTYHMWWVCFWHWDGLVSSLYLPHLTLFLVGLALLTLIINPYWT
    HKKTQQLLNPVDWNFAQPEAKSRPEGNGQLLRKKRPDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for CLN8 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • CLN8-DNA
  • (SEQ ID NO: 79)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    TGAAATCGTATGGATCAAAGCTGTCCTCTTGGCCT
    GAGCCGGTTGCTGAGTGCGGGCCACAAGATCAGAT
    GAGCCAGTTTATCCATCTGGATGGTCCCAGCTAAT
    TCAGCAAGTGCAAGGTCTGCAAAATATCTCAAGCA
    CTGTTGTTAGGTTTTTCAATAGTGATGTTATCCCC
    AGGAGCAATTTAGGGAGGGTCAGAATCTTGTAATC
    TCCAGCTGCATGAATTCTAAACCATAATTTCTAAT
    CTTGTGGCTAATTTGTTAGTCCTACAAAGGCAGTC
    TAGTCTCCAGACAAGAAAAGGATTTGTTTCGGGAA
    ACCATAAACTAAGTTTCTTCCAAAGTTAGTTTGGC
    CTACACCCAGGAATGAACAAGGACAGTTTGGAGGT
    TAAAAGTAGGATGGAGTTGGTTAGGTCAGATCTCT
    CACTGTCTCAGTTATAATTTTGCAATGGTGGTTTC
    ACTGGTTTTAAGATGGAGTTTATTTTATTTTATTT
    TTATTTTTTTGAGACAGAGTTTCATTCTTGTTGCC
    CAGGCTAGAGTGCCATGGCGTGATCTCAGCTCACT
    GCAACCTCTGCCTCCTGGGTTCAAGCAATTCTGCT
    TCAGCCTCCTGAGTAGCTGAGATTACAGGTGCCCA
    CCACCACACCCGGCTAATTTTTTTCTCCCAAACTG
    CTGGGATTACAGGCTTGAGCCACCGCGCCCGCCAA
    GATGGGTTGTTGATTTTAAAACGATGTCACCCTGG
    CTCTCCTAGGTTCCTGTTCCCCTAACATCATGAAG
    GCTGCTGGCAGTTCTCTGGCCTCCAGGGAGGCCTC
    CACACACATTTACCCATGTTCTTAGACAATCCTAG
    GGACTCAAGGCTTCAACAAACCTAAATGAGCCAGC
    AGAAAAATAAACTTGAACATGAATGTCAACGTTTT
    CCCACATGTACTAAAGGGTTATAATAATTTTGAAA
    TGTTTATTTCAAAGAGCGTTGGTAATTTAAACGTC
    CTATTTAATCCCCAAAACAGTCCTGAGGGGGAGAT
    AAGGCAGTTATCTTCATAGTACAGGAAAGGAAAAA
    AGCGAGGGTCCAAGGCCGACTATACCCTCAGCTCC
    ATTAGCCCCCGAGGCCTCCCTGACAGGCGGGGCGG
    ACAATCCCAGTGCAGATGCTCTGTATCGATCGCAT
    GCTATCGGTTCTTTCAAGGAACGTGTATTGATCAT
    CAATTAAGTGGTGAGTACTCCTCTAGATGTCGATT
    CTAAAGAGGGAACAAAACACATTAGACAGAAGAAA
    CATAAGTAACTCGAATAACAACCTTAGAGAGAAGA
    AACATAAGTAACTCGAATAACAACCTTAGCAAACT
    GCGGAAACTCCTACAGACAAAAACTCAGGTGTGGG
    CGCAGAAGGGCCGGGGATGCGCTCGGTCAAGACTT
    GAAGGTCCGGGGCTTGGGCCCTGTGTCCTCACCGA
    GCCCCGGGGAGGCTCCGGCTCCGTCCACACCCGGA
    GCGCGACTCAGCCGCGGCCGGGGCTCTTCTGGAGA
    ACGGCGCGGTCAGGCTCTGCCGCAGCTTTCAGGTC
    CCCTCGGCCCCGTCCGGGTGCAGCGCAACCAGGCT
    GCCCGCGGAGCCCCACCGCCCCCATGCCGCCGCCC
    GCCATGGTTCAGCCCGGCCCCGCCCCTCCCGCCGG
    CCGTGGTTCAGCGCGTCCACCCCGCCGCCCCACAG
    CGCCCGCCGCGCCGCGCCCCGCCCCCCGCCGCGCC
    CCGCCCCCCGCCGCGCCCCGCCCCCCGCCGCGCCC
    CGCCCCCCGCCGCGCCCCGCCCCCCGCCGCGCCCC
    GCCCCCCGCCGCGCCCCGCCCCCCGCCGCGCCCCG
    CCCCCCGCCGCGCCCCGCCCCCCGCCGCGCCCCGC
    CCCCCGCCGCGCCCCGCCCCCCGCCGCGCCCCGCC
    CCCCGCCGCGCCCCGCCCCCCGCGCCGCGCCCCGC
    CCCCCGCCGCGCCCCTCCCGCGCCCGCCGCCCCGC
    CCCTCCCGCGCCCGCCGCCGTCGTAGCGCCTAGGC
    GGGGCCTCTCGTAGCGACGCCCCCAGGGGGGCAGA
    ACGCGTGCGCACGCGCGTGCGAACGCGCGTGCGCG
    GGCGGTGTTTGAGGCCGGC
    Figure US20230142867A1-20230511-P00285
    ATGAATCCCGCGAGTGATGGCGGCACTTCAGAGAG
    TATCTTTGACTTGGACTATGCATCATGGGGGATAC
    GCTCAACTCTTATGGTTGCAGGGTTCGTATTTTAT
    CTGGGGGTTTTTGTTGTCTGTCACCAACTTTCCAG
    CAGTCTCAACGCCACTTATAGATCCCTTGTGGCTA
    GGGAGAAAGTCTTTTGGGACTTGGCCGCAACCCGA
    GCCGTTTTTGGTGTCCAGTCCACGGCGGCTGGACT
    GTGGGCCCTCCTTGGTGATCCTGTTCTGCACGCTG
    ATAAAGCCAGAGGCCAACAGAACTGGTGTTGGTTC
    CATATAACTACGGCGACTGGCTTCTTTTGTTTTGA
    AAACGTAGCCGTACACCTGTCCAATTTGATCTTCA
    GAACATTCGACCTCTTTCTGGTAATCCACCATCTT
    TTCGCCTTTCTCGGCTTTCTTGGATGTCTTGTCAA
    TCTCCAAGCCGGTCACTACCTCGCAATGACTACTT
    TGTTGCTCGAAATGTCAACCCCATTTACTTGCGTA
    TCATGGATGCTGCTGAAAGCGGGATGGAGTGAATC
    ACTTTTTTGGAAGTTGAATCAGTGGTTGATGATTC
    ATATGTTTCACTGTCGAATGGTACTGACATATCAT
    ATGTGGTGGGTCTGTTTCTGGCACTGGGACGGGCT
    GGTATCTTCACTTTATCTCCCTCACCTGACCCTTT
    TCCTCGTAGGCTTGGCGCTGCTCACACTGATCATT
    AATCCTTACTGGACCCACAAGAAAACGCAGCAACT
    GCTGAACCCTGTGGATTGGAACTTCGCACAGCCCG
    AGGCCAAGTCACGGCCTGAAGGCAACGGTCAACTG
    TTGAGGAAGAAGAGACCGGACTATAAGGATGATGA
    CGACAAATA
    Figure US20230142867A1-20230511-P00286
    Figure US20230142867A1-20230511-P00287
    Figure US20230142867A1-20230511-P00288
    Figure US20230142867A1-20230511-P00289
    Figure US20230142867A1-20230511-P00290
    Figure US20230142867A1-20230511-P00291
  • The GM2A protein sequence is:
  • (SEQ ID NO: 80)
    MQSLMQAPLLIALGLLLAAPAQAHLKKPSQLSSFSWDNCDEGKDPAVIRS
    LTLEPDPIIVPGNVTLSVMGSTSVPLSSPLKVDLVLEKEVAGLWIKIPCT
    DYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKS
    EFVVPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGIDYKDDDD
    K
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for GM2A and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • GM2A-DNA
  • (SEQ ID NO: 81)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    CCCAGATACAGAATCTTCTTGGGTCCAAATACCCC
    CTAGAGGTTTCCCATTGGCCACTTGGTGCTCACCT
    CATGTAAATGAAGTGGTGGCCCGCAATCAGTCTGA
    CTGGTTAATCAGAGGCCGAAGTGAAGTTACAAAGG
    TCATGCTCCTGTGCAAACATCTGATTGGTTGTGGA
    AAGCAACCAATCAGGCTGAAGTGAAGTTACAAAGT
    TGCACTTCTATGCAATTTTAACATATAAACTGAAA
    TGTAGCATAGAAATCGAAATGTACTTTCAATTTCC
    CATCTGCTGCACAGAAAAGGTGGGGGGTTGTAAAG
    AGAGTAGCCTCTGGTCCTCTTGTTACTTAGGTGTG
    GAAAGTTAGGGTTTTCCTTTCGATTTAGTTCTAGG
    AAGTCAGCGTGAAATGGCCTTAGGTTCCCTGCCTG
    CAGACTCTATTCTCCTACCTCAGTTGGGCTAGGTG
    AAAGGGAAGAGTTGATCAGAATGGGAGGATCATTA
    TGTAAAGTATGGAGCTTAGTTAATAATAGTGTACC
    AATGTCATGCCCTCAGATTTAAAAAAAATTTTTTA
    ATTTTTTATTCTTTTGCCACCTCCTTGTCGCTGGC
    AGTGCCCCCCAGTTATGCACAAGCACAAGCTCTGG
    GTCTGCAGTCTGAACCCCCGGCCAGGCCACGGGTG
    TCTTATCAACAATAGAGGAAACGAACAGAACTTTG
    TAGTATTTTTGCAACTTTTCTGTAAACCAAAATAA
    AAAGTTTACTTTAAAAAAGATGATCAGGTGAGTTT
    TAAATTCTACACATGCATTCATGTACTCAACAGAT
    TGTCTATTATGCATCAGGCACTGGGTATAAATGAG
    TGCCTGCATCATGATGCCACCCCTTGTGTAGTTTA
    CAATCCTGTGGTAAATTGGCGGTATTGGTAACAAC
    ATCCCTGTTTTTTCTAAAAGCAAATCTCCAACTAG
    TATATTAACCTGACAAAATCTTGGCTGATGGGGTG
    TCTAAAAAGAAGTCTTGCTCTATTCCTTCCTAGCT
    GTGCGACCTTGGCAGAGCTAACCTCACTGTGTATC
    AGTTTCCTCAGTGCTAAAATTGGGGAAGAAAAAAA
    AAAAGCATTCCCTTGTGCGTCCTTAGCATGCTTTC
    AAGCAGTTACTAAATGTTAGTTTTTATAACGGTTA
    TAATTATTAAAAAGATACGGATTCGTGAAGTCAAG
    AGATCAAGACCGTCCTGGACAACACGGTGAAACCC
    CGTTTCTACTAAAAACACAAAAATTAGCTGGGCGT
    GTGGCGCGTGCCTGTAGTCCCAGCTAGTCGAGAGG
    CTGAGGCGGAAGAATCGCTTGAACCTGGGAGGCAG
    AGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTC
    CAGCTCGGGCGACAGAGCGAGAGTCCGTCTCAAAG
    CAAAAAAAAAAAAAAAAAAGACACACATATGGGGA
    AGCGTAACATGGCAGAGAGTCTGACATTTTTGTCC
    GGAAGGACTTGGGGATGAAAACACTGAATCCGCGG
    TTTCCCAAAGGTCCTTCCAAAGGTGACGGCCCCGG
    CACGAGTTCACAAACCACAGCTCCGCCCCATCCCC
    TGCCCGGCCCCGCCCTTCCCAGGAGCTCTGCTCCG
    CCCCGCCCCCGACTCCGCTCGGCCCCTCCTCGCGC
    CACGCCCCCGCCCCTCCCGCGCGCGCAGCTCCGCC
    CCGCCCTCGTGATCCCGCCCCAACCTCGTGCTAAG
    CTCCCTTCCGTTCCAGCCGCCTTCAGCCCCGCCGC
    GCTCAGTTCCGCCCCTTTCTCTTCCAACTCCGCCT
    ACGCCCCAGAGAGGGAAGGCCTTGACACCTACGCT
    AGAGAGACAGGCTCCAATGAAAAGAGCGAGGGGGC
    GGAGCCGGGAGGAATCGGTCCAATTCTCGGCCGGG
    AAGTCTCTGGCCGGGACGGGGCAGGGCGAACCTGC
    CAGTGACTGGACTCAGCTTCTTTGCGTAACCAATA
    CTGGAAGGCATTTAAAGGCACCTCTGCCGCCACAG
    ACCT
    Figure US20230142867A1-20230511-P00292
    Figure US20230142867A1-20230511-P00293
    ATGCAGTCTCTCATGCAAGCTCCTCTCTTGATCGC
    CCTTGGTCTGCTTTTGGCTGCCCCAGCACAAGCTC
    ACCTGAAAAAACCTTCCCAACTTAGCTCCTTTTCC
    TGGGATAATTGCGATGAAGGAAAGGACCCGGCTGT
    AATCCGAAGTTTGACACTCGAACCTGACCCCATCA
    TCGTCCCGGGCAATGTCACGCTTTCAGTAATGGGT
    AGCACGAGCGTTCCGCTCTCTAGCCCATTGAAAGT
    AGACCTCGTGTTGGAAAAAGAAGTAGCCGGGTTGT
    GGATAAAAATTCCATGCACGGATTATATTGGCTCA
    TGTACCTTTGAACACTTTTGCGATGTTTTGGACAT
    GTTGATCCCCACAGGCGAGCCATGTCCAGAGCCCC
    TTAGGACGTATGGGCTTCCGTGTCACTGCCCATTC
    AAGGAAGGTACATATAGCCTGCCGAAGTCAGAATT
    TGTCGTCCCTGATCTCGAATTGCCTAGTTGGCTGA
    CAACCGGCAACTATCGAATTGAATCTGTACTTTCT
    TCAAGCGGTAAACGGCTCGGCTGTATCAAAATAGC
    CGCCTCCCTGAAGGGGATAGACTACAAGGACGACG
    ACGATAAATA
    Figure US20230142867A1-20230511-P00294
    Figure US20230142867A1-20230511-P00295
    Figure US20230142867A1-20230511-P00296
    Figure US20230142867A1-20230511-P00297
    Figure US20230142867A1-20230511-P00298
    Figure US20230142867A1-20230511-P00299
  • The HYAL1 protein sequence is:
  • (SEQ ID NO: 82)
    MAAHLLPICALFLTLLDMAQGFRGPLLPNRPFTTVWNANTQWCLERHGVD
    VDVSVFDVVANPGQTFRGPDMTIFYSSQLGTYPYYTPTGEPVFGGLPQNA
    SLIAHLARTFQDILAAIPAPDFSGLAVIDWEAWRPRWAFNWDTKDIYRQR
    SRALVQAQHPDWPAPQVEAVAQDQFQGAARAWMAGTLQLGRALRPRGLWG
    FYGFPDCYNYDFLSPNYTGQCPSGIRAQNDQLGWLWGQSRALYPSIYMPA
    VLEGTGKSQMYVQHRVAEAFRVAVAAGDPNLPVLPYVQIFYDTTNHFLPL
    DELEHSLGESAAQGAAGVVLWVSWENTRTKESCQAIKEYMDTTLGPFILN
    VTSGALLCSQALCSGHGRCVRRTSHPKALLLLNPASFSIQLTPGGGPLSL
    RGALSLEDQAQMAVEFKCRCYPGWQAPWCERKSMWDYKDDDDK 
  • The proprietary viral enhancer sequences, native promoter sequences, coding sequence for HYAL1 and the 5′ and 3′ UTR sequences are listed below using the same nomenclature.
  • HYAL1-DNA
  • (SEQ ID NO: 83)
    GCGCAGCACCATGGCCTGAAATAACCTCTGAAAGA
    GGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAA
    CCAGCGTTACATAACTTACGGTAAATGGCCCGCCT
    GGCTGACCGCCCAACGACCCCCGCCCATTGACGTC
    AATAATGACGTATGTTCCCATAGTAACGCCAATAG
    GGACTTTCCATTGACGTCAATGGGTGGAGTATTTA
    CGGTAAACTGCCCACTTGGCAGTACATCAAGTGTA
    TCATATGCCAAGTACGCCCCCTATTGACGTCAATG
    ACGGTAAATGGCCCGCCTGGCATTATGCCCAGTAC
    ATGACCTTATGGGACTTTCCTACTTGGCAGTACAT
    CTACGTATTAGTCATCGCTATTACCATG
    GGACTTGCTGGTGACACAGCCTCAGGAAGTCCTGA
    GGACATGTGCCCAGGGCGGTTGGGGCACAGCTTGG
    TTTTATAGATTTTAGGGAGACATGAGACATTAATC
    AATATGTAAGAAGTACATTAGTTCCAGAAAGAAAG
    GTGGAGACTGCTCAAATCAAGGCTCCCAGGCTCAA
    AGCACTGGGGGCTTCCAGGTCACAGATAGGTGAGA
    GACAGATGGTTGCATTCTTTTGAGTTTCTGGTAAG
    TCTTTCCAAAGGAGGCAATCAGAATATGCATCTAT
    CTCTGTGAGCAAAAGGATGACTTGAATAGAATGGG
    AGGCAGATTTGTCCTGAGCAGTTCCCAGCTTGAAG
    AGGCCCAAGATACTTTCCTTTCACATTTACCCCAT
    TTTCTTTTTCAAAATCTTTTGGAGAAAGCATTTTG
    CAAGAAAATGAGTATCTGGTCTCAGGTTTCATCTG
    ATCTCTCATTGCTAGATAAGTAGGTCCGGAAAGCT
    CATTTTTAGCAGGTTGTAAAGTCTCATGCAGTGTG
    AAGAGAAAATAGGGAGAAGGAAGGAAGAGAAAAAA
    AAAACAGCAAAAGAACAATCCCAGCCCTGGCGGGG
    TGGCTCATGCCTGTAATCCCAACAATTTGGGAGGC
    TGAGGCGGGTGGATCACCTGCGGTTGGGAGTTCGA
    GAACAGCCTGACCAACATGGAGAAACTCTGTCTCT
    ACTGAAAATACAAAAAACTAGCCAGGCATGGTGGC
    TCATGCCTGTAATCCCAGCTACTCAGGAGGCTGAG
    ACAGGAGAATCACTTGAACCCAGGAGGCGGAGGTT
    GCAGCAACCTGACATTGCGCCATTGCACTCCAGCC
    TGGGCAACAAGAGTGAAACTCCATCTCAAAAAAAA
    AAAAAAAAAAAATCCTGGGAAAATATAGGCCACAT
    TACTCTGAAGTCCATACATTGGTAGGCAGGTATGA
    AAGTGGCTTATGTATGTACATAAACAGGTTACTGT
    TACTTTCTTCTGAAGTGTAAGTTGTCTGACTTTAG
    TTGACACGCTTTTAAGAAACCACAGCTAGGGCCGG
    GCGCAGTGACTAACGCCTGTAATCCCAGCACTTCG
    AAGAGGCCGAGGCGGCCGGATCACAAGGTCAGGGT
    TCGAGACCAGCCTGGTCAATATGGTGAAACCCTGT
    CTCTACTAAAAATACAAAAATTAGCCAGGCATGGT
    AGCAAGCGCCTGTAGTCCCAGCTACTCGGGAGGCT
    GAGGCAGGAGAATTGCTTGAACCTGGGAGGCATAA
    GTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCA
    GCCTGGGTGACAGAGCGAGACTCCATCTCAAAAAA
    AAAAAAAAAAGAAACCACAGTGGCCGATTGCAGTG
    GCTCCCCCTCCCCCTCCCCCTCCCCTCCCCCTCCC
    CCTCCCCCTCTCCCTCCACGGTCTCCCTCTGATGC
    CGAGCGGAAGCTGGACTGTACTGCTGCCATCTCGG
    CTCACTGCAACCTCCCTGCCTGATTCTCCTGCCTC
    AGCCTGCCGAGTGCCTGCGATTGCAGGCACGCGCC
    GCCACGCCTGACTGGTTTTCGTATTTTTTGGGTGG
    AGACGGGGTTTCGCTGTGTTGGCCGGGCTGGTCTC
    CAGCTCCTAACCGTGAGTGATCTGCCAGCCTCGGC
    CTCCCGAGGTGCTGGGATTGCAGACGGAGTCTCTC
    AATGGTGCCCAGGCTGGAGTGCAGTGGCGTGATCT
    CGGCTCGCTACAACATCCACCTCCCAGCAGCCTGC
    CTTGGCCTCCCAAAGTGCCGAGATTGCAGCCTCTG
    CCCGGCCGCCACCCCGTCTGGGAAGTGAGGAGCGT
    CTCTGCCTGGCCGCCCATCGTCTGGGATGTGAGGA
    GCCCCTCTGCCTGGCTGCCCAGTCTGGAAAGTGAG
    GAGCGTCTCTGCCCAGCCGCCATCCCATCTAGGAA
    GTGAGCAGCGCCTCTTCCCGGCCGCCATCCCATCT
    AGGAAGTGCGGAGCCTCTCTGCCCGGCCACCCATC
    GTCT
    Figure US20230142867A1-20230511-P00300
    Figure US20230142867A1-20230511-P00301
    ATGGCAGCGCATCTCCTCCCGATATGTGCACTTTT
    TCTGACTCTTCTTGATATGGCACAAGGATTTCGGG
    GTCCGCTCTTGCCTAATCGACCTTTTACAACAGTA
    TGGAATGCTAACACACAGTGGTGCCTTGAAAGGCA
    CGGTGTCGACGTTGACGTCTCCGTCTTCGACGTAG
    TAGCCAATCCAGGCCAAACCTTCAGGGGGCCAGAT
    CATGACTATCTTCTATAGTTCTCAACTGGGGACTA
    TCCCTACTACACCCCCACGGGTGAACCTGTATTCG
    GCGGGTTGCCACAGAATGCTAGCCTTATCGCGCAC
    CTGGCCAGGACGTTTCAAGACATACTCGCCGCCAT
    TCCAGCACCCGACTTCTCTGGTCTGGCTGTGATCG
    ATTGGGAGGCCTGGCGACCTAGGTGGGCTTTCAAC
    TGGGATACCAAGGACATCTATAGGCAGCGGAGCCG
    AGCATTGGTGCAAGCCCAGCATCCGGATTGGCCAG
    CGCCTCAGGTGGAAGCGGTAGCGCAGGATCAATTC
    CAGGGCGCCGCGAGAGCTTGGATGGCCGGGACATT
    GCAACTTGGGAGAGCCTTGAGGCCTCGCGGCCTGT
    GGGGTTTTTATGGATTCCCAGATTGCTACAACTAT
    GATTTTCTCTCTCCAAATTACACAGGCCAATGTCC
    AAGCGGCATACGCGCACAGAACGATCAGCTTGGCT
    GGCTCTGGGGACAATCTCGAGCCTTGTACCCTAGC
    ATCTATATGCCAGCTGTCTTGGAGGGCACCGGAAA
    ATCTCAGATGTACGTGCAACATAGGGTTGCAGAGG
    CCTTCCGGGTTGCCGTCGCAGCCGGAGATCCGAAT
    CTGCCTGTTCTTCCCTATGTACAGATCTTTTATGA
    TACCACAAATCACTTTTTGCCGTTGGACGAGCTTG
    AACATAGTTTGGGTGAAAGTGCGGCCCAAGGTGCG
    GCGGGAGTAGTTCTTTGGGTTTCCTGGGAAAATAC
    ACGGACTAAAGAGTCATGTCAGGCCATAAAGGAGT
    ACATGGATACCACACTGGGCCCTTTCATTTTGAAT
    GTTACGAGTGGCGCGTTGCTTTGTAGTCAGGCACT
    GTGCAGCGGACACGGCAGGTGTGTGCGGCGGACGT
    CACATCCAAAGGCCCTGCTGCTGCTGAACCCGGCG
    TCATTTTCAATTCAACTTACTCCCGGTGGGGGGCC
    CCTCTCACTTAGGGGCGCGCTCTCCCTGGAAGACC
    AGGCCCAGATGGCAGTAGAGTTTAAGTGCAGATGT
    TACCCAGGGTGGCAGGCGCCTTGGTGCGAGCGCAA
    ATCCATGTGGGATTACAAAGATGATGATGATAAGT
    A
    Figure US20230142867A1-20230511-P00302
    Figure US20230142867A1-20230511-P00303
    Figure US20230142867A1-20230511-P00304
    Figure US20230142867A1-20230511-P00305
    Figure US20230142867A1-20230511-P00306
    Figure US20230142867A1-20230511-P00307
  • In certain embodiment, any of the GOI described herein are codon optimized for manufacturing, optimal expression in a host cell, such as in a human cell. Further Codon optimization of GOI can be done based on codon optimization table publically available, such as the Codon and Codon Pair Usage Tables (CoCoPUTs) maintained at the George Washington University (GWU). The CoCoPUTs is derived from all available GenBank and RefSeq data source.
  • In certain embodiment, the GOI's are codon optimized for expression in neuronal tissues or specific cell types in neuronal tissues, such as neurons, depending on the target cell type the GOI's are to be expressed. This can be done using TissueCoCoPUTs, which are human tissue-specific codon and codon pair usage tables derived from human genomic codon usage information and tissue-specific transcriptomic data. The TissueCoCoPUTs maintained by the GWU website represent 52 human tissues. Transcriptomic data are derived from the Genotype Tissue Expression (GTEx) Portal.
  • 3. Delivery Methods
  • In certain embodiments, the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by intravenous injection or direct intraparenchymal injection or direct intracranial injection. Such delivery means effectively bypass the BBB, and are common means for the administration of gene therapeutics to the brain. In certain embodiments, the neuronal target cells intended to receive gene therapy through intraparenchymal injection are clustered in a focal location, and are this particularly suitable for focal delivery of the viral vector.
  • In certain embodiments, the viral vector (such as AAV) of the invention is delivered locally or compartmentally, such as direct intraocular injection, or injection in the cochlea.
  • In certain embodiments, the viral vectors of the invention are delivered to neuronal target cells by intravenous injection.
  • In certain embodiments, the neuronal target cells intended to receive gene therapy through intravenous injection are globally located in the CNS, or in multiple CNS regions. Such global delivery across the BBB is achieved by using selected AAV serotypes (such as AAV9) that have been shown to deliver genes across the BBB after intravenous injection.
  • In certain embodiments, the viral vector (such as AAV) of the invention is delivered to achieve widespread delivery for diseases of extensive pathology including glioblastoma multiforme (GBM), lysosomal storage diseases (LSDs), Alzheimer's disease, Parkinson's disease and Canavan's disease. In certain embodiments, the viral vectors are delivered intracerebroventricularly, intrathecally, intranasally, or systemicaly (e.g., intravenous and intra-arterial injections).
  • The systemic delivery route for gene delivery vehicles is very promising, and has the potential to be a conduit to the entire CNS (every cell in the brain is a maximum distance of 40 μm from an endothelial cell), provided that the issue of BBB blockage and the filtration systems of the spleen and liver are adequately dealt with.
  • In certain embodiments, the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by intravascular, intrathecal, and/or cerebrospinal fluid (CSF) administration. Many AAV capsids, such as AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 and rAAVrh.10, have remarkable CNS tropism after vascular (Foust et al., Nat Biotechnol. 27:59-65, 2009; Gray et al., Gene Ther. 20:450-459, 2013; Zhang et al., Mol Ther. 19:1440-1448, 2011; Bevan et al., Mol Ther. 19:1971-1980, 2011; Yang et al., Mol Ther. 22:1299-1309, 2014; Duque et al., Mol Ther. 17:1187-1196, 2009), and CSF administration (Samaranch et al., Hum Gene Ther. 24:526-532, 2013; Gray, supra; Samaranch et al., Hum Gene Ther. 23:382-389, 2012).
  • In certain embodiments, the viral vectors (such as AAV) of the invention are delivered to neuronal target cells by direct intracranial injection. A large range of AAV capsids, including pseudotypes rAAV2/1, rAAV2/2, and rAAV2/5, hu.32, hu.37, pi.2, hu.11, rh.8, hu.48R3, bb2, cy5, rh20, rh39, rh43, AAV7, AAV8, and AAV9, have been demonstrated for their CNS gene transfer properties by direct intracranial injection (Burger et al., Mol Ther. 10:302-317, 2004; Cearley et al., Mol Ther. 16:1710-1718, 2008; Cearley and Wolfe, Mol Ther. 13:528-537, 2006; Lawlor et al., Mol Ther. 17:1692-1702, 2009). Vectors based on distinct AAV serotypes or pseudotypes can be chosen for specific applications in the nervous system, depending on their preferential or global tropism within the different regions of the brain and target neuronal tissues.
  • In certain embodiments, the viral vector is AAV9. The vast majority (>90%) of cells in the CNS transduced by intravenously injected AAV9 using strong promoters like CBA are astrocytes and neurons. The distribution of vector is widespread with transduced cells detected in all areas of the brain from the olfactory bulb to cerebellum.
  • In certain embodiments, delivery of the viral vectors of the invention across the BBB is further enhanced by a variety of techniques including: intra-nasal administration and/or intracranial NSC transplantation that bypasses the BBB; transiently permeating the BBB using osmotic manipulation; magnetic resonance imaging guided focused ultrasound (MRIgFUS) to selectively open the BBB and allow therapeutics to pass into the parenchyma; convection enhanced delivery (Debinski and Tatter, Curr Opin Mol Ther. 12:647-653, 2010); introduction into a fluid space, e.g., vasculature, brain ventricles or intrathecal space in the spinal cord, such that cells lining the space produce the deficient protein (Chen et al., Nat Med. 15:1215-1218, 2009); injection into regions of the brain that serve as a nexus of interconnections with retrograde transport of vectors, such as the thalamus, e.g., for AAV (McCurdy et al., Sci Transl Med. 6:231ra48, 2014); and/or by taking advantage of the “bystander effect” whereby proteins released from the producer cells can be taken up by the deficient cells (McCurdy; Chen, Supra).
  • In certain embodiments, the viral vector (such as AAV) of the invention comprises a targeting peptide that bind selectively to the brain endothelial cells or a brain endothelial cell receptor (such as the low-density lipoprotein (LDL) and other related receptors that can be targeted by Apolipoprotein E-derived peptides) to target the viral vector to the brain vasculature.
  • 4. Viral Production
  • Another aspect of the invention provides a method of producing the rAAV vector of the invention, comprising introducing the rAAV vector of the invention into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
  • In certain embodiments, the packaging cell line is a HEK293 cell line, a HeLa cell, or an A549 cell.
  • In general, the subject rAAV vector can be produced using any of many art-recognized approach. In certain embodiments, the rAAV vector is produced based on the helper-virus-free transient transfection method with all cis and trans components (e.g., vector plasmid and packaging plasmids, along with helper genes isolated from adenovirus) in host cells such as 293 cells. In certain embodiments, the rAAV vector is produced using recombinant herpes simplex virus (rHSV)-based AAV production system, which utilizes rHSV vectors to bring the AAV vector and the Rep and Cap genes into the cells. In certain embodiments, the rAAV vector is produced based on baculovirus system which requires simultaneous infection of insect cells with several baculovirus vectors to deliver the rAAV vector cassette and the Rep and Cap genes. In certain embodiments, the rAAV vector is produced based on the AAV producer cell lines derived from HeLa or A549, which stably harbored AAV Rep/cap genes. The AAV vector cassette can either be stably integrated in the host genome or be introduced by an adenovirus that contained the cassette.

Claims (23)

1. A recombinant adeno-associated viral (rAAV) vector, comprising:
(1) a polynucleotide comprising a gene of interest (GOI), under the transcriptional control of the native/endogenous promoter of said GOI;
(2) a heterologous enhancer that enhances transcription of the GOI from the native/endogenous promoter; and,
(3) a pair of inverted terminal repeat (ITR) flanking the polynucleotide.
2. The rAAV vector of claim 1, wherein GOI is a neuronal gene, or is a gene expressed in neuronal tissues (CNS and/or PNS).
3. The rAAV vector of claim 1 or 2, wherein the GOI is defective (e.g., loss-of-function) in a neuronal disease or condition.
4. The rAAV vector of claim 3, wherein the neuronal disease or condition is a CNS disease or condition.
5. The rAAV vector of claim 4, wherein the CNS disease or condition is: Ceroid-Lipofuscinosis, Fucosidosis, Giant Axonal Neuropathy, Globoid cell leukodystrophy, Mucolipidiosis Type IV, Neuronal Ceroid Lipofusinoses, Niemann-Pick Disease, Sandhoff Disease, Sanfilippo syndrome, Tay-Sachs Disease, Hunter Syndrome, Sialidosis, Multiple Sulfatase Deficiency, Childhood Epilepsy, CMT Peripheral Neuropathy, Neuronal Ceroid Lipofusinoses, Aspartylglycosaminuria, Mucopolysaccharidoses, Parkinsons Disease/Symptoms, GM1-Gangliosidosis, Charcot-Marie-Tooth Type 1A, Retts Syndrome, Dannon Disease, Mucopolysaccharidoses (due to GNS, HGSNAT, NAGLU, or ARSB), Sly Syndrome, Biotin basal ganglia disease, Pelizaeus-Merzbacher disease, Neuronal Ceroid Lipofusinoses, Canavan Disease, Beta-Mannosidosis, Wolman Disease, Cystinosis, Alzheimer's Disease/Symptom, Salla Disease, Jansky-Bielschowsky disease, Neuronal ceroid lipofuscinoses, or GM2 gangliosidosis.
6. The rAAV vector of any one of claims 1-5, wherein the GOI is: CLN3, FUCA1, GAN, GALC, MCOLN1, PPT1, SMPD1, HEXB, SGSH, HEXA, IDS, NEU1, SUMF1, GAT1/SLCA1, FIG. 4 , CLNS, AGA, IDUA, GDNF, GLB1, PMP22/MFN2, MECP2, LAMP2, NAGLU, ARSB, GUSB, SLC19A3, PLP1, TPP1/CLN2, ACY2/ASPA, MANBA, LIPA, CTNS, GNS, HGSNAT, NEUROD1, SLC17A5, CLN6, CLN8, or GM2A.
7. The rAAV vector of any one of claims 1-6, wherein the rAAV has a tropism for neuronal cells, or for a cell in neuronal tissue.
8. The rAAV vector of claim 7, wherein the rAAV is AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 or rAAVrh.10, preferably AAV9.
9. The rAAV vector of claim 7, wherein the rAAV comprises AAV9, AAV2.5, rAAVrh.39, rAAVrh.43, rAAV7, rAAVrh.8 and rAAVrh.10 capsids, preferably AAV9 capsids.
10. The rAAV vector of any one of claims 1-9, wherein the polynucleotide further comprises a 5′-UTR coding region, and a 3′-UTR coding region.
11. The rAAV vector of any one of claims 1-10, wherein the heterologous enhancer is a viral enhancer.
12. The rAAV vector of claim 11, wherein the viral enhancer is CMV enhancer.
13. The rAAV vector of claim 11, wherein the viral enhancer is SV-40 enhancer.
14. The rAAV vector of claim 11, wherein the heterologous enhancer comprises a polynucleotide of SEQ ID NO: 1.
15. A pharmaceutical composition comprising the rAAV vector of any one of claims 1-14.
16. The pharmaceutical composition of claim 15, which is formulated for intravenous administration (e.g., to the blood) or intrathecal administration (e.g., to spinal fluid).
17. A cell infected with the rAAV vector of any one of claims 1-14.
18. A recombinant AAV virus comprising the rAAV vector of any one of claims 1-14, wherein the serotype or pseudotype of the recombinant AAV virus is for preferential infection of a neuronal cell or a cell in neuronal tissue.
19. A method of treating a (genetic) disease or disorder in a subject, the method comprising administering the recombinant AAV virus of claim 18 into the subject.
20. The method of claim 19, wherein the genetic disease or disorder is a neuronal disease or disorder, and/or wherein the recombinant AAV virus preferentially infects a neuronal cell or a cell in neuronal tissue of the subject, thereby treating the neuronal disease or disorder.
21. The method of claim 19 or 20, wherein the rAAV virus is administered to the subject intravenously or intrathecally.
22. A method of producing the rAAV vector of any one of claims 1-14, comprising introducing the rAAV vector of any one of claims 1-14 into a packaging cell line that constitutively or inducibly provides rep/cap proteins in trans.
23. The method of claim 22, wherein said rep/cap proteins comprise one or more AAV9 capsid proteins.
US17/920,483 2020-04-23 2021-04-23 Aav native-neuro platform and use for neuronal disease gene therapy Pending US20230142867A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR1020200049578A KR20210131149A (en) 2020-04-23 2020-04-23 Printed circuit board and mehod of manufacturing thereof
KR10-2020-0049578 2020-04-23
PCT/US2021/028803 WO2021216975A1 (en) 2020-04-23 2021-04-23 Aav native-neuro platform and use for neuronal disease gene therapy

Publications (1)

Publication Number Publication Date
US20230142867A1 true US20230142867A1 (en) 2023-05-11

Family

ID=78269827

Family Applications (2)

Application Number Title Priority Date Filing Date
US17/920,483 Pending US20230142867A1 (en) 2020-04-23 2021-04-23 Aav native-neuro platform and use for neuronal disease gene therapy
US17/920,495 Pending US20230171886A1 (en) 2020-04-23 2021-04-23 Circuit board

Family Applications After (1)

Application Number Title Priority Date Filing Date
US17/920,495 Pending US20230171886A1 (en) 2020-04-23 2021-04-23 Circuit board

Country Status (4)

Country Link
US (2) US20230142867A1 (en)
KR (1) KR20210131149A (en)
CN (1) CN115700016A (en)
WO (1) WO2021215879A1 (en)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05327163A (en) * 1992-05-20 1993-12-10 Fujitsu Ltd Mounting structure of electronic component
JP4803918B2 (en) * 2001-04-27 2011-10-26 京セラ株式会社 Manufacturing method of multilayer wiring board
JP5592459B2 (en) * 2012-11-07 2014-09-17 日本特殊陶業株式会社 Wiring board manufacturing method
CN105210460B (en) * 2013-05-22 2019-01-11 三菱制纸株式会社 The manufacturing method of circuit board
JP6092752B2 (en) * 2013-10-30 2017-03-08 京セラ株式会社 Wiring board

Also Published As

Publication number Publication date
US20230171886A1 (en) 2023-06-01
CN115700016A (en) 2023-02-03
KR20210131149A (en) 2021-11-02
WO2021215879A1 (en) 2021-10-28

Similar Documents

Publication Publication Date Title
US20220049272A1 (en) Gene therapy for juvenile batten disease
RU2588667C2 (en) Vectors and sequences for treating diseases
Ruzo et al. Correction of pathological accumulation of glycosaminoglycans in central nervous system and peripheral tissues of MPSIIIA mice through systemic AAV9 gene transfer
US20240009327A1 (en) Gene therapy for mucopolysaccharidosis iiib
US20220193261A1 (en) Compositions useful for treatment of pompe disease
JP2021508241A5 (en)
Pan et al. An engineered galactosylceramidase construct improves AAV gene therapy for krabbe disease in twitcher mice
Snyder et al. Viral vector-mediated gene transfer for CNS disease
US20230142867A1 (en) Aav native-neuro platform and use for neuronal disease gene therapy
US20220204950A1 (en) A new type of enzyme composition
US20210095314A1 (en) Bicistronic aav vectors encoding hexosaminidase alpha and beta-subunits and uses thereof
WO2021216975A1 (en) Aav native-neuro platform and use for neuronal disease gene therapy
US20230174994A1 (en) Engineered parkin and uses thereof
CA3177954A1 (en) Compositions useful for treatment of pompe disease
Wirth et al. Gene technology based therapies in the brain
WO2023061499A1 (en) Recombinant adeno-associated viral vectors for treating spinal muscular atrophy
US20230374483A1 (en) Modified hexosaminidase and uses thereof
US20220389450A1 (en) Vector system
US20230104357A1 (en) Combination of Retromer Pharmacological Chaperones and Exogenous Retromer for the Treatment of Alzheimer's Disease and Other Neurodegenerative Diseases and Disorders
EP4341284A1 (en) Granulin/epithelin modules and combinations thereof to treat neurodegenerative disease
JP2021520204A (en) Gene therapy for oxidative stress
JP2024506860A (en) Compositions and methods for treating Niemann-Pick disease type A
RU2021101685A (en) OPTIMIZED CLN5 GENES AND CARTRIDGES FOR THEIR EXPRESSION AND THEIR APPLICATIONS

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: APPLICATION UNDERGOING PREEXAM PROCESSING

AS Assignment

Owner name: AAV GENE THERAPEUTICS, INC., MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:RAMU, SENTHIL;REEL/FRAME:063862/0817

Effective date: 20220216

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION