KR20230009444A

KR20230009444A - Gene therapy by dysferin double vector

Info

Publication number: KR20230009444A
Application number: KR1020227042999A
Authority: KR
Inventors: 루이즈 로디노-클라팍
Original assignee: 더 리서치 인스티튜트 앳 네이션와이드 칠드런스 하스피탈
Priority date: 2020-05-13
Filing date: 2021-05-12
Publication date: 2023-01-17
Also published as: AU2023206111A1; IL297656A; MX2022014066A; JP2023519762A; US20230279065A1; CA3172664A1; JP2024032967A; WO2021231577A1; AU2021270526A1; EP4138999A4; CL2022003099A1; BR112022020387A2; EP4138999A1; CN115605266A; CO2022016968A2; NZ793468A; TW202208407A; AR123826A1; AU2021270526B2

Abstract

본원에는 인간 디스페린 단백질(human dysferlin protein)의 단편을 암호화하는 재조합 폴리뉴클레오티드가 기재된다. 또한, 이러한 재조합 폴리뉴클레오티드를 포함하는 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 세포, 및 조성물이 추가로 기재된다. 이러한 재조합 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 세포, 및 조성물은 디스페린병증을 치료하는 데 사용될 수 있다.Described herein are recombinant polynucleotides encoding fragments of the human dysferlin protein. Also further described are plasmids, viral vectors, dual vector systems, cells, and compositions comprising such recombinant polynucleotides. Such recombinant polynucleotides, plasmids, viral vectors, dual vector systems, cells, and compositions can be used to treat dysferinopathy.

Description

Gene therapy by dysferin double vector

본원은 2020년 5월 13일자로 출원된 미국 가출원 제63/024,338호의 35 U.S.C.§119(e) 하에서 우선권을 주장하며, 그 내용은 본원에 참조로 포함된다.This application claims priority under 35 U.S.C. §119(e) of US Provisional Application Serial No. 63/024,338, filed May 13, 2020, the contents of which are incorporated herein by reference.

전자적으로 제출된 자료의 참조에 의한 포함Inclusion by reference of electronically submitted material

본원은 본 개시의 별도의 부분으로서, 그 전체가 참조로 포함되고 파일명: 106887_8070_SL.txt; 크기: 129,123 바이트; 생성일: 2021년 5월 11일과 같이 식별되는 컴퓨터 판독가능한 형태의 서열 목록을 포함한다.This application is a separate part of this disclosure, which is incorporated by reference in its entirety and has the following filename: 106887_8070_SL.txt; Size: 129,123 bytes; Creation Date: May 11, 2021 Contains a listing of sequences in computer readable form identified as such.

본 개시내용은 디스페린 결핍, 예를 들어 사지 근이영양증(limb girdle muscular dystrophy) 유형 2B, 미요시 근병증(miyoshi myopathy), 및 원위 전방 구획 근병증(distal anterior compartment myopathy)을 갖는 대상체를 치료하기 위한, 인간 디스페린(dysferlin) 유전자 및 플라스미드의 단편을 포함하는 폴리뉴클레오티드, 이러한 폴리뉴클레오티드를 포함하는 바이러스 벡터, 세포, 및 조성물, 및 상기 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 및 조성물을 사용하는 방법을 제공한다.The present disclosure provides human dystrophy for treating subjects with dysferrin deficiency, e.g., limb girdle muscular dystrophy type 2B, miyoshi myopathy, and distal anterior compartment myopathy. Provided are polynucleotides comprising fragments of the dysferlin gene and plasmids, viral vectors, cells, and compositions comprising such polynucleotides, and methods of using the polynucleotides, plasmids, viral vectors, and compositions.

디스페르리노병증(dysferlinopathies)은 사지 근이영양증 유형 2B(limb girdle muscular dystrophy type 2B, LGMD2B), 미요시 근병증, 및 원위 전방 구획 근병증(집합적으로 디스페르리노병증으로 알려짐)을 포함한 상염색체 열성 장애이다. 사지 근이영양증 유형 2B(LGMD2B)는 1/100,000-1/200,000의 발생률의 전세계적인 보고와 함께 미국에서 가장 흔한 LGMD 중의 하나를 나타낸다. 미요시 근병증(Miyoshi Myopathy)은 디스페리노병증(dysferlinopathy)의 더 한정된 원위 하부 말단 형태이다. 사실상, 질병 스펙트럼을 고려할 때, LGMD2B는 종종 비복근의 위축으로 원위에서 시작되고, 그 후 근위 근육에 영향을 미치도록 경시적으로 확산된다. 디스페린의 손실은 만성 근섬유 손실, 염증, 지방 대체 및 섬유증을 동반하는 점진적인 형태의 이영양증을 초래하고, 이들은 모두 근육 약화를 악화시킨다. Dysferlinopathies are autosomal recessive disorders including limb girdle muscular dystrophy type 2B (LGMD2B), Miyoshi myopathy, and distal anterior compartment myopathy (collectively known as dysferlinopathies). Limb muscular dystrophy type 2B (LGMD2B) represents one of the most common LGMD in the United States, with worldwide reports of an incidence of 1/100,000-1/200,000. Miyoshi Myopathy is a more limited distal lower extremity form of dysferlinopathy. Indeed, given the disease spectrum, LGMD2B often begins distally with gastrocnemius atrophy and then spreads over time to affect the proximal muscles. Loss of dysferin results in a progressive form of dystrophy with chronic muscle fiber loss, inflammation, fat replacement and fibrosis, all of which exacerbate muscle weakness.

디스페린 유전자는 크며, 지금까지 확인된 55개의 엑손(exons)은 적어도 150 kb의 게놈 DNA에 걸쳐 있다. 이들 엑손은 대략 6.5 kb의 cDNA 및 2,088개 아미노산의 단백질을 예측한다. 디스페린은 C-말단 소수성 막관통 도메인 및 다수의 C2 도메인을 갖는 보다 긴 세포질 배향 친수성 영역으로 구성된 237 kDa의 단백질이다. 성장하는 신체의 연구는 디스페린의 손실이 골격근에서 Ca² ⁺-의존성 막 복구를 손상시킨다는 것을 보여주었다[문헌 「Song 등, Proc. Natl. Acad. Sci USA 98: 4084-4088, 2001」; 「Schnepp 등, J. Virol. 77:3495-3504, 2003」 참조]. 또한, 디스페린은 아넥신 A1 및 A2, AHNAK, 및 카베올린-3을 포함하는 막 복구에 관여하는 다른 단백질과 상호작용하는 것으로 나타났다. 이러한 시스템의 중요성은 골격근이 기계적으로 활동적이고 손상되기 쉬운 것을 고려할 때 강조되며; 따라서, 강건한 막 재밀봉 메커니즘이 존재해야 한다. 디스페린의 부재 또는 돌연변이는 손상된 막 복구 및 근섬유 괴사로 시작하여 근섬유 손실 및 진행성 사지 약화를 초래하는 일련의 사건으로 이어진다. 근섬유 재생 능력의 손실은 디스페린 결핍의 기여 결과로서 생각된다. 디스페린은 또한 소포 트래피킹(vesicle trafficking) 및 엔도시토시스(endocytosis), T관(T tubule) 형성 등과 관련되어 있다.The dysferrin gene is large, with 55 exons identified so far spanning at least 150 kb of genomic DNA. These exons predict a cDNA of approximately 6.5 kb and a protein of 2,088 amino acids. Dysferin is a 237 kDa protein composed of a C-terminal hydrophobic transmembrane domain and a longer cytoplasmically oriented hydrophilic region with multiple C2 domains. Studies in the growing body have shown that loss of ^{dysperin impairs Ca 2+} ^-dependent membrane repair in skeletal muscle [Song et al., Proc. Natl. Acad. Sci USA 98: 4084-4088, 2001; "Schnepp et al., J. Virol. 77:3495-3504, 2003]. Dysferin has also been shown to interact with other proteins involved in membrane repair, including annexins A1 and A2, AHNAK, and caveolin-3. The importance of these systems is emphasized given that skeletal muscle is mechanically active and prone to damage; Thus, a robust membrane resealing mechanism must exist. Absence or mutation of dysferin leads to a cascade of events that begins with damaged membrane repair and myofibrillar necrosis, leading to myofibrillar loss and progressive limb weakness. Loss of muscle fiber regeneration capacity is thought to be a contributing result of dysferrin deficiency. Dysferin is also involved in vesicle trafficking, endocytosis, and T tubule formation.

디스페린 유전자의 돌연변이는 사지 근육 이영양증 유형 2B(LGMD2B), 미요시 근병증 및 원위 전방 구획 근병증(집합적으로 디스페르리노병증으로 알려짐)을 포함하는 대립유전자 상염색체 열성 장애를 유발한다[예를 들어 문헌 「Grose 등, PloS one 7:e39233,2012」; 「Bansal 등, Nature 423, 168-172,2003, Moore」; 「Moore, SA, 등, J Neuropathol Exp. Neurol 65: 995-1003, 2006」; 「Rosales 등, Muscle Nerv 42:14-21, 2010」; 「Sondergaard 등, Anns of Clin Trans Neurol 2:256-270, 2015」; 「Evesson 등, J Biol Chem 285: 28529-28539, 2010」; 및 「Klinge 등, Soc Exp Biol 21: 1768-1776, 2007」 등을 참조하며, 이들 각각은 그 전문이 참조로 포함됨]. 디스페린 결핍증의 덜 흔한 표현형은 경추 증후군을 나타낸다[문헌 「Klinge 등, Muscle Nerve 41: 166-173, 2010」을 참조하며, 이는 그 전문이 참조로 포함됨]. 전형적으로, 환자는 그의 20대 초기에 서서히 진행성 쇠약 및 높은 혈청 크레아틴 키나제(creatine kinase; CK)를 나타낸다. 환자의 대략 1/3은 발병 15년 이내에 휠체어-의존성으로 된다. 임상적으로 심장은 보호되고 인지 기능은 영향을 받지 않는다. 근육 약화의 비교적 제한된 분포를 갖는 표현형 변이체는 이러한 장애에 대한 삶의 질에 크게 영향을 미칠 수 있는 잠재적 부위 유전자 대체 요법에 대한 단계를 설정한다[문헌 「Grose 등, PLoS One 7:e39233, 2012」, 「Barton 등, Muscle Nerve 42:22-29, 2010」 참조]. 단일 뉴클레오티드 변화는 전형적인 DYSF 유전자 돌연변이이며, 이는 또한 면역거부반응으로부터 전이유전자 생성물을 보호하는 역할을 하는 유전자 전달에서의 성공을 선호한다[문헌 「Rodine-Klapac 등, Mol Ther 18: 109-117, 2010」, 「Mendell 등, N Eng J Med 363: 1429-1437, 2010」, 「Mendell 등, Ann Neurol 66:290-297, 2009」을 참조하며, 이들 각각은 그 전문이 참조로 포함됨).Mutations in the dysferin gene cause allelic autosomal recessive disorders including limb muscular dystrophy type 2B (LGMD2B), Miyoshi myopathy and distal anterior compartment myopathy (collectively known as dysperrinopathies) [see, e.g., "Grose et al., PloS one 7:e39233,2012"; "Bansal et al., Nature 423, 168-172, 2003, Moore"; "Moore, SA, et al., J Neuropathol Exp. Neurol 65: 995-1003, 2006; "Rosales et al., Muscle Nerv 42:14-21, 2010"; "Sondergaard et al., Anns of Clin Trans Neurol 2:256-270, 2015"; "Evesson et al., J Biol Chem 285: 28529-28539, 2010"; and "Klinge et al., Soc Exp Biol 21: 1768-1776, 2007", each of which is incorporated by reference in its entirety]. A less common phenotype of dysferin deficiency is cervical syndrome (see Klinge et al., Muscle Nerve 41: 166-173, 2010, incorporated by reference in its entirety). Typically, patients present with slowly progressive decline and high serum creatine kinase (CK) in their early twenties. Approximately one-third of patients become wheelchair-dependent within 15 years of onset. Clinically, the heart is protected and cognitive function is unaffected. A phenotypic variant with a relatively limited distribution of muscle weakness sets the stage for potential site gene replacement therapy that could significantly impact quality of life for this disorder [Grose et al., PLoS One 7:e39233, 2012] , "Barton et al., Muscle Nerve 42:22-29, 2010"]. Single nucleotide changes are typical DYSF gene mutations, which favor success in gene transfer, which also serves to protect the transgene product from immune rejection [Rodine-Klapac et al., Mol Ther 18: 109-117, 2010 ”, “Mendell et al., N Eng J Med 363: 1429-1437, 2010”, “Mendell et al., Ann Neurol 66:290-297, 2009”, each of which is incorporated by reference in its entirety).

디스페르리노병증에 대한 치유 또는 치료법은 존재하지 않는다. 전체적으로, 유전자 대체 또는 대용 유전자 대체를 평가한 전임상 연구는 다수의 전략이 막 복구를 회복시키는 데 일부 효능을 나타낸다는 것을 보여주었다. 디스페린 유전자는 6.5 kb에서 이의 회합된 cDNA와 함께 150 kb의 게놈 DNA를 포괄하는 55개의 엑손을 포함한다. 그러나, 유전자 대체를 위해, AAV의 패키징 한계는 4.7 kb이며, 이는 6.5 kb에서 디스페린의 cDNA 서열 미만이다. 따라서, 필요로 하는 대상체에게 기능성 전체-길이 디스페린 단백질을 전달할 수 있는 새로운 메커니즘을 필요로 하는 LGMD2B에 대한 치료법이 요구된다.There is no cure or cure for dysperrinopathies. Overall, preclinical studies evaluating gene replacement or surrogate gene replacement have shown that a number of strategies show some efficacy in restoring membrane repair. The dysferrin gene contains 55 exons covering 150 kb of genomic DNA with its associated cDNA at 6.5 kb. However, for gene replacement, AAV's packaging limit is 4.7 kb, which is less than the cDNA sequence of dysferrin at 6.5 kb. Therefore, there is a need for treatments for LGMD2B that require new mechanisms to deliver functional full-length dysferrin protein to subjects in need thereof.

인간 디스페린(human dysferlin, hDYSF) 단백질의 단편을 암호화하는 재조합 폴리뉴클레오티드가 본원에 개시되며, 여기서 상기 재조합 폴리뉴클레오티드는 제 1 뉴클레오티드 서열을 포함하되, 상기 제 1 뉴클레오티드 서열은: (a) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (b) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이(full length)에 걸쳐 서열번호 1, 서열번호 6, 또는 서열번호 18의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (c) 서열번호 13 또는 15의 뉴클레오티드 서열; (d) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (e) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 상기 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (f) 상기 (e)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (e)의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성된다.Disclosed herein is a recombinant polynucleotide encoding a fragment of a human dysferlin (hDYSF) protein, wherein the recombinant polynucleotide comprises a first nucleotide sequence, wherein the first nucleotide sequence comprises: (a) SEQ ID NO: a nucleotide sequence of 1, 6, or 18; (b) at least 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; (c) the nucleotide sequence of SEQ ID NO: 13 or 15; (d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; , 98%, or 99% identical nucleotide sequences; (e) a nucleotide sequence encoding a fragment of the hDYSF protein, the fragment consisting of the amino acid sequence of SEQ ID NO: 9; or (f) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of (e) over the entire length of the nucleotide sequence of (e), 98%, or 99% identical nucleotide sequences;

일부 구현예에서, 재조합 폴리뉴클레오티드는 역전된 말단 반복부(inverted terminal repeat; ITR), 프로모터(promoter), 인트론(intron), 선택 마커(selection marker), 또는 복제기점(origin of replication; ORI)으로부터 선택된 하나 이상의 추가 뉴클레오티드 서열을 더 포함한다.In some embodiments, the recombinant polynucleotide is derived from an inverted terminal repeat (ITR), promoter, intron, selection marker, or origin of replication (ORI). It further comprises one or more additional nucleotide sequences selected.

일부 구현예에서, 재조합 폴리뉴클레오티드는 ITR을 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR은 AAV2 ITR 또는 AAV3 ITR이다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 2개의 ITR을 포함한다. 일부 구현예에서, ITR은 서열번호 3 또는 서열번호 17의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises an additional nucleotide sequence comprising an ITR. In some embodiments, the ITR is an AAV ITR. In some embodiments, the AAV ITR is an AAV2 ITR or AAV3 ITR. In some embodiments, a recombinant polynucleotide comprises two ITRs. In some embodiments, the ITR comprises the nucleotide sequence of SEQ ID NO: 3 or SEQ ID NO: 17.

일부 구현예에서, 재조합 폴리뉴클레오티드는 프로모터를 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 프로모터는 근육-특이적 프로모터이다. 일부 구현예에서, 근육-특이적 프로모터는 인간 골격 액틴 유전자 요소, 심장 액틴 유전자 요소, 데스민 프로모터, 골격 알파-액틴(ASKA) 프로모터, 트로포닌 I(TNNI2) 프로모터, 근세포-특이적 인핸서 결합 인자 mef 결합 요소, 근육 크레아틴 키나제(MCK) 프로모터, 절단된 MCK(tMCK) 프로모터, 미오신 중쇄(MHC) 프로모터, 혼성 a-미오신 중쇄 인핸서-/MCK 인핸서-프로모터(MHCK7) 프로모터, C5-12 프로모터, 뮤린 크레아틴 키나제 인핸서 요소, 골격 빠른-트위치(twitch, 연축) 트로포닌 c 유전자 요소, 느린-트위치 심장 트로포닌 c 유전자 요소(slow-twitch cardiac troponin c gene element), 느린-트위치 트로포닌 i 유전자 요소, 저산소증-유도성 핵 인자(hypoxia-inducible nuclear factor)로부터 선택된다. 일부 구현예에서, 근육-특이적 프로모터는 MHCK7 프로모터이다. 일부 구현예에서, 프로모터는 재조합 프로모터이다. 일부 구현예에서, 재조합 프로모터는 재조합 근육-특이적 프로모터이다. 일부 구현예에서, 재조합 근육-특이적 프로모터는 재조합 미오신 중쇄-크레아틴 키나제 근육-특이적 프로모터이다. 일부 구현예에서, 프로모터는 서열번호 4의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises an additional nucleotide sequence comprising a promoter. In some embodiments, the promoter is a muscle-specific promoter. In some embodiments, the muscle-specific promoter is a human skeletal actin gene element, a cardiac actin gene element, a desmin promoter, a skeletal alpha-actin (ASKA) promoter, a troponin I (TNNI2) promoter, a myocyte-specific enhancer binding factor mef binding element, muscle creatine kinase (MCK) promoter, truncated MCK (tMCK) promoter, myosin heavy chain (MHC) promoter, hybrid a-myosin heavy chain enhancer-/MCK enhancer-promoter (MHCK7) promoter, C5-12 promoter, murine Creatine kinase enhancer element, skeletal fast-twitch troponin c gene element, slow-twitch cardiac troponin c gene element, slow-twitch troponin i gene urea, hypoxia-inducible nuclear factor. In some embodiments, the muscle-specific promoter is the MHCK7 promoter. In some embodiments, the promoter is a recombinant promoter. In some embodiments, the recombinant promoter is a recombinant muscle-specific promoter. In some embodiments, the recombinant muscle-specific promoter is a recombinant myosin heavy chain-creatine kinase muscle-specific promoter. In some embodiments, the promoter comprises the nucleotide sequence of SEQ ID NO:4.

일부 구현예에서, 재조합 폴리뉴클레오티드는 인트론을 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 인트론은 5' 공여체 부위, 분지점, 및/또는 3' 스플라이스 부위를 포함한다. 일부 구현예에서, 인트론은 키메라 인트론이다. 일부 구현예에서, 인트론은 인간 β-글로빈 유전자로부터의 5' 공여체 부위를 포함한다. 일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 분지점을 포함한다. 일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 3' 스플라이스 수용체 부위를 포함한다. 일부 구현예에서, 인트론은 서열번호 5의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises additional nucleotide sequences comprising introns. In some embodiments, an intron comprises a 5' donor site, branch point, and/or 3' splice site. In some embodiments, an intron is a chimeric intron. In some embodiments, the intron comprises a 5' donor region from the human β-globin gene. In some embodiments, an intron comprises a branch point from an immunoglobulin G (IgG) heavy chain. In some embodiments, an intron comprises a 3' splice acceptor site from an immunoglobulin G (IgG) heavy chain. In some embodiments, an intron comprises the nucleotide sequence of SEQ ID NO:5.

일부 구현예에서, 재조합 폴리뉴클레오티드는 선택 마커를 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 선택 마커는 항생제 내성 유전자(antibiotic resistance gene)이다. 일부 구현예에서, 항생제 내성 유전자는 β-락타마제 유전자 또는 카나마이신 내성 유전자(kanamycin resistance gene)이다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 서열번호 6 또는 서열번호 18의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 재조합 뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the recombinant polynucleotide further comprises an additional nucleotide sequence comprising a selectable marker. In some embodiments, the selectable marker is an antibiotic resistance gene. In some embodiments, the antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene. In some embodiments, the recombinant polynucleotide comprises the nucleotide sequence of SEQ ID NO:6 or SEQ ID NO:18. In some embodiments, the recombinant nucleotide further does not include a second polynucleotide sequence encoding a second fragment of hDYSF protein.

일부 구현예에서, 재조합 뉴클레오티드는 하나 이상의 ITR 이외의 AAV 서열을 포함하지 않는다.In some embodiments, the recombinant nucleotides do not contain AAV sequences other than one or more ITRs.

일부 구현예에서, 재조합 뉴클레오티드는 하나 이상의 ITR 이외의 바이러스 서열을 포함하지 않는다.In some embodiments, the recombinant nucleotides do not contain viral sequences other than one or more ITRs.

인간 디스페린 단백질의 단편을 암호화하는 재조합 폴리뉴클레오티드 서열이 본원에 개시되며, 여기서 상기 재조합 폴리뉴클레오티드는 제 2 뉴클레오티드 서열을 포함하되, 상기 제 2 뉴클레오티드 서열은: (a) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (b) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (c) 서열번호 14 또는 16의 뉴클레오티드 서열; (d) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (e) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (f) 상기 (d)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (e)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성된다.Disclosed herein is a recombinant polynucleotide sequence encoding a fragment of human dysferrin protein, wherein the recombinant polynucleotide comprises a second nucleotide sequence, wherein the second nucleotide sequence comprises: (a) SEQ ID NO: 2, 8, or nucleotide sequence of 19; (b) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (c) the nucleotide sequence of SEQ ID NO: 14 or 16; (d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (e) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (f) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (e) over the entire length of the nucleotide sequence of (d); 98%, or 99% identical polynucleotide sequences;

일부 구현예에서, 재조합 폴리뉴클레오티드는 역전된 말단 반복부(ITR), 선택 마커, 복제기점(ORI), 비번역 영역(untranslated region; UTR), 또는 폴리아데닐화(polyadenylation; 폴리A) 신호를 포함하는 하나 이상의 추가 뉴클레오티드 서열을 더 포함한다.In some embodiments, the recombinant polynucleotide comprises an inverted terminal repeat (ITR), selectable marker, origin of replication (ORI), untranslated region (UTR), or polyadenylation (polyA) signal. It further includes one or more additional nucleotide sequences that

일부 구현예에서, 재조합 폴리뉴클레오티드는 ITR을 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR은 AAV2 ITR 또는 AAV3 ITR이다. 일부 구현예에서, 재조합 뉴클레오티드는 2개의 ITR을 포함한다. 일부 구현예에서, ITR은 서열번호 3 또는 17의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises an additional nucleotide sequence comprising an ITR. In some embodiments, the ITR is an AAV ITR. In some embodiments, the AAV ITR is an AAV2 ITR or AAV3 ITR. In some embodiments, a recombinant nucleotide comprises two ITRs. In some embodiments, the ITR comprises the nucleotide sequence of SEQ ID NO: 3 or 17.

일부 구현예에서, 재조합 폴리뉴클레오티드는 폴리A 신호를 포함하는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 폴리A 신호는 인공 폴리A 신호이다. 일부 구현예에서, 폴리A 신호는 서열번호 7의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises a nucleotide sequence comprising a polyA signal. In some embodiments, the polyA signal is an artificial polyA signal. In some embodiments, the polyA signal comprises the nucleotide sequence of SEQ ID NO:7.

일부 구현예에서, 재조합 폴리뉴클레오티드는 선택 마커를 포함하는 추가의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 선택 마커는 항생제 내성 유전자이다. 일부 구현예에서, 항생제 내성 유전자는 β-락타마제 유전자 또는 카나마이신 내성 유전자이다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 서열번호 8 또는 서열번호 19의 뉴클레오티드 서열을 포함한다.In some embodiments, the recombinant polynucleotide further comprises an additional nucleotide sequence comprising a selectable marker. In some embodiments, the selectable marker is an antibiotic resistance gene. In some embodiments, the antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene. In some embodiments, the recombinant polynucleotide comprises the nucleotide sequence of SEQ ID NO: 8 or SEQ ID NO: 19.

일부 구현예에서, 재조합 뉴클레오티드는 hDYSF 단백질의 제 1 단편을 암호화하는 제 1 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the recombinant nucleotide further does not comprise a first polynucleotide sequence encoding a first fragment of hDYSF protein.

본원에서는 추가로, 이중 아데노-연관 바이러스(adeno-associated viral, AAV) 벡터 시스템이 개시되며, 상기 벡터 시스템은:Further disclosed herein is a dual adeno-associated viral (AAV) vector system comprising:

(a) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는 인간 디스페린(hDYSF) 단백질의 N-말단 단편을 암호화하는 제 1 재조합 폴리뉴클레오티드를 포함하되, 상기 제 1 재조합 폴리뉴클레오티드는 제 1 뉴클레오티드 서열을 포함하고, 상기 제 1 뉴클레오티드 서열은: (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 서열번호 6, 또는 서열번호 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 상기 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (a) a first AAV vector, wherein the first AAV vector comprises a first recombinant polynucleotide encoding an N-terminal fragment of human dysferrin (hDYSF) protein, wherein the first recombinant polynucleotide comprises a first nucleotide sequence wherein the first nucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 1, 6, or 18; (ii) at least 90%, 91%, 92%, 93%, 94% of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; , 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of the hDYSF protein, said fragment consisting of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 재조합 폴리뉴클레오티드를 포함하되, 상기 제 2 재조합 폴리뉴클레오티드는 제 2 뉴클레오티드 서열을 포함하고, 상기 제 2 뉴클레오티드 서열은: (i) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (ii) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨;(b) a second AAV vector, wherein the second AAV vector comprises a second recombinant polynucleotide encoding a C-terminal fragment of human dysferrin protein, wherein the second recombinant polynucleotide comprises a second nucleotide sequence; , wherein the second nucleotide sequence is: (i) the nucleotide sequence of SEQ ID NO: 2, 8, or 19; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotide sequences;

을 포함한다.includes

본원에서 추가로, 본원에 개시된 임의의 재조합 폴리뉴클레오티드를 포함하는 아데노-연관 바이러스(adeno-associated viral, AAV) 벡터가 개시된다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 인간 디스페린 단백질의 N-말단 단편을 암호화한다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 인간 디스페린 단백질의 C-말단 단편을 암호화한다. 일부 구현예에서, AAV 벡터는 AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV-11, AAV-12, AAV-13, AAVrh10, AAVrh20, 또는 AAVrh74이다. 일부 구현예에서, AAV 벡터는 AAVrh74이다.Further disclosed herein are adeno-associated viral (AAV) vectors comprising any of the recombinant polynucleotides disclosed herein. In some embodiments, the recombinant polynucleotide encodes an N-terminal fragment of human dysferrin protein. In some embodiments, the recombinant polynucleotide encodes a C-terminal fragment of human dysferrin protein. In some embodiments, the AAV vector is AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV- 11, AAV-12, AAV-13, AAVrh10, AAVrh20, or AAVrh74. In some embodiments, the AAV vector is AAVrh74.

본원에서는 추가로, 본원에 개시된 임의의 AAV 벡터를 포함하는 조성물이 개시된다.Further disclosed herein are compositions comprising any of the AAV vectors disclosed herein.

본원에서는 추가로, 조성물이 개시되며, 상기 조성물은:Further disclosed herein is a composition comprising:

(a) 제 1 재조합 아데노-연관 바이러스(rAAV) 벡터, 여기서 상기 제 1 rAAV 벡터는 인간 디스페린(hDYSF) 단백질의 N-말단 단편을 암호화하는 제 1 재조합 폴리뉴클레오티드를 포함하되, 상기 제 1 재조합 폴리뉴클레오티드는 제 1 뉴클레오티드 서열을 포함하고, 상기 제 1 뉴클레오티드 서열은: (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 서열번호 6, 또는 서열번호 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 상기 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (a) a first recombinant adeno-associated virus (rAAV) vector, wherein the first rAAV vector comprises a first recombinant polynucleotide encoding an N-terminal fragment of human dysferrin (hDYSF) protein, wherein the first recombinant The polynucleotide comprises a first nucleotide sequence comprising: (i) the nucleotide sequence of SEQ ID NO: 1, 6, or 18; (ii) at least 90%, 91%, 92%, 93%, 94% of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; , 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of the hDYSF protein, said fragment consisting of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 제 2 rAAV 벡터, 여기서 상기 제 2 rAAV 벡터는 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 재조합 폴리뉴클레오티드를 포함하되, 상기 제 2 재조합 폴리뉴클레오티드는 제 2 뉴클레오티드 서열을 포함하고, 상기 제 2 뉴클레오티드 서열은: (i) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (ii) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨;(b) a second rAAV vector, wherein the second rAAV vector comprises a second recombinant polynucleotide encoding a C-terminal fragment of human dysferrin protein, wherein the second recombinant polynucleotide comprises a second nucleotide sequence; , wherein the second nucleotide sequence is: (i) the nucleotide sequence of SEQ ID NO: 2, 8, or 19; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotide sequences;

를 포함한다.includes

일부 구현예에서, 제 1 및 제 2 rAAV 벡터의 몰비는 약 100:1-100, 약 10:1-1:10, 약 2:1-1:2, 또는 약 1:1이다.In some embodiments, the molar ratio of the first and second rAAV vectors is about 100:1-100, about 10:1-1:10, about 2:1-1:2, or about 1:1.

본원에서는 추가로, 아데노-연관 바이러스(Adeno-associated virus, AAV) 벡터가 개시되며, 상기 AAV 벡터는:Further disclosed herein is an Adeno-associated virus (AAV) vector, wherein the AAV vector:

(a) 제 1 역전된 말단 반복부(inverted terminal repeat; ITR); (a) a first inverted terminal repeat (ITR);

(b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (b) a polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the polynucleotide comprises (i) a nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(c) 제 2 ITR;(c) a second ITR;

을 포함하되, 상기 폴리뉴클레오티드는 상기 제 1 및 제 2 ITR에 의해 플랭킹된다(flanked)(또는 '옆에 배치된다').Including, wherein the polynucleotide is flanked (or 'next to') by the first and second ITRs.

일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR은 AAV2 ITR 또는 AAV3 ITR이다. 일부 구현예에서, 제 1 및/또는 제 2 ITR은 서열번호 3 또는 17의 뉴클레오티드 서열을 포함한다.In some embodiments, the ITR is an AAV ITR. In some embodiments, the AAV ITR is an AAV2 ITR or AAV3 ITR. In some embodiments, the first and/or second ITR comprises the nucleotide sequence of SEQ ID NO: 3 or 17.

일부 구현예에서, AAV 벡터는 프로모터, 인트론, 선택 마커, 또는 복제기점(ORI)을 포함하는 하나 이상의 추가의 폴리뉴클레오티드 서열을 더 포함한다.In some embodiments, the AAV vector further comprises one or more additional polynucleotide sequences comprising a promoter, intron, selectable marker, or origin of replication (ORI).

일부 구현예에서, AAV 벡터는 프로모터를 포함하는 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 프로모터는 근육-특이적 프로모터이다. 일부 구현예에서, 근육-특이적 프로모터는 미오신 중쇄 복합체-E 박스 근육 크레아틴 키나제 융합 인핸서/프로모터이다. 일부 구현예에서, 프로모터는 재조합 프로모터이다. 일부 구현예에서, 재조합 프로모터는 재조합 근육-특이적 프로모터이다. 일부 구현예에서, 재조합 근육-특이적 프로모터는 MHCK7 프로모터이다. 일부 구현예에서, 프로모터는 서열번호 4의 뉴클레오티드 서열을 포함한다.In some embodiments, the AAV vector further comprises an additional polynucleotide sequence comprising a promoter. In some embodiments, the promoter is a muscle-specific promoter. In some embodiments, the muscle-specific promoter is a myosin heavy chain complex-E box muscle creatine kinase fusion enhancer/promoter. In some embodiments, the promoter is a recombinant promoter. In some embodiments, the recombinant promoter is a recombinant muscle-specific promoter. In some embodiments, the recombinant muscle-specific promoter is the MHCK7 promoter. In some embodiments, the promoter comprises the nucleotide sequence of SEQ ID NO:4.

일부 구현예에서, AAV 벡터는 인트론을 포함하는 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 인트론은 5' 공여체 부위(donor site), 분지점, 및/또는 3' 스플라이스 부위(splice site)를 포함한다. 일부 구현예에서, 인트론은 키메라 인트론이다. 일부 구현예에서, 인트론은 인간 β-글로빈 유전자로부터의 5' 공여체 부위를 포함한다. 일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 분지점을 포함한다. 일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 3' 스플라이스 수용체 부위를 포함한다. 일부 구현예에서, 인트론은 서열번호 5의 뉴클레오티드 서열을 포함한다.In some embodiments, the AAV vector further comprises additional polynucleotide sequences comprising introns. In some embodiments, an intron comprises a 5' donor site, branch point, and/or 3' splice site. In some embodiments, an intron is a chimeric intron. In some embodiments, the intron comprises a 5' donor region from the human β-globin gene. In some embodiments, an intron comprises a branch point from an immunoglobulin G (IgG) heavy chain. In some embodiments, an intron comprises a 3' splice acceptor site from an immunoglobulin G (IgG) heavy chain. In some embodiments, an intron comprises the nucleotide sequence of SEQ ID NO:5.

일부 구현예에서, AAV 벡터는 선택 마커를 포함하는 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 선택 마커는 항생제 내성 유전자이다. 일부 구현예에서, 항생제 내성 유전자는 β-락타마제 유전자 또는 카나마이신 내성 유전자이다. 일부 구현예에서, AAV 벡터는 서열번호 6 또는 서열번호 15의 뉴클레오티드 서열을 포함한다.In some embodiments, the AAV vector further comprises an additional polynucleotide sequence comprising a selectable marker. In some embodiments, the selectable marker is an antibiotic resistance gene. In some embodiments, the antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene. In some embodiments, the AAV vector comprises the nucleotide sequence of SEQ ID NO:6 or SEQ ID NO:15.

일부 구현예에서, AAV 벡터는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the AAV vector does not further comprise a second polynucleotide sequence encoding a second fragment of hDYSF protein.

일부 구현예에서, AAV 벡터는 하나 이상의 ITR 이외의 AAV 서열을 포함하지 않는다.In some embodiments, the AAV vector does not contain AAV sequences other than one or more ITRs.

일부 구현예에서, AAV 벡터는 하나 이상의 ITR 이외의 바이러스 서열을 포함하지 않는다.In some embodiments, the AAV vector does not contain viral sequences other than one or more ITRs.

본원에서는 추가로, 아데노-연관 바이러스(AAV) 벡터가 개시되며, 상기 아데노-연관 바이러스(AAV) 벡터는:Further disclosed herein is an adeno-associated viral (AAV) vector, wherein the adeno-associated viral (AAV) vector:

(a) 제 1 역전된 말단 반복부(ITR); (a) a first inverted terminal repeat (ITR);

(b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드 서열은: (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드;로 구성됨; 및 (b) a polynucleotide encoding a fragment of human dysferrin protein, wherein the polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotides; and

(c) 제 2 ITR;(c) a second ITR;

을 포함하되, 상기 폴리뉴클레오티드는 상기 제 1 및 제 2 ITR에 의해 플랭킹된다.Including, wherein the polynucleotide is flanked by the first and second ITRs.

일부 구현예에서, AAV 벡터는 선택 마커, 복제기점(ORI), 비번역 영역(UTR), 또는 폴리아데닐화(폴리A) 신호를 포함하는 하나 이상의 폴리뉴클레오티드 서열을 더 포함한다.In some embodiments, the AAV vector further comprises one or more polynucleotide sequences comprising a selectable marker, origin of replication (ORI), untranslated region (UTR), or polyadenylation (polyA) signal.

일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR은 AAV2 ITR 또는 AAV3 ITR이다. 일부 구현예에서, ITR은 서열번호 3 또는 17의 뉴클레오티드 서열을 포함한다.In some embodiments, the ITR is an AAV ITR. In some embodiments, the AAV ITR is an AAV2 ITR or AAV3 ITR. In some embodiments, the ITR comprises the nucleotide sequence of SEQ ID NO: 3 or 17.

일부 구현예에서, AAV 벡터는 폴리A 신호를 포함하는 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 폴리A 신호는 인공 폴리A 신호이다. 일부 구현예에서, 폴리A 신호는 서열번호 7의 뉴클레오티드 서열을 포함한다.In some embodiments, the AAV vector further comprises an additional polynucleotide sequence comprising a polyA signal. In some embodiments, the polyA signal is an artificial polyA signal. In some embodiments, the polyA signal comprises the nucleotide sequence of SEQ ID NO:7.

일부 구현예에서, AAV 벡터는 선택 마커를 포함하는 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 선택 마커는 항생제 내성 유전자이다. 일부 구현예에서, 항생제 내성 유전자는 β-락타마제 유전자 또는 카나마이신 내성 유전자이다.In some embodiments, the AAV vector further comprises an additional polynucleotide sequence comprising a selectable marker. In some embodiments, the selectable marker is an antibiotic resistance gene. In some embodiments, the antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene.

일부 구현예에서, AAV 벡터는 서열번호 8 또는 서열번호 16의 뉴클레오티드 서열을 포함한다.In some embodiments, the AAV vector comprises the nucleotide sequence of SEQ ID NO: 8 or SEQ ID NO: 16.

일부 구현예에서, AAV 벡터는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다. In some embodiments, the AAV vector does not further comprise a second polynucleotide sequence encoding a second fragment of hDYSF protein.

본원에서는, 이중 아데노-연관 바이러스(AAV) 벡터 시스템이 개시되며, 상기 이중 AAV 벡터 시스템은:Disclosed herein is a dual adeno-associated virus (AAV) vector system, wherein the dual AAV vector system:

(I) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는, (I) a first AAV vector, wherein the first AAV vector comprises:

(a) 제 1 역전된 말단 반복부(ITR)를 포함하는 제 1 AAV 벡터; (a) a first AAV vector comprising a first inverted terminal repeat (ITR);

(b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;을 포함하는 제 1 AAV 벡터임; 및 (b) a polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the polynucleotide comprises (i) a nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(c) 제 2 ITR;(c) a second ITR;

를 포함하되, 상기 폴리뉴클레오티드는 상기 제 1 ITR 및 제 2 ITR에 의해 플랭킹됨; 및 Including, wherein the polynucleotide is flanked by the first ITR and the second ITR; and

(II) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는, (II) a second AAV vector, wherein the second AAV vector,

(a) 제 3 역전된 말단 반복부(ITR)를 포함하는 제 2 AAV 벡터; (a) a second AAV vector comprising a third inverted terminal repeat (ITR);

(b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드임;를 포함하는 제 2 AAV 벡터임; 및 (b) a polynucleotide encoding a fragment of human dysferrin protein, wherein the polynucleotide comprises: (i) a nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); are 98%, or 99% identical polynucleotides; and

(c) 제 4 ITR;(c) a fourth ITR;

를 포함하되, 상기 폴리뉴클레오티드는 제 3 ITR 및 제 4 ITR에 의해 플랭킹됨;wherein the polynucleotide is flanked by a third ITR and a fourth ITR;

을 포함한다.includes

본원에서는 추가로, 아데노-연관 바이러스(AAV) 패키징 시스템이 개시되며, 상기 시스템은: (a) 본원에 개시된 재조합 폴리뉴클레오티드를 포함하는 플라스미드; (b) 아데노바이러스 헬퍼 플라스미드; 및 (c) rep-cap 플라스미드;를 포함한다. 일부 경우에, 아데노바이러스 헬퍼 플라스미드는 pHELP 플라스미드를 포함한다.Further disclosed herein is an adeno-associated virus (AAV) packaging system comprising: (a) a plasmid comprising a recombinant polynucleotide disclosed herein; (b) an adenovirus helper plasmid; and (c) rep-cap plasmid; In some cases, the adenoviral helper plasmid includes a pHELP plasmid.

본원에서는 추가로, (a) 재조합 폴리뉴클레오티드를 포함하는 플라스미드; 및 (b) 아데노바이러스 헬퍼 플라스미드를 포함하는 아데노-연관 바이러스 패키징 시스템이 본원에 개시된다. 일부 경우에, 아데노바이러스 헬퍼 플라스미드는 pHELP 플라스미드를 포함한다.Further provided herein are: (a) a plasmid comprising a recombinant polynucleotide; and (b) an adenovirus helper plasmid. In some cases, the adenoviral helper plasmid includes a pHELP plasmid.

본원에서는 추가로, 세포를 AAV 패키징 시스템과 접촉시키는 것을 포함하는, 아데노-연관 바이러스(AAV) 벡터를 생산하는 방법이 개시되며, 상기 AAV 패키징 시스템은, (a) 본원에 개시된 재조합 폴리뉴클레오티드를 포함하는 플라스미드; (b) 아데노바이러스 헬퍼 플라스미드; 및 (c) rep-cap 플라스미드;를 포함한다. 일부 경우에, 세포는 숙주 세포, 임의로 포유동물 숙주 세포, 추가로 임의로 HEK293이다.Further disclosed herein is a method of producing an adeno-associated virus (AAV) vector comprising contacting a cell with an AAV packaging system, wherein the AAV packaging system comprises (a) a recombinant polynucleotide disclosed herein. a plasmid; (b) an adenovirus helper plasmid; and (c) rep-cap plasmid; In some cases, the cell is a host cell, optionally a mammalian host cell, and further optionally HEK293.

본원에서는 추가로, 패키징 세포주를 AAV 패키징 시스템으로 형질도입(transducing)하는 것을 포함하는, 아데노-연관 바이러스(AAV) 벡터를 생산하는 방법이 개시되며, 상기 AAV 패키징 시스템은: (a) 본원에 개시된 재조합 폴리뉴클레오티드 중의 임의의 것을 포함하는 AAV 발현 카세트를 포함하는 플라스미드; 및 (b) 아데노바이러스 헬퍼 플라스미드(adenovirus helper plasmid);를 포함하되, 여기서 상기 패키징 세포주는 아데노-연관 바이러스 rep 및 cap 유전자를 발현한다. 일부 경우에, AAV rep 유전자는 Rep78이다. 일부 경우에, AAV cap 유전자는 Rh74 cap 유전자이다.Further disclosed herein is a method of producing an adeno-associated virus (AAV) vector comprising transducing a packaging cell line with an AAV packaging system, the AAV packaging system comprising: (a) the disclosed herein a plasmid comprising an AAV expression cassette comprising any of the recombinant polynucleotides; and (b) an adenovirus helper plasmid; wherein the packaging cell line expresses adeno-associated virus rep and cap genes. In some cases, the AAV rep gene is Rep78. In some cases, the AAV cap gene is the Rh74 cap gene.

본원에서는 추가로, 본원에 개시된 재조합 폴리뉴클레오티드 중의 임의의 것을 포함하는 세포가 개시된다.Further disclosed herein are cells comprising any of the recombinant polynucleotides disclosed herein.

본원에서는 추가로, 본원에 개시된 재조합 폴리뉴클레오티드 중의 임의의 것을 포함하는 AAV 발현 카세트를 포함하는 세포가 개시된다. 일부 경우에, 플라스미드는 서열번호 18 또는 19와 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드를 포함한다. 일부 경우에, 플라스미드는 서열번호 18 또는 19의 폴리뉴클레오티드를 포함한다.Further disclosed herein are cells comprising an AAV expression cassette comprising any of the recombinant polynucleotides disclosed herein. In some cases, the plasmid comprises a polynucleotide that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 18 or 19. In some cases, the plasmid comprises a polynucleotide of SEQ ID NO: 18 or 19.

본원에서는 추가로, 디스페리노병증의 치료 방법이 개시되며, 상기 방법은, 디스페리노병증의 치료를 필요로 하는 대상체에게,Further disclosed herein is a method of treating dysferinopathy, the method comprising:

(a) 인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드 서열을 포함하는 유효량의 제 1 재조합 폴리뉴클레오티드, 여기서 상기 제 1 폴리뉴클레오티드 서열은, (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 서열번호 6, 또는 서열번호 18의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (a) an effective amount of a first recombinant polynucleotide comprising a first polynucleotide sequence encoding the N-terminus of human dysferrin (hDYSF) protein, wherein said first polynucleotide sequence comprises (i) SEQ ID NOs: 1, 6 , or a nucleotide sequence of 18; (ii) at least 90%, 91%, 92%, 93%, 94 to the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; %, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; , 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of the hDYSF protein, wherein the fragment of the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v), 98%, or 99% identical nucleotide sequences; and

(b) 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 포함하는 유효량의 제 2 재조합 폴리뉴클레오티드, 여기서 상기 제 2 폴리뉴클레오티드 서열은, (i) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (ii) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열에 대해 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨;(b) an effective amount of a second recombinant polynucleotide comprising a second polynucleotide sequence encoding a C-terminal fragment of human dysferrin protein, wherein the second polynucleotide sequence comprises: (i) SEQ ID NO: 2, 8, or nucleotide sequence of 19; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; , 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% relative to the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; , 98%, or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). , 98%, or 99% identical polynucleotide sequences;

을 투여하는 것을 포함한다.It includes administering

일부 구현예에서, 제 1 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 2 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다.In some embodiments, the first polynucleotide is administered intramuscularly or intravenously. In some embodiments, the second polynucleotide is administered intramuscularly or intravenously.

일부 구현예에서, 제 1 및 제 2 폴리뉴클레오티드는 동시에 또는 순차적으로 투여된다.In some embodiments, the first and second polynucleotides are administered simultaneously or sequentially.

일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

본원에서는 추가로, 디스페리노병증의 치료를 필요로 하는 대상체에게, In addition, in the present application, to a subject in need of treatment for dysperinopathies,

(a) 유효량의 제 1 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 1 AAV 벡터는 인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드를 포함하되, 상기 제 1 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (a) an effective amount of a first adeno-associated virus (AAV) vector, wherein the first AAV vector comprises a first polynucleotide encoding the N-terminus of a human dysferrin (hDYSF) protein, wherein the first polynucleotide is, (i) the nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of the hDYSF protein, wherein the fragment of the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 유효량의 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 폴리뉴클레오티드를 포함하되, 상기 제 2 폴리뉴클레오티드는, (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드에 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드;로 구성됨;(b) an effective amount of a second AAV vector, wherein the second AAV vector comprises a second polynucleotide encoding a C-terminal fragment of human dysferrin protein, wherein the second polynucleotide comprises: (i) SEQ ID NO: 2 or a nucleotide sequence of 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the polynucleotide of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical polynucleotides;

을 투여하는 것을 포함하는, 디스페리노병증의 치료 방법이 개시된다.A method of treating dysperinopathies, comprising administering a, is disclosed.

일부 구현예에서, 제 1 AAV 벡터는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 2 AAV 벡터는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 1 및 제 2 AAV 벡터는 동시에 투여된다.In some embodiments, the first AAV vector is administered intramuscularly or intravenously. In some embodiments, the second AAV vector is administered intramuscularly or intravenously. In some embodiments, the first and second AAV vectors are administered simultaneously.

일부 구현예에서, 본원에서는 디스페리노증의 치료를 필요로 하는 대상체에게 유효량의 본원에 개시된 임의의 AAV 이중 벡터 시스템을 투여하는 것을 포함하는, 디스페리노증을 치료하는 방법이 개시된다.In some embodiments, disclosed herein are methods of treating dysferinosis comprising administering to a subject in need thereof an effective amount of any of the AAV dual vector systems disclosed herein.

일부 구현예에서, AAV 이중 벡터 시스템은 근육내로 또는 정맥내로 투여된다.In some embodiments, the AAV dual vector system is administered intramuscularly or intravenously.

일부 구현예에서, 디스페리노증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, the dysferinosis is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

일부 구현예에서, 본원에서는 디스페리노증의 치료를 필요로 하는 대상체에게 유효량의 본원에 개시된 임의의 조성물을 투여하는 것을 포함하는, 디스페리노증을 치료하는 방법이 개시된다.In some embodiments, disclosed herein are methods of treating dysferinosis comprising administering to a subject in need thereof an effective amount of any of the compositions disclosed herein.

일부 구현예에서, 조성물은 근육내로 또는 정맥내로 투여된다.In some embodiments, the composition is administered intramuscularly or intravenously.

본원에서는 추가로, 디스페리노병증의 치료를 필요로 하는 대상체에서 디스페리노병증을 치료하기 위한 의약의 제조에서의 조성물의 용도가 개시되어 있고, 여기서 상기 조성물은, Further disclosed herein is the use of the composition in the manufacture of a medicament for treating dysferinopathy in a subject in need thereof, wherein the composition comprises:

(a) 인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드 서열을 포함하는 제 1 재조합 폴리뉴클레오티드, 여기서 상기 제 1 폴리뉴클레오티드 서열은: (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 서열번호 6, 또는 서열번호 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 (a) a first recombinant polynucleotide comprising a first polynucleotide sequence encoding the N-terminus of a human dysferrin (hDYSF) protein, wherein the first polynucleotide sequence is: (i) SEQ ID NO: 1, 6, or a nucleotide sequence of 18; (ii) at least 90%, 91%, 92%, 93%, 94% of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; , 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 포함하는 제 2 재조합 폴리뉴클레오티드, 여기서 상기 제 2 폴리뉴클레오티드 서열은: (i) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (ii) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨;(b) a second recombinant polynucleotide comprising a second polynucleotide sequence encoding a C-terminal fragment of human dysferrin protein, wherein said second polynucleotide sequence is: (i) of SEQ ID NO: 2, 8, or 19 nucleotide sequence; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotide sequences;

을 포함한다.includes

일부 구현예에서, 제 1 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 2 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 1 및 제 2 폴리뉴클레오티드는 동시에 투여된다.In some embodiments, the first polynucleotide is administered intramuscularly or intravenously. In some embodiments, the second polynucleotide is administered intramuscularly or intravenously. In some embodiments, the first and second polynucleotides are administered simultaneously.

(a) 유효량의 제 1 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 1 AAV 벡터는 인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드 서열을 포함하되, 상기 제 1 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및(a) an effective amount of a first adeno-associated virus (AAV) vector, wherein the first AAV vector comprises a first polynucleotide sequence encoding the N-terminus of a human dysferrin (hDYSF) protein, wherein the first polynucleotide sequence The nucleotides include (i) the nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 유효량의 제 2 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 2 AAV 벡터는 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 폴리뉴클레오티드를 포함하되, 상기 제 2 폴리뉴클레오티드는, (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드, 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드와 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드;로 구성됨;(b) an effective amount of a second adeno-associated virus (AAV) vector, wherein the second AAV vector comprises a second polynucleotide encoding a C-terminal fragment of human dysferrin protein, wherein the second polynucleotide comprises: (i) the nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the polynucleotide of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical polynucleotides;

을 포함한다.includes

본원에서는 디스페리노병증의 치료를 필요로 하는 대상체에서 디스페리노병증을 치료하기 위한 의약의 제조에서의 조성물의 용도가 개시되며, 여기서 상기 조성물은 본원에 개시된 임의의 AAV 이중 벡터 시스템을 포함한다.Disclosed herein is the use of a composition in the manufacture of a medicament for treating dysferinopathy in a subject in need thereof, wherein the composition comprises any AAV dual vector system disclosed herein. .

본원에서는 디스페리노병증의 치료를 필요로 하는 대상체에서 디스페리노병증을 치료하기 위한 의약의 제조에서의 임의의 개시된 조성물의 용도가 개시된다.Disclosed herein is the use of any disclosed composition in the manufacture of a medicament for the treatment of dysferinopathy in a subject in need thereof.

일부 구현예에서, 제 1 AAV 벡터의 유효량은 정량화 표준으로서 슈퍼코일 DNA 또는 플라스미드를 기준으로 하여 약 1×10⁶ 내지 1×10¹⁶ vg/kg, 약 1×10⁸ 내지 1×10¹⁵ vg/kg, 또는 약 1×10¹⁰ 내지 1×10¹⁴ vg/kg이다.In some embodiments, the effective amount of the first AAV vector is about 1×10 ⁶ to 1×10 ¹⁶ vg/kg, about 1×10 ⁸ to 1×10 ¹⁵ vg/kg, based on supercoil DNA or plasmid as a quantification standard. kg, or about 1×10 ¹⁰ to 1×10 ¹⁴ vg/kg.

일부 구현예에서, 제 2 AAV 벡터의 유효량은 정량화 표준으로서 슈퍼코일 DNA 또는 플라스미드를 기준으로 하여 약 1×10⁶ 내지 1×10¹⁶ vg/kg, 약 1×10⁸ 내지 1×10¹⁵ vg/kg, 또는 약 1×10¹⁰ 내지 1×10¹⁴ vg/kg이다.In some embodiments, the effective amount of the second AAV vector is about 1×10 ⁶ to 1×10 ¹⁶ vg/kg, about 1×10 ⁸ to 1×10 ¹⁵ vg/kg, based on supercoil DNA or plasmid as a quantification standard. kg, or about 1×10 ¹⁰ to 1×10 ¹⁴ vg/kg.

일부 구현예에서, 제 1 AAV 벡터는 적어도 1, 2, 3, 4, 또는 5회 투여된다.In some embodiments, the first AAV vector is administered at least 1, 2, 3, 4, or 5 times.

일부 구현예에서, 제 2 AAV 벡터는 적어도 1, 2, 3, 4, 또는 5회 투여된다.In some embodiments, the second AAV vector is administered at least 1, 2, 3, 4, or 5 times.

일부 구현예에서, AAV 이중 벡터 시스템의 유효량은 약 1×10¹⁰ 내지 1×10¹³ 벡터 게놈(vg), 약 1×10¹¹ 내지 1×10¹³ vg, 1×10¹² 내지 1×10¹³ vg이다.In some embodiments, the effective amount of the AAV dual vector system is about 1×10 ¹⁰ to 1×10 ¹³ vector genome (vg), about 1×10 ¹¹ to 1×10 ¹³ vg, 1×10 ¹² to 1×10 ¹³ vg. to be.

일부 구현예에서, AAV 이중 벡터 시스템은 적어도 1, 2, 3, 4, 또는 5회 투여된다.In some embodiments, the AAV dual vector system is administered at least 1, 2, 3, 4, or 5 times.

일부 구현예에서, 조성물의 유효량은 약 1×10¹⁰ 내지 1×10¹³ 벡터 게놈(vg), 약 1×10¹¹ 내지 1×10¹³ vg, 1×10¹² 내지 1×10¹³ vg이다.In some embodiments, the effective amount of the composition is about 1×10 ¹⁰ to 1×10 ¹³ vector genome (vg), about 1×10 ¹¹ to 1×10 ¹³ vg, 1×10 ¹² to 1×10 ¹³ vg.

일부 구현예에서, 조성물은 적어도 1, 2, 3, 4, 또는 5회 투여된다.In some embodiments, the composition is administered at least 1, 2, 3, 4, or 5 times.

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드가 개시되며, 여기서 상기 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열을 포함한다.Disclosed herein is a recombinant polynucleotide encoding a human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide sequence is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences. In some embodiments, the recombinant polynucleotide sequence comprises the nucleotide sequence of SEQ ID NO: 20.

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드를 제조하는 방법이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하고, 상기 방법은, 세포를,Disclosed herein is a method for preparing a recombinant polynucleotide encoding human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide sequence is at least 90%, 91% of the nucleotide sequence of SEQ ID NO: 20 or the nucleotide sequence of SEQ ID NO: 20 , 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, the method comprising:

인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드 서열을 포함하는 재조합 폴리뉴클레오티드, 여기서 상기 제 1 폴리뉴클레오티드 서열은: (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 6, 또는 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 A recombinant polynucleotide comprising a first polynucleotide sequence encoding the N-terminus of a human dysferrin (hDYSF) protein, wherein the first polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 1, 6, or 18; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 1, 6, or 18 over the entire length of each of SEQ ID NO: 1, 6, or 18; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

과 접촉시키는 것을 포함한다.including contact with

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드를 제조하는 방법이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드는 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하고, 상기 방법은, 세포를,Disclosed herein is a method for producing a recombinant polynucleotide encoding a human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, the method comprising:

(a) 제 1 역전된 말단 반복부(first inverted terminal repeat; ITR); (a) a first inverted terminal repeat (ITR);

(b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및(b) a polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the polynucleotide comprises (i) a nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(c) 제 2 ITR;(c) a second ITR;

을 포함하되, 상기 폴리뉴클레오티드는 상기 제 1 및 제 2 ITR에 의해서 플랭킹되고, 상기 제 2 AAV 벡터는 상기 (a) 제 1 역전된 말단 반복부(ITR)을 포함함; 및 wherein said polynucleotide is flanked by said first and second ITRs, said second AAV vector comprising said (a) a first inverted terminal repeat (ITR); and

(II) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는,(II) a second AAV vector, wherein the second AAV vector,

(a) 제 3 역전된 말단 반복부(ITR);(a) a third inverted terminal repeat (ITR);

(b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드 서열은: (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드와 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨; 및(b) a polynucleotide encoding a fragment of human dysferrin protein, wherein the polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the polynucleotide of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical polynucleotide sequences; consisting of; and

(c) 제 4 ITR;(c) a fourth ITR;

을 포함하되, 상기 폴리뉴클레오티드는 상기 제 3 및 제 4 ITR에 의해서 플랭킹됨;Including, wherein the polynucleotide is flanked by the third and fourth ITRs;

을 포함하는, 이중 AAV 벡터 시스템과 접촉시키는 것을 포함한다. Including contacting with a dual AAV vector system, including.

일부 구현예에서, 세포는 진핵 세포이다. 일부 구현예에서, 세포는 근육 세포, 심장 세포, 줄기 세포, 위성 세포, 및/또는 간 세포이다.In some embodiments, the cell is a eukaryotic cell. In some embodiments, the cells are muscle cells, heart cells, stem cells, satellite cells, and/or liver cells.

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드를 제조하는 방법이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하고, 여기서 상기 방법은, 대상체에게, Disclosed herein is a method for preparing a recombinant polynucleotide encoding human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide sequence is at least 90%, 91% of the nucleotide sequence of SEQ ID NO: 20 or the nucleotide sequence of SEQ ID NO: 20 , 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, wherein the method comprises:

인간 디스페린(hDYSF) 단백질의 N-말단을 암호화하는 제 1 폴리뉴클레오티드 서열, 여기서 상기 제 1 폴리뉴클레오티드 서열은: (i) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (ii) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 6, 또는 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열;로 구성됨; 및 A first polynucleotide sequence encoding the N-terminus of a human dysferrin (hDYSF) protein, wherein the first polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 1, 6, or 18; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 1, 6, or 18 over the entire length of each of SEQ ID NO: 1, 6, or 18; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (v) a nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, or 99% identical nucleotide sequences; and

(b) 인간 디스페린 단백질의 C-말단 단편을 암호화하는 제 2 폴리뉴클레오티드 서열, 여기서 상기 제 2 폴리뉴클레오티드 서열은: (i) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (ii) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열로 구성됨;(b) a second polynucleotide sequence encoding a C-terminal fragment of human dysferrin protein, wherein the second polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 2, 8, or 19; (ii) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); consisting of polynucleotide sequences that are 98%, or 99% identical;

을 포함하는 재조합 폴리뉴클레오티드로 투여하는 것을 포함한다. It includes administering a recombinant polynucleotide comprising a.

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드를 제조하는 방법이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드는 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하고, 상기 방법은, 대상체에게,Disclosed herein is a method for producing a recombinant polynucleotide encoding a human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, the method comprising:

(c) 제 2 ITR;(c) a second ITR;

(b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드 서열은: (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨; 및(b) a polynucleotide encoding a fragment of human dysferrin protein, wherein the polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotide sequences; and

(c) 제 4 ITR;(c) a fourth ITR;

을 포함하는, 이중 AAV 벡터 시스템을 투여하는 것을 포함한다. It includes administering a dual AAV vector system, comprising.

본원에서는 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드의 발현을 포함하는, 대상체의 근이영양증을 치료하는 방법이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열 또는 대상체에서 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함한다.Disclosed herein is a method of treating muscular dystrophy in a subject comprising expression of a recombinant polynucleotide encoding human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide sequence is the nucleotide sequence of SEQ ID NO: 20 or SEQ ID NO: 20 in a subject a nucleotide sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the nucleotide sequence of 20.

일부 구현예에서, 상기 방법은 대상체에게, In some embodiments, the method provides a subject with

일부 구현예에서, 상기 방법은, 대상체에게,In some embodiments, the method comprises:

(c) 제 2 ITR;(c) a second ITR;

(b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 hDYSF 단백질의 단편은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 폴리뉴클레오티드 서열;로 구성됨; 및(b) a polynucleotide encoding a fragment of human dysferrin protein, wherein the polynucleotide comprises: (i) a nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; or 99% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; or 99% identical nucleotide sequences; (v) a polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 98%, or 99% identical polynucleotide sequences; and

(c) 제 4 ITR;(c) a fourth ITR;

일부 구현예에서, 대상체는 인간, 비-인간 영장류, 개(canine), 양(ovine), 말(horse), 돼지(porcine), 뮤린(murine), 래트(rat), 토끼(rabbit), 소(bovine), 또는 고양이(feline)로부터 선택된 포유동물이다.In some embodiments, the subject is a human, non-human primate, canine, ovine, horse, porcine, murine, rat, rabbit, cow (bovine), or a mammal selected from feline.

일부 구현예에서, 대상체는 디스페리노병증(dysferlinopathy)을 앓고 있다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, the subject suffers from dysferlinopathy. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

도 1은 디스페리노병증을 치료하기 위한 이중 AAV 벡터 시스템의 개략도를 제공한다. 5' 벡터(예를 들어 5' hDYSF AAV 벡터), pAAV.MHCK7.DYSF5'.PTG(PTG=프로모터/전이유전자)는 근육 특이적 MHCK7 프로모터, 키메라 인트론, 공통(consensus) 코작(Kozak) 서열 및 서열번호 12의 아미노산 1-1113에 상응하는 DYSF cDNA의 5' 부분을 함유한다. 3' 벡터(예를 들어 3' hDYSF AAV 벡터), pAAV.DYSF3'.POLYA는 서열번호 12의 아미노산 794-2080에 상응하는 DYSF cDNA의 3' 부분 및 폴리아데닐화 신호를 보유하는 DYSF 3'UTR을 함유한다.
도 2는 pAAV.MHCK7.DYSF5'.PTG DNA 벡터 플라스미드 맵을 제공한다.
도 3은 pAAV.DYSF3'.POLYA 벡터 DNA 플라스미드 맵을 제공한다.
도 4a 내지 도 4d는 이중 벡터 시스템의 전달 후 디스페린 발현을 나타낸다. 강력한 전체-길이 디스페린 발현(robust full-length dysferlin expression)은 면역 염색(immune staining)(도 4a) 및 웨스턴 블롯(western blot)(도 4c)에 의한 두 벡터의 전달 후에 나타났다. 어느 하나 벡터의 단독 전달은 비정상적인 디스페린 발현을 갖지 않았다(도 4b: 면역 염색, 도 4d: 웨스턴 블롯). 3222는 전체-길이 제어이다.
도 5a 내지 5c는 rAAVrh.74.MHCK7.DYSF.DV 전달 이후의 디스페린 발현의 시간 경과의 결과를 나타낸다. 도 5a는 좌측 전경골(LTA)으로의 이중 벡터의 전달 후에 보여지는 디스페린 면역표지에 의한 전체-길이 디스페린 발현(상부 패널)을 입증한다. 디스페린 발현은 병리학에서 비정상적인 반응이 없는 치료 후 1, 3, 및 6개월 동안 지속되었다(H&E, 하부 패널). 스케일 바(scale bar), 100 ㎛. 시점당 N=4. 도 5b는 주사된 LTA에서 전체-길이 디스페린의 발현을 입증하는 1, 3, 6 개월 샘플에 대한 웨스턴 블롯을 나타낸다(그룹 당 2개). 도 5c는 다양한 조직에 대한 주사 후 3 및 6개월에 ㎍ 벡터 게놈 당 게놈 DNA의 생체분포 플롯을 도시한다. 주의: LTA를 처리하였다; 로그 축.
도 6은 3 또는 12개월의 종말점(endpoints)에서 rAAVrh.74.MHCK7.DYSF.DV로 근육내 주사에 의해 치료된 4 마리의 개별 동물로부터의 표적 근육(LTA) 및 비표적 조직의 웨스턴 블롯 분석을 나타낸다.
도 7a 내지 7c는 AAVrh.74.MHCK7.DYSF.DV의 전신 전달(systemic delivery) 후 디스페린 발현을 나타낸다. 도 7a는 6×10¹² vg(2.4e13 vg/kg, 정량 표준으로서 슈퍼코일 DNA 또는 플라스미드를 기준으로 함) AAVrh.74.MHCK7.DYSF.DV의 전신 전달 후 조직의 디스페린 면역표지를 나타낸다(n=6/용량(dose)). 도시된 근육은 심장, 비복근, 횡격막 및 사두근, 치료된 (AAV.DV) 및 야생형(WT) 조직에 대한 것이다. 도 7b는 전경골(LTA), 비복근(RGAS), 사두근(LQD), 삼두근(RTri) 및 횡격막에서의 중앙화된 핵의 정량화를 도시한다. *p<0.05 샘플과 야생형 사이의 유의한 차이, # 샘플과 야생형 사이의 유의한 차이 없음. 도 7c는 로딩 대조군으로서 포함된 237 kD의 γ-튜불린(tubulin)에서 전체-길이 디스페린 밴드를 입증하는 조직 용해물(H: 심장, G: 비복근, Q: 사두근, D: 횡격막)의 웨스턴 블롯을 나타낸다.
도 8은 AAVrh.74.DYSF.DV 전달 후의 용량 의존적 막 재밀봉 활성을 나타낸다.
도 9는 AAVrh.74.MHCK7.DYSF.DV의 전신 전달 후 섬유증 및 염증의 역전을 보여준다. BlaJ 마우스를 정량 표준으로서 슈퍼코일 DNA 또는 플라스미드를 기준으로 6×10¹² vg(2.4e13 vg/kg)로 처리하였다. AAVrh.74.MHCK7.DYSF.DV(n=6). 돼지 근육을 제거하고, 섬유증(중간 칼럼) 및 CD8 단핵 세포의 존재에 대해 분석하였다. 유전자 전달 후 두 파라미터 모두에서 유의한 감소가 있었다.
도 10a 내지 도 10b는 rAAVrh.74.MHCK7.DYSF.DV의 전신 전달을 입증한다. Dysf-/-마우스에서 기능 결함을 회복시켰다. 도 10a: 설사 근육 스트립을 수확하고 특정 힘을 평가하기 위한 프로토콜에 적용하였다. 처리된 다이어프램은 2e12 vg 총 AAV.DYSF.DV(정량 표준으로서 수퍼코일 DNA 또는 플라스미드를 기준으로 8e13 vg/kg), 또는 6e12 vg 총 AAV.DYSF.DV(정량 표준으로서 수퍼코일 DNA 또는 플라스미드를 기준으로 2.4e13 vg/kg) 용량 둘 다에서 야생형 힘과 상이하지 않은 힘(**P>0.01, ANOVA)의 유의한 개선을 입증하였다. 낮은 용량에서 유의한 개선은 없었다.
도 11a 내지 도 11d는 AAV 캡시드 및 디스페린에 대한 T 세포 반응의 모니터링으로부터의 결과를 보여준다. 말초 혈액 단핵 세포를 단리하고, AAV5 및 AAVrh.74 캡시드(청색 막대) 뿐만 아니라 인간 디스페린(녹색)을 포함하는 펩티드에 노출시켰다. AAV5 캡시드 및 디스페린에 대한 T 세포 반응을 3개월(도 11a) 및 6개월(도 11b)에서 모니터링하였다. AAVrh.74 캡디스 및 디스페린에 대한 T 세포 반응을 3개월(도 11c) 및 6개월 (도 11d)에 모니터링하였다.
도 12a 내지 도 12c는 인간이 아닌 영장류에서 디스페린 발현을 나타낸다. 도 12a는 AAV5.DYSF 또는 AAVrh.74.DYSF.DV의 주사 후 3 및 6개월째에 NHP 조직의 조직학(H&E) 및 디스페린 면역형광(IF) 이미지를 나타낸다. H&E 염색된 절편은 섬유의 면역 침윤 및 괴사의 결여를 나타낸다. 도 12b는 AAV5.DYSF 및 AAVrh.74.DYSF.DV 둘 다에 대한 주사 후 3 및 6개월로부터의 조직의 웨스턴 블롯 이미지를 나타낸다. 중요하게는, 주사된 조직은 샴 대조군과 비교하여 디스페린의 과발현을 입증한다. 양성(+) 대조군은 야생형 마우스 조직이고, 음성(-) 대조군은 129Dysf-/-비주사된 조직이다. 도 12c는 좌측 TA 내로 AAVrh.74.DYSF.DV를 사용한 IM 주사 후 벡터 게놈의 생체분포(biodistribution)를 나타내며, 로그 스케일(logarithmic scale)에 주목한다.
도 13은 벡터 유래된 디스페린 발현을 확인하기 위한 항-FLAG의 사용을 입증한다. N 말단 FLAG 태그를 사용하여 내인성 및 AAV 유래 디스페린을 구별하였다.1 provides a schematic diagram of a dual AAV vector system for treating dysferinopathies. The 5' vector (e.g. 5' hDYSF AAV vector), pAAV.MHCK7.DYSF5'.PTG (PTG=promoter/transgene) contains the muscle specific MHCK7 promoter, chimeric intron, consensus Kozak sequence and It contains the 5' portion of DYSF cDNA corresponding to amino acids 1-1113 of SEQ ID NO: 12. A 3' vector (e.g., a 3' hDYSF AAV vector), pAAV.DYSF3'.POLYA, is a DYSF 3'UTR containing the 3' portion of the DYSF cDNA corresponding to amino acids 794-2080 of SEQ ID NO: 12 and a polyadenylation signal. contains
Figure 2 provides the pAAV.MHCK7.DYSF5'.PTG DNA vector plasmid map.
Figure 3 provides the pAAV.DYSF3'.POLYA vector DNA plasmid map.
4A-4D show dysferrin expression after delivery of the dual vector system. Robust full-length dysferlin expression was shown after delivery of both vectors by immunostaining (Fig. 4a) and western blot (Fig. 4c). Delivery of either vector alone had no aberrant dysferrin expression (FIG. 4B: immunostaining, FIG. 4D: Western blot). 3222 is full-length control.
5A to 5C show the results of the time course of dysferin expression after rAAVrh.74.MHCK7.DYSF.DV delivery. 5A demonstrates full-length dysferrin expression by dysferrin immunolabeling (upper panel) seen after delivery of dual vectors into the left tibialis anterior (LTA). Dysferin expression persisted for 1, 3, and 6 months after treatment with no abnormal response in pathology (H&E, lower panel). Scale bar, 100 μm. N=4 per time point. 5B shows Western blots for 1, 3, and 6 month samples demonstrating expression of full-length dysferrin in injected LTA (n=2 per group). 5C depicts biodistribution plots of genomic DNA per μg vector genome at 3 and 6 months post injection for various tissues. Note: LTA was treated; log axis.
6 Western blot analysis of target muscle (LTA) and non-target tissues from 4 separate animals treated by intramuscular injection with rAAVrh.74.MHCK7.DYSF.DV at endpoints of 3 or 12 months. indicates
7A-7C show dysferrin expression after systemic delivery of AAVrh.74.MHCK7.DYSF.DV. Figure 7a shows dysferrin immunolabeling of tissues after systemic delivery of 6×10 ¹² vg (2.4e13 vg/kg, based on supercoil DNA or plasmid as quantitative standard) AAVrh.74.MHCK7.DYSF.DV ( n=6/dose). Muscles shown are for cardiac, gastrocnemius, diaphragm and quadriceps, treated (AAV.DV) and wild type (WT) tissues. 7B shows quantification of centralized nuclei in the tibialis anterior (LTA), gastrocnemius (RGAS), quadriceps (LQD), triceps (RTri) and diaphragm. *p<0.05 significant difference between sample and wild type, # no significant difference between sample and wild type. Figure 7C is a Western image of tissue lysates (H: heart, G: gastrocnemius, Q: quadriceps, D: diaphragm) demonstrating a full-length dysperin band at 237 kD γ-tubulin included as a loading control. represents a blot.
Figure 8 shows dose-dependent membrane resealing activity after AAVrh.74.DYSF.DV delivery.
9 shows reversal of fibrosis and inflammation following systemic delivery of AAVrh.74.MHCK7.DYSF.DV. BlaJ mice were treated with 6×10 ¹² vg (2.4e13 vg/kg) based on supercoil DNA or plasmid as a quantification standard. AAVrh.74.MHCK7.DYSF.DV (n=6). Porcine muscles were removed and analyzed for fibrosis (middle column) and the presence of CD8 mononuclear cells. There was a significant decrease in both parameters after gene transfer.
10A-10B demonstrate systemic delivery of rAAVrh.74.MHCK7.DYSF.DV. Functional defects were restored in Dysf−/− mice. 10A: Diarrheal muscle strips were harvested and subjected to a protocol to evaluate specific forces. The treated diaphragm contained 2e12 vg total AAV.DYSF.DV (8e13 vg/kg based on supercoiled DNA or plasmid as quantification standard), or 6e12 vg total AAV.DYSF.DV (based on supercoiled DNA or plasmid as quantification standard). 2.4e13 vg/kg) demonstrated significant improvement in force (**P>0.01, ANOVA) not different from wild-type force at both doses. There was no significant improvement at the lower dose.
11A-11D show results from monitoring of T cell responses to AAV capsids and dysferrin. Peripheral blood mononuclear cells were isolated and exposed to peptides including AAV5 and AAVrh.74 capsids (blue bars) as well as human dysferrin (green). T cell responses to AAV5 capsid and dysferrin were monitored at 3 months (FIG. 11A) and 6 months (FIG. 11B). T cell responses to AAVrh.74 capdis and dysferrin were monitored at 3 months (FIG. 11C) and 6 months (FIG. 11D).
12A-12C show dysferrin expression in non-human primates. 12A shows histology (H&E) and dysferrin immunofluorescence (IF) images of NHP tissue at 3 and 6 months after injection of AAV5.DYSF or AAVrh.74.DYSF.DV. H&E stained sections show immune infiltration of fibers and lack of necrosis. 12B shows Western blot images of tissues from 3 and 6 months post injection for both AAV5.DYSF and AAVrh.74.DYSF.DV. Importantly, the injected tissues demonstrate overexpression of dysferin compared to sham controls. The positive (+) control is wild type mouse tissue, and the negative (-) control is 129Dysf−/− non-injected tissue. 12C shows the biodistribution of the vector genome after IM injection with AAVrh.74.DYSF.DV into the left TA, note the logarithmic scale.
13 demonstrates the use of anti-FLAG to confirm vector derived dysferrin expression. An N-terminal FLAG tag was used to distinguish endogenous and AAV-derived dysferrin.

정의Justice

달리 정의되지 않는 한, 본원에 사용된 모든 용어(기술적 및 과학적 용어 포함)는 본 발명이 속하는 분야의 당업자에 의해 일반적으로 이해되는 것과 동일한 의미를 갖는다. 추가로, 일반적으로 사용되는 사전에서 정의된 것과 같은 용어는 본원 및 관련 분야의 맥락에서 그의 의미와 일치하는 의미를 갖는 것으로 해석되어야 하고, 본원에 명시적으로 그렇게 정의되지 않는 한 이상적이거나 지나치게 형식적인 의미로 해석되지 않아야 한다는 것으로 이해될 것이다. 하기에서 명시적으로 정의되지 않지만, 이러한 용어는 그의 일반적인 의미에 따라 해석되어야 한다.Unless defined otherwise, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Additionally, terms as defined in commonly used dictionaries are to be construed to have a meaning consistent with their meaning in the context of this application and related fields, and are not to be idealized or overly formal unless expressly so defined herein. It will be understood that it should not be interpreted in a meaningful way. Although not explicitly defined below, these terms should be interpreted according to their ordinary meaning.

본원의 기술에서 사용된 용어는 단지 특정 구현예를 설명하기 위한 것이고, 본 발명을 제한하는 것으로 의도되지 않는다. 본원에 언급된 모든 간행물, 특허 출원, 특허 및 다른 참고문헌은 그 전문이 참조로 포함된다.The terminology used in the description herein is for the purpose of describing specific embodiments only and is not intended to limit the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety.

본 기술의 실시는 달리 지시되지 않는 한, 당업계의 기술 내에 있는 조직 배양, 면역학, 분자 생물학, 미생물학, 세포 생물학, 및 재조합 DNA의 통상적인 기술을 이용할 것이다.The practice of the present technology will utilize, unless otherwise indicated, conventional techniques of tissue culture, immunology, molecular biology, microbiology, cell biology, and recombinant DNA within the skill of the art.

문맥상 달리 지시되지 않는 한, 본원에 기재된 본 발명의 다양한 특징은 임의의 조합으로 사용될 수 있는 것으로 구체적으로 의도된다. 더욱이, 본 개시내용은 또한 일부 구현예에서, 본원에 제시된 임의의 특징 또는 특징의 조합이 배제되거나 생략될 수 있는 것으로 고려된다. 예시를 위해, 본원 명세서가 복합체가 성분 A, B 및 C를 포함하는 것으로 언급하는 경우, A, B 또는 C 중의 임의의 것, 또는 그의 조합은 단수로 또는 임의의 조합으로 생략되고 구별될 수 있는 것으로 구체적으로 의도된다.Unless the context dictates otherwise, it is specifically intended that the various features of the invention described herein may be used in any combination. Moreover, the present disclosure also contemplates that in some embodiments, any feature or combination of features set forth herein may be excluded or omitted. For illustrative purposes, when the specification refers to a complex as comprising components A, B, and C, any of A, B, or C, or combinations thereof, singularly or in any combination, are omitted and identified as possible. specifically intended

명시적으로 달리 지시되지 않는 한, 모든 명시된 구현예, 특징, 및 용어는 인용된 구현예, 특징, 또는 용어 및 그의 생물학적 등가물 둘 다를 포함하는 것으로 의도된다.Unless expressly indicated otherwise, all stated embodiments, features, and terms are intended to include both the recited embodiments, features, or terms and their biological equivalents.

모든 수치 지정, 예를 들어 pH, 온도, 시간, 농도, 및 분자량은 적절한 경우 1.0 또는 0.1의 증분에 의해, 또는 대안적으로 +/- 15%, 또는 대안적으로 10%, 또는 대안적으로 5%, 또는 대안적으로 2%의 변동에 의해 변하는 근사치이고, 이러한 범위는 포함된다. 항상 명시적으로 언급되지는 않지만, 모든 수치 지정은 용어 "약"에 의해 선행되는 것으로 이해되어야 한다. 또한, 항상 명시적으로 언급되지는 않지만, 본원에 기재된 시약은 단지 예시적이고, 이러한 등가물은 당업계에 공지되어 있는 것으로 이해되어야 한다.All numerical designations, such as pH, temperature, time, concentration, and molecular weight, as appropriate, by increments of 1.0 or 0.1, or alternatively +/- 15%, or alternatively 10%, or alternatively 5%. %, or alternatively an approximation varying by a variance of 2%, such ranges are inclusive. Although not always explicitly stated, it should be understood that all numerical designations are preceded by the term “about”. Also, although not always explicitly stated, it is to be understood that the reagents described herein are merely illustrative, and such equivalents are known in the art.

본 기술의 실시는 달리 지시되지 않는 한, 당업계의 기술 내에 있는 유기 화학, 약리학, 면역학, 분자 생물학, 미생물학, 세포 생물학 및 재조합 DNA의 통상적인 기술을 이용할 것이다. 예를 들어 문헌 「Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual, 2nd edition (1989)」; 「Current Protocols In Molecular Biology (F M Ausubel, 등, eds, (1987))」; 「Methods in the series Methods in Enzymology (Academic Press, Inc): PCR 2: A Practical Approach (M J MacPherson, B D Hames and G R Taylor eds (1995)), Harlow and Lane, eds (1988) Antibodies, a Laboratory Manual, and Animal Cell Culture (R I Freshney, ed (1987)」을 참조한다.The practice of the present technology will employ, unless otherwise indicated, conventional techniques of organic chemistry, pharmacology, immunology, molecular biology, microbiology, cell biology, and recombinant DNA within the skill of the art. See, eg, Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual, 2nd edition (1989); "Current Protocols In Molecular Biology (F M Ausubel, et al., eds, (1987))"; 「Methods in the series Methods in Enzymology (Academic Press, Inc): PCR 2: A Practical Approach (M J MacPherson, B D Hames and G R Taylor eds (1995)), Harlow and Lane, eds (1988) Antibodies, a Laboratory Manual, and Animal Cell Culture (R I Freshney, ed (1987)).

본원에서 사용되는 용어 "증가된", "감소된", "높은", "낮은" 또는 이의 임의의 문법적 변형은 참조 조성물, 폴리펩티드, 단백질 등의 약 90%, 80%, 50%, 20%, 10%, 5%, 1%, 0.5%, 또는 심지어 0.1%의 변형을 지칭한다.As used herein, the terms "increased", "reduced", "high", "low" or any grammatical variations thereof refer to about 90%, 80%, 50%, 20%, A variation of 10%, 5%, 1%, 0.5%, or even 0.1%.

본원에 개시된 임의의 성분, 범위, 투여 형태 등의 선택을 설명하기 위해 사용되는 용어 "허용가능한", "유효한", 또는 "충분한"은 상기 성분, 범위, 투여 형태 등이 개시된 목적에 적합하다는 것으로 의도된다.The terms “acceptable,” “effective,” or “sufficient,” when used to describe a selection of any ingredient, range, dosage form, etc. disclosed herein, mean that the ingredient, range, dosage form, etc. is suitable for the disclosed purpose. it is intended

또한, 본원에서 사용되는 용어 "및/또는"은 하나 이상의 연관된 열거된 항목의 임의의 및 모든 가능한 조합, 뿐만 아니라 대안으로 해석될 때 조합의 결여("또는")를 지칭하고 이를 포괄한다.Also, as used herein, the term “and/or” refers to and encompasses any and all possible combinations of one or more associated listed items, as well as the lack of combinations (“or”) when alternatively interpreted.

명백한 인용 없이 그리고 달리 의도되지 않는 한, 본 개시내용이 폴리펩티드, 단백질, 폴리뉴클레오티드 또는 항체에 관한 것일 때, 이들의 등가물 또는 생물학적 등가물이 본 개시의 범위 내에 속하는 것으로 추론되어야 한다. 본원에서 사용되는 용어 "이의 생물학적 등가물"은 참조 단백질, 항체, 폴리펩티드 또는 핵산을 지칭할 때 "이의 등가물"과 동의어인 것으로 의도되고, 원하는 구조 또는 기능성을 여전히 유지하면서 최소의 서열 동일성을 갖는 것을 의도한다. 본원에서 구체적으로 언급되지 않는 한, 본원에 언급된 임의의 폴리뉴클레오티드, 폴리펩티드 또는 단백질이 또한 이의 등가물을 포함하는 것으로 고려된다. 예를 들어 등가물은 참조 서열의 길이에 걸쳐 적어도 약 70% 상동성 또는 동일성, 또는 적어도 80% 상동성 또는 동일성, 및 대안적으로, 적어도 약 85%, 또는 대안적으로 적어도 약 90%, 또는 대안적으로 적어도 약 95%, 또는 대안적으로 98% 상동성 또는 동일성을 의도하고, 참조 단백질, 폴리펩티드 또는 핵산에 대해 실질적으로 등가인 생물학적 활성을 나타낸다. 대안적으로, 폴리뉴클레오티드를 지칭할 때, 이의 등가물은 한 측면에서, 엄격한 조건 하에 참조 폴리뉴클레오티드 또는 추가 측면에서 참조 폴리뉴클레오티드 또는 이의 상보체와 동일하거나 유사한 활성 또는 기능을 갖는 참조 폴리뉴클레오티드 또는 이의 상보체에 혼성화하는 폴리뉴클레오티드이다.Unless expressly cited and otherwise intended, when this disclosure is directed to polypeptides, proteins, polynucleotides or antibodies, equivalents or bioequivalents thereof are to be inferred to fall within the scope of this disclosure. As used herein, the term "biological equivalent thereof", when referring to a reference protein, antibody, polypeptide or nucleic acid, is intended to be synonymous with "equivalent thereof", and is intended to have minimal sequence identity while still retaining the desired structure or functionality. do. Unless specifically stated herein, any polynucleotide, polypeptide or protein mentioned herein is also contemplated to include equivalents thereof. For example, an equivalent may have at least about 70% homology or identity, or at least 80% homology or identity, and alternatively, at least about 85%, or alternatively, at least about 90%, or alternatively, at least about 90% homology or identity over the length of the reference sequence. Typically at least about 95%, or alternatively, 98% homology or identity is intended, and exhibits biological activity that is substantially equivalent to a reference protein, polypeptide or nucleic acid. Alternatively, when referring to a polynucleotide, an equivalent thereof is, in one aspect, a reference polynucleotide or its complement that has the same or similar activity or function as the reference polynucleotide or its complement, in a further aspect, under stringent conditions. It is a polynucleotide that hybridizes to the body.

단백질 또는 폴리펩티드의 등가물 (본원에서 참조로서 지칭됨)은 참조에 대해 적어도 50% (또는 적어도 60%, 또는 적어도 70%, 또는 적어도 80%, 또는 적어도 90%) 동일성을 공유하고, 참조의 기능 및 제조가능성을 유지한다.Equivalents of proteins or polypeptides (referred to herein as references) share at least 50% (or at least 60%, or at least 70%, or at least 80%, or at least 90%) identity to the reference, and the functions of the reference and maintain manufacturability.

본원에서 사용되는 용어 "기능", "활성", 및 "효소 활성"은 상호교환가능하게 사용된다. 디스페린의 손실은 골격근에서 Ca² ⁺-의존성 막 복구를 손상시키는 것으로 나타났다[문헌 「Song 등, Proc Natl Acad Sci USA 98:4084-4088, 2001」 및 「Schnepp 등, J Virol 77:3495-3504, 2003」 참조]. 또한, 디스페린은 아넥신 A1 및 A2, AHNAK, 및 카베올린-3을 포함한, 막 복구에 관여하는 다른 단백질과 상호작용하는 것으로 나타났다[문헌 「Schnepp 등, J Virol 77:3495-3504, 2003」, 「Duan 등, J Virol 72:8568-8577, 1998」, 「Donsante 등, Gene Ther 8:1343-1346, 2001」, 및 「Monahan 등, Expert Opin Drug Saf 1:79-91, 2002」 참조]. 근섬유 재생 능력의 손실은 디스페린 결핍의 기여 결과로서 생각된다[문헌 「Song 등, Gene Ther 8:1299-1306, 2001」 참조]. 디스페린은 또한 소포 트래피킹 및 엔도시토시스 및 T 세관 형성과 연관되어 있다[문헌 「Eveson 등, The Journal of Biological Chemistry 285:28529-28539, 2010」, 「Klinge 등, FASEB Journal: Official Publication of the Federation of American Societies for Experimental Biology 21:1768-1776, 2007」, 및 「Klinge 등, Muscle & Nerve 41:166-173, 2010」 참조]. 따라서, 디스페린의 활성의 예는 골격근에서 막 복구, 예를 들어 근섬유 재생 능력의 막 재밀봉, 예방 또는 복구, 소포 트래피킹, 엔도시토시스, 및 횡방향 (T-) 세관 형성을 포함하나 이에 제한되지는 않는다. 막 복구 검정, 소포 트래피킹 검정, 및 튜브 형성 검정은 관련 기술분야에 공지되어 있고, 시험관내에서 디스페린 활성을 측정하는데 사용될 수 있다. 예를 들어 문헌 「Carmeille 등, Methods Mol Biol 1668:195-207, 2017」, 「Vassilieva and Nusrat, Methods Mol Biol 440:3-14, 2008」, 「Demonbreun 등, Am J Pathol 184(1):248-59, 2014」을 참조하며, 이들 각각은 그 전문이 참조로 포함된다. 디스페린의 활성을 측정하기 위한 추가의 방법은, 예를 들어 문헌 「Grose 등, PLoS One 7:e39233, 2012」 및 「Sondergaard 등, Anns of Clin Trans Neurol 2:256-270, 2015」에서 발견된다.As used herein, the terms "function", "activity", and "enzymatic activity" are used interchangeably. Loss of ^{dysferrin has been shown to impair Ca 2+} ^-dependent membrane repair in skeletal muscle [Song et al., Proc Natl Acad Sci USA 98:4084-4088, 2001] and [Schnepp et al., J Virol 77:3495-3504 , 2003]. Dysferin has also been shown to interact with other proteins involved in membrane repair, including annexins A1 and A2, AHNAK, and caveolin-3 [Schnepp et al., J Virol 77:3495-3504, 2003] , "Duan et al., J Virol 72:8568-8577, 1998", "Donsante et al., Gene Ther 8:1343-1346, 2001", and "Monahan et al., Expert Opin Drug Saf 1:79-91, 2002"] . Loss of muscle fiber regeneration capacity is thought to be a contributing result of dysferrin deficiency (see Song et al., Gene Ther 8:1299-1306, 2001). Dysferin has also been implicated in vesicle trafficking and endocytosis and T tubule formation [Eveson et al., The Journal of Biological Chemistry 285:28529-28539, 2010], Klinge et al., FASEB Journal: Official Publication of the Federation of American Societies for Experimental Biology 21:1768-1776, 2007, and Klinge et al., Muscle & Nerve 41:166-173, 2010]. Thus, examples of activities of dysferrin include, but are not limited to, membrane repair in skeletal muscle, e.g., membrane resealing, prevention or restoration of muscle fiber regenerative capacity, vesicle trafficking, endocytosis, and transverse (T-) tubule formation. Not limited. Membrane repair assays, vesicle trafficking assays, and tube formation assays are known in the art and can be used to measure dysferrin activity in vitro. See, eg, Carmeille et al., Methods Mol Biol 1668:195-207, 2017, Vassilieva and Nusrat, Methods Mol Biol 440:3-14, 2008, Demonbreun et al., Am J Pathol 184(1):248 -59, 2014”, each of which is incorporated by reference in its entirety. Additional methods for measuring the activity of dysferrin are found, for example, in Grose et al., PLoS One 7:e39233, 2012 and Sondergaard et al., Anns of Clin Trans Neurol 2:256-270, 2015. .

폴리뉴클레오티드의 등가물(본원에서 참조로서 지칭됨)은 참조에 대해 적어도 50% (또는 적어도 60%, 또는 적어도 70%, 또는 적어도 80%, 또는 적어도 90%)의 동일성을 공유하고, 참조에 의해 암호화된 것과 동일한 폴리펩티드를 암호화하거나, 참조에 의해 암호화된 폴리펩티드의 등가물을 암호화한다.Equivalents of polynucleotides (referred to herein as references) share at least 50% (or at least 60%, or at least 70%, or at least 80%, or at least 90%) identity to the reference, and are encoded by the reference. encodes the same polypeptide as described above, or encodes an equivalent of the polypeptide encoded by reference.

참조 서열의 아미노산/뉴클레오티드 잔기 또는 연속적인 세그먼트에 등가인 (또는 그에 상응하는) 시험 서열의 위치 또는 연속적인 세그먼트에 도달하기 위해, 서열 정렬(sequence alignment)이 시험 서열과 참조 서열 사이에서 수행된다. 서로에 대해 정렬된 위치 또는 세그먼트는 등가물로서 결정된다.A sequence alignment is performed between the test sequence and the reference sequence to arrive at a position or contiguous segment of the test sequence that is equivalent to (or equivalent to) an amino acid/nucleotide residue or contiguous segment of the reference sequence. Positions or segments aligned with respect to each other are determined as equivalents.

용어 "친화도 태그"는 친화성 태그에 결합할 수 있는, 즉 친화성 태그에 대해 친화성을 갖는 리간드를 사용하여 융합 단백질의 검출 및/또는 세포 환경으로부터 융합 단백질의 정제를 허용하기 위해 융합 단백질 내에 포함될 수 있는 폴리펩티드를 지칭한다. 리간드는 항체, 수지, 또는 상보성 폴리펩티드일 수 있지만, 이에 제한되지는 않는다. 친화성 태그는 작은 펩티드, 통상적으로 대략 4 내지 16개의 아미노산 길이의 펩티드를 포함할 수 있거나, 더 큰 폴리펩티드를 포함할 수 있다. 통상적으로 사용되는 친화성 태그는 특히 폴리아르기닌, FLAG, V5, 폴리히스티딘, c-Myc, Strep II, 말토스 결합 단백질(MBP), N-유틸화 물질 단백질 A(NusA), 티오레독신(Trx), 및 글루타티온 S-전이효소(GST)를 포함한다[예를 들어 문헌 「GST Gene Fusion System Handbook - Sigma-Aldrich」 참조]. 한 구현예에서, 친화성 태그는 폴리히스티딘 태그, 예를 들어 His6 태그(서열번호 21)이다. 융합 단백질 내에 친화성 태그의 포함은 친화성 태그에 단단하고 특이적으로 결합할 수 있는 친화성 매질을 사용하여 융합 단백질이 친화성 정제에 의해 세포 환경으로부터 정제되도록 한다. 친화성 매질은 예를 들어 금속-충전된 수지 또는 아가로스 또는 금속 비드와 같은 고정상 (매트릭스)에 공유 연결된 리간드를 포함할 수 있다. 예를 들어 폴리히스티딘 태그부착된 융합 단백질(또한 His 태그부착된 융합 단백질로서 지칭됨)은 Ni² ⁺ 또는 Co² ⁺ 로딩된 수지를 사용하여 고정된 금속 이온 크로마토그래피에 의해 회수될 수 있고, 항-FLAG 친화성 겔은 FLAG 태그부착된 융합 단백질을 포획하기 위해 사용될 수 있고, 아가로스와 같은 고체 지지체에 가교된 글루타티온은 GST 태그부착된 융합 단백질을 포획하기 위해 사용될 수 있다. 한 측면에서, 친화성 태그는 정제 태그 또는 마커이다.The term "affinity tag" refers to a fusion protein to allow detection of the fusion protein and/or purification of the fusion protein from the cellular environment using a ligand capable of binding to the affinity tag, i.e., having affinity for the affinity tag. Refers to a polypeptide that can be included within. A ligand can be, but is not limited to, an antibody, resin, or complementary polypeptide. Affinity tags can include small peptides, typically about 4 to 16 amino acids in length, or they can include larger polypeptides. Commonly used affinity tags include polyarginine, FLAG, V5, polyhistidine, c-Myc, Strep II, maltose binding protein (MBP), N-utilizing substance protein A (NusA), thioredoxin (Trx), among others. ), and glutathione S-transferase (GST) (see, eg, "GST Gene Fusion System Handbook - Sigma-Aldrich"). In one embodiment, the affinity tag is a polyhistidine tag, such as a His6 tag (SEQ ID NO: 21). Inclusion of an affinity tag within the fusion protein allows the fusion protein to be purified from the cellular environment by affinity purification using an affinity medium capable of binding tightly and specifically to the affinity tag. The affinity medium may include, for example, a metal-filled resin or a ligand covalently linked to a stationary phase (matrix) such as agarose or metal beads. For example, polyhistidine-tagged fusion proteins (also referred to as His-tagged fusion proteins ^{) can be recovered by immobilized metal ion chromatography using Ni 2+ or Co 2+} ^loaded ^resins , ^and A -FLAG affinity gel can be used to capture FLAG tagged fusion proteins, and glutathione cross-linked to a solid support such as agarose can be used to capture GST tagged fusion proteins. In one aspect, the affinity tag is a purification tag or marker.

본원에서 사용되는 용어 "정제", "정제하는" 또는 "분리하는"은 복합 혼합물, 예를 들어 세포 용해물 또는 폴리펩티드의 혼합물로부터 1종 이상의 생체물질(예를 들어 폴리뉴클레오티드, 폴리펩티드 또는 바이러스 벡터)을 단리하는 공정을 지칭한다. 정제, 분리 또는 단리는 완료될 필요는 없으며, 즉 복합 혼합물의 일부 성분은 정제 공정 후에 1종 이상의 생체물질(예를 들어 폴리뉴클레오티드, 폴리펩티드 또는 바이러스 벡터)과 함께 남아있을 수 있다. 그러나, 정제의 생성물은 정제 전에 복합 혼합물에 비해 1종 이상의 생체물질 (예를 들어 폴리뉴클레오티드, 폴리펩티드 또는 바이러스 벡터)에 대해 농축되어야 하며, 복합 혼합물 내에 초기에 존재하는 다른 성분의 유의한 부분은 정제 공정에 의해 제거되어야 한다.As used herein, the term "purifying", "purifying" or "isolating" refers to one or more biomaterials (eg, polynucleotides, polypeptides or viral vectors) from a complex mixture, such as a cell lysate or mixture of polypeptides. Refers to the process of isolating. The purification, separation or isolation need not be complete, ie some components of the complex mixture may remain with one or more biomaterials (eg polynucleotides, polypeptides or viral vectors) after the purification process. However, the product of purification must be enriched for one or more biomaterials (e.g. polynucleotides, polypeptides or viral vectors) relative to the complex mixture prior to purification, and a significant fraction of the other components initially present in the complex mixture are purified. It must be removed by the process.

본원에서 사용되는 용어 "세포"는 임의로 대상체 또는 상업적으로 입수가능한 공급원으로부터 수득된 원핵 또는 진핵 세포를 지칭할 수 있다. 일부 경우에, 세포는 숙주 세포, 예를 들어 포유동물 세포 또는 포유동물 숙주 세포이다. 일부 경우에, 숙주 세포는 또한 본원에서 생산 세포 또는 패키징 세포로서 지칭된다. 일부 경우에, 세포주는 패키징 세포주이다.As used herein, the term “cell” may refer to a prokaryotic or eukaryotic cell, optionally obtained from a subject or from a commercially available source. In some cases, a cell is a host cell, eg, a mammalian cell or mammalian host cell. In some cases, host cells are also referred to herein as production cells or packaging cells. In some cases, the cell line is a packaging cell line.

"유핵 세포"는 모네라(monera)를 제외한 모든 생명체를 포함한다. 이들은 막-결합된 핵을 통해 쉽게 구별될 수 있다. 동물, 식물, 진균 및 원생생물은 세포가 내부 막 및 세포골격에 의해 복합 구조로 조직화되는 진핵생물 또는 유기체이다. 가장 특징적인 막-결합된 구조는 핵이다. 구체적으로 언급되지 않는 한, 용어 "숙주"는 예를 들어 효모, 고등 식물, 곤충 및 포유동물 세포를 포함한 진핵 숙주를 포함한다. 진핵 세포 또는 숙주의 비제한적 예는 유인원, 소, 돼지, 뮤린, 래트, 조류, 파충류 및 인간, 예를 들어 HEK293 세포, 차이니즈 햄스터 난소(CHO) 세포, CHO-S 세포, CHO-K1 세포, 293T 세포, HeLa 세포, 바비 햄스터 신장(BHK) 세포, Sf9 세포, 줄기 세포, 위성 세포 및 근육 세포를 포함한다. 근육 세포의 예는 골격근 세포, 심근 세포 및 평활근 세포를 포함하나 이에 제한되지는 않는다.A "nucleated cell" includes all living organisms except monera. They can be readily distinguished through their membrane-bound nuclei. Animals, plants, fungi and protists are eukaryotes or organisms in which cells are organized into complex structures by internal membranes and cytoskeletons. The most characteristic membrane-bound structure is the nucleus. Unless specifically stated otherwise, the term "host" includes eukaryotic hosts including, for example, yeast, higher plants, insects and mammalian cells. Non-limiting examples of eukaryotic cells or hosts include simian, bovine, pig, murine, rat, avian, reptile and human, e.g. HEK293 cells, Chinese Hamster Ovary (CHO) cells, CHO-S cells, CHO-K1 cells, 293T cells, HeLa cells, barbie hamster kidney (BHK) cells, Sf9 cells, stem cells, satellite cells and muscle cells. Examples of muscle cells include, but are not limited to, skeletal muscle cells, cardiac muscle cells, and smooth muscle cells.

일반적으로 핵 또는 임의의 다른 막-결합된 소기관이 결여되고 2개의 도메인, 박테리아 및 고세균으로 분할되는 "유핵 세포". 염색체 DNA에 더하여, 이들 세포는 또한 에피솜으로 불리는 원형 루프에 유전 정보를 함유할 수 있다. 박테리아 세포는 매우 작고, 대략 동물 미토콘드리아의 크기 (약 1 내지 2 ㎛의 직경 및 10 ㎛의 길이)이다. 유핵 세포는 3개의 주요 형상: 막대 형상, 구형 및 나선형을 특징으로 한다. 진핵생물과 같은 정교한 복제 과정을 거치는 대신에, 박테리아 세포는 이원 핵분열에 의해 분열된다. 예로는 바실러스 박테리아(Bacillus　bacteria), 대장균 박테리아(E. coli 　bacterium) 및 살모넬라 박테리아(Salmonella　bacterium)를 포함하나, 이에 한정되지 않는다.A "nucleated cell" that usually lacks a nucleus or any other membrane-bound organelle and divides into two domains, bacteria and archaea. In addition to chromosomal DNA, these cells may also contain genetic information in circular loops called episomes. Bacterial cells are very small, about the size of animal mitochondria (about 1-2 μm in diameter and 10 μm in length). Nucleated cells are characterized by three main shapes: rod-shaped, spherical and spiral. Instead of going through an elaborate replication process like eukaryotes, bacterial cells divide by binary fission. Examples include Bacillus bacteria, E. coli bacteria　bacterium) and Salmonella bacteria ( Salmonella bacterium), but are not limited thereto.

핵산 서열에 적용되는 바와 같은 용어 "암호화(encode)"는, 천연 상태에서 또는 당업자에게 널리 공지된 방법에 의해 조작되는 경우에 폴리펩티드를 "암호화"하는 폴리뉴클레오티드를 지칭하며, 전사 및/또는 번역되어 폴리펩티드 및/또는 그의 단편에 대한 mRNA를 생성할 수 있다. 안티센스 가닥은 이러한 핵산의 상보체이며, 암호화 서열은 그로부터 추론될 수 있다.The term "encode" as applied to a nucleic acid sequence refers to a polynucleotide that "encodes" a polypeptide in its native state or when manipulated by methods well known to those of ordinary skill in the art and is transcribed and/or translated. mRNA for the polypeptide and/or fragments thereof can be produced. The antisense strand is the complement of these nucleic acids, from which the coding sequence can be deduced.

용어 "동등한" 또는 "생물학적 등가물"은 특정 분자, 생물학적 또는 세포 물질을 지칭하는 경우에 상호교환가능하게 사용되며, 원하는 구조 또는 기능성(예를 들어 유사한 기능 또는 활성을 가짐)을 여전히 유지하면서 최소의 상동성을 갖는 것을 의도한다. 참조 폴리펩티드, 단백질 또는 폴리뉴클레오티드에 대한 등가물 또는 생물학적 등가물을 지칭하는 경우에 등가물 또는 생물학적 등가물이 참조 폴리펩티드, 단백질 또는 폴리뉴클레오티드 및 등가물 또는 실질적으로 등가인 생물학적 활성에 대해 언급된 구조적 관계를 갖는다는 것이 명백히 언급되지는 않는 것으로 이해되어야 한다. 예를 들어 등가 폴리펩티드, 단백질 또는 폴리뉴클레오티드의 비제한적인 예는 참조 폴리펩티드, 폴리뉴클레오티드 또는 단백질의 길이에 걸쳐 폴리펩티드, 폴리뉴클레오티드 또는 단백질 서열에 대해 또는 그와 적어도 60%, 또는 대안적으로 적어도 65%, 또는 대안적으로 적어도 70%, 또는 대안적으로 적어도 75%, 또는 대안적으로 적어도 80%, 또는 대안적으로 적어도 85%, 또는 대안적으로 적어도 90%, 또는 대안적으로 적어도 95% 동일성을 갖는 폴리펩티드, 단백질 또는 폴리뉴클레오티드를 포함한다. 대안적으로, 등가 폴리펩티드는 이러한 참조 폴리펩티드 서열을 암호화하는 폴리뉴클레오티드에 대해 높은 엄격도의 조건 하에 혼성화되고 실질적으로 등가이거나 등가인 생물학적 활성을 갖는 폴리뉴클레오티드 또는 그의 상보체에 의해 암호화되는 것이다. 높은 엄격도의 조건이 본원에 기재되어 있으며, 본원에 참조로 포함된다. 대안적으로, 그의 등가물은 참조 폴리뉴클레오티드, 예를 들어 야생형 폴리뉴클레오티드에 대해 참조 폴리뉴클레오티드의 길이에 걸쳐 적어도 70%, 또는 대안적으로 적어도 75%, 또는 대안적으로 적어도 80%, 또는 대안적으로 적어도 85%, 또는 대안적으로 적어도 90%, 또는 대안적으로 적어도 95% 동일성, 또는 적어도 97% 서열 동일성을 갖는 폴리뉴클레오티드 또는 그의 상보체에 의해 암호화되는 폴리펩티드이다. 이러한 등가 폴리펩티드는 참조 폴리뉴클레오티드에 의해 암호화되는 폴리펩티드와 동일한 생물학적 활성을 갖는다.The terms "equivalent" or "biological equivalent" are used interchangeably when referring to a particular molecule, biological or cellular material, which is intended to provide a minimum of the desired structure or functionality (e.g., having a similar function or activity) while still retaining the desired structure or functionality. It is intended to have homology. When referring to an equivalent or biological equivalent to a reference polypeptide, protein or polynucleotide, it should be made clear that the equivalent or biological equivalent has the stated structural relationship with respect to the reference polypeptide, protein or polynucleotide and the equivalent or substantially equivalent biological activity. It should be understood that not mentioned. For example, non-limiting examples of equivalent polypeptides, proteins or polynucleotides are at least 60%, or alternatively at least 65% relative to or relative to a polypeptide, polynucleotide or protein sequence over the length of the reference polypeptide, polynucleotide or protein. , or alternatively at least 70%, or alternatively at least 75%, or alternatively at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95% identity Polypeptides, proteins or polynucleotides with Alternatively, an equivalent polypeptide is one that hybridizes under conditions of high stringency to a polynucleotide encoding such a reference polypeptide sequence and is encoded by a polynucleotide that has substantially equivalent or equivalent biological activity, or its complement. High stringency conditions are described herein and incorporated herein by reference. Alternatively, the equivalent thereof is at least 70%, or alternatively at least 75%, or alternatively at least 80%, or alternatively, over the length of the reference polynucleotide relative to the reference polynucleotide, e.g., a wild-type polynucleotide. A polypeptide encoded by a polynucleotide or its complement having at least 85%, or alternatively at least 90%, or alternatively at least 95% identity, or at least 97% sequence identity. Such equivalent polypeptides have the same biological activity as the polypeptide encoded by the reference polynucleotide.

등가 폴리뉴클레오티드의 비제한적인 예는 참조 폴리뉴클레오티드에 대해 적어도 60%, 또는 대안적으로 적어도 65%, 또는 대안적으로 적어도 70%, 또는 대안적으로 적어도 75%, 또는 대안적으로 적어도 80%, 또는 대안적으로 적어도 85%, 또는 대안적으로 적어도 90%, 또는 대안적으로 적어도 95%, 또는 대안적으로 적어도 97% 동일성을 갖는 폴리뉴클레오티드를 포함한다. 등가물은 또한 참조 폴리뉴클레오티드에 대해 높은 엄격도의 조건 하에 혼성화되는 폴리뉴클레오티드 또는 그의 상보체를 의도한다. 이러한 등가 폴리뉴클레오티드는 참조 폴리뉴클레오티드와 동일한 생물학적 활성을 갖는다.Non-limiting examples of equivalent polynucleotides are at least 60%, or alternatively at least 65%, or alternatively at least 70%, or alternatively at least 75%, or alternatively at least 80%, relative to the reference polynucleotide. or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95%, or alternatively at least 97% identity. Equivalent is also intended a polynucleotide or its complement that hybridizes under conditions of high stringency to a reference polynucleotide. Such equivalent polynucleotides have the same biological activity as the reference polynucleotide.

또 다른 서열에 대한 "서열 동일성"의 특정 백분율(예를 들어 80%, 85%, 90%, 또는 95%)을 갖는 폴리뉴클레오티드 또는 폴리뉴클레오티드 영역 (또는 폴리펩티드 또는 폴리펩티드 영역)은 정렬될 때, 참조 폴리뉴클레오티드의 길이에 걸쳐 2개의 서열을 비교할 때 염기 (또는 아미노산)의 백분율이 동일하다는 것을 의미한다. 정렬 및 퍼센트 상동성 또는 서열 동일성은 관련 기술분야에 공지된 소프트웨어 프로그램, 예를 들어 문헌 「Current Protocols in Molecular Biology (Ausubel 등, eds 1987) Supplement 30, section 7.7.18, Table 7.7.1」에 기재된 것을 사용하여 결정될 수 있다. 비제한적인 예시적인 정렬 프로그램은 디폴트 파라미터를 사용하는 BLAST이다. 특히, 예시적인 프로그램은 하기 디폴트 파라미터를 사용하는 BLASTN 및 BLASTP를 포함한다: 유전자 코드=표준; 필터=none; 가닥=nboth; 컷오프=60; 예상=10; 매트릭스=BLOSUM62; 설명= 50개의 서열; 분류=높은 점수; 데이터베이스=비-중복, GenBank+EMBL+DDBJ+PDB+GenBank CDS 번역+SwissProtein+SPupdate+PIR. 이들 프로그램의 세부사항은 하기 인터넷 주소: ncbimnihgov/cgi-bin/BLAST에서 찾을 수 있다. 서열 동일성 및 퍼센트 동일성은 이들을 clustalW(웹 주소:genomejp/tools/clustalw/, 2017년 1월 13일에 마지막으로 액세스됨) 또는 Clustal Omega(ebiacuk/Tools/msa/clustalo/에서 입수가능함) 내로 혼입시킴으로써 결정될 수 있다.Polynucleotides or polynucleotide regions (or polypeptides or polypeptide regions) that have a certain percentage (e.g., 80%, 85%, 90%, or 95%) of "sequence identity" to another sequence, when aligned, refer to It means that the percentage of bases (or amino acids) is the same when comparing two sequences over the length of a polynucleotide. Alignment and percent homology or sequence identity can be performed using software programs known in the art, such as those described in Current Protocols in Molecular Biology (Ausubel et al., eds 1987) Supplement 30, section 7.7.18, Table 7.7.1. can be determined using A non-limiting exemplary alignment program is BLAST using default parameters. In particular, exemplary programs include BLASTN and BLASTP using the following default parameters: genetic code=standard; filter=none; strand=nboth; cutoff=60; expected=10; matrix=BLOSUM62; Description = 50 sequences; classification=high score; Database=non-redundant, GenBank+EMBL+DDBJ+PDB+GenBank CDS Translation+SwissProtein+SPupdate+PIR. Details of these programs can be found at the following Internet address: ncbimnihgov/cgi-bin/BLAST. Sequence identity and percent identity were determined by incorporating them into clustalW (web address: genomejp/tools/clustalw/, last accessed on January 13, 2017) or Clustal Omega (available at ebiacuk/Tools/msa/clustalo/). can be determined

"상동성" 또는 "동일성" 또는 "유사성"은 2개의 펩티드 사이의 또는 2개의 핵산 분자 사이의 서열 유사성을 지칭한다. 상동성은 비교의 목적을 위해 정렬될 수 있는 각각의 서열에서의 위치를 비교함으로써 결정될 수 있다. 비교된 서열에서의 위치가 동일한 염기 또는 아미노산에 의해 점유될 때, 분자는 그 위치에서 상동성이다. 서열 사이의 상동성의 정도는 서열에 의해 공유되는 매칭 또는 상동성 위치의 수의 함수이다. "비관련" 또는 "비-상동성" 서열은 본 개시의 서열 중의 하나와 40% 미만의 동일성, 또는 대안적으로 25% 미만의 동일성을 공유한다.“Homology” or “identity” or “similarity” refers to sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing positions in each sequence that can be aligned for purposes of comparison. When a position in the compared sequences is occupied by the same base or amino acid, the molecules are homologous at that position. The degree of homology between sequences is a function of the number of matching or homologous positions shared by the sequences. An “unrelated” or “non-homologous” sequence shares less than 40% identity, or alternatively less than 25% identity, with one of the sequences of the present disclosure.

본원에서 사용되는 용어 "적어도 90% 동일한"은 약 90% 내지 약 100%의 2개의 비교된 서열 (폴리뉴클레오티드 또는 폴리펩티드)의 동일성을 지칭한다. 이는 또한 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 약 91% 내지 약 100%, 약 92% 내지 약 100%, 약 93% 내지 약 100%, 약 94% 내지 약 100%, 약 95% 내지 약 100%, 약 96% 내지 약 100%, 약 97% 내지 약 100%, 약 98% 내지 약 100%, 또는 약 99% 내지 약 100%의 동일성을 포함한다.As used herein, the term "at least 90% identical" refers to about 90% to about 100% identity of two compared sequences (polynucleotides or polypeptides). It is also at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, about 91% to about 100%, about 92% to about 92% About 100%, about 93% to about 100%, about 94% to about 100%, about 95% to about 100%, about 96% to about 100%, about 97% to about 100%, about 98% to about 100 %, or from about 99% to about 100% identity.

본원에서 사용되는 용어 "함유한다", "유사하다" 및 "유사한"은 폴리뉴클레오티드, 단백질 및/또는 펩티드의 기능, 활성 또는 기능적 활성을 설명하면서 상호교환적으로 사용되며, 참조 단백질, 폴리뉴클레오티드 및/또는 펩티드의 활성의 적어도 약 20%(적어도 약 30%, 적어도 약 40%, 적어도 약 50%, 적어도 약 60%, 적어도 약 70%, 적어도 약 80%, 적어도 약 90%, 적어도 약 95%, 적어도 약 97%, 또는 약 100%를 포함하나 이에 제한되지 않음)의 기능적 활성에 관한 것이다.As used herein, the terms "comprises," "is similar to," and "similar to" are used interchangeably while describing a function, activity, or functional activity of a polynucleotide, protein, and/or peptide, and refer to a protein, polynucleotide, and/or peptide. / or at least about 20% (at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%) of the activity of the peptide , at least about 97%, or about 100%) functional activity.

"혼성화"는 하나 이상의 폴리뉴클레오티드가 반응하여 뉴클레오티드 잔기의 염기 사이의 수소 결합을 통해 안정화되는 복합체를 형성하는 반응을 지칭한다. 수소 결합은 Watson-Crick 염기 짝짓기(base pairing), Hoogstein 결합에 의해, 또는 임의의 다른 서열-특이적 방식으로 일어날 수 있다. 복합체는 듀플렉스 구조(duplex structure)를 형성하는 2개의 가닥, 다중-가닥 복합체를 형성하는 3개 이상의 가닥, 단일 자가-혼성화 가닥, 또는 이들의 임의의 조합을 포함할 수 있다. 혼성화 반응은 PCR 반응의 개시, 또는 리보자임(ribozyme)에 의한 폴리뉴클레오티드의 효소적 절단과 같은 보다 광범위한 공정에서의 단계를 구성할 수 있다. “Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized through hydrogen bonds between the bases of nucleotide residues. Hydrogen bonding can occur by Watson-Crick base pairing, Hoogstein bonding, or in any other sequence-specific manner. The complex may include two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination thereof. A hybridization reaction may constitute a step in a broader process, such as initiation of a PCR reaction or enzymatic cleavage of a polynucleotide by a ribozyme.

엄격한 혼성화 조건(stringent hybridization conditions)의 예는 약 25℃ 내지 약 37℃의 인큐베이션 온도; 약 6×SSC 내지 약 10×SSC의 혼성화 완충제 농도; 약 0% 내지 약 25%의 포름아미드 농도; 및 약 4×SSC 내지 약 8×SSC의 세척 용액을 포함한다. 적당한 혼성화 조건(moderate hybridization conditions)의 예는 약 40℃ 내지 약 50℃의 인큐베이션 온도; 약 9×SSC 내지 약 2×SSC의 혼성화 완충제 농도; 약 30% 내지 약 50%의 포름아미드 농도; 및 약 5×SSC 내지 약 2×SSC의 세척 용액을 포함한다. 매우 엄격한 혼성화 조건(high stringency conditions)의 예는 약 55℃ 내지 약 68℃의 인큐베이션 온도; 약 1×SSC 내지 약 0.1×SSC의 혼성화 완충제 농도; 약 55% 내지 약 75%의 포름아미드 농도; 및 약 1×SSC, 0.1×SSC의 세척 용액, 또는 탈이온수를 포함한다. 일반적으로, 혼성화 인큐베이션 시간은 1회, 2회 또는 그 초과의 세척 단계로 5분 내지 24시간이고, 세척 인큐베이션 시간은 약 1, 2 또는 15분이다. SSC는 0.15 M NaCl 및 15 mM 시트레이트 완충액이다. 다른 완충제 시스템을 사용하는 SSC의 등가물이 사용될 수 있는 것으로 이해되어야 한다. 한 측면에서, 동등한 폴리뉴클레오티드는 엄격한 조건 하에 참조 폴리뉴클레오티드 또는 그의 상보체에 혼성화하는 것이다. 또 다른 측면에서, 동등한 폴리펩티드는 엄격한 조건 하에 참조 폴리뉴클레오티드 또는 그의 상보체에 혼성화하는 폴리뉴클레오티드에 의해 암호화되는 폴리펩티드이다.Examples of stringent hybridization conditions include an incubation temperature of about 25° C. to about 37° C.; a hybridization buffer concentration of about 6xSSC to about 10xSSC; a formamide concentration of about 0% to about 25%; and a washing solution from about 4xSSC to about 8xSSC. Examples of suitable hybridization conditions include an incubation temperature of about 40° C. to about 50° C.; a hybridization buffer concentration of about 9xSSC to about 2xSSC; a formamide concentration of about 30% to about 50%; and about 5xSSC to about 2xSSC wash solution. Examples of high stringency conditions include an incubation temperature of about 55° C. to about 68° C.; a hybridization buffer concentration of about 1 x SSC to about 0.1 x SSC; a formamide concentration of about 55% to about 75%; and a washing solution of about 1xSSC, 0.1xSSC, or deionized water. Generally, the hybridization incubation time is from 5 minutes to 24 hours with one, two or more wash steps, and the wash incubation time is about 1, 2 or 15 minutes. SSC is 0.15 M NaCl and 15 mM citrate buffer. It should be understood that equivalents of SSC using other buffer systems may be used. In one aspect, an equivalent polynucleotide is one that hybridizes under stringent conditions to a reference polynucleotide or its complement. In another aspect, an equivalent polypeptide is a polypeptide encoded by a polynucleotide that hybridizes under stringent conditions to a reference polynucleotide or its complement.

본원에서 사용되는 용어 "발현"은 폴리뉴클레오티드가 mRNA로 전사되는 과정 및/또는 전사된 mRNA가 후속적으로 펩티드, 폴리펩티드 또는 단백질로 번역되는 과정을 지칭한다. 폴리뉴클레오티드가 게놈 DNA로부터 유래되는 경우, 발현은 진핵 세포에서 mRNA의 스플라이싱을 포함할 수 있다.As used herein, the term “expression” refers to the process by which a polynucleotide is transcribed into mRNA and/or the transcribed mRNA is subsequently translated into a peptide, polypeptide or protein. When the polynucleotide is derived from genomic DNA, expression may include splicing of mRNA in eukaryotic cells.

본원에서 사용되는 용어 "기능성"은 임의의 분자, 생물학적, 또는 세포성 물질을 특정의 특정 효과를 달성하는 것을 의도하도록 변형시키기 위해 사용될 수 있다. As used herein, the term "functional" can be used to modify any molecular, biological, or cellular material with the intention of achieving a particular particular effect.

본원에서 사용되는 용어 "핵산 서열" 및 "폴리뉴클레오티드"는 리보뉴클레오티드 또는 데옥시리보뉴클레오티드인 임의의 길이의 뉴클레오티드의 중합체 형태를 지칭하도록 상호교환가능하게 사용된다. 따라서 상기 용어는 단일-, 이중-, 또는 다중-가닥 DNA 또는 RNA, 게놈 DNA, 상보성 DNA(cDNA), DNA-RNA 혼성, 또는 퓨린 및 피리미딘 염기 또는 다른 천연, 화학적으로 또는 생화학적으로 변형된, 비-천연, 또는 유도체화된 뉴클레오티드 염기를 포함하는 중합체를 포함하나 이에 제한되지는 않는다. 특정 구현예에서, 폴리뉴클레오티드는 메신저 RNA(mRNA), 짧은 헤어핀(short hairpin) RNA, 및/또는 작은 헤어핀(small hairpin) RNA를 포함하고/하거나 암호화한다. 한 구현예에서, 폴리뉴클레오티드는 mRNA이거나 이를 암호화한다. 특정 구현예에서, 폴리뉴클레오티드는 이중-가닥(ds) DNA, 예를 들어 단일-가닥 RNA로부터 합성된 조작된 ds DNA 또는 ds cDNA이다. As used herein, the terms "nucleic acid sequence" and "polynucleotide" are used interchangeably to refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Accordingly, the term refers to single-, double-, or multi-stranded DNA or RNA, genomic DNA, complementary DNA (cDNA), DNA-RNA hybrids, or purine and pyrimidine bases or other natural, chemically or biochemically modified , polymers comprising non-natural, or derivatized nucleotide bases. In certain embodiments, a polynucleotide comprises and/or encodes messenger RNA (mRNA), short hairpin RNA, and/or small hairpin RNA. In one embodiment, the polynucleotide is or encodes an mRNA. In certain embodiments, the polynucleotide is double-stranded (ds) DNA, eg engineered ds DNA or ds cDNA synthesized from single-stranded RNA.

용어 "단백질", "펩티드" 및 "폴리펩티드"는 상호교환가능하게 사용되고, 그의 가장 넓은 의미에서 아미노산, 아미노산 유사체 또는 펩티드모방체(peptidomimetics)의 2개 이상의 서브유닛의 화합물을 지칭한다. 서브유닛은 펩티드 결합에 의해 연결될 수 있다. 또 다른 측면에서, 서브유닛은 다른 결합, 예를 들어 에스테르, 에테르 등에 의해 연결될 수 있다. 단백질 또는 펩티드는 적어도 2개의 아미노산을 함유하여야 하고, 단백질 또는 펩티드의 서열을 포함할 수 있는 아미노산의 최대 수에 어떠한 제한도 두지 않는다. 본원에서 사용되는 용어 "아미노산"은 글리신 및 D 및 L 광학 이성질체 둘 다, 아미노산 유사체 및 펩티드모방체를 포함한 천연 및/또는 비천연 또는 합성 아미노산을 지칭한다.The terms "protein", "peptide" and "polypeptide" are used interchangeably and in their broadest sense refer to compounds of two or more subunits of amino acids, amino acid analogs or peptidomimetics. Subunits may be linked by peptide bonds. In another aspect, the subunits may be linked by other linkages, such as esters, ethers, and the like. A protein or peptide must contain at least 2 amino acids, and there is no limit to the maximum number of amino acids that a protein or peptide sequence can contain. As used herein, the term “amino acid” refers to natural and/or unnatural or synthetic amino acids, including glycine and both D and L optical isomers, amino acid analogs and peptidomimetics.

본원에 사용되는 연속 아미노산 서열은 적어도 2개의 아미노산을 갖는 서열을 지칭한다. 그러나, 제 1 부분 및 제 2 부분의 연속 아미노산 서열은 제 1 부분이 제 2 부분에 직접적으로 접합된 것을 갖도록 아미노산 서열을 제한하지 않는다는 것에 주목한다. 제 1 부분이 제 3 부분, 예를 들어 연결(link)을 통해 제 2 부분에 연결되어, 이에 따라 1개의 연속 아미노산 서열을 형성하는 것이 또한 가능하다. A contiguous amino acid sequence as used herein refers to a sequence having at least two amino acids. Note, however, that the contiguous amino acid sequence of the first part and the second part does not limit the amino acid sequence such that the first part is directly conjugated to the second part. It is also possible for a first part to be connected to a third part, eg to a second part via a link, thus forming one contiguous amino acid sequence.

본원에서 사용되는 용어 "접합체", "접합된", "접합하는", 및 "접합"은 분자 사이의, 특히 2개의 아미노산 서열 및/또는 2개의 폴리펩티드 사이의 결합의 형성을 지칭한다. 접합은 직접적(즉, 결합) 또는 간접적(즉, 추가의 분자를 통해)일 수 있다. 접합은 공유적 또는 비-공유적일 수 있다.As used herein, the terms “conjugate,” “conjugated,” “conjugating,” and “conjugation” refer to the formation of a linkage between molecules, particularly between two amino acid sequences and/or two polypeptides. Conjugation can be direct (ie, binding) or indirect (ie, through an additional molecule). Conjugation can be covalent or non-covalent.

본원에 사용되는 연속 아미노산 서열은 서로 직접적으로 또는 간접적으로 (예를 들어 링커(linker)를 통해) 접합된 2개 이상의 폴리펩티드를 포함할 수 있다.As used herein, a contiguous amino acid sequence may include two or more polypeptides joined directly or indirectly (eg, via a linker) to each other.

본원에서 사용되는 용어 "재조합 발현 시스템"은 재조합에 의해 형성된 특정 유전 물질의 발현을 위한 유전자 구조체(genetic construct)(들)를 지칭한다.As used herein, the term "recombinant expression system" refers to a genetic construct(s) for the expression of specific genetic material formed by recombination.

"유전자 전달 비히클"은 삽입된 폴리뉴클레오티드를 숙주 세포 내로 운반할 수 있는 임의의 분자로서 정의된다. 유전자 전달 비히클의 예는 리포솜, 천연 중합체 및 합성 중합체를 비롯한 생체적합성 중합체인 미셀; 지질단백질; 지질 나노입자; 폴리펩티드; 다당류; 리포폴리사카라이드; 인공 바이러스 외피; 금속 입자; 및 박테리아, 또는 바이러스, 예를 들어 광견병 바이러스, 플라비바이러스, 렌티바이러스, 배큘로바이러스, 아데노바이러스 및 레트로바이러스, 박테리오파지, 코스미드, 플라스미드, 진균 벡터 및 다양한 진핵 및 원핵 숙주에서의 발현에 대해 기재된 당업계에서 전형적으로 사용되는 다른 재조합 비히클이며, 유전자 요법 뿐만 아니라 단순 단백질 발현에 사용될 수 있다.A “gene transfer vehicle” is defined as any molecule capable of delivering an inserted polynucleotide into a host cell. Examples of gene delivery vehicles include micelles, which are biocompatible polymers including liposomes, natural and synthetic polymers; lipoprotein; lipid nanoparticles; polypeptide; polysaccharides; lipopolysaccharide; artificial viral envelope; metal particles; and bacteria, or viruses such as rabies viruses, flaviviruses, lentiviruses, baculoviruses, adenoviruses and retroviruses, bacteriophages, cosmids, plasmids, fungal vectors, and expression in a variety of eukaryotic and prokaryotic hosts. It is another recombinant vehicle typically used in the art and can be used for simple protein expression as well as gene therapy.

본원에 개시된 폴리뉴클레오티드는 유전자 전달 비히클을 사용하여 세포 또는 조직에 전달될 수 있다. 본원에 사용되는 "유전자 전달(gene delivery)", "유전자 전이(gene transfer)", "mRNA-기반 전달", "형질도입(transducing)" 등은 도입에 사용되는 방법에 관계없이, 숙주 세포 내로의 외인성 폴리뉴클레오티드 (때때로 "전이유전자"로 지칭됨)의 도입을 지칭하는 용어이다. 이러한 방법은 다양하게 널리 공지된 기술, 예를 들어 벡터-매개 유전자 전달 (예를 들어 바이러스 감염/형질주입(transfection), 또는 예를 들어 프로타민 복합체, 지질 나노입자, 중합체 나노입자, 지질-중합체 혼성 나노입자, 및 무기 나노입자, 또는 그의 조합을 비롯한 다양한 다른 단백질-기반 또는 지질-기반 유전자 전달 복합체에 의한 것) 뿐만 아니라 "네이키드(naked)" 폴리뉴클레오티드의 전달을 용이하게 하는 기술 (예를 들어 전기천공, "유전자 총" 전달 및 폴리뉴클레오티드의 도입에 사용되는 다양한 다른 기술)을 포함한다. 도입된 폴리뉴클레오티드는 변형되지 않을 수 있거나 또는 하나 이상의 변형을 포함할 수 있고; 예를 들어 변형된 mRNA는 ARCA 캡핑; 100-250 아데노신 잔기(서열번호 22)의 테일(tail)을 부가하기 위한 효소적 폴리아데닐화; 및 시티딘 중의 하나 또는 둘 다의 5-메틸시티딘 및/또는 우리딘의 슈도우리딘(pseudouridine)으로의 치환을 포함할 수 있다. 도입된 폴리뉴클레오티드는 숙주 세포에서 안정적으로 또는 일시적으로 유지될 수 있다. 안정한 유지는 전형적으로 도입된 폴리뉴클레오티드가 숙주 세포와 상용성인 복제기점을 함유하거나 또는 숙주 세포의 레플리콘, 예를 들어 염색체외 레플리콘(extrachromosomal replicon) (예를 들어 플라스미드) 또는 핵 또는 미토콘드리아 염색체 내로 통합되는 것을 필요로 한다. 다수의 벡터는 당업계에 공지되고 본원에 기재된 바와 같이 포유동물 세포로의 유전자의 전달을 매개할 수 있는 것으로 공지되어 있다.The polynucleotides disclosed herein can be delivered to cells or tissues using gene delivery vehicles. As used herein, “gene delivery,” “gene transfer,” “mRNA-based delivery,” “transducing,” and the like refer to any method used for introduction into a host cell. A term that refers to the introduction of an exogenous polynucleotide (sometimes referred to as a “transgene”) of Such methods may be accomplished using a variety of well-known techniques, such as vector-mediated gene delivery (eg viral infection/transfection, or eg protamine complexes, lipid nanoparticles, polymer nanoparticles, lipid-polymer hybridization). nanoparticles, and inorganic nanoparticles, or a variety of other protein-based or lipid-based gene delivery complexes including combinations thereof) as well as technologies that facilitate the delivery of "naked" polynucleotides (eg eg electroporation, "gene gun" delivery, and various other techniques used for introduction of polynucleotides). An introduced polynucleotide may be unmodified or may contain one or more modifications; For example, modified mRNAs can be ARCA capped; enzymatic polyadenylation to add a tail of 100-250 adenosine residues (SEQ ID NO: 22); and substitution of one or both of the cytidines with 5-methylcytidine and/or uridine with pseudouridine. An introduced polynucleotide may be stably or transiently maintained in a host cell. Stable maintenance typically requires that the introduced polynucleotide contains an origin of replication compatible with the host cell or a replicon of the host cell, such as an extrachromosomal replicon (eg a plasmid) or a nuclear or mitochondrial replicon. It requires integration into a chromosome. A number of vectors are known in the art and capable of mediating the transfer of genes into mammalian cells as described herein.

"플라스미드"는 염색체 DNA와는 독립적으로 복제할 수 있는 염색체 DNA로부터 분리된 염색체외 DNA 분자이다. 많은 경우에, 이는 원형 및 이중-가닥이다. 플라스미드는 미생물의 집단 내에서의 수평 유전자 전달을 위한 메카니즘을 제공하고, 전형적으로 주어진 환경 상태 하에 선택적 이점을 제공한다. 플라스미드는 경쟁적 환경 니치(competitive environmental niche)에서 자연 발생 항생제에 대한 내성을 제공하는 유전자를 운반할 수 있거나, 또는 대안적으로 생성된 단백질은 유사한 환경 하에 독소로서 작용할 수 있다.A "plasmid" is an extrachromosomal DNA molecule isolated from chromosomal DNA that is capable of replicating independently of chromosomal DNA. In many cases, they are circular and double-stranded. Plasmids provide a mechanism for horizontal gene transfer within a population of microorganisms and typically provide a selective advantage under given environmental conditions. Plasmids can carry genes conferring resistance to naturally occurring antibiotics in a competitive environmental niche, or alternatively the resulting protein can act as a toxin under similar circumstances.

유전자 조작에 사용되는 "플라스미드"는 "플라스미드 벡터"로 지칭된다. 많은 플라스미드가 이러한 용도를 위해 상업적으로 이용가능하다. 복제될 유전자는 특정 항생제 및 다중 클로닝 부위(MCS, 또는 폴리링커)에 내성인 세포를 만드는 유전자를 함유하는 플라스미드의 카피 내로 삽입되며, 이는 이러한 위치에서 DNA 단편의 용이한 삽입을 가능하게 하는 몇몇 통상적으로 사용되는 제한 부위를 함유하는 짧은 영역이다. 플라스미드의 또 다른 주요 용도는 다량의 단백질을 만드는 것이다. 이러한 경우에, 연구자들은 관심 유전자를 보유하는 플라스미드를 함유하는 박테리아를 성장시킨다. 박테리아가 그의 항생제 내성을 부여하는 단백질을 생산하는 것과 마찬가지로, 삽입된 유전자로부터 다량의 단백질을 생산하도록 유도될 수도 있다.A "plasmid" used for genetic engineering is referred to as a "plasmid vector". Many plasmids are commercially available for this use. The gene to be cloned is inserted into a copy of a plasmid containing the gene that renders the cell resistant to certain antibiotics and multiple cloning sites (MCS, or polylinkers), which allows for the facile insertion of DNA fragments at these locations. is a short region containing restriction sites used as Another major use of plasmids is to make large amounts of protein. In this case, researchers grow bacteria containing a plasmid carrying the gene of interest. Just as bacteria produce proteins that confer their antibiotic resistance, they can also be induced to produce large amounts of proteins from inserted genes.

"효모 인공 염색체(yeast artificial chromosome)" 또는 "YAC"는 큰 DNA 단편(100 kb 초과 및 3000 kb 이하)을 클로닝하기 위해 사용되는 벡터를 지칭한다. 이는 인공적으로 구축된 염색체이며, 효모 세포에서 복제 및 보존을 위해 필요한 텔로머(telomeric), 센트로메릭(centromeric), 및 복제 기원 서열을 함유한다. 초기 원형 플라스미드를 사용하는 빌딩은 제한 효소를 사용함으로써 선형화되고, 이어서 DNA 리가제(ligase)는 응집성 말단의 사용에 의해 선형 분자 내에 관심 서열 또는 유전자를 부가할 수 있다. 효모 발현 벡터, 예를 들어 YACs, YIps(효모 통합 플라스미드), 및 YEps(효모 에피솜 플라스미드)는 효모가 그 자체로 진핵 세포이기 때문에 번역후 변형을 갖는 진핵 단백질 생성물을 얻을 수 있으므로 매우 유용하지만, YAC는 BAC보다 더 불안정한 것으로 밝혀져서, 키메라 효과를 생성한다."Yeast artificial chromosome" or "YAC" refers to vectors used for cloning large DNA fragments (greater than 100 kb and less than 3000 kb). It is an artificially constructed chromosome and contains telomeric, centromeric, and origin of replication sequences necessary for replication and preservation in yeast cells. Building using an initial circular plasmid is linearized using restriction enzymes, then DNA ligase can add the sequence or gene of interest into the linear molecule by use of cohesive ends. Yeast expression vectors such as YACs, YIps (yeast integrating plasmids), and YEps (yeast episomal plasmids) are very useful because yeast is itself a eukaryotic cell, allowing eukaryotic protein products with post-translational modifications to be obtained, YAC was found to be more labile than BAC, producing a chimeric effect.

본원에서 사용되는 용어 "바이러스 캡시드" 또는 "캡시드"는 바이러스 입자의 단백질성 쉘 또는 코트를 지칭한다. 캡시드는 바이러스 게놈을 숙주 세포 내로 캡시드화, 보호, 수송, 및 방출하는 기능을 한다. 캡시드는 일반적으로 단백질의 올리고머 구조 서브유닛("캡시드 단백질")으로 구성된다. 본원에서 사용되는 용어 "캡시드화된"은 바이러스 캡시드 내에 봉입된 것을 의미한다.As used herein, the term “viral capsid” or “capsid” refers to the proteinaceous shell or coat of a viral particle. The capsid functions to encapsidate, protect, transport, and release the viral genome into the host cell. Capsids are generally composed of oligomeric structural subunits of proteins ("capsid proteins"). As used herein, the term “encapsidated” means enclosed within a viral capsid.

본원에서 바이러스 또는 플라스미드와 관련하여 사용되는 용어 "헬퍼(helper)"는 바이러스 입자 또는 재조합 바이러스 입자, 예를 들어 본원에 개시된 변형된 AAV의 복제 및 패키징을 위해 필요한 추가의 성분을 제공하기 위해 사용되는 바이러스 또는 플라스미드를 지칭한다. 헬퍼 바이러스에 의해 암호화되는 성분은 비리온 어셈블리(virion assembly), 캡시드화, 게놈 복제, 및/또는 패키징을 위해 요구되는 임의의 유전자를 포함할 수 있다. 예를 들어 헬퍼 바이러스는 바이러스 게놈의 복제를 위해 필요한 효소를 암호화할 수 있다. AAV 구조체와 함께 사용하기에 적합한 헬퍼 바이러스 및 플라스미드의 비제한적 예는 pHELP(플라스미드), 아데노바이러스(바이러스), 또는 헤르페스바이러스(바이러스)를 포함한다.As used herein, the term "helper" in reference to a virus or plasmid is used to provide additional components necessary for replication and packaging of a viral particle or recombinant viral particle, e.g., a modified AAV disclosed herein. Refers to a virus or plasmid. Components encoded by helper viruses may include any genes required for virion assembly, encapsidation, genome replication, and/or packaging. For example, helper viruses may encode enzymes necessary for replication of the viral genome. Non-limiting examples of helper viruses and plasmids suitable for use with AAV constructs include pHELP (plasmid), adenovirus (virus), or herpesvirus (virus).

본원에 사용되는 생물학적 샘플, 또는 샘플은 대상체, 세포주 또는 배양된 세포 또는 조직으로부터 수득될 수 있다. 예시적인 샘플은 세포 샘플, 조직 샘플, 액체 샘플, 예를 들어 혈액 및 생물학적 기원의 다른 액체 샘플(안액(ocular fluids)(수성 및 유리액(vitreous humor)), 말초 혈액, 혈청, 혈장, 복수, 소변, 뇌척수액(cerebrospinal fluid; CSF), 객담(sputum), 타액, 골수, 활액, 수성 액, 양수, 뇌척수, 모유, 기관지 폐포 세척액(broncheoalveolar lavage fluid), 정액, 전립선 유체, 소의 유체(cowper's fluid) 또는 쿠퍼액(pre-ejaculatory fluid), 여성 사정액, 땀, 눈물, 낭종 유체, 흉막 및 복막 유체, 심막 유체, 복수, 림프, 라임, 킬(chyle), 담즙, 간질액, 월경, 고름, 피지, 구토, 질 분비물/홍조, 활액, 점막 분비물, 대변 물, 췌장 주스, 부비동 공동으로부터의 세척액, 기관지폐 흡인물, 블라스토실 공동 유체, 또는 제대혈을 포함하나 이에 제한되지는 않는다.A biological sample, or sample, as used herein may be obtained from a subject, cell line, or cultured cell or tissue. Exemplary samples include cell samples, tissue samples, liquid samples such as blood and other liquid samples of biological origin (ocular fluids (aqueous and vitreous humor), peripheral blood, serum, plasma, ascites, Urine, cerebrospinal fluid (CSF), sputum, saliva, bone marrow, synovial fluid, aqueous fluid, amniotic fluid, cerebrospinal fluid, breast milk, broncheoalveolar lavage fluid, semen, prostate fluid, cow's fluid or pre-ejaculatory fluid, female ejaculatory fluid, sweat, tears, cyst fluid, pleural and peritoneal fluid, pericardial fluid, ascites, lymph, lime, chyle, bile, interstitial fluid, menstruation, pus, sebum, include but are not limited to vomiting, vaginal discharge/flushing, synovial fluid, mucosal secretions, feces, pancreatic juice, lavage fluid from the sinus cavity, bronchopulmonary aspirate, blastosyl cavity fluid, or umbilical cord blood.

본원에서 사용되는 용어 "검출가능한 마커"는 검출가능한 신호를 직접적으로 또는 간접적으로 생성할 수 있는 적어도 1종의 마커를 지칭한다. 이러한 마커의 비-포괄적 목록은 예를 들어 비색측정, 형광, 발광, 예를 들어 양고추냉이 퍼옥시다제(horseradish peroxidase), 알칼리성 포스파타제, β-갈락토시다제, 글루코스6 포스페이트 탈수소효소, 발색단, 예를 들어 형광, 발광 염료, 전자 현미경에 의해 또는 그의 전기적 특성, 예를 들어 전도도, 전류측정, 전압전류측정, 임피던스에 의해 검출가능한 신호를 생성하는 효소를 포함하며, 예를 들어 그의 분자가 그의 물리적 및/또는 화학적 특성에서 검출가능한 변형을 유도하기에 충분한 크기를 갖는 검출가능한 기는 광학 방법, 예를 들어 회절, 표면 플라즈몬 공명, 표면 변동, 접촉각 변화 또는 물리적 방법, 예를 들어 원자력 분광법, 터널 효과, 또는 방사성 분자, 예를 들어 32P, 35S, 89Zr 또는 ¹²⁵I에 의해 달성될 수 있다.As used herein, the term “detectable marker” refers to at least one marker capable of directly or indirectly producing a detectable signal. A non-exhaustive list of such markers includes, for example, colorimetric, fluorescence, luminescence, e.g., horseradish peroxidase, alkaline phosphatase, β-galactosidase, glucose 6 phosphate dehydrogenase, chromophore, It includes, for example, an enzyme that produces a signal detectable by fluorescence, luminescent dyes, electron microscopy or by its electrical properties, such as conductivity, amperometry, voltammetry, impedance, e.g. a molecule thereof A detectable group having a size sufficient to induce a detectable modification in physical and/or chemical properties is selected from optical methods such as diffraction, surface plasmon resonance, surface fluctuations, contact angle changes or physical methods such as atomic force spectroscopy, tunnel effect. , or by radioactive molecules such as 32P, 35S, 89Zr or ¹²⁵ I.

본원에서 사용되는 용어 "정제 마커"는 정제 또는 확인에 유용한 적어도 1종의 마커를 지칭한다. 이러한 마커의 비-포괄적 목록은 His, lacZ, GST, 말토스-결합 단백질, NusA, BCCP, c-myc, CaM, FLAG, GFP, YFP, 체리, 티오레독신, 폴리(NANP), V5, Snap, HA, 키틴-결합 단백질, Softag 1, Softag 3, Strep, 또는 S-단백질을 포함한다. 적합한 직접 또는 간접 형광 마커는 FLAG, GFP, YFP, RFP, dTomato, 체리, Cy3, Cy 5, Cy 5.5, Cy 7, DNP, AMCA, 비오틴, 디곡시제닌(Digoxigenin), 타마(Tamra), 텍사스 레드(Texas Red), 로다민(rhodamine), 알렉사 플루오르(Alexa fluors), FITC, TRITC 또는 임의의 다른 형광 염료 또는 합텐(hapten)을 포함한다.As used herein, the term “purification marker” refers to at least one marker useful for purification or identification. A non-exhaustive list of these markers is His, lacZ, GST, maltose-binding protein, NusA, BCCP, c-myc, CaM, FLAG, GFP, YFP, cherry, thioredoxin, poly(NANP), V5, Snap , HA, chitin-binding protein, Softag 1, Softag 3, Strep, or S-protein. Suitable direct or indirect fluorescent markers include FLAG, GFP, YFP, RFP, dTomato, Cherry, Cy3, Cy 5, Cy 5.5, Cy 7, DNP, AMCA, Biotin, Digoxigenin, Tamra, Texas Red (Texas Red), rhodamine, Alexa fluors, FITC, TRITC or any other fluorescent dye or hapten.

본원에 사용된 에피토프 태그는 항체에 의해 인식되는 항원으로서 작용하는 생물학적 구조 또는 서열, 예를 들어 단백질 또는 탄수화물이다. 특정 구현예에서, 에피토프 태그는 정제 마커 및/또는 친화성 태그와 상호교환가능하게 사용된다.As used herein, an epitope tag is a biological structure or sequence that serves as an antigen recognized by an antibody, such as a protein or carbohydrate. In certain embodiments, epitope tags are used interchangeably with purification markers and/or affinity tags.

"조성물"은 2종 이상의 화합물의 조합물, 예를 들어 활성 폴리펩티드, 폴리뉴클레오티드, 바이러스 벡터 또는 항체 및 또 다른 화합물 또는 조성물, 불활성(예를 들어 검출가능한 표지) 또는 활성(예를 들어 유전자 전달 비히클)의 조합물을 의미하는 것으로 의도된다.A "composition" is a combination of two or more compounds, e.g., active polypeptides, polynucleotides, viral vectors or antibodies, and another compound or composition, inactive (e.g., detectable label) or active (e.g., gene transfer vehicle). ) is intended to mean a combination of

"약학 조성물"은 활성 폴리펩티드, 폴리뉴클레오티드 또는 항체와 담체, 불활성 또는 활성, 예를 들어 고체 지지체의 조합물을 포함하는 것으로 의도되며, 이는 조성물을 시험관내, 생체내 또는 생체외에서 진단 또는 치료 용도에 적합하게 만든다."Pharmaceutical composition" is intended to include a combination of an active polypeptide, polynucleotide or antibody with a carrier, inactive or active, eg a solid support, which is intended to include the composition for diagnostic or therapeutic use in vitro, in vivo or ex vivo. make it fit

본원에서 사용되는 용어 "약학적으로 허용가능한 담체"는 표준 약학적 담체 중의 임의의 것, 예를 들어 포스페이트 완충 염수 용액, 물, 및 에멀젼, 예를 들어 오일/물 또는 물/오일 에멀젼, 및 다양한 유형의 습윤제를 포괄한다. 조성물은 또한 안정화제 및 보존제를 포함할 수 있다. 담체, 안정화제 및 아주반트의 예는 문헌 「Martin (1975) Remington's Pharm Sci, 15th Ed (Mack Publ Co, Easton)」을 참조한다.As used herein, the term “pharmaceutically acceptable carrier” refers to any of the standard pharmaceutical carriers, such as phosphate buffered saline solution, water, and emulsions such as oil/water or water/oil emulsions, and various Covers types of humectants. The composition may also include stabilizers and preservatives. For examples of carriers, stabilizers and adjuvants, see Martin (1975) Remington's Pharm Sci, 15th Ed (Mack Publ Co, Easton).

"대상체", "개체" 또는 "환자"는 본원에서 상호교환가능하게 사용되며, 척추동물, 바람직하게는 포유동물, 보다 바람직하게는 인간을 지칭한다. 포유동물은 비-인간 영장류, 뮤린, 래트, 토끼, 원숭이, 소, 양, 돼지, 개, 고양이, 농장 동물, 스포츠 동물, 애완동물, 말, 및 영장류, 특히 인간을 포함하나, 이에 제한되지는 않는다. 인간 치료에 유용한 것 이외에도, 본 발명은 또한 포유동물, 설치류 등을 비롯한 반려 포유동물, 외래 동물 및 가축의 수의학적 치료에 유용하다. 한 구현예에서, 포유동물은 말, 개 및 고양이를 포함한다. 본 발명의 또 다른 구현예에서, 인간은 18세 연령 미만의 청소년 또는 유아이다."Subject", "individual" or "patient" are used interchangeably herein and refer to a vertebrate, preferably a mammal, more preferably a human. Mammals include, but are not limited to, non-human primates, murines, rats, rabbits, monkeys, cows, sheep, pigs, dogs, cats, farm animals, sport animals, pets, horses, and primates, particularly humans. don't In addition to being useful for human treatment, the present invention is also useful for veterinary treatment of companion mammals, exotic animals and livestock, including mammals, rodents, and the like. In one embodiment, mammals include horses, dogs and cats. In another embodiment of the invention, the human is an adolescent or toddler under the age of 18 years.

질환의 "치료함" 또는 "치료"는, (1) 질환에 걸리기 쉬울 수 있지만 아직 질환의 증상을 경험하거나 나타내지 않는 환자에서 질환을 예방하는 것, 즉 질환의 임상적 증상이 발생하지 않도록 하는 것; (2) 질환을 억제하는 것, 즉 질환 또는 그의 임상적 증상의 발생을 저지 또는 감소시키는 것; 또는 (3) 질환을 완화시키는 것, 즉 질환 또는 그의 임상적 증상의 퇴행을 유발하는 것을 포함한다. 한 측면에서, 용어 "치료"는 예방(prevention) 또는 예방(prophylaxis)을 배제한다."Treatment" or "treatment" of a disease means (1) preventing a disease in a patient who may be predisposed to the disease but who is not yet experiencing or displaying symptoms of the disease, i.e., preventing clinical symptoms of the disease from occurring. ; (2) inhibiting the disease, ie arresting or reducing the occurrence of the disease or its clinical symptoms; or (3) alleviating the disease, ie causing regression of the disease or its clinical symptoms. In one aspect, the term "treatment" excludes prevention or prophylaxis.

용어 "치료"와 관련하여 용어 "앓는(suffering)"은 질환에 걸렸거나 걸리기 쉬운 것으로 진단된 환자 또는 개체를 지칭한다.The term "suffering" in relation to the term "treatment" refers to a patient or individual who has been diagnosed as suffering from or susceptible to a disease.

"유효량"은 유익한 또는 원하는 결과를 달성하기에 충분한 양이다. 유효량은 1회 이상의 투여, 적용 또는 투여량(dosage)으로 투여될 수 있다. 이러한 전달은 개별 투여 단위가 사용되는 기간, 치료제의 생체이용률, 투여 경로 등을 비롯한 다수의 변수에 따라 달라진다. 그러나, 임의의 특정 대상체에 대한 본 발명의 치료제의 특정 용량 수준은 사용되는 특정 화합물의 활성, 대상체의 연령, 체중, 일반적인 건강, 성별, 및 식이, 투여 시간, 배출 속도, 약물 조합, 및 치료되는 특정 장애의 중증도 및 투여 형태를 비롯한 다양한 인자에 따라 달라진다는 것으로 이해되어야 한다. 치료 투여량은 일반적으로 안전성 및 효능을 최적화하기 위해 적정될 수 있다. 한 측면에서, 유효량은 치료 유효량이다. 전형적으로, 시험관내 및/또는 생체내 시험으로부터의 용량-효과 관계는 초기에 환자 투여를 위한 적절한 투여량에 대한 유용한 지침을 제공할 수 있다. 일반적으로, 시험관내에서 효과적인 것으로 밝혀진 농도와 상응하는 혈청 수준을 달성하기에 효과적인 화합물의 양을 투여하고자 할 것이다. 이들 파라미터의 결정은 관련 기술분야의 통상의 기술 내에 속한다. 이들 고려사항, 뿐만 아니라 효과적인 제제 및 투여 절차는 관련 기술분야에 널리 공지되어 있고, 표준 교재에 기재되어 있다. 본원에 사용된 바와 같은 이러한 정의와 일치하게, 용어 "치료 유효량"은 RNA 바이러스 복제를 생체외, 시험관내 또는 생체내에서 억제하기에 충분한 양이다.An "effective amount" is an amount sufficient to achieve beneficial or desired results. An effective amount can be administered in one or more administrations, applications or dosages. Such delivery depends on a number of variables, including the length of time the individual dosage unit is used, the bioavailability of the therapeutic agent, the route of administration, and the like. However, the specific dosage level of a therapeutic agent of the present invention for any particular subject depends on the activity of the particular compound employed, the age, weight, general health, sex, and diet of the subject, the time of administration, rate of excretion, drug combination, and the combination of drugs being treated. It should be understood that this will depend on a variety of factors including the severity of the particular disorder and the form of administration. Therapeutic dosages can generally be titrated to optimize safety and efficacy. In one aspect, an effective amount is a therapeutically effective amount. Typically, dose-effect relationships from in vitro and/or in vivo studies can initially provide useful guidance on appropriate dosages for patient administration. Generally, one will attempt to administer an amount of compound effective to achieve serum levels commensurate with the concentration found to be effective in vitro. Determination of these parameters is within the ordinary skill in the art. These considerations, as well as effective formulations and administration procedures, are well known in the art and described in standard textbooks. Consistent with this definition as used herein, the term "therapeutically effective amount" is an amount sufficient to inhibit RNA viral replication ex vivo, in vitro or in vivo.

용어 투여는 비제한적으로 경구, 비경구(예를 들어 근육내, 복강내, 정맥내, ICV, 수조내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의한 투여를 포함할 것이고, 단독으로 또는 함께, 각각의 투여 경로에 적절한 통상적인 무-독성(non-toxic) 약학적으로 허용가능한 담체, 아주반트(adjuvants), 부형제, 및 비히클을 함유하는 적합한 투여량 단위 제제(dosage unit formulations)로 제제화될 수 있다. 본 발명은 투여 경로, 제제 또는 투여 스케줄에 의해 제한되지 않는다.The term administration includes, but is not limited to oral, parenteral (eg intramuscular, intraperitoneal, intravenous, ICV, intracisternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal, sublingual, urethral ( eg urethral suppositories) or by topical routes of administration (eg gels, ointments, creams, aerosols, etc.), alone or in combination, conventional non-toxic (non-toxic) appropriate for each route of administration. toxic) in suitable dosage unit formulations containing pharmaceutically acceptable carriers, adjuvants, excipients, and vehicles. The present invention is not limited by route of administration, formulation or schedule of administration.

본원에서 사용되는 용어 "AAV"는 아데노-연관 바이러스에 대한 표준 약어이다. 아데노-연관 바이러스는 특정 기능이 보조인자 바이러스를 공동-감염시킴으로써 제공되는 세포에서만 성장하는 단일-가닥 DNA 파보바이러스이다. 현재, 특성화된 AAV의 13종의 혈청형이 존재한다. AAV의 일반적 정보 및 검토는 예를 들어 문헌 「Carter, Handbook of Parvoviruses 1:169-228, 1989」 및 「Berns, Virology 1743-1764, 1999」에서 찾아볼 수 있다. 그러나, 이들 동일한 원리는 다양한 혈청형이 구조적으로 및 기능적으로도 유전자 수준에서 매우 밀접하게 관련되어 있다는 것이 널리 공지되어 있기 때문에 추가의 AAV 혈청형에 적용가능할 것으로 완전히 예상된다. 예를 들어 문헌 「Blacklowe, Parvoviruses and Human Disease 165-174, 1988, J R Pattison, ed」; 및 「Rose, Comprehensive Virology 3:1-61, 1974」을 참조한다. 예를 들어 모든 AAV 혈청형은 명백하게 상동성 rep 유전자에 의해 매개되는 매우 유사한 복제 특성을 나타내며; 모두 AAV2에서 발현되는 것과 같은 3개의 관련된 캡시드 단백질을 갖는다. 관련성의 정도는 게놈의 길이를 따라 혈청형 사이의 광범위한 교차-혼성화를 나타내는 헤테로듀플렉스 분석; 및 "역전된 말단 반복 서열"(ITR)에 상응하는 말단에서 유사한 자기-어닐링 세그먼트의 존재에 의해 추가로 제안된다. 유사한 감염성 패턴은 또한 각각의 혈청형에서 복제 기능이 유사한 조절 제어 하에 있음을 시사한다.As used herein, the term "AAV" is a standard abbreviation for adeno-associated virus. Adeno-associated viruses are single-stranded DNA parvoviruses that grow only in cells whose specific function is served by co-infecting a cofactor virus. Currently, there are 13 serotypes of AAV that have been characterized. General information and reviews of AAV can be found, for example, in Carter, Handbook of Parvoviruses 1:169-228, 1989 and Berns, Virology 1743-1764, 1999. However, these same principles are fully expected to be applicable to additional AAV serotypes as it is well known that the various serotypes are structurally and functionally very closely related at the genetic level. See, eg, Blacklowe, Parvoviruses and Human Disease 165-174, 1988, J R Pattison, ed; and "Rose, Comprehensive Virology 3:1-61, 1974". For example, all AAV serotypes display very similar replication properties mediated by apparently homologous rep genes; All have three related capsid proteins as expressed in AAV2. The degree of relatedness can be assessed by heteroduplex analysis indicating extensive cross-hybridization between serotypes along the length of the genome; and the presence of similar self-annealing segments at the ends corresponding to "inverted terminal repeats" (ITRs). Similar infectivity patterns also suggest that replication function in each serotype is under similar regulatory control.

본원에 사용되는 용어 "AAV 발현 카세트"는 AAV 말단 반복 서열 (ITR)에 의해 플랭킹된 하나 이상의 관심 폴리뉴클레오티드(또는 전이유전자)를 포함하는 뉴클레오티드 서열을 지칭한다. 이러한 AAV 발현 카세트는 rep 및 cap 유전자 생성물을 암호화하고 발현하는 벡터로 형질주입된(transfected) 숙주 세포에 존재할 때 복제되고 감염성 바이러스 입자(예를 들어 AAV 벡터) 내로 패키징될 수 있다.As used herein, the term "AAV expression cassette" refers to a nucleotide sequence comprising one or more polynucleotides (or transgenes) of interest flanked by AAV terminal repeat sequences (ITRs). Such AAV expression cassettes can be replicated and packaged into infectious viral particles (eg AAV vectors) when present in host cells transfected with vectors encoding and expressing the rep and cap gene products.

"AAV 비리온" 또는 "AAV 벡터" 또는 "AAV 바이러스 입자" 또는 "AAV 벡터 입자"는 적어도 하나의 AAV 캡시드 단백질 및 캡시드화된 폴리뉴클레오티드 AAV 발현 카세트로 구성된 바이러스 입자를 지칭한다. 입자가 이종 폴리뉴클레오티드(즉, 포유동물 세포에 전달될 전이유전자와 같은 야생형 AAV 게놈 이외의 폴리뉴클레오티드)를 포함하는 경우에, 이는 전형적으로 "AAV 벡터 입자" 또는 간단히 "AAV 벡터"로 지칭된다. 따라서, AAV 벡터 입자의 생산은 반드시 AAV 발현 카세트의 생산을 포함하며, 이는 이러한 플라스미드가 AAV 벡터 입자 내에 함유되기 때문이다."AAV virion" or "AAV vector" or "AAV viral particle" or "AAV vector particle" refers to a viral particle composed of at least one AAV capsid protein and an encapsidated polynucleotide AAV expression cassette. When a particle comprises a heterologous polynucleotide (i.e., a polynucleotide other than the wild-type AAV genome, such as a transgene to be delivered to a mammalian cell), it is typically referred to as an "AAV vector particle" or simply "AAV vector". Thus, production of an AAV vector particle necessarily includes production of an AAV expression cassette, since such plasmid is contained within the AAV vector particle.

아데노-연관 바이러스(AAV)는 복제-결함 파보바이러스이며, 이의 단일-가닥 DNA 게놈은 145 뉴클레오티드 역전된 말단 반복부(ITR)를 포함하여 약 4.7 kb 길이이다. AAV의 다중 혈청형이 존재한다. AAV 혈청형의 게놈의 뉴클레오티드 서열은 공지되어 있다. 예를 들어 AAV 혈청형 2 (AAV2) 게놈의 뉴클레오티드 서열은 문헌 「Ruffing 등, J Gen Virol, 75: 3385-3392 (1994)」에 의해 보정된 바와 같이 문헌 「Srivastava 등, J Virol, 45:555-564 (1983)」에 제시되어 있다. 다른 예로서, AAV-1의 완전한 게놈은 진뱅크(GenBank) 수탁번호 NC_002077에 제공되고; AAV-3의 완전한 게놈은 진뱅크 수탁번호 NC_1829에 제공되며; AAV-4의 완전한 게놈은 진뱅크 수탁번호 NC_001829에 제공되고; AAV-5의 완전한 게놈은 진뱅크 수탁번호 AF085716에 제공되며; AAV-6의 완전한 게놈은 진뱅크 수탁번호 NC_001862에 제공되고; AAV-7 및 AAV-8 게놈의 적어도 일부는 각각 진뱅크 수탁번호 AX753246 및 AX753249에 제공되며(또한 AAV-8에 관한 미국특허 제7,282,199호 및 제7,790,449호 참조); AAV-9 게놈은 문헌 「Gao 등, J Virol, 78:6381-6388 (2004)」에 제공되고; AAV-10 게놈은 문헌 「Mol Ther, 13(1):67-76 (2006)」에 제공되며; AAV-11 게놈은 문헌 「Virology, 330(2):375-383 (2004)」에 제공된다. AAVrh74 혈청형의 클로닝은 문헌 「Rodino-Klapac, 등, Journal of translational medicine 5, 45 (2007)」에 기재되어 있다. 바이러스 DNA 복제(rep), 캡시드화/패키징 및 숙주 세포 염색체 통합을 지시하는 시스-작용 서열이 ITR 내에 함유된다. 3개의 AAV 프로모터(그의 상대적인 지도 위치에 대해 p5, p19, 및 p40로 명명됨)는 rep 및 cap 유전자를 암호화하는 2개의 AAV 내부 오픈 리딩 프레임의 발현을 구동한다. 단일 AAV 인트론의 차별적 스플라이싱(예를 들어 AAV2 뉴클레오티드 2107 및 2227에서)과 커플링된 2개의 rep 프로모터(p5 및 p19)는 rep 유전자로부터 4개의 rep 단백질(rep 78, rep 68, rep 52, 및 rep 40)의 생산을 초래한다. Rep 단백질은 궁극적으로 바이러스 게놈을 복제하는 것을 담당하는 다중 효소 특성을 보유한다. 캡 유전자는 p40 프로모터로부터 발현되고, 이는 3개의 캡시드 단백질 VP1, VP2, 및 VP3을 암호화한다. 대안적 스플라이싱 및 비-컨센서스 번역 개시 부위는 3개의 관련된 캡시드 단백질의 생성을 담당한다. 단일 컨센서스 폴리아데닐화 부위는 AAV 게놈의 지도 위치 95에 위치한다. AAV의 생활 주기 및 유전학은 문헌 「Muzyczka, Current Topics in Microbiology and Immunology, 158: 97-129 (1992)」에서 검토된다.Adeno-associated virus (AAV) is a replication-defective parvovirus, whose single-stranded DNA genome is approximately 4.7 kb in length, including a 145 nucleotide inverted terminal repeat (ITR). Multiple serotypes of AAV exist. The nucleotide sequences of the genomes of AAV serotypes are known. For example, the nucleotide sequence of the AAV serotype 2 (AAV2) genome is described in Srivastava et al., J Virol, 45:555 as corrected by Ruffing et al., J Gen Virol, 75: 3385-3392 (1994). -564 (1983)”. As another example, the complete genome of AAV-1 is provided under GenBank accession number NC_002077; The complete genome of AAV-3 is provided under Genbank Accession No. NC_1829; The complete genome of AAV-4 is provided under Genbank Accession No. NC_001829; The complete genome of AAV-5 is provided under Genbank Accession No. AF085716; The complete genome of AAV-6 is provided under Genbank Accession No. NC_001862; at least portions of the AAV-7 and AAV-8 genomes are provided under GenBank Accession Nos. AX753246 and AX753249, respectively (see also US Pat. Nos. 7,282,199 and 7,790,449 for AAV-8); The AAV-9 genome is provided by Gao et al., J Virol, 78:6381-6388 (2004); The AAV-10 genome is provided in Mol Ther, 13(1):67-76 (2006); The AAV-11 genome is provided in Virology, 330(2):375-383 (2004). Cloning of the AAVrh74 serotype is described in Rodino-Klapac, et al., Journal of translational medicine 5, 45 (2007). Contained within the ITRs are cis-acting sequences that direct viral DNA replication (rep), encapsidation/packaging and host cell chromosomal integration. Three AAV promoters (named p5, p19, and p40 for their relative map positions) drive expression of two AAV internal open reading frames encoding the rep and cap genes. Two rep promoters (p5 and p19) coupled with differential splicing of a single AAV intron (e.g. at AAV2 nucleotides 2107 and 2227) generate four rep proteins (rep 78, rep 68, rep 52, and rep 40). The Rep protein possesses multiple enzymatic properties that are ultimately responsible for replicating the viral genome. The cap gene is expressed from the p40 promoter and encodes the three capsid proteins VP1, VP2, and VP3. Alternative splicing and non-consensus translation initiation sites are responsible for the production of three related capsid proteins. A single consensus polyadenylation site is located at map position 95 of the AAV genome. The life cycle and genetics of AAV are reviewed in Muzyczka, Current Topics in Microbiology and Immunology, 158: 97-129 (1992).

본 개시의 재조합 AAV 게놈은 본 발명의 핵산 분자 및 핵산 분자 측면에 위치하는 하나 이상의 AAV ITR을 포함한다. rAAV 게놈 내의 AAV DNA는 AAV 혈청형 AAVrh74, AAVrh10, AAVrh20, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV-11, AAV-12 및 AAV-13을 포함하나 이에 제한되지 않는 재조합 바이러스가 유래될 수 있는 임의의 AAV 혈청형으로부터 유래될 수 있다. 슈도타입(pseudotyped) rAAV의 생산은 예를 들어 국제공개공보 제WO 2001/83692호에 개시되어 있다. 다른 유형의 rAAV 변이체, 예를 들어 캡시드 돌연변이를 갖는 rAAV가 또한 고려된다. 예를 들어 문헌 「Marsic 등, Molecular Therapy, 22(11): 1900-1909 (2014)」을 참조한다. 상기 배경기술에서 언급한 바와 같이, 다양한 AAV 혈청형의 게놈의 뉴클레오티드 서열은 관련 기술분야에 공지되어 있다. 일부 구현예에서, 골격근 특이적 발현을 촉진하기 위해, AAV1, AAV6, AAV8 또는 AAVrh74가 사용된다.A recombinant AAV genome of the present disclosure comprises a nucleic acid molecule of the present invention and one or more AAV ITRs flanking the nucleic acid molecule. AAV DNA within the rAAV genome is composed of AAV serotypes AAVrh74, AAVrh10, AAVrh20, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9 , AAV-10, AAV-11, AAV-12 and AAV-13, including but not limited to, from any AAV serotype from which recombinant viruses can be derived. Production of pseudotyped rAAV is disclosed, for example, in WO 2001/83692. Other types of rAAV variants are also contemplated, such as rAAV with capsid mutations. See, for example, Marsic et al., Molecular Therapy, 22(11): 1900-1909 (2014). As noted in the background section above, the nucleotide sequences of the genomes of various AAV serotypes are known in the art. In some embodiments, to promote skeletal muscle specific expression, AAV1, AAV6, AAV8 or AAVrh74 is used.

본원 명세서 및 청구범위에 사용되는 단수 형태 "하나(a, an)" 및 "상기(the)"는 문맥상 달리 명확하게 지시하지 않는한 복수의 것을 지칭한다. 예를 들어 용어 "세포"는 그의 혼합물을 비롯한 복수의 세포를 의미한다.As used in this specification and claims, the singular forms “a, an” and “the” refer to plural unless the context clearly dictates otherwise. For example, the term "cell" means a plurality of cells, including mixtures thereof.

본원에서 사용되는 용어 "포함하는" 또는 "구비한다"는 조성물 및 방법이 언급된 요소를 포함하지만 다른 것을 배제하지 않는 것을 의미하는 것으로 이해되어야 한다. 조성물 및 방법을 정의하기 위해 사용되는 경우에 "본질적으로 구성된"은 언급된 목적을 위한 조합에 대한 임의의 본질적인 중요성의 다른 요소를 배제하는 것을 의미한다. 따라서, 본원에 정의된 바와 같은 요소로 본질적으로 구성되는 조성물은 단리 및 정제 방법으로부터의 미량 오염물 및 약학적으로 허용가능한 담체, 예를 들어 포스페이트 완충 염수, 보존제 등을 배제하지 않는다. "단리하는"은 조성물을 생산하거나 의도된 결과를 달성하기 위해 본 발명의 조성물을 투여하기 위한 다른 성분 및 실질적인 방법 단계의 미량보다 많은 요소를 배제하는 것을 의미할 것이다. 각각의 이들 전이 용어에 의해 정의된 구현예는 본 발명의 범주 내에 있다.As used herein, the terms "comprising" or "comprising" should be understood to mean that the compositions and methods include the recited elements but do not exclude others. “Consisting essentially of” when used to define compositions and methods means excluding any other elements of essential importance to the combination for the stated purpose. Thus, compositions consisting essentially of the elements as defined herein do not exclude trace contaminants from isolation and purification methods and pharmaceutically acceptable carriers such as phosphate buffered saline, preservatives, and the like. "Isolating" shall mean excluding more than trace elements of other components and substantial process steps for administering a composition of the present invention to produce the composition or achieve an intended result. Embodiments defined by each of these transition terms are within the scope of this invention.

핵산, 예를 들어 DNA 또는 RNA와 관련하여 본원에서 사용되는 용어 "단리된"은 거대분자의 천연 공급원에 존재하는 다른 DNA 또는 RNA로부터 각각 분리된 분자를 지칭한다. 용어 "단리된 핵산"은 단편으로서 자연 발생하지 않는 핵산 단편을 포함하는 것을 의미한다. 용어 "단리된"은 또한 다른 세포 단백질로부터 단리되고 정제된 폴리펩티드 및 재조합 폴리펩티드를 둘 다 포괄하는 폴리펩티드, 단백질 및/또는 숙주 세포를 지칭하기 위해 본원에 사용된다. 다른 구현예에서, 용어 "단리된"은 구성성분, 세포 및 달리 자연에서 통상적으로 연관된 세포, 조직, 폴리뉴클레오티드, 펩티드, 폴리펩티드, 단백질, 항체 또는 그의 단편(들)으로부터 분리된 것을 의미한다. 예를 들어 단리된 세포는 비유사 표현형(dissimilar phenotype) 또는 유전자형(genotype)의 조직 또는 세포로부터 분리된 세포이다. 관련 기술분야의 통상의 기술자에게 명백한 바와 같이, 비-자연 발생 폴리뉴클레오티드, 펩티드, 폴리펩티드, 단백질, 항체 또는 그의 단편(들)은 그의 자연 발생 대응물과 구별하기 위해 "단리"를 필요로 하지 않는다.The term "isolated" as used herein in reference to a nucleic acid, eg DNA or RNA, refers to a molecule that has been separated from other DNA or RNA, respectively, present in the macromolecule's natural source. The term "isolated nucleic acid" is meant to include nucleic acid fragments that do not occur naturally as fragments. The term "isolated" is also used herein to refer to polypeptides, proteins and/or host cells, encompassing both polypeptides and recombinant polypeptides that have been isolated and purified from other cellular proteins. In another embodiment, the term "isolated" means separated from components, cells and otherwise normally associated cells, tissues, polynucleotides, peptides, polypeptides, proteins, antibodies or fragment(s) thereof in nature. For example, an isolated cell is a cell that has been separated from a tissue or cell of a dissimilar phenotype or genotype. As will be apparent to those skilled in the art, non-naturally occurring polynucleotides, peptides, polypeptides, proteins, antibodies or fragment(s) thereof do not require "isolation" to distinguish them from their naturally occurring counterparts. .

폴리펩티드 또는 폴리뉴클레오티드, 예를 들어 DNA 또는 RNA와 관련하여 본원에서 사용되는 용어 "재조합"은 재조합의 실험실 방법, 예를 들어 분자 클로닝에 의해 형성된 분자를 지칭한다. 분자 클로닝 기술은 관련 기술분야에 공지되어 있고, 폴리뉴클레오티드의 PCR 증폭, 폴리뉴클레오티드의 효소적 소화, 발현 카세트(예를 들어 포유동물 발현 카세트) 내로의 폴리뉴클레오티드의 결찰(ligation), 폴리뉴클레오티드를 사용한 세포의 형질전환(transformation), 형질주입(transfection) 또는 형질도입(transduction), 및 폴리펩티드를 생산하기 위한 폴리뉴클레오티드의 발현을 포함할 수 있지만, 이에 제한되지는 않는다. 예를 들어 문헌 「Green and Sambrook, Molecular Cloning: A Laboratory Manual, 2012」을 참조한다. 용어 "재조합 폴리뉴클레오티드"는 단백질-암호화 폴리뉴클레오티드의 단편을 포함하는 것을 의미한다. 예를 들어 재조합 폴리뉴클레오티드는 인간 디스페린 단백질을 암호화하는 폴리뉴클레오티드의 단편을 포함할 수 있다. 재조합 폴리뉴클레오티드는 단백질-암호화 폴리뉴클레오티드의 단편의 PCR 증폭에 의해 생산될 수 있다. 재조합 폴리펩티드는 1종 이상의 재조합 폴리뉴클레오티드의 발현에 의해 생산될 수 있다.The term “recombinant” as used herein in reference to a polypeptide or polynucleotide, such as DNA or RNA, refers to a molecule formed by laboratory methods of recombination, such as molecular cloning. Molecular cloning techniques are known in the art and include PCR amplification of polynucleotides, enzymatic digestion of polynucleotides, ligation of polynucleotides into expression cassettes (eg mammalian expression cassettes), transformation, transfection or transduction of cells, and expression of polynucleotides to produce polypeptides, but is not limited thereto. See, for example, Green and Sambrook, Molecular Cloning: A Laboratory Manual, 2012. The term “recombinant polynucleotide” is meant to include fragments of protein-encoding polynucleotides. For example, the recombinant polynucleotide may include a fragment of a polynucleotide encoding the human dysferrin protein. Recombinant polynucleotides can be produced by PCR amplification of fragments of protein-encoding polynucleotides. Recombinant polypeptides can be produced by expression of one or more recombinant polynucleotides.

인간 디스페린(hDSYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드가 본원에 개시된다. 인간 디스페린(hDSYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드를 포함하는 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물이 본원에 추가로 개시된다. 이러한 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물의 제조 및 사용 방법이 또한 본원에 개시된다.Polynucleotides encoding fragments of the human dysferrin (hDSYSF) protein are disclosed herein. Further disclosed herein are plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions comprising polynucleotides encoding fragments of the human dysferrin (hDSYSF) protein. Methods of making and using such polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions are also disclosed herein.

본원에서는 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 재조합 폴리뉴클레오티드가 개시되고, 여기서 상기 재조합 폴리뉴클레오티드는 제 1 뉴클레오티드 서열을 포함하되, 상기 제 1 뉴클레오티드 서열은, (a) 서열번호 1, 6, 또는 18의 뉴클레오티드 서열; (b) 서열번호 1, 6, 또는 18의 그들 각각의 전체 길이에 걸쳐 서열번호 1, 6, 또는 18의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (c) 서열번호 13 또는 15의 뉴클레오티드 서열; (d) 서열번호 13 또는 15의 그들 각각의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (e) hDYSF 단백질을 암호화하는 뉴클레오티드 서열, 상기 hDYSF 단백질은 서열번호 9의 아미노산 서열로 구성됨; 또는 (f) 상기 (e)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (e)의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열;로 구성된다.Disclosed herein is a recombinant polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide comprises a first nucleotide sequence, wherein the first nucleotide sequence comprises (a) SEQ ID NOs: 1, 6 , or a nucleotide sequence of 18; (b) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 1, 6, or 18 over the entire length of each of SEQ ID NO: 1, 6, or 18; 96%, 97%, 98%, or 99% identical nucleotide sequences; (c) the nucleotide sequence of SEQ ID NO: 13 or 15; (d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; 98%, or 99% identical nucleotide sequences; (e) a nucleotide sequence encoding the hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (f) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of (e) over the entire length of the nucleotide sequence of (e). %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences;

추가로 본원에서는 인간 디스페린 단백질의 단편을 암호화하는 재조합 폴리뉴클레오티드 서열이 개시되며, 여기서 상기 재조합 폴리뉴클레오티드는 제 1 뉴클레오티드 서열을 포함하되, 상기 제 1 뉴클레오티드 서열은, (a) 서열번호 2, 8, 또는 19의 뉴클레오티드 서열; (b) 서열번호 2, 8, 또는 19의 그들 각각의 전체 길이에 걸쳐 서열번호 2, 8, 또는 19의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (c) 서열번호 14 또는 16의 뉴클레오티드 서열; (d) 서열번호 14 또는 16의 그들 각각의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열; (e) hDYSF 단백질을 암호화하는 폴리뉴클레오티드 서열, 상기 hDYSF 단백질은 서열번호 10의 아미노산 서열로 구성됨; 또는 (f) 상기 (e)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (e)의 폴리뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 폴리뉴클레오티드 서열;로 구성된다.Further disclosed herein is a recombinant polynucleotide sequence encoding a fragment of the human dysferrin protein, wherein the recombinant polynucleotide comprises a first nucleotide sequence, wherein the first nucleotide sequence comprises (a) SEQ ID NOs: 2, 8 , or a nucleotide sequence of 19; (b) at least 90%, 91%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 2, 8, or 19 over the entire length of each of SEQ ID NO: 2, 8, or 19; 96%, 97%, 98%, or 99% identical nucleotide sequences; (c) the nucleotide sequence of SEQ ID NO: 14 or 16; (d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences; (e) a polynucleotide sequence encoding a hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (f) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87% of the polynucleotide sequence of (e) over the entire length of the nucleotide sequence of (e); 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical polynucleotide sequences;

추가로 본원에서는 아데노-연관 바이러스(AAV) 벡터가 개시된다. 일부 구현예에서, 아데노-연관 바이러스(AAV) 벡터는, (a) 제 1 역전된 말단 반복부(ITR); (b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 1 또는 6의 뉴클레오티드 서열; (ii) 서열번호 1 또는 6의 전체 길이에 걸쳐 서열번호 1 또는 6의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (iii) 서열번호 13 또는 15의 뉴클레오티드 서열; (iv) 서열번호 13 또는 15의 전체 길이에 걸쳐 서열번호 13 또는 15의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질을 암호화하는 뉴클레오티드 서열, 상기 hDYSF 단백질은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열;로 구성된다.Further disclosed herein are adeno-associated virus (AAV) vectors. In some embodiments, an adeno-associated virus (AAV) vector comprises (a) a first inverted terminal repeat (ITR); (b) a polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the polynucleotide comprises (i) a nucleotide sequence of SEQ ID NO: 1 or 6; (ii) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6; a nucleotide sequence that is 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical; (iii) the nucleotide sequence of SEQ ID NO: 13 or 15; (iv) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15; a nucleotide sequence that is 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical; (v) a nucleotide sequence encoding hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences;

일부 구현예에서, 아데노-연관 바이러스(AAV) 벡터는, In some embodiments, an adeno-associated virus (AAV) vector is

(b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드, 여기서 상기 폴리뉴클레오티드는, (i) 서열번호 2 또는 8의 뉴클레오티드 서열; (ii) 서열번호 2 또는 8의 전체 길이에 걸쳐 서열번호 2 또는 8의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (iii) 서열번호 14 또는 16의 뉴클레오티드 서열; (iv) 서열번호 14 또는 16의 전체 길이에 걸쳐 서열번호 14 또는 16의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열;로 구성됨; 및 (b) a polynucleotide encoding a fragment of human dysferrin (hDYSF) protein, wherein the polynucleotide comprises (i) a nucleotide sequence of SEQ ID NO: 2 or 8; (ii) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8; a nucleotide sequence that is 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical; (iii) the nucleotide sequence of SEQ ID NO: 14 or 16; (iv) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16; a nucleotide sequence that is 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical; (v) a nucleotide sequence encoding a hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; and

(c) 제 2 ITR;(c) a second ITR;

을 포함하되, 상기 폴리뉴클레오티드는 상기 제 1 ITR 및 상기 제 2 ITR에 의해 플랭킹된다.Including, wherein the polynucleotide is flanked by the first ITR and the second ITR.

본원에서는 추가로, 아데노-연관 바이러스(AAV) 발현 카세트가 개시된다. 일부 구현예에서, AAV 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 ITR 중의 임의의 ITR을 포함함; (b) 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 5' hDYSF 폴리뉴클레오티드; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR은 본원에 개시된 ITR 중의 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 5' hDYSF 폴리뉴클레오티드는 상기 (a) 및 (c)의 제 1 ITR 및 제 2 ITR에 의해 플랭킹된다.Further disclosed herein are adeno-associated virus (AAV) expression cassettes. In some embodiments, an AAV expression cassette comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any of the ITRs disclosed herein; (b) any of the 5' hDYSF polynucleotides disclosed herein; and (c) a second ITR, wherein the second ITR comprises any of the ITRs disclosed herein; wherein the 5' hDYSF polynucleotide of (b) is one of (a) and (c) above It is flanked by a first ITR and a second ITR.

일부 구현예에서, AAV 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 ITR 중의 임의의 ITR을 포함함; (b) 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 3' hDYSF 폴리뉴클레오티드; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR은 본원에 개시된 ITR 중의 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 3' hDYSF 폴리뉴클레오티드는 (a) 및 (c)의 제 1 ITR 및 제 2 ITR에 의해 플랭킹된다.In some embodiments, an AAV expression cassette comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any of the ITRs disclosed herein; (b) any of the 3' hDYSF polynucleotides disclosed herein; and (c) a second ITR, wherein the second ITR comprises any of the ITRs disclosed herein; wherein the 3' hDYSF polynucleotide of (b) is the second ITR of (a) and (c) It is flanked by 1 ITR and the 2nd ITR.

본원에서는 추가로, 아데노-연관 바이러스(AAV) 벡터가 개시된다. 일부 구현예에서, AAV 벡터는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 임의의 ITR을 포함함; (b) 본원에 개시된 임의의 5' hDYSF 폴리뉴클레오티드; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR은 본원에 개시된 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 5' hDYSF 폴리뉴클레오티드는 상기 (a) 및 (c)의 제 1 및 제 2 ITR에 의해 플랭킹된다.Further disclosed herein are adeno-associated virus (AAV) vectors. In some embodiments, an AAV vector comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any ITR disclosed herein; (b) any 5' hDYSF polynucleotide disclosed herein; and (c) a second ITR, wherein the second ITR comprises any ITR disclosed herein; wherein the 5′ hDYSF polynucleotide of (b) is the first ITR of (a) and (c) above and flanked by a second ITR.

일부 구현예에서, AAV 벡터는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 임의의 ITR을 포함함; (b) 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR은 본원에 개시된 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 3' hDYSF 폴리뉴클레오티드는 상기 (a) 및 (c)의 제 1 및 제 2 ITR에 의해 플랭킹된다.In some embodiments, an AAV vector comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any ITR disclosed herein; (b) any 3' hDYSF polynucleotide disclosed herein; and (c) a second ITR, wherein the second ITR comprises any ITR disclosed herein; wherein the 3' hDYSF polynucleotide of (b) is the first ITR of (a) and (c) above and flanked by a second ITR.

본원에서는 추가로, 이중 아데노-연관 바이러스(AAV) 벡터 시스템이 개시된다. 일부 구현예에서, 이중 AAV 벡터 시스템은, (a) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는 본원에 개시된 임의의 5' hDYSF 폴리뉴클레오티드를 포함함; 및 (b) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함함;을 포함한다.Further disclosed herein is a dual adeno-associated virus (AAV) vector system. In some embodiments, the dual AAV vector system comprises (a) a first AAV vector, wherein the first AAV vector comprises any 5' hDYSF polynucleotide disclosed herein; and (b) a second AAV vector, wherein the second AAV vector comprises any 3' hDYSF polynucleotide disclosed herein.

본원에서는 추가로, 이중 아데노-연관 바이러스(AAV) 벡터 시스템이 개시된다. 일부 구현예에서, 이중 AAV 벡터 시스템은, (a) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는 본원에 개시된 임의의 5' hDYSF AAV 벡터를 포함하거나, 이로 구성되거나, 또는 이로 본질적으로 구성됨; 및 (b) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 개시된 임의의 3' hDYSF AAV 벡터를 포함하거나, 이로 구성되거나, 또는 이로 본질적으로 구성됨;을 포함하거나, 이로 구성되거나, 또는 이로 본질적으로 구성된다. Further disclosed herein is a dual adeno-associated virus (AAV) vector system. In some embodiments, the dual AAV vector system comprises (a) a first AAV vector, wherein the first AAV vector comprises, consists of, or consists essentially of any 5' hDYSF AAV vector disclosed herein; and (b) a second AAV vector, wherein the second AAV vector comprises, consists of, or consists essentially of any 3' hDYSF AAV vector disclosed herein; is essentially composed of

본원에서는 추가로, 아데노-연관 바이러스(AAV) 벡터가 개시된다. 일부 구현예에서, AAV 벡터는 본원에 개시된 임의의 5' hDYSF 폴리뉴클레오티드를 포함한다.Further disclosed herein are adeno-associated virus (AAV) vectors. In some embodiments, an AAV vector comprises any 5' hDYSF polynucleotide disclosed herein.

일부 구현예에서, AAV 벡터는 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함한다.In some embodiments, an AAV vector comprises any 3' hDYSF polynucleotide disclosed herein.

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터(예를 들어 바이러스 또는 바이러스 입자), 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은, 역전된 말단 반복부(ITR), 프로모터, 인트론, 선택 마커, 또는 복제기점(ORI)을 포함하거나, 이로 구성되거나, 또는 이로 본질적으로 구성되는 하나 이상의 뉴클레오티드 서열을 더 포함한다.In some embodiments, polynucleotides, plasmids, viral vectors (eg viruses or viral particles), vector systems, viral packaging systems, cells, and compositions may include inverted terminal repeats (ITRs), promoters, introns, selectable markers , or one or more nucleotide sequences comprising, consisting of, or consisting essentially of an origin of replication (ORI).

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은, 역전된 말단 반복부(ITR), 선택 마커, 복제기점(ORI), 비번역 영역(UTR), 또는 폴리아데닐화(폴리A) 신호를 포함하는 하나 이상의 추가 뉴클레오티드 서열을 더 포함한다.In some embodiments, polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions comprise inverted terminal repeats (ITRs), selectable markers, origins of replication (ORIs), untranslated regions (UTRs) , or one or more additional nucleotide sequences comprising a polyadenylation (polyA) signal.

본원에서는 추가로, 디스페리노병증을 치료하는 방법이 본원에 추가로 개시된다. 일부 구현예에서, 디스페리노병증을 치료하는 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물 중의 임의의 것을 이를 필요로 하는 대상체에게 투여하는 것을 포함한다.Further disclosed herein are methods of treating dysperinopathies. In some embodiments, the method of treating dysferinopathy comprises administering to a subject in need thereof any of the polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions disclosed herein. do.

본원에서는 추가로, 디스페리노병증의 치료를 위한 의약의 제조에서의 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물 중의 임의의 것의 용도가 개시된다.Further disclosed herein is the use of any of the polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions disclosed herein in the manufacture of a medicament for the treatment of dysferinopathy.

상동 재조합 및 Homologous recombination and hDYSFhDYSF 단편 snippet

AAV-매개 유전자 요법은 다수의 질환에 대한 바람직한 치료 전략을 제시하지만, 이는 AAV 비리온의 제한적인 4.7 kb 패키징 한계에 의해 방해받는다. 현재 치유되지 않거나 효과적인 요법, 예를 들어 디스페리노병증을 갖지 않는 질환이 특히 관심의 대상이다. 2개의 부분적 게놈의 상동 재조합(homologous recombination)에 의해 전체 길이의 디스페린 유전자를 제조 또는 생산하는 방법이 본원에 개시된다. 한 예에서, 2개의 부분적으로 패키징된 게놈은 pAAV.MHCK7.DYSF5'.PTG 및 pAAV.DYSF3'.POLYA로서 도 1에 제시된다. 2개의 게놈이 AAV 벡터 내로 패키징됨으로써 바이러스 전달을 통해든지 또는 비-바이러스 방법 (예를 들어 LNP)을 통해든지 간에, 세포 (예를 들어 근세포)에 전달되면, 이들은 전체-길이 디스페린 암호화 영역을 포함하는 전사체(transcript)를 생성하여, 기능적 디스페린 단백질의 발현을 유도한다. 2개의 폴리뉴클레오티드 사이의 중첩 영역은 전체-길이 디스페린 유전자를 함유하는 전사체를 유도하는 상동 재조합을 용이하게 한다. 전체-길이 디스페린 유전자를 2개의 부분적으로 패키징된 게놈으로 분리함으로써, 상기 방법은 AAV 패키징 제한을 성공적으로 우회하고 기능적 전체-길이 디스페린 유전자를 생산한다. 한 구현예에서, 전사체는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 서열을 포함하는 발현 카세트이다. 한 구현예에서, 전사체는 서열번호 20의 서열을 포함하는 발현 카세트이다.AAV-mediated gene therapy represents a desirable treatment strategy for a number of diseases, but it is hampered by the restrictive 4.7 kb packaging limitation of AAV virions. Of particular interest are diseases that are currently incurable or do not have an effective therapy, such as dysperinopathies. Methods for making or producing a full-length dysferrin gene by homologous recombination of two partial genomes are disclosed herein. In one example, two partially packaged genomes are shown in Figure 1 as pAAV.MHCK7.DYSF5'.PTG and pAAV.DYSF3'.POLYA. When the two genomes are packaged into an AAV vector and delivered to a cell (eg myocyte), whether via viral transfer or via a non-viral method (eg LNP), they contain the full-length dysferrin coding region. By generating a transcript that contains, expression of a functional dysferrin protein is induced. The overlapping region between the two polynucleotides facilitates homologous recombination leading to a transcript containing the full-length dysferrin gene. By separating the full-length dysferrin gene into two partially packaged genomes, the method successfully bypasses AAV packaging restrictions and produces a functional full-length dysferrin gene. In one embodiment, the transcript comprises a sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the nucleotide sequence of SEQ ID NO: 20. It is an expression cassette that In one embodiment, the transcript is an expression cassette comprising the sequence of SEQ ID NO:20.

일부 구현예에서, 인간 디스페린(hDYSF) 단백질을 암호화하는 재조합 폴리뉴클레오티드가 본원에 개시되며, 여기서 상기 재조합 폴리뉴클레오티드 서열은 서열번호 20의 뉴클레오티드 서열 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함한다. 일부 경우에, 서열번호 20의 뉴클레오티드 서열을 포함하는 재조합 폴리뉴클레오티드가 본원에 개시된다. 일부 경우에, 재조합 폴리펩티드를 제조하는 방법이 본원에 개시된다. 일부 경우에, 상기 방법은 세포를 hDYSF 단백질의 5' 단편을 암호화하는 재조합 폴리뉴클레오티드 및 hDYSF 단백질의 3' 단편을 암호화하는 제 2 재조합 폴리뉴클레오티드와 접촉시키는 것을 포함한다. 일부 경우에, 상기 방법은 세포를 본원에 기재된 이중 AAV 벡터 시스템과 접촉시키는 것을 포함한다. 일부 경우에, 세포는 진핵 세포, 임의로 근육 세포, 심장 세포, 및/또는 간 세포이다. 일부 경우에, 상기 방법은 hDYSF 단백질의 5' 단편을 암호화하는 재조합 폴리뉴클레오티드 및 hDYSF 단백질의 3' 단편을 암호화하는 제 2 재조합 폴리뉴클레오티드를 대상체에게 투여하는 것을 포함한다. 일부 경우에, 상기 방법은 대상체에게 본원에 기재된 이중 AAV 벡터 시스템을 투여하는 것을 포함한다. 일부 경우에, 상기 방법은 서열번호 20 또는 서열번호 20의 뉴클레오티드 서열과 적어도 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하는 재조합 폴리뉴클레오티드를 발현시킴으로써 대상체의 근이영양증을 치료하는 단계를 포함한다. 일부 경우에, 대상체는 임의로 LGMD2B 또는 미요시 근병증으로부터 선택된 디스페리노병증을 갖는다.In some embodiments, disclosed herein is a recombinant polynucleotide encoding human dysferrin (hDYSF) protein, wherein the recombinant polynucleotide sequence is at least 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences. In some cases, recombinant polynucleotides comprising the nucleotide sequence of SEQ ID NO: 20 are disclosed herein. In some instances, methods of making recombinant polypeptides are disclosed herein. In some cases, the method includes contacting the cell with a recombinant polynucleotide encoding a 5' fragment of hDYSF protein and a second recombinant polynucleotide encoding a 3' fragment of hDYSF protein. In some cases, the methods include contacting the cells with a dual AAV vector system described herein. In some cases, the cells are eukaryotic cells, optionally muscle cells, cardiac cells, and/or liver cells. In some cases, the method comprises administering to the subject a recombinant polynucleotide encoding a 5' fragment of hDYSF protein and a second recombinant polynucleotide encoding a 3' fragment of hDYSF protein. In some cases, the methods include administering to the subject a dual AAV vector system described herein. In some cases, the method is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 20 or the nucleotide sequence of SEQ ID NO: 20. treating muscular dystrophy in a subject by expressing a recombinant polynucleotide comprising the nucleotide sequence. In some cases, the subject has a dysferinopathy optionally selected from LGMD2B or Miyoshi myopathy.

또한, 위에서 기술한 바와 같은 상동 재조합에 의해 전체-길이 디스페린 유전자의 생산을 유도할 수 있는, 인간 디스페린(hDYSF) 단백질의 단편을 각각 암호화하는 2개의 재조합 폴리뉴클레오티드가 본원에 개시된다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 hDYSF 단백질의 5' 단편을 암호화한다. 일부 구현예에서, hDYSF 단백질의 5' 단편을 암호화하는 재조합 폴리뉴클레오티드는 5' hDYSF 폴리뉴클레오티드로 지칭된다. 일부 구현예에서, 재조합 폴리뉴클레오티드는 hDYSF 단백질의 3' 단편을 암호화한다. 일부 구현예에서, hDYSF 단백질의 3' 단편을 암호화하는 재조합 폴리뉴클레오티드는 3' hDSYF 폴리뉴클레오티드로 지칭된다.Also disclosed herein are two recombinant polynucleotides, each encoding a fragment of the human dysferrin (hDYSF) protein, capable of directing production of the full-length dysferrin gene by homologous recombination as described above. In some embodiments, the recombinant polynucleotide encodes a 5' fragment of the hDYSF protein. In some embodiments, a recombinant polynucleotide encoding a 5' fragment of an hDYSF protein is referred to as a 5' hDYSF polynucleotide. In some embodiments, the recombinant polynucleotide encodes a 3' fragment of the hDYSF protein. In some embodiments, a recombinant polynucleotide encoding a 3' fragment of hDYSF protein is referred to as a 3' hDSYF polynucleotide.

5' 5' hDYSFhDYSF 폴리뉴클레오티드 polynucleotide

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는, 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, 또는 3340-3355개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개 또는 그 미만의 연속 뉴클레오티드(consecutive nucleotides)를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지는 않는다.In some embodiments, a 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150 of SEQ ID NO: 11 -3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000 ; -3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335- It comprises, consists of, or consists essentially of between 3350, 3340-3365, 3340-3360, or 3340-3355 contiguous nucleotides. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 33512, 33512, 377-3716 , 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, or 3340 or less consecutive nucleotides, consisting of, or consisting essentially of. . In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, 또는 3340-3355개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 85%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 90%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 95%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350 in the range between 377-3716 , 3340-3365, 3340-3360, or 3340-3355 contiguous nucleotides, wherein the 5' hDYSF polynucleotide comprises nucleotides 100-4000 of SEQ ID NO: 11 ,100-3900,100-3800,100-3750,100-3716,150-4000,150-3900,150-3800,150-3750,150-3716,200-4000,200-3900,200-3800,200 -3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000 , 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377 at least 80%, 82%, 85%, 87%, 88%, 90% of the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between -3900, 377-3800, 377-3750, or 377-3716; 92%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 33512, 33512, 377-3716 , 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, or 3340 or less consecutive nucleotides, consisting of, or consisting essentially of, wherein said 5' The hDYSF polynucleotide has nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200 of SEQ ID NO: 11 -4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800 , 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, nucleotide sequence of SEQ ID NO: 11 and at least 80 %, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or at least 85% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or at least 90% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or at least 95% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, 또는 3340-3355개 사이의 연속 뉴클레오티드 미스매치(mismatches)를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1개 또는 그 미만의 연속 뉴클레오티드 미스매치를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 15개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 10개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 5개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, 또는 377-3716개 사이의 영역에서 적어도 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 377-3716 사이의 영역에서 적어도 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350 in the range between 377-3716 , 3340-3365, 3340-3360, or 3340-3355 consecutive nucleotide mismatches between, consisting of, or consisting essentially of, wherein the 5' hDYSF polynucleotide is SEQ ID NO: 11 nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300- 3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370-3750, 370-37 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11 in the range between 16, 377-4000, 377-3900, 377-3800, 377-3750, or 377-3716 , 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 or less consecutive nucleotide mismatches. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 33512, 33512, 377-3716 , 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, or 3340 or less consecutive nucleotides, consisting of, or consisting essentially of, wherein said 5' The hDYSF polynucleotide has nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150-3750, 150-3716, 200 of SEQ ID NO: 11 -4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800 , 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 30, 25, 20, 19, 18, 17, 16 in the range between 370-3800, 370-3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 377-3716 , 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 or less consecutive nucleotides, consisting essentially of these consists of In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 15 or fewer nucleotide mismatches in the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 10 or fewer nucleotide mismatches in the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 5 or fewer nucleotide mismatches in the region between 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- contains 1 nucleotide mismatch in the region between 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or 377-3716. In some embodiments, the 5' hDYSF polynucleotide comprises nucleotides 100-4000, 100-3900, 100-3800, 100-3750, 100-3716, 150-4000, 150-3900, 150-3800, 150- 3750, 150-3716, 200-4000, 200-3900, 200-3800, 200-3750, 200-3716, 250-4000, 250-3900, 250-3800, 250-3750, 250-3716, 300-4000, 300-3900, 300-3800, 300-3750, 300-3716, 350-4000, 350-3900, 350-3800, 350-3750, 350-3716, 370-4000, 370-3900, 370-3800, 370- 3750, 370-3716, 377-4000, 377-3900, 377-3800, 377-3750, or at least one nucleotide mismatch in the region between 377-3716. In some embodiments, the 5' polynucleotide comprises at least one nucleotide mismatch in the region between nucleotides 377-3716 of SEQ ID NO: 11. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, 또는 3340-3355개의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 여기서 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 여기서 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 전체 길이에 걸쳐 상기 영역의 뉴클레오티드 서열과 적어도 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 전체 길이에 걸쳐 상기 영역의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 전체 길이에 걸쳐 상기 영역의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 전체 길이에 걸쳐 상기 영역의 뉴클레오티드 서열과 적어도 99% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide is 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, or 3340-3355 contiguous nucleotides of SEQ ID NO: 11 comprising, consisting of, or consisting essentially of, wherein the 5' hDYSF polynucleotide is at nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550 of SEQ ID NO: 11 ; -3700, 3371-3650, 3371-3600, 3371-3550, or the region containing 3371-3500. In some embodiments, the 5' hDYSF polynucleotide is 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 33434, 33434 of SEQ ID NO: 11 , 3342, 3341, or 3340 or fewer contiguous nucleotides, consisting of, or consisting essentially of, wherein the 5' hDYSF polynucleotide is at nucleotide positions 3400-3716, 3400 of SEQ ID NO: 11 -3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-303716 , 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, or 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or at least 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99% of the nucleotide sequence of the region over the entire length of the region comprising 3371-3500 , or 100% identical nucleotide sequences. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or a nucleotide sequence that is at least 90% identical to the nucleotide sequence of the region over the entire length of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or a nucleotide sequence that is at least 95% identical to the nucleotide sequence of the region over the entire length of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or a nucleotide sequence that is at least 99% identical to the nucleotide sequence of the region over the entire length of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 뉴클레오티드 서열에 대해 10, 9, 8, 7, 6, 5, 4, 3, 2, 1개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 뉴클레오티드 서열에 대해 5개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 뉴클레오티드 서열에 대해 2개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역의 뉴클레오티드 서열에 대해 1개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 or less nucleotide mismatches to the nucleotide sequence of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or 5 or fewer nucleotide mismatches to the nucleotide sequence of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or 2 or fewer nucleotide mismatches to the nucleotide sequence of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide has nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390 of SEQ ID NO: 11 -3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-350710 , or 1 or less nucleotide mismatches to the nucleotide sequence of the region comprising 3371-3500. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500을 포함하는 영역을 포함하고, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, 또는 3371-3500를 포함하는 영역의 전체 길이에 걸쳐 서열변호 11의 뉴클레오티드 서열에 대해 적어도 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100%이다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide is 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 33434, 33434 of SEQ ID NO: 11 , 3342, 3341, or 3340 or less contiguous nucleotides, wherein the 5' hDYSF polynucleotide comprises SEQ ID NO: 11 at nucleotide positions 3400-3716, 3400- 3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, 3390-3500, 3380-303716 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, 3371-3600, 3371-3550, or 3371-3500, wherein the 5' hDYSF polynucleotide is SEQ ID NO: 11 nucleotide positions 3400-3716, 3400-3700, 3400-3650, 3400-3600, 3400-3550, 3400-3500, 3390-3716, 3390-3700, 3390-3650, 3390-3600, 3390-3550, , 3380-3716, 3380-3700, 3380-3650, 3380-3500, 3371-3716, 3371-3700, 3371-3650, over the entire length of the region containing at least 90%, 92%, 95%, 96%, 97%, 98%, 99%, or 100% of the nucleotide sequence of SEQ ID NO: 11. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, 또는 3340-3355개의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 1-100, 1-200, 1-300, 1-350, 1-375, 1-376, 100-200, 100-300, 100-350, 100-375, 100-376, 200-300, 200-350, 200-375, 또는 200-376을 포함하는 영역을 포함하지 않는다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 1-376의 뉴클레오티드 위치를 포함하는 영역을 포함하지 않는다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide is 3330-3365, 3330-3360, 3330-3355, 3335-3365, 3335-3350, 3340-3365, 3340-3360, or 3340-3355 contiguous nucleotides of SEQ ID NO: 11 comprising, consisting of, or consisting essentially of, wherein the 5' hDYSF polynucleotide comprises nucleotide positions 1-100, 1-200, 1-300, 1-350, 1-375 of SEQ ID NO: 11; 1-376, 100-200, 100-300, 100-350, 100-375, 100-376, 200-300, 200-350, 200-375, or the region containing 200-376 is not included. In some embodiments, the 5' hDYSF polynucleotide does not include a region comprising nucleotide positions 1-376 of SEQ ID NO: 11. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 3344, 3343, 3342, 3341, 또는 3340개의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 1-100, 1-200, 1-300, 1-350, 1-375, 100-200, 100-300, 100-350, 100-375, 200-300, 200-350, 200-375로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 1-376으로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide is 3360, 3359, 3358, 3357, 3356, 3355, 3354, 3353, 3352, 3351, 3350, 3349, 3348, 3347, 3346, 3345, 33434, 33434 of SEQ ID NO: 11 , 3342, 3341, or 3340 contiguous nucleotides, wherein the 5' hDYSF polynucleotide comprises nucleotide positions 1-100, 1-200, 1- 300, 1-350, 1-375, 100-200, 100-300, 100-350, 100-375, 200-300, 200-350, 200-375. In some embodiments, the 5' hDYSF polynucleotide does not comprise a region consisting of nucleotide positions 1-376 of SEQ ID NO:11. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 1의 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 1의 전체 길이에 걸쳐 서열번호 1의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 1의 전체 길이에 걸쳐 서열번호 1의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 1의 전체 길이에 걸쳐 서열번호 1의 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 1의 전체 길이에 걸쳐 서열번호 1의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of the nucleotide sequence of SEQ ID NO: 1. In some embodiments, the 5' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO: 1 over its entire length. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 92% identical to the nucleotide sequence of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 95% identical to the nucleotide sequence of SEQ ID NO: 1 over its entire length. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 13의 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 13의 전체 길이에 걸쳐 서열번호 13의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 13의 전체 길이에 걸쳐 서열번호 13의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 13의 전체 길이에 걸쳐 서열번호 13의 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 13의 전체 길이에 걸쳐 서열번호 13의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of the nucleotide sequence of SEQ ID NO: 13. In some embodiments, the 5' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 13 over the entire length of SEQ ID NO: 13 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO: 13 over the entire length of SEQ ID NO: 13. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 92% identical to the nucleotide sequence of SEQ ID NO: 13 over the entire length of SEQ ID NO: 13. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 95% identical to the nucleotide sequence of SEQ ID NO: 13 over the entire length of SEQ ID NO: 13. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 야생형 hDSYF 단백질의 N-말단 영역을 포함하는 hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 야생형 hDSYF 단백질의 N-말단 영역을 포함하는 hDYSF 단백질의 단편은 N-말단 hDYSF 단백질로서 지칭된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 1-1113, 200-1113, 400-1113, 500-1113, 600-1113, 650-113, 650-1100, 700-1100, 700-1113, 700-1050, 700-1000, 800-1113, 800-1100, 800-1050, 900-1113, 900-1100, 1000-1113, 또는 1000-1100개를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 1-1113, 200-1113, 400-1113, 500-1113, 600-1113, 650-113, 650-1100, 700-1100, 700-1113, 700-1050, 700-1000, 800-1113, 800-1100, 800-1050, 900-1113, 900-1100, 1000-1113, 또는 1000-1100개를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 영역을 포함한다. 일부 구현예에서, N-말단 hDYSF 단백질은 아미노산 잔기 999-1113, 999-1100, 1000-1113, 또는 1000-1100개를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, a 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a fragment of hDYSF protein comprising the N-terminal region of wild-type hDSYF protein. In some embodiments, fragments of hDYSF protein comprising the N-terminal region of wild-type hDSYF protein are referred to as N-terminal hDYSF proteins. In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding an N-terminal hDYSF protein, wherein the N-terminal hDYSF protein comprises amino acids of SEQ ID NO: 12 Residues 1-1113, 200-1113, 400-1113, 500-1113, 600-1113, 650-113, 650-1100, 700-1100, 700-1113, 700-1050, 700-1000, 800-1113, 800 -1100, 800-1050, 900-1113, 900-1100, 1000-1113, or 1000-1100. In some embodiments, a 5' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, comprises, consists of, or consists essentially of 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; Wherein the N-terminal hDYSF protein comprises amino acid residues 1-1113, 200-1113, 400-1113, 500-1113, 600-1113, 650-113, 650-1100, 700-1100, 700- comprising, consisting of, or consisting essentially of 1113, 700-1050, 700-1000, 800-1113, 800-1100, 800-1050, 900-1113, 900-1100, 1000-1113, or 1000-1100; includes the area composed of them. In some embodiments, the N-terminal hDYSF protein comprises a region comprising, consisting of, or consisting essentially of amino acid residues 999-1113, 999-1100, 1000-1113, or 1000-1100. . In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 N-말단 hDYSF 단백질은 적어도 1000개의 아미노산 길이이고, 여기서 상기 N-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 999-1113, 999-1100, 1000-1113, 또는 1000-1100개를 포함하는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 N-말단 hDYSF 단백질은 적어도 1000개의 아미노산 길이이고, 여기서 N-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 999-1113, 999-1100, 1000-1113, 또는 1000-1100개를 포함하는 영역을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding an N-terminal hDYSF protein, wherein the N-terminal hDYSF protein is at least 1000 amino acids in length wherein the N-terminal hDYSF protein comprises a region comprising amino acid residues 999-1113, 999-1100, 1000-1113, or 1000-1100 of SEQ ID NO: 12. In some embodiments, a 5' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, comprises, consists of, or consists essentially of 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; wherein the N-terminal hDYSF protein is at least 1000 amino acids in length, wherein the N-terminal hDYSF protein comprises amino acid residues 999-1113, 999-1100, 1000-1113, or 1000-1100 of SEQ ID NO: 12 contains the area In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding an N-terminal hDYSF protein, wherein the N-terminal hDYSF protein comprises amino acids of SEQ ID NO: 9 It comprises, consists of, or consists essentially of a sequence. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 N-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 97% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 N-말단 hDYSF 단백질은 서열번호 9의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, N-말단 hDYSF 단백질은 서열번호 9의 전체 길이에 걸쳐 서열번호 9의 아미노산 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, N-말단 hDYSF 단백질은 서열번호 9의 전체 길이에 걸쳐 서열번호 9의 아미노산 서열과 적어도 90% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, N-말단 hDYSF 단백질은 서열번호 9의 전체 길이에 걸쳐 서열번호 9의 아미노산 서열과 적어도 92% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, N-말단 hDYSF 단백질은 서열번호 9의 전체 길이에 걸쳐 서열번호 9의 아미노산 서열과 적어도 95% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, N-말단 hDYSF 단백질은 서열번호 9의 전체 길이에 걸쳐 서열번호 9의 아미노산 서열과 적어도 97% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, a 5' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, comprises, consists of, or consists essentially of a nucleotide sequence that is 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical; Wherein the N-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 9. In some embodiments, the 5' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 90% identical to a nucleotide sequence encoding an N-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the N-terminal hDYSF protein is, or consists essentially of, wherein the N-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 9. In some embodiments, the 5' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 92% identical to a nucleotide sequence encoding an N-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the N-terminal hDYSF protein is, or consists essentially of, wherein the N-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 9. In some embodiments, the 5' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 95% identical to a nucleotide sequence encoding an N-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the N-terminal hDYSF protein is, or consists essentially of, wherein the N-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 9. In some embodiments, the 5' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 97% identical to a nucleotide sequence encoding an N-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the N-terminal hDYSF protein is, or consists essentially of, wherein the N-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 9. In some embodiments, the N-terminal hDYSF protein is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the amino acid sequence of SEQ ID NO: 9 over the entire length of SEQ ID NO: 9 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical amino acid sequences. In some embodiments, the N-terminal hDYSF protein comprises an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO:9 over the entire length of SEQ ID NO:9. In some embodiments, the N-terminal hDYSF protein comprises an amino acid sequence that is at least 92% identical to the amino acid sequence of SEQ ID NO:9 over the entire length of SEQ ID NO:9. In some embodiments, the N-terminal hDYSF protein comprises an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO:9 over the entire length of SEQ ID NO:9. In some embodiments, the N-terminal hDYSF protein comprises an amino acid sequence that is at least 97% identical to the amino acid sequence of SEQ ID NO:9 over the entire length of SEQ ID NO:9. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 6의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 6의 전체 길이에 걸쳐 서열번호 6의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 6의 전체 길이에 걸쳐 서열번호 6의 뉴클레오티드 서열과 적어도 85% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 6의 전체 길이에 걸쳐 서열번호 6의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises the nucleotide sequence of SEQ ID NO:6. In some embodiments, the 5' hDYSF polynucleotide is at least 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 6 over the entire length of SEQ ID NO: 6 , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the 5' hDYSF polynucleotide comprises a nucleotide sequence that is at least 85% identical to the nucleotide sequence of SEQ ID NO:6 over the entire length of SEQ ID NO:6. In some embodiments, the 5' hDYSF polynucleotide comprises a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO:6 over the entire length of SEQ ID NO:6. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 15의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 15의 전체 길이에 걸쳐 서열번호 6의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 15의 전체 길이에 걸쳐 서열번호 15의 뉴클레오티드 서열과 적어도 85% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 서열번호 15의 전체 길이에 걸쳐 서열번호 15의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 5' hDYSF polynucleotide comprises the nucleotide sequence of SEQ ID NO: 15. In some embodiments, the 5' hDYSF polynucleotide is at least 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 6 over the entire length of SEQ ID NO: 15 , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the 5' hDYSF polynucleotide comprises a nucleotide sequence that is at least 85% identical to the nucleotide sequence of SEQ ID NO: 15 over the entire length of SEQ ID NO: 15. In some embodiments, the 5' hDYSF polynucleotide comprises a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO: 15 over the entire length of SEQ ID NO: 15. In some embodiments, the 5' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

3' 3' hDYSFhDYSF 폴리뉴클레오티드 polynucleotide

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4100, 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000, 또는 3800-3900개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 4100, 4000, 3900, 3880, 또는 3870개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4 in the range of 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 , 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000- or 38000- comprises, consists of, or consists essentially of consecutive nucleotides between In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 4100, 4000, 3900, 3880, or 3870 or less contiguous nucleotides of the region between, or consisting of consists of, or consists essentially of. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4100, 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000, 또는 3800-3900개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100%이다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 4100, 4000, 3900, 3880, 또는 3870개의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, 또는 100%이다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 85%이다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 90%이다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 전체 길이에 걸쳐 서열번호 11의 뉴클레오티드 서열에 대해 적어도 95%이다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 10600-4 in the range of 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 , 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000-, or 3800-38000 wherein the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600 of SEQ ID NO: 11 -6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6700, 61620 , 2750-6850, 2750-6800, 2750-6780, 2750 -6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-2754, 2754-27540 , 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or at least 80%, 82%, 85 relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between %, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 comprises, consists of, or consists of 4100, 4000, 3900, 3880, or 3870 contiguous nucleotides of the region between 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619, or Consisting essentially of these, wherein the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800 of SEQ ID NO: 11 ; -6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-2754, 2754-27540 , 2754-6700, 2754-6680, 2754-6650, at least 80%, 82%, 85%, 87%, 88%, 90%, 92% of the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 2754-6625, 2754-6620, or 2754-6619 , 95%, 96%, 97%, 98%, 99%, or 100%. In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or at least 85% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 2754-6619. In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or at least 90% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 2754-6619. In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or at least 95% relative to the nucleotide sequence of SEQ ID NO: 11 over the entire length of the region between 2754-6619. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4100, 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000, 또는 3800-3900개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1개 또는 그 미만의 연속 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역의 4100, 4000, 3900, 3880, 또는 3870개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1개 또는 그 미만의 연속 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에서 15개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에서 10개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에서 5개 또는 그 미만의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에서 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-6725, 2754-6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, 또는 2754-6619개 사이의 영역에서 적어도 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 5' 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 2754-6619 사이의 영역에서 적어도 1개의 뉴클레오티드 미스매치를 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 10600-4 in the range of 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 , 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000-, or 3800-38000 wherein the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600 of SEQ ID NO: 11 -6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6700, 61620 , 2750-6850, 2750-6800, 2750-6780, 2750 -6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-2754, 2754-27540 , 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 or less consecutive nucleotide mismatches. In some embodiments, the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700- 6780, 2700-6750, 2700-6725, 2700-6700, 2700-6680, 2700-6650, 2700-6625, 2700-6620, 2700-6619, 2750-6850, 2750-6800, 2750-6750, 67-250,6780 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-27 2754 6700, 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 4100, 4000, 3900, 3880, or 3870 or less contiguous nucleotides of the region between, or consisting of Consisting of, or consisting essentially of, wherein the 3' hDYSF polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850 of SEQ ID NO: 11 ; -6780, 2750-6750, 2750-6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-66780 , 2754-6725, 2754-6700, 2754-6680, 27 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8 in any range between 54-6650, 2754-6625, 2754-6620, or 2754-6619 , 7, 6, 5, 4, 3, 2, 1 or less consecutive nucleotide mismatches. In some embodiments, the 5' polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780 of SEQ ID NO: 11 ; -6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-670725 , 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 in the region between 15 or less nucleotide mismatches. In some embodiments, the 5' polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780 of SEQ ID NO: 11 ; -6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-670725 , 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 in the region between 10 or less nucleotide mismatches. In some embodiments, the 5' polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780 of SEQ ID NO: 11 ; -6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-670725 , 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619 in the region between 5 or less nucleotide mismatches. In some embodiments, the 5' polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780 of SEQ ID NO: 11 ; -6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-670725 , 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619. In some embodiments, the 5' polynucleotide comprises nucleotides 2600-6850, 2600-6800, 2600-6780, 2600-6750, 2600-6725, 2600-6700, 2700-6850, 2700-6800, 2700-6780 of SEQ ID NO: 11 ; -6725, 2750-6700, 2750-6680, 2750-6650, 2750-6625, 2750-6620, 2750-6619, 2754-6850, 2754-6800, 2754-6780, 2754-6750, 2754-670725 , 2754-6680, 2754-6650, 2754-6625, 2754-6620, or 2754-6619. In some embodiments, the 5' polynucleotide comprises at least one nucleotide mismatch in the region between nucleotides 2754-6619 of SEQ ID NO: 11. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4100, 3600-4000, 3600-3900, 3600-3880, 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000, 또는 3800-3900개 사이의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 5' hDSYF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 6620-6914, 6620-6900, 6620-6800, 6620-6700, 6700-6914, 6700-6800, 6800-6914, 또는 6800-6900로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 6620-6914로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide is 3500-4100, 3500-4000, 3500-3900, 3500-3880, 3500-3870, 3600-4100, 3600-4000, 3600-3900, 3600-3880 of SEQ ID NO: 11 , 3600-3870, 3700-4100, 3700-4000, 3700-3900, 3700-3880, 3700-3870, 3800-4100, 3800-4000, or 3800-3900 contiguous nucleotides, or , or consisting essentially of these, wherein the 5' hDSYF polynucleotide comprises nucleotide positions 6620-6914, 6620-6900, 6620-6800, 6620-6700, 6700-6914, 6700-6800, 6800-6914 of SEQ ID NO: 11; or the region consisting of 6800-6900. In some embodiments, the 3' hDYSF polynucleotide does not comprise a region consisting of nucleotide positions 6620-6914 of SEQ ID NO:11. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 4100, 4000, 3900, 3880, 또는 3870개 또는 그 미만의 연속 뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 3' hDSYF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 6620-6914, 6620-6900, 6620-6800, 6620-6700, 6700-6914, 6700-6800, 6800-6914, 또는 6800-6900으로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 11의 뉴클레오티드 위치 6620-6914로 구성되는 영역을 포함하지 않는다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of 4100, 4000, 3900, 3880, or 3870 or fewer contiguous nucleotides of SEQ ID NO: 11, The 3' hDSYF polynucleotide comprises a region consisting of nucleotide positions 6620-6914, 6620-6900, 6620-6800, 6620-6700, 6700-6914, 6700-6800, 6800-6914, or 6800-6900 of SEQ ID NO: 11 I never do that. In some embodiments, the 3' hDYSF polynucleotide does not comprise a region consisting of nucleotide positions 6620-6914 of SEQ ID NO:11. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 2의 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 2의 전체 길이에 걸쳐 서열번호 2의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 2의 전체 길이에 걸쳐 서열번호 2의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 2의 전체 길이에 걸쳐 서열번호 2의 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 2의 전체 길이에 걸쳐 서열번호 2의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of the nucleotide sequence of SEQ ID NO: 2. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 2 over the entire length of SEQ ID NO: 2 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO:2 over its entire length. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 92% identical to the nucleotide sequence of SEQ ID NO:2 over its entire length. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 95% identical to the nucleotide sequence of SEQ ID NO:2 over its entire length. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 14의 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 14의 전체 길이에 걸쳐 서열번호 14의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 14의 전체 길이에 걸쳐 서열번호 14의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 14의 전체 길이에 걸쳐 서열번호 14의 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 14의 전체 길이에 걸쳐 서열번호 14의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of the nucleotide sequence of SEQ ID NO: 14. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 14 over the entire length of SEQ ID NO: 14 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO: 14 over the entire length of SEQ ID NO: 14. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 92% identical to the nucleotide sequence of SEQ ID NO: 14 over the entire length of SEQ ID NO: 14. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence that is at least 95% identical to the nucleotide sequence of SEQ ID NO: 14 over the entire length of SEQ ID NO: 14. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 야생형 hDSYF 단백질의 C-말단 영역을 포함하는 hDYSF 단백질의 단편을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 야생형 hDSYF 단백질의 C-말단 영역을 포함하는 hDYSF 단백질의 단편은 C-말단 hDYSF 단백질로 지칭된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, 또는 794-1900개를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 영역을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, 또는 794-1900개를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 영역을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, a 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a fragment of an hDYSF protein comprising a C-terminal region of a wild-type hDSYF protein. In some embodiments, a fragment of a hDYSF protein comprising the C-terminal region of a wild-type hDSYF protein is referred to as a C-terminal hDYSF protein. In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a C-terminal hDYSF protein, wherein the C-terminal hDYSF protein comprises amino acids of SEQ ID NO: 12 comprises, consists of, or consists essentially of residues 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, or 794-1900; It includes a region composed of them. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 82%, 85%, 88%, comprises, consists of, or consists essentially of 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences; wherein the C-terminal hDYSF protein comprises amino acid residues 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, or 794 of SEQ ID NO: 12 -Includes a region that contains, consists of, or consists essentially of 1900. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 길이가 1400, 1350, 1325, 1300, 1290, 또는 1287개 또는 그 미만의 아미노산을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되고, 상기 C-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, 또는 794-1900개를 포함하는 영역을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 C-말단 hDYSF 단백질은 길이가 1400, 1350, 1325, 1300, 1290, 또는 1287개 또는 그 미만의 아미노산이고, 상기 C-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 750-2080, 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, 또는 794-1900개를 포함하는 영역을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a C-terminal hDYSF protein, wherein the C-terminal hDYSF protein has a length of 1400, 1350 , 1325, 1300, 1290, or 1287 or less amino acids, wherein the C-terminal hDYSF protein comprises amino acid residues 750-2080, 750 of SEQ ID NO: 12 -2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, or 794-1900. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 82%, 85%, 88% of the nucleotide sequence encoding the C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of wherein the C-terminal hDYSF protein is 1400, 1350, 1325, 1300, 1290, or 1287 amino acids or less in length, and the C-terminal hDYSF protein comprises amino acid residues 750-2080 of SEQ ID NO: 12; 750-2000, 750-1900, 775-2080, 775-2000, 775-1900, 794-2080, 794-2000, or 794-1900. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 C-말단 hDYSF 단백질은 길이가 1400, 1350, 1325, 1300, 1290, 또는 1287개 또는 그 미만의 아미노산을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 여기서 상기 C-말단 hDYSF 단백질은 서열번호 12의 아미노산 잔기 678-793, 678-750, 678-725, 또는 678-700개를 포함하는 영역을 포함하지 않는다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a C-terminal hDYSF protein, wherein the C-terminal hDYSF protein has a length of 1400, 1350 , 1325, 1300, 1290, or 1287 or less amino acids comprising, consisting of, or consisting essentially of, wherein the C-terminal hDYSF protein comprises amino acid residues 678-793 of SEQ ID NO: 12; It does not include regions containing 678-750, 678-725, or 678-700. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises, consists of, or consists essentially of a nucleotide sequence encoding a C-terminal hDYSF protein, wherein the C-terminal hDYSF protein comprises amino acids of SEQ ID NO: 10 It comprises, consists of, or consists essentially of a sequence. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 92% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 C-말단 hDYSF 단백질을 암호화하는 뉴클레오티드 서열과 적어도 97% 동일한 뉴클레오티드 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되되, 상기 C-말단 hDYSF 단백질은 서열번호 10의 아미노산 서열을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, C-말단 hDYSF 단백질은 서열번호 10의 전체 길이에 걸쳐 서열번호 10의 아미노산 서열과 적어도 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, C-말단 hDYSF 단백질은 서열번호 10의 전체 길이에 걸쳐 서열번호 10의 아미노산 서열과 적어도 90% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, C-말단 hDYSF 단백질은 서열번호 10의 전체 길이에 걸쳐 서열번호 10의 아미노산 서열과 적어도 92% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, C-말단 hDYSF 단백질은 서열번호 10의 전체 길이에 걸쳐 서열번호 10의 아미노산 서열과 적어도 95% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, C-말단 hDYSF 단백질은 서열번호 10의 전체 길이에 걸쳐 서열번호 10의 아미노산 서열과 적어도 97% 동일한 아미노산 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 82%, 85%, 88% of the nucleotide sequence encoding the C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences, consisting of, or consisting essentially of Wherein the C-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 10. In some embodiments, the 3' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 90% identical to the nucleotide sequence encoding the C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. Consisting of, or consisting essentially of, wherein the C-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 10. In some embodiments, the 3' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 92% identical to the nucleotide sequence encoding the C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. Consisting of, or consisting essentially of, wherein the C-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 10. In some embodiments, the 3' hDYSF polynucleotide comprises or consists of a nucleotide sequence that is at least 95% identical to the nucleotide sequence encoding the C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. Consisting of, or consisting essentially of, wherein the C-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 10. In some embodiments, the 3' hDYSF polynucleotide comprises, or consists of, a nucleotide sequence that is at least 97% identical to a nucleotide sequence encoding a C-terminal hDYSF protein over the entire length of the nucleotide sequence encoding the C-terminal hDYSF protein. Consisting of, or consisting essentially of, wherein the C-terminal hDYSF protein comprises, consists of, or consists essentially of the amino acid sequence of SEQ ID NO: 10. In some embodiments, the C-terminal hDYSF protein is at least 80%, 82%, 85%, 88%, 89%, 90%, 91%, 92% of the amino acid sequence of SEQ ID NO: 10 over the entire length of SEQ ID NO: 10 , 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical amino acid sequences. In some embodiments, the C-terminal hDYSF protein comprises an amino acid sequence that is at least 90% identical to the amino acid sequence of SEQ ID NO: 10 over the entire length of SEQ ID NO: 10. In some embodiments, the C-terminal hDYSF protein comprises an amino acid sequence that is at least 92% identical to the amino acid sequence of SEQ ID NO: 10 over the entire length of SEQ ID NO: 10. In some embodiments, the C-terminal hDYSF protein comprises an amino acid sequence that is at least 95% identical to the amino acid sequence of SEQ ID NO: 10 over the entire length of SEQ ID NO: 10. In some embodiments, the C-terminal hDYSF protein comprises an amino acid sequence that is at least 97% identical to the amino acid sequence of SEQ ID NO: 10 over the entire length of SEQ ID NO: 10. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 8의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 8의 전체 길이에 걸쳐 서열번호 8의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 8의 전체 길이에 걸쳐 서열번호 8의 뉴클레오티드 서열과 적어도 85% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 8의 전체 길이에 걸쳐 서열번호 8의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다. In some embodiments, the 3' hDYSF polynucleotide comprises the nucleotide sequence of SEQ ID NO:8. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 8 over the entire length of SEQ ID NO: 8 , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the 3' hDYSF polynucleotide comprises a nucleotide sequence that is at least 85% identical to the nucleotide sequence of SEQ ID NO:8 over the entire length of SEQ ID NO:8. In some embodiments, the 3' hDYSF polynucleotide comprises a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO:8 over the entire length of SEQ ID NO:8. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 16의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 16의 전체 길이에 걸쳐 서열번호 16의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 16의 전체 길이에 걸쳐 서열번호 16의 뉴클레오티드 서열과 적어도 85% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 서열번호 16의 전체 길이에 걸쳐 서열번호 16의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다.In some embodiments, the 3' hDYSF polynucleotide comprises the nucleotide sequence of SEQ ID NO: 16. In some embodiments, the 3' hDYSF polynucleotide is at least 80%, 81%, 82%, 83%, 84%, 85%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 16 over the entire length of SEQ ID NO: 16 , 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the 3' hDYSF polynucleotide comprises a nucleotide sequence that is at least 85% identical to the nucleotide sequence of SEQ ID NO: 16 over the entire length of SEQ ID NO: 16. In some embodiments, the 3' hDYSF polynucleotide comprises a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO: 16 over the entire length of SEQ ID NO: 16. In some embodiments, the 3' hDYSF polynucleotide does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

일부 구현예에서, 5' hDSYF 폴리뉴클레오티드 및 3' hDSYF 폴리뉴클레오티드의 서열은 적어도 500, 600, 700, 800, 900, 950, 960, 또는 963개의 뉴클레오티드의 중첩(overlap)을 포함한다. 일부 구현예에서, N-말단 hDSYF 단백질 및 C-말단 hDSYF 단백질의 서열은 적어도 50, 100, 150, 200, 250, 300 또는 320개 아미노산의 중첩을 포함한다.In some embodiments, the sequences of the 5' hDSYF polynucleotide and the 3' hDSYF polynucleotide comprise an overlap of at least 500, 600, 700, 800, 900, 950, 960, or 963 nucleotides. In some embodiments, the sequence of the N-terminal hDSYF protein and the C-terminal hDSYF protein comprises an overlap of at least 50, 100, 150, 200, 250, 300 or 320 amino acids.

역전된 말단 inverted end 반복부repetition

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포 및 조성물은 하나 이상의 역전된 말단 반복부(ITRS)를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포 및 조성물은 2개, 3개, 4개, 5개 또는 6개 이상의 ITR을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되는 2개, 3개, 4개, 5개 또는 6개 이상의 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 2개 이상의 ITR은 동일하다. 일부 구현예에서, 2개 이상의 ITR은 상이하다.In some embodiments, polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions comprise, consist of, or consist essentially of one or more inverted terminal repeats (ITRS). It further includes a nucleotide sequence. In some embodiments, polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells and compositions comprise, consist of, or consist of 2, 3, 4, 5, 6 or more ITRs; and a sequence of 2, 3, 4, 5, 6 or more nucleotides consisting essentially of them. In some embodiments, two or more ITRs are the same. In some embodiments, two or more ITRs are different.

일부 구현예에서, 재조합 폴리뉴클레오티드는 2개 이상의 ITR에 의해 플랭킹된다. 일부 구현예에서, 5' hDYSF 폴리뉴클레오티드는 제 1 쌍의 ITR에 의해 플랭킹된다. 일부 구현예에서, 3' hDYSF 폴리뉴클레오티드는 제 2 쌍의 ITR에 의해 플랭킹된다. 일부 구현예에서, 제 1 쌍의 ITR 내의 ITR은 동일하다. 일부 구현예에서, 제 1 쌍의 ITR 내의 ITR은 상이하다. 일부 구현예에서, 제 2 쌍의 ITR 내의 ITR은 동일하다. 일부 구현예에서, 제 2 쌍의 ITR 내의 ITR은 상이하다. 일부 구현예에서, 제 1 쌍의 ITR 내의 ITR은 제 2 쌍의 ITR 내의 ITR과 동일하다. 일부 구현예에서, 제 1 쌍의 ITR 내의 적어도 1개의 ITR은 제 2 쌍의 ITR 내의 적어도 1개의 ITR과 동일하다. 일부 구현예에서, 제 1 쌍의 ITR 내의 ITR은 제 2 쌍의 ITR 내의 ITR과 상이하다. 일부 구현예에서, 제 1 쌍의 ITR 내의 적어도 1개의 ITR은 제 2 쌍의 ITR 내의 적어도 1개의 ITR과 상이하다.In some embodiments, a recombinant polynucleotide is flanked by two or more ITRs. In some embodiments, a 5' hDYSF polynucleotide is flanked by a first pair of ITRs. In some embodiments, the 3' hDYSF polynucleotide is flanked by a second pair of ITRs. In some embodiments, the ITRs within the first pair of ITRs are the same. In some embodiments, the ITRs within the first pair of ITRs are different. In some embodiments, the ITRs within the second pair of ITRs are the same. In some embodiments, the ITRs within the second pair of ITRs are different. In some implementations, the ITRs in the ITRs of the first pair are identical to the ITRs in the ITRs of the second pair. In some implementations, at least one ITR in the first pair of ITRs is identical to at least one ITR in the second pair of ITRs. In some implementations, the ITRs in the first pair of ITRs are different from the ITRs in the second pair of ITRs. In some implementations, at least one ITR in the first pair of ITRs is different from at least one ITR in the second pair of ITRs.

일부 구현예에서, ITR은 바이러스 ITR이다. 일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR은 AAV 혈청형 AAVrh20, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAVrh74, AAV-8, AAV-9, AAV-10, AAVrh10, AAV-11, AAV-12 및 AAV-13 중의 적어도 하나로부터의 ITR로부터 선택된다. 일부 구현예에서, AAV ITR은 AAV2 ITR이다. 일부 구현예에서, AAV ITR은 AAV5 ITR이다. AAV1-6에 대한 ITR 서열은 예를 들어 문헌 「Grimm 등, J Virol80(1):426-39, 2006」에서 찾아볼 수 있으며, 상기 문헌은 그 전체가 참조로 포함된다. In some embodiments, the ITR is a viral ITR. In some embodiments, the ITR is an AAV ITR. In some embodiments, an AAV ITR is an AAV serotype AAVrh20, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAVrh74, AAV-8, AAV-9 , ITRs from at least one of AAV-10, AAVrh10, AAV-11, AAV-12 and AAV-13. In some embodiments, an AAV ITR is an AAV2 ITR. In some embodiments, the AAV ITR is an AAV5 ITR. ITR sequences for AAV1-6 can be found, for example, in Grimm et al., J Virol 80(1):426-39, 2006, which is incorporated by reference in its entirety.

일부 구현예에서, 재조합 폴리뉴클레오티드는 역전된 말단 반복부(ITR) 이외의 AAV 서열을 포함하지 않는다.In some embodiments, the recombinant polynucleotide does not contain AAV sequences other than inverted terminal repeats (ITRs).

일부 구현예에서, 재조합 폴리뉴클레오티드는 역전된 말단 반복부(ITR) 이외의 바이러스 서열을 포함하지 않는다.In some embodiments, the recombinant polynucleotide contains no viral sequences other than inverted terminal repeats (ITRs).

일부 구현예에서, ITR은 서열번호 3의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 전체 길이에 걸쳐 서열번호 3의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 전체 길이에 걸쳐 서열번호 3의 뉴클레오티드 서열과 적어도 90% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 전체 길이에 걸쳐 서열번호 3의 뉴클레오티드 서열과 적어도 95% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 뉴클레오티드 서열과 10, 9, 8, 7, 6, 5, 4, 3, 2, 또는 1개 또는 그 미만의 뉴클레오티드 미스매치를 포함하는 뉴클레오티드 서열을 포함한다. 일부 구현예에서, ITR은 서열번호 3의 뉴클레오티드 서열과 5개 또는 그 미만의 뉴클레오티드 미스매치를 포함하는 뉴클레오티드 서열을 포함한다.In some embodiments, the ITR comprises the nucleotide sequence of SEQ ID NO:3. In some embodiments, the ITR is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences. In some embodiments, the ITR comprises the nucleotide sequence of SEQ ID NO:3. In some embodiments, the ITR comprises a nucleotide sequence that is at least 90% identical to the nucleotide sequence of SEQ ID NO:3 over the entire length of SEQ ID NO:3. In some embodiments, the ITR comprises a nucleotide sequence that is at least 95% identical to the nucleotide sequence of SEQ ID NO:3 over the entire length of SEQ ID NO:3. In some embodiments, the ITR comprises a nucleotide sequence comprising 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 or less nucleotide mismatches with the nucleotide sequence of SEQ ID NO: 3 . In some embodiments, the ITR comprises a nucleotide sequence comprising 5 or fewer nucleotide mismatches with the nucleotide sequence of SEQ ID NO:3.

프로모터promoter

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은 하나 이상의 프로모터를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 프로모터는 진핵 프로모터이다. 진핵 프로모터의 예는 사이토메갈로바이러스(CMV) 프로모터, 신장 인자 1 알파(EF1a) 프로모터, CAG 프로모터, 포스포글리세레이트 키나제 유전자(PGK) 프로모터, 테트라사이클린 반응 요소(TRE) 프로모터, 인간 U6 핵(U6) 프로모터, 및 UAS 프로모터를 포함하지만, 이에 제한되지 않는다. 일부 구현예에서, 프로모터는 포유동물 프로모터이다. 일부 구현예에서, 프로모터는 구성적 프로모터이다. 일부 구현예에서, 프로모터는 유도성 프로모터(inducible promoter)이다.In some embodiments, the polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions further comprise a nucleotide sequence comprising, consisting of, or consisting essentially of one or more promoters. . In some embodiments, the promoter is a eukaryotic promoter. Examples of eukaryotic promoters are the cytomegalovirus (CMV) promoter, elongation factor 1 alpha (EF1a) promoter, CAG promoter, phosphoglycerate kinase gene (PGK) promoter, tetracycline response element (TRE) promoter, human U6 nuclear (U6 ) promoter, and the UAS promoter. In some embodiments, the promoter is a mammalian promoter. In some embodiments, a promoter is a constitutive promoter. In some embodiments, the promoter is an inducible promoter.

일부 구현예에서, 프로모터는 조직-특이적 프로모터이다. 조직의 예는 근육, 상피, 결합, 및 신경 조직을 포함하지만, 이에 제한되지 않는다. 조직-특이적 프로모터의 예는 B29 프로모터, CD14 프로모터, CD43 프로모터, CD45 프로모터, CD68 프로모터, 데스민 프로모터, 엘라스타제-1 프로모터, 엔도글린 프로모터, 피브로넥틴 프로모터, Flt-1 프로모터, GFAP 프로모터, ICAM-2 프로모터, INF-β 프로모터, Mb 프로모터, NphsI 프로모터, OG-2 프로모터, SP-B 프로모터, SYN1 프로모터, WASP 프로모터, SV40/bAlb 프로모터, SV40/hAlb 프로모터, SV40/CD43 프로모터, SV40/CD45 프로모터, 및 NSE/RU5' 프로모터를 포함하지만, 이에 제한되지 않는다.In some embodiments, the promoter is a tissue-specific promoter. Examples of tissues include, but are not limited to, muscle, epithelial, connective, and neural tissue. Examples of tissue-specific promoters are the B29 promoter, CD14 promoter, CD43 promoter, CD45 promoter, CD68 promoter, Desmin promoter, Elastase-1 promoter, Endoglin promoter, Fibronectin promoter, Flt-1 promoter, GFAP promoter, ICAM -2 promoter, INF-β promoter, Mb promoter, NphsI promoter, OG-2 promoter, SP-B promoter, SYN1 promoter, WASP promoter, SV40/bAlb promoter, SV40/hAlb promoter, SV40/CD43 promoter, SV40/CD45 promoter , and the NSE/RU5' promoter.

일부 구현예에서, 프로모터는 근육-특이적 프로모터이다. 일부 구현예에서, 근육-특이적 프로모터는 미오신 중쇄 복합체-E 박스 근육 크레아틴 키나제 융합 인핸서/프로모터이다.In some embodiments, the promoter is a muscle-specific promoter. In some embodiments, the muscle-specific promoter is a myosin heavy chain complex-E box muscle creatine kinase fusion enhancer/promoter.

일부 구현예에서, 프로모터는 재조합 프로모터이다. 일부 구현예에서, 재조합 프로모터는 재조합 근육-특이적 프로모터이다. 일부 구현예에서, 재조합 근육-특이적 프로모터는 재조합 미오신 중쇄-크레아틴 키나제 근육-특이적 프로모터이다. 또 다른 구현예에서, 근육-특이적 프로모터는 인간 골격 액틴 유전자 요소, 심장 액틴 유전자 요소, 데스민 프로모터, 골격 알파-액틴(ASKA) 프로모터, 트로포닌 I(TNNI2) 프로모터, 근세포-특이적 인핸서 결합 인자 mef 결합 요소, 근육 크레아틴 키나제(MCK) 프로모터, 절단된 MCK(tMCK) 프로모터, 미오신 중쇄(MHC) 프로모터, 혼성 a-미오신 중쇄 인핸서-/MCK 인핸서-프로모터(MHCK7) 프로모터, C5-12 프로모터, 뮤린 크레아틴 키나제 인핸서 요소, 골격 빠른-트위치 트로포닌 c 유전자 요소, 느린-트위치 심장 트로포닌 c 유전자 요소, 느린-트위치 트로포닌 i 유전자 요소, 저산소증-유도성 핵 인자를 포함한다.In some embodiments, the promoter is a recombinant promoter. In some embodiments, the recombinant promoter is a recombinant muscle-specific promoter. In some embodiments, the recombinant muscle-specific promoter is a recombinant myosin heavy chain-creatine kinase muscle-specific promoter. In another embodiment, the muscle-specific promoter is a human skeletal actin gene element, a cardiac actin gene element, a desmin promoter, a skeletal alpha-actin (ASKA) promoter, a troponin I (TNNI2) promoter, a myocyte-specific enhancer binding factor mef binding element, muscle creatine kinase (MCK) promoter, truncated MCK (tMCK) promoter, myosin heavy chain (MHC) promoter, hybrid a-myosin heavy chain enhancer-/MCK enhancer-promoter (MHCK7) promoter, C5-12 promoter, murine creatine kinase enhancer element, skeletal fast-twitch troponin c gene element, slow-twitch cardiac troponin c gene element, slow-twitch troponin i gene element, hypoxia-inducible nuclear factor.

일부 구현예에서, 프로모터는 서열번호 4의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 프로모터는 서열번호 4의 전체 길이에 걸쳐 서열번호 4의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the promoter comprises the nucleotide sequence of SEQ ID NO:4. In some embodiments, the promoter is at least 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences.

인트론intron

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은 하나 이상의 인트론을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 인트론은 진핵 인트론(eukaryotic intron)이다. 일부 구현예에서, 인트론은 포유동물 인트론이다. 일부 구현예에서, 인트론은 합성 인트론이다. 일부 구현예에서, 인트론은 키메라 인트론이다. 일부 구현예에서, 인트론은 비-암호화 엑손으로부터 유래된다. 일부 구현예에서, 인트론은 5' hDYSF 폴리뉴클레오티드의 상류 또는 5'에 있다.In some embodiments, polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions further comprise nucleotide sequences comprising, consisting of, or consisting essentially of one or more introns. . In some embodiments, an intron is a eukaryotic intron. In some embodiments, the intron is a mammalian intron. In some embodiments, an intron is a synthetic intron. In some embodiments, an intron is a chimeric intron. In some embodiments, introns are derived from non-coding exons. In some embodiments, the intron is upstream or 5' of the 5' hDYSF polynucleotide.

일부 구현예에서, 인트론은 5' 공여체 부위, 분지점, 또는 3' 스플라이스 부위 중의 적어도 하나를 포함한다. 일부 구현예에서, 인트론은 5' 공여체 부위, 분지점, 또는 3' 스플라이스 부위 중의 2개 이상을 포함한다. 일부 구현예에서, 인트론은 5' 공여체 부위, 분지점, 및 3' 스플라이스 부위를 포함한다.In some embodiments, an intron comprises at least one of a 5' donor site, branch point, or 3' splice site. In some embodiments, an intron comprises two or more of a 5' donor site, branch point, or 3' splice site. In some embodiments, an intron comprises a 5' donor site, a branch point, and a 3' splice site.

일부 구현예에서, 인트론은 인간 β-글로빈 유전자로부터의 5' 공여체 부위를 포함한다.In some embodiments, the intron comprises a 5' donor region from the human β-globin gene.

일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 분지점을 포함한다.In some embodiments, an intron comprises a branch point from an immunoglobulin G (IgG) heavy chain.

일부 구현예에서, 인트론은 면역글로불린 G(IgG) 중쇄로부터의 3' 스플라이스 수용체 부위를 포함한다.In some embodiments, an intron comprises a 3' splice acceptor site from an immunoglobulin G (IgG) heavy chain.

일부 구현예에서, 인트론은 서열번호 5의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 인트론은 서열번호 5의 전체 길이에 걸쳐 서열번호 5의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the intron comprises the nucleotide sequence of SEQ ID NO:5. In some embodiments, the intron is at least 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences.

선택 select 마커marker

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은 하나 이상의 선택 마커를 포함하거나, 이들로 구성되거나, 또는 본질적으로 구성되는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 선택 마커는 박테리아 선택 마커이다. 일부 구현예에서, 선택 마커는 항생제 내성 유전자이다. 항생제 내성 유전자의 예는 β-락타마제, 카나마이신 내성 유전자, Tn5로부터의 네오 유전자, 대장균 게놈으로부터의 돌연변이체 FabI 유전자, 및 URA3 (효모로부터의 오로티딘-5' 포스페이트 데카르복실라제)을 포함하나, 이에 제한되지는 않는다. 일부 구현예에서, 항생제 내성 유전자는 β-락타마제 유전자이다. 일부 구현예에서, 항생제 내성 유전자는 카나마이신 내성 유전자이다.In some embodiments, the polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions further comprise a nucleotide sequence comprising, consisting of, or consisting essentially of one or more selectable markers. In some embodiments, the selectable marker is a bacterial selectable marker. In some embodiments, the selectable marker is an antibiotic resistance gene. Examples of antibiotic resistance genes include β-lactamase, kanamycin resistance gene, neo gene from Tn5, mutant FabI gene from E. coli genome, and URA3 (orotidine-5' phosphate decarboxylase from yeast) , but is not limited thereto. In some embodiments, the antibiotic resistance gene is a β-lactamase gene. In some embodiments, the antibiotic resistance gene is a kanamycin resistance gene.

폴리아데닐화polyadenylation 신호 signal

일부 구현예에서, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 바이러스 패키징 시스템, 세포, 및 조성물은 하나 이상의 폴리아데닐화(폴리A) 신호를 포함하거나, 이들로 구성되거나, 또는 본질적으로 구성되는 뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, 폴리A 신호는 인공 폴리A 신호이다.In some embodiments, polynucleotides, plasmids, viral vectors, vector systems, viral packaging systems, cells, and compositions comprise, consist of, or consist essentially of one or more polyadenylation (polyA) signals. It further contains the sequence. In some embodiments, the polyA signal is an artificial polyA signal.

일부 구현예에서, 폴리A 신호는 서열번호 7의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, 폴리A 신호는 서열번호 7의 전체 길이에 걸쳐 서열번호 7의 뉴클레오티드 서열과 적어도 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the polyA signal comprises the nucleotide sequence of SEQ ID NO:7. In some embodiments, the polyA signal is at least 80%, 82%, 85%, 87%, 90%, 92%, 93%, 94%, 95% of the nucleotide sequence of SEQ ID NO: 7 over the entire length of SEQ ID NO: 7. %, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences.

발현 카세트 및 패키징 시스템Expression Cassettes and Packaging Systems

본원에서는 추가로, 아데노-연관 바이러스(AAV) 발현 카세트가 개시된다. 일부 구현예에서, AAV 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 임의의 ITR을 포함함; (b) 5' hDYSF 폴리뉴클레오티드 중의 임의의 것; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR는 본원에 개시된 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 5' hDYSF 폴리뉴클레오티드는 상기 (a) 및 (c)의 제 1 및 제 2 ITR에 의해 플랭킹된다. 일부 구현예에서, 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하는 AAV 발현 카세트는 5' hDYSF AAV 발현 카세트로 지칭된다.Further disclosed herein are adeno-associated virus (AAV) expression cassettes. In some embodiments, an AAV expression cassette comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any ITR disclosed herein; (b) any of the 5' hDYSF polynucleotides; and (c) a second ITR, wherein the second ITR comprises any ITR disclosed herein; wherein the 5' hDYSF polynucleotide of (b) is the first ITR of (a) and (c) above and flanked by a second ITR. In some embodiments, an AAV expression cassette comprising any of the 5' hDYSF polynucleotides disclosed herein is referred to as a 5' hDYSF AAV expression cassette.

본원에서는 추가로, 아데노-연관 바이러스(AAV) 플라스미드가 개시된다. 일부 구현예에서, AAV 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR), 여기서 상기 제 1 ITR은 본원에 개시된 임의의 ITR을 포함함; (b) 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드; 및 (c) 제 2 ITR, 여기서 상기 제 2 ITR은 본원에 개시된 임의의 ITR을 포함함;을 포함하되, 상기 (b)의 3' hDYSF 폴리뉴클레오티드는 상기 (a) 및 (c)의 제 1 및 제 2 ITR에 의해 플랭킹된다. 일부 구현예에서, 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하는 AAV 발현 카세트는 3' hDYSF AAV 발현 카세트로 지칭된다.Further disclosed herein are adeno-associated virus (AAV) plasmids. In some embodiments, an AAV expression cassette comprises (a) a first inverted terminal repeat (ITR), wherein the first ITR comprises any ITR disclosed herein; (b) any 3' hDYSF polynucleotide disclosed herein; and (c) a second ITR, wherein the second ITR comprises any ITR disclosed herein; wherein the 3' hDYSF polynucleotide of (b) is the first ITR of (a) and (c) above and flanked by a second ITR. In some embodiments, an AAV expression cassette comprising any 3' hDYSF polynucleotide disclosed herein is referred to as a 3' hDYSF AAV expression cassette.

일부 구현예에서, 아데노-연관 바이러스(AAV) 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR); (b) 인간 디스페린(hDYSF) 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 상기 폴리뉴클레오티드 서열은: (i) 서열번호 1의 뉴클레오티드 서열; (ii) 서열번호 1의 전체 길이에 걸쳐 서열번호 1의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (iii) 서열번호 13의 뉴클레오티드 서열; (iv) 서열번호 13의 전체 길이에 걸쳐 서열번호 13의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질을 암호화하는 뉴클레오티드 서열, 여기서 상기 hDYSF 단백질은 서열번호 9의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열;로 구성됨; 및 (c) 제 2 ITR;을 포함하되, 상기 폴리뉴클레오티드 서열은 제 1 및 제 2 ITR에 의해 플랭킹된다. 일부 구현예에서, 본원에 개시된 임의의 AAV 발현 카세트는 프로모터, 인트론, 선택 마커, 또는 복제기점(ORI)을 포함하는 1종 이상의 추가의 폴리뉴클레오티드 서열을 더 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 6의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 6의 전체 길이에 걸쳐 서열번호 6의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 15의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 15의 전체 길이에 걸쳐 서열번호 15의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 hDYSF 단백질의 제 2 단편을 암호화하는 제 2 폴리뉴클레오티드 서열을 더 포함하지 않는다. 일부 구현예에서, AAV 발현 카세트는 역전된 말단 반복부(ITR) 이외의 AAV 서열을 포함하지 않는다. 일부 구현예에서, AAV 발현 카세트는 역전된 말단 반복부(ITR) 이외의 바이러스 서열을 포함하지 않는다.In some embodiments, an adeno-associated virus (AAV) expression cassette comprises (a) a first inverted terminal repeat (ITR); (b) a polynucleotide sequence encoding a fragment of human dysferrin (hDYSF) protein, the polynucleotide sequence comprising: (i) the nucleotide sequence of SEQ ID NO: 1; (ii) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90 of the nucleotide sequence of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1; %, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 13; (iv) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90 of the nucleotide sequence of SEQ ID NO: 13 over the entire length of SEQ ID NO: 13; %, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; (v) a nucleotide sequence encoding a hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or (vi) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v). %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; and (c) a second ITR; wherein the polynucleotide sequence is flanked by first and second ITRs. In some embodiments, any AAV expression cassette disclosed herein further comprises one or more additional polynucleotide sequences comprising a promoter, intron, selectable marker, or origin of replication (ORI). In some embodiments, the AAV expression cassette comprises the nucleotide sequence of SEQ ID NO:6. In some embodiments, the AAV expression cassette is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO:6 over the entire length of SEQ ID NO:6. %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the AAV expression cassette comprises the nucleotide sequence of SEQ ID NO: 15. In some embodiments, the AAV expression cassette is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 15 over the entire length of SEQ ID NO: 15 %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the AAV expression cassette does not further comprise a second polynucleotide sequence encoding a second fragment of hDYSF protein. In some embodiments, the AAV expression cassette contains no AAV sequences other than inverted terminal repeats (ITRs). In some embodiments, the AAV expression cassette contains no viral sequences other than inverted terminal repeats (ITRs).

일부 구현예에서, 아데노-연관 바이러스(AAV) 발현 카세트는, (a) 제 1 역전된 말단 반복부(ITR); (b) 인간 디스페린 단백질의 단편을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 폴리뉴클레오티드 서열은: (i) 서열번호 2의 뉴클레오티드 서열; (ii) 서열번호 2의 전체 길이에 걸쳐 서열번호 2의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (iii) 서열번호 14의 뉴클레오티드 서열; (ii) 서열번호 14의 전체 길이에 걸쳐 서열번호 14의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열; (v) hDYSF 단백질을 암호화하는 폴리뉴클레오티드 서열, 여기서 상기 hDYSF 단백질은 서열번호 10의 아미노산 서열로 구성됨; 또는 (vi) 상기 (v)의 뉴클레오티드 서열의 전체 길이에 걸쳐 상기 (v)의 폴리뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 폴리뉴클레오티드 서열;로 구성됨; 및 (c) 제 2 ITR;을 포함하되, 상기 폴리뉴클레오티드 서열은 제 1 및 제 2 ITR에 의해 플랭킹된다. 일부 구현예에서, AAV 발현 카세트는 서열번호 8의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 8의 전체 길이에 걸쳐 서열번호 8의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 16의 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 서열번호 16의 전체 길이에 걸쳐 서열번호 16의 뉴클레오티드 서열과 적어도 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구현예에서, AAV 발현 카세트는 선택 마커, 복제기점(ORI), 비번역 영역(UTR), 또는 폴리아데닐화(폴리A) 신호를 포함하는 하나 이상의 폴리뉴클레오티드 서열을 더 포함한다.In some embodiments, an adeno-associated virus (AAV) expression cassette comprises (a) a first inverted terminal repeat (ITR); (b) a polynucleotide sequence encoding a fragment of human dysferrin protein, wherein the polynucleotide sequence comprises: (i) the nucleotide sequence of SEQ ID NO: 2; (ii) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90 of the nucleotide sequence of SEQ ID NO: 2 over the entire length of SEQ ID NO: 2; %, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; (iii) the nucleotide sequence of SEQ ID NO: 14; (ii) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90 of the nucleotide sequence of SEQ ID NO: 14 over the entire length of SEQ ID NO: 14; %, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences; (v) a polynucleotide sequence encoding a hDYSF protein, wherein the hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or (vi) at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87% of the polynucleotide sequence of (v) over the entire length of the nucleotide sequence of (v); 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical polynucleotide sequences; and (c) a second ITR; wherein the polynucleotide sequence is flanked by first and second ITRs. In some embodiments, the AAV expression cassette comprises the nucleotide sequence of SEQ ID NO:8. In some embodiments, the AAV expression cassette is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 8 over the entire length of SEQ ID NO: 8. %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the AAV expression cassette comprises the nucleotide sequence of SEQ ID NO: 16. In some embodiments, the AAV expression cassette is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% of the nucleotide sequence of SEQ ID NO: 16 over the entire length of SEQ ID NO: 16 %, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical nucleotide sequences. In some embodiments, the AAV expression cassette further comprises one or more polynucleotide sequences comprising a selectable marker, origin of replication (ORI), untranslated region (UTR), or polyadenylation (polyA) signal.

본원에서는 추가로, 아데노-연관 바이러스(AAV) 패키징 시스템이 개시된다. 일부 구현예에서, AAV 패키징 시스템은 (a) 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것; (b) 아데노바이러스 헬퍼 플라스미드; 및 (c) rep-cap 플라스미드;를 포함한다. 일부 구현예에서, 아데노바이러스 헬퍼 플라스미드는 아데노바이러스로부터의 하나 이상의 유전자를 포함한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 AAV 복제를 매개한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 E4, E2a, 및 VA로부터 선택된다. 일부 구현예에서, rep-cap 플라스미드는 아데노-연관 바이러스 rep 및 cap 유전자를 암호화하는 하나 이상의 폴리뉴클레오티드를 포함한다. 일부 구현예에서, rep 유전자는 Rep78, Rep68, Rep62, 및 Rep40으로부터 선택된 생활 주기 단백질 중 하나 이상을 암호화한다. 일부 구현예에서, cap 유전자는 VP1, VP2, 및 VP3으로부터 선택된 캡시드 단백질 중 하나 이상을 암호화한다. 일부 구현예에서, 5' hDYSF AAV 발현 카세트는 하나 이상의 ITR을 포함한다. 일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR의 혈청형은 AAV 캡시드 단백질의 혈청형과 동일하다. 일부 구현예에서, AAV ITR의 혈청형은 AAV 캡시드 단백질의 혈청형과 상이하다. 일부 구현예에서, AAV rep 유전자의 혈청형은 AAV 캡시드 단백질의 혈청형과 동일하다. 일부 구현예에서, AAV rep 유전자의 혈청형은 AAV 캡시드 단백질의 혈청형과 상이하다. 일부 구현예에서, 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것을 포함하는 AAV 패키징 시스템은 5' hDYSF AAV 패키징 시스템으로서 지칭된다.Further disclosed herein is an adeno-associated virus (AAV) packaging system. In some embodiments, the AAV packaging system comprises (a) any of the 5' hDYSF AAV expression cassettes disclosed herein; (b) an adenovirus helper plasmid; and (c) rep-cap plasmid; In some embodiments, an adenovirus helper plasmid comprises one or more genes from an adenovirus. In some embodiments, one or more genes from adenovirus mediate AAV replication. In some embodiments, the one or more genes from adenovirus are selected from E4, E2a, and VA. In some embodiments, a rep-cap plasmid comprises one or more polynucleotides encoding adeno-associated virus rep and cap genes. In some embodiments, the rep gene encodes one or more of the life cycle proteins selected from Rep78, Rep68, Rep62, and Rep40. In some embodiments, the cap gene encodes one or more of the capsid proteins selected from VP1, VP2, and VP3. In some embodiments, a 5' hDYSF AAV expression cassette comprises one or more ITRs. In some embodiments, the ITR is an AAV ITR. In some embodiments, the serotype of the AAV ITR is the same as the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV ITR is different from the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV rep gene is the same as the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV rep gene is different from the serotype of the AAV capsid protein. In some embodiments, an AAV packaging system comprising any of the 5' hDYSF AAV expression cassettes disclosed herein is referred to as a 5' hDYSF AAV packaging system.

일부 구현예에서, AAV 패키징 시스템은 (a) 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것; (b) 아데노바이러스 헬퍼 플라스미드; 및 (c) rep-cap 플라스미드;를 포함한다. 일부 구현예에서, 아데노바이러스 헬퍼 플라스미드는 아데노바이러스로부터의 하나 이상의 유전자를 포함한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 AAV 복제를 매개한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 E4, E2a, 및 VA로부터 선택된다. 일부 구현예에서, rep-cap 플라스미드는 아데노-연관 바이러스 rep 및 cap 유전자를 암호화하는 하나 이상의 폴리뉴클레오티드를 포함한다. 일부 구현예에서, rep 유전자는 Rep78, Rep68, Rep62 및 Rep40으로부터 선택된 하나 이상의 생활 주기 단백질을 암호화한다. 일부 구현예에서, 캡 유전자는 VP1, VP2, 및 VP3으로부터 선택된 캡시드 단백질 중 하나 이상을 암호화한다. 일부 구현예에서, 3' hDYSF AAV 발현 카세트는 하나 이상의 ITR을 포함한다. 일부 구현예에서, ITR은 AAV ITR이다. 일부 구현예에서, AAV ITR의 혈청형은 AAV 캡시드 단백질의 혈청형과 동일하다. 일부 구현예에서, AAV ITR의 혈청형은 AAV 캡시드 단백질의 혈청형과 상이하다. 일부 구현예에서, AAV rep 유전자의 혈청형은 AAV 캡시드 단백질의 혈청형과 동일하다. 일부 구현예에서, AAV rep 유전자의 혈청형은 AAV 캡시드 단백질의 혈청형과 상이하다. 일부 구현예에서, 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것을 포함하는 AAV 패키징 시스템은 3' hDYSF AAV 패키징 시스템으로 지칭된다.In some embodiments, the AAV packaging system comprises (a) any of the 3' hDYSF AAV expression cassettes disclosed herein; (b) an adenovirus helper plasmid; and (c) rep-cap plasmid; In some embodiments, an adenovirus helper plasmid comprises one or more genes from an adenovirus. In some embodiments, one or more genes from adenovirus mediate AAV replication. In some embodiments, the one or more genes from adenovirus are selected from E4, E2a, and VA. In some embodiments, a rep-cap plasmid comprises one or more polynucleotides encoding adeno-associated virus rep and cap genes. In some embodiments, the rep gene encodes one or more life cycle proteins selected from Rep78, Rep68, Rep62 and Rep40. In some embodiments, the cap gene encodes one or more of the capsid proteins selected from VP1, VP2, and VP3. In some embodiments, a 3' hDYSF AAV expression cassette comprises one or more ITRs. In some embodiments, the ITR is an AAV ITR. In some embodiments, the serotype of the AAV ITR is the same as the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV ITR is different from the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV rep gene is the same as the serotype of the AAV capsid protein. In some embodiments, the serotype of the AAV rep gene is different from the serotype of the AAV capsid protein. In some embodiments, an AAV packaging system comprising any of the 3' hDYSF AAV expression cassettes disclosed herein is referred to as a 3' hDYSF AAV packaging system.

일부 구현예에서, 아데노-연관 바이러스 패키징 시스템은 (a) 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것; 및 (b) 아데노바이러스 헬퍼 플라스미드;를 포함한다. 일부 구현예에서,m 아데노바이러스 헬퍼 플라스미드는 아데노바이러스로부터의 하나 이상의 유전자를 포함한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 AAV 복제를 매개한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 E4, E2a, 및 VA로부터 선택된다.In some embodiments, the adeno-associated virus packaging system comprises (a) any of the 5' hDYSF AAV expression cassettes disclosed herein; and (b) an adenovirus helper plasmid. In some embodiments, the m adenovirus helper plasmid comprises one or more genes from an adenovirus. In some embodiments, one or more genes from adenovirus mediate AAV replication. In some embodiments, the one or more genes from adenovirus are selected from E4, E2a, and VA.

일부 구현예에서, 아데노-연관 바이러스 패키징 시스템은 (a) 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것; 및 (b) 아데노바이러스 헬퍼 플라스미드;를 포함한다. 일부 구현예에서, 아데노바이러스 헬퍼 플라스미드는 아데노바이러스로부터의 하나 이상의 유전자를 포함한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 AAV 복제를 매개한다. 일부 구현예에서, 아데노바이러스로부터의 하나 이상의 유전자는 E4, E2a, 및 VA로부터 선택된다.In some embodiments, the adeno-associated virus packaging system comprises (a) any of the 3' hDYSF AAV expression cassettes disclosed herein; and (b) an adenovirus helper plasmid. In some embodiments, an adenovirus helper plasmid comprises one or more genes from an adenovirus. In some embodiments, one or more genes from adenovirus mediate AAV replication. In some embodiments, the one or more genes from adenovirus are selected from E4, E2a, and VA.

바이러스 벡터virus vector

본원에서는 추가로, 아데노-연관 바이러스(AAV) 벡터(예를 들어 AAV 바이러스 또는 AAV 입자)가 개시된다. 일부 구현예에서, AAV 벡터는 본원에 개시된 임의의 5' hDYSF 폴리뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하는 AAV 벡터는 5' hDYSF AAV 벡터로서 지칭된다.Further disclosed herein are adeno-associated virus (AAV) vectors (eg, AAV viruses or AAV particles). In some embodiments, an AAV vector comprises, consists of, or consists essentially of any 5' hDYSF polynucleotide disclosed herein. In some embodiments, an AAV vector comprising any of the 5' hDYSF polynucleotides disclosed herein is referred to as a 5' hDYSF AAV vector.

일부 구현예에서, 5' hDYSF AAV 벡터는 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것을 포함한다.In some embodiments, a 5' hDYSF AAV vector comprises any of the 5' hDYSF AAV expression cassettes disclosed herein.

일부 구현예에서, AAV 벡터는 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. 일부 구현예에서, 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하는 AAV 벡터는 3' hDYSF AAV 벡터로서 지칭된다.In some embodiments, an AAV vector comprises, consists of, or consists essentially of any 3' hDYSF polynucleotide disclosed herein. In some embodiments, an AAV vector comprising any 3' hDYSF polynucleotide disclosed herein is referred to as a 3' hDYSF AAV vector.

일부 구현예에서, 3' hDYSF AAV 벡터는 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것을 포함한다.In some embodiments, a 3' hDYSF AAV vector comprises any of the 3' hDYSF AAV expression cassettes disclosed herein.

일부 구현예에서, AAV 벡터는 혈청형 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, rh10, rh20, 또는 rh74의 AAV이다. 일부 구현예에서, AAV 벡터는 혈청형 rh74의 AAV이다. 일부 구현예에서, AAV 벡터는 혈청형 5의 AAV가 아니다.In some embodiments, the AAV vector is an AAV of serotype 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, rh10, rh20, or rh74. In some embodiments, the AAV vector is AAV of serotype rh74. In some embodiments, the AAV vector is not serotype 5 AAV.

본원에서는 추가로, 본원에 개시된 2개 이상의 AAV 벡터를 포함하는 이중 아데노-연관 바이러스(AAV) 벡터 시스템이 개시된다. 일부 구현예에서, 이중 AAV 벡터 시스템은 (a) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함함; 및 (b) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함함;을 포함한다.Further disclosed herein is a dual adeno-associated virus (AAV) vector system comprising two or more AAV vectors disclosed herein. In some embodiments, the dual AAV vector system comprises (a) a first AAV vector, wherein the first AAV vector comprises any of the 5' hDYSF polynucleotides disclosed herein; and (b) a second AAV vector, wherein the second AAV vector comprises any of the 3' hDYSF polynucleotides disclosed herein.

일부 구현예에서, 이중 AAV 벡터 시스템은 (a) 제 1 AAV 벡터, 여기서 상기 제 1 AAV 벡터는 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성됨; 및 (b) 제 2 AAV 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 개시된 3' hDYSF AAV 벡터 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성됨;을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성된다. In some embodiments, the dual AAV vector system comprises (a) a first AAV vector, wherein the first AAV vector comprises, consists of, or consists essentially of any of the 5' hDYSF AAV vectors disclosed herein. configured; and (b) a second AAV vector, wherein the second AAV vector comprises, consists of, or consists essentially of any of the 3' hDYSF AAV vectors disclosed herein; be, or consist essentially of.

조성물composition

본원에서는 추가로, 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다. 본원에서는 추가로, 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다. 본원에서는 추가로, 본원에 개시된 5' hDYSF 플라스미드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다. 본원에서는 추가로, 본원에 개시된 임의의 3' hDYSF 플라스미드를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다. 본원에서는 추가로, 본원에 개시된 임의의 이중 AAV 벡터 시스템을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다. 본원에서는 추가로, 본원에 개시된 임의의 AAV 벡터를 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성되는 조성물이 개시된다.Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the 5' hDYSF polynucleotides disclosed herein. Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the 3' hDYSF polynucleotides disclosed herein. Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the 5' hDYSF plasmids disclosed herein. Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the 3' hDYSF plasmids disclosed herein. Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the dual AAV vector systems disclosed herein. Further disclosed herein are compositions comprising, consisting of, or consisting essentially of any of the AAV vectors disclosed herein.

본원에서는 추가로, (a) 재조합 아데노-연관 바이러스(rAAV) 벡터, 여기서 상기 rAAV는 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 이로 본질적으로 구성됨; 및 (b) 약학적으로 허용가능한 담체, 희석제, 부형제 또는 아주반트;를 포함하거나, 이들로 구성되거나, 또는 이로 본질적으로 구성되는 조성물이 개시된다.Further provided herein are (a) a recombinant adeno-associated virus (rAAV) vector, wherein the rAAV comprises, consists of, or consists essentially of any of the 5' hDYSF polynucleotides disclosed herein; and (b) a pharmaceutically acceptable carrier, diluent, excipient or adjuvant;

본원에서는 추가로, (a) 재조합 아데노-연관 바이러스(rAAV) 벡터, 여기서 상기 rAAV는 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 본질적으로 이들로 구성됨; 및 (b) 약학적으로 허용가능한 담체, 희석제, 부형제 또는 아주반트;를 포함하는 조성물이 개시된다.Further provided herein are (a) a recombinant adeno-associated virus (rAAV) vector, wherein the rAAV comprises, consists of, or consists essentially of any of the 3' hDYSF polynucleotides disclosed herein; and (b) a pharmaceutically acceptable carrier, diluent, excipient or adjuvant.

본원에서는 추가로, (a) 제 1 재조합 아데노-연관 바이러스(rAAV), 여기서 상기 제 1 rAAV는 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성됨; 및 (b) 제 2 재조합 아데노-연관 바이러스(rAAV), 여기서 상기 제 2 rAAV는 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성됨;을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되는 조성물이 개시된다.Further disclosed herein is (a) a first recombinant adeno-associated virus (rAAV), wherein the first rAAV comprises, consists of, or consists essentially of any of the 5' hDYSF polynucleotides disclosed herein. configured; and (b) a second recombinant adeno-associated virus (rAAV), wherein the second rAAV comprises, consists of, or consists essentially of any of the 3' hDYSF polynucleotides disclosed herein; A composition comprising, consisting of, or consisting essentially of is disclosed.

본원에서는 추가로, (a) 제 1 아데노-연관 바이러스(AAV) 입자, 여기서 상기 제 1 AAV 입자는 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성됨; 및 (b) 제 2 아데노-연관 바이러스(AAV) 입자, 여기서 상기 제 2 AAV 입자는 본원에 개시된 3' hDYSF AAV 벡터 중의 임의의 것을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성됨;을 포함하거나, 이들로 구성되거나, 또는 이들로 본질적으로 구성되는 조성물이 개시된다.Further provided herein are (a) a first adeno-associated virus (AAV) particle, wherein the first AAV particle comprises, consists of, or is essentially of any of the 5' hDYSF AAV vectors disclosed herein. made up of; and (b) a second adeno-associated virus (AAV) particle, wherein the second AAV particle comprises, consists of, or consists essentially of any of the 3' hDYSF AAV vectors disclosed herein; Compositions comprising, consisting of, or consisting essentially of are disclosed.

일부 구현예에서, 본원에 개시된 임의의 조성물 중의 임의의 것은 약학적으로 허용가능한 담체, 희석제, 부형제 또는 아주반트 중의 적어도 1종을 더 포함한다. 허용가능한 담체, 희석제 및 아주반트는 수용체에 대해 무독성이고, 바람직하게는 사용된 투여량 및 농도에서 불활성이고, 완충제 및 계면활성제, 예를 들어 플루로닉(pluronics)을 포함한다. 허용가능한 담체의 예는 포스페이트 완충 염수, 보존제 등을 포함하나, 이에 제한되지는 않는다.In some embodiments, any of any of the compositions disclosed herein further comprises at least one of a pharmaceutically acceptable carrier, diluent, excipient or adjuvant. Acceptable carriers, diluents and adjuvants are nontoxic to recipients and preferably inert at the dosages and concentrations employed, and include buffers and surfactants such as pluronics. Examples of acceptable carriers include, but are not limited to, phosphate buffered saline, preservatives, and the like.

약학적으로 허용가능한 담체, 희석제 또는 부형제는 주사가능한 용도에 적합할 수 있다. 주사가능한 용도에 적합한 약학적으로 허용가능한 담체, 희석제 또는 부형제의 예는 멸균 주사가능한 용액 또는 분산액의 즉석 제조를 위한 멸균 수용액 또는 분산액 및 멸균 분말을 포함한다. 모든 경우에, 그 형태는 멸균이어야 하고, 용이한 시링가능성(syringability)이 존재하는 정도로 유체여야 한다. 이는 제조 및 저장 조건 하에 안정해야 하고, 미생물, 예를 들어 박테리아 및 진균의 오염 작용에 대해 보존되어야 한다. 담체는, 예를 들어 물, 에탄올, 폴리올(예를 들어 글리세롤, 프로필렌 글리콜, 액체 폴리에틸렌 글리콜 등), 그의 적합한 혼합물 및 식물성 오일을 함유하는 용매 또는 분산 매질일 수 있다. 적절한 유동성은, 예를 들어 코팅, 예를 들어 레시틴의 사용에 의해, 분산액의 경우에 요구되는 입자 크기의 유지에 의해, 및 계면활성제의 사용에 의해 유지될 수 있다. 미생물의 작용의 방지는 다양한 항박테리아제 및 항진균제, 예를 들어 파라벤, 클로로부탄올, 페놀, 소르브산, 티메로살 등에 의해 유발될 수 있다. 많은 경우에, 등장화제, 예를 들어 당 또는 염화나트륨을 포함하는 것이 바람직할 것이다. 주사가능한 조성물의 연장된 흡수는 흡수를 지연시키는 작용제, 예를 들어 알루미늄 모노스테아레이트 및 젤라틴의 사용에 의해 야기될 수 있다.A pharmaceutically acceptable carrier, diluent or excipient may be suitable for injectable use. Examples of pharmaceutically acceptable carriers, diluents or excipients suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. In all cases, the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier may be a solvent or dispersion medium containing, for example, water, ethanol, polyols (eg glycerol, propylene glycol, liquid polyethylene glycols, etc.), suitable mixtures thereof and vegetable oils. Proper fluidity can be maintained, for example, by the use of coatings, for example lecithin, by maintenance of the required particle size in the case of dispersions, and by the use of surfactants. Prevention of the action of microorganisms can be caused by various antibacterial and antifungal agents, such as parabens, chlorobutanol, phenol, sorbic acid, thimerosal and the like. In many cases it will be desirable to include a tonicity agent such as sugar or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use of agents delaying absorption, for example, aluminum monostearate and gelatin.

멸균 주사가능한 용액은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 또는 이중 벡터 시스템을 상기 열거된 바와 같은 다양한 다른 성분과 함께 적절한 용매 중에서 필요한 양으로 혼입시킨 후, 필터 멸균시킴으로써 제조된다. 일반적으로, 분산액은 멸균된 활성 성분을 염기성 분산 매질 및 상기 열거된 것으로부터의 필요한 다른 성분을 함유하는 멸균 비히클 내로 혼입시킴으로써 제조된다. 멸균 주사가능한 용액의 제조를 위한 멸균 분말의 경우에, 바람직한 제조 방법은 활성 성분 및 그의 이전에 멸균-여과된 용액으로부터의 임의의 추가의 원하는 성분의 분말을 생성하는 진공 건조 및 동결 건조 기술이다.Sterile injectable solutions are prepared by incorporating the polynucleotide, plasmid, viral vector, or dual vector system disclosed herein in the required amount in an appropriate solvent with various other ingredients, as enumerated above, followed by filter sterilization. Generally, dispersions are prepared by incorporating the sterilized active ingredient into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, preferred methods of preparation are vacuum drying and freeze drying techniques which result in a powder of the active ingredient and any additional desired ingredients from a previously sterile-filtered solution thereof.

AAVAAV 벡터를 제조하는 방법 How to make a vector

아데노-연관 바이러스(AAV) 벡터(예를 들어 바이러스 또는 바이러스 입자)를 생산하는 방법이 본원에 개시된다. AAV 벡터를 생산하는 방법은 관련 기술분야에 공지되어 있다. 예를 들어 상기 방법은 예를 들어 국제공개공보 제WO 01/83692호에 개시되어 있으며, 이는 그 전문이 본원에 참조로 포함된다. AAV 생산의 일반적 원리는 예를 들어 문헌 「Carter, Current Opinions in Biotechnology 1533-1539, 1992」; 및 「Muzyczka, Curr. Topics in Microbial and Immunol 158:97-129, 1992」에 검토되어 있으며, 이들 각각은 그 전문이 참조로 포함된다. AAV를 생산하기 위한 다양한 접근법이 문헌 「Ratschin 등, Mol. Cell. Biol. 4:2072, 1984」; 「Hermonat 등, Proc. Natl. Acad. Sci. USA, 81:6466, 1984」; 「Tratschin 등, Mol. Cell. Biol. 5:3251, 1985」; 「McLaughlin 등, J. Virol., 62:1963, 1988」; 및 「Lebkowski 등, Mol. Cell. Biol., 7:349, 1988」; 「Samulski 등, J. Virol., 63:3822-3828, 1989」; 미국특허 제5,173,414호; 국제공개공보 제WO 95/13365호; 및 대응하는 미국특허 제5,658,776호; 국제공개공보 제WO 95/13392호; 제WO 96/17947호; 국제출원 제PCT/US98/18600호; 국제공개공보 제WO 97/09441호(국제출원 제PCT/US96/14423호); 제WO 97/21825호(제PCT/US96/20777호); 제WO 97/06243호 (제PCT/FR96/01064호); 제WO 99/11764호; 문헌 「Perrin 등, Vaccine 13:1244-1250, 1995」; 「Paul 등, Human Gene Therapy 4:609-615, 1993」; Clark 등, 「Gene Therapy 3:1124-1132, 1996」; 미국특허 제5,786,211호; 미국특허 제5,871,982호; 및 미국특허 제6,258,595호에 개시되어 있으며. 이들 각각은 그 전문이 참조로 포함된다.Methods of producing adeno-associated virus (AAV) vectors (eg, viruses or viral particles) are disclosed herein. Methods for producing AAV vectors are known in the art. Such methods are disclosed, for example, in WO 01/83692, which is incorporated herein by reference in its entirety. General principles of AAV production are described, for example, in Carter, Current Opinions in Biotechnology 1533-1539, 1992; and "Muzyczka, Curr. Topics in Microbial and Immunol 158:97-129, 1992”, each of which is incorporated by reference in its entirety. Various approaches for producing AAV are described in Ratschin et al., Mol. Cell. Biol. 4:2072, 1984; "Hermonat et al., Proc. Natl. Acad. Sci. USA, 81:6466, 1984; “Tratschin et al., Mol. Cell. Biol. 5:3251, 1985”; McLaughlin et al., J. Virol., 62:1963, 1988; and Lebkowski et al., Mol. Cell. Biol., 7:349, 1988; Samulski et al., J. Virol., 63:3822-3828, 1989; U.S. Patent No. 5,173,414; International Publication No. WO 95/13365; and corresponding U.S. Patent Nos. 5,658,776; International Publication No. WO 95/13392; WO 96/17947; International Application No. PCT/US98/18600; International Publication No. WO 97/09441 (International Application No. PCT/US96/14423); WO 97/21825 (PCT/US96/20777); WO 97/06243 (PCT/FR96/01064); WO 99/11764; See Perrin et al., Vaccine 13:1244-1250, 1995; "Paul et al., Human Gene Therapy 4:609-615, 1993"; Clark et al., Gene Therapy 3:1124-1132, 1996; U.S. Patent No. 5,786,211; U.S. Patent No. 5,871,982; and U.S. Patent No. 6,258,595. Each of these is incorporated by reference in its entirety.

일부 구현예에서, 아데노-연관 바이러스(AAV) 벡터를 생산하는 방법은 본원에 개시된 임의의 AAV 패키징 시스템으로 세포를 형질도입하는 것을 포함한다. 일부 구현예에서, 세포는 진핵 세포이다. 일부 구현예에서, 세포는 포유동물 세포이다. 일부 구현예에서, 세포는 아데노-연관 바이러스 rep 및 cap 유전자를 안정적으로 발현하는 재조합 세포이다. 일부 구현예에서, 상기 방법은 상기 세포를 배양하여 형질도입된 세포의 집단을 생산하는 단계를 추가로 포함한다. 일부 구현예에서, 상기 방법은 형질도입된 세포의 집단으로부터 상청액을 수집하는 단계를 추가로 포함한다. 일부 구현예에서, 상기 방법은 상청액을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생성하는 단계를 추가로 포함하되, 상기 AAV 벡터 샘플에는 세포 파편 및 단백질이 실질적으로 존재하지 않는다. 대안적으로, 또는 추가적으로, 상기 방법은 형질도입된 세포의 집단을 용해시켜 세포 용해물을 생산하는 단계를 추가로 포함한다. 일부 구현예에서, 상기 방법은 세포 용해물을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생성하는 것을 추가로 포함하되, 상기 AAV 벡터 샘플은 세포 파편 및 단백질이 실질적으로 없다. 일부 구현예에서, 정제된 AAV 벡터 샘플의 순도는 적어도 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% 또는 100% 순수하다.In some embodiments, a method of producing an adeno-associated virus (AAV) vector comprises transducing a cell with any of the AAV packaging systems disclosed herein. In some embodiments, the cell is a eukaryotic cell. In some embodiments, the cell is a mammalian cell. In some embodiments, the cell is a recombinant cell that stably expresses adeno-associated virus rep and cap genes. In some embodiments, the method further comprises culturing the cells to produce a population of transduced cells. In some embodiments, the method further comprises collecting supernatant from the population of transduced cells. In some embodiments, the method further comprises subjecting the supernatant to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cellular debris and proteins. Alternatively, or additionally, the method further comprises lysing the population of transduced cells to produce a cell lysate. In some embodiments, the method further comprises subjecting the cell lysate to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cellular debris and protein. In some embodiments, the purity of the purified AAV vector sample is at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% pure.

일부 구현예에서, 아데노-연관 바이러스(AAV) 벡터를 생산하는 방법은 본원에 개시된 5' hDYSF AAV 패키징 시스템 중의 임의의 시스템으로 세포를 형질도입하는 것을 포함한다. 일부 구현예에서, 세포는 진핵 세포이다. 일부 구현예에서, 세포는 포유동물 세포이다. 일부 구현예에서, 세포는 아데노-연관 바이러스 rep 및 cap 유전자를 안정적으로 발현하는 재조합 세포이다. 일부 구현예에서, 상기 방법은 세포를 배양하여 형질도입된 세포의 집단을 생산하는 것을 더 포함한다. 일부 구현예에서, 방법은 형질도입된 세포의 집단으로부터 상청액을 수집하는 것을 더 포함한다. 일부 구현예에서, 상기 방법은 상청액을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생산하는 것을 추가로 포함하되, 상기 AAV 벡터 샘플은 세포 파편 및 단백질이 실질적으로 없다. 대안적으로, 또는 추가적으로, 상기 방법은 형질도입된 세포의 집단을 용해시켜 세포 용해물을 생산하는 것을 더 포함한다. 일부 구현예에서, 상기 방법은 세포 용해물을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생산하는 것을 추가로 포함하되, 상기 AAV 벡터 샘플은 세포 파편 및 단백질이 실질적으로 없다. 일부 구현예에서, 정제된 AAV 벡터 샘플의 순도는 적어도 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% 또는 100% 순수하다.In some embodiments, a method of producing an adeno-associated virus (AAV) vector comprises transducing a cell with any of the 5' hDYSF AAV packaging systems disclosed herein. In some embodiments, the cell is a eukaryotic cell. In some embodiments, the cell is a mammalian cell. In some embodiments, the cell is a recombinant cell that stably expresses adeno-associated virus rep and cap genes. In some embodiments, the method further comprises culturing the cells to produce a population of transduced cells. In some embodiments, the method further comprises collecting supernatant from the population of transduced cells. In some embodiments, the method further comprises subjecting the supernatant to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cellular debris and protein. Alternatively, or additionally, the method further comprises lysing the population of transduced cells to produce a cell lysate. In some embodiments, the method further comprises subjecting the cell lysate to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cell debris and protein. In some embodiments, the purity of the purified AAV vector sample is at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% pure.

일부 구현예에서, 아데노-연관 바이러스(AAV) 벡터를 생산하는 방법은 본원에 개시된 3' hDYSF AAV 패키징 시스템 중의 임의의 시스템으로 세포를 형질도입하는 것을 포함한다. 일부 구현예에서, 세포는 진핵 세포이다. 일부 구현예에서, 세포는 포유동물 세포이다. 일부 구현예에서, 세포는 아데노-연관 바이러스 rep 및 cap 유전자를 안정적으로 발현하는 재조합 세포이다. 일부 구현예에서, 상기 방법은 세포를 배양하여 형질도입된 세포의 집단을 생산하는 것을 더 포함한다. 일부 구현예에서, 상기 방법은 형질도입된 세포의 집단으로부터 상청액을 수집하는 것을 더 포함한다. 일부 구현예에서, 상기 방법은 상청액을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생산하는 것을 추가로 포함하되, 상기 AAV 벡터 샘플은 세포 파편 및 단백질이 실질적으로 없다. 대안적으로, 또는 추가적으로, 상기 방법은 형질도입된 세포의 집단을 용해시켜 세포 용해물을 생산하는 것을 더 포함한다. 일부 구현예에서, 상기 방법은 세포 용해물을 하나 이상의 정제 단계에 적용하여 정제된 AAV 벡터 샘플을 생산하는 것을 추가로 포함하되, 상기 AAV 벡터 샘플은 세포 파편 및 단백질이 실질적으로 없다. 일부 구현예에서, 정제된 AAV 벡터 샘플의 순도는 적어도 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% 또는 100% 순수하다.In some embodiments, a method of producing an adeno-associated virus (AAV) vector comprises transducing a cell with any of the 3' hDYSF AAV packaging systems disclosed herein. In some embodiments, the cell is a eukaryotic cell. In some embodiments, the cell is a mammalian cell. In some embodiments, the cell is a recombinant cell that stably expresses adeno-associated virus rep and cap genes. In some embodiments, the method further comprises culturing the cells to produce a population of transduced cells. In some embodiments, the method further comprises collecting supernatant from the population of transduced cells. In some embodiments, the method further comprises subjecting the supernatant to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cellular debris and protein. Alternatively, or additionally, the method further comprises lysing the population of transduced cells to produce a cell lysate. In some embodiments, the method further comprises subjecting the cell lysate to one or more purification steps to produce a purified AAV vector sample, wherein the AAV vector sample is substantially free of cell debris and protein. In some embodiments, the purity of the purified AAV vector sample is at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% pure.

세포cell

본원에서는 추가로, 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하는 세포가 개시된다. 세포는 원핵 또는 진핵 세포일 수 있다. 진핵 세포의 비제한적 예는 포유동물, 예를 들어 햄스터, 뮤린, 래트, 개, 양 또는 인간 세포를 포함한다. 일부 구현예에서, 세포는 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것을 포함하는 플라스미드로 형질주입된다. 일부 구현예에서, 세포는 본원에 개시된 임의의 5' hDYSF AAV 발현 카세트로 형질도입된다. 일부 구현예에서, 세포는 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 벡터로 감염된다.Further disclosed herein are cells comprising any of the 5' hDYSF polynucleotides disclosed herein. Cells may be prokaryotic or eukaryotic. Non-limiting examples of eukaryotic cells include mammalian, eg hamster, murine, rat, dog, sheep or human cells. In some embodiments, cells are transfected with a plasmid comprising any of the 5' hDYSF polynucleotides disclosed herein. In some embodiments, cells are transduced with any of the 5' hDYSF AAV expression cassettes disclosed herein. In some embodiments, cells are infected with any of the 5' hDYSF AAV vectors disclosed herein.

본원에서는 추가로, 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하는 세포가 개시된다. 일부 구현예에서, 세포는 본원에 개시된 임의의 3' hDYSF 폴리뉴클레오티드를 포함하는 플라스미드로 형질주입된다. 일부 구현예에서, 세포는 본원에 개시된 임의의 3' hDYSF AAV 발현 카세트로 형질도입된다. 일부 구현예에서, 세포는 본원에 개시된 임의의 3' hDYSF AAV 벡터로 감염된다.Further disclosed herein are cells comprising any of the 3' hDYSF polynucleotides disclosed herein. In some embodiments, cells are transfected with a plasmid comprising any of the 3' hDYSF polynucleotides disclosed herein. In some embodiments, cells are transduced with any of the 3' hDYSF AAV expression cassettes disclosed herein. In some embodiments, the cells are infected with any of the 3' hDYSF AAV vectors disclosed herein.

본원에 개시된 임의의 세포는 감염성 rAAV를 생산하는 패키징 세포일 수 있다. 일부 구현예에서, 패키징 세포는 안정적으로 형질전환된 암 세포, 예를 들어 HeLa 세포, 293 세포 및 PerC6 세포(동족 293 라인)이다. 또 다른 구현예에서, 패키징 세포는 저 계대(low passage) 293 세포(아데노바이러스의 E1으로 형질전환된 인간 태아 신장 세포), MRC-5 세포(인간 태아 섬유모세포), WI-38 세포(인간 태아 섬유모세포), 베로 세포(Vero cells)(원숭이 신장 세포) 및 FRhL-2 세포(레서스 태아 폐 세포)와 같은, 형질전환된 암 세포가 아닌 세포이다. 원핵 세포의 비제한적인 예는 박테리아 세포(예를 들어 에스케리키아 콜라이(Escherichia coli)) 및 고세균 세포를 포함한다. 본 개시의 세포는 세포 은행, 예를 들어 비-GMP 목적을 위한 수탁 세포 은행(ACB) 또는 GMP 마스터 세포 은행(MCB)을 생산하는 데 사용될 수 있다. 세포의 분취량은, 일 구현예에서, 생산을 위한 생물반응기에서의 배양 전에 원래의 접종물로부터 더 큰 부피로 확장된다.Any of the cells disclosed herein may be packaging cells that produce infectious rAAV. In some embodiments, the packaging cells are stably transformed cancer cells, such as HeLa cells, 293 cells and PerC6 cells (cognate 293 line). In another embodiment, the packaging cells are low passage 293 cells (human fetal kidney cells transformed with E1 of adenovirus), MRC-5 cells (human fetal fibroblasts), WI-38 cells (human fetal cells) fibroblasts), Vero cells (monkey kidney cells) and FRhL-2 cells (rhesus fetal lung cells). Non-limiting examples of prokaryotic cells include bacterial cells (eg Escherichia coli) and archaeal cells. Cells of the present disclosure can be used to produce a cell bank, such as an depository cell bank (ACB) or a GMP master cell bank (MCB) for non-GMP purposes. An aliquot of cells, in one embodiment, is expanded to a larger volume from the original inoculum prior to culture in a bioreactor for production.

치료 방법treatment method

본원에서는 추가로, 디스페리노병증을 치료하는 방법이 개시된다. 일부 구현예에서, 디스페리노병증을 치료하는 방법은 이를 필요로 하는 대상체에게, (a) 유효량의 제 1 폴리뉴클레오티드, 여기서 상기 제 1 폴리뉴클레오티드는 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의임; 및 (b) 유효량의 제 2 폴리뉴클레오티드, 여기서 상기 제 2 폴리뉴클레오티드는 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것임;을 투여하는 것을 포함하거나, 이로 구성되거나, 또는 본질적으로 이로 구성된다. 일부 구현예에서, 제 1 폴리뉴클레오티드는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 1 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 2 폴리뉴클레오티드는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 2 폴리뉴클레오티드는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 1 및 제 2 폴리뉴클레오티드는 동시에 투여된다. 일부 구현예에서, 제 1 및 제 2 폴리뉴클레오티드는 순차적으로 투여된다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.Further disclosed herein are methods of treating dysferinopathies. In some embodiments, a method of treating dysferinopathies comprises providing a subject in need thereof with (a) an effective amount of a first polynucleotide, wherein the first polynucleotide is any of the 5' hDYSF polynucleotides disclosed herein; and (b) an effective amount of a second polynucleotide, wherein the second polynucleotide is any of the 3' hDYSF polynucleotides disclosed herein. In some embodiments, the first polynucleotide is administered orally, parenterally (eg intramuscularly, intraperitoneally, intravenously, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal , sublingually, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the first polynucleotide is administered intramuscularly or intravenously. In some embodiments, the second polynucleotide is administered orally, parenterally (e.g. intramuscular, intraperitoneal, intravenous, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal , sublingually, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the second polynucleotide is administered intramuscularly or intravenously. In some embodiments, the first and second polynucleotides are administered simultaneously. In some embodiments, the first and second polynucleotides are administered sequentially. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

일부 구현예에서, 디스페리노병증을 치료하는 방법은 이를 필요로 하는 대상체에게, (a) 유효량의 제 1 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 1 AAV 벡터든 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 것임; 및 (b) 유효량의 제 2 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 개시된 3' hDYSF AAV 벡터 중의 임의의 것임;을 투여하는 것을 포함하거나, 이로 구성되거나, 또는 본질적으로 이로 구성된다. 일부 구현예에서, 제 1 AAV 벡터는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 1 AAV 벡터는 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 제 2 AAV 벡터는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 2 AAV 벡터는 근육내 또는 정맥내 투여된다. 일부 구현예에서, 제 1 및 제 2 AAV 벡터는 동시에 투여된다. 일부 구현예에서, 제 1 및 제 2 AAV 벡터는 순차적으로 투여된다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, the method of treating dysferinopathy comprises administering to a subject in need thereof (a) an effective amount of a first adeno-associated virus (AAV) vector, wherein said first AAV vector is a 5' hDYSF disclosed herein; is any of the AAV vectors; and (b) an effective amount of a second adeno-associated virus (AAV) vector, wherein the second AAV vector is any of the 3' hDYSF AAV vectors disclosed herein; consists of this In some embodiments, the first AAV vector is administered orally, parenterally (eg intramuscularly, intraperitoneally, intravenously, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal , sublingually, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the first AAV vector is administered intramuscularly or intravenously. In some embodiments, the second AAV vector is administered orally, parenterally (e.g. intramuscular, intraperitoneal, intravenous, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal , sublingually, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the second AAV vector is administered intramuscularly or intravenously. In some embodiments, the first and second AAV vectors are administered simultaneously. In some embodiments, the first and second AAV vectors are administered sequentially. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

일부 구현예에서, 디스페리노병증의 치료 방법은 이를 필요로 하는 대상체에게, (a) 유효량의 제 1 AAV 발현 카세트, 여기서 상기 제 1 AAV 발현 카세트는 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것임; 및 (b) 유효량의 제 2 AAV 발현 카세트, 여기서 상기 본원에 개시된 제 2 AAV 발현 카세트는 3' hDSYF AAV 발현 카세트 중의 임의의 것임;을 투여하는 것을 포함하거나, 이로 구성되거나, 또는 본질적으로 이로 구성된다. 일부 구현예에서, 제 1 AAV 발현 카세트는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 1 AAV 발현 카세트는 근육내 또는 정맥내 투여된다. 일부 구현예에서, 제 2 AAV 발현 카세트는 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 제 2 AAV 발현 카세트는 근육내 또는 정맥내 투여된다. 일부 구현예에서, 제 1 및 제 2 AAV 발현 카세트는 동시에 투여된다. 일부 구현예에서, 제 1 및 제 2 AAV 발현 카세트는 순차적으로 투여된다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, the method of treating dysferinopathy comprises administering to a subject in need thereof (a) an effective amount of a first AAV expression cassette, wherein the first AAV expression cassette is any of the 5' hDYSF AAV expression cassettes disclosed herein. belongs to; and (b) an effective amount of a second AAV expression cassette, wherein the second AAV expression cassette disclosed herein above is any of the 3' hDSYF AAV expression cassettes; do. In some embodiments, the first AAV expression cassette is administered orally, parenterally (eg intramuscularly, intraperitoneally, intravenously, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasally, vaginally, Administered by rectal, sublingual, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the first AAV expression cassette is administered intramuscularly or intravenously. In some embodiments, the second AAV expression cassette is administered orally, parenterally (eg intramuscularly, intraperitoneally, intravenously, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasally, vaginally, Administered by rectal, sublingual, urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the second AAV expression cassette is administered intramuscularly or intravenously. In some embodiments, the first and second AAV expression cassettes are administered simultaneously. In some embodiments, the first and second AAV expression cassettes are administered sequentially. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

일부 구현예에서, 디스페리노병증의 치료 방법은 이를 필요로 하는 대상체에게, (a) 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것 및 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것; (b) 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 것 및 본원에 개시된 3' hDYSF AAV 벡터 중의 임의의 것; (c) 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것 및 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것; 또는 (d) 본원에 개시된 이중 AAV 벡터 시스템 중의 임의의 것;을 포함하는 유효량의 조성물을 투여하는 것을 포함하거나, 이로 구성되거나, 또는 본질적으로 이로 구성된다. 일부 구현예에서, 상기 조성물은 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 상기 조성물은 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다.In some embodiments, a method of treating dysferinopathies comprises administering to a subject in need thereof (a) any of the 5' hDYSF polynucleotides disclosed herein and any of the 3' hDYSF polynucleotides disclosed herein; (b) any of the 5' hDYSF AAV vectors disclosed herein and any of the 3' hDYSF AAV vectors disclosed herein; (c) any of the 5' hDYSF AAV expression cassettes disclosed herein and any of the 3' hDYSF AAV expression cassettes disclosed herein; or (d) any of the dual AAV vector systems disclosed herein; In some embodiments, the composition is administered orally, parenterally (eg intramuscular, intraperitoneal, intravenous, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal, sublingual , urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the composition is administered intramuscularly or intravenously. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

본원에서는 추가로, 임의의 재조합 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 벡터 시스템, 및 조성물의, 디스페리노병증의 치료를 필요로 하는 대상체에서 디스페리노병증을 치료하기 위한 의약의 제조에서의 용도가 개시된다. 본원에서는, 디스페리노병증의 치료를 필요로 하는 대상체에서 디스페리노병증을 치료하기 위한 의약의 제조에서의, (a) 본원에 개시된 5' hDYSF 폴리뉴클레오티드 중의 임의의 것 및 본원에 개시된 3' hDYSF 폴리뉴클레오티드 중의 임의의 것; (b) 본원에 개시된 5' hDYSF AAV 벡터 중의 임의의 것 및 본원에 개시된 3' hDYSF AAV 벡터 중의 임의의 것; (c) 본원에 개시된 5' hDYSF AAV 발현 카세트 중의 임의의 것 및 본원에 개시된 3' hDYSF AAV 발현 카세트 중의 임의의 것; 또는 (e) (d) 본원에 개시된 이중 AAV 벡터 시스템 중의 임의의 것;을 포함하는 조성물의 용도가 개시된다. 일부 구현예에서, 조성물은 경구로, 비경구로(예를 들어 근육내, 복강내, 정맥내, ICV, 흉골내 주사 또는 주입, 피하 주사, 또는 임플란트), 흡입 스프레이 비강, 질, 직장, 설하, 요도(예를 들어 요도 좌제) 또는 국소 투여 경로(예를 들어 겔, 연고, 크림, 에어로졸 등)에 의해 투여된다. 일부 구현예에서, 조성물은 근육내로 또는 정맥내로 투여된다. 일부 구현예에서, 디스페리노병증은 사지 근이영양증 유형 2B(LGMD2B) 또는 미요시 근병증이다. Further herein, the use of any of the recombinant polynucleotides, plasmids, viral vectors, vector systems, and compositions in the manufacture of a medicament for treating dysferinopathy in a subject in need thereof is provided. is initiated As used herein, (a) any of the 5' hDYSF polynucleotides disclosed herein and the 3' any of the hDYSF polynucleotides; (b) any of the 5' hDYSF AAV vectors disclosed herein and any of the 3' hDYSF AAV vectors disclosed herein; (c) any of the 5' hDYSF AAV expression cassettes disclosed herein and any of the 3' hDYSF AAV expression cassettes disclosed herein; or (e) (d) any of the dual AAV vector systems disclosed herein. In some embodiments, the composition is administered orally, parenterally (e.g. intramuscular, intraperitoneal, intravenous, ICV, intrasternal injection or infusion, subcutaneous injection, or implant), inhalation spray nasal, vaginal, rectal, sublingual, Administered by urethral (eg urethral suppositories) or topical routes of administration (eg gels, ointments, creams, aerosols, etc.). In some embodiments, the composition is administered intramuscularly or intravenously. In some embodiments, the dysferinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

본 발명의 방법에서 투여될 AAV 벡터의 역가(titers)는, 예를 들어 특정 AAV, 투여 방식, 치료 목표, 개체, 및 표적화되는 세포 유형(들)에 의존하고, 관련 기술분야의 표준 방법에 의해 결정될 수 있다. AAV의 역가는 ㎖ 당 적어도 약 1×10⁶, 약 1×10⁷, 약 1×10⁸, 약 1×10⁹, 약 1×10¹⁰, 약 1×10¹¹, 약 1×10¹², 약 1×10¹³내지 약 1×10¹⁴개 또는 그 초과의 DNase 저항성 입자(DRP)의 범위일 수 있다. 또한, 투여량은 바이러스 게놈(vg)의 단위로 표현될 수 있다. 예를 들어 AAV의 투여량은 적어도 약 1×10⁶, 약 1×10⁷, 약 1×10⁸, 약 1×10⁹, 약 1×10¹⁰, 약 1×10¹¹, 약 1×10¹², 약 2×10¹², 약 3×10¹², 약 4×10¹², 약 5×10¹², 약 6×10¹², 약 7×10¹², 약 8×10¹², 약 9×10¹², 약 1×10¹³내지 약 1×10¹⁴개의 바이러스 게놈의 범위일 수 있다.The titers of the AAV vectors to be administered in the methods of the present invention depend, for example, on the particular AAV, mode of administration, target of treatment, subject, and cell type(s) to be targeted, and are determined by standard methods in the art. can be determined The AAV titer per ml is at least about 1×10 ⁶ , about 1×10 ⁷ , about 1×10 ⁸ , about 1×10 ⁹ , about 1×10 ¹⁰ , about 1×10 ¹¹ , about 1×10 ¹² , about 1×10 ¹³ to about 1×10 ¹⁴ or more DNase resistant particles (DRPs). Also, the dosage can be expressed in units of viral genome (vg). For example, a dose of AAV may be at least about 1×10 ⁶ , about 1×10 ⁷ , about 1×10 ⁸ , about 1×10 ⁹ , about 1×10 ¹⁰ , about 1×10 ¹¹ , about 1×10 ¹² , about 2×10 ¹² , about 3×10 ¹² , about 4×10 ¹² , about 5×10 ¹² , about 6×10 ¹² , about 7×10 ¹² , about 8×10 ¹² , about 9×10 ¹² , about It may range from 1×10 ¹³ to about 1×10 ¹⁴ viral genomes.

AAV 투여량은 ELISA, 역전사효소 활성의 평가, FACS, 형질도입 검정 노던 블롯팅(예를 들어 반-정량적 노던), 도트 블롯 분석 또는 PCR(예를 들어 qPCR)을 포함하나 이에 제한되지는 않는 다중 방법에 의해 결정될 수 있다. AAV 투여량은 정량적 실시간 PCR (qPCR)로 AAV 벡터 게놈을 측정함으로써 결정될 수 있다는 것이 널리 공지되어 있다. 이러한 qPCR 방법은 통상적인 형질도입 검정으로부터의 불일치 또는 임의적 결과를 극복한다. PCR 투여량 결정의 한 구현예에서, 플라스미드 DNA는 보정 표준으로서 사용된다. 플라스미드의 형태는 qPCR 방법으로부터의 투여량 결과에 영향을 줄 수 있다. 한 구현예에서, 원형 또는 수퍼코일 DNA 또는 플라스미드는 정량화 표준으로서 사용된다.AAV dosing can be performed by multiple assays including but not limited to ELISA, assessment of reverse transcriptase activity, FACS, transduction assay Northern blotting (eg semi-quantitative Northern), dot blot analysis or PCR (eg qPCR). method can be determined. It is well known that AAV dosage can be determined by measuring the AAV vector genome with quantitative real-time PCR (qPCR). This qPCR method overcomes inconsistent or random results from conventional transduction assays. In one embodiment of PCR dosage determination, plasmid DNA is used as a calibration standard. The form of the plasmid can affect dosage results from qPCR methods. In one embodiment, circular or supercoiled DNA or plasmids are used as quantification standards.

일부 구현예에서, 투여량은 정량화 표준으로서 수퍼코일 DNA 또는 플라스미드를 기준으로 하여 vg/kg의 단위로 표현될 수 있다. 예를 들어 AAV의 투여량은 정량화 표준으로서 수퍼코일 DNA 또는 플라스미드를 기준으로 하여 약 1×106-1×1016 vg/kg, 약 1×10⁸ 내지 1×10¹⁵ vg/kg, 또는 약 1×10¹⁰ 내지 1×10¹⁴ vg/kg이다. 또 다른 구현예에서, 투여량은 적어도 약 1×10⁶, 약 1×10⁷, 약 1×10⁸, 약 1×10⁹, 약 1×10¹⁰, 약 1×10¹¹, 약 1×10¹², 약 2×10¹², 약 4×10¹², 약 6×10¹², 약 8×10¹², 약 1×10¹³, 약 2×10¹³, 약 2.4×10¹³, 약 3×10¹³, 약 4×10¹³, 약 5×10¹³, 약 6×10¹³, 약 7×10¹³, 약 8×10¹³, 약 9×10¹³, 약 1×10¹⁴, 약 1×10¹⁵, 또는 적어도 약 1×10¹⁶ vg/kg이다. 한 구현예에서, 투여량은 정량화 표준으로서 수퍼코일 DNA 또는 플라스미드를 기준으로 하여 적어도 2×10¹², 4×10¹², 6×10¹², 8×10¹², 1×10¹³, 2×10¹³, 2.4×10¹³, 3×10¹³, 4×10¹³, 5×10¹³, 6×10¹³, 7×10¹³, 또는 8×10¹³ vg/kg이다.In some embodiments, dosages can be expressed in units of vg/kg, based on supercoiled DNA or plasmid as a quantification standard. For example, the dose of AAV may be about 1×10 6 -1×10 16 vg/kg, about 1×10 ⁸ to 1×10 ¹⁵ vg/kg, or about 1×10 15 vg/kg, based on supercoil DNA or plasmid as a quantification standard. 10 ¹⁰ to 1×10 ¹⁴ vg/kg. In another embodiment, the dosage is at least about 1×10 ⁶ , about 1×10 ⁷ , about 1×10 ⁸ , about 1×10 ⁹ , about 1×10 ¹⁰ , about 1×10 ¹¹ , about 1×10 ¹² , about 2×10 ¹² , about 4×10 ¹² , about 6×10 ¹² , about 8×10 ¹² , about 1×10 ¹³ , about 2×10 ¹³ , about 2.4×10 ¹³ , about 3×10 ¹³ , about 4×10 ¹³ , about 5×10 ¹³ , about 6×10 ¹³ , about 7×10 ¹³ , about 8×10 ¹³ , about 9×10 ¹³ , about 1×10 ¹⁴ , about 1×10 ¹⁵ , or at least It is about 1×10 ¹⁶ vg/kg. In one embodiment, the dose is at least 2×10 ¹² , 4×10 ¹² , 6×10 ¹² , 8×10 ¹² , 1×10 ¹³ , 2×10 ¹³ based on supercoil DNA or plasmid as a quantification standard. , 2.4×10 ¹³ , 3×10 ¹³ , 4×10 ¹³ , 5×10 ¹³ , 6×10 ¹³ , 7×10 ¹³ , or 8×10 ¹³ vg/kg.

일부 구현예에서, 본원에 개시된 방법은 적어도 약 1×10⁶, 약 1×10⁷, 약 1×10⁸, 약 1×10⁹, 약 1×10¹⁰, 약 1×10¹¹, 약 1×10¹², 약 2×10¹², 약 3×10¹², 약 4×10¹², 약 5×10¹², 약 6×10¹², 약 7×10¹², 약 8×10¹², 약 9×10¹², 약 1×10¹³ vg을 주사당 1.5 ml의 총 부피로 투여하는 것을 포함한다. 일부 구현예에서, 본원에 개시된 방법은 적어도 약 1×10⁶, 약 1×10⁷, 약 1×10⁸, 약 1×10⁹, 약 1×10¹⁰, 약 1×10¹¹, 약 1×10¹², 약 2×10¹², 약 3×10¹², 약 4×10¹², 약 5×10¹², 약 6×10¹², 약 7×10¹², 약 8×10¹², 약 9×10¹², 약 1×10¹³, 약 2×10¹³, 약 5×10¹³, 약 7×10¹³, 약 1×10¹⁴ vg의 총 1일 용량을 투여하는 것을 포함한다. 캡시드화된 벡터 게놈 역가를 결정하는 한 예시적인 방법은 정량적 PCR, 예를 들어 문헌 「Pozsgai 등, Mol. Ther. 25(4): 855-869, 2017」에 기재된 방법을 사용하며, 이는 그 전문이 참조로 포함된다.In some embodiments, a method disclosed herein can be used for at least about 1×10 ⁶ , about 1×10 ⁷ , about 1×10 ⁸ , about 1×10 ⁹ , about 1×10 ¹⁰ , about 1×10 ¹¹ , about 1× 10 ¹² , about 2×10 ¹² , about 3×10 ¹² , about 4×10 ¹² , about 5×10 ¹² , about 6×10 ¹² , about 7×10 ¹² , about 8×10 ¹² , about 9×10 ¹² , about 1×10 ¹³ vg in a total volume of 1.5 ml per injection. In some embodiments, a method disclosed herein can be used for at least about 1×10 ⁶ , about 1×10 ⁷ , about 1×10 ⁸ , about 1×10 ⁹ , about 1×10 ¹⁰ , about 1×10 ¹¹ , about 1× 10 ¹² , about 2×10 ¹² , about 3×10 ¹² , about 4×10 ¹² , about 5×10 ¹² , about 6×10 ¹² , about 7×10 ¹² , about 8×10 ¹² , about 9×10 ¹² , about 1×10 ¹³ , about 2×10 ¹³ , about 5×10 ¹³ , about 7×10 ¹³ , about 1×10 ¹⁴ vg total daily dose. One exemplary method for determining encapsidated vector genome titer is quantitative PCR, eg, Pozsgai et al., Mol. Ther. 25(4): 855-869, 2017”, which is incorporated by reference in its entirety.

일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 매일 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10회 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 주(week) 당 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 또는 20회 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 1개월 당 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 또는 31회 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10일마다 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 또는 14주마다 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10일의 기간 동안 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 또는 20주의 기간 동안 투여된다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 대상체에게 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 또는 20개월의 기간 동안 투여된다.In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 times daily. dosed twice. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 administrations. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or 31 times. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject for at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 days. dosed every In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein can be administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , administered every 12, 13, or 14 weeks. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject for at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 days. administered for a period of In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein can be administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 weeks. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein can be administered to a subject at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 months.

일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 전신 투여하는 것을 포함한다. 예를 들어 전신 투여는 전체 신체가 영향을 받도록 순환계 내로의 투여이다. 전신 투여는 장관 투여, 예를 들어 위장관을 통한 흡수 및 주사, 주입 또는 이식을 통한 비경구 투여를 포함한다.In some embodiments, the methods disclosed herein include systemic administration of any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein. Systemic administration, for example, is administration into the circulatory system such that the entire body is affected. Systemic administration includes enteral administration, eg absorption through the gastrointestinal tract and parenteral administration via injection, infusion or implantation.

일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 국소적으로 투여하는 것을 포함한다. 일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 하나 이상의 조직에 투여하는 것을 포함한다. 일부 구현예에서, 조직은 근육, 상피, 결합 및 신경 조직으로부터 선택된다. 일부 구현예에서, 조직은 근육 조직이다.In some embodiments, the methods disclosed herein include topically administering any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein. In some embodiments, the methods disclosed herein include administering any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein to one or more tissues. In some embodiments, the tissue is selected from muscle, epithelial, connective and neural tissue. In some embodiments, the tissue is muscle tissue.

일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 대상체의 발에 투여하는 것을 포함한다. 일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 대상체의 신근 브레비스 (EDB) 근육에 투여하는 것을 포함한다.In some embodiments, the methods disclosed herein include administering to a foot of a subject any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein. In some embodiments, the methods disclosed herein comprise administering any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein to an extensor brevis (EDB) muscle of a subject.

또한 조합 요법이 본 발명에 의해 고려된다. 본원에 사용된 바와 같은 조합은 동시 치료 및 순차적 치료 둘 다를 포함한다. 본 발명의 방법과 표준 의학적 치료(예를 들어 코르티코스테로이드)의 조합이 신규 요법과의 조합과 같이 구체적으로 고려된다.Combination therapies are also contemplated by the present invention. Combinations as used herein include both simultaneous and sequential treatments. Combinations of the methods of the present invention with standard medical treatments (eg corticosteroids) are specifically contemplated as combinations with novel therapies.

일부 구현예에서, 본원에 개시된 방법은 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 대상체에게 투여하기 전에 또는 그 후에 대상체에서 디스페린 유전자에서의 돌연변이의 존재 또는 부재를 검출하는 것을 더 포함한다. 일부 구현예에서, 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것은 디스페린 유전자에서의 돌연변이의 존재의 검출 시에 대상체에게 투여된다. 예시적인 디스페린 돌연변이는 c1392dupA, c3035GA (pW1012X), c2858dupT, c2779del G, c5594delG, c4201dupA, c1795_1799dupTACT, c3832CT (pQ1278X), c757CT (pR253W), c855+1delG, c3126GA (pW1042X), c1663CT (pR555W), c610CT (pR204X), c3112CT (pR1038X), c1368CG (pC456W), c5713CT (pR1905X), c3826CG (pI1276V), c3843+1GA, c4167+1GC, c2643+1GA, c797TC (pI266P), c4876delG, c3477CA (pY1159X), c3137GA (pR1046H), c509CA (pA170E), c3967CT (pQ1323X), 3191_3196dupGAGGCG, c3992GT (pR1331L), c3516_3517delTT, c247delG, c1180+11CT, c896GA (pG299E), c5078GA (pR1693Q), c5979dupA, c3348+1_3348+4delGTAT, c5314_5318delAGCCC, 및 c565CG (pL189V)를 포함하지만, 이에 제한되지 않는다. 일부 경우에, 디스페린 유전자는 c1392dupA, c3035GA (pW1012X), c2858dupT, c2779delG, c5594delG, c4201dupA, c1795_1799dupTACT, c3832CT (pQ1278X), c757CT (pR253W), c855+1delG, c3126GA (pW1042X), c1663CT (pR555W), c610CT (pR204X), c3112CT (pR1038X), c1368CG (pC456W), c5713CT (pR1905X), c3826CG (pI1276V), c3843+1GA, c4167+1GC, c2643+1GA, c797TC (pI266P), c4876delG, c3477CA (pY1159X), c3137GA (pR1046H), c509CA (pA170E), c3967CT (pQ1323X), 3191_3196dupGAGGCG, c3992GT (pR1331L), c3516_3517delTT, c247delG, c1180+11CT, c896GA (pG299E), c5078GA (pR1693Q), c5979dupA, c3348+1_3348+4delGTAT, c5314_5318delAGCCC, 및 c565CG (pL189V)를 포함하는 하나 이상의 돌연변이를 포함하지만 이에 제한되지는 않는다.In some embodiments, the methods disclosed herein detect the presence or It further includes detecting the absence. In some embodiments, any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein are administered to a subject upon detection of the presence of a mutation in the dysferrin gene. 예시적인 디스페린 돌연변이는 c1392dupA, c3035GA (pW1012X), c2858dupT, c2779del G, c5594delG, c4201dupA, c1795_1799dupTACT, c3832CT (pQ1278X), c757CT (pR253W), c855+1delG, c3126GA (pW1042X), c1663CT (pR555W), c610CT ( pR204X), c3112CT (pR1038X), c1368CG (pC456W), c5713CT (pR1905X), c3826CG (pI1276V), c3843+1GA, c4167+1GC, c2643+1GA, c797TC (pI266P), c4876delG, c3477CA (pY1159X), c3137GA (pR1046H ), c509CA (pA170E), c3967CT (pQ1323X), 3191_3196dupGAGGCG, c3992GT (pR1331L), c3516_3517delTT, c247delG, c1180+11CT, c896GA (pG299E), c5078GA (pR1693Q), c5979dupA, c3348+1_3348+4delGTAT, c5314_5318delAGCCC, 및 c565CG ( pL189V), but is not limited thereto. 일부 경우에, 디스페린 유전자는 c1392dupA, c3035GA (pW1012X), c2858dupT, c2779delG, c5594delG, c4201dupA, c1795_1799dupTACT, c3832CT (pQ1278X), c757CT (pR253W), c855+1delG, c3126GA (pW1042X), c1663CT (pR555W), c610CT (pR204X), c3112CT (pR1038X), c1368CG (pC456W), c5713CT (pR1905X), c3826CG (pI1276V), c3843+1GA, c4167+1GC, c2643+1GA, c797TC (pI266P), c4876delG, c4876delG, c347 pR1046H), c509CA (pA170E), c3967CT (pQ1323X), 3191_3196dupGAGGCG, c3992GT (pR1331L), c3516_3517delTT, c247delG, c1180+11CT, c896GA (pG299E), c5078GA (pR1693Q), c5979dupA, c3348+1_3348+4delGTAT, c5314_5318delAGCCC, 및 c565CG (pL189V).

일부 구현예에서, 본원에 개시된 방법은 대상체에게 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 투여하기 전에 또는 그 후에 대상체에서 디스페린 단백질의 수준을 검출하는 것을 더 포함한다. 일부 구현예에서, 본원에 개시된 방법은 대상체에게 본원에 개시된 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물 중의 임의의 것을 투여한 후에 대상체에서 디스페린 단백질의 수준을 검출하는 것을 더 포함한다. 일부 구현예에서, 디스페린의 수준을 검출하는 것은 디스페린 유전자의 발현을 검출하는 것을 포함한다. 디스페린 유전자의 발현을 검출하는 것은 디스페린 DNA 또는 RNA 수준을 정량화하는 것을 포함할 수 있다. 대안적으로, 또는 추가적으로, 디스페린 단백질의 수준을 검출하는 것은 디스페린 단백질의 수준을 정량화하는 것을 포함한다. 일부 구현예에서, 디스페린 단백질의 수준은 대상체로부터의 샘플에서 검출된다. 일부 구현예에서, 샘플은 체액 샘플이다. 체액 샘플의 예는 혈액, 소변, 땀, 타액, 대변, 및 활액을 포함하나 이에 제한되지는 않는다. 일부 구현예에서, 혈액 샘플은 혈장 또는 혈청 샘플이다. 일부 경우에, 상기 방법은 예를 들어 Athena Diagnostics (CPT: 81408(1))로부터의 디스페린 DNA 서열분석 시험을 더 포함한다. In some embodiments, a method disclosed herein comprises detecting the level of dysferrin protein in a subject prior to or after administering to the subject any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein. contains more In some embodiments, the methods disclosed herein further comprise detecting the level of dysferrin protein in the subject after administering to the subject any of the polynucleotides, plasmids, viral vectors, dual vector systems, or compositions disclosed herein. . In some embodiments, detecting the level of dysferrin comprises detecting expression of the dysferrin gene. Detecting expression of the dysferrin gene may include quantifying dysferrin DNA or RNA levels. Alternatively, or additionally, detecting the level of dysferrin protein includes quantifying the level of dysferrin protein. In some embodiments, the level of dysferrin protein is detected in a sample from the subject. In some embodiments, the sample is a bodily fluid sample. Examples of bodily fluid samples include, but are not limited to, blood, urine, sweat, saliva, feces, and synovial fluid. In some embodiments, the blood sample is a plasma or serum sample. In some cases, the method further comprises a dysferrin DNA sequencing test, eg, from Athena Diagnostics (CPT: 81408(1)).

일부 구현예에서, 본원에 개시된 방법은 대상체에게 투여되는 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템 또는 조성물 중의 임의의 것의 용량 또는 투여 빈도를 변형시키는 것을 더 포함한다. 일부 구현예에서, 투여량 또는 투여 빈도의 변경은 디스페린 단백질 수준의 검출에 기초한다. 일부 구현예에서, 용량 또는 투여 빈도는 대상체에서 디스페린 단백질 수준이 더 이른 시점(예를 들면, 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물의 투여 전, 또는 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물의 초기 투여량의 투여 후, 그러나 폴리뉴클레오티드, 플라스미드, 바이러스 벡터, 이중 벡터 시스템, 또는 조성물의 후속 용량의 투여 전)으로부터 대상체에서 디스페린 단백질 수준과 비교하여 증가할 때 감소된다.In some embodiments, the methods disclosed herein further comprise modifying the dose or frequency of administration of any of the polynucleotides, plasmids, viral vectors, dual vector systems or compositions administered to the subject. In some embodiments, alteration of dosage or frequency of administration is based on detection of dysferrin protein levels. In some embodiments, the dose or frequency of administration is determined at an earlier time in the subject's dysferrin protein level (e.g., prior to administration of the polynucleotide, plasmid, viral vector, dual vector system, or composition, or to a polynucleotide, plasmid, virus increase compared to the dysferrin protein level in a subject after administration of an initial dose of the vector, dual vector system, or composition, but prior to administration of subsequent doses of the polynucleotide, plasmid, viral vector, dual vector system, or composition) is reduced when

키트kit

또 다른 측면에서, 폴리뉴클레오티드, 폴리펩티드, 벡터, 세포 및 시스템 중의 임의의 하나 이상, 또는 조성물, 및 사용 설명서를 포함하거나, 대안적으로 이들로 본질적으로 구성되거나, 또 추가로 이들로 구성되는 키트가 제공된다. 한 측면에서, 폴리뉴클레오티드, 폴리펩티드, 벡터, 세포 및 시스템 중의 임의의 하나 이상, 또는 조성물은 검출가능하게 표지되거나 정제 또는 검출가능한 마커를 추가로 포함한다. 일부 경우에, 상기 키트는, a) 제 1 폴리뉴클레오티드, 여기서 상기 제 1 폴리뉴클레오티드는 본원에 기재된 재조합 폴리뉴클레오티드임, 제 2 폴리뉴클레오티드, 여기서 상기 제 2 폴리뉴클레오티드는 본원에 기재된 재조합 폴리뉴클레오티드임; 또는 b) 제 1 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 1 AAV 벡터는 본원에 기재된 AAV 벡터임, 제 2 아데노-연관 바이러스(AAV) 벡터, 여기서 상기 제 2 AAV 벡터는 본원에 기재된 AAV 벡터임; 또는 c) 본원에 기재된 AAV 이중 벡터 시스템; 또는 d) 본원에 기재된 조성물; 또는 e) 본원에 기재된 세포(예를 들어 숙주 세포, 임의로 포유동물 세포); 및 임의로 사용 지침서;를 포함한다.In another aspect, a kit comprising, or alternatively consisting essentially of, or further consisting of any one or more of polynucleotides, polypeptides, vectors, cells and systems, or compositions, and instructions for use is provided. Provided. In one aspect, any one or more of the polynucleotides, polypeptides, vectors, cells and systems, or compositions are detectably labeled or further comprise a purified or detectable marker. In some cases, the kit comprises: a) a first polynucleotide, wherein the first polynucleotide is a recombinant polynucleotide described herein, and a second polynucleotide, wherein the second polynucleotide is a recombinant polynucleotide described herein; or b) a first adeno-associated virus (AAV) vector, wherein said first AAV vector is an AAV vector described herein, and a second adeno-associated virus (AAV) vector, wherein said second AAV vector is an AAV vector described herein. is a vector; or c) an AAV dual vector system described herein; or d) a composition described herein; or e) a cell described herein (eg a host cell, optionally a mammalian cell); and, optionally, instructions for use.

실시예Example

실시예Example 1: 이중 1: double AAVAAV 벡터 시스템의 생성 Creation of vector systems

본 실시예는 본원에 개시된 이중 AAV 벡터 시스템을 생산하기 위한 예시적인 방법을 제공한다. 본 실시예에서는, 이중 AAV 벡터, rAAVrh74.MHCK7.DYSF.DV가 생산된다. rAAVrh.74.MHCK7.DYSF.DV 근육 특이적 MHCK7 프로모터의 제어 하에 이중 벡터 (DV)로부터 인간 디스페린을 발현하는 비-복제 재조합 AAV 혈청형 rh74 (AAVrh74)이다. 이중 벡터는 디스페린 cDNA 서열의 5' 부분 또는 3' 부분을 함유하고, 이들 부분은 ~1 kb 만큼 중첩되어 재조합을 촉진하고 전체 길이 인간 디스페린 유전자를 생성한다. 인간 디스페린 cDNA의 일부를 함유하는 발현 카세트는 AAV2 역 말단 반복 서열 (ITR)에 의해 플랭킹된다(도 1).This example provides an exemplary method for producing the dual AAV vector system disclosed herein. In this example, a double AAV vector, rAAVrh74.MHCK7.DYSF.DV, is produced. rAAVrh.74.MHCK7.DYSF.DV is a non-cloning recombinant AAV serotype rh74 (AAVrh74) that expresses human dysferrin from a dual vector (DV) under the control of the muscle specific MHCK7 promoter. The dual vector contains either the 5' or 3' portions of the dysferrin cDNA sequence, and these portions overlap by -1 kb to facilitate recombination and create the full-length human dysferrin gene. An expression cassette containing a portion of the human dysferrin cDNA is flanked by AAV2 inverted terminal repeat sequences (ITRs) (FIG. 1).

rAAVrh.74.MHCK7.DYSFDV를 구축하기 위해, 인간 디스페린 cDNA를 AAV의 패키징 용량(4.7kb)에 부착된 2개의 구조체로 분할하였다. 5' 벡터(예를 들어 5' hDYSF AAV 벡터), pAAV.MHCK7.DYSF5'.PTG(PTG=프로모터/전이유전자)는 근육 특이적 MHCK7 프로모터, 키메라 인트론, 공통 코작 서열 및 디스페린 아미노산 서열의 아미노산 1-1113에 상응하는 DYSF cDNA의 5' 부분을 함유한다. 3' 벡터(예를 들어 3' hDYSF AAV 벡터), pAAV.DYSF3'.POLYA는, 폴리아데닐화 신호를 보유하는(harboring) DYSF 3'UTR 및 디스페린 아미노산 서열의 아미노산 794-2080에 상응하는 DYSF cDNA의 3' 부분을 함유한다. 5' hDYSF AAV 벡터 및 3' hDYSF AAV 벡터의 발현 카세트의 서열은 각각 서열번호 6 및 8로서 개시된다.To construct rAAVrh.74.MHCK7.DYSFDV, human dysferin cDNA was split into two constructs attached to the packaging capacity of AAV (4.7 kb). The 5' vector (e.g. 5' hDYSF AAV vector), pAAV.MHCK7.DYSF5'.PTG (PTG=promoter/transgene) contains the muscle-specific MHCK7 promoter, chimeric intron, consensus Kozak sequence and the amino acids of the dysferrin amino acid sequence. It contains the 5' portion of the DYSF cDNA corresponding to 1-1113. A 3' vector (e.g. 3' hDYSF AAV vector), pAAV.DYSF3'.POLYA, is a DYSF 3'UTR harboring a polyadenylation signal and DYSF corresponding to amino acids 794-2080 of the disferrin amino acid sequence contains the 3' portion of cDNA. The sequences of the expression cassettes of the 5' hDYSF AAV vector and the 3' hDYSF AAV vector are disclosed as SEQ ID NOs: 6 and 8, respectively.

이전 연구는 MHCK7 프로모터(그 전문이 참조로 포함되는 문헌 「Salva 등, Mol Ther 15, 320-329 (2007)」) 및 AAVrh.74를 사용하여 심장 발현을 입증하여 골격, 횡격막, 및 심근 발현을 달성하였다(그 전문이 참조로 포함되는 문헌 「Sondergaard 등, Annals of clinical and Transl Neurology 2, 256-270 (2015)」). 5' hDYSF AAV 벡터 및 3' hDSYF AAV 벡터를 별개의 AAVrh.74 비리온으로 캡시드화하였다. AAVrh.74 혈청형의 분자 클론을 레서스 마카크 림프절로부터 클로닝하고, 이는 그 전문이 참조로 포함되는 문헌 「Rodino-Klapac 등, Journal of Translational medicine 5, 45 (2007)」에서 논의된다.Previous studies have demonstrated cardiac expression using the MHCK7 promoter (Salva et al., Mol Ther 15, 320-329 (2007), incorporated by reference in its entirety) and AAVrh.74 to achieve skeletal, diaphragmatic, and myocardial expression. It was achieved ("Sondergaard et al., Annals of Clinical and Transl Neurology 2, 256-270 (2015)", which is incorporated by reference in its entirety). The 5' hDYSF AAV vector and the 3' hDSYF AAV vector were encapsidated into separate AAVrh.74 virions. A molecular clone of the AAVrh.74 serotype was cloned from rhesus macaque lymph nodes and is discussed in Rodino-Klapac et al., Journal of Translational medicine 5, 45 (2007), which is incorporated by reference in its entirety.

5' 5' hDYSFhDYSF AAVAAV 벡터( vector( AAVAAV 벡터 플라스미드 vector plasmid pAAVpAAV .. MHCK7.DYSF5MHCK7. DYSF5 '.'. PTGPTG ))

제 1 재조합 단일-가닥 AAV 벡터를 AAV 벡터 DNA 플라스미드 pAAV.MHCK7.DYSF5'.PTG를 사용하여 생성되었다. 인간 디스페린 부분 cDNA 서열(인간 cDNA, Genbank 수탁번호 NM_003494.3)의 5' 부분을 구동하는 MHCK7 발현 카세트를 벡터 백본(vector backbone) pAAV-CMV(클론테크(Clontech))에 삽입함으로써 플라스미드를 구축하였다(플라스미드 맵에 대해 도 2 및 특이적 서열 정보에 대해 표 1 참조). 키메라 인트론이 존재하고, 인간 β-글로빈 유전자의 제 1 인트론으로부터의 5' 공여체 부위 및 면역글로불린 유전자 중쇄 가변 영역의 몸체(body)와 리더(leader) 사이에 있는 인트론으로부터의 분지점 및 3' 스플라이스 수용체 부위로 구성되었다. 이 벡터에 포함된 유일한 바이러스 서열은 rAAV 벡터 게놈의 바이러스 DNA 복제 및 패키징 둘 다에 필요한 AAV2의 역전된 말단 반복체이다. 2개의 ITR 사이의 서열은 AAVrh.74 비리온으로 캡시드화된 DNA의 부분이다.A first recombinant single-stranded AAV vector was generated using the AAV vector DNA plasmid pAAV.MHCK7.DYSF5'.PTG. Plasmid construction by inserting the MHCK7 expression cassette driving the 5' portion of the human dysferrin partial cDNA sequence (human cDNA, Genbank Accession No. NM_003494.3) into the vector backbone pAAV-CMV (Clontech) (see Figure 2 for plasmid map and Table 1 for specific sequence information). A chimeric intron exists, and the 5' donor site from the first intron of the human β-globin gene and the branch point and 3' spleen from the intron between the body and the leader of the heavy chain variable region of the immunoglobulin gene Consists of the Rice receptor site. The only viral sequence contained in this vector is the inverted terminal repeat of AAV2 required for both viral DNA replication and packaging of the rAAV vector genome. The sequence between the two ITRs is the portion of DNA encapsidated into the AAVrh.74 virion.

[표 1][Table 1]

3' 3' hDYSFhDYSF AAVAAV 벡터( vector( AAVAAV 벡터 플라스미드 vector plasmid pAAVpAAV .. DYSF3DYSF3 '.'. POLYPOLY ))

제 2 재조합 단일-가닥 AAV 벡터를 AAV 벡터 DNA 플라스미드 pAAV.DYSF3'.POLYA를 사용하여 생산하였다. 인간 디스페린 부분 cDNA 서열(인간 cDNA, Genbank 수탁번호 NM_003494.3)을 벡터 백본 pAAV-CMV (Clontech)에 삽입함으로써 플라스미드를 구축하였다(플라스미드 맵에 대해 도 3 및 특이적 서열 정보에 대해 표 2 참조). 내인성 디스페린 3' 비번역 영역 및 폴리A 신호 서열이 효율적인 전사 종결(transcription termination)을 위해 사용되었다. 이 벡터에 포함된 유일한 바이러스 서열은 rAAV 벡터 게놈의 바이러스 DNA 복제 및 패키징 둘 다에 필요한 AAV2의 역전된 말단 반복체이다. 2개의 ITR 사이의 서열은 AAVrh.74 비리온으로 캡시드화된 DNA의 부분이다.A second recombinant single-stranded AAV vector was produced using the AAV vector DNA plasmid pAAV.DYSF3'.POLYA. A plasmid was constructed by inserting the human dysferrin partial cDNA sequence (human cDNA, Genbank Accession No. NM_003494.3) into the vector backbone pAAV-CMV (Clontech) (see Figure 3 for plasmid map and Table 2 for specific sequence information). ). Endogenous dysferrin 3' untranslated region and polyA signal sequence were used for efficient transcription termination. The only viral sequence contained in this vector is the inverted terminal repeat of AAV2 required for both viral DNA replication and packaging of the rAAV vector genome. The sequence between the two ITRs is the portion of DNA encapsidated into the AAVrh.74 virion.

[표 2][Table 2]

AAVAAV 헬퍼helper 플라스미드 ( plasmid ( pNLRep2pNLRep2 -- Caprh74Caprh74 ))

모 플라스미드(parent plasmid), pNLrep는 p5E18 및 pCLR3K로부터 구축되었다. [문헌 「Bansal, D., 등, Defective membrane repair in dysferlin-deficient muscular dystrophy. Nature 423, 168-172 (2003)」을 참조하며, 상기 문헌은 그 전체가 참조로 포함된다]. p5E18은 pAAV/Ad를 기반으로 한다. 그것은 AAV2 rep 및 캡 유전자를 함유하며, p5 프로모터가 rep의 5' 말단에서 제거되고 캡의 3' 말단에 배치되며, 이는 p5와 rep 사이에 3 kb 스페이서 서열의 존재를 초래한다(특이적 서열 정보에 대해서는 표 3 참조). pCLR3K를 생성하기 위해, 인간 콜라겐 인트론을 PCR에 의해 증폭시킨 다음, 위치 1.052에서 pAd/AAV로 클로닝하였다. pNLrep를 구축하기 위해, p5E18의 BamHI/XbaI 단편을 pCLR3K로부터의 3 kb 콜라겐 인트론을 함유하는 BamHI/XbaI 단편으로 대체하였다. rh74 캡 유전자를 PCR 증폭시키고, Swa I/Not I 제한 부위를 사용하여 pNLrep에서 AAV2 캡 유전자 대신에 클로닝하여 pNLRep2-Caprh74를 수득하였다. AAV rh74 캡시드 유전자의 동일성은 DNA 플라스미드 서열분석에 의해 확인되었다.The parent plasmid, pNLrep, was constructed from p5E18 and pCLR3K. [Document "Bansal, D. , et al., Defective membrane repair in dysferlin-deficient muscular dystrophy. Nature 423 , 168-172 (2003), which is incorporated by reference in its entirety]. p5E18 is based on pAAV/Ad. It contains the AAV2 rep and cap genes, the p5 promoter is removed from the 5' end of rep and placed at the 3' end of cap, which results in the presence of a 3 kb spacer sequence between p5 and rep (specific sequence information see Table 3). To generate pCLR3K, a human collagen intron was amplified by PCR and then cloned into pAd/AAV at position 1.052. To construct pNLrep, the BamHI/XbaI fragment of p5E18 was replaced with a BamHI/XbaI fragment containing the 3 kb collagen intron from pCLR3K. The rh74 cap gene was PCR amplified and cloned in place of the AAV2 cap gene in pNLrep using Swa I/Not I restriction sites to obtain pNLRep2-Caprh74. The identity of the AAV rh74 capsid gene was confirmed by DNA plasmid sequencing.

[표 3][Table 3]

아데노바이러스adenovirus 헬퍼helper 플라스미드( Plasmid ( pHELPpHELP ))

플라스미드 pHELP는 Applied Viromics(Fremont, CA 94538)로부터 얻었고, 크기는 11,635 bp이다(구체적인 서열 정보에 대해서는 표 4 참조). 플라스미드는 AAV 복제에 중요한 아데노바이러스 게놈의 영역, 즉 E2A, E4 및 VA RNA를 함유한다(아데노바이러스 E1 기능은 293개의 세포에 의해 제공됨). 플라스미드는 pBluescript 백본을 기반으로 하고, 또한 암피실린(ampicillin)(10,182-11,042 bp), 박테리아 ColE1 복제기점(9,315- 10,167 bp) 및 f1 단일-가닥 DNA 복제기점(11,172-11,627 bp)에 내성을 부여하는 TEM-1 β-락타마아제 유전자를 암호화하는 bla 유전자를 함유한다. 이 플라스미드에 존재하는 아데노바이러스 서열은 단지 아데노바이러스 게놈의 약 ~28%(9,280 / 35,938)를 나타내고, 복제에 중요한 시스(cis) 요소, 예를 들어 역전된 말단 반복부를 함유하지 않는다. 이들 3개의 아데노바이러스 유전자의 동일성은 수행된 DNA 플라스미드 서열분석에 의해 확인하였다. DNA 분석은 3개의 아데노바이러스 유형 5 유전자 영역(그 전문이 참조로 포함되는 GenBank 수탁번호 AF369965)과 100% 상동성을 나타내었다.Plasmid pHELP was obtained from Applied Viromics (Fremont, CA 94538) and is 11,635 bp in size (see Table 4 for specific sequence information). The plasmid contains regions of the adenovirus genome important for AAV replication, namely the E2A, E4 and VA RNAs (adenovirus E1 functions are provided by 293 cells). The plasmid is based on the pBluescript backbone, and also confers resistance to ampicillin (10,182-11,042 bp), the bacterial ColE1 origin of replication (9,315-10,167 bp) and the f1 single-stranded DNA origin of replication (11,172-11,627 bp). It contains the bla gene encoding the TEM-1 β-lactamase gene. The adenoviral sequences present on this plasmid represent only about 28% (9,280 / 35,938) of the adenovirus genome and do not contain cis elements important for replication, such as inverted terminal repeats. The identity of these three adenovirus genes was confirmed by DNA plasmid sequencing performed. DNA analysis showed 100% homology to three adenovirus type 5 gene regions (GenBank accession number AF369965, incorporated by reference in its entirety).

[표 4][Table 4]

실시예Example 2: 이중 2: double AAVAAV 벡터 시스템을 이용한 바이러스 생성물의 제조 Preparation of viral products using vector systems

HEK 293 세포를 최적화된 칼슘 포스페이트 공침전 방법을 사용하여 3개의 생산 플라스미드((i) AAV 벡터 플라스미드, 예를 들어 5' hDYSF AAV 벡터 또는 3' hDYSF AAV 벡터; (ii) 아데노바이러스(Ad) 헬퍼 플라스미드; 및 (iii) AAV 헬퍼 플라스미드)로 형질주입시켰다. 세포를 형질주입시키는 것은 AAV 벡터 플라스미드, Ad 헬퍼 플라스미드, AAV 헬퍼 플라스미드 및 CaCl₂를 함유하는 DNA/칼슘 용액을 제조하고, 동일 부피의 2X HEPES 완충 염수와 혼합하여 최적 침전물을 수득하는 것을 포함한다. 이어서, 침전물을 HEK 293 세포에 첨가하고, 인큐베이션하였다. 이어서, 침전물을 HEK 293 세포에 첨가하고, 인큐베이션하였다. 배양 후, 배지를 교환하고, 이때 핵산 분해 효소(nuclease)를 첨가하였다.HEK 293 cells were transfected with three production plasmids ((i) an AAV vector plasmid, e.g., a 5' hDYSF AAV vector or a 3' hDYSF AAV vector; (ii) an adenovirus (Ad) helper. plasmid; and (iii) an AAV helper plasmid). Transfecting the cells involved preparing a DNA/calcium solution containing AAV vector plasmid, Ad helper plasmid, AAV helper plasmid and CaCl ₂ and mixing with an equal volume of 2X HEPES buffered saline to obtain an optimal precipitate. The precipitate was then added to HEK 293 cells and incubated. The precipitate was then added to HEK 293 cells and incubated. After culturing, the medium was exchanged, and at this time, nuclease was added.

실시예Example 3: 3: rAAVrhrAAVrh .. 74.MHCK7.DYSF74.MHCK7.DYSF .. DVDV 근육내intramuscular 전달의 효능 결정 Determining the efficacy of delivery

~1kb의 중첩 서열을 갖는 MHCK7.DYSF 카세트의 5' 및 3' 부분을 함유하는 2개의 AAV 발현 카세트를 생성하였다(도 1 참조). 플라스미드를 AAVrh74 벡터 내로 패키징하였다. 4주령의 Dysf^-/- 마우스를 전경골근(tibialis anterior muscle) 내로의 근육내 주사에 의해 1×10¹¹ vg의 각각의 벡터로 처리하고 1개월에 부검하였다. 강력한 전체-길이 디스페린 발현은 면역 염색(도 4a) 및 웨스턴 블롯(도 4c)에 의해 두 벡터 모두의 전달 후에 나타났다. 어느 하나의 벡터 단독의 전달은 비정상적인 디스페린 발현을 갖지 않았다(도 4b, 면역 염색, 및 도 4d, 웨스턴 블롯). 3222는 전체-길이 대조군이다. 디스페린을 발현하는 근섬유의 수를 정량화하고 표 5에 나타내었다.Two AAV expression cassettes were generated containing the 5' and 3' portions of the MHCK7.DYSF cassette with ~1 kb of overlapping sequence (see Figure 1). The plasmid was packaged into an AAVrh74 vector. 4-week-old Dysf ^-/- mice were treated with 1×10 ¹¹ vg of each vector by intramuscular injection into the tibialis anterior muscle and necropsied at 1 month. Robust full-length dysferrin expression was shown after delivery of both vectors by immunostaining (FIG. 4A) and Western blot (FIG. 4C). Delivery of either vector alone did not result in aberrant dysferrin expression (FIG. 4B, immunostaining, and FIG. 4D, Western blot). 3222 is the full-length control. The number of myofibers expressing dysferrin was quantified and shown in Table 5.

[표 5][Table 5]

전경골근에 대한 근육내 주사 후 안전성을 평가하기 위한 시간 경과 연구를 개시하였다(표 6). 단백질 발현 및 벡터 생체분포를 또한 평가하였다. 1, 3, 6, 9, 및 12개월 종말점에서, 동물을 충분히 부검하고, 근육 및 비표적 기관에 대한 디스페린 발현(도 5a 내지 도 5c(1, 3, 및 6개월이 표시됨), 벡터 생체분포(vector biodistribution)(표 7) 및 조직병리학을 분석하였다. 경골근(tibialis muscle; TA)을 주사하였다. 각각의 동물에 대한 조직병리학에 대해 분석된 조직은 다음을 포함하였다: 생식선, 간, 심장, 폐, 비장, 신장, 횡격막, 좌측(처리된) 및 우측 전경골근. 어떠한 발견도 확인되지 않았다.A time course study was initiated to evaluate safety after intramuscular injection into the tibialis anterior muscle (Table 6). Protein expression and vector biodistribution were also evaluated. At the 1, 3, 6, 9, and 12 month endpoints, animals were thoroughly necropsied and dysferin expression on muscle and non-target organs (Figures 5A-5C (1, 3, and 6 months indicated), vector biopsy Analysis of vector biodistribution (Table 7) and histopathology.Tibialis muscle (TA) was injected.Tissues analyzed for histopathology for each animal included: gonads, liver, heart , lung, spleen, kidney, diaphragm, left (treated) and right tibialis anterior muscle.No finding was confirmed.

[표 6][Table 6]

[표 7][Table 7]

근육내 주사 후, 디스페린의 발현이 주사된 경골 전근에서 발견되었다(도 6 참조). 정맥내 전달 후, 디스페린의 발현이 골격근 및 심장 근육에서 발견되었다(도 7a 참조). After intramuscular injection, expression of dysferin was found in the injected tibialis anterior muscle (see Figure 6). After intravenous delivery, expression of dysferrin was found in skeletal and cardiac muscle (see FIG. 7A ).

마우스의 추가 코호트를 처리하여 막 복구를 위한 최소 유효 용량을 결정하였다. AAV 벡터의 3회 용량을 8주령의 129Dysf-/- 마우스의 FDB에 주사하였다(그룹당 n=6). 대조군 Dysf-/- 그룹은 염수를 투여받았고, 129WT 마우스의 그룹은 균주 특이적 정상 대조군으로서 기능하였다. 도 8에 도시된 바와 같이, AAVrh.74.DYSF.DV 처리는 용량 의존성 막 재밀봉을 나타냈다. 병행 발현 연구는 높은 용량이 섬유 형질도입의 발현>50%를 초래함을 보여준다. 이 용량은 근육 중량에 대해 정규화될 때 발현 및 안전성 연구를 위해 전경골근에 제공된 것과 동등하다(도 4a 내지 도 5c 참조).Additional cohorts of mice were treated to determine the minimum effective dose for membrane repair. Three doses of AAV vectors were injected into the FDB of 8-week-old 129Dysf−/− mice (n=6 per group). A control Dysf-/- group received saline, and a group of 129WT mice served as strain-specific normal controls. As shown in Figure 8, AAVrh.74.DYSF.DV treatment showed dose dependent membrane resealing. Parallel expression studies show that high doses result in >50% expression of fiber transduction. This dose, when normalized to muscle weight, is equivalent to that given to the tibialis anterior muscle for expression and safety studies (see FIGS. 4A-5C ).

[표 8][Table 8]

rAAVrhrAAVrh .. 74.MHCK7.DYSF74.MHCK7.DYSF .. DV의DV's 전신 전달 systemic delivery

이중 벡터 전달의 전신 전달의 타당성/유효성을 시험하기 위해 용량 발견 연구(dose finding study)를 수행하였다. BlaJ 마우스(BL6 마우스에 역교배된 AJ 디스페린 결핍 마우스)를 이 균주에서 확립된 기능적 MRI/MRS 결과에 기초한 연구를 위해 사용하였다. 3 그룹의 마우스(그룹 당 n=6)를 6주령에서 염수, 2e12 vg 총 AAV.DYSF DV(정량 표준으로서 슈퍼코일 DNA 또는 플라스미드에 기초한 8e13 vg/kg), 또는 6e12 vg 총 AAV.DYSF DV(정량 표준으로서 슈퍼코일 DNA 또는 플라스미드에 기초한 2.4e13 vg/kg)로 꼬리 정맥 주사로 처리하였다. 종말점 분석은 3개월에 이루어졌고, 횡격막 생리학(diaphragm physiology), FDB의 막 복구 분석, 및 디스페린 발현을 정량화하고 조직병리학을 평가하기 위한 완전한 부검(necropsies)을 포함하였다.⁴ A dose finding study was conducted to test the feasibility/effectiveness of systemic delivery of dual vector delivery. BlaJ mice (AJ dysferrin deficient mice backcrossed to BL6 mice) were used for studies based on functional MRI/MRS results established in this strain. Three groups of mice (n=6 per group) were injected at 6 weeks of age with saline, 2e12 vg total AAV.DYSF DV (8e13 vg/kg based on supercoil DNA or plasmid as quantification standard), or 6e12 vg total AAV.DYSF DV ( 2.4e13 vg/kg) based on supercoiled DNA or plasmid as quantification standard) by tail vein injection. Endpoint analyzes were performed at 3 months and included diaphragm physiology, analysis of membrane repair of FDB, and complete necropsies to quantify dysferin expression and assess histopathology. ⁴

4개월의 연구 종말점에서, 완전한 부검이 수행되었다. 횡격막(diaphragm)은 힘을 측정하고, FDB 근육은 막 복구의 회복을 위해 시험하고, 근육 및 기관을 발현, 벡터 생체분포 및 조직병리학을 위해 수확하였다. 고 용량에서, 디스페린 발현은 모든 근육에서 광범위한 반면(도 7a 및 7b), 저 용량은 낮은 수준의 가변 발현을 가졌다. 20주에서, BlaJ는 조직학적으로 매우 온화한 표현형을 갖는다. 대조군에 비해 재생의 마커로서 중심 핵에서 유의한 증가가 존재한다. 처리된 마우스는 고 용량에서 중심 핵형성에서 유의한 감소를 나타내었다(도 7b). 가장 영향을 많이 받은 근육인 소아(psoa)는 고 용량에서 처리시 섬유증 및 염증이 감소하는 것으로 나타났다(6e12 vg)(도 9). 횡격막의 힘 결손은 고 용량 및 저 용량 둘 다에서 회복되었고(도 10a), FDB 근육의 막 복구에서 용량 반응이 존재하였다(도 10b).At the study endpoint of 4 months, a complete necropsy was performed. The diaphragm was force measured, FDB muscle was tested for restoration of membrane repair, and muscle and organs were harvested for expression, vector biodistribution and histopathology. At high doses, dysferrin expression was extensive in all muscles (Figures 7A and 7B), whereas low doses had low levels of variable expression. At 20 weeks, BlaJ has a histologically very mild phenotype. There is a significant increase in the central nucleus as a marker of regeneration compared to control. Treated mice showed a significant reduction in central nucleation at high doses (FIG. 7B). The most affected muscle, the psoa, showed reduced fibrosis and inflammation when treated at high doses (6e12 vg) (FIG. 9). The diaphragmatic force deficit was restored at both high and low doses (FIG. 10A), and there was a dose response in membrane repair of the FDB muscle (FIG. 10B).

실시예Example 4: 4: NHP에서in the NHP AAV5AAV5 및 and AavrhAavrh .. 74.Mhck7.Dysf74. Mhck7. Dysf .. DvDv 전달에 의한 전체-길이 디스페린 발현의 안전성 및 효능 Safety and Efficacy of Full-Length Dysferin Expression by Delivery

방법: 근육내 주사에 의해 AAV.MHCK7.DYSF.FLAG로 4개의 NHP를 처리하였다. 2개의 NHP를 AAV5.MHCK7.DYSF로 처리하고, 2개를 AAVrh.74.MHCK7.DYSF.FLAG로 처리하였다. 본 발명자들의 임상 시험 설계를 반영하여, 한마리의 동물을 3개월에, 두마리는 6개월에 분석하였다. 동물들은 AAV5와 rh.74 캡시드 및 디스페린(도 11a 내지 도 11d)과 항-AAV Ab 역가(표 9)에 대해 T 세포를 측정하기 위한 ELISpot 분석을 포함한 기본 화학 및 면역학 연구를 받았다.Methods: Four NHPs were treated with AAV.MHCK7.DYSF.FLAG by intramuscular injection. Two NHPs were treated with AAV5.MHCK7.DYSF and two with AAVrh.74.MHCK7.DYSF.FLAG. Reflecting our clinical trial design, one animal was analyzed at 3 months and two at 6 months. Animals underwent basic chemistry and immunology studies including ELISpot assays to measure T cells for AAV5 and rh.74 capsid and dysferin (FIGS. 11A-11D) and anti-AAV Ab titers (Table 9).

PBMC를 자극하는데 사용되는 펩티드 풀(peptide pools)은 가능한 모든 항원 에피토프를 포획할 수 있도록 10개의 아미노산으로 중첩된 15개의 긴 아미노산으로 설계되었다. 펩티드에 반응하는 세포는 ELISpot 분석을 통해 스폿(spots)으로서 정량화된 인터페론-γ를 방출한다. 백만개의 세포당 스폿은 양성 반응 임계치(positive reaction threshold)로서 50개의 스폿/1×10⁶개의 세포로 계수되었다. 어떠한 지속적인 면역 반응도 관찰되지 않았다. 모든 동물은 연구 종말점에서 발현이 있었다(도 16a). 이들 연구는 전체 연구에 대해 2주마다 반복되었다. 연구 종말점에서 유전자 발현 연구에 더하여 생체 기관 조직에 대한 조직병리학 및 생체분포 연구를 포함하는 동물에 대한 전체 부검이 수행되었다.Peptide pools used to stimulate PBMCs were designed to be 15 amino acids long with an overlap of 10 amino acids to capture all possible antigenic epitopes. Cells responding to the peptide release interferon-γ quantified as spots via ELISpot assay. Spots per million cells were counted as 50 spots/1×10 ⁶ cells as a positive reaction threshold. No sustained immune response was observed. All animals had expression at the end of the study (FIG. 16A). These studies were repeated every 2 weeks for the entire study. At study endpoint, a full necropsy was performed on the animals, which included histopathology and biodistribution studies on living organ tissues in addition to gene expression studies.

결과: 어떠한 관찰가능한 독성도 발견되지 않았다. 본 출원인은 디스페린의 과발현을 입증하기 위해 레서스(rhesus)와 인간 디스페린을 구별하지 않는 항-디스페린 항체를 사용하였다(도 12a 내지 도 12c). 또한 벡터 유래된 디스페린 발현을 확인하기 위해 항-FLAG 면역 염색을 수행하였다(도 13). AAV5.DYSF 주사된 TA의 경우, 근육은 디스페린의 104.9%(3mo) 및 122.6%(6mo) 과발현을 나타낸 반면, AAVrh74.DYSF.DV 주사된 TA는 주사되지 않은 대조군과 비교하여 122.0%(3mo) 및 115.2% 6mo) 과발현을 나타냈다. 염증 또는 근섬유 괴사가 결여된 NHP의 조직 수준에서 어떠한 독성도 관찰되지 않았다. 면역학적 분석은 ELISpot에 의한 캡시드 또는 전이유전자에 대한 어떠한 비정상적인 반응을 나타내지 않았다(도 11a 내지 도 11d). 또한, 충분히 완전한 혈액 계수(complete blood count) 및 화학 패널은 임의의 마카크에서 비정상 값을 나타내지 않았다. 예상한 바와 같이, 항-AAV 항체 역가는 유전자 전달 후에 상승되었다. 종말점 항-AAVrh74 역가는 항-AAV5에 대한 것보다 더 낮았다.Results: No observable toxicity was found. Applicants used an anti-dysferin antibody that does not discriminate between rhesus and human dysferin to demonstrate overexpression of dysferin (FIGS. 12a to 12c). In addition, anti-FLAG immunostaining was performed to confirm expression of vector-derived dysferin (FIG. 13). For AAV5.DYSF injected TA, muscle showed 104.9% (3mo) and 122.6% (6mo) overexpression of dysferin, whereas AAVrh74.DYSF.DV injected TA showed 122.0% (3mo) overexpression of dysferin compared to non-injected controls. ) and 115.2% 6mo) overexpression. No toxicity was observed at the tissue level in NHPs lacking inflammation or myofibrillar necrosis. Immunological analysis did not reveal any abnormal response to the capsid or transgene by ELISpot (Figs. 11a to 11d). Also, sufficiently complete blood counts and chemistry panels did not show abnormal values in any macaques. As expected, anti-AAV antibody titers were elevated after gene transfer. Endpoint anti-AAVrh74 titers were lower than those for anti-AAV5.

[표 9][Table 9]

등가물equivalent

달리 정의되지 않는 한, 본원 명세서에서 사용되는 모든 기술적 및 과학적 용어는 이 기술 분야의 당업자에 의해 일반적으로 이해되는 것과 동일한 의미를 갖는다.Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art.

본원에서 예시적으로 기술된 본 기술은 본원에 구체적으로 개시되지 않은 임의의 요소 또는 요소들, 제한 또는 제한들의 부재 하에 적합하게 실시될 수 있다. 따라서 예를 들어 "포함하는(comprising)", "포함하는(including)", "함유하는(containing)" 등의 용어는 제한 없이 포괄적으로 읽혀져야 한다. 또한, 본원 명세서에서 사용되는 용어 및 표현들은 설명의 목적으로 사용된 것이지 제한할 목적으로 사용된 것이 아니며, 도시 및 설명된 특징들 또는 그 일부의 균등물들을 배제하는 이러한 용어들 및 표현들의 사용의 의도는 없으며, 청구된 본 기술의 범위 내에서 다양한 수정들이 가능하다는 것이 인식된다.The technology illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms "comprising", "including", "containing" and the like are to be read inclusively and without limitation. Further, the terms and expressions used herein are used for purposes of description and not limitation, and the use of such terms and expressions to exclude equivalents of the features shown and described or portions thereof is prohibited. Without intent, it is recognized that various modifications are possible within the scope of the claimed subject matter.

따라서, 본원에 제공된 재료, 방법 및 실시예는 바람직한 구현예를 대표하고, 예시적인 것이며, 본 기술의 범위에 대한 제한으로서 의도되지 않는다는 것이 이해되어야 한다.Accordingly, it should be understood that the materials, methods, and examples provided herein represent preferred embodiments, are illustrative, and are not intended as limitations on the scope of the present technology.

본 기술은 광범위하게 그리고 포괄적으로 본원 명세서에서 설명되었다. 일반 개시물 내에 속하는 각각의 더 좁은 종 및 하위-범용 그룹은 또한 본 기술의 일부를 형성한다. 이는 절제된 물질이 본원에서 구체적으로 언급되는지 여부에 관계없이, 속으로부터 임의의 주제를 제거하는 단서 또는 부정적 제한을 갖는 본 기술의 일반적인 설명을 포함한다.The technology has been broadly and comprehensively described herein. Each narrower species and sub-universal group falling within the general disclosure also forms part of the present description. This includes general descriptions of the subject matter with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.

또한, 본 기술의 특징 또는 구현예가 마쿠쉬 그룹의 관점에서 기술되는 경우, 당업자는 본 기술이 또한 마쿠쉬 그룹의 구성원의 임의의 개별 구성원 또는 서브그룹의 관점에서 기술된다는 것을 인식할 것이다.Further, where a feature or embodiment of the present technology is described in terms of a Markush group, one skilled in the art will recognize that the present technology is also described in terms of any individual member or subgroup of members of the Markush group.

본원 명세서에 언급된 모든 간행물, 특허 출원, 특허, 및 다른 참고문헌은 각각이 개별적으로 참고로 포함되는 것과 동일한 정도로 그 전체가 참고로 명백히 포함된다. 충돌의 경우에, 정의를 포함하는 본원 명세서는 조절될 것이다.All publications, patent applications, patents, and other references mentioned in this specification are expressly incorporated by reference in their entirety to the same extent as if each were individually incorporated by reference. In case of conflict, the present specification, including definitions, will control.

다른 구현예들은 다음의 청구항들 내에 제시된다.Other implementations are presented in the following claims.

SEQUENCE LISTING <110> RESEARCH INSTITUTE AT NATIONWIDE CHILDREN'S HOSPITAL <120> GENE THERAPY WITH DYSFERLIN DUAL VECTORS <130> 106887-8070 <140> <141> <150> 63/024,338 <151> 2020-05-13 <160> 22 <170> PatentIn version 3.5 <210> 1 <211> 3340 <212> DNA <213> Homo sapiens <220> <223> 5' hDYSF DNA fragment v1 <400> 1 atgctgaggg tcttcatcct ctatgccgag aacgtccaca cacccgacac cgacatcagc 60 gatgcctact gctccgcggt gtttgcaggg gtgaagaaga gaaccaaagt catcaagaac 120 agcgtgaacc ctgtatggaa tgagggattt gaatgggacc tcaagggcat ccccctggac 180 cagggctctg agcttcatgt ggtggtcaaa gaccatgaga cgatggggag gaacaggttc 240 ctgggggaag ccaaggtccc actccgagag gtcctcgcca cccctagtct gtccgccagc 300 ttcaatgccc ccctgctgga caccaagaag cagcccacag gggcctcgct ggtcctgcag 360 gtgtcctaca caccgctgcc tggagctgtg cccctgttcc cgccccctac tcctctggag 420 ccctccccga ctctgcctga cctggatgta gtggcagaca caggaggaga ggaagacaca 480 gaggaccagg gactcactgg agatgaggcg gagccattcc tggatcaaag cggaggcccg 540 ggggctccca ccaccccaag gaaactacct tcacgtcctc cgccccacta ccccgggatc 600 aaaagaaagc gaagtgcgcc tacatctaga aagctgctgt cagacaaacc gcaggatttc 660 cagatcaggg tccaggtgat cgaggggcgc cagctgccgg gggtgaacat caagcctgtg 720 gtcaaggtta ccgctgcagg gcagaccaag cggacgcgga tccacaaggg aaacagccca 780 ctcttcaatg agactctttt cttcaacttg tttgactctc ctggggagct gtttgatgag 840 cccatcttta tcacggtggt agactctcgt tctctcagga cagatgctct cctcggggag 900 ttccggatgg acgtgggcac catttacaga gagccccggc acgcctatct caggaagtgg 960 ctgctgctct cagaccctga tgacttctct gctggggcca gaggctacct gaaaacaagc 1020 ctttgtgtgc tggggcctgg ggacgaagcg cctctggaga gaaaagaccc ctctgaagac 1080 aaggaggaca ttgaaagcaa cctgctccgg cccacaggcg tagccctgcg aggagcccac 1140 ttctgcctga aggtcttccg ggccgaggac ttgccgcaga tggacgatgc cgtgatggac 1200 aacgtgaaac agatctttgg cttcgagagt aacaagaaga acttggtgga cccctttgtg 1260 gaggtcagct ttgcggggaa aatgctgtgc agcaagatct tggagaagac ggccaaccct 1320 cagtggaacc agaacatcac actgcctgcc atgtttccct ccatgtgcga aaaaatgagg 1380 attcgtatca tagactggga ccgcctgact cacaatgaca tcgtggctac cacctacctg 1440 agtatgtcga aaatctctgc ccctggagga gaaatagaag aggagcctgc aggtgctgtc 1500 aagccttcga aagcctcaga cttggatgac tacctgggct tcctccccac ttttgggccc 1560 tgctacatca acctctatgg cagtcccaga gagttcacag gcttcccaga cccctacaca 1620 gagctcaaca caggcaaggg ggaaggtgtg gcttatcgtg gccggcttct gctctccctg 1680 gagaccaagc tggtggagca cagtgaacag aaggtggagg accttcctgc ggatgacatc 1740 ctccgggtgg agaagtacct taggaggcgc aagtactccc tgtttgcggc cttctactca 1800 gccaccatgc tgcaggatgt ggatgatgcc atccagtttg aggtcagcat cgggaactac 1860 gggaacaagt tcgacatgac ctgcctgccg ctggcctcca ccactcagta cagccgtgca 1920 gtctttgacg ggtgccacta ctactaccta ccctggggta acgtgaaacc tgtggtggtg 1980 ctgtcatcct actgggagga catcagccat agaatcgaga ctcagaacca gctgcttggg 2040 attgctgacc ggctggaagc tggcctggag caggtccacc tggccctgaa ggcgcagtgc 2100 tccacggagg acgtggactc gctggtggct cagctgacgg atgagctcat cgcaggctgc 2160 agccagcctc tgggtgacat ccatgagaca ccctctgcca cccacctgga ccagtacctg 2220 taccagctgc gcacccatca cctgagccaa atcactgagg ctgccctggc cctgaagctc 2280 ggccacagtg agctccctgc agctctggag caggcggagg actggctcct gcgtctgcgt 2340 gccctggcag aggagcccca gaacagcctg ccggacatcg tcatctggat gctgcaggga 2400 gacaagcgtg tggcatacca gcgggtgccc gcccaccaag tcctcttctc ccggcggggt 2460 gccaactact gtggcaagaa ttgtgggaag ctacagacaa tctttctgaa atatccgatg 2520 gagaaggtgc ctggcgcccg gatgccagtg cagatacggg tcaagctgtg gtttgggctc 2580 tctgtggatg agaaggagtt caaccagttt gctgagggga agctgtctgt ctttgctgaa 2640 acctatgaga acgagactaa gttggccctt gttgggaact ggggcacaac gggcctcacc 2700 taccccaagt tttctgacgt cacgggcaag atcaagctac ccaaggacag cttccgcccc 2760 tcggccggct ggacctgggc tggagattgg ttcgtgtgtc cggagaagac tctgctccat 2820 gacatggacg ccggtcacct gagcttcgtg gaagaggtgt ttgagaacca gacccggctt 2880 cccggaggcc agtggatcta catgagtgac aactacaccg atgtgaacgg ggagaaggtg 2940 cttcccaagg atgacattga gtgcccactg ggctggaagt gggaagatga ggaatggtcc 3000 acagacctca accgggctgt cgatgagcaa ggctgggagt atagcatcac catccccccg 3060 gagcggaagc cgaagcactg ggtccctgct gagaagatgt actacacaca ccgacggcgg 3120 cgctgggtgc gcctgcgcag gagggatctc agccaaatgg aagcactgaa aaggcacagg 3180 caggcggagg cggagggcga gggctgggag tacgcctctc tttttggctg gaagttccac 3240 ctcgagtacc gcaagacaga tgccttccgc cgccgccgct ggcgccgtcg catggagcca 3300 ctggagaaga cggggcctgc agctgtgttt gcccttgagg 3340 <210> 2 <211> 3866 <212> DNA <213> Homo sapiens <220> <223> 3' hDYSF DNA fragment v1 <400> 2 tcgtcatctg gatgctgcag ggagacaagc gtgtggcata ccagcgggtg cccgcccacc 60 aagtcctctt ctcccggcgg ggtgccaact actgtggcaa gaattgtggg aagctacaga 120 caatctttct gaaatatccg atggagaagg tgcctggcgc ccggatgcca gtgcagatac 180 gggtcaagct gtggtttggg ctctctgtgg atgagaagga gttcaaccag tttgctgagg 240 ggaagctgtc tgtctttgct gaaacctatg agaacgagac taagttggcc cttgttggga 300 actggggcac aacgggcctc acctacccca agttttctga cgtcacgggc aagatcaagc 360 tacccaagga cagcttccgc ccctcggccg gctggacctg ggctggagat tggttcgtgt 420 gtccggagaa gactctgctc catgacatgg acgccggtca cctgagcttc gtggaagagg 480 tgtttgagaa ccagacccgg cttcccggag gccagtggat ctacatgagt gacaactaca 540 ccgatgtgaa cggggagaag gtgcttccca aggatgacat tgagtgccca ctgggctgga 600 agtgggaaga tgaggaatgg tccacagacc tcaaccgggc tgtcgatgag caaggctggg 660 agtatagcat caccatcccc ccggagcgga agccgaagca ctgggtccct gctgagaaga 720 tgtactacac acaccgacgg cggcgctggg tgcgcctgcg caggagggat ctcagccaaa 780 tggaagcact gaaaaggcac aggcaggcgg aggcggaggg cgagggctgg gagtacgcct 840 ctctttttgg ctggaagttc cacctcgagt accgcaagac agatgccttc cgccgccgcc 900 gctggcgccg tcgcatggag ccactggaga agacggggcc tgcagctgtg tttgcccttg 960 agggggccct gggcggcgtg atggatgaca agagtgaaga ttccatgtcc gtctccacct 1020 tgagcttcgg tgtgaacaga cccacgattt cctgcatatt cgactatggg aaccgctacc 1080 atctacgctg ctacatgtac caggcccggg acctggctgc gatggacaag gactcttttt 1140 ctgatcccta tgccatcgtc tccttcctgc accagagcca gaagacggtg gtggtgaaga 1200 acacccttaa ccccacctgg gaccagacgc tcatcttcta cgagatcgag atctttggcg 1260 agccggccac agttgctgag caaccgccca gcattgtggt ggagctgtac gaccatgaca 1320 cttatggtgc agacgagttt atgggtcgct gcatctgtca accgagtctg gaacggatgc 1380 cacggctggc ctggttccca ctgacgaggg gcagccagcc gtcgggggag ctgctggcct 1440 cttttgagct catccagaga gagaagccgg ccatccacca tattcctggt tttgaggtgc 1500 aggagacatc aaggatcctg gatgagtctg aggacacaga cctgccctac ccaccacccc 1560 agagggaggc caacatctac atggttcctc agaacatcaa gccagcgctc cagcgtaccg 1620 ccatcgagat cctggcatgg ggcctgcgga acatgaagag ttaccagctg gccaacatct 1680 cctcccccag cctcgtggta gagtgtgggg gccagacggt gcagtcctgt gtcatcagga 1740 acctccggaa gaaccccaac tttgacatct gcaccctctt catggaagtg atgctgccca 1800 gggaggagct ctactgcccc cccatcaccg tcaaggtcat cgataaccgc cagtttggcc 1860 gccggcctgt ggtgggccag tgtaccatcc gctccctgga gagcttcctg tgtgacccct 1920 actcggcgga gagtccatcc ccacagggtg gcccagacga tgtgagccta ctcagtcctg 1980 gggaagacgt gctcatcgac attgatgaca aggagcccct catccccatc caggaggaag 2040 agttcatcga ttggtggagc aaattctttg cctccatagg ggagagggaa aagtgcggct 2100 cctacctgga gaaggatttt gacaccctga aggtctatga cacacagctg gagaatgtgg 2160 aggcctttga gggcctgtct gacttttgta acaccttcaa gctgtaccgg ggcaagacgc 2220 aggaggagac agaagatcca tctgtgattg gtgaatttaa gggcctcttc aaaatttatc 2280 ccctcccaga agacccagcc atccccatgc ccccaagaca gttccaccag ctggccgccc 2340 agggacccca ggagtgcttg gtccgtatct acattgtccg agcatttggc ctgcagccca 2400 aggaccccaa tggaaagtgt gatccttaca tcaagatctc catagggaag aaatcagtga 2460 gtgaccagga taactacatc ccctgcacgc tggagcccgt atttggaaag atgttcgagc 2520 tgacctgcac tctgcctctg gagaaggacc taaagatcac tctctatgac tatgacctcc 2580 tctccaagga cgaaaagatc ggtgagacgg tcgtcgacct ggagaacagg ctgctgtcca 2640 agtttggggc tcgctgtgga ctcccacaga cctactgtgt ctctggaccg aaccagtggc 2700 gggaccagct ccgcccctcc cagctcctcc acctcttctg ccagcagcat agagtcaagg 2760 cacctgtgta ccggacagac cgtgtaatgt ttcaggataa agaatattcc attgaagaga 2820 tagaggctgg caggatccca aacccacacc tgggcccagt ggaggagcgt ctggctctgc 2880 atgtgcttca gcagcagggc ctggtcccgg agcacgtgga gtcacggccc ctctacagcc 2940 ccctgcagcc agacatcgag caggggaagc tgcagatgtg ggtcgaccta tttccgaagg 3000 ccctggggcg gcctggacct cccttcaaca tcaccccacg gagagccaga aggtttttcc 3060 tgcgttgtat tatctggaat accagagatg tgatcctgga tgacctgagc ctcacggggg 3120 agaagatgag cgacatttat gtgaaaggtt ggatgattgg ctttgaagaa cacaagcaaa 3180 agacagacgt gcattatcgt tccctgggag gtgaaggcaa cttcaactgg aggttcattt 3240 tccccttcga ctacctgcca gctgagcaag tctgtaccat tgccaagaag gatgccttct 3300 ggaggctgga caagactgag agcaaaatcc cagcacgagt ggtgttccag atctgggaca 3360 atgacaagtt ctcctttgat gattttctgg gctccctgca gctcgatctc aaccgcatgc 3420 ccaagccagc caagacagcc aagaagtgct ccttggacca gctggatgat gctttccacc 3480 cagaatggtt tgtgtccctt tttgagcaga aaacagtgaa gggctggtgg ccctgtgtag 3540 cagaagaggg tgagaagaaa atactggcgg gcaagctgga aatgaccttg gagattgtag 3600 cagagagtga gcatgaggag cggcctgctg gccagggccg ggatgagccc aacatgaacc 3660 ctaagcttga ggacccaagg cgccccgaca cctccttcct gtggtttacc tccccataca 3720 agaccatgaa gttcatcctg tggcggcgtt tccggtgggc catcatcctc ttcatcatcc 3780 tcttcatcct gctgctgttc ctggccatct tcatctacgc cttcccgaac tatgctgcca 3840 tgaagctggt gaagcccttc agctga 3866 <210> 3 <211> 128 <212> DNA <213> Adeno-associated virus 2 <220> <223> ITR <400> 3 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttcct 128 <210> 4 <211> 792 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> promoter <400> 4 aagcttgcat gtctaagcta gacccttcag attaaaaata actgaggtaa gggcctgggt 60 aggggaggtg gtgtgagacg ctcctgtctc tcctctatct gcccatcggc cctttgggga 120 ggaggaatgt gcccaaggac taaaaaaagg ccatggagcc agaggggcga gggcaacaga 180 cctttcatgg gcaaaccttg gggccctgct gtctagcatg ccccactacg ggtctaggct 240 gcccatgtaa ggaggcaagg cctggggaca cccgagatgc ctggttataa ttaacccaga 300 catgtggctg cccccccccc cccaacacct gctgcctcta aaaataaccc tgtccctggt 360 ggatcccctg catgcgaaga tcttcgaaca aggctgtggg ggactgaggg caggctgtaa 420 caggcttggg ggccagggct tatacgtgcc tgggactccc aaagtattac tgttccatgt 480 tcccggcgaa gggccagctg tcccccgcca gctagactca gcacttagtt taggaaccag 540 tgagcaagtc agcccttggg gcagcccata caaggccatg gggctgggca agctgcacgc 600 ctgggtccgg ggtgggcacg gtgcccgggc aacgagctga aagctcatct gctctcaggg 660 gcccctccct ggggacagcc cctcctggct agtcacaccc tgtaggctcc tctatataac 720 ccaggggcac aggggctgcc ctcattctac caccacctcc acagcacaga cagacactca 780 ggagcagcca gc 792 <210> 5 <211> 148 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Intron <400> 5 aggtaagttt agtctttttg tcttttattt caggtcccgg atccggtggt ggtgcaaatc 60 aaagaactgc tcctcagtgg atgttgcctt tacttctagg cctgtacgga agtgttactt 120 ctgctctaaa agctgcggaa ttgtaccc 148 <210> 6 <211> 4689 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F5' Expression Cassette v1 <400> 6 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggcttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 cacaggcaag ggggaaggtg tggcttatcg tggccggctt ctgctctccc tggagaccaa 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgcccacca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgc 4689 <210> 7 <211> 49 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic oligonucleotide <220> <223> PolyA Signal <400> 7 aataaaagat ctttattttc attagatctg tgtgttggtt ttttgtgtg 49 <210> 8 <211> 4592 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F3' Expression Cassette v1 <400> 8 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctactta tctacgtagc catgctctgg 180 tcgactctag agttttcgtc atctggatgc tgcagggaga caagcgtgtg gcataccagc 240 gggtgcccgc ccaccaagtc ctcttctccc ggcggggtgc caactactgt ggcaagaatt 300 gtgggaagct acagacaatc tttctgaaat atccgatgga gaaggtgcct ggcgcccgga 360 tgccagtgca gatacgggtc aagctgtggt ttgggctctc tgtggatgag aaggagttca 420 accagtttgc tgaggggaag ctgtctgtct ttgctgaaac ctatgagaac gagactaagt 480 tggcccttgt tgggaactgg ggcacaacgg gcctcaccta ccccaagttt tctgacgtca 540 cgggcaagat caagctaccc aaggacagct tccgcccctc ggccggctgg acctgggctg 600 gagattggtt cgtgtgtccg gagaagactc tgctccatga catggacgcc ggtcacctga 660 gcttcgtgga agaggtgttt gagaaccaga cccggcttcc cggaggccag tggatctaca 720 tgagtgacaa ctacaccgat gtgaacgggg agaaggtgct tcccaaggat gacattgagt 780 gcccactggg ctggaagtgg gaagatgagg aatggtccac agacctcaac cgggctgtcg 840 atgagcaagg ctgggagtat agcatcacca tccccccgga gcggaagccg aagcactggg 900 tccctgctga gaagatgtac tacacacacc gacggcggcg ctgggtgcgc ctgcgcagga 960 gggatctcag ccaaatggaa gcactgaaaa ggcacaggca ggcggaggcg gagggcgagg 1020 gctgggagta cgcctctctt tttggctgga agttccacct cgagtaccgc aagacagatg 1080 ccttccgccg ccgccgctgg cgccgtcgca tggagccact ggagaagacg gggcctgcag 1140 ctgtgtttgc ccttgagggg gccctgggcg gcgtgatgga tgacaagagt gaagattcca 1200 tgtccgtctc caccttgagc ttcggtgtga acagacccac gatttcctgc atattcgact 1260 atgggaaccg ctaccatcta cgctgctaca tgtaccaggc ccgggacctg gctgcgatgg 1320 acaaggactc tttttctgat ccctatgcca tcgtctcctt cctgcaccag agccagaaga 1380 cggtggtggt gaagaacacc cttaacccca cctgggacca gacgctcatc ttctacgaga 1440 tcgagatctt tggcgagccg gccacagttg ctgagcaacc gcccagcatt gtggtggagc 1500 tgtacgacca tgacacttat ggtgcagacg agtttatggg tcgctgcatc tgtcaaccga 1560 gtctggaacg gatgccacgg ctggcctggt tcccactgac gaggggcagc cagccgtcgg 1620 gggagctgct ggcctctttt gagctcatcc agagagagaa gccggccatc caccatattc 1680 ctggttttga ggtgcaggag acatcaagga tcctggatga gtctgaggac acagacctgc 1740 cctacccacc accccagagg gaggccaaca tctacatggt tcctcagaac atcaagccag 1800 cgctccagcg taccgccatc gagatcctgg catggggcct gcggaacatg aagagttacc 1860 agctggccaa catctcctcc cccagcctcg tggtagagtg tgggggccag acggtgcagt 1920 cctgtgtcat caggaacctc cggaagaacc ccaactttga catctgcacc ctcttcatgg 1980 aagtgatgct gcccagggag gagctctact gcccccccat caccgtcaag gtcatcgata 2040 accgccagtt tggccgccgg cctgtggtgg gccagtgtac catccgctcc ctggagagct 2100 tcctgtgtga cccctactcg gcggagagtc catccccaca gggtggccca gacgatgtga 2160 gcctactcag tcctggggaa gacgtgctca tcgacattga tgacaaggag cccctcatcc 2220 ccatccagga ggaagagttc atcgattggt ggagcaaatt ctttgcctcc ataggggaga 2280 gggaaaagtg cggctcctac ctggagaagg attttgacac cctgaaggtc tatgacacac 2340 agctggagaa tgtggaggcc tttgagggcc tgtctgactt ttgtaacacc ttcaagctgt 2400 accggggcaa gacgcaggag gagacagaag atccatctgt gattggtgaa tttaagggcc 2460 tcttcaaaat ttatcccctc ccagaagacc cagccatccc catgccccca agacagttcc 2520 accagctggc cgcccaggga ccccaggagt gcttggtccg tatctacatt gtccgagcat 2580 ttggcctgca gcccaaggac cccaatggaa agtgtgatcc ttacatcaag atctccatag 2640 ggaagaaatc agtgagtgac caggataact acatcccctg cacgctggag cccgtatttg 2700 gaaagatgtt cgagctgacc tgcactctgc ctctggagaa ggacctaaag atcactctct 2760 atgactatga cctcctctcc aaggacgaaa agatcggtga gacggtcgtc gacctggaga 2820 acaggctgct gtccaagttt ggggctcgct gtggactccc acagacctac tgtgtctctg 2880 gaccgaacca gtggcgggac cagctccgcc cctcccagct cctccacctc ttctgccagc 2940 agcatagagt caaggcacct gtgtaccgga cagaccgtgt aatgtttcag gataaagaat 3000 attccattga agagatagag gctggcagga tcccaaaccc acacctgggc ccagtggagg 3060 agcgtctggc tctgcatgtg cttcagcagc agggcctggt cccggagcac gtggagtcac 3120 ggcccctcta cagccccctg cagccagaca tcgagcaggg gaagctgcag atgtgggtcg 3180 acctatttcc gaaggccctg gggcggcctg gacctccctt caacatcacc ccacggagag 3240 ccagaaggtt tttcctgcgt tgtattatct ggaataccag agatgtgatc ctggatgacc 3300 tgagcctcac gggggagaag atgagcgaca tttatgtgaa aggttggatg attggctttg 3360 aagaacacaa gcaaaagaca gacgtgcatt atcgttccct gggaggtgaa ggcaacttca 3420 actggaggtt cattttcccc ttcgactacc tgccagctga gcaagtctgt accattgcca 3480 agaaggatgc cttctggagg ctggacaaga ctgagagcaa aatcccagca cgagtggtgt 3540 tccagatctg ggacaatgac aagttctcct ttgatgattt tctgggctcc ctgcagctcg 3600 atctcaaccg catgcccaag ccagccaaga cagccaagaa gtgctccttg gaccagctgg 3660 atgatgcttt ccacccagaa tggtttgtgt ccctttttga gcagaaaaca gtgaagggct 3720 ggtggccctg tgtagcagaa gagggtgaga agaaaatact ggcgggcaag ctggaaatga 3780 ccttggagat tgtagcagag agtgagcatg aggagcggcc tgctggccag ggccgggatg 3840 agcccaacat gaaccctaag cttgaggacc caaggcgccc cgacacctcc ttcctgtggt 3900 ttacctcccc atacaagacc atgaagttca tcctgtggcg gcgtttccgg tgggccatca 3960 tcctcttcat catcctcttc atcctgctgc tgttcctggc catcttcatc tacgccttcc 4020 cgaactatgc tgccatgaag ctggtgaagc ccttcagctg aggactctcc tgccctgtag 4080 aaggggccgt ggggtcccct ccagcatggg actggcctgc ctcctccgcc cagctcggcg 4140 agctcctcca gacctcctag gcctgattgt cctgccaggg tgggcagaca gacagatgga 4200 ccggcccaca ctcccagagt tgctaacatg gagctctgag atcaccccac ttccatcatt 4260 tccttctccc ccaacccaac gcttttttgg atcagctcag acatatttca gtataaaaca 4320 gttggaacca caaaaaaaaa aaaaaaaagt cgacgcggcc gcaataaaag atctttattt 4380 tcattagatc tgtgtgttgg ttttttgtgt gtctagagca tggctacgta gataagtagc 4440 atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc 4500 tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg 4560 cccgggcggc ctcagtgagc gagcgagcgc gc 4592 <210> 9 <211> 1113 <212> PRT <213> Homo sapiens <220> <223> 5' hDYSF protein fragment <400> 9 Met Leu Arg Val Phe Ile Leu Tyr Ala Glu Asn Val His Thr Pro Asp 1 5 10 15 Thr Asp Ile Ser Asp Ala Tyr Cys Ser Ala Val Phe Ala Gly Val Lys 20 25 30 Lys Arg Thr Lys Val Ile Lys Asn Ser Val Asn Pro Val Trp Asn Glu 35 40 45 Gly Phe Glu Trp Asp Leu Lys Gly Ile Pro Leu Asp Gln Gly Ser Glu 50 55 60 Leu His Val Val Val Lys Asp His Glu Thr Met Gly Arg Asn Arg Phe 65 70 75 80 Leu Gly Glu Ala Lys Val Pro Leu Arg Glu Val Leu Ala Thr Pro Ser 85 90 95 Leu Ser Ala Ser Phe Asn Ala Pro Leu Leu Asp Thr Lys Lys Gln Pro 100 105 110 Thr Gly Ala Ser Leu Val Leu Gln Val Ser Tyr Thr Pro Leu Pro Gly 115 120 125 Ala Val Pro Leu Phe Pro Pro Pro Thr Pro Leu Glu Pro Ser Pro Thr 130 135 140 Leu Pro Asp Leu Asp Val Val Ala Asp Thr Gly Gly Glu Glu Asp Thr 145 150 155 160 Glu Asp Gln Gly Leu Thr Gly Asp Glu Ala Glu Pro Phe Leu Asp Gln 165 170 175 Ser Gly Gly Pro Gly Ala Pro Thr Thr Pro Arg Lys Leu Pro Ser Arg 180 185 190 Pro Pro Pro His Tyr Pro Gly Ile Lys Arg Lys Arg Ser Ala Pro Thr 195 200 205 Ser Arg Lys Leu Leu Ser Asp Lys Pro Gln Asp Phe Gln Ile Arg Val 210 215 220 Gln Val Ile Glu Gly Arg Gln Leu Pro Gly Val Asn Ile Lys Pro Val 225 230 235 240 Val Lys Val Thr Ala Ala Gly Gln Thr Lys Arg Thr Arg Ile His Lys 245 250 255 Gly Asn Ser Pro Leu Phe Asn Glu Thr Leu Phe Phe Asn Leu Phe Asp 260 265 270 Ser Pro Gly Glu Leu Phe Asp Glu Pro Ile Phe Ile Thr Val Val Asp 275 280 285 Ser Arg Ser Leu Arg Thr Asp Ala Leu Leu Gly Glu Phe Arg Met Asp 290 295 300 Val Gly Thr Ile Tyr Arg Glu Pro Arg His Ala Tyr Leu Arg Lys Trp 305 310 315 320 Leu Leu Leu Ser Asp Pro Asp Asp Phe Ser Ala Gly Ala Arg Gly Tyr 325 330 335 Leu Lys Thr Ser Leu Cys Val Leu Gly Pro Gly Asp Glu Ala Pro Leu 340 345 350 Glu Arg Lys Asp Pro Ser Glu Asp Lys Glu Asp Ile Glu Ser Asn Leu 355 360 365 Leu Arg Pro Thr Gly Val Ala Leu Arg Gly Ala His Phe Cys Leu Lys 370 375 380 Val Phe Arg Ala Glu Asp Leu Pro Gln Met Asp Asp Ala Val Met Asp 385 390 395 400 Asn Val Lys Gln Ile Phe Gly Phe Glu Ser Asn Lys Lys Asn Leu Val 405 410 415 Asp Pro Phe Val Glu Val Ser Phe Ala Gly Lys Met Leu Cys Ser Lys 420 425 430 Ile Leu Glu Lys Thr Ala Asn Pro Gln Trp Asn Gln Asn Ile Thr Leu 435 440 445 Pro Ala Met Phe Pro Ser Met Cys Glu Lys Met Arg Ile Arg Ile Ile 450 455 460 Asp Trp Asp Arg Leu Thr His Asn Asp Ile Val Ala Thr Thr Tyr Leu 465 470 475 480 Ser Met Ser Lys Ile Ser Ala Pro Gly Gly Glu Ile Glu Glu Glu Pro 485 490 495 Ala Gly Ala Val Lys Pro Ser Lys Ala Ser Asp Leu Asp Asp Tyr Leu 500 505 510 Gly Phe Leu Pro Thr Phe Gly Pro Cys Tyr Ile Asn Leu Tyr Gly Ser 515 520 525 Pro Arg Glu Phe Thr Gly Phe Pro Asp Pro Tyr Thr Glu Leu Asn Thr 530 535 540 Gly Lys Gly Glu Gly Val Ala Tyr Arg Gly Arg Leu Leu Leu Ser Leu 545 550 555 560 Glu Thr Lys Leu Val Glu His Ser Glu Gln Lys Val Glu Asp Leu Pro 565 570 575 Ala Asp Asp Ile Leu Arg Val Glu Lys Tyr Leu Arg Arg Arg Lys Tyr 580 585 590 Ser Leu Phe Ala Ala Phe Tyr Ser Ala Thr Met Leu Gln Asp Val Asp 595 600 605 Asp Ala Ile Gln Phe Glu Val Ser Ile Gly Asn Tyr Gly Asn Lys Phe 610 615 620 Asp Met Thr Cys Leu Pro Leu Ala Ser Thr Thr Gln Tyr Ser Arg Ala 625 630 635 640 Val Phe Asp Gly Cys His Tyr Tyr Tyr Leu Pro Trp Gly Asn Val Lys 645 650 655 Pro Val Val Val Leu Ser Ser Tyr Trp Glu Asp Ile Ser His Arg Ile 660 665 670 Glu Thr Gln Asn Gln Leu Leu Gly Ile Ala Asp Arg Leu Glu Ala Gly 675 680 685 Leu Glu Gln Val His Leu Ala Leu Lys Ala Gln Cys Ser Thr Glu Asp 690 695 700 Val Asp Ser Leu Val Ala Gln Leu Thr Asp Glu Leu Ile Ala Gly Cys 705 710 715 720 Ser Gln Pro Leu Gly Asp Ile His Glu Thr Pro Ser Ala Thr His Leu 725 730 735 Asp Gln Tyr Leu Tyr Gln Leu Arg Thr His His Leu Ser Gln Ile Thr 740 745 750 Glu Ala Ala Leu Ala Leu Lys Leu Gly His Ser Glu Leu Pro Ala Ala 755 760 765 Leu Glu Gln Ala Glu Asp Trp Leu Leu Arg Leu Arg Ala Leu Ala Glu 770 775 780 Glu Pro Gln Asn Ser Leu Pro Asp Ile Val Ile Trp Met Leu Gln Gly 785 790 795 800 Asp Lys Arg Val Ala Tyr Gln Arg Val Pro Ala His Gln Val Leu Phe 805 810 815 Ser Arg Arg Gly Ala Asn Tyr Cys Gly Lys Asn Cys Gly Lys Leu Gln 820 825 830 Thr Ile Phe Leu Lys Tyr Pro Met Glu Lys Val Pro Gly Ala Arg Met 835 840 845 Pro Val Gln Ile Arg Val Lys Leu Trp Phe Gly Leu Ser Val Asp Glu 850 855 860 Lys Glu Phe Asn Gln Phe Ala Glu Gly Lys Leu Ser Val Phe Ala Glu 865 870 875 880 Thr Tyr Glu Asn Glu Thr Lys Leu Ala Leu Val Gly Asn Trp Gly Thr 885 890 895 Thr Gly Leu Thr Tyr Pro Lys Phe Ser Asp Val Thr Gly Lys Ile Lys 900 905 910 Leu Pro Lys Asp Ser Phe Arg Pro Ser Ala Gly Trp Thr Trp Ala Gly 915 920 925 Asp Trp Phe Val Cys Pro Glu Lys Thr Leu Leu His Asp Met Asp Ala 930 935 940 Gly His Leu Ser Phe Val Glu Glu Val Phe Glu Asn Gln Thr Arg Leu 945 950 955 960 Pro Gly Gly Gln Trp Ile Tyr Met Ser Asp Asn Tyr Thr Asp Val Asn 965 970 975 Gly Glu Lys Val Leu Pro Lys Asp Asp Ile Glu Cys Pro Leu Gly Trp 980 985 990 Lys Trp Glu Asp Glu Glu Trp Ser Thr Asp Leu Asn Arg Ala Val Asp 995 1000 1005 Glu Gln Gly Trp Glu Tyr Ser Ile Thr Ile Pro Pro Glu Arg Lys 1010 1015 1020 Pro Lys His Trp Val Pro Ala Glu Lys Met Tyr Tyr Thr His Arg 1025 1030 1035 Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp Leu Ser Gln Met 1040 1045 1050 Glu Ala Leu Lys Arg His Arg Gln Ala Glu Ala Glu Gly Glu Gly 1055 1060 1065 Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu Glu Tyr 1070 1075 1080 Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg Met 1085 1090 1095 Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 1100 1105 1110 <210> 10 <211> 1287 <212> PRT <213> Homo sapiens <220> <223> 3' hDYSF protein fragment <400> 10 Val Ile Trp Met Leu Gln Gly Asp Lys Arg Val Ala Tyr Gln Arg Val 1 5 10 15 Pro Ala His Gln Val Leu Phe Ser Arg Arg Gly Ala Asn Tyr Cys Gly 20 25 30 Lys Asn Cys Gly Lys Leu Gln Thr Ile Phe Leu Lys Tyr Pro Met Glu 35 40 45 Lys Val Pro Gly Ala Arg Met Pro Val Gln Ile Arg Val Lys Leu Trp 50 55 60 Phe Gly Leu Ser Val Asp Glu Lys Glu Phe Asn Gln Phe Ala Glu Gly 65 70 75 80 Lys Leu Ser Val Phe Ala Glu Thr Tyr Glu Asn Glu Thr Lys Leu Ala 85 90 95 Leu Val Gly Asn Trp Gly Thr Thr Gly Leu Thr Tyr Pro Lys Phe Ser 100 105 110 Asp Val Thr Gly Lys Ile Lys Leu Pro Lys Asp Ser Phe Arg Pro Ser 115 120 125 Ala Gly Trp Thr Trp Ala Gly Asp Trp Phe Val Cys Pro Glu Lys Thr 130 135 140 Leu Leu His Asp Met Asp Ala Gly His Leu Ser Phe Val Glu Glu Val 145 150 155 160 Phe Glu Asn Gln Thr Arg Leu Pro Gly Gly Gln Trp Ile Tyr Met Ser 165 170 175 Asp Asn Tyr Thr Asp Val Asn Gly Glu Lys Val Leu Pro Lys Asp Asp 180 185 190 Ile Glu Cys Pro Leu Gly Trp Lys Trp Glu Asp Glu Glu Trp Ser Thr 195 200 205 Asp Leu Asn Arg Ala Val Asp Glu Gln Gly Trp Glu Tyr Ser Ile Thr 210 215 220 Ile Pro Pro Glu Arg Lys Pro Lys His Trp Val Pro Ala Glu Lys Met 225 230 235 240 Tyr Tyr Thr His Arg Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp 245 250 255 Leu Ser Gln Met Glu Ala Leu Lys Arg His Arg Gln Ala Glu Ala Glu 260 265 270 Gly Glu Gly Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu 275 280 285 Glu Tyr Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg 290 295 300 Met Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 305 310 315 320 Gly Ala Leu Gly Gly Val Met Asp Asp Lys Ser Glu Asp Ser Met Ser 325 330 335 Val Ser Thr Leu Ser Phe Gly Val Asn Arg Pro Thr Ile Ser Cys Ile 340 345 350 Phe Asp Tyr Gly Asn Arg Tyr His Leu Arg Cys Tyr Met Tyr Gln Ala 355 360 365 Arg Asp Leu Ala Ala Met Asp Lys Asp Ser Phe Ser Asp Pro Tyr Ala 370 375 380 Ile Val Ser Phe Leu His Gln Ser Gln Lys Thr Val Val Val Lys Asn 385 390 395 400 Thr Leu Asn Pro Thr Trp Asp Gln Thr Leu Ile Phe Tyr Glu Ile Glu 405 410 415 Ile Phe Gly Glu Pro Ala Thr Val Ala Glu Gln Pro Pro Ser Ile Val 420 425 430 Val Glu Leu Tyr Asp His Asp Thr Tyr Gly Ala Asp Glu Phe Met Gly 435 440 445 Arg Cys Ile Cys Gln Pro Ser Leu Glu Arg Met Pro Arg Leu Ala Trp 450 455 460 Phe Pro Leu Thr Arg Gly Ser Gln Pro Ser Gly Glu Leu Leu Ala Ser 465 470 475 480 Phe Glu Leu Ile Gln Arg Glu Lys Pro Ala Ile His His Ile Pro Gly 485 490 495 Phe Glu Val Gln Glu Thr Ser Arg Ile Leu Asp Glu Ser Glu Asp Thr 500 505 510 Asp Leu Pro Tyr Pro Pro Pro Gln Arg Glu Ala Asn Ile Tyr Met Val 515 520 525 Pro Gln Asn Ile Lys Pro Ala Leu Gln Arg Thr Ala Ile Glu Ile Leu 530 535 540 Ala Trp Gly Leu Arg Asn Met Lys Ser Tyr Gln Leu Ala Asn Ile Ser 545 550 555 560 Ser Pro Ser Leu Val Val Glu Cys Gly Gly Gln Thr Val Gln Ser Cys 565 570 575 Val Ile Arg Asn Leu Arg Lys Asn Pro Asn Phe Asp Ile Cys Thr Leu 580 585 590 Phe Met Glu Val Met Leu Pro Arg Glu Glu Leu Tyr Cys Pro Pro Ile 595 600 605 Thr Val Lys Val Ile Asp Asn Arg Gln Phe Gly Arg Arg Pro Val Val 610 615 620 Gly Gln Cys Thr Ile Arg Ser Leu Glu Ser Phe Leu Cys Asp Pro Tyr 625 630 635 640 Ser Ala Glu Ser Pro Ser Pro Gln Gly Gly Pro Asp Asp Val Ser Leu 645 650 655 Leu Ser Pro Gly Glu Asp Val Leu Ile Asp Ile Asp Asp Lys Glu Pro 660 665 670 Leu Ile Pro Ile Gln Glu Glu Glu Phe Ile Asp Trp Trp Ser Lys Phe 675 680 685 Phe Ala Ser Ile Gly Glu Arg Glu Lys Cys Gly Ser Tyr Leu Glu Lys 690 695 700 Asp Phe Asp Thr Leu Lys Val Tyr Asp Thr Gln Leu Glu Asn Val Glu 705 710 715 720 Ala Phe Glu Gly Leu Ser Asp Phe Cys Asn Thr Phe Lys Leu Tyr Arg 725 730 735 Gly Lys Thr Gln Glu Glu Thr Glu Asp Pro Ser Val Ile Gly Glu Phe 740 745 750 Lys Gly Leu Phe Lys Ile Tyr Pro Leu Pro Glu Asp Pro Ala Ile Pro 755 760 765 Met Pro Pro Arg Gln Phe His Gln Leu Ala Ala Gln Gly Pro Gln Glu 770 775 780 Cys Leu Val Arg Ile Tyr Ile Val Arg Ala Phe Gly Leu Gln Pro Lys 785 790 795 800 Asp Pro Asn Gly Lys Cys Asp Pro Tyr Ile Lys Ile Ser Ile Gly Lys 805 810 815 Lys Ser Val Ser Asp Gln Asp Asn Tyr Ile Pro Cys Thr Leu Glu Pro 820 825 830 Val Phe Gly Lys Met Phe Glu Leu Thr Cys Thr Leu Pro Leu Glu Lys 835 840 845 Asp Leu Lys Ile Thr Leu Tyr Asp Tyr Asp Leu Leu Ser Lys Asp Glu 850 855 860 Lys Ile Gly Glu Thr Val Val Asp Leu Glu Asn Arg Leu Leu Ser Lys 865 870 875 880 Phe Gly Ala Arg Cys Gly Leu Pro Gln Thr Tyr Cys Val Ser Gly Pro 885 890 895 Asn Gln Trp Arg Asp Gln Leu Arg Pro Ser Gln Leu Leu His Leu Phe 900 905 910 Cys Gln Gln His Arg Val Lys Ala Pro Val Tyr Arg Thr Asp Arg Val 915 920 925 Met Phe Gln Asp Lys Glu Tyr Ser Ile Glu Glu Ile Glu Ala Gly Arg 930 935 940 Ile Pro Asn Pro His Leu Gly Pro Val Glu Glu Arg Leu Ala Leu His 945 950 955 960 Val Leu Gln Gln Gln Gly Leu Val Pro Glu His Val Glu Ser Arg Pro 965 970 975 Leu Tyr Ser Pro Leu Gln Pro Asp Ile Glu Gln Gly Lys Leu Gln Met 980 985 990 Trp Val Asp Leu Phe Pro Lys Ala Leu Gly Arg Pro Gly Pro Pro Phe 995 1000 1005 Asn Ile Thr Pro Arg Arg Ala Arg Arg Phe Phe Leu Arg Cys Ile 1010 1015 1020 Ile Trp Asn Thr Arg Asp Val Ile Leu Asp Asp Leu Ser Leu Thr 1025 1030 1035 Gly Glu Lys Met Ser Asp Ile Tyr Val Lys Gly Trp Met Ile Gly 1040 1045 1050 Phe Glu Glu His Lys Gln Lys Thr Asp Val His Tyr Arg Ser Leu 1055 1060 1065 Gly Gly Glu Gly Asn Phe Asn Trp Arg Phe Ile Phe Pro Phe Asp 1070 1075 1080 Tyr Leu Pro Ala Glu Gln Val Cys Thr Ile Ala Lys Lys Asp Ala 1085 1090 1095 Phe Trp Arg Leu Asp Lys Thr Glu Ser Lys Ile Pro Ala Arg Val 1100 1105 1110 Val Phe Gln Ile Trp Asp Asn Asp Lys Phe Ser Phe Asp Asp Phe 1115 1120 1125 Leu Gly Ser Leu Gln Leu Asp Leu Asn Arg Met Pro Lys Pro Ala 1130 1135 1140 Lys Thr Ala Lys Lys Cys Ser Leu Asp Gln Leu Asp Asp Ala Phe 1145 1150 1155 His Pro Glu Trp Phe Val Ser Leu Phe Glu Gln Lys Thr Val Lys 1160 1165 1170 Gly Trp Trp Pro Cys Val Ala Glu Glu Gly Glu Lys Lys Ile Leu 1175 1180 1185 Ala Gly Lys Leu Glu Met Thr Leu Glu Ile Val Ala Glu Ser Glu 1190 1195 1200 His Glu Glu Arg Pro Ala Gly Gln Gly Arg Asp Glu Pro Asn Met 1205 1210 1215 Asn Pro Lys Leu Glu Asp Pro Arg Arg Pro Asp Thr Ser Phe Leu 1220 1225 1230 Trp Phe Thr Ser Pro Tyr Lys Thr Met Lys Phe Ile Leu Trp Arg 1235 1240 1245 Arg Phe Arg Trp Ala Ile Ile Leu Phe Ile Ile Leu Phe Ile Leu 1250 1255 1260 Leu Leu Phe Leu Ala Ile Phe Ile Tyr Ala Phe Pro Asn Tyr Ala 1265 1270 1275 Ala Met Lys Leu Val Lys Pro Phe Ser 1280 1285 <210> 11 <211> 6914 <212> DNA <213> Homo sapiens <220> <223> hDYSF DNA full length <400> 11 gcggccgccg cccagccagg tgcaaaatgc cgtgtcattg ggagactccg cagccggagc 60 attagattac agctcgacgg agctcgggaa gggcggcggg ggtggaagat gagcagaagc 120 ccctgttctc ggaacgccgg ctgacaagcg gggtgagcgc agccggggcg gggacccagc 180 ctagcccact ggagcagccg ggggtggccc gttccccttt aagagcaact gctctaagcc 240 aggagccaga gattcgagcc ggcctcgccc agccagccct ctccagcgag gggacccaca 300 agcggcgcct cggccctccc gacctttccg agccctcttt gcgccctggg cgcacggggc 360 cctacacgcg ccaagcatgc tgagggtctt catcctctat gccgagaacg tccacacacc 420 cgacaccgac atcagcgatg cctactgctc cgcggtgttt gcaggggtga agaagagaac 480 caaagtcatc aagaacagcg tgaaccctgt atggaatgag ggatttgaat gggacctcaa 540 gggcatcccc ctggaccagg gctctgagct tcatgtggtg gtcaaagacc atgagacgat 600 ggggaggaac aggttcctgg gggaagccaa ggtcccactc cgagaggtcc tcgccacccc 660 tagtctgtcc gccagcttca atgcccccct gctggacacc aagaagcagc ccacaggggc 720 ctcgctggtc ctgcaggtgt cctacacacc gctgcctgga gctgtgcccc tgttcccgcc 780 ccctactcct ctggagccct ccccgactct gcctgacctg gatgtagtgg cagacacagg 840 aggagaggaa gacacagagg accagggact cactggagat gaggcggagc cattcctgga 900 tcaaagcgga ggcccggggg ctcccaccac cccaaggaaa ctaccttcac gtcctccgcc 960 ccactacccc gggatcaaaa gaaagcgaag tgcgcctaca tctagaaagc tgctgtcaga 1020 caaaccgcag gatttccaga tcagggtcca ggtgatcgag gggcgccagc tgccgggggt 1080 gaacatcaag cctgtggtca aggttaccgc tgcagggcag accaagcgga cgcggatcca 1140 caagggaaac agcccactct tcaatgagac tcttttcttc aacttgtttg actctcctgg 1200 ggagctgttt gatgagccca tctttatcac ggtggtagac tctcgttctc tcaggacaga 1260 tgctctcctc ggggagttcc ggatggacgt gggcaccatt tacagagagc cccggcacgc 1320 ctatctcagg aagtggctgc tgctctcaga ccctgatgac ttctctgctg gggccagagg 1380 ctacctgaaa acaagccttt gtgtgctggg gcctggggac gaagcgcctc tggagagaaa 1440 agacccctct gaagacaagg aggacattga aagcaacctg ctccggccca caggcgtagc 1500 cctgcgagga gcccacttct gcctgaaggt cttccgggcc gaggacttgc cgcagatgga 1560 cgatgccgtg atggacaacg tgaaacagat ctttggcttc gagagtaaca agaagaactt 1620 ggtggacccc tttgtggagg tcagctttgc ggggaaaatg ctgtgcagca agatcttgga 1680 gaagacggcc aaccctcagt ggaaccagaa catcacactg cctgccatgt ttccctccat 1740 gtgcgaaaaa atgaggattc gtatcataga ctgggaccgc ctgactcaca atgacatcgt 1800 ggctaccacc tacctgagta tgtcgaaaat ctctgcccct ggaggagaaa tagaagagga 1860 gcctgcaggt gctgtcaagc cttcgaaagc ctcagacttg gatgactacc tgggcttcct 1920 ccccactttt gggccctgct acatcaacct ctatggcagt cccagagagt tcacaggctt 1980 cccagacccc tacacagagc tcaacacagg caagggggaa ggtgtggctt atcgtggccg 2040 gcttctgctc tccctggaga ccaagctggt ggagcacagt gaacagaagg tggaggacct 2100 tcctgcggat gacatcctcc gggtggagaa gtaccttagg aggcgcaagt actccctgtt 2160 tgcggccttc tactcagcca ccatgctgca ggatgtggat gatgccatcc agtttgaggt 2220 cagcatcggg aactacggga acaagttcga catgacctgc ctgccgctgg cctccaccac 2280 tcagtacagc cgtgcagtct ttgacgggtg ccactactac tacctaccct ggggtaacgt 2340 gaaacctgtg gtggtgctgt catcctactg ggaggacatc agccatagaa tcgagactca 2400 gaaccagctg cttgggattg ctgaccggct ggaagctggc ctggagcagg tccacctggc 2460 cctgaaggcg cagtgctcca cggaggacgt ggactcgctg gtggctcagc tgacggatga 2520 gctcatcgca ggctgcagcc agcctctggg tgacatccat gagacaccct ctgccaccca 2580 cctggaccag tacctgtacc agctgcgcac ccatcacctg agccaaatca ctgaggctgc 2640 cctggccctg aagctcggcc acagtgagct ccctgcagct ctggagcagg cggaggactg 2700 gctcctgcgt ctgcgtgccc tggcagagga gccccagaac agcctgccgg acatcgtcat 2760 ctggatgctg cagggagaca agcgtgtggc ataccagcgg gtgcccgccc accaagtcct 2820 cttctcccgg cggggtgcca actactgtgg caagaattgt gggaagctac agacaatctt 2880 tctgaaatat ccgatggaga aggtgcctgg cgcccggatg ccagtgcaga tacgggtcaa 2940 gctgtggttt gggctctcag tggatgagaa ggagttcaac cagtttgctg aggggaagct 3000 gtctgtcttt gctgaaacct atgagaacga gactaagttg gcccttgttg ggaactgggg 3060 cacaacgggc ctcacctacc ccaagttttc tgacgtcacg ggcaagatca agctacccaa 3120 ggacagcttc cgcccctcgg ccggctggac ctgggctgga gattggttcg tgtgtccgga 3180 gaagactctg ctccatgaca tggacgccgg tcacctgagc ttcgtggaag aggtgtttga 3240 gaaccagacc cggcttcccg gaggccagtg gatctacatg agtgacaact acaccgatgt 3300 gaacggggag aaggtgcttc ccaaggatga cattgagtgc ccactgggct ggaagtggga 3360 agatgaggaa tggtccacag acctcaaccg ggctgtcgat gagcaaggct gggagtatag 3420 catcaccatc cccccggagc ggaagccgaa gcactgggtc cctgctgaga agatgtacta 3480 cacacaccga cggcggcgct gggtgcgcct gcgcaggagg gatctcagcc aaatggaagc 3540 actgaaaagg cacaggcagg cggaggcgga gggcgagggc tgggagtacg cctctctttt 3600 tggctggaag ttccacctcg agtaccgcaa gacagatgcc ttccgccgcc gccgctggcg 3660 ccgtcgcatg gagccactgg agaagacggg gcctgcagct gtgtttgccc ttgagggggc 3720 cctgggcggc gtgatggatg acaagagtga agattccatg tccgtctcca ccttgagctt 3780 cggtgtgaac agacccacga tttcctgcat attcgactat gggaaccgct accatctacg 3840 ctgctacatg taccaggccc gggacctggc tgcgatggac aaggactctt tttctgatcc 3900 ctatgccatc gtctccttcc tgcaccagag ccagaagacg gtggtggtga agaacaccct 3960 taaccccacc tgggaccaga cgctcatctt ctacgagatc gagatctttg gcgagccggc 4020 cacagttgct gagcaaccgc ccagcattgt ggtggagctg tacgaccatg acacttatgg 4080 tgcagacgag tttatgggtc gctgcatctg tcaaccgagt ctggaacgga tgccacggct 4140 ggcctggttc ccactgacga ggggcagcca gccgtcgggg gagctgctgg cctcttttga 4200 gctcatccag agagagaagc cggccatcca ccatattcct ggttttgagg tgcaggagac 4260 atcaaggatc ctggatgagt ctgaggacac agacctgccc tacccaccac cccagaggga 4320 ggccaacatc tacatggttc ctcagaacat caagccagcg ctccagcgta ccgccatcga 4380 gatcctggca tggggcctgc ggaacatgaa gagttaccag ctggccaaca tctcctcccc 4440 cagcctcgtg gtagagtgtg ggggccagac ggtgcagtcc tgtgtcatca ggaacctccg 4500 gaagaacccc aactttgaca tctgcaccct cttcatggaa gtgatgctgc ccagggagga 4560 gctctactgc ccccccatca ccgtcaaggt catcgataac cgccagtttg gccgccggcc 4620 tgtggtgggc cagtgtacca tccgctccct ggagagcttc ctgtgtgacc cctactcggc 4680 ggagagtcca tccccacagg gtggcccaga cgatgtgagc ctactcagtc ctggggaaga 4740 cgtgctcatc gacattgatg acaaggagcc cctcatcccc atccaggagg aagagttcat 4800 cgattggtgg agcaaattct ttgcctccat aggggagagg gaaaagtgcg gctcctacct 4860 ggagaaggat tttgacaccc tgaaggtcta tgacacacag ctggagaatg tggaggcctt 4920 tgagggcctg tctgactttt gtaacacctt caagctgtac cggggcaaga cgcaggagga 4980 gacagaagat ccatctgtga ttggtgaatt taagggcctc ttcaaaattt atcccctccc 5040 agaagaccca gccatcccca tgcccccaag acagttccac cagctggccg cccagggacc 5100 ccaggagtgc ttggtccgta tctacattgt ccgagcattt ggcctgcagc ccaaggaccc 5160 caatggaaag tgtgatcctt acatcaagat ctccataggg aagaaatcag tgagtgacca 5220 ggataactac atcccctgca cgctggagcc cgtatttgga aagatgttcg agctgacctg 5280 cactctgcct ctggagaagg acctaaagat cactctctat gactatgacc tcctctccaa 5340 ggacgaaaag atcggtgaga cggtcgtcga cctggagaac aggctgctgt ccaagtttgg 5400 ggctcgctgt ggactcccac agacctactg tgtctctgga ccgaaccagt ggcgggacca 5460 gctccgcccc tcccagctcc tccacctctt ctgccagcag catagagtca aggcacctgt 5520 gtaccggaca gaccgtgtaa tgtttcagga taaagaatat tccattgaag agatagaggc 5580 tggcaggatc ccaaacccac acctgggccc agtggaggag cgtctggctc tgcatgtgct 5640 tcagcagcag ggcctggtcc cggagcacgt ggagtcacgg cccctctaca gccccctgca 5700 gccagacatc gagcagggga agctgcagat gtgggtcgac ctatttccga aggccctggg 5760 gcggcctgga cctcccttca acatcacccc acggagagcc agaaggtttt tcctgcgttg 5820 tattatctgg aataccagag atgtgatcct ggatgacctg agcctcacgg gggagaagat 5880 gagcgacatt tatgtgaaag gttggatgat tggctttgaa gaacacaagc aaaagacaga 5940 cgtgcattat cgttccctgg gaggtgaagg caacttcaac tggaggttca ttttcccctt 6000 cgactacctg ccagctgagc aagtctgtac cattgccaag aaggatgcct tctggaggct 6060 ggacaagact gagagcaaaa tcccagcacg agtggtgttc cagatctggg acaatgacaa 6120 gttctccttt gatgattttc tgggctccct gcagctcgat ctcaaccgca tgcccaagcc 6180 agccaagaca gccaagaagt gctccttgga ccagctggat gatgctttcc acccagaatg 6240 gtttgtgtcc ctttttgagc agaaaacagt gaagggctgg tggccctgtg tagcagaaga 6300 gggtgagaag aaaatactgg cgggcaagct ggaaatgacc ttggagattg tagcagagag 6360 tgagcatgag gagcggcctg ctggccaggg ccgggatgag cccaacatga accctaagct 6420 tgaggaccca aggcgccccg acacctcctt cctgtggttt acctccccat acaagaccat 6480 gaagttcatc ctgtggcggc gtttccggtg ggccatcatc ctcttcatca tcctcttcat 6540 cctgctgctg ttcctggcca tcttcatcta cgccttcccg aactatgctg ccatgaagct 6600 ggtgaagccc ttcagctgag gactctcctg ccctgtagaa ggggccgtgg ggtcccctcc 6660 agcatgggac tggcctgcct cctccgccca gctcggcgag ctcctccaga cctcctaggc 6720 ctgattgtcc tgccagggtg ggcagacaga cagatggacc ggcccacact cccagagttg 6780 ctaacatgga gctctgagat caccccactt ccatcatttc cttctccccc aacccaacgc 6840 ttttttggat cagctcagac atatttcagt ataaaacagt tggaaccaca aaaaaaaaaa 6900 aaaaaaaaaa aaaa 6914 <210> 12 <211> 2080 <212> PRT <213> Homo sapiens <220> <223> hDYSF protein full length <400> 12 Met Leu Arg Val Phe Ile Leu Tyr Ala Glu Asn Val His Thr Pro Asp 1 5 10 15 Thr Asp Ile Ser Asp Ala Tyr Cys Ser Ala Val Phe Ala Gly Val Lys 20 25 30 Lys Arg Thr Lys Val Ile Lys Asn Ser Val Asn Pro Val Trp Asn Glu 35 40 45 Gly Phe Glu Trp Asp Leu Lys Gly Ile Pro Leu Asp Gln Gly Ser Glu 50 55 60 Leu His Val Val Val Lys Asp His Glu Thr Met Gly Arg Asn Arg Phe 65 70 75 80 Leu Gly Glu Ala Lys Val Pro Leu Arg Glu Val Leu Ala Thr Pro Ser 85 90 95 Leu Ser Ala Ser Phe Asn Ala Pro Leu Leu Asp Thr Lys Lys Gln Pro 100 105 110 Thr Gly Ala Ser Leu Val Leu Gln Val Ser Tyr Thr Pro Leu Pro Gly 115 120 125 Ala Val Pro Leu Phe Pro Pro Pro Thr Pro Leu Glu Pro Ser Pro Thr 130 135 140 Leu Pro Asp Leu Asp Val Val Ala Asp Thr Gly Gly Glu Glu Asp Thr 145 150 155 160 Glu Asp Gln Gly Leu Thr Gly Asp Glu Ala Glu Pro Phe Leu Asp Gln 165 170 175 Ser Gly Gly Pro Gly Ala Pro Thr Thr Pro Arg Lys Leu Pro Ser Arg 180 185 190 Pro Pro Pro His Tyr Pro Gly Ile Lys Arg Lys Arg Ser Ala Pro Thr 195 200 205 Ser Arg Lys Leu Leu Ser Asp Lys Pro Gln Asp Phe Gln Ile Arg Val 210 215 220 Gln Val Ile Glu Gly Arg Gln Leu Pro Gly Val Asn Ile Lys Pro Val 225 230 235 240 Val Lys Val Thr Ala Ala Gly Gln Thr Lys Arg Thr Arg Ile His Lys 245 250 255 Gly Asn Ser Pro Leu Phe Asn Glu Thr Leu Phe Phe Asn Leu Phe Asp 260 265 270 Ser Pro Gly Glu Leu Phe Asp Glu Pro Ile Phe Ile Thr Val Val Asp 275 280 285 Ser Arg Ser Leu Arg Thr Asp Ala Leu Leu Gly Glu Phe Arg Met Asp 290 295 300 Val Gly Thr Ile Tyr Arg Glu Pro Arg His Ala Tyr Leu Arg Lys Trp 305 310 315 320 Leu Leu Leu Ser Asp Pro Asp Asp Phe Ser Ala Gly Ala Arg Gly Tyr 325 330 335 Leu Lys Thr Ser Leu Cys Val Leu Gly Pro Gly Asp Glu Ala Pro Leu 340 345 350 Glu Arg Lys Asp Pro Ser Glu Asp Lys Glu Asp Ile Glu Ser Asn Leu 355 360 365 Leu Arg Pro Thr Gly Val Ala Leu Arg Gly Ala His Phe Cys Leu Lys 370 375 380 Val Phe Arg Ala Glu Asp Leu Pro Gln Met Asp Asp Ala Val Met Asp 385 390 395 400 Asn Val Lys Gln Ile Phe Gly Phe Glu Ser Asn Lys Lys Asn Leu Val 405 410 415 Asp Pro Phe Val Glu Val Ser Phe Ala Gly Lys Met Leu Cys Ser Lys 420 425 430 Ile Leu Glu Lys Thr Ala Asn Pro Gln Trp Asn Gln Asn Ile Thr Leu 435 440 445 Pro Ala Met Phe Pro Ser Met Cys Glu Lys Met Arg Ile Arg Ile Ile 450 455 460 Asp Trp Asp Arg Leu Thr His Asn Asp Ile Val Ala Thr Thr Tyr Leu 465 470 475 480 Ser Met Ser Lys Ile Ser Ala Pro Gly Gly Glu Ile Glu Glu Glu Pro 485 490 495 Ala Gly Ala Val Lys Pro Ser Lys Ala Ser Asp Leu Asp Asp Tyr Leu 500 505 510 Gly Phe Leu Pro Thr Phe Gly Pro Cys Tyr Ile Asn Leu Tyr Gly Ser 515 520 525 Pro Arg Glu Phe Thr Gly Phe Pro Asp Pro Tyr Thr Glu Leu Asn Thr 530 535 540 Gly Lys Gly Glu Gly Val Ala Tyr Arg Gly Arg Leu Leu Leu Ser Leu 545 550 555 560 Glu Thr Lys Leu Val Glu His Ser Glu Gln Lys Val Glu Asp Leu Pro 565 570 575 Ala Asp Asp Ile Leu Arg Val Glu Lys Tyr Leu Arg Arg Arg Lys Tyr 580 585 590 Ser Leu Phe Ala Ala Phe Tyr Ser Ala Thr Met Leu Gln Asp Val Asp 595 600 605 Asp Ala Ile Gln Phe Glu Val Ser Ile Gly Asn Tyr Gly Asn Lys Phe 610 615 620 Asp Met Thr Cys Leu Pro Leu Ala Ser Thr Thr Gln Tyr Ser Arg Ala 625 630 635 640 Val Phe Asp Gly Cys His Tyr Tyr Tyr Leu Pro Trp Gly Asn Val Lys 645 650 655 Pro Val Val Val Leu Ser Ser Tyr Trp Glu Asp Ile Ser His Arg Ile 660 665 670 Glu Thr Gln Asn Gln Leu Leu Gly Ile Ala Asp Arg Leu Glu Ala Gly 675 680 685 Leu Glu Gln Val His Leu Ala Leu Lys Ala Gln Cys Ser Thr Glu Asp 690 695 700 Val Asp Ser Leu Val Ala Gln Leu Thr Asp Glu Leu Ile Ala Gly Cys 705 710 715 720 Ser Gln Pro Leu Gly Asp Ile His Glu Thr Pro Ser Ala Thr His Leu 725 730 735 Asp Gln Tyr Leu Tyr Gln Leu Arg Thr His His Leu Ser Gln Ile Thr 740 745 750 Glu Ala Ala Leu Ala Leu Lys Leu Gly His Ser Glu Leu Pro Ala Ala 755 760 765 Leu Glu Gln Ala Glu Asp Trp Leu Leu Arg Leu Arg Ala Leu Ala Glu 770 775 780 Glu Pro Gln Asn Ser Leu Pro Asp Ile Val Ile Trp Met Leu Gln Gly 785 790 795 800 Asp Lys Arg Val Ala Tyr Gln Arg Val Pro Ala His Gln Val Leu Phe 805 810 815 Ser Arg Arg Gly Ala Asn Tyr Cys Gly Lys Asn Cys Gly Lys Leu Gln 820 825 830 Thr Ile Phe Leu Lys Tyr Pro Met Glu Lys Val Pro Gly Ala Arg Met 835 840 845 Pro Val Gln Ile Arg Val Lys Leu Trp Phe Gly Leu Ser Val Asp Glu 850 855 860 Lys Glu Phe Asn Gln Phe Ala Glu Gly Lys Leu Ser Val Phe Ala Glu 865 870 875 880 Thr Tyr Glu Asn Glu Thr Lys Leu Ala Leu Val Gly Asn Trp Gly Thr 885 890 895 Thr Gly Leu Thr Tyr Pro Lys Phe Ser Asp Val Thr Gly Lys Ile Lys 900 905 910 Leu Pro Lys Asp Ser Phe Arg Pro Ser Ala Gly Trp Thr Trp Ala Gly 915 920 925 Asp Trp Phe Val Cys Pro Glu Lys Thr Leu Leu His Asp Met Asp Ala 930 935 940 Gly His Leu Ser Phe Val Glu Glu Val Phe Glu Asn Gln Thr Arg Leu 945 950 955 960 Pro Gly Gly Gln Trp Ile Tyr Met Ser Asp Asn Tyr Thr Asp Val Asn 965 970 975 Gly Glu Lys Val Leu Pro Lys Asp Asp Ile Glu Cys Pro Leu Gly Trp 980 985 990 Lys Trp Glu Asp Glu Glu Trp Ser Thr Asp Leu Asn Arg Ala Val Asp 995 1000 1005 Glu Gln Gly Trp Glu Tyr Ser Ile Thr Ile Pro Pro Glu Arg Lys 1010 1015 1020 Pro Lys His Trp Val Pro Ala Glu Lys Met Tyr Tyr Thr His Arg 1025 1030 1035 Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp Leu Ser Gln Met 1040 1045 1050 Glu Ala Leu Lys Arg His Arg Gln Ala Glu Ala Glu Gly Glu Gly 1055 1060 1065 Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu Glu Tyr 1070 1075 1080 Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg Met 1085 1090 1095 Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 1100 1105 1110 Gly Ala Leu Gly Gly Val Met Asp Asp Lys Ser Glu Asp Ser Met 1115 1120 1125 Ser Val Ser Thr Leu Ser Phe Gly Val Asn Arg Pro Thr Ile Ser 1130 1135 1140 Cys Ile Phe Asp Tyr Gly Asn Arg Tyr His Leu Arg Cys Tyr Met 1145 1150 1155 Tyr Gln Ala Arg Asp Leu Ala Ala Met Asp Lys Asp Ser Phe Ser 1160 1165 1170 Asp Pro Tyr Ala Ile Val Ser Phe Leu His Gln Ser Gln Lys Thr 1175 1180 1185 Val Val Val Lys Asn Thr Leu Asn Pro Thr Trp Asp Gln Thr Leu 1190 1195 1200 Ile Phe Tyr Glu Ile Glu Ile Phe Gly Glu Pro Ala Thr Val Ala 1205 1210 1215 Glu Gln Pro Pro Ser Ile Val Val Glu Leu Tyr Asp His Asp Thr 1220 1225 1230 Tyr Gly Ala Asp Glu Phe Met Gly Arg Cys Ile Cys Gln Pro Ser 1235 1240 1245 Leu Glu Arg Met Pro Arg Leu Ala Trp Phe Pro Leu Thr Arg Gly 1250 1255 1260 Ser Gln Pro Ser Gly Glu Leu Leu Ala Ser Phe Glu Leu Ile Gln 1265 1270 1275 Arg Glu Lys Pro Ala Ile His His Ile Pro Gly Phe Glu Val Gln 1280 1285 1290 Glu Thr Ser Arg Ile Leu Asp Glu Ser Glu Asp Thr Asp Leu Pro 1295 1300 1305 Tyr Pro Pro Pro Gln Arg Glu Ala Asn Ile Tyr Met Val Pro Gln 1310 1315 1320 Asn Ile Lys Pro Ala Leu Gln Arg Thr Ala Ile Glu Ile Leu Ala 1325 1330 1335 Trp Gly Leu Arg Asn Met Lys Ser Tyr Gln Leu Ala Asn Ile Ser 1340 1345 1350 Ser Pro Ser Leu Val Val Glu Cys Gly Gly Gln Thr Val Gln Ser 1355 1360 1365 Cys Val Ile Arg Asn Leu Arg Lys Asn Pro Asn Phe Asp Ile Cys 1370 1375 1380 Thr Leu Phe Met Glu Val Met Leu Pro Arg Glu Glu Leu Tyr Cys 1385 1390 1395 Pro Pro Ile Thr Val Lys Val Ile Asp Asn Arg Gln Phe Gly Arg 1400 1405 1410 Arg Pro Val Val Gly Gln Cys Thr Ile Arg Ser Leu Glu Ser Phe 1415 1420 1425 Leu Cys Asp Pro Tyr Ser Ala Glu Ser Pro Ser Pro Gln Gly Gly 1430 1435 1440 Pro Asp Asp Val Ser Leu Leu Ser Pro Gly Glu Asp Val Leu Ile 1445 1450 1455 Asp Ile Asp Asp Lys Glu Pro Leu Ile Pro Ile Gln Glu Glu Glu 1460 1465 1470 Phe Ile Asp Trp Trp Ser Lys Phe Phe Ala Ser Ile Gly Glu Arg 1475 1480 1485 Glu Lys Cys Gly Ser Tyr Leu Glu Lys Asp Phe Asp Thr Leu Lys 1490 1495 1500 Val Tyr Asp Thr Gln Leu Glu Asn Val Glu Ala Phe Glu Gly Leu 1505 1510 1515 Ser Asp Phe Cys Asn Thr Phe Lys Leu Tyr Arg Gly Lys Thr Gln 1520 1525 1530 Glu Glu Thr Glu Asp Pro Ser Val Ile Gly Glu Phe Lys Gly Leu 1535 1540 1545 Phe Lys Ile Tyr Pro Leu Pro Glu Asp Pro Ala Ile Pro Met Pro 1550 1555 1560 Pro Arg Gln Phe His Gln Leu Ala Ala Gln Gly Pro Gln Glu Cys 1565 1570 1575 Leu Val Arg Ile Tyr Ile Val Arg Ala Phe Gly Leu Gln Pro Lys 1580 1585 1590 Asp Pro Asn Gly Lys Cys Asp Pro Tyr Ile Lys Ile Ser Ile Gly 1595 1600 1605 Lys Lys Ser Val Ser Asp Gln Asp Asn Tyr Ile Pro Cys Thr Leu 1610 1615 1620 Glu Pro Val Phe Gly Lys Met Phe Glu Leu Thr Cys Thr Leu Pro 1625 1630 1635 Leu Glu Lys Asp Leu Lys Ile Thr Leu Tyr Asp Tyr Asp Leu Leu 1640 1645 1650 Ser Lys Asp Glu Lys Ile Gly Glu Thr Val Val Asp Leu Glu Asn 1655 1660 1665 Arg Leu Leu Ser Lys Phe Gly Ala Arg Cys Gly Leu Pro Gln Thr 1670 1675 1680 Tyr Cys Val Ser Gly Pro Asn Gln Trp Arg Asp Gln Leu Arg Pro 1685 1690 1695 Ser Gln Leu Leu His Leu Phe Cys Gln Gln His Arg Val Lys Ala 1700 1705 1710 Pro Val Tyr Arg Thr Asp Arg Val Met Phe Gln Asp Lys Glu Tyr 1715 1720 1725 Ser Ile Glu Glu Ile Glu Ala Gly Arg Ile Pro Asn Pro His Leu 1730 1735 1740 Gly Pro Val Glu Glu Arg Leu Ala Leu His Val Leu Gln Gln Gln 1745 1750 1755 Gly Leu Val Pro Glu His Val Glu Ser Arg Pro Leu Tyr Ser Pro 1760 1765 1770 Leu Gln Pro Asp Ile Glu Gln Gly Lys Leu Gln Met Trp Val Asp 1775 1780 1785 Leu Phe Pro Lys Ala Leu Gly Arg Pro Gly Pro Pro Phe Asn Ile 1790 1795 1800 Thr Pro Arg Arg Ala Arg Arg Phe Phe Leu Arg Cys Ile Ile Trp 1805 1810 1815 Asn Thr Arg Asp Val Ile Leu Asp Asp Leu Ser Leu Thr Gly Glu 1820 1825 1830 Lys Met Ser Asp Ile Tyr Val Lys Gly Trp Met Ile Gly Phe Glu 1835 1840 1845 Glu His Lys Gln Lys Thr Asp Val His Tyr Arg Ser Leu Gly Gly 1850 1855 1860 Glu Gly Asn Phe Asn Trp Arg Phe Ile Phe Pro Phe Asp Tyr Leu 1865 1870 1875 Pro Ala Glu Gln Val Cys Thr Ile Ala Lys Lys Asp Ala Phe Trp 1880 1885 1890 Arg Leu Asp Lys Thr Glu Ser Lys Ile Pro Ala Arg Val Val Phe 1895 1900 1905 Gln Ile Trp Asp Asn Asp Lys Phe Ser Phe Asp Asp Phe Leu Gly 1910 1915 1920 Ser Leu Gln Leu Asp Leu Asn Arg Met Pro Lys Pro Ala Lys Thr 1925 1930 1935 Ala Lys Lys Cys Ser Leu Asp Gln Leu Asp Asp Ala Phe His Pro 1940 1945 1950 Glu Trp Phe Val Ser Leu Phe Glu Gln Lys Thr Val Lys Gly Trp 1955 1960 1965 Trp Pro Cys Val Ala Glu Glu Gly Glu Lys Lys Ile Leu Ala Gly 1970 1975 1980 Lys Leu Glu Met Thr Leu Glu Ile Val Ala Glu Ser Glu His Glu 1985 1990 1995 Glu Arg Pro Ala Gly Gln Gly Arg Asp Glu Pro Asn Met Asn Pro 2000 2005 2010 Lys Leu Glu Asp Pro Arg Arg Pro Asp Thr Ser Phe Leu Trp Phe 2015 2020 2025 Thr Ser Pro Tyr Lys Thr Met Lys Phe Ile Leu Trp Arg Arg Phe 2030 2035 2040 Arg Trp Ala Ile Ile Leu Phe Ile Ile Leu Phe Ile Leu Leu Leu 2045 2050 2055 Phe Leu Ala Ile Phe Ile Tyr Ala Phe Pro Asn Tyr Ala Ala Met 2060 2065 2070 Lys Leu Val Lys Pro Phe Ser 2075 2080 <210> 13 <211> 3340 <212> DNA <213> Homo sapiens <220> <223> 5' hDYSF DNA fragment v2 <400> 13 atgctgaggg tcttcatcct ctatgccgag aacgtccaca cacccgacac cgacatcagc 60 gatgcctact gctccgcggt gtttgcaggg gtgaagaaga gaaccaaagt catcaagaac 120 agcgtgaacc ctgtatggaa tgagggattt gaatgggacc tcaagggcat ccccctggac 180 cagggctctg agcttcatgt ggtggtcaaa gaccatgaga cgatggggag gaacaggttc 240 ctgggggaag ccaaggtccc actccgagag gtcctcgcca cccctagtct gtccgccagc 300 ttcaatgccc ccctgctgga caccaagaag cagcccacag gggcctcgct ggtcctgcag 360 gtgtcctaca caccgctgcc tggagctgtg cccctgttcc cgccccctac tcctctggag 420 ccctccccga ctctgcctga cctggatgta gtggcagaca caggaggaga ggaagacaca 480 gaggaccagg gactcactgg agatgaggcg gagccattcc tggatcaaag cggaggcccg 540 ggggctccca ccaccccaag gaaactacct tcacgtcctc cgccccacta ccccgggatc 600 aaaagaaagc gaagtgcgcc tacatctaga aagctgctgt cagacaaacc gcaggatttc 660 cagatcaggg tccaggtgat cgaggggcgc cagctgccgg gggtgaacat caagcctgtg 720 gtcaaggtta ccgctgcagg gcagaccaag cggacgcgga tccacaaggg aaacagccca 780 ctcttcaatg agactctttt cttcaacttg tttgactctc ctggggagct gtttgatgag 840 cccatcttta tcacggtggt agactctcgt tctctcagga cagatgctct cctcggggag 900 ttccggatgg acgtgggcac catttacaga gagccccggc acgcctatct caggaagtgg 960 ctgctgctct cagaccctga tgacttctct gctggggcca gaggctacct gaaaacaagc 1020 ctttgtgtgc tggggcctgg ggacgaagcg cctctggaga gaaaagaccc ctctgaagac 1080 aaggaggaca ttgaaagcaa cctgctccgg cccacaggcg tagccctgcg aggagcccac 1140 ttctgcctga aggtcttccg ggccgaggac ttgccgcaga tggacgatgc cgtgatggac 1200 aacgtgaaac agatctttgg cttcgagagt aacaagaaga acttggtgga cccctttgtg 1260 gaggtcagct ttgcggggaa aatgctgtgc agcaagatct tggagaagac ggccaaccct 1320 cagtggaacc agaacatcac actgcctgcc atgtttccct ccatgtgcga aaaaatgagg 1380 attcgtatca tagactggga ccgcctgact cacaatgaca tcgtggctac cacctacctg 1440 agtatgtcga aaatctctgc ccctggagga gaaatagaag aggagcctgc aggtgctgtc 1500 aagccttcga aagcctcaga cttggatgac tacctgggct tcctccccac ttttgggccc 1560 tgctacatca acctctatgg cagtcccaga gagttcacag gcttcccaga cccctacaca 1620 gagctcaaca caggcaaggg ggaaggtgtg gcttatcgtg gccggcttct gctctccctg 1680 gagaccaagc tggtggagca cagtgaacag aaggtggagg accttcctgc ggatgacatc 1740 ctccgggtgg agaagtacct taggaggcgc aagtactccc tgtttgcggc cttctactca 1800 gccaccatgc tgcaggatgt ggatgatgcc atccagtttg aggtcagcat cgggaactac 1860 gggaacaagt tcgacatgac ctgcctgccg ctggcctcca ccactcagta cagccgtgca 1920 gtctttgacg ggtgccacta ctactaccta ccctggggta acgtgaaacc tgtggtggtg 1980 ctgtcatcct actgggagga catcagccat agaatcgaga ctcagaacca gctgcttggg 2040 attgctgacc ggctggaagc tggcctggag caggtccacc tggccctgaa ggcgcagtgc 2100 tccacggagg acgtggactc gctggtggct cagctgacgg atgagctcat cgcaggctgc 2160 agccagcctc tgggtgacat ccatgagaca ccctctgcca cccacctgga ccagtacctg 2220 taccagctgc gcacccatca cctgagccaa atcactgagg ctgccctggc cctgaagctc 2280 ggccacagtg agctccctgc agctctggag caggcggagg actggctcct gcgtctgcgt 2340 gccctggcag aggagcccca gaacagcctg ccggacatcg tcatctggat gctgcaggga 2400 gacaagcgtg tggcatacca gcgggtgccc gcccaccaag tcctcttctc ccggcggggt 2460 gccaactact gtggcaagaa ttgtgggaag ctacagacaa tctttctgaa atatccgatg 2520 gagaaggtgc ctggcgcccg gatgccagtg cagatacggg tcaagctgtg gtttgggctc 2580 tcagtggatg agaaggagtt caaccagttt gctgagggga agctgtctgt ctttgctgaa 2640 acctatgaga acgagactaa gttggccctt gttgggaact ggggcacaac gggcctcacc 2700 taccccaagt tttctgacgt cacgggcaag atcaagctac ccaaggacag cttccgcccc 2760 tcggccggct ggacctgggc tggagattgg ttcgtgtgtc cggagaagac tctgctccat 2820 gacatggacg ccggtcacct gagcttcgtg gaagaggtgt ttgagaacca gacccggctt 2880 cccggaggcc agtggatcta catgagtgac aactacaccg atgtgaacgg ggagaaggtg 2940 cttcccaagg atgacattga gtgcccactg ggctggaagt gggaagatga ggaatggtcc 3000 acagacctca accgggctgt cgatgagcaa ggctgggagt atagcatcac catccccccg 3060 gagcggaagc cgaagcactg ggtccctgct gagaagatgt actacacaca ccgacggcgg 3120 cgctgggtgc gcctgcgcag gagggatctc agccaaatgg aagcactgaa aaggcacagg 3180 caggcggagg cggagggcga gggctgggag tacgcctctc tttttggctg gaagttccac 3240 ctcgagtacc gcaagacaga tgccttccgc cgccgccgct ggcgccgtcg catggagcca 3300 ctggagaaga cggggcctgc agctgtgttt gcccttgagg 3340 <210> 14 <211> 3866 <212> DNA <213> Homo sapiens <220> <223> 3' hDYSF DNA fragment v2 <400> 14 tcgtcatctg gatgctgcag ggagacaagc gtgtggcata ccagcgggtg cccgcccacc 60 aagtcctctt ctcccggcgg ggtgccaact actgtggcaa gaattgtggg aagctacaga 120 caatctttct gaaatatccg atggagaagg tgcctggcgc ccggatgcca gtgcagatac 180 gggtcaagct gtggtttggg ctctcagtgg atgagaagga gttcaaccag tttgctgagg 240 ggaagctgtc tgtctttgct gaaacctatg agaacgagac taagttggcc cttgttggga 300 actggggcac aacgggcctc acctacccca agttttctga cgtcacgggc aagatcaagc 360 tacccaagga cagcttccgc ccctcggccg gctggacctg ggctggagat tggttcgtgt 420 gtccggagaa gactctgctc catgacatgg acgccggtca cctgagcttc gtggaagagg 480 tgtttgagaa ccagacccgg cttcccggag gccagtggat ctacatgagt gacaactaca 540 ccgatgtgaa cggggagaag gtgcttccca aggatgacat tgagtgccca ctgggctgga 600 agtgggaaga tgaggaatgg tccacagacc tcaaccgggc tgtcgatgag caaggctggg 660 agtatagcat caccatcccc ccggagcgga agccgaagca ctgggtccct gctgagaaga 720 tgtactacac acaccgacgg cggcgctggg tgcgcctgcg caggagggat ctcagccaaa 780 tggaagcact gaaaaggcac aggcaggcgg aggcggaggg cgagggctgg gagtacgcct 840 ctctttttgg ctggaagttc cacctcgagt accgcaagac agatgccttc cgccgccgcc 900 gctggcgccg tcgcatggag ccactggaga agacggggcc tgcagctgtg tttgcccttg 960 agggggccct gggcggcgtg atggatgaca agagtgaaga ttccatgtcc gtctccacct 1020 tgagcttcgg tgtgaacaga cccacgattt cctgcatatt cgactatggg aaccgctacc 1080 atctacgctg ctacatgtac caggcccggg acctggctgc gatggacaag gactcttttt 1140 ctgatcccta tgccatcgtc tccttcctgc accagagcca gaagacggtg gtggtgaaga 1200 acacccttaa ccccacctgg gaccagacgc tcatcttcta cgagatcgag atctttggcg 1260 agccggccac agttgctgag caaccgccca gcattgtggt ggagctgtac gaccatgaca 1320 cttatggtgc agacgagttt atgggtcgct gcatctgtca accgagtctg gaacggatgc 1380 cacggctggc ctggttccca ctgacgaggg gcagccagcc gtcgggggag ctgctggcct 1440 cttttgagct catccagaga gagaagccgg ccatccacca tattcctggt tttgaggtgc 1500 aggagacatc aaggatcctg gatgagtctg aggacacaga cctgccctac ccaccacccc 1560 agagggaggc caacatctac atggttcctc agaacatcaa gccagcgctc cagcgtaccg 1620 ccatcgagat cctggcatgg ggcctgcgga acatgaagag ttaccagctg gccaacatct 1680 cctcccccag cctcgtggta gagtgtgggg gccagacggt gcagtcctgt gtcatcagga 1740 acctccggaa gaaccccaac tttgacatct gcaccctctt catggaagtg atgctgccca 1800 gggaggagct ctactgcccc cccatcaccg tcaaggtcat cgataaccgc cagtttggcc 1860 gccggcctgt ggtgggccag tgtaccatcc gctccctgga gagcttcctg tgtgacccct 1920 actcggcgga gagtccatcc ccacagggtg gcccagacga tgtgagccta ctcagtcctg 1980 gggaagacgt gctcatcgac attgatgaca aggagcccct catccccatc caggaggaag 2040 agttcatcga ttggtggagc aaattctttg cctccatagg ggagagggaa aagtgcggct 2100 cctacctgga gaaggatttt gacaccctga aggtctatga cacacagctg gagaatgtgg 2160 aggcctttga gggcctgtct gacttttgta acaccttcaa gctgtaccgg ggcaagacgc 2220 aggaggagac agaagatcca tctgtgattg gtgaatttaa gggcctcttc aaaatttatc 2280 ccctcccaga agacccagcc atccccatgc ccccaagaca gttccaccag ctggccgccc 2340 agggacccca ggagtgcttg gtccgtatct acattgtccg agcatttggc ctgcagccca 2400 aggaccccaa tggaaagtgt gatccttaca tcaagatctc catagggaag aaatcagtga 2460 gtgaccagga taactacatc ccctgcacgc tggagcccgt atttggaaag atgttcgagc 2520 tgacctgcac tctgcctctg gagaaggacc taaagatcac tctctatgac tatgacctcc 2580 tctccaagga cgaaaagatc ggtgagacgg tcgtcgacct ggagaacagg ctgctgtcca 2640 agtttggggc tcgctgtgga ctcccacaga cctactgtgt ctctggaccg aaccagtggc 2700 gggaccagct ccgcccctcc cagctcctcc acctcttctg ccagcagcat agagtcaagg 2760 cacctgtgta ccggacagac cgtgtaatgt ttcaggataa agaatattcc attgaagaga 2820 tagaggctgg caggatccca aacccacacc tgggcccagt ggaggagcgt ctggctctgc 2880 atgtgcttca gcagcagggc ctggtcccgg agcacgtgga gtcacggccc ctctacagcc 2940 ccctgcagcc agacatcgag caggggaagc tgcagatgtg ggtcgaccta tttccgaagg 3000 ccctggggcg gcctggacct cccttcaaca tcaccccacg gagagccaga aggtttttcc 3060 tgcgttgtat tatctggaat accagagatg tgatcctgga tgacctgagc ctcacggggg 3120 agaagatgag cgacatttat gtgaaaggtt ggatgattgg ctttgaagaa cacaagcaaa 3180 agacagacgt gcattatcgt tccctgggag gtgaaggcaa cttcaactgg aggttcattt 3240 tccccttcga ctacctgcca gctgagcaag tctgtaccat tgccaagaag gatgccttct 3300 ggaggctgga caagactgag agcaaaatcc cagcacgagt ggtgttccag atctgggaca 3360 atgacaagtt ctcctttgat gattttctgg gctccctgca gctcgatctc aaccgcatgc 3420 ccaagccagc caagacagcc aagaagtgct ccttggacca gctggatgat gctttccacc 3480 cagaatggtt tgtgtccctt tttgagcaga aaacagtgaa gggctggtgg ccctgtgtag 3540 cagaagaggg tgagaagaaa atactggcgg gcaagctgga aatgaccttg gagattgtag 3600 cagagagtga gcatgaggag cggcctgctg gccagggccg ggatgagccc aacatgaacc 3660 ctaagcttga ggacccaagg cgccccgaca cctccttcct gtggtttacc tccccataca 3720 agaccatgaa gttcatcctg tggcggcgtt tccggtgggc catcatcctc ttcatcatcc 3780 tcttcatcct gctgctgttc ctggccatct tcatctacgc cttcccgaac tatgctgcca 3840 tgaagctggt gaagcccttc agctga 3866 <210> 15 <211> 4689 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F5' Expresson Cassette v2 <400> 15 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggcttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 cacaggcaag ggggaaggtg tggcttatcg tggccggctt ctgctctccc tggagaccaa 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgcccacca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctcagtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgc 4689 <210> 16 <211> 4592 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F3' Expression Cassette v2 <400> 16 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctactta tctacgtagc catgctctgg 180 tcgactctag agttttcgtc atctggatgc tgcagggaga caagcgtgtg gcataccagc 240 gggtgcccgc ccaccaagtc ctcttctccc ggcggggtgc caactactgt ggcaagaatt 300 gtgggaagct acagacaatc tttctgaaat atccgatgga gaaggtgcct ggcgcccgga 360 tgccagtgca gatacgggtc aagctgtggt ttgggctctc agtggatgag aaggagttca 420 accagtttgc tgaggggaag ctgtctgtct ttgctgaaac ctatgagaac gagactaagt 480 tggcccttgt tgggaactgg ggcacaacgg gcctcaccta ccccaagttt tctgacgtca 540 cgggcaagat caagctaccc aaggacagct tccgcccctc ggccggctgg acctgggctg 600 gagattggtt cgtgtgtccg gagaagactc tgctccatga catggacgcc ggtcacctga 660 gcttcgtgga agaggtgttt gagaaccaga cccggcttcc cggaggccag tggatctaca 720 tgagtgacaa ctacaccgat gtgaacgggg agaaggtgct tcccaaggat gacattgagt 780 gcccactggg ctggaagtgg gaagatgagg aatggtccac agacctcaac cgggctgtcg 840 atgagcaagg ctgggagtat agcatcacca tccccccgga gcggaagccg aagcactggg 900 tccctgctga gaagatgtac tacacacacc gacggcggcg ctgggtgcgc ctgcgcagga 960 gggatctcag ccaaatggaa gcactgaaaa ggcacaggca ggcggaggcg gagggcgagg 1020 gctgggagta cgcctctctt tttggctgga agttccacct cgagtaccgc aagacagatg 1080 ccttccgccg ccgccgctgg cgccgtcgca tggagccact ggagaagacg gggcctgcag 1140 ctgtgtttgc ccttgagggg gccctgggcg gcgtgatgga tgacaagagt gaagattcca 1200 tgtccgtctc caccttgagc ttcggtgtga acagacccac gatttcctgc atattcgact 1260 atgggaaccg ctaccatcta cgctgctaca tgtaccaggc ccgggacctg gctgcgatgg 1320 acaaggactc tttttctgat ccctatgcca tcgtctcctt cctgcaccag agccagaaga 1380 cggtggtggt gaagaacacc cttaacccca cctgggacca gacgctcatc ttctacgaga 1440 tcgagatctt tggcgagccg gccacagttg ctgagcaacc gcccagcatt gtggtggagc 1500 tgtacgacca tgacacttat ggtgcagacg agtttatggg tcgctgcatc tgtcaaccga 1560 gtctggaacg gatgccacgg ctggcctggt tcccactgac gaggggcagc cagccgtcgg 1620 gggagctgct ggcctctttt gagctcatcc agagagagaa gccggccatc caccatattc 1680 ctggttttga ggtgcaggag acatcaagga tcctggatga gtctgaggac acagacctgc 1740 cctacccacc accccagagg gaggccaaca tctacatggt tcctcagaac atcaagccag 1800 cgctccagcg taccgccatc gagatcctgg catggggcct gcggaacatg aagagttacc 1860 agctggccaa catctcctcc cccagcctcg tggtagagtg tgggggccag acggtgcagt 1920 cctgtgtcat caggaacctc cggaagaacc ccaactttga catctgcacc ctcttcatgg 1980 aagtgatgct gcccagggag gagctctact gcccccccat caccgtcaag gtcatcgata 2040 accgccagtt tggccgccgg cctgtggtgg gccagtgtac catccgctcc ctggagagct 2100 tcctgtgtga cccctactcg gcggagagtc catccccaca gggtggccca gacgatgtga 2160 gcctactcag tcctggggaa gacgtgctca tcgacattga tgacaaggag cccctcatcc 2220 ccatccagga ggaagagttc atcgattggt ggagcaaatt ctttgcctcc ataggggaga 2280 gggaaaagtg cggctcctac ctggagaagg attttgacac cctgaaggtc tatgacacac 2340 agctggagaa tgtggaggcc tttgagggcc tgtctgactt ttgtaacacc ttcaagctgt 2400 accggggcaa gacgcaggag gagacagaag atccatctgt gattggtgaa tttaagggcc 2460 tcttcaaaat ttatcccctc ccagaagacc cagccatccc catgccccca agacagttcc 2520 accagctggc cgcccaggga ccccaggagt gcttggtccg tatctacatt gtccgagcat 2580 ttggcctgca gcccaaggac cccaatggaa agtgtgatcc ttacatcaag atctccatag 2640 ggaagaaatc agtgagtgac caggataact acatcccctg cacgctggag cccgtatttg 2700 gaaagatgtt cgagctgacc tgcactctgc ctctggagaa ggacctaaag atcactctct 2760 atgactatga cctcctctcc aaggacgaaa agatcggtga gacggtcgtc gacctggaga 2820 acaggctgct gtccaagttt ggggctcgct gtggactccc acagacctac tgtgtctctg 2880 gaccgaacca gtggcgggac cagctccgcc cctcccagct cctccacctc ttctgccagc 2940 agcatagagt caaggcacct gtgtaccgga cagaccgtgt aatgtttcag gataaagaat 3000 attccattga agagatagag gctggcagga tcccaaaccc acacctgggc ccagtggagg 3060 agcgtctggc tctgcatgtg cttcagcagc agggcctggt cccggagcac gtggagtcac 3120 ggcccctcta cagccccctg cagccagaca tcgagcaggg gaagctgcag atgtgggtcg 3180 acctatttcc gaaggccctg gggcggcctg gacctccctt caacatcacc ccacggagag 3240 ccagaaggtt tttcctgcgt tgtattatct ggaataccag agatgtgatc ctggatgacc 3300 tgagcctcac gggggagaag atgagcgaca tttatgtgaa aggttggatg attggctttg 3360 aagaacacaa gcaaaagaca gacgtgcatt atcgttccct gggaggtgaa ggcaacttca 3420 actggaggtt cattttcccc ttcgactacc tgccagctga gcaagtctgt accattgcca 3480 agaaggatgc cttctggagg ctggacaaga ctgagagcaa aatcccagca cgagtggtgt 3540 tccagatctg ggacaatgac aagttctcct ttgatgattt tctgggctcc ctgcagctcg 3600 atctcaaccg catgcccaag ccagccaaga cagccaagaa gtgctccttg gaccagctgg 3660 atgatgcttt ccacccagaa tggtttgtgt ccctttttga gcagaaaaca gtgaagggct 3720 ggtggccctg tgtagcagaa gagggtgaga agaaaatact ggcgggcaag ctggaaatga 3780 ccttggagat tgtagcagag agtgagcatg aggagcggcc tgctggccag ggccgggatg 3840 agcccaacat gaaccctaag cttgaggacc caaggcgccc cgacacctcc ttcctgtggt 3900 ttacctcccc atacaagacc atgaagttca tcctgtggcg gcgtttccgg tgggccatca 3960 tcctcttcat catcctcttc atcctgctgc tgttcctggc catcttcatc tacgccttcc 4020 cgaactatgc tgccatgaag ctggtgaagc ccttcagctg aggactctcc tgccctgtag 4080 aaggggccgt ggggtcccct ccagcatggg actggcctgc ctcctccgcc cagctcggcg 4140 agctcctcca gacctcctag gcctgattgt cctgccaggg tgggcagaca gacagatgga 4200 ccggcccaca ctcccagagt tgctaacatg gagctctgag atcaccccac ttccatcatt 4260 tccttctccc ccaacccaac gcttttttgg atcagctcag acatatttca gtataaaaca 4320 gttggaacca caaaaaaaaa aaaaaaaagt cgacgcggcc gcaataaaag atctttattt 4380 tcattagatc tgtgtgttgg ttttttgtgt gtctagagca tggctacgta gataagtagc 4440 atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc 4500 tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg 4560 cccgggcggc ctcagtgagc gagcgagcgc gc 4592 <210> 17 <211> 128 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> 3' ITR <400> 17 aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60 ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120 gagcgcgc 128 <210> 18 <211> 8341 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Plasmid sequence for DYS F5 Expression Cassette <400> 18 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggcttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 cacaggcaag ggggaaggtg tggcttatcg tggccggctt ctgctctccc tggagaccaa 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgcccacca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 4740 ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 4800 cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 4860 gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4920 tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 4980 agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 5040 tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 5100 cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 5160 ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 5220 ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 5280 ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 5340 ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 5400 cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 5460 gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 5520 atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 5580 ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 5640 gttttaaatc aatctaaagt atatatgagt aaaaatattc cggaattgcc agctggggcg 5700 ccctctggta aggttgggaa gccctgcaaa gtaaactgga tggctttctt gccgccaagg 5760 atctgatggc gcaggggatc aagatctgat caagagacag gatgaggatc gtttcgcatg 5820 attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc 5880 tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg 5940 caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag 6000 gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc 6060 gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat 6120 ctcctgtcat cccaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg 6180 cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc 6240 gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag 6300 catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc 6360 gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc 6420 cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata 6480 gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc 6540 gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac 6600 gagttcttct gaaccggtaa tattattgaa gcatttatca gggttattgt ctcatgagcg 6660 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 6720 gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata 6780 ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac 6840 acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag 6900 cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 6960 cagagcagat tgtactgaga gtgcaccata tgcggtgtga aataccgcac agatgcgtaa 7020 ggagaaaata ccgcatcagg cgattccaac atccaataaa tcatacaggc aaggcaaaga 7080 attagcaaaa ttaagcaata aagcctcaga gcataaagct aaatcggttg taccaaaaac 7140 attatgaccc tgtaatactt ttgcgggaga agcctttatt tcaacgcaag gataaaaatt 7200 tttagaaccc tcatatattt taaatgcaat gcctgagtaa tgtgtaggta aagattcaaa 7260 cgggtgagaa aggccggaga cagtcaaatc accatcaata tgatattcaa ccgttctagc 7320 tgataaattc atgccggaga gggtagctat ttttgagagg tctctacaaa ggctatcagg 7380 tcattgcctg agagtctgga gcaaacaaga gaatcgatga acggtaatcg taaaactagc 7440 atgtcaatca tatgtacccc ggttgataat cagaaaagcc ccaaaaacag gaagattgta 7500 taagcaaata tttaaattgt aagcgttaat attttgttaa aattcgcgtt aaatttttgt 7560 taaatcagct cattttttaa ccaataggcc gaaatcggca aaatccctta taaatcaaaa 7620 gaatagaccg agatagggtt gagtgttgtt ccagtttgga acaagagtcc actattaaag 7680 aacgtggact ccaacgtcaa agggcgaaaa accgtctatc agggcgatgg cccactacgt 7740 gaaccatcac cctaatcaag ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac 7800 cctaaaggga gcccccgatt tagagcttga cggggaaagc cggcgaacgt ggcgagaaag 7860 gaagggaaga aagcgaaagg agcgggcgct agggcgctgg caagtgtagc ggtcacgctg 7920 cgcgtaacca ccacacccgc cgcgcttaat gcgccgctac agggcgcgta ctatggttgc 7980 tttgacgagc acgtataacg tgctttcctc gttagaatca gagcgggagc taaacaggag 8040 gccgattaaa gggattttag acaggaacgg tacgccagaa tcctgagaag tgtttttata 8100 atcagtgagg ccaccgagta aaagagtctg tccatcacgc aaattaaccg ttgtcgcaat 8160 acttctttga ttagtaataa catcacttgc ctgagtagaa gaactcaaac tatcggcctt 8220 gctggtaata tccagaacaa tattaccgcc agccattgca acggaatcgc cattcgccat 8280 tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc 8340 t 8341 <210> 19 <211> 8244 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Plasmid sequence for DYS F3 Expression Cassette <400> 19 gtagccatgc tctggtcgac tctagagttt tcgtcatctg gatgctgcag ggagacaagc 60 gtgtggcata ccagcgggtg cccgcccacc aagtcctctt ctcccggcgg ggtgccaact 120 actgtggcaa gaattgtggg aagctacaga caatctttct gaaatatccg atggagaagg 180 tgcctggcgc ccggatgcca gtgcagatac gggtcaagct gtggtttggg ctctctgtgg 240 atgagaagga gttcaaccag tttgctgagg ggaagctgtc tgtctttgct gaaacctatg 300 agaacgagac taagttggcc cttgttggga actggggcac aacgggcctc acctacccca 360 agttttctga cgtcacgggc aagatcaagc tacccaagga cagcttccgc ccctcggccg 420 gctggacctg ggctggagat tggttcgtgt gtccggagaa gactctgctc catgacatgg 480 acgccggtca cctgagcttc gtggaagagg tgtttgagaa ccagacccgg cttcccggag 540 gccagtggat ctacatgagt gacaactaca ccgatgtgaa cggggagaag gtgcttccca 600 aggatgacat tgagtgccca ctgggctgga agtgggaaga tgaggaatgg tccacagacc 660 tcaaccgggc tgtcgatgag caaggctggg agtatagcat caccatcccc ccggagcgga 720 agccgaagca ctgggtccct gctgagaaga tgtactacac acaccgacgg cggcgctggg 780 tgcgcctgcg caggagggat ctcagccaaa tggaagcact gaaaaggcac aggcaggcgg 840 aggcggaggg cgagggctgg gagtacgcct ctctttttgg ctggaagttc cacctcgagt 900 accgcaagac agatgccttc cgccgccgcc gctggcgccg tcgcatggag ccactggaga 960 agacggggcc tgcagctgtg tttgcccttg agggggccct gggcggcgtg atggatgaca 1020 agagtgaaga ttccatgtcc gtctccacct tgagcttcgg tgtgaacaga cccacgattt 1080 cctgcatatt cgactatggg aaccgctacc atctacgctg ctacatgtac caggcccggg 1140 acctggctgc gatggacaag gactcttttt ctgatcccta tgccatcgtc tccttcctgc 1200 accagagcca gaagacggtg gtggtgaaga acacccttaa ccccacctgg gaccagacgc 1260 tcatcttcta cgagatcgag atctttggcg agccggccac agttgctgag caaccgccca 1320 gcattgtggt ggagctgtac gaccatgaca cttatggtgc agacgagttt atgggtcgct 1380 gcatctgtca accgagtctg gaacggatgc cacggctggc ctggttccca ctgacgaggg 1440 gcagccagcc gtcgggggag ctgctggcct cttttgagct catccagaga gagaagccgg 1500 ccatccacca tattcctggt tttgaggtgc aggagacatc aaggatcctg gatgagtctg 1560 aggacacaga cctgccctac ccaccacccc agagggaggc caacatctac atggttcctc 1620 agaacatcaa gccagcgctc cagcgtaccg ccatcgagat cctggcatgg ggcctgcgga 1680 acatgaagag ttaccagctg gccaacatct cctcccccag cctcgtggta gagtgtgggg 1740 gccagacggt gcagtcctgt gtcatcagga acctccggaa gaaccccaac tttgacatct 1800 gcaccctctt catggaagtg atgctgccca gggaggagct ctactgcccc cccatcaccg 1860 tcaaggtcat cgataaccgc cagtttggcc gccggcctgt ggtgggccag tgtaccatcc 1920 gctccctgga gagcttcctg tgtgacccct actcggcgga gagtccatcc ccacagggtg 1980 gcccagacga tgtgagccta ctcagtcctg gggaagacgt gctcatcgac attgatgaca 2040 aggagcccct catccccatc caggaggaag agttcatcga ttggtggagc aaattctttg 2100 cctccatagg ggagagggaa aagtgcggct cctacctgga gaaggatttt gacaccctga 2160 aggtctatga cacacagctg gagaatgtgg aggcctttga gggcctgtct gacttttgta 2220 acaccttcaa gctgtaccgg ggcaagacgc aggaggagac agaagatcca tctgtgattg 2280 gtgaatttaa gggcctcttc aaaatttatc ccctcccaga agacccagcc atccccatgc 2340 ccccaagaca gttccaccag ctggccgccc agggacccca ggagtgcttg gtccgtatct 2400 acattgtccg agcatttggc ctgcagccca aggaccccaa tggaaagtgt gatccttaca 2460 tcaagatctc catagggaag aaatcagtga gtgaccagga taactacatc ccctgcacgc 2520 tggagcccgt atttggaaag atgttcgagc tgacctgcac tctgcctctg gagaaggacc 2580 taaagatcac tctctatgac tatgacctcc tctccaagga cgaaaagatc ggtgagacgg 2640 tcgtcgacct ggagaacagg ctgctgtcca agtttggggc tcgctgtgga ctcccacaga 2700 cctactgtgt ctctggaccg aaccagtggc gggaccagct ccgcccctcc cagctcctcc 2760 acctcttctg ccagcagcat agagtcaagg cacctgtgta ccggacagac cgtgtaatgt 2820 ttcaggataa agaatattcc attgaagaga tagaggctgg caggatccca aacccacacc 2880 tgggcccagt ggaggagcgt ctggctctgc atgtgcttca gcagcagggc ctggtcccgg 2940 agcacgtgga gtcacggccc ctctacagcc ccctgcagcc agacatcgag caggggaagc 3000 tgcagatgtg ggtcgaccta tttccgaagg ccctggggcg gcctggacct cccttcaaca 3060 tcaccccacg gagagccaga aggtttttcc tgcgttgtat tatctggaat accagagatg 3120 tgatcctgga tgacctgagc ctcacggggg agaagatgag cgacatttat gtgaaaggtt 3180 ggatgattgg ctttgaagaa cacaagcaaa agacagacgt gcattatcgt tccctgggag 3240 gtgaaggcaa cttcaactgg aggttcattt tccccttcga ctacctgcca gctgagcaag 3300 tctgtaccat tgccaagaag gatgccttct ggaggctgga caagactgag agcaaaatcc 3360 cagcacgagt ggtgttccag atctgggaca atgacaagtt ctcctttgat gattttctgg 3420 gctccctgca gctcgatctc aaccgcatgc ccaagccagc caagacagcc aagaagtgct 3480 ccttggacca gctggatgat gctttccacc cagaatggtt tgtgtccctt tttgagcaga 3540 aaacagtgaa gggctggtgg ccctgtgtag cagaagaggg tgagaagaaa atactggcgg 3600 gcaagctgga aatgaccttg gagattgtag cagagagtga gcatgaggag cggcctgctg 3660 gccagggccg ggatgagccc aacatgaacc ctaagcttga ggacccaagg cgccccgaca 3720 cctccttcct gtggtttacc tccccataca agaccatgaa gttcatcctg tggcggcgtt 3780 tccggtgggc catcatcctc ttcatcatcc tcttcatcct gctgctgttc ctggccatct 3840 tcatctacgc cttcccgaac tatgctgcca tgaagctggt gaagcccttc agctgaggac 3900 tctcctgccc tgtagaaggg gccgtggggt cccctccagc atgggactgg cctgcctcct 3960 ccgcccagct cggcgagctc ctccagacct cctaggcctg attgtcctgc cagggtgggc 4020 agacagacag atggaccggc ccacactccc agagttgcta acatggagct ctgagatcac 4080 cccacttcca tcatttcctt ctcccccaac ccaacgcttt tttggatcag ctcagacata 4140 tttcagtata aaacagttgg aaccacaaaa aaaaaaaaaa aaagtcgacg cggccgcaat 4200 aaaagatctt tattttcatt agatctgtgt gttggttttt tgtgtgtcta gagcatggct 4260 acgtagataa gtagcatggc gggttaatca ttaactacaa ggaaccccta gtgatggagt 4320 tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc 4380 gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcagc tgcattaatg 4440 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 4500 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 4560 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 4620 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 4680 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 4740 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 4800 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 4860 tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 4920 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 4980 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 5040 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 5100 tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 5160 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 5220 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 5280 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 5340 aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 5400 atatgagtaa aaatattccg gaattgccag ctggggcgcc ctctggtaag gttgggaagc 5460 cctgcaaagt aaactggatg gctttcttgc cgccaaggat ctgatggcgc aggggatcaa 5520 gatctgatca agagacagga tgaggatcgt ttcgcatgat tgaacaagat ggattgcacg 5580 caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa 5640 tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg 5700 tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt 5760 ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa 5820 gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatcc caccttgctc 5880 ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg 5940 ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg 6000 aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg 6060 aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg 6120 gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact 6180 gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg 6240 ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc 6300 ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga accggtaata 6360 ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 6420 gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcta 6480 agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 6540 tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc cggagacggt 6600 cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg cgtcagcggg 6660 tgttggcggg tgtcggggct ggcttaacta tgcggcatca gagcagattg tactgagagt 6720 gcaccatatg cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg 6780 attccaacat ccaataaatc atacaggcaa ggcaaagaat tagcaaaatt aagcaataaa 6840 gcctcagagc ataaagctaa atcggttgta ccaaaaacat tatgaccctg taatactttt 6900 gcgggagaag cctttatttc aacgcaagga taaaaatttt tagaaccctc atatatttta 6960 aatgcaatgc ctgagtaatg tgtaggtaaa gattcaaacg ggtgagaaag gccggagaca 7020 gtcaaatcac catcaatatg atattcaacc gttctagctg ataaattcat gccggagagg 7080 gtagctattt ttgagaggtc tctacaaagg ctatcaggtc attgcctgag agtctggagc 7140 aaacaagaga atcgatgaac ggtaatcgta aaactagcat gtcaatcata tgtaccccgg 7200 ttgataatca gaaaagcccc aaaaacagga agattgtata agcaaatatt taaattgtaa 7260 gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 7320 aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga 7380 gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 7440 ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt 7500 ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 7560 gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 7620 cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 7680 cgcttaatgc gccgctacag ggcgcgtact atggttgctt tgacgagcac gtataacgtg 7740 ctttcctcgt tagaatcaga gcgggagcta aacaggaggc cgattaaagg gattttagac 7800 aggaacggta cgccagaatc ctgagaagtg tttttataat cagtgaggcc accgagtaaa 7860 agagtctgtc catcacgcaa attaaccgtt gtcgcaatac ttctttgatt agtaataaca 7920 tcacttgcct gagtagaaga actcaaacta tcggccttgc tggtaatatc cagaacaata 7980 ttaccgccag ccattgcaac ggaatcgcca ttcgccattc aggctgcgca actgttggga 8040 agggcgatcg gtgcgggcct cttcgctatt acgccagctg cgcgctcgct cgctcactga 8100 ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt cgcccggcct cagtgagcga 8160 gcgagcgcgc agagagggag tggccaactc catcactagg ggttccttgt agttaatgat 8220 taacccgcca tgctacttat ctac 8244 <210> 20 <211> 7906 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> full DYF expression cassette <400> 20 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggcttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 cacaggcaag ggggaaggtg tggcttatcg tggccggctt ctgctctccc tggagaccaa 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgcccacca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggggccctg ggcggcgtga tggatgacaa 4500 gagtgaagat tccatgtccg tctccacctt gagcttcggt gtgaacagac ccacgatttc 4560 ctgcatattc gactatggga accgctacca tctacgctgc tacatgtacc aggcccggga 4620 cctggctgcg atggacaagg actctttttc tgatccctat gccatcgtct ccttcctgca 4680 ccagagccag aagacggtgg tggtgaagaa cacccttaac cccacctggg accagacgct 4740 catcttctac gagatcgaga tctttggcga gccggccaca gttgctgagc aaccgcccag 4800 cattgtggtg gagctgtacg accatgacac ttatggtgca gacgagttta tgggtcgctg 4860 catctgtcaa ccgagtctgg aacggatgcc acggctggcc tggttcccac tgacgagggg 4920 cagccagccg tcgggggagc tgctggcctc ttttgagctc atccagagag agaagccggc 4980 catccaccat attcctggtt ttgaggtgca ggagacatca aggatcctgg atgagtctga 5040 ggacacagac ctgccctacc caccacccca gagggaggcc aacatctaca tggttcctca 5100 gaacatcaag ccagcgctcc agcgtaccgc catcgagatc ctggcatggg gcctgcggaa 5160 catgaagagt taccagctgg ccaacatctc ctcccccagc ctcgtggtag agtgtggggg 5220 ccagacggtg cagtcctgtg tcatcaggaa cctccggaag aaccccaact ttgacatctg 5280 caccctcttc atggaagtga tgctgcccag ggaggagctc tactgccccc ccatcaccgt 5340 caaggtcatc gataaccgcc agtttggccg ccggcctgtg gtgggccagt gtaccatccg 5400 ctccctggag agcttcctgt gtgaccccta ctcggcggag agtccatccc cacagggtgg 5460 cccagacgat gtgagcctac tcagtcctgg ggaagacgtg ctcatcgaca ttgatgacaa 5520 ggagcccctc atccccatcc aggaggaaga gttcatcgat tggtggagca aattctttgc 5580 ctccataggg gagagggaaa agtgcggctc ctacctggag aaggattttg acaccctgaa 5640 ggtctatgac acacagctgg agaatgtgga ggcctttgag ggcctgtctg acttttgtaa 5700 caccttcaag ctgtaccggg gcaagacgca ggaggagaca gaagatccat ctgtgattgg 5760 tgaatttaag ggcctcttca aaatttatcc cctcccagaa gacccagcca tccccatgcc 5820 cccaagacag ttccaccagc tggccgccca gggaccccag gagtgcttgg tccgtatcta 5880 cattgtccga gcatttggcc tgcagcccaa ggaccccaat ggaaagtgtg atccttacat 5940 caagatctcc atagggaaga aatcagtgag tgaccaggat aactacatcc cctgcacgct 6000 ggagcccgta tttggaaaga tgttcgagct gacctgcact ctgcctctgg agaaggacct 6060 aaagatcact ctctatgact atgacctcct ctccaaggac gaaaagatcg gtgagacggt 6120 cgtcgacctg gagaacaggc tgctgtccaa gtttggggct cgctgtggac tcccacagac 6180 ctactgtgtc tctggaccga accagtggcg ggaccagctc cgcccctccc agctcctcca 6240 cctcttctgc cagcagcata gagtcaaggc acctgtgtac cggacagacc gtgtaatgtt 6300 tcaggataaa gaatattcca ttgaagagat agaggctggc aggatcccaa acccacacct 6360 gggcccagtg gaggagcgtc tggctctgca tgtgcttcag cagcagggcc tggtcccgga 6420 gcacgtggag tcacggcccc tctacagccc cctgcagcca gacatcgagc aggggaagct 6480 gcagatgtgg gtcgacctat ttccgaaggc cctggggcgg cctggacctc ccttcaacat 6540 caccccacgg agagccagaa ggtttttcct gcgttgtatt atctggaata ccagagatgt 6600 gatcctggat gacctgagcc tcacggggga gaagatgagc gacatttatg tgaaaggttg 6660 gatgattggc tttgaagaac acaagcaaaa gacagacgtg cattatcgtt ccctgggagg 6720 tgaaggcaac ttcaactgga ggttcatttt ccccttcgac tacctgccag ctgagcaagt 6780 ctgtaccatt gccaagaagg atgccttctg gaggctggac aagactgaga gcaaaatccc 6840 agcacgagtg gtgttccaga tctgggacaa tgacaagttc tcctttgatg attttctggg 6900 ctccctgcag ctcgatctca accgcatgcc caagccagcc aagacagcca agaagtgctc 6960 cttggaccag ctggatgatg ctttccaccc agaatggttt gtgtcccttt ttgagcagaa 7020 aacagtgaag ggctggtggc cctgtgtagc agaagagggt gagaagaaaa tactggcggg 7080 caagctggaa atgaccttgg agattgtagc agagagtgag catgaggagc ggcctgctgg 7140 ccagggccgg gatgagccca acatgaaccc taagcttgag gacccaaggc gccccgacac 7200 ctccttcctg tggtttacct ccccatacaa gaccatgaag ttcatcctgt ggcggcgttt 7260 ccggtgggcc atcatcctct tcatcatcct cttcatcctg ctgctgttcc tggccatctt 7320 catctacgcc ttcccgaact atgctgccat gaagctggtg aagcccttca gctgaggact 7380 ctcctgccct gtagaagggg ccgtggggtc ccctccagca tgggactggc ctgcctcctc 7440 cgcccagctc ggcgagctcc tccagacctc ctaggcctga ttgtcctgcc agggtgggca 7500 gacagacaga tggaccggcc cacactccca gagttgctaa catggagctc tgagatcacc 7560 ccacttccat catttccttc tcccccaacc caacgctttt ttggatcagc tcagacatat 7620 ttcagtataa aacagttgga accacaaaaa aaaaaaaaaa aagtcgacgc ggccgcaata 7680 aaagatcttt attttcatta gatctgtgtg ttggtttttt gtgtgtctag agcatggcta 7740 cgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag tgatggagtt 7800 ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg 7860 acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgc 7906 <210> 21 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic 6xHis tag <400> 21 His His His His His His 1 5 <210> 22 <211> 250 <212> RNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <221> misc_feature <222> (1)..(250) <223> This sequence may encompass 100-250 nucleotides <400> 22 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 120 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 180 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 240 aaaaaaaaaa 250 SEQUENCE LISTING <110> RESEARCH INSTITUTE AT NATIONWIDE CHILDREN'S HOSPITAL <120> GENE THERAPY WITH DYSFERLIN DUAL VECTORS <130> 106887-8070 <140> <141> <150> 63/024,338 <151> 2020-05-13 <160> 22 <170> PatentIn version 3.5 <210> 1 <211> 3340 <212> DNA <213> Homo sapiens <220> <223> 5' hDYSF DNA fragment v1 <400> 1 atgctgaggg tcttcatcct ctatgccgag aacgtccaca cacccgacac cgacatcagc 60 gatgcctact gctccgcggt gtttgcaggg gtgaagaaga gaaccaaagt catcaagaac 120 agcgtgaacc ctgtatggaa tgagggattt gaatgggacc tcaagggcat ccccctggac 180 cagggctctg agcttcatgt ggtggtcaaa gaccatgaga cgatggggag gaacaggttc 240 ctgggggaag ccaaggtccc actccgagag gtcctcgcca cccctagtct gtccgccagc 300 ttcaatgccc ccctgctgga caccaagaag cagccacag gggcctcgct ggtcctgcag 360 gtgtcctaca caccgctgcc tggagctgtg cccctgttcc cgccccctac tcctctggag 420 ccctccccga ctctgcctga cctggatgta gtggcagaca caggaggaga ggaagacaca 480 gaggaccagg gactcactgg agatgaggcg gagccattcc tggatcaaag cggaggcccg 540 ggggctccca ccaccccaag gaaactacct tcacgtcctc cgccccacta ccccgggatc 600 aaaagaaagc gaagtgcgcc tacatctaga aagctgctgt cagacaaacc gcaggatttc 660 cagatcaggg tccaggtgat cgaggggcgc cagctgccgg gggtgaacat caagcctggg 720 gtcaaggtta ccgctgcagg gcagaccaag cggacgcgga tccacaaggg aaacagccca 780 ctcttcaatg agactctttt cttcaacttg tttgactctc ctggggagct gtttgatgag 840 cccatcttta tcacggtggt agactctcgt tctctcagga cagatgctct cctcggggag 900 ttccggatgg acgtgggcac catttacaga gagccccggc acgcctatct caggaagtgg 960 ctgctgctct cagaccctga tgacttctct gctggggcca gaggctacct gaaaacaagc 1020 ctttgtgtgc tggggcctgg ggacgaagcg cctctggaga gaaaagaccc ctctgaagac 1080 aaggaggaca ttgaaagcaa cctgctccgg cccacaggcg tagccctgcg aggagcccac 1140 ttctgcctga aggtcttccg ggccgaggac ttgccgcaga tggacgatgc cgtgatggac 1200 aacgtgaaac agatctttgg cttcgagagt aacaagaaga acttggtgga cccctttggg 1260 gaggtcagct ttgcgggggaa aatgctgtgc agcaagatct tggagaagac ggccaaccct 1320 cagtggaacc agaacatcac actgcctgcc atgtttccct ccatgtgcga aaaaatgagg 1380 attcgtatca tagactggga ccgcctgact cacaatgaca tcgtggctac cacctacctg 1440 agtatgtcga aaatctctgc ccctggagga gaaatagaag aggagcctgc aggtgctgtc 1500 aagccttcga aagcctcaga cttggatgac tacctgggct tcctccccac ttttgggccc 1560 tgctacatca acctctatgg cagtcccaga gagttcacag gcttcccaga cccctacaca 1620 gagctcaaca caggcaaggg ggaagggtg gcttatcgtg gccggcttct gctctccctg 1680 gagaccaagc tggtggagca cagtgaacag aaggtggagg accttcctgc ggatgacatc 1740 ctccgggtgg agaagtacct taggaggcgc aagtactccc tgtttgcggc cttctactca 1800 gccaccatgc tgcaggatgt ggatgatgcc atccagtttg aggtcagcat cgggaactac 1860 gggaacaagt tcgacatgac ctgcctgccg ctggcctcca ccactcagta cagccgtgca 1920 gtctttgacg ggtgccacta ctactaccta ccctggggta acgtgaaacc tgtggtggtg 1980 ctgtcatcct actgggagga catcagccat agaatcgaga ctcagaacca gctgcttggg 2040 attgctgacc ggctggaagc tggcctggag caggtccacc tggccctgaa ggcgcagtgc 2100 tccacggagg acgtggactc gctggtggct cagctgacgg atgagctcat cgcaggctgc 2160 agccagcctc tgggtgacat ccatgagaca ccctctgcca cccacctgga ccagtacctg 2220 taccagctgc gcacccatca cctgagccaa atcactgagg ctgccctggc cctgaagctc 2280 ggccacagtg agctccctgc agctctggag caggcggagg actggctcct gcgtctgcgt 2340 gccctggcag aggagcccca gaacagcctg ccggacatcg tcatctggat gctgcaggga 2400 gacaagcgtg tggcatacca gcgggtgccc gcccaccaag tcctcttctc ccggcggggt 2460 gccaactact gtggcaagaa ttgtgggaag ctacagacaa tctttctgaa atatccgatg 2520 gagaaggtgc ctggcgcccg gatgccagtg cagatacggg tcaagctggg gtttgggctc 2580 tctgtggatg agaaggagtt caaccagttt gctgagggga agctgtctgt ctttgctgaa 2640 acctatgaga acgagactaa gttggccctt gttgggaact ggggcacaac gggcctcacc 2700 taccccaagt tttctgacgt cacgggcaag atcaagctac ccaaggacag cttccgcccc 2760 tcggccggct ggacctgggc tggagattgg ttcgtgtgtc cggagaagac tctgctccat 2820 gacatggacg ccggtcacct gagcttcgtg gaagaggtgt ttgagaacca gacccggctt 2880 cccggaggcc agtggatcta catgagtgac aactacaccg atgtgaacgg ggagaaggtg 2940 cttcccaagg atgacattga gtgcccactg ggctggaagt gggaagatga ggaatggtcc 3000 acagacctca accgggctgt cgatgagcaa ggctgggagt atagcatcac catccccccg 3060 gagcggaagc cgaagcactg ggtccctgct gagaagatgt actacacaca ccgacggcgg 3120 cgctgggtgc gcctgcgcag gagggatctc agccaaatgg aagcactgaa aaggcacagg 3180 caggcggagg cggagggcga gggctggggag tacgcctctc tttttggctg gaagttccac 3240 ctcgagtacc gcaagacaga tgccttccgc cgccgccgct ggcgccgtcg catggagcca 3300 ctggagaaga cggggcctgc agctgtgttt gcccttgagg 3340 <210> 2 <211> 3866 <212> DNA <213> Homo sapiens <220> <223> 3' hDYSF DNA fragment v1 <400> 2 tcgtcatctg gatgctgcag ggagacaagc gtgtggcata ccagcgggtg cccgcccacc 60 aagtcctctt ctcccggcgg ggtgccaact actgtggcaa gaattgtggg aagctacaga 120 caatctttct gaaatatccg atggagaagg tgcctggcgc ccggatgcca gtgcagatac 180 gggtcaagct gtggtttggg ctctctgtgg atgagaagga gttcaaccag tttgctgagg 240 ggaagctgtc tgtctttgct gaaacctatg agaacgagac taagttggcc cttgttggga 300 actggggcac aacgggcctc acctacccca agttttctga cgtcacgggc aagatcaagc 360 tacccaagga cagcttccgc ccctcggccg gctggacctg ggctggagat tggttcgtgt 420 gtccggagaa gactctgctc catgacatgg acgccggtca cctgagcttc gtggaagagg 480 tgtttgagaa ccagacccgg cttcccggag gccagtggat ctacatgagt gacaactaca 540 ccgatgtgaa cggggagaag gtgcttccca aggatgacat tgagtgccca ctgggctgga 600 agtgggaaga tgaggaatgg tccacagacc tcaaccgggc tgtcgatgag caaggctggg 660 agtatagcat caccatcccc ccggagcgga agccgaagca ctgggtccct gctgagaaga 720 tgtactacac acaccgacgg cggcgctggg tgcgcctgcg caggagggat ctcagccaaa 780 tggaagcact gaaaaggcac aggcaggcgg aggcggaggg cgagggctgg gagtacgcct 840 ctctttttgg ctggaagttc cacctcgagt accgcaagac agatgccttc cgccgccgcc 900 gctggcgccg tcgcatggag ccactggaga agacggggcc tgcagctgtg tttgcccttg 960 agggggccct gggcggcgtg atggatgaca agagtgaaga ttccatgtcc gtctccacct 1020 tgagcttcgg tgtgaacaga cccacgattt cctgcatatt cgactatggg aaccgctacc 1080 atctacgctg ctacatgtac caggcccggg acctggctgc gatggacaag gactcttttt 1140 ctgatcccta tgccatcgtc tccttcctgc accagagcca gaagacggtg gtggtgaaga 1200 acacccttaa ccccacctgg gaccagacgc tcatcttcta cgagatcgag atctttggcg 1260 agccggccac agttgctgag caaccgccca gcattgtggt ggagctgtac gaccatgaca 1320 cttatggtgc agacgagttt atgggtcgct gcatctgtca accgagtctg gaacggatgc 1380 cacggctggc ctggttccca ctgacgaggg gcagccagcc gtcgggggag ctgctggcct 1440 cttttgagct catccagaga gagaagccgg ccatccacca tattcctggt tttgaggtgc 1500 aggagacatc aaggatcctg gatgagtctg aggacacaga cctgccctac ccaccacccc 1560 agagggaggc cacacatctac atggttcctc agaacatcaa gccagcgctc cagcgtaccg 1620 ccatcgagat cctggcatgg ggcctgcgga acatgaagag ttaccagctg gccaacatct 1680 cctcccccag cctcgtggta gagtgtgggg gccagacggt gcagtcctgt gtcatcagga 1740 acctccggaa gaaccccaac tttgacatct gcaccctctt catggaagtg atgctgccca 1800 gggaggagct ctactgcccc cccatcaccg tcaaggtcat cgataaccgc cagtttggcc 1860 gccggcctgt ggtgggccag tgtaccatcc gctccctgga gagcttcctg tgtgacccct 1920 actcggcgga gagtccatcc ccacagggtg gcccagacga tgtgagccta ctcagtcctg 1980 gggaagacgt gctcatcgac attgatgaca aggagcccct catcccccatc caggaggaag 2040 agttcatcga ttggtggagc aaattctttg cctccatagg ggagagggaa aagtgcggct 2100 cctacctgga gaaggatttt gacaccctga aggtctatga cacacagctg gagaatgtgg 2160 aggcctttga gggcctgtct gacttttgta acaccttcaa gctgtaccgg ggcaagacgc 2220 aggagggagac agaagatcca tctgtgattg gtgaatttaa gggcctcttc aaaatttatc 2280 ccctcccaga agacccagcc atccccatgc ccccaagaca gttccaccag ctggccgccc 2340 agggacccca ggaggtgcttg gtccgtatct acattgtccg agcatttggc ctgcagccca 2400 aggaccccaa tggaaagtgt gatccttaca tcaagatctc catagggaag aaatcagtga 2460 gtgaccagga taactacatc ccctgcacgc tggagcccgt atttggaaag atgttcgagc 2520 tgacctgcac tctgcctctg gagaaggacc taaagatcac tctctatgac tatgacctcc 2580 tctccaagga cgaaaagatc ggtgagacgg tcgtcgacct ggagaacagg ctgctgtcca 2640 agtttggggc tcgctgtgga ctcccacaga cctactgtgt ctctggaccg aaccagtggc 2700 gggaccagct ccgcccctcc cagctcctcc acctcttctg ccagcagcat agagtcaagg 2760 cacctgtgta ccggacagac cgtgtaatgt ttcaggataa agaatattcc attgaagaga 2820 tagaggctgg caggatccca aacccacacc tgggcccagt ggaggagcgt ctggctctgc 2880 atgtgcttca gcagcagggc ctggtcccgg agcacgtgga gtcacggccc ctctacagcc 2940 ccctgcagcc agacatcgag caggggaagc tgcagatgtg ggtcgaccta tttccgaagg 3000 ccctggggcg gcctggacct cccttcaaca tcaccccacg gagagccaga aggtttttcc 3060 tgcgttgtat tatctggaat accagagatg tgatcctgga tgacctgagc ctcacggggg 3120 agaagatgag cgacatttat gtgaaaggtt ggatgattgg ctttgaagaa cacaagcaaa 3180 agacagacgt gcattatcgt tccctgggag gtgaaggcaa cttcaactgg aggttcattt 3240 tccccttcga ctacctgcca gctgagcaag tctgtaccat tgccaagaag gatgccttct 3300 ggaggctgga caagactgag agcaaaatcc cagcacgagt ggtgttccag atctggggaca 3360 atgacaagtt ctcctttgat gattttctgg gctccctgca gctcgatctc aaccgcatgc 3420 ccaagccagc caagacagcc aagaagtgct ccttggacca gctggatgat gctttccacc 3480 cagaatggtt tgtgtccctt tttgagcaga aaacagtgaa gggctggtgg ccctggtgtag 3540 cagaagaggg tgagaagaaa atactggcgg gcaagctgga aatgaccttg gagattgtag 3600 cagagagtga gcatgaggag cggcctgctg gccagggccg ggatgagccc aacatgaacc 3660 ctaagcttga ggacccaagg cgccccgaca cctccttcct gtggtttacc tccccataca 3720 agaccatgaa gttcatcctg tggcggcgtt tccggtgggc catcatcctc ttcatcatcc 3780 tcttcatcct gctgctgttc ctggccatct tcatctacgc cttcccgaac tatgctgcca 3840 tgaagctggt gaagcccttc agctga 3866 <210> 3 <211> 128 <212> DNA <213> Adeno-associated virus 2 <220> <223> ITR <400> 3 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttcct 128 <210> 4 <211> 792 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> promoter <400> 4 aagcttgcat gtctaagcta gacccttcag attaaaaata actgaggtaa gggcctgggt 60 aggggaggtg gtgtgagacg ctcctgtctc tcctctatct gcccatcggc cctttgggga 120 ggaggaatgt gcccaaggac taaaaaaagg ccatggagcc agaggggcga gggcaacaga 180 cctttcatgg gcaaaccttg gggccctgct gtctagcatg ccccactacg ggtctaggct 240 gcccatgtaa ggaggcaagg cctggggaca cccgagatgc ctggttataa ttaacccaga 300 catgtggctg cccccccccc cccaacacct gctgcctcta aaaataaccc tgtccctggt 360 ggatcccctg catgcgaaga tcttcgaaca aggctgtggg ggactgaggg caggctgtaa 420 caggcttggg ggccagggct tatacgtgcc tgggactccc aaagtattac tgttccatgt 480 tcccggcgaa gggccagctg tccccccgcca gctagactca gcacttagtt taggaaccag 540 tgagcaagtc agcccttggg gcagcccata caaggccatg gggctgggca agctgcacgc 600 ctgggtccgg ggtgggcacg gtgcccgggc aacgagctga aagctcatct gctctcaggg 660 gcccctccct ggggacagcc cctcctggct agtcacaccc tgtaggctcc tctatataac 720 ccaggggcac aggggctgcc ctcattctac caccacctcc acagcacaga cagacactca 780 ggagcagcca gc 792 <210> 5 <211> 148 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Intron <400> 5 aggtaagttt agtctttttg tctttattt caggtcccgg atccggtggt ggtgcaaatc 60 aaagaactgc tcctcagtgg atgttgcctt tacttctagg cctgtacgga agtgttactt 120 ctgctctaaa agctgcggaa ttgtaccc 148 <210> 6 <211> 4689 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F5' Expression Cassette v1 <400> 6 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggctttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtgggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgccaccca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgc 4689 <210> 7 <211> 49 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic oligonucleotide <220> <223> PolyA signal <400> 7 aataaaagat ctttattttc attagatctg tgtgttggtt ttttgtgtg 49 <210> 8 <211> 4592 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F3' Expression Cassette v1 <400> 8 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctactta tctacgtagc catgctctgg 180 tcgactctag agttttcgtc atctggatgc tgcagggaga caagcgtgtg gcataccagc 240 gggtgcccgc ccaccaagtc ctcttctccc ggcggggtgc caactactgt ggcaagaatt 300 gtgggaagct acagacaatc tttctgaaat atccgatgga gaaggtgcct ggcgcccgga 360 tgccagtgca gatacgggtc aagctgtggt ttgggctctc tgtggatgag aaggagttca 420 accagtttgc tgaggggaag ctgtctgtct ttgctgaaac ctatgagaac gagactaagt 480 tggcccttgt tgggaactgg ggcacaacgg gcctcaccta ccccaagttt tctgacgtca 540 cgggcaagat caagctaccc aaggacagct tccgcccctc ggccggctgg acctgggctg 600 gagattggtt cgtgtgtccg gagaagactc tgctccatga catggacgcc ggtcacctga 660 gcttcgtgga agaggtgttt gagaaccaga cccggcttcc cggaggccag tggatctaca 720 tgagtgacaa ctacaccgat gtgaacgggg agaaggtgct tcccaaggat gacattgagt 780 gcccactggg ctggaagtgg gaagatgagg aatggtccac agacctcaac cgggctgtcg 840 atgagcaagg ctgggagtat agcatcacca tccccccgga gcggaagccg aagcactggg 900 tccctgctga gaagatgtac tacacacacc gacggcggcg ctgggtgcgc ctgcgcagga 960 gggatctcag ccaaatggaa gcactgaaaa ggcacaggca ggcggaggcg gagggcgagg 1020 gctgggagta cgcctctctt tttggctgga agttccacct cgagtaccgc aagacagatg 1080 ccttccgccg ccgccgctgg cgccgtcgca tggagccact ggagaagacg gggcctgcag 1140 ctgtgtttgc ccttgagggg gccctgggcg gcgtgatgga tgacaagagt gaagattcca 1200 tgtccgtctc caccttgagc ttcggtgtga acagaccac gatttcctgc atattcgact 1260 atgggaaccg ctaccatcta cgctgctaca tgtaccaggc ccgggacctg gctgcgatgg 1320 acaaggactc tttttctgat ccctatgcca tcgtctcctt cctgcaccag agccagaaga 1380 cggtggtggt gaagaacacc cttaacccca cctgggacca gacgctcatc ttctacgaga 1440 tcgagatctt tggcgagccg gccacagttg ctgagcaacc gcccagcatt gtggtggagc 1500 tgtacgacca tgacacttat ggtgcagacg agtttatggg tcgctgcatc tgtcaaccga 1560 gtctggaacg gatgccacgg ctggcctggt tccccactgac gaggggcagc cagccgtcgg 1620 gggagctgct ggcctctttt gagctcatcc agagagagaa gccggccatc caccatattc 1680 ctggttttga ggtgcaggag acatcaagga tcctggatga gtctgaggac acagacctgc 1740 cctacccacc accccagagg gaggccaaca tctacatggt tcctcagaac atcaagccag 1800 cgctccagcg taccgccatc gagatcctgg catggggcct gcggaacatg aagagttacc 1860 agctggccaa catctcctcc cccagcctcg tggtagagtg tgggggccag acggtgcagt 1920 cctgtgtcat caggaacctc cggaagaacc ccaactttga catctgcacc ctcttcatgg 1980 aagtgatgct gcccagggag gagctctact gcccccccat caccgtcaag gtcatcgata 2040 accgccagtt tggccgccgg cctgtggtgg gccagtgtac catccgctcc ctggagagct 2100 tcctgtgtga cccctactcg gcggagagtc catccccaca gggtggccca gacgatgtga 2160 gcctactcag tcctgggggaa gacgtgctca tcgacattga tgacaaggag cccctcatcc 2220 ccatccagga ggaagagttc atcgattggt ggagcaaatt ctttgcctcc ataggggaga 2280 gggaaaagtg cggctcctac ctggagaagg attttgacac cctgaaggtc tatgacacac 2340 agctggagaa tgtggaggcc tttgagggcc tgtctgactt ttgtaacacc ttcaagctgt 2400 accggggcaa gacgcaggag gagacagaag atccatctgt gattggtgaa tttaagggcc 2460 tcttcaaaat ttatcccctc ccagaagacc cagccatccc catgccccca agacagttcc 2520 accagctggc cgcccaggga ccccaggagt gcttggtccg tatctacatt gtccgagcat 2580 ttggcctgca gcccaaggac cccaatggaa agtgtgatcc ttacatcaag atctccatag 2640 ggaagaaatc agtgagtgac caggataact acatcccctg cacgctggag cccgtatttg 2700 gaaagatgtt cgagctgacc tgcactctgc ctctggagaa ggacctaaag atcactctct 2760 atgactatga cctcctctcc aaggacgaaa agatcggtga gacggtcgtc gacctggaga 2820 acaggctgct gtccaagttt ggggctcgct gtggactccc acagacctac tgtgtctctg 2880 gaccgaacca gtggcgggac cagctccgcc cctcccagct cctccacctc ttctgccagc 2940 agcatagagt caaggcacct gtgtaccgga cagaccgtgt aatgtttcag gataaagaat 3000 attccattga agagatagag gctggcagga tcccaaaccc acacctgggc ccagtggagg 3060 agcgtctggc tctgcatgtg cttcagcagc agggcctggt cccggagcac gtggagtcac 3120 ggcccctcta cagccccctg cagccagaca tcgagcaggg gaagctgcag atgtgggtcg 3180 acctatttcc gaaggccctg gggcggcctg gacctccctt caacatcacc ccacggagag 3240 ccagaaggtt tttcctgcgt tgtattatct ggaataccag agatgtgatc ctggatgacc 3300 tgagcctcac gggggagaag atgagcgaca tttatgtgaa aggttggatg attggctttg 3360 aagaacacaa gcaaaagaca gacgtgcatt atcgttccct gggaggtgaa ggcaacttca 3420 actggaggtt cattttcccc ttcgactacc tgccagctga gcaagtctgt accattgcca 3480 agaaggatgc cttctggagg ctggacaaga ctgagagcaa aatcccagca cgagtggtgt 3540 tccagatctg ggacaatgac aagttctcct ttgatgattt tctgggctcc ctgcagctcg 3600 atctcaaccg catgcccaag ccagccaaga cagccaagaa gtgctccttg gaccagctgg 3660 atgatgcttt ccacccagaa tggtttgtgt ccctttttga gcagaaaaca gtgaagggct 3720 ggtggccctg tgtagcagaa gagggtgaga agaaaatact ggcgggcaag ctggaaatga 3780 ccttggagat tgtagcagag agtgagcatg aggagcggcc tgctggccag ggccgggatg 3840 agcccaacat gaaccctaag cttgaggacc caaggcgccc cgacacctcc ttcctgtggt 3900 ttacctcccc atacaagacc atgaagttca tcctgtggcg gcgtttccgg tgggccatca 3960 tcctcttcat catcctcttc atcctgctgc tgttcctggc catcttcatc tacgccttcc 4020 cgaactatgc tgccatgaag ctggtgaagc ccttcagctg aggactctcc tgccctgtag 4080 aaggggccgt ggggtcccct ccagcatggg actggcctgc ctcctccgcc cagctcggcg 4140 agctcctcca gacctcctag gcctgattgt cctgccaggg tgggcagaca gacagatgga 4200 ccggcccaca ctcccagagt tgctaacatg gagctctgag atcaccccac ttccatcatt 4260 tccttctccc ccaacccaac gcttttttgg atcagctcag acatatttca gtataaaaca 4320 gttggaacca caaaaaaaaa aaaaaaaagt cgacgcggcc gcaataaaag atctttattt 4380 tcattagatc tgtgtgttgg ttttttgtgt gtctagagca tggctacgta gataagtagc 4440 atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc 4500 tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg 4560 cccgggcggc ctcagtgagc gagcgagcgc gc 4592 <210> 9 <211> 1113 <212> PRT <213> Homo sapiens <220> <223> 5' hDYSF protein fragment <400> 9 Met Leu Arg Val Phe Ile Leu Tyr Ala Glu Asn Val His Thr Pro Asp 1 5 10 15 Thr Asp Ile Ser Asp Ala Tyr Cys Ser Ala Val Phe Ala Gly Val Lys 20 25 30 Lys Arg Thr Lys Val Ile Lys Asn Ser Val Asn Pro Val Trp Asn Glu 35 40 45 Gly Phe Glu Trp Asp Leu Lys Gly Ile Pro Leu Asp Gln Gly Ser Glu 50 55 60 Leu His Val Val Val Lys Asp His Glu Thr Met Gly Arg Asn Arg Phe 65 70 75 80 Leu Gly Glu Ala Lys Val Pro Leu Arg Glu Val Leu Ala Thr Pro Ser 85 90 95 Leu Ser Ala Ser Phe Asn Ala Pro Leu Leu Asp Thr Lys Lys Gln Pro 100 105 110 Thr Gly Ala Ser Leu Val Leu Gln Val Ser Tyr Thr Pro Leu Pro Gly 115 120 125 Ala Val Pro Leu Phe Pro Pro Pro Thr Pro Leu Glu Pro Ser Pro Thr 130 135 140 Leu Pro Asp Leu Asp Val Val Ala Asp Thr Gly Gly Glu Glu Asp Thr 145 150 155 160 Glu Asp Gln Gly Leu Thr Gly Asp Glu Ala Glu Pro Phe Leu Asp Gln 165 170 175 Ser Gly Gly Pro Gly Ala Pro Thr Thr Pro Arg Lys Leu Pro Ser Arg 180 185 190 Pro Pro Pro His Tyr Pro Gly Ile Lys Arg Lys Arg Ser Ala Pro Thr 195 200 205 Ser Arg Lys Leu Leu Ser Asp Lys Pro Gln Asp Phe Gln Ile Arg Val 210 215 220 Gln Val Ile Glu Gly Arg Gln Leu Pro Gly Val Asn Ile Lys Pro Val 225 230 235 240 Val Lys Val Thr Ala Ala Gly Gln Thr Lys Arg Thr Arg Ile His Lys 245 250 255 Gly Asn Ser Pro Leu Phe Asn Glu Thr Leu Phe Phe Asn Leu Phe Asp 260 265 270 Ser Pro Gly Glu Leu Phe Asp Glu Pro Ile Phe Ile Thr Val Val Asp 275 280 285 Ser Arg Ser Leu Arg Thr Asp Ala Leu Leu Gly Glu Phe Arg Met Asp 290 295 300 Val Gly Thr Ile Tyr Arg Glu Pro Arg His Ala Tyr Leu Arg Lys Trp 305 310 315 320 Leu Leu Leu Ser Asp Pro Asp Asp Phe Ser Ala Gly Ala Arg Gly Tyr 325 330 335 Leu Lys Thr Ser Leu Cys Val Leu Gly Pro Gly Asp Glu Ala Pro Leu 340 345 350 Glu Arg Lys Asp Pro Ser Glu Asp Lys Glu Asp Ile Glu Ser Asn Leu 355 360 365 Leu Arg Pro Thr Gly Val Ala Leu Arg Gly Ala His Phe Cys Leu Lys 370 375 380 Val Phe Arg Ala Glu Asp Leu Pro Gln Met Asp Asp Ala Val Met Asp 385 390 395 400 Asn Val Lys Gln Ile Phe Gly Phe Glu Ser Asn Lys Lys Asn Leu Val 405 410 415 Asp Pro Phe Val Glu Val Ser Phe Ala Gly Lys Met Leu Cys Ser Lys 420 425 430 Ile Leu Glu Lys Thr Ala Asn Pro Gln Trp Asn Gln Asn Ile Thr Leu 435 440 445 Pro Ala Met Phe Pro Ser Met Cys Glu Lys Met Arg Ile Arg Ile Ile 450 455 460 Asp Trp Asp Arg Leu Thr His Asn Asp Ile Val Ala Thr Thr Tyr Leu 465 470 475 480 Ser Met Ser Lys Ile Ser Ala Pro Gly Gly Glu Ile Glu Glu Glu Pro 485 490 495 Ala Gly Ala Val Lys Pro Ser Lys Ala Ser Asp Leu Asp Asp Tyr Leu 500 505 510 Gly Phe Leu Pro Thr Phe Gly Pro Cys Tyr Ile Asn Leu Tyr Gly Ser 515 520 525 Pro Arg Glu Phe Thr Gly Phe Pro Asp Pro Tyr Thr Glu Leu Asn Thr 530 535 540 Gly Lys Gly Glu Gly Val Ala Tyr Arg Gly Arg Leu Leu Leu Ser Leu 545 550 555 560 Glu Thr Lys Leu Val Glu His Ser Glu Gln Lys Val Glu Asp Leu Pro 565 570 575 Ala Asp Asp Ile Leu Arg Val Glu Lys Tyr Leu Arg Arg Arg Lys Tyr 580 585 590 Ser Leu Phe Ala Ala Phe Tyr Ser Ala Thr Met Leu Gln Asp Val Asp 595 600 605 Asp Ala Ile Gln Phe Glu Val Ser Ile Gly Asn Tyr Gly Asn Lys Phe 610 615 620 Asp Met Thr Cys Leu Pro Leu Ala Ser Thr Thr Gln Tyr Ser Arg Ala 625 630 635 640 Val Phe Asp Gly Cys His Tyr Tyr Tyr Leu Pro Trp Gly Asn Val Lys 645 650 655 Pro Val Val Val Leu Ser Ser Tyr Trp Glu Asp Ile Ser His Arg Ile 660 665 670 Glu Thr Gln Asn Gln Leu Leu Gly Ile Ala Asp Arg Leu Glu Ala Gly 675 680 685 Leu Glu Gln Val His Leu Ala Leu Lys Ala Gln Cys Ser Thr Glu Asp 690 695 700 Val Asp Ser Leu Val Ala Gln Leu Thr Asp Glu Leu Ile Ala Gly Cys 705 710 715 720 Ser Gln Pro Leu Gly Asp Ile His Glu Thr Pro Ser Ala Thr His Leu 725 730 735 Asp Gln Tyr Leu Tyr Gln Leu Arg Thr His His Leu Ser Gln Ile Thr 740 745 750 Glu Ala Ala Leu Ala Leu Lys Leu Gly His Ser Glu Leu Pro Ala Ala 755 760 765 Leu Glu Gln Ala Glu Asp Trp Leu Leu Arg Leu Arg Ala Leu Ala Glu 770 775 780 Glu Pro Gln Asn Ser Leu Pro Asp Ile Val Ile Trp Met Leu Gln Gly 785 790 795 800 Asp Lys Arg Val Ala Tyr Gln Arg Val Pro Ala His Gln Val Leu Phe 805 810 815 Ser Arg Arg Gly Ala Asn Tyr Cys Gly Lys Asn Cys Gly Lys Leu Gln 820 825 830 Thr Ile Phe Leu Lys Tyr Pro Met Glu Lys Val Pro Gly Ala Arg Met 835 840 845 Pro Val Gln Ile Arg Val Lys Leu Trp Phe Gly Leu Ser Val Asp Glu 850 855 860 Lys Glu Phe Asn Gln Phe Ala Glu Gly Lys Leu Ser Val Phe Ala Glu 865 870 875 880 Thr Tyr Glu Asn Glu Thr Lys Leu Ala Leu Val Gly Asn Trp Gly Thr 885 890 895 Thr Gly Leu Thr Tyr Pro Lys Phe Ser Asp Val Thr Gly Lys Ile Lys 900 905 910 Leu Pro Lys Asp Ser Phe Arg Pro Ser Ala Gly Trp Thr Trp Ala Gly 915 920 925 Asp Trp Phe Val Cys Pro Glu Lys Thr Leu Leu His Asp Met Asp Ala 930 935 940 Gly His Leu Ser Phe Val Glu Glu Val Phe Glu Asn Gln Thr Arg Leu 945 950 955 960 Pro Gly Gly Gln Trp Ile Tyr Met Ser Asp Asn Tyr Thr Asp Val Asn 965 970 975 Gly Glu Lys Val Leu Pro Lys Asp Asp Ile Glu Cys Pro Leu Gly Trp 980 985 990 Lys Trp Glu Asp Glu Glu Trp Ser Thr Asp Leu Asn Arg Ala Val Asp 995 1000 1005 Glu Gln Gly Trp Glu Tyr Ser Ile Thr Ile Pro Pro Glu Arg Lys 1010 1015 1020 Pro Lys His Trp Val Pro Ala Glu Lys Met Tyr Tyr Thr His Arg 1025 1030 1035 Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp Leu Ser Gln Met 1040 1045 1050 Glu Ala Leu Lys Arg His Arg Gln Ala Glu Ala Glu Gly Glu Gly 1055 1060 1065 Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu Glu Tyr 1070 1075 1080 Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg Met 1085 1090 1095 Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 1100 1105 1110 <210> 10 <211> 1287 <212> PRT <213> Homo sapiens <220> <223> 3' hDYSF protein fragment <400> 10 Val Ile Trp Met Leu Gln Gly Asp Lys Arg Val Ala Tyr Gln Arg Val 1 5 10 15 Pro Ala His Gln Val Leu Phe Ser Arg Arg Gly Ala Asn Tyr Cys Gly 20 25 30 Lys Asn Cys Gly Lys Leu Gln Thr Ile Phe Leu Lys Tyr Pro Met Glu 35 40 45 Lys Val Pro Gly Ala Arg Met Pro Val Gln Ile Arg Val Lys Leu Trp 50 55 60 Phe Gly Leu Ser Val Asp Glu Lys Glu Phe Asn Gln Phe Ala Glu Gly 65 70 75 80 Lys Leu Ser Val Phe Ala Glu Thr Tyr Glu Asn Glu Thr Lys Leu Ala 85 90 95 Leu Val Gly Asn Trp Gly Thr Thr Gly Leu Thr Tyr Pro Lys Phe Ser 100 105 110 Asp Val Thr Gly Lys Ile Lys Leu Pro Lys Asp Ser Phe Arg Pro Ser 115 120 125 Ala Gly Trp Thr Trp Ala Gly Asp Trp Phe Val Cys Pro Glu Lys Thr 130 135 140 Leu Leu His Asp Met Asp Ala Gly His Leu Ser Phe Val Glu Glu Val 145 150 155 160 Phe Glu Asn Gln Thr Arg Leu Pro Gly Gly Gln Trp Ile Tyr Met Ser 165 170 175 Asp Asn Tyr Thr Asp Val Asn Gly Glu Lys Val Leu Pro Lys Asp Asp 180 185 190 Ile Glu Cys Pro Leu Gly Trp Lys Trp Glu Asp Glu Glu Trp Ser Thr 195 200 205 Asp Leu Asn Arg Ala Val Asp Glu Gln Gly Trp Glu Tyr Ser Ile Thr 210 215 220 Ile Pro Pro Glu Arg Lys Pro Lys His Trp Val Pro Ala Glu Lys Met 225 230 235 240 Tyr Tyr Thr His Arg Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp 245 250 255 Leu Ser Gln Met Glu Ala Leu Lys Arg His Arg Gln Ala Glu Ala Glu 260 265 270 Gly Glu Gly Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu 275 280 285 Glu Tyr Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg 290 295 300 Met Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 305 310 315 320 Gly Ala Leu Gly Gly Val Met Asp Asp Lys Ser Glu Asp Ser Met Ser 325 330 335 Val Ser Thr Leu Ser Phe Gly Val Asn Arg Pro Thr Ile Ser Cys Ile 340 345 350 Phe Asp Tyr Gly Asn Arg Tyr His Leu Arg Cys Tyr Met Tyr Gln Ala 355 360 365 Arg Asp Leu Ala Ala Met Asp Lys Asp Ser Phe Ser Asp Pro Tyr Ala 370 375 380 Ile Val Ser Phe Leu His Gln Ser Gln Lys Thr Val Val Val Lys Asn 385 390 395 400 Thr Leu Asn Pro Thr Trp Asp Gln Thr Leu Ile Phe Tyr Glu Ile Glu 405 410 415 Ile Phe Gly Glu Pro Ala Thr Val Ala Glu Gln Pro Pro Ser Ile Val 420 425 430 Val Glu Leu Tyr Asp His Asp Thr Tyr Gly Ala Asp Glu Phe Met Gly 435 440 445 Arg Cys Ile Cys Gln Pro Ser Leu Glu Arg Met Pro Arg Leu Ala Trp 450 455 460 Phe Pro Leu Thr Arg Gly Ser Gln Pro Ser Gly Glu Leu Leu Ala Ser 465 470 475 480 Phe Glu Leu Ile Gln Arg Glu Lys Pro Ala Ile His His Ile Pro Gly 485 490 495 Phe Glu Val Gln Glu Thr Ser Arg Ile Leu Asp Glu Ser Glu Asp Thr 500 505 510 Asp Leu Pro Tyr Pro Pro Pro Gln Arg Glu Ala Asn Ile Tyr Met Val 515 520 525 Pro Gln Asn Ile Lys Pro Ala Leu Gln Arg Thr Ala Ile Glu Ile Leu 530 535 540 Ala Trp Gly Leu Arg Asn Met Lys Ser Tyr Gln Leu Ala Asn Ile Ser 545 550 555 560 Ser Pro Ser Leu Val Val Glu Cys Gly Gly Gln Thr Val Gln Ser Cys 565 570 575 Val Ile Arg Asn Leu Arg Lys Asn Pro Asn Phe Asp Ile Cys Thr Leu 580 585 590 Phe Met Glu Val Met Leu Pro Arg Glu Glu Leu Tyr Cys Pro Pro Ile 595 600 605 Thr Val Lys Val Ile Asp Asn Arg Gln Phe Gly Arg Arg Pro Val Val 610 615 620 Gly Gln Cys Thr Ile Arg Ser Leu Glu Ser Phe Leu Cys Asp Pro Tyr 625 630 635 640 Ser Ala Glu Ser Pro Ser Pro Gln Gly Gly Pro Asp Asp Val Ser Leu 645 650 655 Leu Ser Pro Gly Glu Asp Val Leu Ile Asp Ile Asp Asp Lys Glu Pro 660 665 670 Leu Ile Pro Ile Gln Glu Glu Glu Phe Ile Asp Trp Trp Ser Lys Phe 675 680 685 Phe Ala Ser Ile Gly Glu Arg Glu Lys Cys Gly Ser Tyr Leu Glu Lys 690 695 700 Asp Phe Asp Thr Leu Lys Val Tyr Asp Thr Gln Leu Glu Asn Val Glu 705 710 715 720 Ala Phe Glu Gly Leu Ser Asp Phe Cys Asn Thr Phe Lys Leu Tyr Arg 725 730 735 Gly Lys Thr Gln Glu Glu Thr Glu Asp Pro Ser Val Ile Gly Glu Phe 740 745 750 Lys Gly Leu Phe Lys Ile Tyr Pro Leu Pro Glu Asp Pro Ala Ile Pro 755 760 765 Met Pro Pro Arg Gln Phe His Gln Leu Ala Ala Gln Gly Pro Gln Glu 770 775 780 Cys Leu Val Arg Ile Tyr Ile Val Arg Ala Phe Gly Leu Gln Pro Lys 785 790 795 800 Asp Pro Asn Gly Lys Cys Asp Pro Tyr Ile Lys Ile Ser Ile Gly Lys 805 810 815 Lys Ser Val Ser Asp Gln Asp Asn Tyr Ile Pro Cys Thr Leu Glu Pro 820 825 830 Val Phe Gly Lys Met Phe Glu Leu Thr Cys Thr Leu Pro Leu Glu Lys 835 840 845 Asp Leu Lys Ile Thr Leu Tyr Asp Tyr Asp Leu Leu Ser Lys Asp Glu 850 855 860 Lys Ile Gly Glu Thr Val Val Asp Leu Glu Asn Arg Leu Leu Ser Lys 865 870 875 880 Phe Gly Ala Arg Cys Gly Leu Pro Gln Thr Tyr Cys Val Ser Gly Pro 885 890 895 Asn Gln Trp Arg Asp Gln Leu Arg Pro Ser Gln Leu Leu His Leu Phe 900 905 910 Cys Gln Gln His Arg Val Lys Ala Pro Val Tyr Arg Thr Asp Arg Val 915 920 925 Met Phe Gln Asp Lys Glu Tyr Ser Ile Glu Glu Ile Glu Ala Gly Arg 930 935 940 Ile Pro Asn Pro His Leu Gly Pro Val Glu Glu Arg Leu Ala Leu His 945 950 955 960 Val Leu Gln Gln Gln Gly Leu Val Pro Glu His Val Glu Ser Arg Pro 965 970 975 Leu Tyr Ser Pro Leu Gln Pro Asp Ile Glu Gln Gly Lys Leu Gln Met 980 985 990 Trp Val Asp Leu Phe Pro Lys Ala Leu Gly Arg Pro Gly Pro Pro Phe 995 1000 1005 Asn Ile Thr Pro Arg Arg Ala Arg Arg Phe Phe Leu Arg Cys Ile 1010 1015 1020 Ile Trp Asn Thr Arg Asp Val Ile Leu Asp Asp Leu Ser Leu Thr 1025 1030 1035 Gly Glu Lys Met Ser Asp Ile Tyr Val Lys Gly Trp Met Ile Gly 1040 1045 1050 Phe Glu Glu His Lys Gln Lys Thr Asp Val His Tyr Arg Ser Leu 1055 1060 1065 Gly Gly Glu Gly Asn Phe Asn Trp Arg Phe Ile Phe Pro Phe Asp 1070 1075 1080 Tyr Leu Pro Ala Glu Gln Val Cys Thr Ile Ala Lys Lys Asp Ala 1085 1090 1095 Phe Trp Arg Leu Asp Lys Thr Glu Ser Lys Ile Pro Ala Arg Val 1100 1105 1110 Val Phe Gln Ile Trp Asp Asn Asp Lys Phe Ser Phe Asp Asp Phe 1115 1120 1125 Leu Gly Ser Leu Gln Leu Asp Leu Asn Arg Met Pro Lys Pro Ala 1130 1135 1140 Lys Thr Ala Lys Lys Cys Ser Leu Asp Gln Leu Asp Asp Ala Phe 1145 1150 1155 His Pro Glu Trp Phe Val Ser Leu Phe Glu Gln Lys Thr Val Lys 1160 1165 1170 Gly Trp Trp Pro Cys Val Ala Glu Glu Gly Glu Lys Lys Ile Leu 1175 1180 1185 Ala Gly Lys Leu Glu Met Thr Leu Glu Ile Val Ala Glu Ser Glu 1190 1195 1200 His Glu Glu Arg Pro Ala Gly Gln Gly Arg Asp Glu Pro Asn Met 1205 1210 1215 Asn Pro Lys Leu Glu Asp Pro Arg Arg Pro Asp Thr Ser Phe Leu 1220 1225 1230 Trp Phe Thr Ser Pro Tyr Lys Thr Met Lys Phe Ile Leu Trp Arg 1235 1240 1245 Arg Phe Arg Trp Ala Ile Ile Leu Phe Ile Ile Leu Phe Ile Leu 1250 1255 1260 Leu Leu Phe Leu Ala Ile Phe Ile Tyr Ala Phe Pro Asn Tyr Ala 1265 1270 1275 Ala Met Lys Leu Val Lys Pro Phe Ser 1280 1285 <210> 11 <211> 6914 <212> DNA <213> Homo sapiens <220> <223> hDYSF DNA full length <400> 11 gcggccgccg cccagccagg tgcaaaatgc cgtgtcattg ggagactccg cagccggagc 60 attagattac agctcgacgg agctcgggaa gggcggcggg ggtggaagat gagcagaagc 120 ccctgttctc ggaacgccgg ctgacaagcg gggtgagcgc agccggggcg gggacccagc 180 ctagcccact ggagcagccg ggggtggccc gttccccttt aagagcaact gctctaagcc 240 aggagccaga gattcgagcc ggcctcgccc agccagccct ctccagcgag gggacccaca 300 agcggcgcct cggccctccc gacctttccg agccctcttt gcgccctggg cgcacggggc 360 cctacacgcg ccaagcatgc tgagggtctt catcctctat gccgagaacg tccacacacc 420 cgacaccgac atcagcgatg cctactgctc cgcggtgttt gcaggggtga agaagagaac 480 caaagtcatc aagaacagcg tgaaccctgt atggaatgag ggatttgaat gggacctcaa 540 gggcatcccc ctggaccagg gctctgagct tcatgtggtg gtcaaagacc atgagacgat 600 ggggaggaac aggttcctgg gggaagccaa ggtcccactc cgagaggtcc tcgccacccc 660 tagtctgtcc gccagcttca atgcccccct gctggacacc aagaagcagc ccacaggggc 720 ctcgctggtc ctgcaggtgt cctacacacc gctgcctgga gctgtgcccc tgttcccgcc 780 ccctactcct ctggagccct ccccgactct gcctgacctg gatgtagtgg cagacacagg 840 aggagagggaa gacacagagg accagggact cactggagat gaggcggagc cattcctgga 900 tcaaagcgga ggcccggggg ctcccaccac cccaaggaaa ctaccttcac gtcctccgcc 960 ccactacccc gggatcaaaa gaaagcgaag tgcgcctaca tctagaaagc tgctgtcaga 1020 caaaccgcag gatttccaga tcagggtcca ggtgatcgag gggcgccagc tgccgggggt 1080 gaacatcaag cctgtggtca aggttaccgc tgcagggcag accaagcgga cgcggatcca 1140 caagggaaac agcccactct tcaatgagac tcttttcttc aacttgtttg actctcctgg 1200 ggagctgttt gatgagccca tctttatcac ggtggtagac tctcgttctc tcaggacaga 1260 tgctctcctc ggggagttcc ggatggacgt gggcaccatt tacagagagc cccggcacgc 1320 ctatctcagg aagtggctgc tgctctcaga ccctgatgac ttctctgctg gggccagagg 1380 ctacctgaaa acaagccttt gtgtgctggg gcctggggac gaagcgcctc tggagagaaa 1440 agacccctct gaagacaagg aggacattga aagcaacctg ctccggccca caggcgtagc 1500 cctgcgagga gcccacttct gcctgaaggt cttccgggcc gaggacttgc cgcagatgga 1560 cgatgccgtg atggacaacg tgaaacagat ctttggcttc gagagtaaca agaagaactt 1620 ggtggacccc tttgtggagg tcagctttgc ggggaaaatg ctgtgcagca agatcttgga 1680 gaagacggcc aaccctcagt ggaaccagaa catcacactg cctgccatgt ttccctccat 1740 gtgcgaaaaa atgaggattc gtatcataga ctgggaccgc ctgactcaca atgacatcgt 1800 ggctaccacc tacctgagta tgtcgaaaat ctctgcccct ggaggagaaa tagaagagga 1860 gcctgcaggt gctgtcaagc cttcgaaagc ctcagacttg gatgactacc tgggcttcct 1920 ccccactttt gggccctgct acatcaacct ctatggcagt cccagagagt tcacaggctt 1980 cccagacccc tacacagagc tcaacacagg caagggggaa ggtgtggctt atcgtggccg 2040 gcttctgctc tccctggaga ccaagctggt ggagcacagt gaacagaagg tggaggacct 2100 tcctgcggat gacatcctcc gggtggagaa gtaccttagg aggcgcaagt actccctgtt 2160 tgcggccttc tactcagcca ccatgctgca ggatgtggat gatgccatcc agtttgaggt 2220 cagcatcggg aactacggga acaagttcga catgacctgc ctgccgctgg cctccaccac 2280 tcagtacagc cgtgcagtct ttgacgggtg ccactactac tacctaccct ggggtaacgt 2340 gaaacctgtg gtggtgctgt catcctactg ggaggacatc agccatagaa tcgagactca 2400 gaaccagctg cttgggattg ctgaccggct ggaagctggc ctggagcagg tccacctggc 2460 cctgaaggcg cagtgctcca cggaggacgt ggactcgctg gtggctcagc tgacggatga 2520 gctcatcgca ggctgcagcc agcctctggg tgacatccat gagacaccct ctgccaccca 2580 cctggaccag tacctgtacc agctgcgcac ccatcacctg agccaaatca ctgaggctgc 2640 cctggccctg aagctcggcc acagtgagct ccctgcagct ctggagcagg cggaggactg 2700 gctcctgcgt ctgcgtgccc tggcagagga gccccagaac agcctgccgg acatcgtcat 2760 ctggatgctg cagggagaca agcgtgtggc ataccagcgg gtgcccgccc accaagtcct 2820 cttctcccgg cggggtgcca actactgtgg caagaattgt gggaagctac agacaatctt 2880 tctgaaatat ccgatggaga aggtgcctgg cgcccggatg ccaggtgcaga tacgggtcaa 2940 gctgtggttt gggctctcag tggatgagaa ggagttcaac cagtttgctg aggggaagct 3000 gtctgtcttt gctgaaacct atgagaacga gactaagttg gcccttgttg ggaactgggg 3060 cacaacgggc ctcacctacc ccaagttttc tgacgtcacg ggcaagatca agctacccaa 3120 ggacagcttc cgcccctcgg ccggctggac ctgggctgga gattggttcg tgtgtccgga 3180 gaagactctg ctccatgaca tggacgccgg tcacctgagc ttcgtggaag aggtgtttga 3240 gaaccagacc cggcttcccg gaggccagtg gatctacatg agtgacaact acaccgatgt 3300 gaacggggag aaggtgcttc ccaaggatga cattgagtgc ccactgggct ggaagtggga 3360 agatgagggaa tggtccacag acctcaaccg ggctgtcgat gagcaaggct gggagtatag 3420 catcaccatc cccccggagc ggaagccgaa gcactgggtc cctgctgaga agatgtacta 3480 cacacaccga cggcggcgct gggtgcgcct gcgcaggagg gatctcagcc aaatggaagc 3540 actgaaaagg cacaggcagg cggaggcgga gggcgagggc tgggagtacg cctctctttt 3600 tggctggaag ttccacctcg agtaccgcaa gacagatgcc ttccgccgcc gccgctggcg 3660 ccgtcgcatg gagccactgg agaagacggg gcctgcagct gtgtttgccc ttgagggggc 3720 cctgggcggc gtgatggatg acaagagtga agattccatg tccgtctcca ccttgagctt 3780 cggtgtgaac agacccacga tttcctgcat attcgactat gggaaccgct accatctacg 3840 ctgctacatg taccaggccc gggacctggc tgcgatggac aaggactctt tttctgatcc 3900 ctatgccatc gtctccttcc tgcaccagag ccagaagacg gtggtggtga agaacaccct 3960 taaccccacc tgggaccaga cgctcatctt ctacgagatc gagatctttg gcgagccggc 4020 cacagttgct gagcaaccgc ccagcattgt ggtggagctg tacgaccatg acacttatgg 4080 tgcagacgag tttatgggtc gctgcatctg tcaaccgagt ctggaacgga tgccacggct 4140 ggcctggttc ccactgacga ggggcagcca gccgtcgggg gagctgctgg cctcttttga 4200 gctcatccag agagagaagc cggccatcca ccatattcct ggttttgagg tgcaggagac 4260 atcaaggatc ctggatgagt ctgaggacac agacctgccc tacccaccac cccagaggga 4320 ggccaacatc tacatggttc ctcagaacat caagccagcg ctccagcgta ccgccatcga 4380 gatcctggca tggggcctgc ggaacatgaa gagttaccag ctggccaaca tctcctcccc 4440 cagcctcgtg gtagagtgtg ggggccagac ggtgcagtcc tgtgtcatca ggaacctccg 4500 gaagaacccc aactttgaca tctgcaccct cttcatggaa gtgatgctgc ccagggagga 4560 gctctactgc ccccccatca ccgtcaaggt catcgataac cgccagtttg gccgccggcc 4620 tgtggtgggc cagtgtacca tccgctccct ggagagcttc ctgtgtgacc cctactcggc 4680 ggagagtcca tccccacagg gtggcccaga cgatgtgagc ctactcagtc ctggggaaga 4740 cgtgctcatc gacattgatg acaaggagcc cctcatcccc atccaggagg aagagttcat 4800 cgattggtgg agcaaattct ttgcctccat aggggagagg gaaaagtgcg gctcctacct 4860 ggagaaggat tttgacaccc tgaaggtcta tgacacacag ctggagaatg tggagggcctt 4920 tgagggcctg tctgactttt gtaacacctt caagctgtac cggggcaaga cgcaggagga 4980 gacagaagat ccatctgtga ttggtgaatt taagggcctc ttcaaaattt atcccctccc 5040 agaagaccca gccatcccca tgcccccaag acagttccac cagctggccg cccagggacc 5100 ccaggagtgc ttggtccgta tctacattgt ccgagcattt ggcctgcagc ccaaggaccc 5160 caatggaaag tgtgatcctt acatcaagat ctccataggg aagaaatcag tgagtgacca 5220 ggataactac atcccctgca cgctggagcc cgtatttgga aagatgttcg agctgacctg 5280 cactctgcct ctggagaagg acctaaagat cactctctat gactatgacc tcctctccaa 5340 ggacgaaaag atcggtgaga cggtcgtcga cctggagaac aggctgctgt ccaagtttgg 5400 ggctcgctgt ggactcccac agacctactg tgtctctgga ccgaaccagt ggcgggacca 5460 gctccgcccc tcccagctcc tccacctctt ctgccagcag catagagtca aggcacctgt 5520 gtaccggaca gaccgtgtaa tgtttcagga taaagaatat tccattgaag agatagaggc 5580 tggcaggatc ccaaacccac acctgggccc agtggaggag cgtctggctc tgcatgtgct 5640 tcagcagcag ggcctggtcc cggagcacgt ggagtcacgg cccctctaca gccccctgca 5700 gccagacatc gagcagggga agctgcagat gtgggtcgac ctatttccga aggccctggg 5760 gcggcctgga cctcccttca acatcacccc acggagagcc agaaggtttt tcctgcgttg 5820 tattatctgg aataccagag atgtgatcct ggatgacctg agcctcacgg gggagaagat 5880 gagcgacatt tatgtgaaag gttggatgat tggctttgaa gaacacaagc aaaagacaga 5940 cgtgcattat cgttccctgg gaggtgaagg caacttcaac tggaggttca ttttcccctt 6000 cgactacctg ccagctgagc aagtctgtac cattgccaag aaggatgcct tctggaggct 6060 ggacaagact gagagcaaaa tcccagcacg agtggtgttc cagatctggg acaatgacaa 6120 gttctccttt gatgattttc tgggctccct gcagctcgat ctcaaccgca tgcccaagcc 6180 agccaagaca gccaagaagt gctccttgga ccagctggat gatgctttcc acccagaatg 6240 gtttgtgtcc ctttttgagc agaaaacagt gaagggctgg tggccctgtg tagcagaaga 6300 gggtgagaag aaaatactgg cgggcaagct ggaaatgacc ttggagattg tagcagagag 6360 tgagcatgag gagcggcctg ctggccaggg ccgggatgag cccaacatga accctaagct 6420 tgaggaccca aggcgccccg acacctcctt cctgtggttt acctccccat acaagaccat 6480 gaagttcatc ctgtggcggc gtttccggtg ggccatcatc ctcttcatca tcctcttcat 6540 cctgctgctg ttcctggcca tcttcatcta cgccttcccg aactatgctg ccatgaagct 6600 ggtgaagccc ttcagctgag gactctcctg ccctgtagaa ggggccgtgg ggtcccctcc 6660 agcatgggac tggcctgcct cctccgccca gctcggcgag ctcctccaga cctcctaggc 6720 ctgattgtcc tgccagggtg ggcagacaga cagatggacc ggcccacact cccagagttg 6780 ctaacatgga gctctgagat caccccactt ccatcatttc cttctccccc aacccaacgc 6840 ttttttggat cagctcagac atatttcagt ataaaacagt tggaaccaca aaaaaaaaaa 6900 aaaaaaaaaa aaaa 6914 <210> 12 <211> 2080 <212> PRT <213> Homo sapiens <220> <223> hDYSF protein full length <400> 12 Met Leu Arg Val Phe Ile Leu Tyr Ala Glu Asn Val His Thr Pro Asp 1 5 10 15 Thr Asp Ile Ser Asp Ala Tyr Cys Ser Ala Val Phe Ala Gly Val Lys 20 25 30 Lys Arg Thr Lys Val Ile Lys Asn Ser Val Asn Pro Val Trp Asn Glu 35 40 45 Gly Phe Glu Trp Asp Leu Lys Gly Ile Pro Leu Asp Gln Gly Ser Glu 50 55 60 Leu His Val Val Val Lys Asp His Glu Thr Met Gly Arg Asn Arg Phe 65 70 75 80 Leu Gly Glu Ala Lys Val Pro Leu Arg Glu Val Leu Ala Thr Pro Ser 85 90 95 Leu Ser Ala Ser Phe Asn Ala Pro Leu Leu Asp Thr Lys Lys Gln Pro 100 105 110 Thr Gly Ala Ser Leu Val Leu Gln Val Ser Tyr Thr Pro Leu Pro Gly 115 120 125 Ala Val Pro Leu Phe Pro Pro Pro Thr Pro Leu Glu Pro Ser Pro Thr 130 135 140 Leu Pro Asp Leu Asp Val Val Val Ala Asp Thr Gly Gly Glu Glu Asp Thr 145 150 155 160 Glu Asp Gln Gly Leu Thr Gly Asp Glu Ala Glu Pro Phe Leu Asp Gln 165 170 175 Ser Gly Gly Pro Gly Ala Pro Thr Thr Pro Arg Lys Leu Pro Ser Arg 180 185 190 Pro Pro Pro His Tyr Pro Gly Ile Lys Arg Lys Arg Ser Ala Pro Thr 195 200 205 Ser Arg Lys Leu Leu Ser Asp Lys Pro Gln Asp Phe Gln Ile Arg Val 210 215 220 Gln Val Ile Glu Gly Arg Gln Leu Pro Gly Val Asn Ile Lys Pro Val 225 230 235 240 Val Lys Val Thr Ala Ala Gly Gln Thr Lys Arg Thr Arg Ile His Lys 245 250 255 Gly Asn Ser Pro Leu Phe Asn Glu Thr Leu Phe Phe Asn Leu Phe Asp 260 265 270 Ser Pro Gly Glu Leu Phe Asp Glu Pro Ile Phe Ile Thr Val Val Asp 275 280 285 Ser Arg Ser Leu Arg Thr Asp Ala Leu Leu Gly Glu Phe Arg Met Asp 290 295 300 Val Gly Thr Ile Tyr Arg Glu Pro Arg His Ala Tyr Leu Arg Lys Trp 305 310 315 320 Leu Leu Leu Ser Asp Pro Asp Asp Phe Ser Ala Gly Ala Arg Gly Tyr 325 330 335 Leu Lys Thr Ser Leu Cys Val Leu Gly Pro Gly Asp Glu Ala Pro Leu 340 345 350 Glu Arg Lys Asp Pro Ser Glu Asp Lys Glu Asp Ile Glu Ser Asn Leu 355 360 365 Leu Arg Pro Thr Gly Val Ala Leu Arg Gly Ala His Phe Cys Leu Lys 370 375 380 Val Phe Arg Ala Glu Asp Leu Pro Gln Met Asp Asp Ala Val Met Asp 385 390 395 400 Asn Val Lys Gln Ile Phe Gly Phe Glu Ser Asn Lys Lys Asn Leu Val 405 410 415 Asp Pro Phe Val Glu Val Ser Phe Ala Gly Lys Met Leu Cys Ser Lys 420 425 430 Ile Leu Glu Lys Thr Ala Asn Pro Gln Trp Asn Gln Asn Ile Thr Leu 435 440 445 Pro Ala Met Phe Pro Ser Met Cys Glu Lys Met Arg Ile Arg Ile Ile 450 455 460 Asp Trp Asp Arg Leu Thr His Asn Asp Ile Val Ala Thr Thr Tyr Leu 465 470 475 480 Ser Met Ser Lys Ile Ser Ala Pro Gly Gly Glu Ile Glu Glu Glu Glu Pro 485 490 495 Ala Gly Ala Val Lys Pro Ser Lys Ala Ser Asp Leu Asp Asp Tyr Leu 500 505 510 Gly Phe Leu Pro Thr Phe Gly Pro Cys Tyr Ile Asn Leu Tyr Gly Ser 515 520 525 Pro Arg Glu Phe Thr Gly Phe Pro Asp Pro Tyr Thr Glu Leu Asn Thr 530 535 540 Gly Lys Gly Glu Gly Val Ala Tyr Arg Gly Arg Leu Leu Leu Ser Leu 545 550 555 560 Glu Thr Lys Leu Val Glu His Ser Glu Gln Lys Val Glu Asp Leu Pro 565 570 575 Ala Asp Asp Ile Leu Arg Val Glu Lys Tyr Leu Arg Arg Arg Arg Lys Tyr 580 585 590 Ser Leu Phe Ala Ala Phe Tyr Ser Ala Thr Met Leu Gln Asp Val Asp 595 600 605 Asp Ala Ile Gln Phe Glu Val Ser Ile Gly Asn Tyr Gly Asn Lys Phe 610 615 620 Asp Met Thr Cys Leu Pro Leu Ala Ser Thr Thr Gln Tyr Ser Arg Ala 625 630 635 640 Val Phe Asp Gly Cys His Tyr Tyr Tyr Leu Pro Trp Gly Asn Val Lys 645 650 655 Pro Val Val Val Leu Ser Ser Tyr Trp Glu Asp Ile Ser His Arg Ile 660 665 670 Glu Thr Gln Asn Gln Leu Leu Gly Ile Ala Asp Arg Leu Glu Ala Gly 675 680 685 Leu Glu Gln Val His Leu Ala Leu Lys Ala Gln Cys Ser Thr Glu Asp 690 695 700 Val Asp Ser Leu Val Ala Gln Leu Thr Asp Glu Leu Ile Ala Gly Cys 705 710 715 720 Ser Gln Pro Leu Gly Asp Ile His Glu Thr Pro Ser Ala Thr His Leu 725 730 735 Asp Gln Tyr Leu Tyr Gln Leu Arg Thr His His Leu Ser Gln Ile Thr 740 745 750 Glu Ala Ala Leu Ala Leu Lys Leu Gly His Ser Glu Leu Pro Ala Ala 755 760 765 Leu Glu Gln Ala Glu Asp Trp Leu Leu Arg Leu Arg Ala Leu Ala Glu 770 775 780 Glu Pro Gln Asn Ser Leu Pro Asp Ile Val Ile Trp Met Leu Gln Gly 785 790 795 800 Asp Lys Arg Val Ala Tyr Gln Arg Val Pro Ala His Gln Val Leu Phe 805 810 815 Ser Arg Arg Gly Ala Asn Tyr Cys Gly Lys Asn Cys Gly Lys Leu Gln 820 825 830 Thr Ile Phe Leu Lys Tyr Pro Met Glu Lys Val Pro Gly Ala Arg Met 835 840 845 Pro Val Gln Ile Arg Val Lys Leu Trp Phe Gly Leu Ser Val Asp Glu 850 855 860 Lys Glu Phe Asn Gln Phe Ala Glu Gly Lys Leu Ser Val Phe Ala Glu 865 870 875 880 Thr Tyr Glu Asn Glu Thr Lys Leu Ala Leu Val Gly Asn Trp Gly Thr 885 890 895 Thr Gly Leu Thr Tyr Pro Lys Phe Ser Asp Val Thr Gly Lys Ile Lys 900 905 910 Leu Pro Lys Asp Ser Phe Arg Pro Ser Ala Gly Trp Thr Trp Ala Gly 915 920 925 Asp Trp Phe Val Cys Pro Glu Lys Thr Leu Leu His Asp Met Asp Ala 930 935 940 Gly His Leu Ser Phe Val Glu Glu Val Phe Glu Asn Gln Thr Arg Leu 945 950 955 960 Pro Gly Gly Gln Trp Ile Tyr Met Ser Asp Asn Tyr Thr Asp Val Asn 965 970 975 Gly Glu Lys Val Leu Pro Lys Asp Asp Ile Glu Cys Pro Leu Gly Trp 980 985 990 Lys Trp Glu Asp Glu Glu Trp Ser Thr Asp Leu Asn Arg Ala Val Asp 995 1000 1005 Glu Gln Gly Trp Glu Tyr Ser Ile Thr Ile Pro Pro Glu Arg Lys 1010 1015 1020 Pro Lys His Trp Val Pro Ala Glu Lys Met Tyr Tyr Thr His Arg 1025 1030 1035 Arg Arg Arg Trp Val Arg Leu Arg Arg Arg Asp Leu Ser Gln Met 1040 1045 1050 Glu Ala Leu Lys Arg His Arg Gln Ala G lu Ala Glu Gly Glu Gly 1055 1060 1065 Trp Glu Tyr Ala Ser Leu Phe Gly Trp Lys Phe His Leu Glu Tyr 1070 1075 1080 Arg Lys Thr Asp Ala Phe Arg Arg Arg Arg Trp Arg Arg Arg Met 1085 1090 1095 Glu Pro Leu Glu Lys Thr Gly Pro Ala Ala Val Phe Ala Leu Glu 1100 1105 1110 Gly Ala Leu Gly Gly Val Met Asp Asp Lys Ser Glu Asp Ser Met 1115 1120 1125 Ser Val Ser Thr Leu Ser Phe Gly Val Asn Arg Pro Thr Ile Ser 1130 1135 1140 Cys Ile Phe Asp Tyr Gly Asn Arg Tyr His Leu Arg Cys Tyr Met 1145 1150 1155 Tyr Gln Ala Arg Asp Leu Ala Ala Met Asp Lys Asp Ser Phe Ser 1160 1165 1170 Asp Pro Tyr Ala Ile Val Ser Phe Leu His Gln Ser Gln Lys Thr 1175 1180 1185 Val Val Val Lys Asn Thr Leu Asn Pro Thr Trp Asp Gln Thr Leu 1190 1195 1200 Ile Phe Tyr Glu Ile Glu Ile Phe Gly Glu Pro Ala Thr Val Ala 1205 1210 1215 Glu Gln Pro Pro Ser Ile Val Val Glu Leu Tyr Asp His Asp Thr 1220 1225 1230 Tyr Gly Ala Asp Glu Phe Met Gly Arg Cys Ile Cys Gln Pro Ser 1235 1240 1245 Leu Glu Arg Met Pro Arg Leu Ala Trp Phe Pro Leu Thr Arg Gly 12 50 1255 1260 Ser Gln Pro Ser Gly Glu Leu Leu Ala Ser Phe Glu Leu Ile Gln 1265 1270 1275 Arg Glu Lys Pro Ala Ile His His Ile Pro Gly Phe Glu Val Gln 1280 1285 1290 Glu Thr Ser Arg Ile Leu Asp Glu Ser Glu Asp Thr Asp Leu Pro 1295 1300 1305 Tyr Pro Pro Pro Gln Arg Glu Ala Asn Ile Tyr Met Val Pro Gln 1310 1315 1320 Asn Ile Lys Pro Ala Leu Gln Arg Thr Ala Ile Glu Ile Leu Ala 1325 1330 1335 Trp Gly Leu Arg Asn Met Lys Ser Tyr Gln Leu Ala Asn Ile Ser 1340 1345 1350 Ser Pro Ser Leu Val Val Glu Cys Gly Gly Gln Thr Val Gln Ser 1355 1360 1365 Cys Val Ile Arg Asn Leu Arg Lys Asn Pro Asn Phe Asp Ile Cys 1370 1375 1380 Thr Leu Phe Met Glu Val Met Leu Pro Arg Glu Glu Leu Tyr Cys 1385 1390 1395 Pro Pro Ile Thr Val Lys Val Ile Asp Asn Arg Gln Phe Gly Arg 1400 1405 1410 Arg Pro Val Val Gly Gln Cys Thr Ile Arg Ser Leu Glu Ser Phe 1415 1420 1425 Leu Cys Asp Pro Tyr Ser Ala Glu Ser Pro Ser Pro Gln Gly Gly 1430 1435 1440 Pro Asp Asp Val Ser Leu Leu Ser Pro Gly Glu Asp Val Leu Ile 1445 1450 1455 Asp Ile Asp Asp Lys Glu Pro Leu Ile Pro Ile Gln Glu Glu Glu 1460 1465 1470 Phe Ile Asp Trp Trp Ser Lys Phe Phe Ala Ser Ile Gly Glu Arg 1475 1480 1485 Glu Lys Cys Gly Ser Tyr Leu Glu Lys Asp Phe Asp Thr Leu Lys 1490 1495 1500 Val Tyr Asp Thr Gln Leu Glu Asn Val Glu Ala Phe Glu Gly Leu 1505 1510 1515 Ser Asp Phe Cys Asn Thr Phe Lys Leu Tyr Arg Gly Lys Thr Gln 1520 1525 1530 Glu Glu Thr Glu Asp Pro Ser Val Ile Gly Glu Phe Lys Gly Leu 1535 1540 1545 Phe Lys Ile Tyr Pro Leu Pro Glu Asp Pro Ala Ile Pro Met Pro 1550 1555 1560 Pro Arg Gln Phe His Gln Leu Ala Ala Gln Gly Pro Gln Glu Cys 1565 1570 1575 Leu Val Arg Ile Tyr Ile Val Arg Ala Phe Gly Leu Gln Pro Lys 1580 1585 1590 Asp Pro Asn Gly Lys Cys Asp Pro Tyr Ile Lys Ile Ser Ile Gly 1595 1600 1605 Lys Lys Ser Val Ser Asp Gln Asp Asn Tyr Ile Pro Cys Thr Leu 1610 1615 1620 Glu Pro Val Phe Gly Lys Met Phe Glu Leu Thr Cys Thr Leu Pro 1625 1630 1635 Leu Glu Lys Asp Leu Lys Ile Thr Leu Tyr Asp Tyr Asp Leu Leu 1640 1645 1650 Ser Lys Asp Glu Lys Ile Gly Glu Thr V al Val Asp Leu Glu Asn 1655 1660 1665 Arg Leu Leu Ser Lys Phe Gly Ala Arg Cys Gly Leu Pro Gln Thr 1670 1675 1680 Tyr Cys Val Ser Gly Pro Asn Gln Trp Arg Asp Gln Leu Arg Pro 1685 1690 1695 Ser Gln Leu Leu His Leu Phe Cys Gln Gln His Arg Val Lys Ala 1700 1705 1710 Pro Val Tyr Arg Thr Asp Arg Val Met Phe Gln Asp Lys Glu Tyr 1715 1720 1725 Ser Ile Glu Glu Ile Glu Ala Gly Arg Ile Pro Asn Pro His Leu 1730 1735 1740 Gly Pro Val Glu Glu Arg Leu Ala Leu His Val Leu Gln Gln Gln 1745 1750 1755 Gly Leu Val Pro Glu His Val Glu Ser Arg Pro Leu Tyr Ser Pro 1760 1765 1770 Leu Gln Pro Asp Ile Glu Gln Gly Lys Leu Gln Met Trp Val Asp 1775 1780 1785 Leu Phe Pro Lys Ala Leu Gly Arg Pro Gly Pro Pro Phe Asn Ile 1790 1795 1800 Thr Pro Arg Arg Ala Arg Arg Phe Phe Leu Arg Cys Ile Ile Trp 1805 1810 1815 Asn Thr Arg Asp Val Ile Leu Asp Asp Leu Ser Leu Thr Gly Glu 1820 1825 1830 Lys Met Ser Asp Ile Tyr Val Lys Gly Trp Met Ile Gly Phe Glu 1835 1840 1845 Glu His Lys Gln Lys Thr Asp Val His Tyr Arg Ser Leu Gly Gly 18 50 1855 1860 Glu Gly Asn Phe Asn Trp Arg Phe Ile Phe Pro Phe Asp Tyr Leu 1865 1870 1875 Pro Ala Glu Gln Val Cys Thr Ile Ala Lys Lys Asp Ala Phe Trp 1880 1885 1890 Arg Leu Asp Lys Thr Glu Ser Lys Ile Pro Ala Arg Val Val Phe 1895 1900 1905 Gln Ile Trp Asp Asn Asp Lys Phe Ser Phe Asp Asp Phe Leu Gly 1910 1915 1920 Ser Leu Gln Leu Asp Leu Asn Arg Met Pro Lys Pro Ala Lys Thr 1925 1930 1935 Ala Lys Lys Cys Ser Leu Asp Gln Leu Asp Asp Ala Phe His Pro 1940 1945 1950 Glu Trp Phe Val Ser Leu Phe Glu Gln Lys Thr Val Lys Gly Trp 1955 1960 1965 Trp Pro Cys Val Ala Glu Glu Gly Glu Lys Lys Ile Leu Ala Gly 1970 1975 1980 Lys Leu Glu Met Thr Leu Glu Ile Val Ala Glu Ser Glu His Glu 1985 1990 1995 Glu Arg Pro Ala Gly Gln Gly Arg Asp Glu Pro Asn Met Asn Pro 2000 2005 2010 Lys Leu Glu Asp Pro Arg Arg Pro Asp Thr Ser Phe Leu Trp Phe 2015 2020 2025 Thr Ser Pro Tyr Lys Thr Met Lys Phe Ile Leu Trp Arg Arg Phe 2030 2035 2040 Arg Trp Ala Ile Ile Leu Phe Ile Ile Leu Phe Ile Leu Leu Leu 2045 2050 2055 Phe Leu Ala Ile Phe Ile Tyr Ala Phe Pro Asn Tyr Ala Ala Met 2060 2065 2070Lys Leu Val Lys Pro Phe Ser 2075 2080 <210> 13 <211> 3340 <212> DNA <213> Homo sapiens <220> <223> 5' hDYSF DNA fragment v2 <400> 13 atgctgaggg tcttcatcct ctatgccgag aacgtccaca cacccgacac cgacatcagc 60 gatgcctact gctccgcggt gtttgcaggg gtgaagaaga gaaccaaagt catcaagaac 120 agcgtgaacc ctgtatggaa tgagggattt gaatgggacc tcaagggcat ccccctggac 180 cagggctctg agcttcatgt ggtggtcaaa gaccatgaga cgatggggag gaacaggttc 240 ctgggggaag ccaaggtccc actccgagag gtcctcgcca cccctagtct gtccgccagc 300 ttcaatgccc ccctgctgga caccaagaag cagccacag gggcctcgct ggtcctgcag 360 gtgtcctaca caccgctgcc tggagctgtg cccctgttcc cgccccctac tcctctggag 420 ccctccccga ctctgcctga cctggatgta gtggcagaca caggaggaga ggaagacaca 480 gaggaccagg gactcactgg agatgaggcg gagccattcc tggatcaaag cggaggcccg 540 ggggctccca ccaccccaag gaaactacct tcacgtcctc cgccccacta ccccgggatc 600 aaaagaaagc gaagtgcgcc tacatctaga aagctgctgt cagacaaacc gcaggatttc 660 cagatcaggg tccaggtgat cgaggggcgc cagctgccgg gggtgaacat caagcctggg 720 gtcaaggtta ccgctgcagg gcagaccaag cggacgcgga tccacaaggg aaacagccca 780 ctcttcaatg agactctttt cttcaacttg tttgactctc ctggggagct gtttgatgag 840 cccatcttta tcacggtggt agactctcgt tctctcagga cagatgctct cctcggggag 900 ttccggatgg acgtgggcac catttacaga gagccccggc acgcctatct caggaagtgg 960 ctgctgctct cagaccctga tgacttctct gctggggcca gaggctacct gaaaacaagc 1020 ctttgtgtgc tggggcctgg ggacgaagcg cctctggaga gaaaagaccc ctctgaagac 1080 aaggaggaca ttgaaagcaa cctgctccgg cccacaggcg tagccctgcg aggagcccac 1140 ttctgcctga aggtcttccg ggccgaggac ttgccgcaga tggacgatgc cgtgatggac 1200 aacgtgaaac agatctttgg cttcgagagt aacaagaaga acttggtgga cccctttggg 1260 gaggtcagct ttgcgggggaa aatgctgtgc agcaagatct tggagaagac ggccaaccct 1320 cagtggaacc agaacatcac actgcctgcc atgtttccct ccatgtgcga aaaaatgagg 1380 attcgtatca tagactggga ccgcctgact cacaatgaca tcgtggctac cacctacctg 1440 agtatgtcga aaatctctgc ccctggagga gaaatagaag aggagcctgc aggtgctgtc 1500 aagccttcga aagcctcaga cttggatgac tacctgggct tcctccccac ttttgggccc 1560 tgctacatca acctctatgg cagtcccaga gagttcacag gcttcccaga cccctacaca 1620 gagctcaaca caggcaaggg ggaagggtg gcttatcgtg gccggcttct gctctccctg 1680 gagaccaagc tggtggagca cagtgaacag aaggtggagg accttcctgc ggatgacatc 1740 ctccgggtgg agaagtacct taggaggcgc aagtactccc tgtttgcggc cttctactca 1800 gccaccatgc tgcaggatgt ggatgatgcc atccagtttg aggtcagcat cgggaactac 1860 gggaacaagt tcgacatgac ctgcctgccg ctggcctcca ccactcagta cagccgtgca 1920 gtctttgacg ggtgccacta ctactaccta ccctggggta acgtgaaacc tgtggtggtg 1980 ctgtcatcct actgggagga catcagccat agaatcgaga ctcagaacca gctgcttggg 2040 attgctgacc ggctggaagc tggcctggag caggtccacc tggccctgaa ggcgcagtgc 2100 tccacggagg acgtggactc gctggtggct cagctgacgg atgagctcat cgcaggctgc 2160 agccagcctc tgggtgacat ccatgagaca ccctctgcca cccacctgga ccagtacctg 2220 taccagctgc gcacccatca cctgagccaa atcactgagg ctgccctggc cctgaagctc 2280 ggccacagtg agctccctgc agctctggag caggcggagg actggctcct gcgtctgcgt 2340 gccctggcag aggagcccca gaacagcctg ccggacatcg tcatctggat gctgcaggga 2400 gacaagcgtg tggcatacca gcgggtgccc gcccaccaag tcctcttctc ccggcggggt 2460 gccaactact gtggcaagaa ttgtgggaag ctacagacaa tctttctgaa atatccgatg 2520 gagaaggtgc ctggcgcccg gatgccagtg cagatacggg tcaagctggg gtttgggctc 2580 tcagtggatg agaaggagtt caaccagttt gctgagggga agctgtctgt ctttgctgaa 2640 acctatgaga acgagactaa gttggccctt gttgggaact ggggcacaac gggcctcacc 2700 taccccaagt tttctgacgt cacgggcaag atcaagctac ccaaggacag cttccgcccc 2760 tcggccggct ggacctgggc tggagattgg ttcgtgtgtc cggagaagac tctgctccat 2820 gacatggacg ccggtcacct gagcttcgtg gaagaggtgt ttgagaacca gacccggctt 2880 cccggaggcc agtggatcta catgagtgac aactacaccg atgtgaacgg ggagaaggtg 2940 cttcccaagg atgacattga gtgcccactg ggctggaagt gggaagatga ggaatggtcc 3000 acagacctca accgggctgt cgatgagcaa ggctgggagt atagcatcac catccccccg 3060 gagcggaagc cgaagcactg ggtccctgct gagaagatgt actacacaca ccgacggcgg 3120 cgctgggtgc gcctgcgcag gagggatctc agccaaatgg aagcactgaa aaggcacagg 3180 caggcggagg cggagggcga gggctggggag tacgcctctc tttttggctg gaagttccac 3240 ctcgagtacc gcaagacaga tgccttccgc cgccgccgct ggcgccgtcg catggagcca 3300 ctggagaaga cggggcctgc agctgtgttt gcccttgagg 3340 <210> 14 <211> 3866 <212> DNA <213> Homo sapiens <220> <223> 3' hDYSF DNA fragment v2 <400> 14 tcgtcatctg gatgctgcag ggagacaagc gtgtggcata ccagcgggtg cccgcccacc 60 aagtcctctt ctcccggcgg ggtgccaact actgtggcaa gaattgtggg aagctacaga 120 caatctttct gaaatatccg atggagaagg tgcctggcgc ccggatgcca gtgcagatac 180 gggtcaagct gtggtttggg ctctcagtgg atgagaagga gttcaaccag tttgctgagg 240 ggaagctgtc tgtctttgct gaaacctatg agaacgagac taagttggcc cttgttggga 300 actggggcac aacgggcctc acctacccca agttttctga cgtcacgggc aagatcaagc 360 tacccaagga cagcttccgc ccctcggccg gctggacctg ggctggagat tggttcgtgt 420 gtccggagaa gactctgctc catgacatgg acgccggtca cctgagcttc gtggaagagg 480 tgtttgagaa ccagacccgg cttcccggag gccagtggat ctacatgagt gacaactaca 540 ccgatgtgaa cggggagaag gtgcttccca aggatgacat tgagtgccca ctgggctgga 600 agtgggaaga tgaggaatgg tccacagacc tcaaccgggc tgtcgatgag caaggctggg 660 agtatagcat caccatcccc ccggagcgga agccgaagca ctgggtccct gctgagaaga 720 tgtactacac acaccgacgg cggcgctggg tgcgcctgcg caggagggat ctcagccaaa 780 tggaagcact gaaaaggcac aggcaggcgg aggcggaggg cgagggctgg gagtacgcct 840 ctctttttgg ctggaagttc cacctcgagt accgcaagac agatgccttc cgccgccgcc 900 gctggcgccg tcgcatggag ccactggaga agacggggcc tgcagctgtg tttgcccttg 960 agggggccct gggcggcgtg atggatgaca agagtgaaga ttccatgtcc gtctccacct 1020 tgagcttcgg tgtgaacaga cccacgattt cctgcatatt cgactatggg aaccgctacc 1080 atctacgctg ctacatgtac caggcccggg acctggctgc gatggacaag gactcttttt 1140 ctgatcccta tgccatcgtc tccttcctgc accagagcca gaagacggtg gtggtgaaga 1200 acacccttaa ccccacctgg gaccagacgc tcatcttcta cgagatcgag atctttggcg 1260 agccggccac agttgctgag caaccgccca gcattgtggt ggagctgtac gaccatgaca 1320 cttatggtgc agacgagttt atgggtcgct gcatctgtca accgagtctg gaacggatgc 1380 cacggctggc ctggttccca ctgacgaggg gcagccagcc gtcgggggag ctgctggcct 1440 cttttgagct catccagaga gagaagccgg ccatccacca tattcctggt tttgaggtgc 1500 aggagacatc aaggatcctg gatgagtctg aggacacaga cctgccctac ccaccacccc 1560 agagggaggc cacacatctac atggttcctc agaacatcaa gccagcgctc cagcgtaccg 1620 ccatcgagat cctggcatgg ggcctgcgga acatgaagag ttaccagctg gccaacatct 1680 cctcccccag cctcgtggta gagtgtgggg gccagacggt gcagtcctgt gtcatcagga 1740 acctccggaa gaaccccaac tttgacatct gcaccctctt catggaagtg atgctgccca 1800 gggaggagct ctactgcccc cccatcaccg tcaaggtcat cgataaccgc cagtttggcc 1860 gccggcctgt ggtgggccag tgtaccatcc gctccctgga gagcttcctg tgtgacccct 1920 actcggcgga gagtccatcc ccacagggtg gcccagacga tgtgagccta ctcagtcctg 1980 gggaagacgt gctcatcgac attgatgaca aggagcccct catcccccatc caggaggaag 2040 agttcatcga ttggtggagc aaattctttg cctccatagg ggagagggaa aagtgcggct 2100 cctacctgga gaaggatttt gacaccctga aggtctatga cacacagctg gagaatgtgg 2160 aggcctttga gggcctgtct gacttttgta acaccttcaa gctgtaccgg ggcaagacgc 2220 aggagggagac agaagatcca tctgtgattg gtgaatttaa gggcctcttc aaaatttatc 2280 ccctcccaga agacccagcc atccccatgc ccccaagaca gttccaccag ctggccgccc 2340 agggacccca ggaggtgcttg gtccgtatct acattgtccg agcatttggc ctgcagccca 2400 aggaccccaa tggaaagtgt gatccttaca tcaagatctc catagggaag aaatcagtga 2460 gtgaccagga taactacatc ccctgcacgc tggagcccgt atttggaaag atgttcgagc 2520 tgacctgcac tctgcctctg gagaaggacc taaagatcac tctctatgac tatgacctcc 2580 tctccaagga cgaaaagatc ggtgagacgg tcgtcgacct ggagaacagg ctgctgtcca 2640 agtttggggc tcgctgtgga ctcccacaga cctactgtgt ctctggaccg aaccagtggc 2700 gggaccagct ccgcccctcc cagctcctcc acctcttctg ccagcagcat agagtcaagg 2760 cacctgtgta ccggacagac cgtgtaatgt ttcaggataa agaatattcc attgaagaga 2820 tagaggctgg caggatccca aacccacacc tgggcccagt ggaggagcgt ctggctctgc 2880 atgtgcttca gcagcagggc ctggtcccgg agcacgtgga gtcacggccc ctctacagcc 2940 ccctgcagcc agacatcgag caggggaagc tgcagatgtg ggtcgaccta tttccgaagg 3000 ccctggggcg gcctggacct cccttcaaca tcaccccacg gagagccaga aggtttttcc 3060 tgcgttgtat tatctggaat accagagatg tgatcctgga tgacctgagc ctcacggggg 3120 agaagatgag cgacatttat gtgaaaggtt ggatgattgg ctttgaagaa cacaagcaaa 3180 agacagacgt gcattatcgt tccctgggag gtgaaggcaa cttcaactgg aggttcattt 3240 tccccttcga ctacctgcca gctgagcaag tctgtaccat tgccaagaag gatgccttct 3300 ggaggctgga caagactgag agcaaaatcc cagcacgagt ggtgttccag atctggggaca 3360 atgacaagtt ctcctttgat gattttctgg gctccctgca gctcgatctc aaccgcatgc 3420 ccaagccagc caagacagcc aagaagtgct ccttggacca gctggatgat gctttccacc 3480 cagaatggtt tgtgtccctt tttgagcaga aaacagtgaa gggctggtgg ccctggtgtag 3540 cagaagaggg tgagaagaaa atactggcgg gcaagctgga aatgaccttg gagattgtag 3600 cagagagtga gcatgaggag cggcctgctg gccagggccg ggatgagccc aacatgaacc 3660 ctaagcttga ggacccaagg cgccccgaca cctccttcct gtggtttacc tccccataca 3720 agaccatgaa gttcatcctg tggcggcgtt tccggtgggc catcatcctc ttcatcatcc 3780 tcttcatcct gctgctgttc ctggccatct tcatctacgc cttcccgaac tatgctgcca 3840 tgaagctggt gaagcccttc agctga 3866 <210> 15 <211> 4689 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F5' Expresson Cassette v2 <400> 15 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggctttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtgggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgccaccca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctcagtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgc 4689 <210> 16 <211> 4592 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> DYSF F3' Expression Cassette v2 <400> 16 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctactta tctacgtagc catgctctgg 180 tcgactctag agttttcgtc atctggatgc tgcagggaga caagcgtgtg gcataccagc 240 gggtgcccgc ccaccaagtc ctcttctccc ggcggggtgc caactactgt ggcaagaatt 300 gtgggaagct acagacaatc tttctgaaat atccgatgga gaaggtgcct ggcgcccgga 360 tgccagtgca gatacgggtc aagctgtggt ttgggctctc agtggatgag aaggagttca 420 accagtttgc tgaggggaag ctgtctgtct ttgctgaaac ctatgagaac gagactaagt 480 tggcccttgt tgggaactgg ggcacaacgg gcctcaccta ccccaagttt tctgacgtca 540 cgggcaagat caagctaccc aaggacagct tccgcccctc ggccggctgg acctgggctg 600 gagattggtt cgtgtgtccg gagaagactc tgctccatga catggacgcc ggtcacctga 660 gcttcgtgga agaggtgttt gagaaccaga cccggcttcc cggaggccag tggatctaca 720 tgagtgacaa ctacaccgat gtgaacgggg agaaggtgct tcccaaggat gacattgagt 780 gcccactggg ctggaagtgg gaagatgagg aatggtccac agacctcaac cgggctgtcg 840 atgagcaagg ctgggagtat agcatcacca tccccccgga gcggaagccg aagcactggg 900 tccctgctga gaagatgtac tacacacacc gacggcggcg ctgggtgcgc ctgcgcagga 960 gggatctcag ccaaatggaa gcactgaaaa ggcacaggca ggcggaggcg gagggcgagg 1020 gctgggagta cgcctctctt tttggctgga agttccacct cgagtaccgc aagacagatg 1080 ccttccgccg ccgccgctgg cgccgtcgca tggagccact ggagaagacg gggcctgcag 1140 ctgtgtttgc ccttgagggg gccctgggcg gcgtgatgga tgacaagagt gaagattcca 1200 tgtccgtctc caccttgagc ttcggtgtga acagaccac gatttcctgc atattcgact 1260 atgggaaccg ctaccatcta cgctgctaca tgtaccaggc ccgggacctg gctgcgatgg 1320 acaaggactc tttttctgat ccctatgcca tcgtctcctt cctgcaccag agccagaaga 1380 cggtggtggt gaagaacacc cttaacccca cctgggacca gacgctcatc ttctacgaga 1440 tcgagatctt tggcgagccg gccacagttg ctgagcaacc gcccagcatt gtggtggagc 1500 tgtacgacca tgacacttat ggtgcagacg agtttatggg tcgctgcatc tgtcaaccga 1560 gtctggaacg gatgccacgg ctggcctggt tccccactgac gaggggcagc cagccgtcgg 1620 gggagctgct ggcctctttt gagctcatcc agagagagaa gccggccatc caccatattc 1680 ctggttttga ggtgcaggag acatcaagga tcctggatga gtctgaggac acagacctgc 1740 cctacccacc accccagagg gaggccaaca tctacatggt tcctcagaac atcaagccag 1800 cgctccagcg taccgccatc gagatcctgg catggggcct gcggaacatg aagagttacc 1860 agctggccaa catctcctcc cccagcctcg tggtagagtg tgggggccag acggtgcagt 1920 cctgtgtcat caggaacctc cggaagaacc ccaactttga catctgcacc ctcttcatgg 1980 aagtgatgct gcccagggag gagctctact gcccccccat caccgtcaag gtcatcgata 2040 accgccagtt tggccgccgg cctgtggtgg gccagtgtac catccgctcc ctggagagct 2100 tcctgtgtga cccctactcg gcggagagtc catccccaca gggtggccca gacgatgtga 2160 gcctactcag tcctgggggaa gacgtgctca tcgacattga tgacaaggag cccctcatcc 2220 ccatccagga ggaagagttc atcgattggt ggagcaaatt ctttgcctcc ataggggaga 2280 gggaaaagtg cggctcctac ctggagaagg attttgacac cctgaaggtc tatgacacac 2340 agctggagaa tgtggaggcc tttgagggcc tgtctgactt ttgtaacacc ttcaagctgt 2400 accggggcaa gacgcaggag gagacagaag atccatctgt gattggtgaa tttaagggcc 2460 tcttcaaaat ttatcccctc ccagaagacc cagccatccc catgccccca agacagttcc 2520 accagctggc cgcccaggga ccccaggagt gcttggtccg tatctacatt gtccgagcat 2580 ttggcctgca gcccaaggac cccaatggaa agtgtgatcc ttacatcaag atctccatag 2640 ggaagaaatc agtgagtgac caggataact acatcccctg cacgctggag cccgtatttg 2700 gaaagatgtt cgagctgacc tgcactctgc ctctggagaa ggacctaaag atcactctct 2760 atgactatga cctcctctcc aaggacgaaa agatcggtga gacggtcgtc gacctggaga 2820 acaggctgct gtccaagttt ggggctcgct gtggactccc acagacctac tgtgtctctg 2880 gaccgaacca gtggcgggac cagctccgcc cctcccagct cctccacctc ttctgccagc 2940 agcatagagt caaggcacct gtgtaccgga cagaccgtgt aatgtttcag gataaagaat 3000 attccattga agagatagag gctggcagga tcccaaaccc acacctgggc ccagtggagg 3060 agcgtctggc tctgcatgtg cttcagcagc agggcctggt cccggagcac gtggagtcac 3120 ggcccctcta cagccccctg cagccagaca tcgagcaggg gaagctgcag atgtgggtcg 3180 acctatttcc gaaggccctg gggcggcctg gacctccctt caacatcacc ccacggagag 3240 ccagaaggtt tttcctgcgt tgtattatct ggaataccag agatgtgatc ctggatgacc 3300 tgagcctcac gggggagaag atgagcgaca tttatgtgaa aggttggatg attggctttg 3360 aagaacacaa gcaaaagaca gacgtgcatt atcgttccct gggaggtgaa ggcaacttca 3420 actggaggtt cattttcccc ttcgactacc tgccagctga gcaagtctgt accattgcca 3480 agaaggatgc cttctggagg ctggacaaga ctgagagcaa aatcccagca cgagtggtgt 3540 tccagatctg ggacaatgac aagttctcct ttgatgattt tctgggctcc ctgcagctcg 3600 atctcaaccg catgcccaag ccagccaaga cagccaagaa gtgctccttg gaccagctgg 3660 atgatgcttt ccacccagaa tggtttgtgt ccctttttga gcagaaaaca gtgaagggct 3720 ggtggccctg tgtagcagaa gagggtgaga agaaaatact ggcgggcaag ctggaaatga 3780 ccttggagat tgtagcagag agtgagcatg aggagcggcc tgctggccag ggccgggatg 3840 agcccaacat gaaccctaag cttgaggacc caaggcgccc cgacacctcc ttcctgtggt 3900 ttacctcccc atacaagacc atgaagttca tcctgtggcg gcgtttccgg tgggccatca 3960 tcctcttcat catcctcttc atcctgctgc tgttcctggc catcttcatc tacgccttcc 4020 cgaactatgc tgccatgaag ctggtgaagc ccttcagctg aggactctcc tgccctgtag 4080 aaggggccgt ggggtcccct ccagcatggg actggcctgc ctcctccgcc cagctcggcg 4140 agctcctcca gacctcctag gcctgattgt cctgccaggg tgggcagaca gacagatgga 4200 ccggcccaca ctcccagagt tgctaacatg gagctctgag atcaccccac ttccatcatt 4260 tccttctccc ccaacccaac gcttttttgg atcagctcag acatatttca gtataaaaca 4320 gttggaacca caaaaaaaaa aaaaaaaagt cgacgcggcc gcaataaaag atctttattt 4380 tcattagatc tgtgtgttgg ttttttgtgt gtctagagca tggctacgta gataagtagc 4440 atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc 4500 tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg 4560 cccgggcggc ctcagtgagc gagcgagcgc gc 4592 <210> 17 <211> 128 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> 3' ITR <400> 17 aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60 ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120 gagcgcgc 128 <210> 18 <211> 8341 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Plasmid sequence for DYS F5 Expression Cassette <400> 18 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggctttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtgggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgccaccca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggcggccgc aataaaagat ctttattttc 4500 attagatctg tgtgttggtt ttttgtgtgt ctagagcatg gcgggttaat cattaactac 4560 aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag 4620 gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag 4680 cgagcgcgca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 4740 ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 4800 cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 4860 gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4920 tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 4980 agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 5040 tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 5100 cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 5160 ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 5220 ccggtaacta tcgtcttgag tccaacccgg taagacacga ctttcgcca ctggcagcag 5280 ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 5340 ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 5400 cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 5460 gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 5520 atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 5580 ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 5640 gttttaaatc aatctaaagt atatatgagt aaaaatattc cggaattgcc agctggggcg 5700 ccctctggta aggttgggaa gccctgcaaa gtaaactgga tggctttctt gccgccaagg 5760 atctgatggc gcaggggatc aagatctgat caagagacag gatgaggatc gtttcgcatg 5820 attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc 5880 tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg 5940 caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag 6000 gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc 6060 gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat 6120 ctcctgtcat cccaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg 6180 cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc 6240 gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag 6300 catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc 6360 gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc 6420 cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata 6480 gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc 6540 gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac 6600 gagttcttct gaaccggtaa tattattgaa gcatttatca gggttattgt ctcatgagcg 6660 gatacatatt tgaatgtatt tagaaaata aacaaatagg ggttccgcgc acatttcccc 6720 gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata 6780 ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac 6840 acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag 6900 cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 6960 cagagcagat tgtactgaga gtgcaccata tgcggtgtga aataccgcac agatgcgtaa 7020 ggagaaaata ccgcatcagg cgattccaac atccaataaa tcatacaggc aaggcaaaga 7080 attagcaaaa ttaagcaata aagcctcaga gcataaagct aaatcggttg taccaaaaac 7140 attatgaccc tgtaatactt ttgcgggaga agcctttatt tcaacgcaag gataaaaatt 7200 tttagaaccc tcatatattt taaatgcaat gcctgagtaa tgtgtaggta aagattcaaa 7260 cgggtgagaa aggccggaga cagtcaaatc accatcaata tgatattcaa ccgttctagc 7320 tgataaattc atgccggaga gggtagctat ttttgagagg tctctacaaa ggctatcagg 7380 tcattgcctg agagtctgga gcaaacaaga gaatcgatga acggtaatcg taaaactagc 7440 atgtcaatca tatgtacccc ggttgataat cagaaaagcc ccaaaaacag gaagatgta 7500 taagcaaata tttaaattgt aagcgttaat attttgttaa aattcgcgtt aaatttttgt 7560 taaatcagct cattttttaa ccaataggcc gaaatcggca aaatccctta taaatcaaaa 7620 gaatagaccg agatagggtt gagtgttgtt ccagtttgga acaagagtcc actattaaag 7680 aacgtggact ccaacgtcaa agggcgaaaa accgtctatc agggcgatgg cccactacgt 7740 gaaccatcac cctaatcaag ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac 7800 cctaaaggga gcccccgatt tagagcttga cggggaaagc cggcgaacgt ggcgagaaag 7860 gaagggaaga aagcgaaagg agcgggcgct agggcgctgg caagtgtagc ggtcacgctg 7920 cgcgtaacca ccacacccgc cgcgcttaat gcgccgctac agggcgcgta ctatggttgc 7980 tttgacgagc acgtataacg tgctttcctc gttagaatca gagcgggagc taaacaggag 8040 gccgattaaa gggattttag acaggaacgg tacgccagaa tcctgagaag tgtttttata 8100 atcagtgagg ccaccgagta aaagagtctg tccatcacgc aaattaaccg ttgtcgcaat 8160 acttctttga ttagtaataa catcacttgc ctgagtagaa gaactcaaac tatcggcctt 8220 gctggtaata tccagaacaa tattaccgcc agccattgca acggaatcgc cattcgccat 8280 tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc 8340 t8341 <210> 19 <211> 8244 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> Plasmid sequence for DYS F3 Expression Cassette <400> 19 gtagccatgc tctggtcgac tctagagttt tcgtcatctg gatgctgcag ggagacaagc 60 gtgtggcata ccagcgggtg cccgcccacc aagtcctctt ctcccggcgg ggtgccaact 120 actgtggcaa gaattgtggg aagctacaga caatctttct gaaatatccg atggagaagg 180 tgcctggcgc ccggatgcca gtgcagatac gggtcaagct gtggtttggg ctctctgtgg 240 atgagaagga gttcaaccag tttgctgagg ggaagctgtc tgtctttgct gaaacctatg 300 agaacgagac taagttggcc cttgttggga actggggcac aacgggcctc acctacccca 360 agtttctga cgtcacgggc aagatcaagc tacccaagga cagcttccgc ccctcggccg 420 gctggacctg ggctggagat tggttcgtgt gtccggagaa gactctgctc catgacatgg 480 acgccggtca cctgagcttc gtggaagagg tgtttgagaa ccagacccgg cttcccggag 540 gccagtggat ctacatgagt gacaactaca ccgatgtgaa cggggagaag gtgcttccca 600 aggatgacat tgagtgccca ctgggctgga agtgggaaga tgaggaatgg tccacagacc 660 tcaaccgggc tgtcgatgag caaggctggg agtatagcat caccatcccc ccggagcgga 720 agccgaagca ctgggtccct gctgagaaga tgtactacac acaccgacgg cggcgctggg 780 tgcgcctgcg caggagggat ctcagccaaa tggaagcact gaaaaggcac aggcaggcgg 840 aggcggaggg cgagggctgg gagtacgcct ctctttttgg ctggaagttc cacctcgagt 900 accgcaagac agatgccttc cgccgccgcc gctggcgccg tcgcatggag ccactggaga 960 agacggggcc tgcagctgtg tttgcccttg agggggccct gggcggcgtg atggatgaca 1020 agagtgaaga ttccatgtcc gtctccacct tgagcttcgg tgtgaacaga cccacgattt 1080 cctgcatatt cgactatggg aaccgctacc atctacgctg ctacatgtac caggcccggg 1140 acctggctgc gatggacaag gactcttttt ctgatcccta tgccatcgtc tccttcctgc 1200 accagagcca gaagacggtg gtggtgaaga acacccttaa ccccacctgg gaccagacgc 1260 tcatcttcta cgagatcgag atctttggcg agccggccac agttgctgag caaccgccca 1320 gcattgtggt ggagctgtac gaccatgaca cttatggtgc agacgagttt atgggtcgct 1380 gcatctgtca accgagtctg gaacggatgc cacggctggc ctggttccca ctgacgaggg 1440 gcagccagcc gtcgggggag ctgctggcct cttttgagct catccagaga gagaagccgg 1500 ccatccacca tattcctggt tttgaggtgc aggagacatc aaggatcctg gatgagtctg 1560 aggacacaga cctgccctac ccaccacccc agagggaggc caacatctac atggttcctc 1620 agaacatcaa gccagcgctc cagcgtaccg ccatcgagat cctggcatgg ggcctgcgga 1680 acatgaagag ttaccagctg gccaacatct cctcccccag cctcgtggta gagtgtgggg 1740 gccagacggt gcagtcctgt gtcatcagga acctccggaa gaaccccaac tttgacatct 1800 gcaccctctt catggaagtg atgctgccca gggaggagct ctactgcccc cccatcaccg 1860 tcaaggtcat cgataaccgc cagtttggcc gccggcctgt ggtgggccag tgtaccatcc 1920 gctccctgga gagcttcctg tgtgacccct actcggcgga gagtccatcc ccacagggtg 1980 gcccagacga tgtgagccta ctcagtcctg gggaagacgt gctcatcgac attgatgaca 2040 aggagcccct catccccatc caggaggaag agttcatcga ttggtggagc aaattctttg 2100 cctccatagg ggagagggaa aagtgcggct cctacctgga gaaggatttt gacaccctga 2160 aggtctatga cacacagctg gagaatgtgg aggcctttga gggcctgtct gacttttgta 2220 acaccttcaa gctgtaccgg ggcaagacgc aggagggagac agaagatcca tctgtgattg 2280 gtgaatttaa gggcctcttc aaaatttatc ccctcccaga agacccagcc atccccatgc 2340 ccccaagaca gttccaccag ctggccgccc agggacccca ggagtgcttg gtccgtatct 2400 acattgtccg agcatttggc ctgcagccca aggaccccaa tggaaagtgt gatccttaca 2460 tcaagatctc catagggaag aaatcagtga gtgaccagga taactacatc ccctgcacgc 2520 tggagcccgt atttggaaag atgttcgagc tgacctgcac tctgcctctg gagaaggacc 2580 taaagatcac tctctatgac tatgacctcc tctccaagga cgaaaagatc ggtgagacgg 2640 tcgtcgacct ggagaacagg ctgctgtcca agtttggggc tcgctgtgga ctcccacaga 2700 cctactgtgt ctctggaccg aaccagtggc gggaccagct ccgcccctcc cagctcctcc 2760 acctcttctg ccagcagcat agagtcaagg cacctgtgta ccggacagac cgtgtaatgt 2820 ttcaggataa agaatattcc attgaagaga tagaggctgg caggatccca aacccacacc 2880 tgggcccagt ggaggagcgt ctggctctgc atgtgcttca gcagcagggc ctggtcccgg 2940 agcacgtgga gtcacggccc ctctacagcc ccctgcagcc agacatcgag caggggaagc 3000 tgcagatgtg ggtcgaccta tttccgaagg ccctggggcg gcctggacct cccttcaaca 3060 tcaccccacg gagagccaga aggtttttcc tgcgttgtat tatctggaat accagagatg 3120 tgatcctgga tgacctgagc ctcacggggg agaagatgag cgacatttat gtgaaaggtt 3180 ggatgattgg ctttgaagaa cacaagcaaa agacagacgt gcattatcgt tccctggggag 3240 gtgaaggcaa cttcaactgg aggttcattt tccccttcga ctacctgcca gctgagcaag 3300 tctgtaccat tgccaagaag gatgccttct ggaggctgga caagactgag agcaaaatcc 3360 cagcacgagt ggtgttccag atctggggaca atgacaagtt ctcctttgat gattttctgg 3420 gctccctgca gctcgatctc aaccgcatgc ccaagccagc caagacagcc aagaagtgct 3480 ccttggacca gctggatgat gctttccacc cagaatggtt tgtgtccctt tttgagcaga 3540 aaacagtgaa gggctggtgg ccctgtgtag cagaagaggg tgagaagaaa atactggcgg 3600 gcaagctgga aatgaccttg gagattgtag cagagagtga gcatgaggag cggcctgctg 3660 gccagggccg ggatgagccc aacatgaacc ctaagcttga ggacccaagg cgccccgaca 3720 cctccttcct gtggtttacc tccccataca agaccatgaa gttcatcctg tggcggcgtt 3780 tccggtgggc catcatcctc ttcatcatcc tcttcatcct gctgctgttc ctggccatct 3840 tcatctacgc cttcccgaac tatgctgcca tgaagctggt gaagcccttc agctgaggac 3900 tctcctgccc tgtagaaggg gccgtggggt cccctccagc atgggactgg cctgcctcct 3960 ccgcccagct cggcgagctc ctccagacct cctaggcctg attgtcctgc cagggtggggc 4020 agacagacag atggaccggc ccacactccc agagttgcta acatggagct ctgagatcac 4080 cccacttcca tcatttcctt ctcccccaac ccaacgcttt tttggatcag ctcagacata 4140 tttcagtata aaacagttgg aaccacaaaa aaaaaaaaaa aaagtcgacg cggccgcaat 4200 aaaagatctt tattttcatt agatctgtgt gttggttttt tgtgtgtcta gagcatggct 4260 acgtagataa gtagcatggc gggttaatca ttaactacaa ggaaccccta gtgatggagt 4320 tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc 4380 gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcagc tgcattaatg 4440 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 4500 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 4560 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 4620 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 4680 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 4740 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 4800 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 4860 tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 4920 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 4980 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 5040 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 5100 tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 5160 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 5220 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 5280 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 5340 aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 5400 atatgagtaa aaatattccg gaattgccag ctggggcgcc ctctggtaag gttgggaagc 5460 cctgcaaagt aaactggatg gctttcttgc cgccaaggat ctgatggcgc aggggatcaa 5520 gatctgatca agagacagga tgaggatcgt ttcgcatgat tgaacaagat ggattgcacg 5580 caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa 5640 tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg 5700 tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt 5760 ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa 5820 gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatcc caccttgctc 5880 ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg 5940 ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg 6000 aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg 6060 aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg 6120 gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact 6180 gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg 6240 ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc 6300 ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga accggtaata 6360 ttatattgaagc atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 6420 gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcta 6480 agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 6540 tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc cggagacggt 6600 cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg cgtcagcggg 6660 tgttggcggg tgtcggggct ggcttaacta tgcggcatca gagcagattg tactgagagt 6720 gcaccatatg cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg 6780 attccaacat ccaataaatc atacaggcaa ggcaaagaat tagcaaaatt aagcaataaa 6840 gcctcagagc ataaagctaa atcggttgta ccaaaaacat tatgaccctg taatactttt 6900 gcgggagaag cctttatttc aacgcaagga taaaaatttt tagaaccctc atatatttta 6960 aatgcaatgc ctgagtaatg tgtaggtaaa gattcaaacg ggtgagaaag gccggagaca 7020 gtcaaatcac catcaatatg atattcaacc gttctagctg ataaattcat gccggagagg 7080 gtagctattt ttgagaggtc tctacaaagg ctatcaggtc attgcctgag agtctggagc 7140 aaacaagaga atcgatgaac ggtaatcgta aaactagcat gtcaatcata tgtaccccgg 7200 ttgataatca gaaaagcccc aaaaacagga agattgtata agcaaatatt taaattgtaa 7260 gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 7320 aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga 7380 gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 7440 ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt 7500 ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 7560 gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 7620 cgggcgctag ggcgctggca aggttagcgg tcacgctgcg cgtaaccacc acacccgccg 7680 cgcttaatgc gccgctacag ggcgcgtact atggttgctt tgacgagcac gtataacgtg 7740 ctttcctcgt tagaatcaga gcgggagcta aacaggaggc cgattaaagg gattttagac 7800 aggaacggta cgccagaatc ctgagaagtg tttttataat cagtgaggcc accgagtaaa 7860 agagtctgtc catcacgcaa attaaccgtt gtcgcaatac ttctttgatt agtaataaca 7920 tcacttgcct gagtagaaga actcaaacta tcggccttgc tggtaatatc cagaacaata 7980 ttaccgccag ccattgcaac ggaatcgcca ttcgccattc aggctgcgca actgttggga 8040 agggcgatcg gtgcgggcct cttcgctatt acgccagctg cgcgctcgct cgctcactga 8100 ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt cgcccggcct cagtgagcga 8160 gcgagcgcgc agagagggag tggccaactc catcactagg ggttccttgt agttaatgat 8220 taacccgcca tgctacttat ctac 8244 <210> 20 <211> 7906 <212> DNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <223> full DYF expression cassette <400> 20 gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg 60 tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag 120 gggttccttg tagttaatga ttaacccgcc atgctctaga gtttaagctt gcatgtctaa 180 gctagaccct tcagattaaa aataactgag gtaagggcct gggtagggga ggtggtgtga 240 gacgctcctg tctctcctct atctgcccat cggccctttg gggaggagga atgtgcccaa 300 ggactaaaaa aaggccatgg agccagaggg gcgagggcaa cagacctttc atgggcaaac 360 cttggggccc tgctgtctag catgccccac tacgggtcta ggctgcccat gtaaggaggc 420 aaggcctggg gacacccgag atgcctggtt ataattaacc cagacatgtg gctgcccccc 480 cccccccaac acctgctgcc tctaaaaata accctgtccc tggtggatcc cctgcatgcg 540 aagatcttcg aacaaggctg tgggggactg agggcaggct gtaacaggct tgggggccag 600 ggctttatacg tgcctgggac tcccaaagta ttactgttcc atgttcccgg cgaagggcca 660 gctgtccccc gccagctaga ctcagcactt agtttaggaa ccagtgagca agtcagccct 720 tggggcagcc catacaaggc catggggctg ggcaagctgc acgcctgggt ccggggtggg 780 cacggtgccc gggcaacgag ctgaaagctc atctgctctc aggggcccct ccctggggac 840 agcccctcct ggctagtcac accctgtagg ctcctctata taacccaggg gcacaggggc 900 tgccctcatt ctaccaccac ctccacagca cagacagaca ctcaggagca gccagcggcg 960 cgcccaggta agtttagtct ttttgtcttt tatttcaggt cccggatccg gtggtggtgc 1020 aaatcaaaga actgctcctc agtggatgtt gcctttactt ctaggcctgt acggaagtgt 1080 tacttctgct ctaaaagctg cggaattgta cccgcggccg cggctagcca ccatgctgag 1140 ggtcttcatc ctctatgccg agaacgtcca cacacccgac accgacatca gcgatgccta 1200 ctgctccgcg gtgtttgcag gggtgaagaa gagaaccaaa gtcatcaaga acagcgtgaa 1260 ccctgtatgg aatgagggat ttgaatggga cctcaagggc atccccctgg accagggctc 1320 tgagcttcat gtggtggtca aagaccatga gacgatgggg aggaacaggt tcctggggga 1380 agccaaggtc ccactccgag aggtcctcgc cacccctagt ctgtccgcca gcttcaatgc 1440 ccccctgctg gacaccaaga agcagcccac aggggcctcg ctggtcctgc aggtgtccta 1500 cacaccgctg cctggagctg tgcccctgtt cccgccccct actcctctgg agccctcccc 1560 gactctgcct gacctggatg tagtggcaga cacaggagga gaggaagaca cagaggacca 1620 gggactcact ggagatgagg cggagccatt cctggatcaa agcggaggcc cgggggctcc 1680 caccacccca aggaaactac cttcacgtcc tccgccccac taccccggga tcaaaagaaa 1740 gcgaagtgcg cctacatcta gaaagctgct gtcagacaaa ccgcaggatt tccagatcag 1800 ggtccaggtg atcgaggggc gccagctgcc gggggtgaac atcaagcctg tggtcaaggt 1860 taccgctgca gggcagacca agcggacgcg gatccacaag ggaaacagcc cactcttcaa 1920 tgagactctt ttcttcaact tgtttgactc tcctggggag ctgtttgatg agcccatctt 1980 tatcacggtg gtagactctc gttctctcag gacagatgct ctcctcgggg agttccggat 2040 ggacgtgggc accatttaca gagagccccg gcacgcctat ctcaggaagt ggctgctgct 2100 ctcagaccct gatgacttct ctgctggggc cagaggctac ctgaaaacaa gcctttgtgt 2160 gctggggcct ggggacgaag cgcctctgga gagaaaagac ccctctgaag acaaggagga 2220 cattgaaagc aacctgctcc ggcccacagg cgtagccctg cgaggagccc acttctgcct 2280 gaaggtcttc cgggccgagg acttgccgca gatggacgat gccgtgatgg acaacgtgaa 2340 acagatcttt ggcttcgaga gtaacaagaa gaacttggtg gacccctttg tggaggtcag 2400 ctttgcgggg aaaatgctgt gcagcaagat cttggagaag acggccaacc ctcagtgggaa 2460 ccagaacatc acactgcctg ccatgtttcc ctccatgtgc gaaaaaatga ggattcgtat 2520 catagactgg gaccgcctga ctcacaatga catcgtggct accacctacc tgagtatgtc 2580 gaaaatctct gcccctggag gagaaataga agaggagcct gcaggtgctg tcaagccttc 2640 gaaagcctca gacttggatg actacctggg cttcctcccc acttttgggc cctgctacat 2700 caacctctat ggcagtccca gagagttcac aggcttccca gacccctaca cagagctcaa 2760 2820 gctggtggag cacagtgaac agaaggtgga ggaccttcct gcggatgaca tcctccgggt 2880 ggagaagtac cttaggaggc gcaagtactc cctgtttgcg gccttctact cagccaccat 2940 gctgcaggat gtggatgatg ccatccagtt tgaggtcagc atcgggaact acgggaacaa 3000 gttcgacatg acctgcctgc cgctggcctc caccactcag tacagccgtg cagtctttga 3060 cgggtgccac tactactacc taccctgggg taacgtgaaa cctgtggtgg tgctgtcatc 3120 ctactgggag gacatcagcc atagaatcga gactcagaac cagctgcttg ggattgctga 3180 ccggctggaa gctggcctgg agcaggtcca cctggccctg aaggcgcagt gctccacgga 3240 ggacgtggac tcgctggtgg ctcagctgac ggatgagctc atcgcaggct gcagccagcc 3300 tctgggtgac atccatgaga caccctctgc cacccacctg gaccagtacc tgtaccagct 3360 gcgcacccat cacctgagcc aaatcactga ggctgccctg gccctgaagc tcggccacag 3420 tgagctccct gcagctctgg agcaggcgga ggactggctc ctgcgtctgc gtgccctggc 3480 agaggagccc cagaacagcc tgccggacat cgtcatctgg atgctgcagg gagacaagcg 3540 tgtggcatac cagcgggtgc ccgccaccca agtcctcttc tcccggcggg gtgccaacta 3600 ctgtggcaag aattgtggga agctacagac aatctttctg aaatatccga tggagaaggt 3660 gcctggcgcc cggatgccag tgcagatacg ggtcaagctg tggtttgggc tctctgtgga 3720 tgagaaggag ttcaaccagt ttgctgaggg gaagctgtct gtctttgctg aaacctatga 3780 gaacgagact aagttggccc ttgttgggaa ctggggcaca acgggcctca cctaccccaa 3840 gttttctgac gtcacgggca agatcaagct acccaaggac agcttccgcc cctcggccgg 3900 ctggacctgg gctggagatt ggttcgtgtg tccggagaag actctgctcc atgacatgga 3960 cgccggtcac ctgagcttcg tggaagaggt gtttgagaac cagacccggc ttcccggagg 4020 ccagtggatc tacatgagtg acaactacac cgatgtgaac ggggagaagg tgcttcccaa 4080 ggatgacatt gagtgcccac tgggctggaa gtgggaagat gaggaatggt ccacagacct 4140 caaccgggct gtcgatgagc aaggctggga gtatagcatc accatccccc cggagcggaa 4200 gccgaagcac tgggtccctg ctgagaagat gtactacaca caccgacggc ggcgctgggt 4260 gcgcctgcgc aggagggatc tcagccaaat ggaagcactg aaaaggcaca ggcaggcgga 4320 ggcggagggc gagggctggg agtacgcctc tctttttggc tggaagttcc acctcgagta 4380 ccgcaagaca gatgccttcc gccgccgccg ctggcgccgt cgcatggagc cactggagaa 4440 gacggggcct gcagctgtgt ttgcccttga gggggccctg ggcggcgtga tggatgacaa 4500 gagtgaagat tccatgtccg tctccacctt gagcttcggt gtgaacagac ccacgatttc 4560 ctgcatattc gactatggga accgctacca tctacgctgc tacatgtacc aggcccggga 4620 cctggctgcg atggacaagg actctttttc tgatccctat gccatcgtct ccttcctgca 4680 ccagagccag aagacggtgg tggtgaagaa cacccttaac cccacctggg accagacgct 4740 catcttctac gagatcgaga tctttggcga gccggccaca gttgctgagc aaccgcccag 4800 cattgtggtg gagctgtacg accatgacac ttatggtgca gacgagttta tgggtcgctg 4860 catctgtcaa ccgagtctgg aacggatgcc acggctggcc tggttcccac tgacgagggg 4920 cagccagccg tcgggggagc tgctggcctc ttttgagctc atccagagag agaagccggc 4980 catccaccat attcctggtt ttgaggtgca ggagacatca aggatcctgg atgagtctga 5040 ggacacagac ctgccctacc caccacccca gagggaggcc aacatctaca tggttcctca 5100 gaacatcaag ccagcgctcc agcgtaccgc catcgagatc ctggcatggg gcctgcggaa 5160 catgaagagt taccagctgg ccaacatctc ctcccccagc ctcgtggtag agtgtggggg 5220 ccagacggtg cagtcctgtg tcatcaggaa cctccggaag aaccccaact ttgacatctg 5280 caccctcttc atggaagtga tgctgcccag ggaggagctc tactgccccc ccatcaccgt 5340 caaggtcatc gataaccgcc agtttggccg ccggcctgtg gtgggccagt gtaccatccg 5400 ctccctggag agcttcctgt gtgaccccta ctcggcggag agtccatccc cacagggtgg 5460 cccagacgat gtgagcctac tcagtcctgg ggaagacgtg ctcatcgaca ttgatgacaa 5520 ggagcccctc atccccatcc aggaggaaga gttcatcgat tggtggagca aattctttgc 5580 ctccataggg gagagggaaa agtgcggctc ctacctggag aaggattttg acaccctgaa 5640 ggtctatgac acacagctgg agaatgtgga ggcctttgag ggcctgtctg acttttgtaa 5700 caccttcaag ctgtaccggg gcaagacgca ggaggagaca gaagatccat ctgtgattgg 5760 tgaatttaag ggcctcttca aaatttatcc cctcccagaa gacccagcca tccccatgcc 5820 cccaagacag ttccaccagc tggccgccca gggaccccag gagtgcttgg tccgtatcta 5880 cattgtccga gcatttggcc tgcagcccaa ggaccccaat ggaaagtgtg atccttacat 5940 caagatctcc atagggaaga aatcagtgag tgaccaggat aactacatcc cctgcacgct 6000 ggagcccgta tttgggaaaga tgttcgagct gacctgcact ctgcctctgg agaaggacct 6060 aaagatcact ctctatgact atgacctcct ctccaaggac gaaaagatcg gtgagacggt 6120 cgtcgacctg gagaacaggc tgctgtccaa gtttggggct cgctgtggac tcccacagac 6180 ctactgtgtc tctggaccga accagtggcg ggaccagctc cgcccctccc agctcctcca 6240 cctcttctgc cagcagcata gagtcaaggc acctgtgtac cggacagacc gtgtaatgtt 6300 tcaggataaa gaatattcca ttgaagagat agaggctggc aggatcccaa acccacacct 6360 gggcccagtg gaggagcgtc tggctctgca tgtgcttcag cagcagggcc tggtcccgga 6420 gcacgtggag tcacggcccc tctacagccc cctgcagcca gacatcgagc aggggaagct 6480 gcagatgtgg gtcgacctat ttccgaaggc cctggggcgg cctggacctc ccttcaacat 6540 caccccacgg agagccagaa ggtttttcct gcgttgtatt atctggaata ccagagatgt 6600 gatcctggat gacctgagcc tcacggggga gaagatgagc gacatttatg tgaaaggttg 6660 gatgattggc tttgaagaac acaagcaaaa gacagacgtg cattatcgtt ccctgggagg 6720 tgaaggcaac ttcaactgga ggttcatttt ccccttcgac tacctgccag ctgagcaagt 6780 ctgtaccatt gccaagaagg atgccttctg gaggctggac aagactgaga gcaaaatccc 6840 agcacgagtg gtgttccaga tctggggacaa tgacaagttc tcctttgatg attttctggg 6900 ctccctgcag ctcgatctca accgcatgcc caagccagcc aagacagcca agaagtgctc 6960 cttggaccag ctggatgatg ctttccaccc agaatggttt gtgtcccttt ttgagcagaa 7020 aacagtgaag ggctggtggc cctggttagc agaagagggt gagaagaaaa tactggcggg 7080 caagctgggaa atgaccttgg agattgtagc agagagtgag catgaggagc ggcctgctgg 7140 ccagggccgg gatgagccca acatgaaccc taagcttgag gacccaaggc gccccgacac 7200 ctccttcctg tggtttacct ccccatacaa gaccatgaag ttcatcctgt ggcggcgttt 7260 ccggtgggcc atcatcctct tcatcatcct cttcatcctg ctgctgttcc tggccatctt 7320 catctacgcc ttcccgaact atgctgccat gaagctggtg aagcccttca gctgaggact 7380 ctcctgccct gtagaagggg ccgtggggtc ccctccagca tgggactggc ctgcctcctc 7440 cgcccagctc ggcgagctcc tccagacctc ctaggcctga ttgtcctgcc agggtgggca 7500 gacagacaga tggaccggcc cacactccca gagttgctaa catggagctc tgagatcacc 7560 ccacttccat catttccttc tcccccaacc caacgctttt ttggatcagc tcagacatat 7620 ttcagtataa aacagttgga accacaaaaa aaaaaaaaaa aagtcgacgc ggccgcaata 7680 aaagatcttt attttcatta gatctgtgtg ttggtttttt gtgtgtctag agcatggcta 7740 cgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag tgatggagtt 7800 ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg 7860 acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgc 7906 <210> 21 <211> 6 <212> PRT <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic 6xHis tag <400> 21 His His His His His His 1 5 <210> 22 <211> 250 <212> RNA <213> artificial sequence <220> <223> Description of Artificial Sequence: Synthetic polynucleotide <220> <221> misc_feature <222> (1)..(250) <223> This sequence may encompass 100-250 nucleotides <400> 22 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 120 180 240 aaaaaaaaaa 250

Claims

A recombinant polynucleotide encoding a fragment of human dysferlin (hDYSF) protein,
The recombinant polynucleotide comprises a first nucleotide sequence, wherein the first nucleotide sequence is:
(a) the nucleotide sequence of SEQ ID NO: 1, 6, or 18;
(b) at least 90%, 91%, 92% of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 6, or SEQ ID NO: 18 over their respective full length of SEQ ID NO: 1, 6, or 18; 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical nucleotide sequences;
(c) the nucleotide sequence of SEQ ID NO: 13 or 15;
(d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 13 or 15 over their respective full length of SEQ ID NO: 13 or 15; , 98%, or 99% identical nucleotide sequences;
(e) a nucleotide sequence encoding a fragment of the hDYSF protein, the fragment consisting of the amino acid sequence of SEQ ID NO: 9; or
(f) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the nucleotide sequence of (e) over the entire length of the nucleotide sequence of (e); %, or 99% identical nucleotide sequences;
Consisting of, a recombinant polynucleotide.

According to claim 1,
Further comprising one or more additional nucleotide sequences selected from an inverted terminal repeat (ITR), promoter, intron, selection marker, or origin of replication (ORI) To, recombinant polynucleotide.

According to claim 2,
wherein the ITR is an AAV ITR.

According to claim 3,
wherein the AAV ITR is an AAV2 ITR or an AAV3 ITR.

According to any one of claims 2 to 4,
Wherein the recombinant polynucleotide comprises two ITRs.

According to any one of claims 2 to 5,
Wherein the ITR comprises the nucleotide sequence of SEQ ID NO: 3 or SEQ ID NO: 17.

According to any one of claims 2 to 6,
The promoter is a recombinant polynucleotide comprising a muscle-specific promoter.

According to claim 7,
The muscle-specific promoter includes human skeletal actin gene element, cardiac actin gene element, desmin promoter, skeletal alpha-actin (ASKA) promoter, troponin I (TNNI2) promoter, myocyte-specific enhancer binding factor mef binding element, muscle creatine kinase (MCK) promoter, truncated MCK (tMCK) promoter, myosin heavy chain (MHC) promoter, hybrid a-myosin heavy chain enhancer-/MCK enhancer-promoter (MHCK7) promoter, C5-12 promoter, murine creatine kinase enhancer element , skeletal fast-twitch troponin c gene element, slow-twitch cardiac troponin c gene element, slow-twitch troponin i gene element, hypoxia-inducible nuclear factor ( A recombinant polynucleotide selected from hypoxia-inducible nuclear factor).

According to claim 7,
Wherein the muscle-specific promoter comprises the MHCK7 promoter.

According to any one of claims 2 to 6,
The promoter comprises a recombinant promoter, recombinant polynucleotide.

According to claim 10,
Wherein the recombinant promoter comprises a recombinant muscle-specific promoter.

According to any one of claims 2 to 11,
The promoter comprises the nucleotide sequence of SEQ ID NO: 4, recombinant polynucleotide.

According to any one of claims 2 to 12,
The recombinant polynucleotide of claim 1, wherein the intron comprises a 5' donor site, a branch point, and/or a 3' splice site.

According to any one of claims 2 to 13,
The recombinant polynucleotide, wherein the intron includes a chimeric intron.

According to any one of claims 2 to 14,
The recombinant polynucleotide of claim 1, wherein the intron comprises a 5' donor site from a human β-globin gene.

According to any one of claims 2 to 15,
The intron comprises a branch point from an immunoglobulin G (IgG) heavy chain.

According to any one of claims 2 to 16,
The recombinant polynucleotide of claim 1, wherein the intron comprises a 3' splice acceptor site from an immunoglobulin G (IgG) heavy chain.

According to any one of claims 2 to 17,
The intron comprises the nucleotide sequence of SEQ ID NO: 5, recombinant polynucleotide.

According to any one of claims 2 to 18,
The selection marker is an antibiotic resistance gene, recombinant polynucleotide.

According to claim 19,
The antibiotic resistance gene is a β-lactamase gene or kanamycin resistance gene, recombinant polynucleotide.

According to claim 1,
The recombinant polynucleotide comprises a nucleotide sequence of SEQ ID NO: 6 or SEQ ID NO: 18.

According to any one of claims 1 to 21,
Wherein the recombinant nucleotide further does not include a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

23. The method of any one of claims 1 to 22,
wherein the recombinant nucleotide does not contain AAV sequences other than one or more ITRs.

23. The method of any one of claims 1 to 22,
wherein the recombinant nucleotide does not contain viral sequences other than one or more ITRs.

A recombinant polynucleotide sequence encoding a fragment of human dysferrin protein,
The recombinant polynucleotide comprises a second nucleotide sequence, wherein the second nucleotide sequence:
(a) the nucleotide sequence of SEQ ID NO: 2, 8 or 19;
(b) at least 90%, 91%, 92%, 93%, 94%, 95%, 96% of the nucleotide sequence of SEQ ID NO: 2, 8 or 19 over their entire length of SEQ ID NO: 2, 8 or 19; , 97%, 98%, or 99% identical nucleotide sequences;
(c) the nucleotide sequence of SEQ ID NO: 14 or 16;
(d) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% of the nucleotide sequence of SEQ ID NO: 14 or 16 over their respective full length of SEQ ID NO: 14 or 16; 98%, or 99% identical nucleotide sequences;
(e) a polynucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or
(f) at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% of the polynucleotide sequence of (e) above over the entire length of the nucleotide sequence of (d) above %, or 99% identical polynucleotide sequences;
A recombinant polynucleotide sequence consisting of.

26. The method of claim 25,
Further comprising one or more additional nucleotides comprising an inverted terminal repeat (ITR), selectable marker, origin of replication (ORI), untranslated region (UTR), or polyadenylation (polyA) signal , Recombinant Polynucleotide.

27. The method of claim 26,
wherein the ITR is an AAV ITR.

28. The method of claim 27,
wherein the AAV ITR is an AAV2 ITR or an AAV3 ITR.

The method of any one of claims 25, 27 and 28,
Wherein the recombinant nucleotide further comprises two ITRs.

According to any one of claims 26 to 29,
The ITR comprises a nucleotide sequence of SEQ ID NO: 3 or 17, recombinant polynucleotide.

According to any one of claims 26 to 30,
The polyA signal is an artificial polyA signal, recombinant polynucleotide.

According to any one of claims 26 to 31,
The polyA signal comprises the nucleotide sequence of SEQ ID NO: 7, recombinant polynucleotide.

According to any one of claims 26 to 32,
The recombinant polynucleotide of claim 1, wherein the selectable marker comprises an antibiotic resistance gene.

34. The method of claim 33,
The antibiotic resistance gene comprises a β-lactamase gene or a kanamycin resistance gene.

26. The method of claim 25,
The recombinant polynucleotide comprises a nucleotide sequence of SEQ ID NO: 8 or SEQ ID NO: 19.

According to any one of claims 25 to 35,
Wherein the recombinant nucleotide further does not include a first polynucleotide sequence encoding a first fragment of the hDYSF protein.

According to any one of claims 25 to 36,
wherein the recombinant nucleotide does not contain AAV sequences other than one or more ITRs.

According to any one of claims 25 to 36,
wherein the recombinant nucleotide does not contain viral sequences other than one or more ITRs.

A dual adeno-associated viral (AAV) vector system comprising:
(a) a first AAV vector, wherein the first AAV vector comprises the recombinant polynucleotide of any one of claims 1-24; and
(b) a second AAV vector, wherein the second AAV vector comprises the recombinant polynucleotide of any one of claims 25-38;
A double adeno-associated virus (AAV) vector system comprising a.

25. An adeno-associated virus (AAV) vector comprising the recombinant polynucleotide of any one of claims 1-24.

41. The method of claim 40,
The AAV vectors are AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV-11, AAV- 12, AAV-13, AAVrh.10, AAVrh.20, or AAVrh.74.

41. The method of claim 40,
wherein the AAV vector is AAVrh.74.

39. An adeno-associated virus (AAV) vector comprising the recombinant polynucleotide of any one of claims 25-38.

44. The method of claim 43,
The AAV vectors are AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV-11, AAV- 12, AAV-13, AAVrh.10, AAVrh.20, or AAVrh.74.

44. The method of claim 43,
wherein the AAV vector is AAVrh.74.

A composition comprising the recombinant polynucleotide of any one of claims 1 - 38 or the AAV vector of any one of claims 40 - 45 .

As a composition,
(a) a first recombinant adeno-associated viral (rAAV) vector, wherein the first rAAV vector comprises the AAV vector of any one of claims 40-42; and
(b) a second rAAV vector, wherein the second rAAV vector comprises the AAV vector of any one of claims 43-45;
A composition comprising a.

48. The method of claim 47,
wherein the molar ratio of the first and second rAAV vectors is about 100:1-1:100, about 10:1-1:10, about 2:1-1:2, or about 1:1.

As an adeno-associated virus (AAV) vector, the AAV vector:
(a) a first inverted terminal repeat (ITR);
(b) a polynucleotide encoding a fragment of a human dysferlin (hDYSF) protein, wherein the polynucleotide comprises:
i. the nucleotide sequence of SEQ ID NO: 1 or 6;
ii. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of SEQ ID NO: 1 or 6 over the entire length of SEQ ID NO: 1 or 6 identical nucleotide sequences;
iii. the nucleotide sequence of SEQ ID NO: 13 or 15;
iv. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of SEQ ID NO: 13 or 15 over the entire length of SEQ ID NO: 13 or 15 identical nucleotide sequences;
v. A nucleotide sequence encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 9; or
vi. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of (v) over the entire length of the nucleotide sequence of (v) % identical nucleotide sequence;
made up of; and
(c) a second ITR;
Including,
wherein the polynucleotide is flanked by the first and second ITRs.

50. The method of claim 49,
An AAV vector, further comprising one or more additional polynucleotides selected from a promoter, intron, selectable marker, or origin of replication (ORI).

51. The method of claim 49 or 50,
wherein the ITR is an AAV ITR.

51. The method of claim 51,
wherein the AAV ITR is an AAV2 ITR or an AAV3 ITR.

The method of any one of claims 49 to 52,
wherein the first and/or second ITR comprises the nucleotide sequence of SEQ ID NO: 3 or 17.

The method of any one of claims 50 to 53,
AAV vector, wherein the promoter is a muscle-specific promoter.

55. The method of claim 54,
The muscle-specific promoter is a myosin heavy chain complex-E box muscle creatine kinase fusion enhancer/promoter, AAV vector.

The method of any one of claims 50 to 53,
The promoter is a recombinant promoter, AAV vector.

57. The method of claim 56,
The AAV vector, wherein the recombinant promoter is a recombinant muscle-specific promoter.

58. The method of claim 57,
AAV vector, wherein the recombinant muscle-specific promoter is the MHCK7 promoter.

59. The method of any one of claims 50 to 58,
AAV vector, wherein the promoter comprises the nucleotide sequence of SEQ ID NO: 4.

60. The method of any one of claims 50 to 59,
wherein the intron comprises a 5' donor site, a branch point, and/or a 3' splice site.

According to any one of claims 50 to 60,
The AAV vector, wherein the intron is a chimeric intron.

According to any one of claims 50 to 61,
wherein the intron comprises a 5' donor site from the human β-globin gene.

63. The method of any one of claims 50 to 62,
wherein the intron comprises a branch point from an immunoglobulin G (IgG) heavy chain.

The method of any one of claims 50 to 63,
wherein the intron comprises a 3' splice acceptor site from an immunoglobulin G (IgG) heavy chain.

The method of any one of claims 50 to 64,
The AAV vector, wherein the intron comprises the nucleotide sequence of SEQ ID NO: 5.

66. The method of any one of claims 50 to 65,
AAV vector, wherein the selectable marker is an antibiotic resistance gene.

67. The method of claim 66,
The antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene, AAV vector.

The method of any one of claims 49 to 67,
The AAV vector comprises the nucleotide sequence of SEQ ID NO: 6 or SEQ ID NO: 15.

69. The method of any one of claims 49 to 68,
Wherein the AAV vector does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

According to any one of claims 49 to 69,
wherein the AAV vector does not contain AAV sequences other than one or more ITRs.

According to any one of claims 49 to 69,
wherein the AAV vector does not contain viral sequences other than one or more ITRs.

As an adeno-associated virus (AAV) vector, the AAV vector:
(a) a first inverted terminal repeat (ITR);
(b) a polynucleotide encoding a fragment of the human dysferrin protein, wherein the polynucleotide comprises:
i. the nucleotide sequence of SEQ ID NO: 2 or 8;
ii. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of SEQ ID NO: 2 or 8 over the entire length of SEQ ID NO: 2 or 8 identical nucleotide sequences;
iii. the nucleotide sequence of SEQ ID NO: 14 or 16;
iv. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of SEQ ID NO: 14 or 16 over the entire length of SEQ ID NO: 14 or 16 identical nucleotide sequences;
v. A polynucleotide encoding a fragment of hDYSF protein, wherein the fragment of hDYSF protein consists of the amino acid sequence of SEQ ID NO: 10; or
vi. at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical polynucleotides;
made up of; and
(c) a second ITR;
Including,
wherein the polynucleotide is flanked by the first and second ITRs.

73. The method of claim 72,
An AAV vector, further comprising one or more polynucleotides selected from a selectable marker, an origin of replication (ORI), an untranslated region (UTR), or a polyadenylation (polyA) signal.

The method of claim 72 or 73,
wherein the ITR is an AAV ITR.

75. The method of claim 74,
wherein the AAV ITR is an AAV2 ITR or an AAV3 ITR.

76. The method of any one of claims 72 to 75,
wherein the ITR comprises the nucleotide sequence of SEQ ID NO: 3 or 17.

The method of any one of claims 73 to 76,
The polyA signal is an artificial polyA signal, AAV vector.

78. The method of any one of claims 73 to 77,
The polyA signal comprises the nucleotide sequence of SEQ ID NO: 7, AAV vector.

79. The method of any one of claims 73 to 78,
AAV vector, wherein the selectable marker is an antibiotic resistance gene.

80. The method of claim 79,
The antibiotic resistance gene is a β-lactamase gene or a kanamycin resistance gene, AAV vector.

The method of any one of claims 72 to 80,
The AAV vector comprises the nucleotide sequence of SEQ ID NO: 8 or SEQ ID NO: 16.

The method of any one of claims 72 to 81,
Wherein the AAV vector does not further comprise a second polynucleotide sequence encoding a second fragment of the hDYSF protein.

The method of any one of claims 72 to 82,
wherein the AAV vector does not contain AAV sequences other than one or more ITRs.

The method of any one of claims 72 to 82,
wherein the AAV vector does not contain viral sequences other than one or more ITRs.

A dual adeno-associated virus (AAV) vector system comprising:
(a) a first AAV vector, wherein the first AAV vector comprises the AAV vector of any one of claims 40-42 and 49-71; and
(b) a second AAV vector, wherein the second AAV vector comprises the AAV vector of any one of claims 43-45, claims 72-84;
A double adeno-associated virus (AAV) vector system comprising a.

86. The method of claim 85,
The AAV vectors are AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAV-6, AAV-7, AAV-8, AAV-9, AAV-10, AAV-11, AAV- 12, AAV-13, AAVrh.10, AAVrh.20, or AAVrh.74, a dual adeno-associated virus (AAV) vector system.

86. The method of claim 85,
The AAV vector is AAVrh.74, a dual adeno-associated virus (AAV) vector system.

An adeno-associated virus (AAV) packaging system comprising:
(a) a plasmid comprising the recombinant polynucleotide of any one of claims 1 to 38;
(b) an adenovirus helper plasmid; and
(c) rep-cap plasmid;
, Adeno-associated virus (AAV) packaging system comprising a.

An adeno-associated virus packaging system comprising:
(a) a plasmid comprising the recombinant polynucleotide of any one of claims 1 to 38; and
(b) an adenovirus helper plasmid;
Including, adeno-associated virus packaging system.

The method of claim 88 or 89,
The adeno-associated virus packaging system of claim 1, wherein the adenovirus helper plasmid comprises a pHELP plasmid.

A method for producing an adeno-associated virus (AAV) vector,
89. The method comprising contacting a cell with a plasmid comprising the recombinant polynucleotide of any one of claims 1-38, or the AAV packaging system of claim 88.

92. The method of claim 91,
wherein the cell is a host cell, optionally a mammalian host cell, and further optionally a HEK293.

A method for producing an adeno-associated virus (AAV) vector,
The method comprises transducing a packaging cell line with a plasmid comprising the recombinant polynucleotide of any one of claims 1 to 38, wherein the packaging cell line rep. adeno-associated virus And to express the cap gene, the method.

94. The method of claim 93,
Wherein the AAV rep gene is Rep78.

94. The method of claim 93,
Wherein the AAV cap gene is a Rh74 cap gene.

A cell comprising the recombinant polynucleotide of any one of claims 1-38.

A cell comprising a plasmid comprising the recombinant polynucleotide of any one of claims 1 to 38.

The method of any one of claims 88 to 97,
wherein the plasmid comprises a polynucleotide that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 18 or 19. , method, or cell.

The method of any one of claims 88 to 97,
wherein the plasmid comprises a polynucleotide of SEQ ID NO: 18 or 19, an AAV packaging system, method, or cell.

As a method of treating dysferlinopathy,
The method is directed to a subject in need thereof:
(a) an effective amount of a first polynucleotide, wherein the first polynucleotide is the recombinant polynucleotide of any one of claims 1-24; and
(b) an effective amount of a second polynucleotide, wherein the second polynucleotide is the recombinant polynucleotide of any one of claims 25-38;
A method comprising administering a

101. The method of claim 100,
wherein the first polynucleotide and/or the second polynucleotide is administered intramuscularly or intravenously.

101. The method of claim 100 or 101,
wherein the first and second polynucleotides are administered simultaneously or sequentially.

As a method of treating dysferinopathy,
The method is directed to a subject in need thereof:
(a) an effective amount of a first adeno-associated virus (AAV) vector, wherein the first AAV vector is the AAV vector of any one of claims 40-42 and 49-71; and
(b) an effective amount of a second AAV vector, wherein the second AAV vector is the AAV vector of any one of claims 43-45 and 72-84;
A method comprising administering.

104. The method of claim 103,
wherein the first AAV vector and/or the second AAV vector is administered intramuscularly or intravenously.

104. The method of claim 103 or 104,
wherein the first and second AAV vectors are administered simultaneously or sequentially.

As a method of treating dysferinopathy,
88. A method comprising administering to a subject in need thereof an effective amount of the AAV dual vector system of any one of claims 39 or 85-87.

107. The method of claim 106,
wherein the AAV dual vector system is administered intramuscularly or intravenously.

As a method of treating dysferinopathy,
49. A method comprising administering to a subject in need thereof an effective amount of the composition of any one of claims 46-48.

108. The method of claim 108,
wherein the composition is administered intramuscularly or intravenously.

According to any one of claims 100 to 109,
The method of claim 1, wherein the dysperinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

Use of the composition in the manufacture of a medicament for the treatment of dysferinopathy in a subject in need thereof, the composition comprising:
(a) a first polynucleotide, wherein the first polynucleotide is the recombinant polynucleotide of any one of claims 1-24; and
(b) a second polynucleotide, wherein the second polynucleotide is the recombinant polynucleotide of any one of claims 25-38;
Including, uses.

Use of the composition in the manufacture of a medicament for the treatment of dysferinopathy in a subject in need thereof, the composition comprising:
(a) an effective amount of a first adeno-associated virus (AAV) vector, wherein the first AAV vector is the AAV vector of any one of claims 40-42 and 49-71; and
(b) an effective amount of a second AAV vector, wherein the second AAV vector is the AAV vector of any one of claims 43-45 and 72-84;
Including, uses.

A use of the composition in the manufacture of a medicament for the treatment of dysferinopathy in a subject in need thereof, wherein the composition comprises the AAV dual of any one of claims 39 or 85-87. Uses, including vector systems.

Use of the composition of any one of claims 46 to 48 in the manufacture of a medicament for the treatment of dysferinopathy in a subject in need thereof.

The method of any one of claims 111 to 114,
Wherein the composition is administered intramuscularly or intravenously.

The method of any one of claims 111 to 114,
Wherein the dysperinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.

According to any one of claims 100 to 110,
An effective amount of the first AAV vector is about 1×10 ⁶ to 1×10 ¹⁶ vg/kg, about 1×10 ⁸ to 1×10 ¹⁵ vg/kg, or about 1×10 ¹⁰ to 1×10 ¹⁴ vg/kg. wherein the dose is determined by a supercoiled PCR quantification standard.

117. The method of claim 117,
wherein the effective amount of the first AAV vector is 1.2e13 vg/kg, wherein the dosage is determined by a supercoil PCR quantification standard.

The method of any one of claims 100 to 110 and 117,
An effective amount of the second AAV vector is about 1×10 ⁶ to 1×10 ¹⁶ vg/kg, about 1×10 ⁸ to 1×10 ¹⁵ vg/kg, or about 1×10 ¹⁰ to 1×10 ¹⁴ vg/kg. Wherein the dose is determined by a supercoiled PCR standard.

119. The method of claim 119,
wherein the effective amount of the second AAV vector is 1.2e13 vg/kg, wherein the dosage is determined by a supercoil PCR quantification standard.

The method of any one of claims 117 to 120,
wherein the total dose of the AAV vector is 2.4e13 vg/kg, wherein the dose is determined by supercoil PCR standards.

As a recombinant polynucleotide encoding human dysferrin (hDYSF) protein,
The recombinant polynucleotide sequence is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the nucleotide sequence of SEQ ID NO: 20 or the nucleotide sequence of SEQ ID NO: 20 A recombinant polynucleotide comprising % identical nucleotide sequences.

122. The method of claim 122,
The recombinant polynucleotide sequence comprises the nucleotide sequence of SEQ ID NO: 20.

A method for producing the recombinant polynucleotide of claim 122,
A method comprising contacting a cell with the recombinant polynucleotide of any one of claims 1 - 24 and the recombinant polynucleotide of any one of claims 25 - 38 .

A method for producing the recombinant polynucleotide of claim 122,
A method comprising contacting a cell with the dual AAV vector system of any one of claims 39 and 85-87.

125. The method of claim 124 or 125,
The method of claim 1, wherein the cell is a eukaryotic cell.

125. The method of claim 124 or 125,
wherein the cells are muscle cells, cardiac cells, stem cells, satellite cells, and/or liver cells.

A method for producing the recombinant polynucleotide of claim 122,
A method comprising administering to a subject the recombinant polynucleotide of any one of claims 1 and 24 and the recombinant polynucleotide of any one of claims 25 and 38.

A method for producing the recombinant polynucleotide of claim 122,
A method comprising administering to a subject the dual AAV vector system of any one of claims 39 and 85-87 or the composition of claims 47 or 48.

As a method of treating muscular dystrophy in a subject,
A method comprising expressing the recombinant polynucleotide of claim 122 in the subject.

130. The method of claim 130,
A method comprising administering to a subject the recombinant polynucleotide of any one of claims 1 to 24 and the recombinant polynucleotide of any one of claims 25 to 38 or the composition of claims 47 or 48.

130. The method of claim 130,
88. A method comprising administering to a subject the dual AAV vector system of any one of claims 39 and 85-87.

The method of any one of claims 128 to 132,
The subject is human, non-human primate, canine, ovine, horse, porcine, murine, rat, rabbit, bovine, or a mammal selected from a feline.

The method of any one of claims 128 to 133,
Wherein the subject is suffering from dysperinopathies.

134. The method of claim 134,
The method of claim 1, wherein the dysperinopathy is limb muscular dystrophy type 2B (LGMD2B) or Miyoshi myopathy.