KR102319845B1 - CRISPR-CAS system for avian host cells - Google Patents

CRISPR-CAS system for avian host cells Download PDF

Info

Publication number
KR102319845B1
KR102319845B1 KR1020197000858A KR20197000858A KR102319845B1 KR 102319845 B1 KR102319845 B1 KR 102319845B1 KR 1020197000858 A KR1020197000858 A KR 1020197000858A KR 20197000858 A KR20197000858 A KR 20197000858A KR 102319845 B1 KR102319845 B1 KR 102319845B1
Authority
KR
South Korea
Prior art keywords
polynucleotide
host cell
sequence
guide
target
Prior art date
Application number
KR1020197000858A
Other languages
Korean (ko)
Other versions
KR20190025910A (en
Inventor
제임스 케이시 립메이어
옐레나 베츠
요하네스 안드리스 라우보스
레네 베르발
Original Assignee
디에스엠 아이피 어셋츠 비.브이.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 디에스엠 아이피 어셋츠 비.브이. filed Critical 디에스엠 아이피 어셋츠 비.브이.
Publication of KR20190025910A publication Critical patent/KR20190025910A/en
Application granted granted Critical
Publication of KR102319845B1 publication Critical patent/KR102319845B1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/111General methods applicable to biologically active non-coding nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/80Vectors containing sites for inducing double-stranded breaks, e.g. meganuclease restriction sites
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2521/00Reaction characterised by the enzymatic activity
    • C12Q2521/30Phosphoric diester hydrolysing, i.e. nuclease
    • C12Q2521/301Endonuclease

Abstract

본 발명은 분자 생물학 및 세포 생물학의 분야에 관한 것이다. 보다 구체적으로, 본 발명은 라비린툴로마이세테스(Labyrinthulomycetes) 숙주 세포에 대한 CRISPR-Cas 시스템에 관한 것이다.The present invention relates to the fields of molecular biology and cell biology. More particularly, the present invention relates to the CRISPR-Cas system for Labyrinthulomycetes host cells.

Description

조류 숙주 세포에 대한 CRISPR-CAS 시스템CRISPR-CAS system for avian host cells

관련 출원의 상호 참조Cross-referencing of related applications

본원은 2016년 7월 13일자로 출원된 미국 가 특허출원 제 62/361,741 호를 우선권 주장하며, 상기 출원의 내용은 전체가 본 명세서에 참고로 인용된다.This application claims priority to U.S. Provisional Patent Application No. 62/361,741, filed on July 13, 2016, the contents of which are incorporated herein by reference in their entirety.

기술 분야technical field

본 명세는 분자 생물학 및 세포 생물학의 분야에 관한 것이다. 보다 구체적으로, 본 발명은 조류 라비린툴로마이세테스(Labyrinthulomycetes) 숙주 세포에 대한 CRISPR-Cas 시스템에 관한 것이다.The present disclosure relates to the fields of molecular biology and cell biology. More specifically, the present invention relates to a CRISPR-Cas system for avian Labyrinthulomycetes host cells.

게놈 기법 및 분석 방법에 있어서 최근의 진보는 다양한 범위의 생물학적 기능 및 질병과 관련된 유전인자를 분류하고 맵핑하는 능력을 현저하게 가속화하였다. 개별적인 유전요소의 선택적인 섭동을 허용함으로써 원인 유전변이의 체계적인 역공학을 가능하게 할 뿐만 아니라 합성 생물학, 생물공학, 및 의학적 응용을 진보시키기 위해서는 정밀 게놈 공학 기술이 필요하다. 게놈-편집 기법, 예를 들어 디자이너 아연 집게, 전사 활성자-형 효과기 뉴클레아제(TALEN), 또는 귀소 메가뉴클레아제를 표적화된 게놈 섭동의 생성에 이용할 수 있지만, 입수 가능하고, 셋업이 용이하며, 규모조정이 가능하고, 게놈내 다중 위치의 표적화가 가능한 신규의 게놈 공학 기술이 여전히 필요하다. 상기 메가뉴클레아제의 공학은 상기 효소의 DNA 인식 및 절단 기능이 단일 도메인 중에 뒤얽혀있기 때문에 대부분의 학술 연구가들에게는 도전이었다. 조작된 아연 집게 배열의 확고한 제작도 또한 배열 중 개별적인 집게 도메인들간의 상황 의존적인 영향을 해명할 필요성으로 인해 많은 실험실들의 경우 어려운 것으로 입증되었다. Recent advances in genomic techniques and analytical methods have significantly accelerated the ability to classify and map genetic factors associated with a diverse range of biological functions and diseases. Precise genome engineering techniques are needed to advance synthetic biology, biotechnology, and medical applications, as well as to enable systematic reverse engineering of causal genetic variations by allowing selective perturbation of individual genetic elements. Genome-editing techniques, such as designer zinc tweezers, transcription activator-type effector nucleases (TALENs), or homing meganucleases, are available for generation of targeted genomic perturbations, but are available and easy to set up. There is still a need for novel genome engineering techniques that are scalable and capable of targeting multiple locations within the genome. Engineering of the meganuclease has been a challenge for most academic researchers because the DNA recognition and cleavage functions of the enzyme are intertwined in a single domain. Robust fabrication of engineered zinc clamp arrays has also proven difficult for many laboratories due to the need to elucidate context-dependent influences between individual clamp domains during arrays.

따라서 다수의 응용과 함께 숙주 세포내 특정 서열의 표적화를 위한 대안의 확고한 기법에 대한 급박한 필요성이 존재한다. 상기 기술적 문제에 대한 해법은 특허청구범위에서 특성화된 실시태양들에 의해 제공된다.There is therefore an urgent need for alternative robust techniques for targeting specific sequences in host cells with numerous applications. The solution to the above technical problem is provided by the embodiments characterized in the claims.

본원은, 표적-특이적인 서열에 대한 맞춤 단백질의 생성이 필요한 것이 아니라 오히려 특정한 폴리뉴클레오타이드 표적을 인식하도록 안내-폴리뉴클레오타이드에 의해 프로그램화될 수 있는 단일 Cas 효소를 필요로 하는 CRISPR-Cas 시스템을 기본으로 한다; 즉 상기 Cas 효소를 상기 안내-폴리뉴클레오타이드 분자를 사용하여 특정한 폴리뉴클레오타이드 표적에 모집할 수 있다. 상기 CRISPR-Cas 시스템을 게놈 기법 및 분석 방법의 레퍼토리에 첨가하는 것은 분자 생물학 분야에서 기존의 방법들을 현저하게 단순화시킬 수 있다.The present application is based on the CRISPR-Cas system, which does not require the creation of a custom protein for a target-specific sequence, but rather a single Cas enzyme that can be programmed by a guide-polynucleotide to recognize a specific polynucleotide target. to do; That is, the Cas enzyme can be recruited to a specific polynucleotide target using the guide-polynucleotide molecule. Adding the CRISPR-Cas system to the repertoire of genomic techniques and analysis methods can significantly simplify existing methods in the field of molecular biology.

본 발명은 안내-폴리뉴클레오타이드 및 Cas 단백질을 포함하는 CRISPR-Cas 시스템의 소스를 포함하는 비-천연 또는 조작된 조성물을 제공하며, 여기에서 상기 안내-폴리뉴클레오타이드는 근본적으로 숙주 세포 중의 표적-폴리뉴클레오타이드의 역 보체인 서열을 포함하고 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킬 수 있다.The present invention provides a non-natural or engineered composition comprising a source of a CRISPR-Cas system comprising a guide-polynucleotide and a Cas protein, wherein the guide-polynucleotide is essentially a target-polynucleotide in a host cell. and the guide-polynucleotide can direct binding of the Cas protein to the target-polynucleotide in the host cell to form a CRISPR-Cas complex.

본 발명은 또한 숙주 세포를 본 발명에 따른 조성물과 접촉시킴을 포함하는, 세포에서 폴리뉴클레오타이드의 발현을 조절하는 방법에 관한 것이며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다.The invention also relates to a method for modulating the expression of a polynucleotide in a cell comprising contacting the host cell with a composition according to the invention, wherein said guide-polynucleotide is at a target-polynucleotide in said host cell. It directs the binding of the Cas protein to form a CRISPR-Cas complex.

본 발명은 또한 본 발명에 따른 조성물을 포함하는 숙주 세포에 관한 것이다.The invention also relates to a host cell comprising a composition according to the invention.

본 발명은 또한 숙주 세포를 본 발명에 따른 조성물과 접촉시킴을 포함하는, 숙주 세포의 생성 방법에 관한 것이며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다.The invention also relates to a method for producing a host cell, comprising contacting the host cell with a composition according to the invention, wherein said guide-polynucleotide binds said Cas protein to a target-polynucleotide in said host cell. to form a CRISPR-Cas complex.

본 발명은 또한 관심 화합물에 도움이 되는 조건하에서 본 발명에 따른 숙주 세포를 배양하고 임의로 상기 관심 화합물을 정제 또는 단리시킴을 포함하는, 상기 관심 화합물의 생성 방법에 관한 것이다.The present invention also relates to a method for producing a compound of interest, comprising culturing a host cell according to the invention under conditions conducive to the compound of interest and optionally purifying or isolating said compound of interest.

본 명세의 성질, 목적, 및 장점을 더욱 이해하기 위해서, 하기 도면과 함께 하기 상세한 설명을 참조할 것이며, 여기에서 같은 참조번호는 같은 요소를 나타낸다.
도 1은 전형적인 안내-폴리뉴클레오타이드의 예들을 묘사한다. 2개의 안내-폴리뉴클레오타이드는 모두 안내-서열(crRNA) 및 안내-폴리뉴클레오타이드 구조 성분을 포함하는 안내-RNA들이다. 상부 도면에서, 상기 안내-폴리뉴클레오타이드 구조 성분은 서로에 대해 하이브리드화된 2개의 별도의 분자들로 구성되며; 상기 개별적인 성분을 tracr 서열 및 tracr-짝 서열이라 칭할 수 있다. 하부 도면에서, 상기 안내-폴리뉴클레오타이드 구조 성분은 내부 하이브리드화를 갖는 단일 분자로 구성된다. 본 도면은 문헌[Sander and Joung, 2014 and Mali et al., Nat Methods.10(10):957-63, 2013]으로부터 적합화되었다.
도 2는 안내-폴리뉴클레오타이드(gRSR로서 약기되는 안내 RNA 자기-가공 리보자임)가 어떻게 형성되는 지를 묘사한다. 햄머헤드 리보자임 및 HDV 리보자임은 RNA 분자를 절단시켜 최종 및 기능 성숙 안내-폴리뉴클레오타이드(안내-RNA)를 형성시킨다.
도 3은 1% 아가로스 젤상에서 실행된 절단된 플라스미드 및 PCR-증폭된 단편을 도시한다.
도 4는 pYB32 및 pYB33 세균 형질전환으로부터 생성되는 형질전환체의 콜로니 PCR의 결과를 도시한다.
도 5는 T188 및 T189 형질전환체의 게놈 DNA로부터 유래된 증폭산물들이 분리된 아가로스 젤을 도시하며, 프로모터 및 부분 Cas9의 존재에 대해 시험되었다.
도 6은 Cas9의 OrfA/Pfa1 유전자좌에의 통합을 입증하는 아가로스 젤상에서 실행된 T188 및 T189 형질전환체의 게놈 DNA로부터 유래된 증폭산물들을 도시한다.
도 7은 1% 아가로스 젤상에서 실행된 게놈 DNA로부터 증폭된 스키조키트리움(Schizochytrium) 지방산 신타제(FAS) 유전자의 일부를 도시한다.
도 8은 벡터 pCL400 중의 FAS 유전자좌내로의 파로모마이신 및 gRNA 카세트의 PCR 클로닝의 결과를 도시한다.
도 9는 하나의 증폭산물로서 첫 번째 3 CS gRNA 표적을 포함하는 전체 영역을 증폭시키기 위해 수행된 PCR 및 gRNA3 CS4 표적을 별도로 증폭시키기 위한 또 다른 PCR의 결과들을 도시한다.
도 10은 T206 형질전환체의 게놈 DNA로부터 gRNA3 CS1 표적을 별도로 증폭시키기 위해 수행된 PCR의 결과를 도시한다.
도 11은 pLC122-Cas9 및 pYB30 플라스미드의 절단의 결과를 도시한다.
도 12는 pYB61 세균 형질전환으로부터 생성되는 형질전환체의 콜로니 PCR의 결과를 도시한다.
도 13은 1% 아가로스 젤상의 pCL122 플라스미드의 절단 결과를 도시한다.
도 14는 1% 아가로스 젤상의 pYB66 클로닝을 위한 gRNA3 CS1 카세트의 증폭산물을 도시한다.
도 15는 1% 아가로스 젤상의 gRNA3 CS1(pYB66 단편)의 절단 결과를 도시한다.
도 16은 pYB66 세균 형질전환으로부터 생성되는 형질전환체의 콜로니 PCR의 결과를 묘사한다.
도 17은 1% 아가로스 젤상의 pYB61 플라스미드의 절단 결과를 도시한다.
도 18은 1% 아가로스 젤상의 pYB73 클로닝을 위한 gRNA CS1 카세트의 증폭산물을 도시한다.
도 19는 pYB73 세균 형질전환으로부터 생성되는 형질전환체의 콜로니 PCR의 결과를 도시한다.
도 20은 T212 형질전환체의 게놈 DNA로부터 유래된 증폭산물들이 분리된 아가로스 젤을 도시하며, CarG 유전자의 존재에 대해 시험되었다.
도 21은 T280, T281, T285 및 T286 형질전환체의 게놈 DNA로부터 유래된 증폭산물들이 분리된 아가로스 젤을 도시하며, 부분적인 프로모터 및 Cas9 서열의 존재에 대해 시험되었다.
도 22는 T281 및 T286 형질전환체의 게놈 DNA로부터 유래된 증폭산물들이 분리된 아가로스 젤을 도시하며, gRNA 카세트의 존재에 대해 시험되었다.
도 23은 T282 및 T287 형질전환체의 게놈 DNA로부터 유래된 증폭산물들이 분리된 아가로스 젤을 도시하며, 부분적인 gRNA 카세트의 존재에 대해 시험되었다.
도 24는 T281, T282, T286 및 T287 형질전환체의 게놈 DNA로부터 gRNA3 CS1 표적을 별도로 증폭시키도록 수행된 PCR의 결과를 도시한다.
서열 목록의 설명
서열번호 1은 pCL122-Cas9 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 2는 pYB31 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 3은 pYB32 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 4는 pYB33 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 5는 pCL399 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 6은 pCL400 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 7은 pCL401 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 8은 pCL402 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 9는 pYB36 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 10은 pYB37 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 11은 pYB38 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 12는 pYB39 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 13은 121 Tub seq F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 14는 pYB32/3 CR1 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 15는 CS 프로 Kpn IF F1 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 16은 CS 프로 BamH IF R1 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 17은 CS 프로 BamH IF F2 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 18은 CS 프로 Nde IF R2 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 19는 O A1-KO F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 20은 pYB32/3 SV40 R1 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 21은 O A1-KO R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 22는 pYB32/3 C F1 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 23은 5' FAS PmeNde 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 24는 3' FAS PmeHpa 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 25는 pCL402 IF F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 26은 pCL402 IF R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 27은 pYB36 CS1 F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 28은 pYB36 CS1 R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 29는 pYB36 CS3 R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 30은 pYB36 CS4 F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 31은 pYB36 CS4 R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 32는 pYB30 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 33은 pYB61 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 34는 pYB66 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 35는 pYB73 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 36은 pCL310 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 37은 pCL122 벡터 뉴클레오타이드 서열을 제시한다.
서열번호 38은 pYB66 BamBgl F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 39는 pYB66 Nde R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 40은 pYB66 EF1seq F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 41은 pCL122 OrfC R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 42는 pYB73 gRNA Pst Kpn IF F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 43은 pYB73 gRNA Xho Pst IF R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 44는 pYB73 seq F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 45는 pYB73 seq R 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 46은 pYB13 pYB1 seq F 프라이머 뉴클레오타이드 서열을 제시한다.
서열번호 47은 TT pYB73 HDV R 프라이머 뉴클레오타이드 서열을 제시한다.
For a further understanding of the nature, objects, and advantages of the present disclosure, reference will be made to the following detailed description in conjunction with the following drawings, wherein like reference numerals refer to like elements.
1 depicts examples of a typical guide-polynucleotide. Both guide-polynucleotides are guide-RNAs comprising a guide-sequence (crRNA) and a guide-polynucleotide structural component. In the upper figure, the guide-polynucleotide structural component consists of two separate molecules hybridized to each other; These individual components may be referred to as a tracr sequence and a tracr-pair sequence. In the lower figure, the guide-polynucleotide structural component consists of a single molecule with internal hybridization. This figure was adapted from Sander and Joung, 2014 and Mali et al ., Nat Methods.10(10):957-63, 2013.
2 depicts how guide-polynucleotides (guide RNA self-processing ribozymes abbreviated as gRSRs) are formed. Hammerhead ribozymes and HDV ribozymes cleave RNA molecules to form final and functional mature guide-polynucleotides (guide-RNAs).
Figure 3 depicts cleaved plasmids and PCR-amplified fragments run on 1% agarose gels.
Figure 4 shows the results of colony PCR of transformants generated from pYB32 and pYB33 bacterial transformation.
Fig. 5 shows an agarose gel in which amplification products derived from the genomic DNA of T188 and T189 transformants were isolated, and tested for the presence of promoter and partial Cas9.
6 depicts amplification products derived from genomic DNA of T188 and T189 transformants run on agarose gels demonstrating the integration of Cas9 into the OrfA/Pfa1 locus.
7 depicts a portion of the Schizochytrium fatty acid synthase (FAS) gene amplified from genomic DNA run on 1% agarose gel.
Figure 8 depicts the results of PCR cloning of paromomycin and gRNA cassette into the FAS locus in vector pCL400.
9 shows the results of PCR performed to amplify the entire region including the first 3 CS gRNA target as one amplification product and another PCR for separately amplifying the gRNA3 CS4 target.
Figure 10 shows the results of PCR performed to amplify the gRNA3 CS1 target separately from the genomic DNA of the T206 transformant.
11 depicts the results of cleavage of pLC122-Cas9 and pYB30 plasmids.
12 shows the results of colony PCR of transformants resulting from pYB61 bacterial transformation.
Figure 13 shows the cleavage result of pCL122 plasmid on 1% agarose gel.
Figure 14 depicts the amplification product of the gRNA3 CS1 cassette for pYB66 cloning on 1% agarose gel.
15 shows the cleavage results of gRNA3 CS1 (pYB66 fragment) on 1% agarose gel.
16 depicts the results of colony PCR of transformants resulting from pYB66 bacterial transformation.
17 shows the results of cleavage of pYB61 plasmid on 1% agarose gel.
18 depicts the amplification product of the gRNA CS1 cassette for pYB73 cloning on 1% agarose gel.
19 shows the results of colony PCR of transformants resulting from pYB73 bacterial transformation.
20 shows an agarose gel in which amplification products derived from the genomic DNA of the T212 transformant were isolated, and tested for the presence of the CarG gene.
21 shows an agarose gel in which amplification products derived from the genomic DNA of T280, T281, T285 and T286 transformants were isolated and tested for the presence of partial promoter and Cas9 sequences.
22 shows an agarose gel in which amplification products derived from genomic DNA of T281 and T286 transformants were isolated and tested for the presence of a gRNA cassette.
23 shows an agarose gel in which amplification products derived from the genomic DNA of T282 and T287 transformants were isolated and tested for the presence of a partial gRNA cassette.
24 depicts the results of PCR performed to separately amplify the gRNA3 CS1 target from the genomic DNA of T281, T282, T286 and T287 transformants.
Description of Sequence Listing
SEQ ID NO: 1 shows the pCL122-Cas9 vector nucleotide sequence.
SEQ ID NO:2 shows the pYB31 vector nucleotide sequence.
SEQ ID NO: 3 shows the pYB32 vector nucleotide sequence.
SEQ ID NO: 4 shows the pYB33 vector nucleotide sequence.
SEQ ID NO: 5 shows the pCL399 vector nucleotide sequence.
SEQ ID NO: 6 shows the pCL400 vector nucleotide sequence.
SEQ ID NO: 7 shows the pCL401 vector nucleotide sequence.
SEQ ID NO:8 shows the pCL402 vector nucleotide sequence.
SEQ ID NO: 9 shows the pYB36 vector nucleotide sequence.
SEQ ID NO: 10 shows the pYB37 vector nucleotide sequence.
SEQ ID NO: 11 shows the pYB38 vector nucleotide sequence.
SEQ ID NO: 12 shows the pYB39 vector nucleotide sequence.
SEQ ID NO: 13 shows the 121 Tub seq F primer nucleotide sequence.
SEQ ID NO: 14 shows the pYB32/3 CR1 primer nucleotide sequence.
SEQ ID NO: 15 shows the CS pro Kpn IF F1 primer nucleotide sequence.
SEQ ID NO: 16 shows the CS pro BamH IF R1 primer nucleotide sequence.
SEQ ID NO: 17 shows the CS pro BamH IF F2 primer nucleotide sequence.
SEQ ID NO: 18 shows the CS pro Nde IF R2 primer nucleotide sequence.
SEQ ID NO: 19 shows the O A1-KO F primer nucleotide sequence.
SEQ ID NO: 20 shows the pYB32/3 SV40 R1 primer nucleotide sequence.
SEQ ID NO: 21 shows the O A1-KO R primer nucleotide sequence.
SEQ ID NO: 22 shows the pYB32/3 C F1 primer nucleotide sequence.
SEQ ID NO:23 shows the 5' FAS PmeNde primer nucleotide sequence.
SEQ ID NO: 24 shows the 3' FAS PmeHpa primer nucleotide sequence.
SEQ ID NO: 25 shows the pCL402 IF F primer nucleotide sequence.
SEQ ID NO:26 shows the pCL402 IF R primer nucleotide sequence.
SEQ ID NO: 27 shows the pYB36 CS1 F primer nucleotide sequence.
SEQ ID NO:28 shows the pYB36 CS1 R primer nucleotide sequence.
SEQ ID NO: 29 shows the pYB36 CS3 R primer nucleotide sequence.
SEQ ID NO: 30 shows the pYB36 CS4 F primer nucleotide sequence.
SEQ ID NO: 31 shows the pYB36 CS4 R primer nucleotide sequence.
SEQ ID NO: 32 shows the pYB30 vector nucleotide sequence.
SEQ ID NO: 33 shows the pYB61 vector nucleotide sequence.
SEQ ID NO:34 shows the pYB66 vector nucleotide sequence.
SEQ ID NO: 35 shows the pYB73 vector nucleotide sequence.
SEQ ID NO:36 shows the pCL310 vector nucleotide sequence.
SEQ ID NO:37 shows the pCL122 vector nucleotide sequence.
SEQ ID NO: 38 shows the pYB66 BamBgl F primer nucleotide sequence.
SEQ ID NO:39 shows the pYB66 Nde R primer nucleotide sequence.
SEQ ID NO: 40 shows the pYB66 EF1seq F primer nucleotide sequence.
SEQ ID NO: 41 shows the pCL122 OrfC R primer nucleotide sequence.
SEQ ID NO: 42 shows the pYB73 gRNA Pst Kpn IF F primer nucleotide sequence.
SEQ ID NO:43 shows the pYB73 gRNA Xho Pst IF R primer nucleotide sequence.
SEQ ID NO: 44 shows the pYB73 seq F primer nucleotide sequence.
SEQ ID NO: 45 shows the pYB73 seq R primer nucleotide sequence.
SEQ ID NO: 46 shows the pYB13 pYB1 seq F primer nucleotide sequence.
SEQ ID NO: 47 shows the TT pYB73 HDV R primer nucleotide sequence.

본 명세를 추가로 기재하기 전에, 특정 실시태양들의 변화를 수행할 수 있고 상기 변화가 여전히 첨부된 특허청구범위의 범위내에 있을 수 있기 때문에, 상기 명세가 하기에 기재되는 명세의 특정 실시태양들로 제한되지 않음을 알아야 한다. 또한, 사용되는 용어는 특정 실시태양들을 기재하기 위한 것이며 제한을 의도하는 것은 아님을 알아야 한다. 대신에, 본 명세의 범위는 첨부되는 특허청구범위에 의해 확립될 것이다.Before further describing this specification, the specification is directed to specific embodiments of the specification set forth below, as changes may be made in specific embodiments and such changes may still be within the scope of the appended claims. It should be noted that there are no restrictions. It is also to be understood that the terminology used is for the purpose of describing particular embodiments and is not intended to be limiting. Instead, the scope of the present disclosure will be established by the appended claims.

첫 번째 태양에서, 본 발명은 안내-폴리뉴클레오타이드 및 Cas 단백질을 포함하는 CRISPR-Cas 시스템의 소스를 포함하는 비-천연 또는 조작된 조성물을 제공하며, 여기에서 상기 안내-폴리뉴클레오타이드는 근본적으로 숙주 세포 중의 표적-폴리뉴클레오타이드의 역 보체인 안내-서열을 포함하고, 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킬 수 있으며, 여기에서 상기 안내-서열은 근본적으로 상기 숙주 세포의 게놈 중 5'-(N)yPAM-3' 폴리뉴클레오타이드 서열 표적의 (N)y 부분의 역 보체이고, 여기에서 y는 8 내지 30, 보다 바람직하게는 10 내지 30, 보다 바람직하게는 15 내지 30, 보다 바람직하게 17 내지 27, 보다 바람직하게 17 내지 20, 보다 바람직하게 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 또는 27의 정수이며, 여기에서 PAM은 프로토스페이서 인접 모티프이고, 여기에서 상기 숙주 세포는 라비린툴로마이세테스 강, 바람직하게는 트라우스토키트리알레스(Thraustochytriales) 목, 보다 바람직하게는 트라우스토키트리아세아에(Thraustochytriaceae) 과, 보다 바람직하게는 아우란티오키트리움(Aurantiochytrium), 오블롱기키트리움(Oblongichytrium), 스키조키트리움(Schizochytrium), 트라우스토키트리움(Thraustochytrium) 및 울케니아(Ulkenia)로 이루어지는 그룹 중에서 선택된 속의 일원, 훨씬 더 바람직하게는 스키조키트리움 종 ATCC# 20888의 것이며, 여기에서 PAM은 바람직하게는 5′-XGG-3′, 5′-XGGXG-3′, 5′-XXAGAAW-3′, 5′-XXXXGATT-3′, 5′-XXAGAA-3′, 5′-XAAAAC-3′로 이루어지는 그룹 중에서 선택된 서열이고, 여기에서 X는 임의의 뉴클레오타이드 또는 그의 유사체일 수 있으며, 바람직하게 X는 임의의 뉴클레오타이드일 수 있고; W는 A 또는 T이다.In a first aspect, the present invention provides a non-natural or engineered composition comprising a source of a CRISPR-Cas system comprising a guide-polynucleotide and a Cas protein, wherein the guide-polynucleotide is essentially a host cell a guide-sequence that is the reverse complement of a target-polynucleotide in the host cell, wherein the guide-polynucleotide is capable of directing binding of the Cas protein to the target-polynucleotide in the host cell to form a CRISPR-Cas complex, wherein wherein said guide-sequence is essentially the reverse complement of the (N)y portion of the target 5'-(N)yPAM-3' polynucleotide sequence in the genome of said host cell, wherein y is 8 to 30, more preferably is 10 to 30, more preferably 15 to 30, more preferably 17 to 27, more preferably 17 to 20, more preferably 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, or an integer of 27, wherein PAM is a protospacer adjacent motif, wherein said host cell is of the class Labyrinthulomycetes, preferably of the order Thraustochytriales, more preferably of Thraustochytriales. Aseae (Thraustochytriaceae) family, more preferably Aurantiochytrium ( Aurantiochytrium ), Oblongichytrium , Schizochytrium , Thraustochytrium ) and Ulkenia consisting of Thraustochytrium and Ulkenia ) A member of the genus selected from the group, even more preferably of the Schizochytrium sp. ATCC# 20888, wherein the PAM is preferably 5'-XGG-3', 5'-XGGXG-3', 5'-XXAGAAW-3 ', 5'-XXXXGATT-3', 5'-XXAGAA-3', 5'-XAAAAC-3' is a sequence selected from the group consisting of, wherein X is any nucleotide or analogs thereof, preferably X can be any nucleotide; W is A or T.

본 명세서에서 상기 조성물, 소스, CRISPR-Cas 시스템, 안내-폴리뉴클레오타이드, Cas 단백질, 표적-폴리뉴클레오타이드, 숙주 세포 및 CRISPR-Cas 복합체는 본 발명에 따른 조성물, 소스, CRISPR-Cas 시스템, 안내-폴리뉴클레오타이드, Cas 단백질, 표적-폴리뉴클레오타이드, 숙주 세포 및 CRISPR-Cas 복합체로서 지칭된다. 완성을 위해서, "하나의"는 본 명세서의 다른 곳에서 "적어도 하나"로서 정의되기 때문에, 본 발명에 따른 하나의 조성물은 적어도 하나, 즉 1, 2, 3 또는 그 이상의 안내-폴리뉴클레오타이드 및/또는 적어도 하나, 즉 1, 2, 3 또는 그 이상의 Cas 단백질의 소스를 포함한다. 상응하게, 본 발명은 편의상 멀티플렉스 CRISPR-Cas 시스템을 제공한다. 상기와 같은 멀티플렉스 CRISPR-Cas 시스템은 편의상 공여체 폴리뉴클레오타이드의 도입, 폴리뉴클레오타이드의 결실 및 숙주 세포의 게놈내로의 폴리뉴클레오타이드 라이브러리 삽입에 사용될 수 있다. 본 명세서에서, 멀티플렉스 CRISPR-Cas 시스템은 하나 이상의 Cas 단백질, 하나 이상의 안내-폴리뉴클레오타이드 및/또는 하나 이상의 공여체 폴리뉴클레오타이드의 사용을 지칭할 수 있다.The composition, source, CRISPR-Cas system, guide-polynucleotide, Cas protein, target-polynucleotide, host cell and CRISPR-Cas complex herein are the composition, source, CRISPR-Cas system, guide-poly according to the present invention. referred to as nucleotides, Cas proteins, target-polynucleotides, host cells and CRISPR-Cas complexes. For completeness sake, since "a" is defined as "at least one" elsewhere herein, a composition according to the invention comprises at least one, ie, 1, 2, 3 or more intra-polynucleotides and/or or at least one, ie, 1, 2, 3 or more sources of Cas proteins. Correspondingly, the present invention conveniently provides a multiplex CRISPR-Cas system. The multiplex CRISPR-Cas system as described above can be conveniently used for introduction of a donor polynucleotide, deletion of the polynucleotide, and insertion of a polynucleotide library into the genome of a host cell. As used herein, a multiplex CRISPR-Cas system may refer to the use of one or more Cas proteins, one or more guide-polynucleotides and/or one or more donor polynucleotides.

"CRISPR 시스템", "CRISPR-Cas 시스템" 및 "CRISPR 효소 시스템"이란 용어들은 본 명세서에서 호환 가능하게 사용되며 본 발명의 모든 실시태양들의 상황에서 표적-폴리뉴클레오타이드와 함께 CRISPR-Cas 복합체를 형성하는데 필요한 요소들의 집합을 지칭하고; 이들 요소는 비제한적으로 Cas 단백질 및 안내-폴리뉴클레오타이드를 포함한다.The terms "CRISPR system", "CRISPR-Cas system" and "CRISPR enzyme system" are used interchangeably herein and in the context of all embodiments of the present invention together with a target-polynucleotide to form a CRISPR-Cas complex. refers to the set of necessary elements; These elements include, but are not limited to, Cas proteins and guide-polynucleotides.

"CRISPR-Cas 복합체"란 용어는 본 발명의 모든 실시태양들의 상황에서 표적-폴리뉴클레오타이드에 하이브리드화되고 Cas 단백질과 복합체화된 안내-폴리뉴클레오타이드를 포함하는 복합체를 지칭한다. 가장 간단한 형태로, 돌연변이되지 않은 Cas 단백질, 예를 들어 비제한적으로 스트렙토코커스 피오게네스(Streptococcus pyogenes)의 Cas9 단백질이 사용되는 경우, 상기 CRISPR-Cas 복합체의 형성은 표적-폴리뉴클레오타이드 중의 또는 상기 부근의(예를 들어 상기로부터 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, 또는 그 이상의 염기쌍내의) 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 생성시킨다. 전형적으로, 본 발명에 따른(본 명세서에서 하기에 정의된) 표적-폴리뉴클레오타이드는 PAM 서열(본 명세서에서 하기에 정의됨)과 결합되고 상기 PAM 서열은 바람직하게는 상기 표적-폴리뉴클레오타이드의 바로 하류(3')에 있으며; 상기 CRISPR-Cas 복합체의 형성은 전형적으로 상기 PAM 서열의 3 염기쌍 상류(5')에서 하나 또는 2개 모두의 폴리뉴클레오타이드 가닥의 절단을 생성시킨다.The term "CRISPR-Cas complex" in the context of all embodiments of the invention refers to a complex comprising a guide-polynucleotide hybridized to a target-polynucleotide and complexed with a Cas protein. In its simplest form, when an unmutated Cas protein, such as, but not limited to , the Cas9 protein of Streptococcus pyogenes , is used, the formation of the CRISPR-Cas complex is in or near the target-polynucleotide. Produces cleavage of one or both polynucleotide strands (e.g., within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from above) . Typically, a target-polynucleotide according to the invention (as defined herein below) is associated with a PAM sequence (as defined herein below) and said PAM sequence is preferably immediately downstream of said target-polynucleotide. in (3'); Formation of the CRISPR-Cas complex typically results in cleavage of one or both polynucleotide strands 3 base pairs upstream (5') of the PAM sequence.

"비-천연 조성물"이란 용어는 본 발명의 모든 실시태양들의 상황에서 본 발명에 사용되는 그의 형태 중에 자연적으로 존재하지 않는 조성물을 지칭한다. 상기 개별적인 요소는 예를 들어 그대로 또는 자연 중의 다른 요소들과 함께 존재할 수 있지만, 상기 비-천연 조성물은 예를 들어 다소 천연 조성물 이외의 적어도 하나의 요소를 포함한다.The term “non-natural composition” refers to a composition that is not naturally present in its form as used in the present invention in the context of all embodiments of the present invention. The individual element may for example be present as is or in combination with other elements in nature, although the non-naturally occurring composition comprises at least one element, for example more or less than the natural composition.

"조작된 조성물"이란 용어는 본 발명의 모든 실시태양들의 상황에서 요소 중 적어도 하나가, 생성되는 요소가 자연적으로 존재하지 않도록 하는 방식으로 조작된, 즉 인간에 의해 변형된 조성물을 지칭한다. 적어도 하나의 조작된 요소를 포함함으로써, 조작된 조성물은 자연에 존재하지 않는 것이 된다.The term "engineered composition" in the context of all embodiments of the invention refers to a composition in which at least one of the elements has been engineered, ie, modified by a human, in such a way that the resulting element does not exist in nature. By including at least one engineered element, the engineered composition is non-existent in nature.

"폴리뉴클레오타이드", "뉴클레오타이드 서열" 및 "핵산"이란 용어는 본 명세서에서 호환 가능하게 사용되며 본 발명의 실시태양들의 상황에서 데옥시리보뉴클레오타이드이든 리보뉴클레오타이드이든, 임의의 길이의 뉴클레오타이드의 중합체성 형태, 또는 이들의 혼합물 또는 유사체를 지칭한다. 폴리뉴클레오타이드는 임의의 3차원 구조를 가질 수 있으며 공지되거나 공지되지 않은 임의의 기능을 수행할 수 있다. 하기는 폴리뉴클레오타이드의 비제한적인 예들이다: 유전자 또는 유전자 단편의 암호화 또는 비-암호화 영역, 연관 분석으로부터 한정된 유전자좌들(유전자좌), 엑손, 인트론, 메신저 RNA(mRNA), 전달 RNA(tRNA), 리보솜 RNA(rRNA), 짧은 간섭 RNA(siRNA), 짧은-헤어핀 RNA(shRNA), 미세-RNA(miRNA), 리보자임, cDNA, 재조합 폴리뉴클레오타이드, 분지된 폴리뉴클레오타이드, 플라스미드, 벡터, 임의의 서열의 단리된 DNA, 임의의 서열의 단리된 RNA, 핵산 탐침, 올리고뉴클레오타이드 및 프라이머. 폴리뉴클레오타이드는 하나 이상의 변형된 뉴클레오타이드, 예를 들어 메틸화된 뉴클레오타이드 및 뉴클레오타이드 유사체 또는 뉴클레오타이드 균등물을 포함할 수 있다(여기에서 뉴클레오타이드 유사체 또는 균등물은 변형된 염기 및/또는 변형된 주쇄, 및/또는 비-천연 뉴클레오사이드간 결합, 또는 이들 변형의 조합을 갖는 잔기로서 정의된다). 바람직한 뉴클레오타이드 유사체 및 균등물은 "일반적인 정의" 섹션에 기재되어 있다. 경우에 따라, 상기 뉴클레오타이드 구조에 대한 변형을 폴리뉴클레오타이드의 조립 전 또는 후에 도입시킬 수도 있다. 폴리뉴클레오타이드를 중합 후에, 예를 들어 표지화 화합물과의 접합에 의해 추가로 변형시킬 수도 있다.The terms "polynucleotide", "nucleotide sequence" and "nucleic acid" are used interchangeably herein and in the context of embodiments of the invention, a polymeric form of a nucleotide of any length, whether deoxyribonucleotide or ribonucleotide , or mixtures or analogs thereof. A polynucleotide may have any three-dimensional structure and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci defined from association analysis (locus), exons, introns, messenger RNA (mRNA), transfer RNA (tRNA), ribosome Isolation of RNA (rRNA), short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozyme, cDNA, recombinant polynucleotide, branched polynucleotide, plasmid, vector, arbitrary sequence DNA, isolated RNA of any sequence, nucleic acid probes, oligonucleotides and primers. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs or nucleotide equivalents (wherein nucleotide analogs or equivalents include modified bases and/or modified backbones, and/or non -defined as residues having natural internucleoside bonds, or a combination of these modifications). Preferred nucleotide analogs and equivalents are described in the "General Definitions" section. Optionally, modifications to the nucleotide structure may be introduced before or after assembly of the polynucleotide. The polynucleotide may be further modified after polymerization, for example by conjugation with a labeling compound.

본 명세서에 사용되는 바와 같이, "폴리뉴클레오타이드"는 통상적인 포스포다이에스터 결합 또는 비-통상적인 결합(예를 들어 아미드 결합, 예를 들어 펩타이드 핵산(PNA) 중에서 발견되는)을 포함할 수 있다. 폴리뉴클레오타이드는 번역되지 않은 5' 및 3' 서열, 암호화 서열을 포함한, 완전길이 cDNA 서열의 뉴클레오타이드 서열뿐만 아니라 상기 핵산 서열의 단편, 에피토프, 도메인 및 변체를 함유할 수 있다. 상기 폴리뉴클레오타이드는 임의의 폴리리보뉴클레오타이드 또는 폴리데옥시리보뉴클레오타이드로 구성될 수 있으며, 이들은 변형되지 않은 RNA 또는 DNA 또는 변형된 RNA 또는 DNA일 수 있다. 예를 들어, 폴리뉴클레오타이드는 단일- 또는 이중-가닥 DNA, 단일- 및 이중-가닥 영역의 혼합물인 DNA, 단일- 및 이중-가닥 RNA, 및 단일- 및 이중-가닥 영역의 혼합물인 RNA, 단일 가닥 또는 보다 전형적으로 이중 가닥 또는 단일- 및 이중-가닥 영역의 혼합물일 수 있는 DNA 및 RNA를 포함하는 하이브리드 분자로 구성될 수 있다. 또한, 상기 폴리뉴클레오타이드는 RNA 또는 DNA 또는 RNA와 DNA 모두를 포함하는 삼중-가닥 영역으로 구성될 수 있다. 폴리뉴클레오타이드는 리보뉴클레오사이드(아데노신, 구아노신, 유리딘, 또는 시티딘; "RNA 분자") 또는 데옥시리보뉴클레오사이드(데옥시아데노신, 데옥시구아노신, 데옥시티미딘, 또는 데옥시시티딘; "DNA 분자"), 또는 이들의 임의의 포스포에스터 유사체, 예를 들어 포스포로티오에이트 및 티오에스터를 함유할 수 있다. 폴리뉴클레오타이드는 또한 안정성 또는 다른 이유로 변형된 하나 이상의 변형된 염기 또는 DNA 또는 RNA 주쇄를 함유할 수 있다. "변형된" 염기는 예를 들어 트리틸화된 염기 및 통상적이지 않은 염기, 예를 들어 이노신을 포함한다. DNA 및 RNA에 다양한 변형을 수행할 수 있으며; 따라서, "폴리뉴클레오타이드"는 화학적으로, 효소적으로 또는 대사적으로 변형된 형태를 포함한다. 핵산 분자란 용어는 오직 상기 분자의 1차 및 2차 구조만을 지칭하며, 상기를 임의의 특정한 3차 형태로 제한하지 않는다. 따라서, 상기 용어는 특히 선형 또는 환상 DNA 분자(예를 들어 제한 단편), 플라스미드, 및 염색체에서 발견되는 이중 가닥 DNA를 포함한다. 특정한 이중 가닥 DNA 분자 구조의 논의에서, 서열은 본 명세서에서 오직 DNA의 전사되지 않은 가닥(즉 mRNA와 상동성인 서열을 갖는 가닥)을 따라 5'에서 3' 방향의 서열만을 제공하는 통상적인 규약에 따라 기재될 수 있다.As used herein, a "polynucleotide" may include conventional phosphodiester linkages or non-conventional linkages (e.g., amide linkages, e.g., found in peptide nucleic acids (PNA)). . A polynucleotide may contain the nucleotide sequence of a full-length cDNA sequence, including untranslated 5' and 3' sequences, coding sequences, as well as fragments, epitopes, domains and variants of said nucleic acid sequence. The polynucleotide may consist of any polyribonucleotides or polydeoxyribonucleotides, which may be unmodified RNA or DNA or modified RNA or DNA. For example, polynucleotides include single- or double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that is a mixture of single- and double-stranded regions, single-stranded or more typically a hybrid molecule comprising DNA and RNA, which may be double-stranded or a mixture of single- and double-stranded regions. In addition, the polynucleotide may be composed of a triple-stranded region comprising RNA or DNA or both RNA and DNA. A polynucleotide may be a ribonucleoside (adenosine, guanosine, uridine, or cytidine; an “RNA molecule”) or a deoxyribonucleoside (deoxyadenosine, deoxyguanosine, deoxythymidine, or deoxy cytidine; "DNA molecule"), or any phosphoester analogs thereof, such as phosphorothioates and thioesters. Polynucleotides may also contain one or more modified bases or DNA or RNA backbones that have been modified for stability or other reasons. "Modified" bases include, for example, tritylated bases and unconventional bases, such as inosine. Various modifications can be made to DNA and RNA; Thus, "polynucleotide" includes chemically, enzymatically or metabolically modified forms. The term nucleic acid molecule refers only to the primary and secondary structure of the molecule and is not limited to any particular tertiary conformation. Thus, the term specifically includes double-stranded DNA found in linear or circular DNA molecules (eg restriction fragments), plasmids, and chromosomes. In the discussion of specific double-stranded DNA molecular structures, sequences are in the conventional convention of providing only sequences in the 5' to 3' direction along the untranscribed strand of DNA (i.e., the strand having a sequence homologous to the mRNA) herein. can be described accordingly.

"단리된" 핵산 분자란 용어는 그의 고유 환경으로부터 제거된 핵산 분자, DNA 또는 RNA를 지칭한다. 단리된 핵산 분자의 추가적인 예는 이종 숙주 세포 중에서 유지되는 재조합 폴리뉴클레오타이드 또는 용액 중 정제된(부분적으로 또는 실질적으로) 폴리뉴클레오타이드를 포함하는 핵산 분자를 포함한다. 단리된 RNA 분자는 본 발명의 폴리뉴클레오타이드의 생체내 또는 시험관내 RNA 전사물을 포함한다. 본 발명에 따른 단리된 핵산 분자는 합성적으로 생성된 상기와 같은 분자를 추가로 포함한다. 또한, 핵산 분자 또는 폴리뉴클레오타이드는 조절 요소, 예를 들어 프로모터, 리보솜 결합 부위, 또는 전사 종결자를 포함할 수 있다.The term “isolated” nucleic acid molecule refers to a nucleic acid molecule, DNA or RNA that has been removed from its native environment. Additional examples of isolated nucleic acid molecules include nucleic acid molecules comprising recombinant polynucleotides maintained in a heterologous host cell or purified (partially or substantially) polynucleotides in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of polynucleotides of the invention. The isolated nucleic acid molecule according to the present invention further comprises such molecules produced synthetically. A nucleic acid molecule or polynucleotide may also include regulatory elements, such as promoters, ribosome binding sites, or transcription terminators.

"유전자"는 폴리펩타이드를 암호화하는 뉴클레오타이드의 조립체를 지칭하며 cDNA 및 게놈 DNA 핵산을 포함한다. "유전자"는 또한 개별적인 암호화 분절(엑손) 사이의 개재 서열(인트론)뿐만 아니라 암호화 서열에 선행(5' 비-암호화 서열) 및 후행(3' 비-암호화 서열)하는 조절 서열을 포함하여, 특정 단백질을 발현하는 핵산 단편을 지칭한다. "고유 유전자"는 그 자신의 조절 서열을 갖는, 자연에서 발견되는 바와 같은 유전자를 지칭한다.A “gene” refers to an assembly of nucleotides encoding a polypeptide and includes cDNA and genomic DNA nucleic acids. A "gene" also includes intervening sequences (introns) between individual coding segments (exons) as well as regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding sequences) coding sequences, including, Refers to a nucleic acid fragment that expresses a protein. A “native gene” refers to a gene as found in nature, having its own regulatory sequences.

일부 실시태양에서, 상기 핵산 분자는 본 명세서에 보고된 폴리뉴클레오타이드 서열에 적어도 약 80%, 85%, 또는 90% 일치하는, 또는 본 명세서에 보고된 폴리뉴클레오타이드 서열에 적어도 약 95%, 96%, 97%, 98%, 99% 또는 100% 일치하는 폴리뉴클레오타이드 서열을 포함한다. 당해 분야에 공지된 바와 같이, "일치성 퍼센트"란 용어는 서열 비교에 의해 측정시, 2개 이상의 아미노산 서열 또는 2개 이상의 폴리뉴클레오타이드 서열간의 관계이다. 당해 분야에서, "일치성"은 또한 서열들의 스트링간의 합치에 의해 측정시, 경우에 따라, 아미노산 또는 폴리뉴클레오타이드 서열간의 서열 관련성 정도를 의미한다.In some embodiments, the nucleic acid molecule is at least about 80%, 85%, or 90% identical to a polynucleotide sequence as reported herein, or at least about 95%, 96% identical to a polynucleotide sequence as reported herein, polynucleotide sequences that are 97%, 98%, 99% or 100% identical. As is known in the art, the term "percent identity" is the relationship between two or more amino acid sequences or two or more polynucleotide sequences, as determined by sequence comparison. In the art, "identity" also refers to the degree of sequence relatedness between amino acid or polynucleotide sequences, as the case may be, as measured by the agreement between strings of sequences.

본 발명의 참조 폴리뉴클레오타이드 서열에 적어도, 예를 들어 95% "일치하는" 폴리뉴클레오타이드 서열을 갖는 핵산 분자의 경우, 상기 핵산 분자의 폴리뉴클레오타이드 서열은 상기 폴리뉴클레오타이드 서열이 상기 참조 폴리뉴클레오타이드 서열의 각 100 뉴클레오타이드당 5개 이하의 뉴클레오타이드 차이를 포함할 수 있음을 제외하고, 상기 참조 서열과 일치함을 의미한다. 즉, 참조 폴리뉴클레오타이드 서열에 적어도 95% 일치하는 폴리뉴클레오타이드 서열을 갖는 핵산 분자를 획득하기 위해서, 상기 참조 서열 중 뉴클레오타이드의 5% 이하가 결실되거나 또 다른 뉴클레오타이드로 치환될 수 있거나, 또는 상기 참조 서열의 전체 뉴클레오타이드의 5% 이하의 뉴클레오타이드의 수가 상기 참조 서열내에 삽입될 수 있다.For a nucleic acid molecule having a polynucleotide sequence that is at least, for example 95% "identical to" a reference polynucleotide sequence of the invention, the polynucleotide sequence of the nucleic acid molecule is such that the polynucleotide sequence is equal to each 100 of the reference polynucleotide sequence. is meant to match the reference sequence, except that it may contain a difference of up to 5 nucleotides per nucleotide. That is, in order to obtain a nucleic acid molecule having a polynucleotide sequence that is at least 95% identical to a reference polynucleotide sequence, 5% or less of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or A number of nucleotides up to 5% of the total nucleotides may be inserted into the reference sequence.

실제적인 문제로서, 임의의 특정 폴리뉴클레오타이드 서열 또는 아미노산 서열이 본 발명의 폴리뉴클레오타이드 서열 또는 아미노산 서열에 적어도 80%, 85%, 90%, 95%, 96%, 97%, 98% 또는 99% 일치하는 지의 여부를 공지된 컴퓨터 프로그램을 사용하여 통상적으로 측정할 수 있다. 의문의 서열(본 발명의 서열)과 주 서열간의 최상의 전체적인 합치를 측정하는 방법을, 서열의 정렬 및 일치성 점수의 계산을 사용하여 결정할 수 있다. 상기 정렬은 참조 알고리즘 및 지니어스 참조 어셈블러(Geneious reference assembler)에 대한 지도와 함께 컴퓨터 프로그램 지니어스(www.geneious.com)를 사용하여 수행되었다.As a practical matter, any particular polynucleotide sequence or amino acid sequence is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a polynucleotide sequence or amino acid sequence of the present invention. Whether or not it can be measured can be conventionally performed using a known computer program. The method of determining the best overall match between the sequence in question (sequence of the invention) and the main sequence can be determined using the alignment of the sequences and calculation of the identity score. The alignment was performed using the computer program Genius (www.geneious.com) with a reference algorithm and a map to the Geneious reference assembler.

본 발명에 따른 안내-폴리뉴클레오타이드는 표적-폴리뉴클레오타이드와 하이브리드화할 수 있는 적어도 하나의 안내-서열을 포함하며 상기 표적-폴리뉴클레오타이드에의 상기 CRISPR-Cas 시스템의 서열-특이적 결합을 지시하여 CRISPR-Cas 복합체를 형성시킬 수 있다. 활성 CRISPR-Cas 복합체의 형성을 가능하게 하기 위해서, 상기 안내-폴리뉴클레오타이드는 바람직하게는 또한, 특이적인 2차 구조를 갖고 상기 Cas 단백질의 상기 안내-폴리뉴클레오타이드에의 결합을 허용하는 서열을 포함한다. 상기와 같은 서열은 당해 분야에 tracr RNA, tracr 서열, tracr 스캐폴드 또는 안내-폴리뉴클레오타이드 구조 성분으로서 공지되어 있으며, 이들 용어는 본 명세서에서 호환 가능하게 사용되고; 여기에서 상기 tracr은 전사촉진 CRISPR에 대한 약어이고; 따라서 tracrRNA는 전사촉진 CRISPR RNA를 의미한다. 원 CRISPR-Cas 시스템 중의 tracrRNA는 crRNA(안내-서열)를 Cas 뉴클레아제에 연결하며, 어떠한 crRNA에도 결합할 수 있는 내인성 세균 RNA이다. 안내-폴리뉴클레오타이드 구조 성분은 단일의 폴리뉴클레오타이드 분자로 구성되거나 또는 서로에 대해 하이브리드화된 2개 이상의 분자, 또는 Cas 단백질 또는 유사한 기능의 다른 뉴클레아제와 결합하는 2개 이상의 분자로 구성될 수 있다. 안내-폴리뉴클레오타이드 구조의 상기와 같은 성분을 tracr 서열 및 tracr-짝 서열로서 지칭할 수 있다.A guide-polynucleotide according to the present invention comprises at least one guide-sequence capable of hybridizing with a target-polynucleotide and directs sequence-specific binding of said CRISPR-Cas system to said target-polynucleotide to thereby direct CRISPR- Cas complexes can be formed. To enable the formation of an active CRISPR-Cas complex, the guide-polynucleotide preferably also comprises a sequence having a specific secondary structure and allowing binding of the Cas protein to the guide-polynucleotide . Such sequences are known in the art as tracr RNA, tracr sequences, tracr scaffolds or guide-polynucleotide structural components, and these terms are used interchangeably herein; wherein the tracr is an abbreviation for transactivation CRISPR; Therefore, tracrRNA means transcription-promoting CRISPR RNA. The tracrRNA in the native CRISPR-Cas system is an endogenous bacterial RNA capable of linking a crRNA (guide-sequence) to a Cas nuclease and can bind any crRNA. A guide-polynucleotide structural component may consist of a single polynucleotide molecule or may consist of two or more molecules hybridized to each other, or two or more molecules that bind Cas proteins or other nucleases of similar function. . Such components of the guide-polynucleotide structure may be referred to as tracr sequences and tracr-pair sequences.

상응하게, 상기 안내-폴리뉴클레오타이드는 바람직하게는 또한 tracr 서열 및/또는 tracr-짝 서열을 포함한다. 상기 안내-폴리뉴클레오타이드는 본 명세서에서 상기에 제시된 폴리뉴클레오타이드의 일반적인 정의에 따른 폴리뉴클레오타이드이며; 바람직한 안내-폴리뉴클레오타이드는 리보뉴클레오타이드를 포함하고, 보다 바람직한 안내-폴리뉴클레오타이드는 RNA(안내-RNA)이다. 안내-폴리뉴클레오타이드 구조의 2가지 예를 도 1에 묘사한다.Correspondingly, said guide-polynucleotide preferably also comprises a tracr sequence and/or a tracr-pair sequence. said guide-polynucleotide is a polynucleotide according to the general definition of polynucleotide set forth herein above; Preferred guide-polynucleotides include ribonucleotides, and more preferred guide-polynucleotides are RNA (guide-RNA). Two examples of guide-polynucleotide structures are depicted in FIG. 1 .

본 발명의 상황에서, 서열은 주 서열이 바람직하게는 숙주 세포에서와 같은 생리학적 조건하에서 표적-서열 또는 표적-폴리뉴클레오타이드와 하이브리드화할 수 있는 경우, 근본적으로 상기 표적-서열 또는 표적-폴리뉴클레오타이드의 역보체로서 지칭된다. 안내-서열과 그의 상응하는 표적-서열간의 상보성 정도는 적합한 정렬 알고리즘을 사용하여 최적으로 정렬될 때, 바람직하게는 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99% 초과 서열 일치성이다. 상기 표적-폴리뉴클레오타이드가 이중가닥 폴리뉴클레오타이드인 경우, 안내-서열과 같은 주 서열은 상기 표적-폴리뉴클레오타이드의 어느 한 가닥, 예를 들어 암호화 가닥 또는 비-암호화 가닥과 하이브리드화할 수 있다.In the context of the present invention, a sequence essentially consists of a target-sequence or target-polynucleotide, if the main sequence is preferably capable of hybridizing with a target-sequence or target-polynucleotide under physiological conditions, such as in a host cell. referred to as reverse complement. The degree of complementarity between the guide-sequence and its corresponding target-sequence, when optimally aligned using a suitable alignment algorithm, is preferably 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, greater than 99% sequence identity. When the target-polynucleotide is a double-stranded polynucleotide, a main sequence, such as a guide-sequence, can hybridize with either strand of the target-polynucleotide, eg, a coding strand or a non-coding strand.

바람직하게, 본 발명에 따른 안내-서열은 표적 중 독특한 표적-서열을 표적화한다. 바람직하게, 본 발명에 따른 안내-서열은 PAM 서열에 바로 인접한 표적-폴리뉴클레오타이드 중 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 또는 20, 보다 바람직하게는 8, 9, 10, 11 또는 12 뉴클레오타이드와 100% 서열 일치성을 갖는다.Preferably, the guide-sequence according to the invention targets a unique target-sequence among the targets. Preferably, the guide-sequence according to the present invention is 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20, more preferably in the target-polynucleotide immediately adjacent to the PAM sequence. has 100% sequence identity with 8, 9, 10, 11 or 12 nucleotides.

본 발명에 따른 안내-서열은 길이가 바람직하게는 8 내지 30, 보다 바람직하게는 10 내지 30, 보다 바람직하게는 15 내지 30, 보다 바람직하게는 17 내지 27, 보다 바람직하게는 17 내지 20, 보다 바람직하게는 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 또는 27 뉴클레오타이드이다. CRISPR-Cas 시스템의 표적-서열에의 서열-특이적 결합을 지시하여 CRISPR-Cas 복합체를 형성시키는 안내-서열의 능력을 임의의 적합한 분석에 의해 평가할 수 있다. 예를 들어, 시험하려는 안내-서열을 포함하여, CRISPR-Cas 복합체를 형성하기에 충분한 CRISPR 시스템의 성분들을, 예를 들어 상기 CRISPR-Cas 시스템의 성분들을 암호화하는 벡터에 의한 형질감염, 이어서 예를 들어 표준 서열 분석에 의한, 우선적인 절단 및/또는 상기 표적-서열내 세포 수복 기전에 의해 유도된 생성된 돌연변이의 평가에 의해, 상응하는 표적-서열을 갖는 숙주 세포에 제공할 수 있다. 표적-폴리뉴클레오타이드의 절단을 또한, 상기 표적-폴리뉴클레오타이드, 시험하려는 안내-서열 및 상기 시험 안내-서열과 상이한 대조용 안내-서열을 포함한 CRISPR-Cas 시스템의 성분들을 제공하고, 상기 시험 및 대조용 안내-서열 반응들간에 표적-서열에서의 절단률 또는 결합을 비교함으로써 시험관에서 평가할 수 있다. 다른 분석들이 가능하며, 이는 당해 분야의 숙련가에게 공지되어 있다.The guide-sequence according to the invention preferably has a length of 8 to 30, more preferably 10 to 30, more preferably 15 to 30, more preferably 17 to 27, more preferably 17 to 20, more preferably 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, or 27 nucleotides. The ability of a guide-sequence to direct sequence-specific binding to a target-sequence of the CRISPR-Cas system to form a CRISPR-Cas complex can be assessed by any suitable assay. Components of the CRISPR system sufficient to form a CRISPR-Cas complex, including, for example, the guide-sequence to be tested, for example by transfection with a vector encoding the components of the CRISPR-Cas system, followed by, for example, Host cells with the corresponding target-sequence can be presented to host cells, eg by standard sequencing, by evaluation of the resulting mutations induced by preferential cleavage and/or cell repair mechanisms in the target-sequence. Cleavage of the target-polynucleotide also provides components of the CRISPR-Cas system, including the target-polynucleotide, the guide-sequence to be tested, and a control guide-sequence different from the test guide-sequence, for the test and control. It can be assessed in vitro by comparing the rate of cleavage or binding at the target-sequence between guide-sequence reactions. Other assays are possible and known to those skilled in the art.

안내-폴리뉴클레오타이드 구조 성분은 활성 CRISPR-Cas 복합체의 형성에 필요한 것으로 여겨진다. 상기 안내-폴리뉴클레오타이드 구조 성분은 반드시 안내-서열에 작동적으로 연결되는 것은 아닌 것으로 여겨지나; 안내-폴리뉴클레오타이드 구조 성분은 안내-폴리뉴클레오타이드내의 안내-서열에 작동적으로 연결될 수도 있다. 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분[상기는 야생형 안내-폴리뉴클레오타이드 구조 성분의 전부 또는 일부(예를 들어 야생형 tracr-서열의 약 또는 약 20, 26, 32, 45, 48, 54, 63, 67, 85 초과, 또는 그 이상의 뉴클레오타이드)를 포함하거나 또는 상기로 이루어질 수 있다]은, 예를 들어 본 발명에 따른 tracr-서열의 적어도 일부의 본 발명에 따른 tracr-짝 서열의 전부 또는 일부에의 하이브리드화에 의해서 및 바람직하게는 본 발명에 따른 안내-서열에 작동적으로 연결되어 CRISPR-Cas 복합체 부분을 형성한다. 본 발명에 따른 tracr-서열은 바람직하게는 숙주 세포에서와 같은 생리학적 조건하에서 본 발명에 따른 tracr-짝 서열에 하이브리드화하고 CRISPR-Cas 복합체의 형성을 촉진하기에 충분한 상보성을 갖는다. 본 발명에 따른 표적-서열과 같이, 기능성이기에 충분한 상보성이 존재한다면, 완전한 상보성은 필요하지 않은 것으로 여겨진다. 바람직하게, 본 발명에 따른 tracr-서열은 최적으로 정렬될 때 본 발명에 따른 tracr-짝 서열의 길이를 따라 적어도 50%, 60%, 70%, 80%, 90%, 95% 또는 99% 서열 일치성을 갖는다. 최적의 정렬은 상기에 논의된 바와 같이 서열 정렬에 대한 임의의 적합한 알고리즘을 사용하여 측정될 수 있다.The guide-polynucleotide structural component is believed to be required for the formation of an active CRISPR-Cas complex. It is believed that the guide-polynucleotide structural component is not necessarily operatively linked to the guide-sequence; A guide-polynucleotide structural component may be operatively linked to a guide-sequence within the guide-polynucleotide. A guide-polynucleotide structural component according to the invention [which includes all or part of a wild-type guide-polynucleotide structural component (eg about or about 20, 26, 32, 45, 48, 54, 63, wild-type tracr-sequence, 67, 85, or more nucleotides)), for example, of at least a part of a tracr-sequence according to the invention to all or part of a tracr-pair sequence according to the invention. By hybridization and preferably operatively linked to the guide-sequence according to the invention, it forms part of the CRISPR-Cas complex. The tracr-sequence according to the invention preferably has sufficient complementarity to hybridize to the tracr-pair sequence according to the invention under physiological conditions such as in a host cell and promote the formation of the CRISPR-Cas complex. Perfect complementarity is not considered necessary if there is sufficient complementarity to be functional, such as a target-sequence according to the present invention. Preferably, the tracr-sequence according to the invention is at least 50%, 60%, 70%, 80%, 90%, 95% or 99% sequence along the length of the tracr-pair sequence according to the invention when optimally aligned. have consistency. Optimal alignment can be determined using any suitable algorithm for sequence alignment as discussed above.

일반적으로, 본 발명에 따른 tracr 짝 서열은 본 발명에 따른 tracr 서열과 표적-서열에서 CRISPR-Cas 복합체의 형성을 촉진하기에 충분한 상보성을 갖는 임의의 서열을 포함하며, 여기에서 상기 CRISPR-Cas 복합체는 본 발명에 따른 tracr 서열에 하이브리드화된 본 발명에 따른 tracr 짝 서열을 포함한다. 본 발명에 따른 tracr 서열과 본 발명에 따른 tracr 짝 서열의 상보성 정도는 바람직하게는 2개 서열 중 보다 짧은 서열의 길이를 따라 상기 tracr 짝 서열 및 tracr 서열의 최적의 정렬에 관해 정의된다. 최적의 정렬은 상기에 논의된 바와 같이 서열 정렬에 대한 임의의 적합한 알고리즘을 사용하여 측정될 수 있다.In general, a tracr pair sequence according to the present invention comprises any sequence having sufficient complementarity to promote the formation of a CRISPR-Cas complex in a target-sequence with a tracr sequence according to the present invention, wherein said CRISPR-Cas complex comprises a tracr partner sequence according to the invention hybridized to a tracr sequence according to the invention. The degree of complementarity of the tracr sequence according to the invention and the tracr pair sequence according to the invention is preferably defined with respect to the optimal alignment of the tracr pair sequence and the tracr sequence along the length of the shorter of the two sequences. Optimal alignment can be determined using any suitable algorithm for sequence alignment as discussed above.

바람직하게, 본 발명에 따른 tracr 짝 서열 및 본 발명에 따른 tracr 서열에 대해서, 2차 구조는 예를 들어 상기 tracr 서열 또는 tracr 짝 서열 중 어느 하나내에서의 자기-상보성을 고려한다. 바람직하게, 본 발명에 따른 tracr 서열 및 본 발명에 따른 tracr 짝 서열간의 상보성 정도는 상기 2개 서열 중 보다 짧은 서열의 길이를 따라 최적으로 정렬될 때 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99% 초과의 서열 일치성이다. 바람직하게 본 발명에 따른 tracr 짝 서열은 길이가 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 이상의 뉴클레오타이드이다. 바람직하게 본 발명에 따른 tracr 서열은 길이가 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 이상의 뉴클레오타이드이다. 바람직하게, 본 발명에 따른 tracr 서열 및 본 발명에 따른 tracr 짝 서열, 즉 안내-폴리뉴클레오타이드 구조 성분은, 상기 둘간의 하이브리드화가 2차 구조를 포함하는 하이브리드화 복합체, 예를 들어 헤어핀을 생성시키도록 단일의 전사물내에 포함된다. 상기와 같은 하이브리드화 복합체는 또한 상기 tracr 서열 및 tracr 짝 서열이 단일의 전사물 중에 포함되지 않는 경우에 형성될 수도 있다. 헤어핀 구조의 형성을 위한 본 발명에 따른 tracr 서열 및/또는 본 발명에 따른 tracr 짝 서열 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분에서 바람직한 고리 형성 서열은 길이가 4 뉴클레오타이드이고, 가장 바람직하게는 GAAA 서열을 가지며; 교체 서열일 수도 있는, 보다 길거나 보다 짧은 고리 서열을 사용할 수도 있다. 상기 고리 서열은 바람직하게는 뉴클레오타이드 트리플릿(예를 들어 AAA) 및 추가적인 뉴클레오타이드(예를 들어 C 또는 G)를 포함한다. 고리 형성 서열의 예는 CAAA 및 AAAG를 포함한다. 바람직하게 본 발명에 따른 tracr 서열 및/또는 본 발명에 따른 tracr 짝 서열 또는 그의 하이브리드화 복합체 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분은 적어도 2개 이상의 헤어핀을 포함하거나 또는 상기를 형성할 수 있다. 보다 바람직하게, 본 발명에 따른 tracr 서열 및/또는 본 발명에 따른 tracr 짝 서열 또는 그의 하이브리드화 복합체 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분은 2, 3, 4 또는 5개의 헤어핀을 포함하거나 또는 상기를 형성할 수 있다. 바람직하게, 본 발명에 따른 tracr 서열 및/또는 본 발명에 따른 tracr 짝 서열 또는 그의 하이브리드화 복합체 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분은 5개 이하의 헤어핀을 포함하거나 또는 상기를 형성할 수 있다. 바람직하게, 본 발명에 따른 tracr 서열 및 본 발명에 따른 tracr-짝 서열 또는 본 발명에 따른 tracr 서열 및 본 발명에 따른 tracr 짝 서열의 하이브리드화 복합체 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드 구조 성분의 단일 전사물은 전사 종결 서열을 추가로 포함하며; 바람직하게 상기는 폴리T 서열, 예를 들어 6개의 T 뉴클레오타이드이다. 말한 바와 같이, 안내-폴리뉴클레오타이드 구조 성분은 당해 분야의 숙련가에게 공지되어 있으며; 배경 정보는 문헌[Gaj et al., 2013]에서 찾을 수 있다.Preferably, for the tracr pair sequence according to the invention and the tracr sequence according to the invention, the secondary structure takes into account, for example, self-complementarity within either said tracr sequence or the tracr pair sequence. Preferably, the degree of complementarity between the tracr sequence according to the invention and the tracr partner sequence according to the invention is 50%, 60%, 75%, 80%, when optimally aligned along the length of the shorter of the two sequences, greater than 85%, 90%, 95%, 97.5%, 99% sequence identity. Preferably the tracr pair sequence according to the invention has a length of 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 or more nucleotides. Preferably the tracr sequence according to the invention has a length of 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 more than one nucleotide. Preferably, the tracr sequence according to the invention and the tracr pair sequence according to the invention, ie the guide-polynucleotide structural component, are such that hybridization between the two produces a hybridization complex comprising a secondary structure, for example a hairpin. contained within a single transcript. Such hybridization complexes may also be formed when the tracr sequence and the tracr pair sequence are not included in a single transcript. Preferred ring-forming sequences in the tracr sequence according to the invention and/or the tracr pair sequence according to the invention and/or the guide-polynucleotide structural component according to the invention for the formation of a hairpin structure are 4 nucleotides in length, most preferably has a GAAA sequence; Longer or shorter ring sequences, which may be replacement sequences, may also be used. The ring sequence preferably comprises a nucleotide triplet (eg AAA) and additional nucleotides (eg C or G). Examples of ring-forming sequences include CAAA and AAAG. Preferably the tracr sequence according to the invention and/or the tracr pair sequence according to the invention or hybridization complex thereof and/or the intra-polynucleotide structural component according to the invention comprises or forms at least two or more hairpins. can More preferably, the tracr sequence according to the invention and/or the tracr pair sequence according to the invention or hybridization complex thereof and/or the guide-polynucleotide structural component according to the invention comprises 2, 3, 4 or 5 hairpins or may form the above. Preferably, the tracr sequence according to the invention and/or the tracr pair sequence according to the invention or hybridization complex thereof and/or the guide-polynucleotide structural component according to the invention comprises or forms no more than 5 hairpins. can do. Preferably, a hybridization complex of a tracr sequence according to the invention and a tracr-pair sequence according to the invention or a tracr sequence according to the invention and a tracr pair sequence according to the invention and/or a guide-polynucleotide structural component according to the invention a single transcript of further comprises a transcription termination sequence; Preferably it is a polyT sequence, for example 6 T nucleotides. As stated, the guiding-polynucleotide structural components are known to those skilled in the art; Background information can be found in Gaj et al., 2013.

본 발명에 따른 모든 실시태양들의 상황에서, "표적-폴리뉴클레오타이드"란 용어는 본 발명에 따른 안내-서열이 상보성을 갖도록 설계된 본 발명에 따른 표적-서열을 지칭하며, 이때 본 발명에 따른 표적-서열과 본 발명에 따른 안내-서열간의 하이브리드화는 CRISPR-Cas 복합체의 형성을 촉진한다. CRISPR-Cas 복합체의 하이브리드화를 일으키고 상기 복합체의 형성을 촉진하기에 충분한 상보성이 존재하는 한, 완전한 상보성이 반드시 요구되는 것은 아니다. 바람직하게, 본 발명에 따른 안내-서열은 표적 중의 독특한 표적-서열을 표적화한다. 바람직하게, 본 발명에 따른 안내-서열은 PAM 서열에 바로 인접한 표적-폴리뉴클레오타이드 중의 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 또는 20, 보다 바람직하게는 8, 9, 10, 11 또는 12 뉴클레오타이드와 100% 서열 일치성을 갖는다. 본 발명에 따른 표적-폴리뉴클레오타이드는 임의의 폴리뉴클레오타이드, 예를 들어 DNA 또는 RNA 폴리뉴클레오타이드를 포함할 수 있으며 단일 또는 이중 가닥일 수 있다. 상기 표적-폴리뉴클레오타이드가 이중가닥 폴리뉴클레오타이드인 경우, 본 발명에 따른 안내-서열은 상기 표적-폴리뉴클레오타이드의 어느 한 가닥, 예를 들어 암호화 가닥 또는 비-암호화 가닥과 하이브리드화할 수 있다.In the context of all embodiments according to the present invention, the term "target-polynucleotide" refers to a target-sequence according to the present invention, wherein the guide-sequence according to the present invention is designed to have complementarity, wherein the target according to the present invention- Hybridization between the sequence and the guide-sequence according to the invention promotes the formation of the CRISPR-Cas complex. Complete complementarity is not necessarily required as long as there is sufficient complementarity to cause hybridization of the CRISPR-Cas complex and promote formation of the complex. Preferably, the guide-sequence according to the invention targets a unique target-sequence in the target. Preferably, the guide-sequence according to the invention is 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20, more preferably in the target-polynucleotide immediately adjacent to the PAM sequence. has 100% sequence identity with 8, 9, 10, 11 or 12 nucleotides. The target-polynucleotide according to the present invention may comprise any polynucleotide, for example a DNA or RNA polynucleotide, and may be single or double stranded. When the target-polynucleotide is a double-stranded polynucleotide, the guide-sequence according to the present invention can hybridize with either strand of the target-polynucleotide, for example a coding strand or a non-coding strand.

본 발명에 따른 표적-폴리뉴클레오타이드는 세포의 핵 또는 세포질 중에 위치할 수 있다. 본 발명에 따른 표적-폴리뉴클레오타이드는 숙주 세포의 세포소기관, 예를 들어 미토콘드리아 또는 색소체 중에 위치할 수도 있다. 본 발명에 따른 표적-폴리뉴클레오타이드는 게놈내에 포함되거나, 염색체내에 포함되거나 또는 염색체외에 존재할 수 있으며, 인공 염색체 중에 포함될 수도 있고, 임의의 염색체 개체 또는 염색체-외 개체, 예를 들어 상염색체 복제 개체, 예를 들어 에피솜 플라스미드 또는 벡터 중에 존재할 수도 있다. 본 발명에 따른 표적-폴리뉴클레오타이드는 숙주 세포에 고유이거나 외래일 수 있다.The target-polynucleotide according to the present invention may be located in the nucleus or cytoplasm of a cell. The target-polynucleotide according to the present invention may be located in an organelle of a host cell, for example in mitochondria or plastids. The target-polynucleotide according to the present invention may be contained within the genome, within the chromosome or present extrachromosomally, may also be contained within an artificial chromosome, any chromosomal entity or extrachromosomal entity, for example an autosomal replicating entity, For example, it may be present in an episomal plasmid or vector. The target-polynucleotide according to the present invention may be native to the host cell or foreign.

본 발명에 따른 표적-폴리뉴클레오타이드는 바람직하게는, CRISPR-Cas 복합체에 의해 인식되는 짧은 폴리뉴클레오타이드인 프로토스페이서 인접 모티프(PAM)와 결합된다. 바람직하게, 상기 표적-폴리뉴클레오타이드 및 PAM은 연결되며, 여기에서 상기 PAM은 바람직하게는 상기 표적-폴리뉴클레오타이드의 바로 하류(3')에 있다. 상기 PAM의 정확한 서열 및 길이는 다양할 수 있다, 예를 들어 상이한 Cas 단백질 및 유사한 기능의 뉴클레아제는 상이한 PAM 서열을 필요로 할 수 있다. 본 발명에 따른 바람직한 PAM은 2 내지 8 뉴클레오타이드 길이의 폴리뉴클레오타이드이다. 바람직한 PAM은 5′-XGG-3′, 5′-XGGXG-3′, 5′-XXAGAAW-3′, 5′-XXXXGATT-3′, 5′-XXAGAA-3′, 5′-XAAAAC-3′(여기에서 X는 임의의 뉴클레오타이드 또는 그의 유사체, 바람직하게는 임의의 뉴클레오타이드일 수 있고; W는 A 또는 T이다)로 이루어지는 그룹 중에서 선택된다. 보다 바람직한 PAM은 5'-XGG-3'이다. 상기 PAM은 바람직하게는 Cas 단백질과 합치된다. 가장 널리 사용되는 CAS/CRISPR 시스템은 스트렙토코커스 피오게네스로부터 유래되며 합치하는 PAM 서열 5'-XGG-3'은 표적-서열의 바로 하류(3')에 위치한다. 네이세리아 메닌지티디스(Neisseria meningitidis) Cas 단백질에 바람직한 PAM은 5′-XXXXGATT-3′이고; 스트렙토코커스 써모필루스(Streptococcus thermophilus) Cas 단백질에 바람직한 PAM은 5′-XXAGAA-3′이며; 트레포네마 덴티콜라(Treponema denticola)에 바람직한 PAM은 5′-XAAAAC-3′이다. 바람직한 PAM은 사용된 Cas 단백질과 합치한다. 본 발명에 따른 Cas 단백질을, 야생형 Cas 단백질과 합치하는 고유 PAM보다는 상이한 PAM과 합치하도록 조작할 수 있다. 이와 같이, 본 발명에 따른 CRISPR-Cas 시스템을 맞춤된 특이적 표적화에 사용할 수 있다.The target-polynucleotide according to the present invention is preferably associated with a protospacer adjacent motif (PAM), which is a short polynucleotide recognized by the CRISPR-Cas complex. Preferably, the target-polynucleotide and the PAM are linked, wherein the PAM is preferably immediately downstream (3′) of the target-polynucleotide. The exact sequence and length of the PAM may vary, eg, different Cas proteins and nucleases of similar function may require different PAM sequences. Preferred PAMs according to the invention are polynucleotides of 2 to 8 nucleotides in length. Preferred PAMs are 5'-XGG-3', 5'-XGGXG-3', 5'-XXAGAAW-3', 5'-XXXXGATT-3', 5'-XXAGAA-3', 5'-XAAAAC-3' (wherein X can be any nucleotide or analog thereof, preferably any nucleotide; W is A or T). A more preferred PAM is 5'-XGG-3'. The PAM is preferably compatible with a Cas protein. The most widely used CAS/CRISPR system is derived from Streptococcus pyogenes and the matching PAM sequence 5'-XGG-3' is located immediately downstream (3') of the target-sequence. A preferred PAM for Neisseria meningitidis Cas protein is 5′-XXXXGATT-3′; A preferred PAM for the Streptococcus thermophilus Cas protein is 5'-XXAGAA-3'; A preferred PAM for Treponema denticola is 5'-XAAAAC-3'. Preferred PAMs are compatible with the Cas protein used. Cas proteins according to the invention can be engineered to match a different PAM than a native PAM that matches the wild-type Cas protein. As such, the CRISPR-Cas system according to the present invention can be used for tailored specific targeting.

"하이브리드화"란 용어는 하나 이상의 폴리뉴클레오타이드가 반응하여, 뉴클레오타이드 잔기의 염기들간의 수소 결합을 통해 안정화되는 복합체를 형성하는 반응을 지칭한다. 상기 수소 결합은 와트슨 크리크 염기 짝짓기, 후그스테인 결합에 의해, 또는 임의의 다른 서열-특이적인 방식으로 발생할 수 있다. 상기 복합체는 듀플렉스 구조를 형성하는 2개의 가닥, 다중 가닥 복합체를 형성하는 3개 이상의 가닥, 단일의 자기-하이브리드화 가닥, 또는 이들의 임의의 조합을 포함할 수 있다. 하이브리드화 반응은 보다 광범위한 가공 중의 한 단계, 예를 들어 효소에 의한 폴리뉴클레오타이드의 절단을 구성할 수 있다. 바람직한 하이브리드화 조건은 본 발명에 따른 숙주 세포내에서와 같은 생리학적 조건이다.The term "hybridization" refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonds between the bases of nucleotide residues. The hydrogen bonding may occur by Watson Creek base pairing, Hoogstein bonding, or in any other sequence-specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination thereof. A hybridization reaction may constitute one step in a broader process, for example, cleavage of a polynucleotide by an enzyme. Preferred hybridization conditions are physiological conditions, such as in a host cell according to the present invention.

본 발명의 모든 실시태양들의 상황에서 "소스"란 용어는 안내-폴리뉴클레오타이드 및 Cas 단백질을 포함하는 CRISPR-Cas 시스템의 임의의 소스를 지칭한다. 상기 안내-폴리뉴클레오타이드 및 Cas 단백질은 별도의 소스 중에 존재할 수 있다. 상기와 같은 경우에, 본 발명에 따른 조성물은 안내-폴리뉴클레오타이드의 소스 및 Cas-단백질의 소스를 포함하는 CRISPR-Cas 시스템을 포함한다. 임의의 소스는 상기 안내-폴리뉴클레오타이드 및 Cas 단백질이 CRISPR-Cas 시스템내에서 기능할 수 있는 형태로 그대로 존재할 수 있음을 의미한다. 상기 안내-폴리뉴클레오타이드 및/또는 Cas-단백질은 그의 활성 형태로 제공될 수 있으며 예를 들어 불활성 형태로부터 또는 또 다른 개체로부터 제공될 수 있다. 상기 안내-폴리뉴클레오타이드는 예를 들어 또 다른 폴리뉴클레오타이드상에 존재하거나 또는 실제 안내-폴리뉴클레오타이드를 제공하도록 전사된 폴리뉴클레오타이드에 의해 암호화될 수 있다. 상기 Cas 단백질은 실제 Cas 단백질을 제공하도록 전사되고/되거나 번역된 폴리뉴클레오타이드(예를 들어 DNA 또는 mRNA)에 의해 암호화될 수 있다. 암호화 폴리뉴클레오타이드는 본 명세서에 정의된 바와 같은 핵산 구조물 중에 및/또는 본 명세서에 정의된 바와 같은 벡터 중에 존재할 수 있다. 상기와 같은 핵산 구조물 및 벡터는 본 명세서에서 본 발명에 따른 핵산 구조물 및 본 발명에 따른 벡터로서 지칭된다.The term "source" in the context of all embodiments of the invention refers to any source of the CRISPR-Cas system comprising a guide-polynucleotide and a Cas protein. The guide-polynucleotide and Cas protein may be in separate sources. In such a case, the composition according to the present invention comprises a CRISPR-Cas system comprising a source of intra-polynucleotides and a source of Cas-proteins. Any source means that the guide-polynucleotide and Cas protein can exist as such in a form capable of functioning in the CRISPR-Cas system. The guide-polynucleotide and/or Cas-protein may be provided in its active form, for example from an inactive form or from another individual. Said guide-polynucleotide may for example be present on another polynucleotide or encoded by a transcribed polynucleotide to provide the actual guide-polynucleotide. The Cas protein may be encoded by a polynucleotide (eg DNA or mRNA) transcribed and/or translated to provide the actual Cas protein. The encoding polynucleotide may be present in a nucleic acid construct as defined herein and/or in a vector as defined herein. Such nucleic acid constructs and vectors are referred to herein as nucleic acid constructs according to the present invention and vectors according to the present invention.

바람직하게, 본 발명에 따른 조성물에서, 상기 Cas 단백질 또는 관련 기능의 뉴클레아제는 폴리뉴클레오타이드에 의해 암화되고/되거나 상기 안내-폴리뉴클레오타이드는 폴리뉴클레오타이드에 의해 암호화되거나 상기 상에 존재한다.Preferably, in the composition according to the present invention, said Cas protein or nuclease of related function is encoded by a polynucleotide and/or said guide-polynucleotide is encoded by or is present on said polynucleotide.

바람직하게, 본 발명에 따른 조성물에서, 상기 Cas 단백질 또는 관련 기능의 뉴클레아제는 폴리뉴클레오타이드에 의해 암호화되고/되거나 상기 안내-폴리뉴클레오타이드는 또 다른 폴리뉴클레오타이드에 의해 암호화되거나 상기 상에 존재하고 상기 폴리뉴클레오타이드 또는 폴리뉴클레오타이드들은 벡터 중에 포함된다.Preferably, in the composition according to the present invention, said Cas protein or nuclease of related function is encoded by a polynucleotide and/or said guide-polynucleotide is encoded by or present on another polynucleotide and is present on said polynucleotide Nucleotides or polynucleotides are included in the vector.

바람직하게, 본 발명에 따른 조성물에서, 상기 안내-폴리뉴클레오타이드는 실제 안내-폴리뉴클레오타이드를 제공하도록 전사된 폴리뉴클레오타이드에 의해 암호화된다. 상응하게, 하나의 실시태양에서, 본 발명에 따른 조성물에서, 바람직하게 상기 안내-폴리뉴클레오타이드는 상기 안내-폴리뉴클레오타이드를 암호화하는 폴리뉴클레오타이드의 형태로 존재하고 상기 안내-폴리뉴클레오타이드는 숙주 세포에서 상기 폴리뉴클레오타이드의 전사시에 수득된다.Preferably, in the composition according to the invention, the guide-polynucleotide is encoded by the transcribed polynucleotide to give the actual guide-polynucleotide. Correspondingly, in one embodiment, in the composition according to the invention, preferably said guide-polynucleotide is in the form of a polynucleotide encoding said guide-polynucleotide and said guide-polynucleotide is said polynucleotide in a host cell. obtained upon transcription of nucleotides.

바람직하게, 본 발명에 따른 조성물에서, 상기 Cas 단백질은 폴리뉴클레오타이드에 의해 암호화되고 상기 안내-폴리뉴클레오타이드는 또 다른 폴리뉴클레오타이드에 의해 암호화되거나 상기 상에 존재하고 상기 폴리뉴클레오타이드는 하나의 벡터 중에 포함된다.Preferably, in the composition according to the present invention, the Cas protein is encoded by a polynucleotide and the guide-polynucleotide is encoded by or present on another polynucleotide and the polynucleotide is comprised in one vector.

바람직하게, 본 발명에 따른 조성물에서, 상기 Cas 단백질은 하나의 벡터 중에 포함된 폴리뉴클레오타이드에 의해 암호화되고 상기 안내-폴리뉴클레오타이드는 또 다른 벡터 중에 포함된 또 다른 폴리뉴클레오타이드에 의해 암호화되거나 상기 상에 존재한다. 바람직하게, 상기 Cas 단백질을 암호화하는 벡터는 저 사본 벡터이고/이거나 상기 Cas 전사물의 발현을 구동하는 프로모터는 저-강도 프로모터이고 상기 안내-폴리뉴클레오타이드를 암호화하는 벡터는 고 사본 벡터이고/이거나 상기 gRNA 전사물의 발현을 구동하는 프로모터는 고-강도 프로모터이다. 이는 상기 Cas 단백질 및 안내-폴리뉴클레오타이드의 차별적인 발현을 허용하며; 상기 Cas 단백질은 예를 들어 상기 안내-폴리뉴클레오타이드보다 낮은 수준으로 발현될 수 있다. 프로모터 강도를 임의의 수단에 의해, 예를 들어 RNA 서열분석에 의해 평가할 수 있다. RNA 서열분석(RNAseq)은 상이한 조건하에서 전사체 전체에 걸친 발현을 측정하기 위한 매우 민감하고 정확한 도구이다. 상기는 RPKM 값(백만개의 맵핑된 판독당 전사물의 킬로염기당 판독)으로서 보고되는 전사물 수준에서의 유전자 발현의 정량적인 근사치를 허용한다. 스키조키트리움으로부터의 전형적인 유전자의 RPKM 값 및 상대적인 프로모터 강도를 표 1에 제공한다.Preferably, in the composition according to the invention, the Cas protein is encoded by a polynucleotide contained in one vector and the guide-polynucleotide is encoded by or present on another polynucleotide contained in another vector. do. Preferably, the vector encoding the Cas protein is a low copy vector and/or the promoter driving expression of the Cas transcript is a low-strength promoter and the vector encoding the guide-polynucleotide is a high copy vector and/or the gRNA The promoter driving expression of the transcript is a high-strength promoter. This allows for differential expression of the Cas protein and guide-polynucleotide; The Cas protein may be expressed, for example, at a lower level than the guide-polynucleotide. Promoter strength can be assessed by any means, for example by RNA sequencing. RNA sequencing (RNAseq) is a very sensitive and accurate tool for measuring expression throughout the transcriptome under different conditions. This allows for a quantitative approximation of gene expression at the transcript level, reported as RPKM values (reads per kilobase of transcript per million mapped reads). The RPKM values and relative promoter strengths of typical genes from Schizochytrium are provided in Table 1.

유전자gene 평균 RPKM 값Average RPKM value 상대적인 relative
프로모터 강도promoter strength
아르기나제 (EC 3.5.3.1)Arginase (EC 3.5.3.1) 3694.343694.34 강함strong 피루베이트 키나제 (EC 2.7.1.40)Pyruvate Kinase (EC 2.7.1.40) 2127.492127.49 강함strong 열 충격 단백질 70heat shock protein 70 1857.741857.74 강함strong 글리세르알데하이드 3-포스페이트 데하이드로게나제 (EC 1.2.1.12)Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12) 1688.001688.00 강함strong 튜불린 알파 쇄tubulin alpha chain 696.85696.85 중간middle 단백질 번역 연장 인자 1A (EF-1A)Protein Translation Elongation Factor 1A (EF-1A) 682.30682.30 중간middle 아이소시트레이트 라이아제 (EC 4.1.3.1)Isocitrate lyase (EC 4.1.3.1) 510.45510.45 중간middle 아코니테이트 하이드라타제 2 (EC 4.2.1.3)Aconitate hydratase 2 (EC 4.2.1.3) 317.74317.74 중간middle 말레이트 데하이드로게나제 (EC 1.1.1.37)Maleate dehydrogenase (EC 1.1.1.37) 192.47192.47 중간middle 액포 ATP 신타제 서브유닛 D (EC 3.6.3.14)vacuole ATP synthase subunit D (EC 3.6.3.14) 163.74163.74 중간middle 아실-CoA 데하이드로게나제 (EC 1.3.99.3)Acyl-CoA dehydrogenase (EC 1.3.99.3) 94.4394.43 약함weakness 아세틸스페르미딘 데아세틸라제 (EC 3.5.1.48)Acetylspermidine deacetylase (EC 3.5.1.48) 90.6090.60 약함weakness 아세토락테이트 신타제 (EC 2.2.1.6)Acetolactate synthase (EC 2.2.1.6) 63.2163.21 약함weakness 말레일아세토아세테이트 아이소머라제 (EC 5.2.1.2)Maleylacetoacetate isomerase (EC 5.2.1.2) 39.4639.46 약함weakness 피토엔 데세튜라제 (EC 1.14.99.-) (카로틴 신타제)Phytoene deseturase (EC 1.14.99.-) (carotene synthase) 31.5631.56 약함weakness ATP-의존성 RNA 헬리카제ATP-dependent RNA helicase 28.4228.42 약함weakness 아세틸-CoA 아세틸트랜스퍼라제 (EC 2.3.1.9)Acetyl-CoA acetyltransferase (EC 2.3.1.9) 26.7426.74 약함weakness

저-강도(즉 약한) 스키조키트리움 프로모터의 예는 비제한적으로 카로티노이드 신타제의 발현을 구동하는 것들을 포함한다. 중간-강도(즉 중간) 스키조키트리움 프로모터의 예는 비제한적으로 알파-튜불린의 발현을 구동하는 것들을 포함한다. 고-강도(즉 강한) 스키조키트리움 프로모터의 예는 비제한적으로 연장 인자 1(EF-1)의 발현을 구동하는 것들을 포함한다. RPKM 값은 일반적으로 상대적인 프로모터 강도를 가리키는 것으로 간주되지만, 고유 게놈 환경에서의 프로모터는 발현 벡터 환경에서와 정확하게 동일한 강도를 갖지 않을 수도 있음은 당해 분야의 숙련가들에게 공지되어 있다. 따라서, 당해 분야의 숙련가는 표 1에 제공된 상대적인 프로모터 강도가 본 발명의 상황에서 변할 수 있음을 알 것이다.Examples of low-strength (ie weak) Schizochytrium promoters include, but are not limited to, those that drive expression of carotenoid synthase. Examples of medium-strength (ie, medium) Schizochytrium promoters include, but are not limited to, those driving expression of alpha-tubulin. Examples of high-strength (ie, strong) Schizochytrium promoters include, but are not limited to, those driving expression of elongation factor 1 (EF-1). Although RPKM values are generally considered to be indicative of relative promoter strength, it is known to those skilled in the art that a promoter in the native genomic environment may not have exactly the same strength as in the expression vector environment. Accordingly, those skilled in the art will appreciate that the relative promoter strengths provided in Table 1 may vary in the context of the present invention.

따라서 본 발명은 상기 안내-폴리뉴클레오타이드 및 Cas 단백질이 그대로 제공되거나, 또는 벡터상에서 암호화되거나 또는 벡터상에 존재할 가능성들을 제공한다. 후자의 경우에, 상기 암호화 폴리뉴클레오타이드는 각각 별도의 벡터상에 있거나 또는 둘 다 단일 벡터상에 있을 수 있다. 상응하게, 하나의 실시태양에서, 본 발명은 본 발명에 따른 조성물을 제공하며, 여기에서 본 발명에 따른 Cas 단백질을 암호화하는 폴리뉴클레오타이드, 안내-폴리뉴클레오타이드 또는 본 발명에 따른 안내-폴리뉴클레오타이드를 암호화하는 폴리뉴클레오타이드는 단일 벡터상에 존재하며, 상기 벡터는 프로모터 및 종결자 요소와 같이 상기 암호화된 생성물을 발현하는데 필요한 임의의 요소를 추가로 포함할 수 있다. 상기와 같은 단일(올인원) 벡터는 CRISPR-Cas 시스템에 필요한 모든 성분이 함께 존재한다는 장점을 가지며; 또한, 임의로 공여체 폴리뉴클레오타이드와 함께, 단일의 형질전환 사건이면 상기 성분들을 숙주 세포내로 도입시키기에 충분하다.Accordingly, the present invention provides possibilities in which the guide-polynucleotide and Cas protein are provided as such, or encoded on a vector or present on a vector. In the latter case, the encoding polynucleotides may each be on a separate vector or both may be on a single vector. Correspondingly, in one embodiment, the invention provides a composition according to the invention, wherein a polynucleotide encoding a Cas protein according to the invention, a guide-polynucleotide or a guide-polynucleotide encoding the guide-polynucleotide according to the invention The polynucleotides described herein are present on a single vector, which vector may further comprise any elements necessary to express the encoded product, such as promoter and terminator elements. Such a single (all-in-one) vector has the advantage that all components necessary for the CRISPR-Cas system are present together; Also, a single transformation event, optionally with a donor polynucleotide, is sufficient to introduce the components into the host cell.

벡터vector

본 발명의 모든 실시태양들의 상황에서, 벡터는 편의상 재조합 DNA 과정이 가해질 수 있고 본 발명에 따른 폴리뉴클레오타이드의 발현을 매개할 수 있는 임의의 벡터(예를 들어 플라스미드 또는 바이러스)일 수 있다. 상기 벡터의 선택은 전형적으로 상기 벡터와 상기 벡터가 도입되는 숙주 세포와의 적합성에 따라 변할 것이다. 바람직한 벡터는 본 명세서의 실시예들에 사용되는 벡터이다. 벡터는 선형 폴리뉴클레오타이드 또는 선형 또는 폐쇄된 환상 플라스미드일 수 있다. 벡터는 통합 또는 자율 복제 벡터, 즉 염색체 또는 염색체-외 개체로서 존재하고 그의 복제가 염색체 복제에 의존적이거나 독립적인 벡터, 예를 들어 플라스미드, 염색체외 요소, 미니-염색체, 또는 인공 염색체일 수 있다.In the context of all embodiments of the present invention, the vector may conveniently be any vector (eg plasmid or virus) capable of mediating the expression of a polynucleotide according to the present invention to which it has been subjected to recombinant DNA processing. The choice of vector will typically depend on the compatibility of the vector with the host cell into which it is introduced. A preferred vector is the vector used in the embodiments herein. A vector may be a linear polynucleotide or a linear or closed circular plasmid. A vector may be an integrative or autonomously replicating vector, ie a vector that exists as a chromosomal or extrachromosomal entity and whose replication is dependent or independent of chromosomal replication, eg, a plasmid, an extrachromosomal element, a mini-chromosome, or an artificial chromosome.

바람직하게 벡터는, 숙주 세포내로 도입될 때 게놈내에 통합되고 상기가 통합된 염색체(들)와 함께 복제되는 것일 수 있다. 통합 벡터는 숙주 세포의 염색체 중에서 무작위로 또는 소정의 표적 유전자좌에서 통합될 수 있다. 바람직한 통합 벡터는 DNA 단편을 포함하며, 상기 단편은 상기 벡터의 소정의 유전자좌에의 통합을 표적화하기 위해 숙주 세포의 게놈 중의 상기 소정의 표적 유전자좌 중의 DNA 서열에 상동성이다. 표적화된 통합을 촉진하기 위해서, 벡터를 바람직하게는 상기 세포의 형질전환에 앞서 선형화시킨다. 선형화를 바람직하게는 상기 벡터의 적어도 하나의 단부, 바람직하게는 어느 한 단부가 상기 표적 유전자좌에 상동성인 서열에 의해 측면 인접되도록 수행한다. 상기 표적 유전자좌에 측면 인접한 상동성 서열의 길이는 바람직하게 적어도 30 bp, 바람직하게 적어도 50 bp, 바람직하게 적어도 0.1 kb, 훨씬 바람직하게 적어도 0.2 kb, 보다 바람직하게 적어도 0.5 kb, 훨씬 더 바람직하게 적어도 1 kb, 가장 바람직하게 적어도 2 kb이다. 바람직하게, 상기 숙주 세포의 게놈내로의 표적화된 통합, 즉 소정의 표적 유전자좌 중의 통합의 효율은 상기 숙주 세포의 증대된 상동성 재조합 능력에 의해 증가된다.Preferably, the vector is one that is integrated into the genome when introduced into a host cell and is replicated together with the chromosome(s) into which it is integrated. The integration vector may integrate randomly in the chromosome of the host cell or at a given target locus. Preferred integration vectors include DNA fragments, said fragments homologous to a DNA sequence in said predetermined target locus in the genome of a host cell to target integration of said vector into said predetermined locus. To facilitate targeted integration, the vector is preferably linearized prior to transformation of said cells. Linearization is preferably carried out such that at least one end, preferably either end, of the vector is flanked by sequences homologous to the target locus. The length of the homologous sequence flanking the target locus is preferably at least 30 bp, preferably at least 50 bp, preferably at least 0.1 kb, even more preferably at least 0.2 kb, more preferably at least 0.5 kb, even more preferably at least 1 bp. kb, most preferably at least 2 kb. Preferably, the efficiency of targeted integration into the genome of said host cell, ie integration in a given target locus, is increased by the enhanced homologous recombination capacity of said host cell.

상기 벡터 중 상동성 측면 인접 DNA 서열(표적 유전자좌에 상동성이다)은 발현된 유전자좌로부터 유래할 수 있으며, 이는 상기 서열이 숙주 세포에서 발현이 가능한 유전자로부터 유래됨을 의미한다. 측면 인접 DNA 서열을, 배양 배지에 적합한 선택성 인자가 보충될 때 형질전환된 세포가 성장하도록 선택성 마커 유전자와 결합시킬 수 있다. 상기 측면 인접 DNA 서열을 당해 분야의 숙련가에게 공지된 임의의 수단에 의해 설계할 수 있으며; 하나의 바람직한 설계는 포화된 및/또는 다중불포화된 지방산 신타제(PUFA 신타제)를 암호화하는 유전자에의 상동성 재조합을 지시하고, 따라서 이들 신타제의 돌연변이 또는 붕괴는 포화된 및/또는 다중불포화된 지방산에 대한 영양요구성을 생성시킨다. 포화된 및/또는 다중불포화된 지방산에 영양요구성인 세포는 성장을 위한 보충제로서 포화된 및/또는 다중불포화된 지방산을 필요로 한다. 또 다른 바람직한 설계는 우성 선택성 마커 유전자의 형질전환체가 적합한 우성 선택성 인자의 존재하에서 성장할 수 있도록 상기 선택성 마커 유전자의 발현을 가능하게 한다.The homologous flanking DNA sequence (which is homologous to the target locus) in the vector may be derived from an expressed locus, meaning that the sequence is derived from a gene capable of expression in a host cell. A flanking DNA sequence can be ligated with a selectable marker gene so that transformed cells grow when the culture medium is supplemented with a suitable selectable factor. The flanking DNA sequences can be designed by any means known to those skilled in the art; One preferred design directs homologous recombination into genes encoding saturated and/or polyunsaturated fatty acid synthase (PUFA synthase), so that mutation or disruption of these synthases results in saturated and/or polyunsaturated fatty acid synthase (PUFA synthase). Creates an auxotroph for fatty acids. Cells that are auxotrophic for saturated and/or polyunsaturated fatty acids require the saturated and/or polyunsaturated fatty acids as a supplement for growth. Another preferred design allows expression of the selectable marker gene so that transformants of the dominant selectable marker gene can be grown in the presence of a suitable dominant selectable factor.

본 발명에 따른 폴리뉴클레오타이드의 하나 초과의 사본을 미생물 숙주 세포내에 삽입하여 상기 폴리뉴클레오타이드에 의해 암호화된 생성물의 생산을 매개할 수 있다. 이를, 바람직하게는 상기 폴리뉴클레오타이드의 다중 사본을 상기 숙주 세포의 게놈내에 통합시키거나 또는 상기 폴리뉴클레오타이드의 통합을 고도로 발현된 유전자좌에 작동 가능한 배열로 표적화함으로써 수행할 수 있다. 한편으로, 다중 사본의 통합을, 본 발명에 따른 폴리뉴클레오타이드를 갖는 증폭 가능한 선택성 마커 유전자를 포함시킴으로써 성취할 수 있으며, 따라서 상기 선택성 마커 유전자의 증폭된 사본(및 이에 의해 핵산 서열의 추가적인 사본)을 함유하는 세포를, 적합한 선택성 인자의 존재하에서 상기 세포를 배양함으로써 선택할 수 있다. 본 발명에 따른 폴리뉴클레오타이드의 사본수를 훨씬 더 증가시키기 위해서, WO98/46772에 기재된 바와 같은 유전자 전환의 기법을 사용할 수 있다.More than one copy of a polynucleotide according to the invention can be inserted into a microbial host cell to mediate production of a product encoded by said polynucleotide. This can preferably be done by integrating multiple copies of the polynucleotide into the genome of the host cell or by targeting the integration of the polynucleotide in an operable arrangement to a highly expressed locus. On the one hand, integration of multiple copies can be achieved by including an amplifiable selectable marker gene with a polynucleotide according to the present invention, thus generating an amplified copy of the selectable marker gene (and thereby an additional copy of the nucleic acid sequence). The containing cells can be selected by culturing the cells in the presence of a suitable selectivity factor. In order to increase the copy number of the polynucleotide according to the invention even more, the technique of gene conversion as described in WO98/46772 can be used.

본 발명에 따른 Cas 단백질 및/또는 본 발명에 따른 안내-폴리뉴클레오타이드를 암호화하는 본 발명에 따른 폴리뉴클레오타이드를 숙주 세포의 게놈내에 통합시킬 때, 예를 들어 목적하는 게놈 편집이 발생했을 때, 상기 폴리뉴클레오타이드를 상기 게놈으로부터 절제하는 것이 바람직할 수 있다. 폴리뉴클레오타이드의 절제를 당해 분야의 숙련가에게 공지된 임의의 수단에 의해 수행할 수 있으며; 하나의 바람직한 수단은 영양요구성이 치유된 세포가 상기 영양요구체에 의해 요구된 영양소가 생략된 배양 배지에서 생육에 의해 대신 선택될 수 있도록 영양요구성을 유도한 유전자 돌연변이 또는 붕괴를 복구하는 뉴클레오타이드에 의한 2차 형질전환에 의한다. 또 다른 절제 수단은 본 발명에 따른 CRISPR-Cas 시스템의 사용일 수 있다.Upon integration of a Cas protein according to the invention and/or a polynucleotide according to the invention encoding a guide-polynucleotide according to the invention into the genome of a host cell, for example when the desired genome editing has occurred, said polynucleotide It may be desirable to excise nucleotides from the genome. Excision of polynucleotides can be performed by any means known to those skilled in the art; One preferred means is a nucleotide that repairs the auxotrophic-induced gene mutation or disruption so that auxotrophic cured cells can be selected instead by growth in culture medium omitting the nutrients required by the auxotroph. by secondary transformation. Another means of ablation may be the use of the CRISPR-Cas system according to the present invention.

본 발명에 따른 벡터는 숙주 세포내로 도입되는 본 발명에 따른 폴리뉴클레오타이드를 함께 함유하는 단일 벡터 또는 플라스미드 또는 2개 이상의 벡터 또는 플라스미드를 포함하는 벡터 시스템일 수 있다.The vector according to the present invention may be a single vector or plasmid containing the polynucleotide according to the present invention together to be introduced into a host cell, or a vector system comprising two or more vectors or plasmids.

본 발명에 따른 벡터는, 형질전환된 세포의 용이한 선택을 허용하는 하나 이상의 선택성 마커를 함유할 수 있다. 하나의 실시태양에서, 본 발명에 따른 조성물에서 하나 이상의 또는 모든 벡터는 선택성 마커를 포함하며, 바람직하게 각각의 벡터는 별개의 선택성 마커를 포함한다. 선택성 마커는 생성물이 살생물제 또는 바이러스 내성, 중금속에 대한 내성, 영양요구체에 대한 원영양요구성 등을 제공하는 유전자이다. 상기 선택성 마커는 발현 카세트로서 상기 벡터상에서 세포내에 도입되거나 또는 별도의 벡터상에 도입될 수 있다.The vector according to the invention may contain one or more selectable markers which allow for easy selection of transformed cells. In one embodiment, one or more or all vectors in the composition according to the invention comprise a selectable marker, preferably each vector comprises a separate selectable marker. Selectable markers are genes whose products confer biocide or virus resistance, resistance to heavy metals, prototrophs for auxotrophs, and the like. The selectable marker may be introduced into cells on the vector as an expression cassette or may be introduced on a separate vector.

라비린툴로마이세트 세포에 사용하기 위한 선택성 마커는 비제한적으로 nptII(네오마이신 포스포트랜스퍼라제 II, 파로모마이신 내성을 부여한다), ALS(아세토락테이트 신타제, 설포메투론메틸 내성을 부여한다), bsd(블라스티시딘-S-데아미나제, 블라스티시딘 내성을 부여한다), 및 Sh ble(플레오마이신 결합, 제오신 내성을 부여한다)를 포함하는 그룹 중에서 선택될 수 있다.Selectable markers for use in labyrinthulomycet cells include, but are not limited to, nptII (neomycin phosphotransferase II, conferring paromomycin resistance), ALS (acetolactate synthase, sulfometuronmethyl resistance confer), bsd (blasticidin-S-deaminase, confer blasticidin resistance), and Sh ble (pleomycin binding, confer zeocin resistance) can

한편으로, 특이성 선택 마커, 예를 들어 앞서 논의된 균주로서 포화되거나 다중불포화된 지방산 신타제 유전자의 불활성화 돌연변이를 갖는 상응하는 돌연변이 숙주 세포를 요구하는 영양요구성 마커를 사용할 수 있다. 바람직한 실시태양에서, 상기 선택 마커는 선택 마커 유전자가 없는 폴리펩타이드를 생성시킬 수 있는 형질전환된 숙주 세포를 수득하기 위해서 발현 구조물의 도입 후에 형질전환된 숙주 세포로부터 결실된다.Alternatively, one can use a specific selectable marker, for example an auxotrophic marker that requires a corresponding mutant host cell bearing an inactivating mutation of the saturated or polyunsaturated fatty acid synthase gene as the strains discussed above. In a preferred embodiment, the selectable marker is deleted from the transformed host cell after introduction of the expression construct to obtain a transformed host cell capable of producing a polypeptide lacking the selectable marker gene.

본 발명에 따른 벡터를 구성하기 위해서 상술한 요소들을 결찰시키는데 사용되는 과정은 당해 분야의 숙련가에게 주지되어 있다(예를 들어 문헌[Sambrook & Russell, Molecular Cloning: A Laboratory Manual, 3rd Ed., CSHL Press, Cold Spring Harbor, NY, 2001]; 및 문헌[Ausubel et al., Current Protocols in Molecular Biology, Wiley InterScience, NY, 1995]을 참조하시오).The procedure used to ligate the elements described above to construct a vector according to the invention is well known to those skilled in the art (see, e.g., Sambrook & Russell, Molecular Cloning: A Laboratory Manual, 3rd Ed. , CSHL Press). , Cold Spring Harbor, NY, 2001; and Ausubel et al. , Current Protocols in Molecular Biology , Wiley InterScience, NY, 1995).

본 발명의 모든 실시태양들의 상황에서 Cas 단백질은 본 발명의 목적에 적합한 임의의 Cas 단백질을 지칭한다. Cas 단백질은 효소 활성을 포함하거나 효소 활성을 포함하지 않을 수도 있다. Cas 단백질의 비제한적인 예는 Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9(또한 Csn1 및 Csx12로서 공지됨), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx1S, Csf1, Csf2, Csf3, Csf4, 그의 상동체 또는 그의 변형된 버전을 포함한다. 이들 Cas 단백질은 당해 분야의 숙련가에게 공지되어 있으며; 예를 들어 스트렙토코커스 피오게네스 Cas9 단백질의 아미노산 서열은 스위스프롯(SwissProt) 데이터베이스에서 수납번호 Q99ZW2하에 발견될 수 있다. 바람직하게, 본 발명에 따른 변형되지 않은 Cas 단백질, 예를 들어 Cas9는 DNA 절단 활성을 갖는다. 바람직하게, 상응하는 Cas 단백질은 Cas9이며, 스트렙토코커스 피오게네스 또는 스트렙토코커스 뉴모니아에로부터의 Cas9일 수 있다. 바람직하게, 본 발명에 따른 Cas 단백질은 표적-폴리뉴클레오타이드의 위치에서, 예를 들어 상기 표적-폴리뉴클레오타이드내 및/또는 상기 표적-폴리뉴클레오타이드의 역보체내에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 지시한다. 상기 표적-폴리뉴클레오타이드의 위치는 본 명세서에서 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 약 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, 또는 그 이상의 뉴클레오타이드 이내; 보다 바람직하게는 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, 또는 그 이상의 뉴클레오타이드 이내; 훨씬 더 바람직하게는 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50 뉴클레오타이드 이내로서 한정된다. 상응하게, 본 발명에 따른 Cas 단백질은 바람직하게는 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 약 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, 또는 그 이상의 뉴클레오타이드 이내; 보다 바람직하게는 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, 또는 그 이상의 뉴클레오타이드 이내; 훨씬 더 바람직하게는 표적-폴리뉴클레오타이드의 첫 번째 또는 최종 뉴클레오타이드로부터 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50 뉴클레오타이드 이내에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 지시한다. 전형적으로, 본 발명에 따른 표적-폴리뉴클레오타이드는 PAM 서열(본 명세서의 다른 어딘가에서 정의됨)과 결합되며 상기 PAM 서열은 바람직하게는 표적-서열의 바로 하류(3')에 있고; 상기 CRISPR-Cas 복합체의 형성은 전형적으로 상기 PAM 서열의 3 염기쌍 상류(5')에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 생성시킨다.Cas protein in the context of all embodiments of the present invention refers to any Cas protein suitable for the purposes of the present invention. A Cas protein may or may not contain enzymatic activity. Non-limiting examples of Cas proteins include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1 , Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csf3, Csx10, Csf , Csf2, Csf3, Csf4, homologues thereof or modified versions thereof. These Cas proteins are known to those skilled in the art; For example, the amino acid sequence of the Streptococcus pyogenes Cas9 protein can be found under accession number Q99ZW2 in the SwissProt database. Preferably, the unmodified Cas protein according to the invention, for example Cas9, has DNA cleavage activity. Preferably, the corresponding Cas protein is Cas9, which may be Cas9 from Streptococcus pyogenes or Streptococcus pneumoniae. Preferably, the Cas protein according to the invention comprises cleavage of one or both polynucleotide strands at the position of the target-polynucleotide, for example in said target-polynucleotide and/or in the reverse complement of said target-polynucleotide. instruct The position of the target-polynucleotide herein is about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50 from the first or last nucleotide of the target-polynucleotide. , within 100, 200, 500, or more nucleotides; more preferably 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or within more nucleotides; Even more preferably within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50 nucleotides from the first or last nucleotide of the target-polynucleotide. Correspondingly, the Cas protein according to the invention is preferably about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25 from the first or last nucleotide of the target-polynucleotide. , within 50, 100, 200, 500, or more nucleotides; more preferably 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or within more nucleotides; Even more preferably one or two within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50 nucleotides from the first or last nucleotide of the target-polynucleotide Directs cleavage of all polynucleotide strands. Typically, a target-polynucleotide according to the invention is associated with a PAM sequence (as defined elsewhere herein), said PAM sequence preferably immediately downstream (3') of the target-sequence; Formation of the CRISPR-Cas complex typically results in cleavage of one or both polynucleotide strands 3 base pairs upstream (5') of the PAM sequence.

바람직하게, 본 발명에 따른 조성물 중의 Cas 단백질은 상기 표적-폴리뉴클레오타이드의 위치에서 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 지시하는 활성을 갖는다. Cas 뉴클레아제 활성은 전형적으로 2개의 분리된 촉매 도메인, 즉 RuvC 및 HNH에 의해 수행된다. 각각의 도메인은 하나의 폴리뉴클레오타이드 가닥을 절단하고 각각의 도메인은 단일 점 돌연변이에 의해 불활성화될 수 있다.Preferably, the Cas protein in the composition according to the present invention has an activity to direct the cleavage of both polynucleotide strands at the position of said target-polynucleotide. Cas nuclease activity is typically carried out by two separate catalytic domains, RuvC and HNH. Each domain cleaves one polynucleotide strand and each domain can be inactivated by a single point mutation.

따라서 본 발명에 따른 Cas 단백질은 편의상, 돌연변이된 Cas 단백질이 변경된 뉴클레아제 활성을 갖고 표적-폴리뉴클레오타이드의 하나 또는 2개의 가닥 모두를 절단하는 능력이 결여되도록 상응하는 야생형 Cas 단백질에 대해 돌연변이될 수 있다. 예를 들어, 스트렙토코커스 피오게네스로부터의 Cas9의 RuvC I 촉매 도메인에서 아스파테이트에서 알라닌으로의 치환(D10A)은 Cas9를 2개의 가닥을 모두 절단하는 뉴클레아제에서 니카제(상기는 본 명세서에서 표적-폴리뉴클레오타이드의 단일 가닥을 절단하는 Cas 단백질로서 정의된다)로 전환시킨다. Cas9를 니카제로 만드는 돌연변이의 다른 예는 비제한적으로 H840A, N854A, 및 N863A를 포함한다. 본 발명의 상황에서, 니카제 활성을 갖는 Cas 단백질을 상동성 재조합을 통한 게놈 편집, 바람직하게는 문헌[Ran et al., 2013]에 따른 이중 닉킹(nicking) 기법에 사용할 수 있다. 상응하게, 본 발명에 따른 바람직한 Cas 단백질은 상기 단백질이 상응하는 야생형 Cas 단백질에 비해 변경된 뉴클레아제 활성을 갖도록, 바람직하게는 상기 표적-서열의 위치에서 단일의 폴리뉴클레오타이드 가닥의 절단을 지시하는 활성을 갖는 적어도 하나의 돌연변이를 포함한다. 상기와 같은 소위 니카제 돌연변이체를 편의상 듀플렉스 셋업에, 즉 RuvC 돌연변이된 Cas 단백질 니카제 돌연변이체 및 NHN이 돌연변이된 Cas 단백질 니카제 돌연변이체를 포함하는 본 발명에 따른 조성물에 사용할 수 있으며, 따라서 상기 하나의 Cas 단백질 돌연변이체는 상기 폴리뉴클레오타이드 표적의 하나의 가닥을 닉킹하고 다른 Cas 단백질 돌연변이체는 상기 폴리뉴클레오타이드 표적의 다른 가닥을 닉킹한다. 상기 사용되는 2개의 안내-폴리뉴클레오타이드에 따라, 2개의 상이한 CRISPR-Cas 복합체는 상기 폴리뉴클레오타이드 표적 중에 2개의 단일-가닥 닉을 유효하게 생성시킬 것이며; 이들 닉은 5, 10, 20, 30 이상 떨어진 다수의 뉴클레오타이드들일 수 있다. 상기와 같은 이중 닉킹 방법은 비-상동성 말단 부착(NEJH)의 특이성을 크게 증대시킨다. 이중 닉킹에 대한 배경 정보를 예를 들어 문헌[Ran et al., 2013]에서 찾을 수 있다.Accordingly, the Cas protein according to the present invention may be conveniently mutated to the corresponding wild-type Cas protein such that the mutated Cas protein has altered nuclease activity and lacks the ability to cleave one or both strands of the target-polynucleotide. have. For example, aspartate to alanine substitution (D10A) in the RuvC I catalytic domain of Cas9 from Streptococcus pyogenes (D10A) results in a nickase in a nuclease that cleaves both strands of Cas9 (see It is defined as a Cas protein that cleaves a single strand of a target-polynucleotide). Other examples of mutations that render Cas9 nickase include, but are not limited to, H840A, N854A, and N863A. In the context of the present invention, a Cas protein with nickase activity can be used for genome editing via homologous recombination, preferably a double nicking technique according to Ran et al., 2013. Correspondingly, a preferred Cas protein according to the invention has an activity that directs cleavage of a single polynucleotide strand, preferably at the position of the target-sequence, such that said protein has an altered nuclease activity compared to the corresponding wild-type Cas protein. at least one mutation with Such so-called nickase mutants can be conveniently used in a duplex setup, i.e., in the composition according to the invention comprising a RuvC mutated Cas protein nickase mutant and a NHN mutated Cas protein nickase mutant, and thus One Cas protein mutant nicks one strand of the polynucleotide target and another Cas protein mutant nicks the other strand of the polynucleotide target. Depending on the two guide-polynucleotides used, two different CRISPR-Cas complexes will effectively generate two single-stranded nicks in the polynucleotide target; These nicks can be multiple nucleotides 5, 10, 20, 30 or more apart. This double nicking method greatly enhances the specificity of non-homologous end attachment (NEJH). Background information on double nicking can be found, for example, in Ran et al., 2013.

본 발명에 따른 Cas 단백질은 Cas9의 2개 이상의 돌연변이된 촉매 도메인, 예를 들어 RuvC I, RuvC II 및/또는 RuvC III를 포함하여 모든 DNA 절단 활성이 실질적으로 없는 돌연변이된 Cas9를 생성시킬 수 있다 일부 실시태양에서, D10A 돌연변이를 H840A, N854A, 또는 N863A 돌연변이 중 하나 이상과 병용하여 모든 DNA 절단 활성이 실질적으로 없는 Cas9 효소를 생성시킨다. 바람직하게, Cas 단백질은 상기 돌연변이된 효소의 DNA 절단 활성이 돌연변이되지 않은 형태에 대해서 약 25%, 10%, 5%, 1%, 0.1%, 0.01% 미만, 또는 그 이하일 때 모든 DNA 절단 활성이 실질적으로 없는 것으로 간주된다. 실질적으로 모든 효소 활성이 없는 Cas 단백질을, 상기 CRISPR-Cas 복합체가 상기 표적-폴리뉴클레오타이드로부터의 전사를 방해할 것이기 때문에, 편의상 유전자 침묵 또는 발현의 하향 조절에 사용할 수 있다. 다른 돌연변이들이 유용할 수 있으며; 상기 Cas9 또는 다른 Cas 단백질이 스트렙토코커스 피오게네스 이외의 종으로부터의 것인 경우, 상응하는 아미노산에서 돌연변이를 수행하여 유사한 효과를 성취할 수 있고; 당해 분야의 숙련가는 이들 상응하는 아미노산을 어떻게 식별하는 지를 안다.A Cas protein according to the present invention can generate a mutated Cas9 that is substantially free of any DNA cleavage activity comprising two or more mutated catalytic domains of Cas9, for example RuvC I, RuvC II and/or RuvC III Some In an embodiment, the D10A mutation is combined with one or more of the H840A, N854A, or N863A mutation to generate a Cas9 enzyme substantially free of all DNA cleavage activity. Preferably, the Cas protein has all DNA cleavage activity when the DNA cleavage activity of the mutated enzyme is less than about 25%, 10%, 5%, 1%, 0.1%, 0.01%, or less relative to the unmutated form. considered practically absent. Cas proteins lacking substantially all enzymatic activity can be conveniently used for gene silencing or down-regulation of expression, since the CRISPR-Cas complex will interfere with transcription from the target-polynucleotide. Other mutations may be useful; If the Cas9 or other Cas protein is from a species other than Streptococcus pyogenes, a similar effect can be achieved by performing mutations in the corresponding amino acids; A person skilled in the art knows how to identify these corresponding amino acids.

본 발명에 따른 Cas 단백질은 융합 단백질일 수 있으며 적어도 하나의 이종 기능성 도메인을 포함하고, 상기와 같은 도메인은 바람직하게는 문헌[Aggarwal, A. K.; Wah, D. A.; Hirsch, J. A.; Dorner, L. F.; Schildkraut, I. (1997). "Structure of the multimodular endonuclease FokI bound to DNA". Nature 388 (6637): 97-100]에 기재된 바와 같은 FokI 활성을 포함하는 도메인이다. 상기 효소 FokI은 플라보박테리움 오케아노코이테스(Flavobacterium okeanokoites)에서 자연적으로 발견되며 N-말단 DNA-결합 도메인 및 C-말단의 비-특이성 DNA 절단 도메인으로 이루어지는 세균성 IIS형 제한 엔도뉴클레아제이다(Durai et al., 2005). 상기 FokI 단백질이 5′-GGATG-3′:3′-CATCC-5′ 인식 부위에서 그의 DNA-결합 도메인을 통해 이중 가닥 DNA에 결합할 때, 상기 DNA 절단 도메인은 활성화되어 추가의 서열 특이성 없이 상기 인식 부위의 하류에서 첫 번째 가닥 9 뉴클레오타이드 및 상기 부위의 가장 가까운 뉴클레오타이드의 상류에서 두 번째 가닥 13 뉴클레오타이드를 절단한다(Wah et al., 1998). Cas9-FokI 융합은 특히 문헌[Guilinger et al., 2014]; 및 문헌[Tsai et al., 2014]에서 기재되었다.The Cas protein according to the present invention may be a fusion protein and comprises at least one heterologous functional domain, such domains are preferably described in Aggarwal, AK; Wah, DA; Hirsch, J. A.; Donner, L. F.; Schildkraut, I. (1997). "Structure of the multimodular endonuclease FokI bound to DNA". Nature 388 (6637): a domain containing 97-100] Fok I activity as described. The enzyme Fok I is a bacterial type IIS restriction endonuclease naturally found in Flavobacterium okeanokoites and consisting of an N-terminal DNA-binding domain and a C-terminal non-specific DNA cleavage domain. (Durai et al., 2005). When the Fok I protein binds to double-stranded DNA via its DNA-binding domain at the 5'-GGATG-3':3'-CATCC-5' recognition site, the DNA cleavage domain is activated without further sequence specificity. Cleave 9 nucleotides of the first strand downstream of the recognition site and 13 nucleotides of the second strand upstream of the nearest nucleotide of the site (Wah et al., 1998). Cas9- Fok I fusions are particularly described in Guilinger et al ., 2014; and Tsai et al ., 2014.

본 발명에 따른 Cas 융합 단백질은 상기 Cas 단백질 외에 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 또는 그 이상의 도메인을 포함할 수 있다. Cas 단백질에 융합될 수 있는 단백질 도메인의 예는 비제한적으로 에피토프 태그, 리포터 유전자 서열, 및 하기의 활성 중 하나 이상을 갖는 단백질 도메인을 포함한다: 메틸라제 활성, 데미틸라제 활성, 전사 활성화 활성, 전사 억제 활성, 전사 방출 인자 활성, 역사적인 변형 활성, RNA 절단 활성 및 핵산 결합 활성. 에피토프 태그의 비제한적인 예는 히스티딘(His) 태그, V5 태그, FLAG 태그, 인플루엔자 헤마글루티닌(HA) 태그, Myc 태그, VSV-G 태그, 및 티오레독신(Trx) 태그를 포함한다. 리포터 유전자의 예는 비제한적으로 글루타치온-S-트랜스퍼라제(GST), 양고추냉이 퍼옥시다제(HRP), 클로람페니콜 아세틸트랜스퍼라제(CAT) 베타-갈락토시다제, 베타-글루쿠로니다제, 루시페라제, 녹색 형광 단백질(GFP), HcRed, DsRed, 시안 형광 단백질(CFP), 황색 형광 단백질(YFP), 및 청색 형광 단백질(BFP)을 포함하는 자가형광 단백질을 포함한다. Cas 단백질을, DNA 분자에 결합하거나 다른 세포 분자에 결합하는 단백질 또는 단백질의 단편을 암호화하는 유전자 서열, 예를 들어 비제한적으로 말토스 결합 단백질(MBP), S-태그, Lex A DNA 결합 도메인(DBD) 융합물, GAL4 DNA 결합 도메인 융합물, 및 헤르페스 단순 바이러스(HSV) BP 16 단백질 융합물에 융합시킬 수 있다. CRISPR 효소를 포함하는 융합 단백질의 부분을 형성할 수 있는 추가적인 도메인은 US20110059502에 기재되어 있다. 태그된 Cas 단백질을 사용하여 표적-폴리뉴클레오타이드의 위치를 확인할 수 있다. 본 발명에 따른 바람직한 Cas 융합 단백질은 본 명세서에서 상기에 정의한 바와 같은 FokI 도메인을 포함한다.The Cas fusion protein according to the present invention may include 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains in addition to the Cas protein. Examples of protein domains that can be fused to a Cas protein include, but are not limited to, epitope tags, reporter gene sequences, and protein domains having one or more of the following activities: methylase activity, demitylase activity, transcriptional activation activity, Transcription repression activity, transcription release factor activity, historical modification activity, RNA cleavage activity and nucleic acid binding activity. Non-limiting examples of epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Examples of reporter genes include, but are not limited to, glutathione-S-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, autofluorescent proteins including luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and blue fluorescent protein (BFP). A Cas protein is a gene sequence encoding a protein or fragment of a protein that binds to a DNA molecule or binds to another cellular molecule, including but not limited to maltose binding protein (MBP), S-tag, Lex A DNA binding domain ( DBD) fusion, a GAL4 DNA binding domain fusion, and a herpes simplex virus (HSV) BP 16 protein fusion. Additional domains capable of forming part of a fusion protein comprising a CRISPR enzyme are described in US20110059502. A tagged Cas protein can be used to determine the location of the target-polynucleotide. Preferred Cas fusion proteins according to the invention comprise a Fok I domain as hereinbefore defined.

본 발명에 따른 바람직한 Cas 단백질은 핵 국소화 서열, 바람직하게는 이종 핵 국소화 서열을 포함한다. 상기와 같은 핵 국소화 서열을 또한 핵 국소화 신호라 칭한다. 바람직하게, 상기와 같은 핵 국소화 신호는 상기 CRISPR-Cas 복합체에, 숙주 세포의 핵 중에서 검출 가능한 양으로 상기 CRISPR-Cas 복합체의 축적을 구동하기에 충분한 강도를 부여한다. 이론에 얽매이고자 하는 것은 아니지만, 핵 국소화 서열이 숙주 세포에서 CRISPR-Cas 활성에 필요하지 않지만 상기와 같은 서열을 포함하는 것은, 특히 상기 핵내로의 표적 핵산 분자에 대한 상기 시스템의 활성을 증대시키는 것으로 여겨진다. 상기와 같은 핵 국소화 서열은 바람직하게는 Cas 단백질 중에 존재하지만, 상기 CRISPR-Cas 시스템의 핵에의 표적화를 촉진하도록 다른 어딘가에 또한 존재할 수도 있다. 바람직한 핵 국소화 서열은 SV40 핵 국소화 서열이다.Preferred Cas proteins according to the invention comprise a nuclear localization sequence, preferably a heterologous nuclear localization sequence. Such nuclear localization sequences are also referred to as nuclear localization signals. Preferably, such a nuclear localization signal imparts to the CRISPR-Cas complex a strength sufficient to drive accumulation of the CRISPR-Cas complex in a detectable amount in the nucleus of a host cell. While not wishing to be bound by theory, although nuclear localization sequences are not required for CRISPR-Cas activity in a host cell, the inclusion of such sequences enhances the activity of the system, particularly for target nucleic acid molecules into the nucleus. It is considered Such nuclear localization sequences are preferably present in the Cas protein, but may also be present elsewhere to facilitate targeting of the CRISPR-Cas system to the nucleus. A preferred nuclear localization sequence is the SV40 nuclear localization sequence.

본 발명에 따른 조성물 및 임의의 다른 실시태양에서, Cas 단백질 암호화 폴리뉴클레오타이드를 바람직하게는 상기를 발현시키고자 하는 숙주 세포에 코돈 최적화시킨다, 보다 바람직하게 상기 Cas 단백질 암호화 폴리뉴클레오타이드를 코돈 쌍 최적화시킨다. 일반적으로, 코돈 최적화는 관심 숙주 세포에서의 증대된 발현을 위해, 고유 아미노산 서열을 유지시키면서 숙주 세포의 유전자에 보다 빈번하게 또는 가장 빈번하게 사용되는 코돈으로 상기 고유 서열 중 적어도 하나의 코돈(예를 들어 1, 2, 3, 4, 5, 10, 15, 20, 25, 50 초과, 또는 그 이상의 코돈)을 교체시킴으로써 핵산 서열을 변형시키는 과정을 지칭한다. 다양한 종들이 특정 아미노산의 몇몇 코돈에 대해 특정한 편향을 나타낸다. 코돈 편향(유기체들간의 코돈 사용의 차이)은 종종 메신저 RNA(mRNA)의 번역 효율과 상관있으며, 이는 차례로 다른 것들 중에서도 특히 번역되는 코돈의 성질 및 특정한 전달 RNA(tRNA) 분자의 입수성에 따라 변하는 것으로 여겨진다. 하나의 세포 중의 선택된 tRNA의 우세성은 일반적으로 펩타이드 합성에 가장 빈번하게 사용되는 코돈의 반영이다. 상응하게, 유전자를 코돈 최적화에 기반하여 주어진 유기체에서의 최적의 유전자 발현에 맞출 수 있다. 코돈 사용 표를, 예를 들어 "코돈 사용 데이터베이스"에서 쉽게 입수할 수 있으며 상기 표를 다양한 방식으로 적합화시킬 수 있다. 예를 들어 문헌[Nakamura, Y., et al., 2000]을 참조하시오. 특정한 숙주 세포에서 발현을 위해 특정한 서열을 코돈 최적화하기 위한 컴퓨터 알고리즘을 또한 입수할 수 있으며, 예를 들어 진 포지(Gene Forge)(앱타젠(Aptagen); 미국 펜실바니아주 저커버스 소재)를 또한 입수할 수 있다. 바람직하게 Cas 단백질을 암호화하는 서열 중 하나 이상의 코돈(예를 들어 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, 또는 그 이상, 또는 모든 코돈)은 특정 아미노산에 대해 가장 빈번히 사용되는 코돈에 상응한다. 코돈 최적화에 바람직한 방법은 WO2006/077258 및 WO2008/000632에 기재되어 있다. WO2008/000632는 코돈-쌍 최적화를 다룬다. 코돈-쌍 최적화는 폴리펩타이드를 암호화하는 뉴클레오타이드 서열의 개선된 발현 및/또는 상기 암호화된 폴리펩타이드의 개선된 생성을 획득하기 위해, 폴리펩타이드를 암호화하는 뉴클레오타이드 서열을 그의 코돈-사용, 특히 사용되는 코돈-쌍에 대해 변형시킨 방법이다. 코돈 쌍은 암호화 서열 중 2개의 후속적인 트리플릿(코돈)의 세트로서 정의된다. 본 발명에 따른 조성물에서 소스 중 Cas 단백질의 양은 변할 수 있으며 최적의 성능을 위해 최적화시킬 수 있다. 높은 수준의 Cas 단백질은, 안내-폴리뉴클레오타이드가 존재하지 않는다 하더라도, 숙주 세포에 대해 독성일 수 있기 때문에(예를 들어 문헌[Ryan et al., 2014] 및 문헌[Jacobs et al., 2014]을 참조하시오) 상기 숙주 세포에서 Cas 단백질의 너무 높은 수준은 피하는 것이 편할 수 있다. 당해 분야의 숙련가는 예를 들어 Cas 단백질의 발현을 위해 보다 약한 프로모터, 억제성 프로모터 또는 유도성 프로모터를 선택함으로써 발현 수준을 어떻게 조절하는 지를 안다. 단백질의 발현에 적합한 프로모터의 예는 본 명세서의 다른 어딘가에 묘사되어 있다.In the composition according to the invention and in any other embodiment, the Cas protein encoding polynucleotide is preferably codon optimized in the host cell in which it is to be expressed, more preferably the Cas protein encoding polynucleotide is codon pair optimized. In general, codon optimization refers to the codon of at least one of the native sequences (e.g., the codons used more frequently or most frequently in the gene of the host cell) for enhanced expression in the host cell of interest while maintaining the native amino acid sequence. for example, 1, 2, 3, 4, 5, 10, 15, 20, 25, 50 or more codons). Different species exhibit specific biases for some codons of specific amino acids. Codon bias (differences in codon usage between organisms) is often correlated with the translation efficiency of messenger RNA (mRNA), which in turn has been shown to vary with, among other things, the nature of the codon being translated and the availability of a particular transfer RNA (tRNA) molecule. It is considered The predominance of a selected tRNA in a cell is generally a reflection of the codons most frequently used in peptide synthesis. Correspondingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, in the "Codon Usage Database" and the tables can be adapted in a variety of ways. See, eg, Nakamura, Y., et al., 2000. Computer algorithms for codon optimization of specific sequences for expression in specific host cells are also available, for example, from Gene Forge (Aptagen; Zuckers, PA). can do. Preferably, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in the sequence encoding the Cas protein are the most codons for a particular amino acid. Corresponds to frequently used codons. Preferred methods for codon optimization are described in WO2006/077258 and WO2008/000632. WO2008/000632 deals with codon-pair optimization. Codon-pair optimization is a nucleotide sequence encoding a polypeptide, in order to obtain improved expression of the nucleotide sequence encoding the polypeptide and/or improved production of the encoded polypeptide, its codon-use, in particular the codons used - A modified method for pairs. A codon pair is defined as a set of two successive triplets (codons) in a coding sequence. The amount of Cas protein in the source in the composition according to the invention can vary and can be optimized for optimal performance. High levels of Cas proteins can be toxic to host cells even in the absence of guide-polynucleotides (eg, Ryan et al ., 2014 and Jacobs et al ., 2014). see) it may be convenient to avoid too high levels of Cas protein in the host cell. The person skilled in the art knows how to control the expression level, for example by selecting a weaker, repressive or inducible promoter for the expression of the Cas protein. Examples of promoters suitable for expression of proteins are described elsewhere herein.

본 발명에 따른 안내-폴리뉴클레오타이드가 폴리뉴클레오타이드에 의해 암호화되는 본 발명에 따른 조성물에서, 상기 안내-폴리뉴클레오타이드의 발현을 암호화 폴리뉴클레오타이드에 작동적으로 연결된 프로모터에 의해 촉진할 수 있다. 상기와 같은 프로모터는 당해 분야의 숙련가에게 공지된 임의의 적합한 프로모터일 수 있다. 다수 유형의 프로모터를 사용할 수 있다. 편의상 RNA 폴리머라제 III 프로모터 또는 RNA 폴리머라제 II 프로모터를 사용할 수 있다. RNA 폴리머라제 III 및 그의 프로모터에 대한 배경 정보를 예를 들어 문헌[Marck et al., 2006]에서 찾을 수 있다. 상응하게, 편의상 RNA 폴리머라제 II 프로모터를 사용할 수 있으며, 상기는 당해 분야의 숙련가에게 공지되어 있고 예를 들어 문헌[Kornberg, 1999]에 리뷰되어 있다. 그러나, RNA II 폴리머라제로부터의 전사물은 종종 복잡한 전사 종결자를 가지며 전사물은 폴리아데닐화되고; 이는, 기능성 CRISPR-Cas 시스템을 생성시키기 위해 필요한 2차 구조를 성취하기 위해서 5' 및 3' 단부를 모두 정확하게 한정할 필요가 있기 때문에 상기 안내-폴리뉴클레오타이드의 요건을 방해할 수도 있다. 그러나 상기 결점은 회피될 수 있다. RNA 폴리머라제 II 프로모터가 사용되는 경우, 상기 안내-폴리뉴클레오타이드를 암호화하는 폴리뉴클레오타이드는 또한 자기-가공 리보자임을 암호화할 수 있고 RNA 폴리머라제 II 프로모터에 작동적으로 연결될 수 있으며; 이와 같이 상기 폴리뉴클레오타이드는 상기 안내-폴리뉴클레오타이드 및 자기-가공 리보자임을 포함하는 프리-안내-폴리뉴클레오타이드를 암호화하며, 여기에서 상기 안내-폴리뉴클레오타이드는 전사시 상기 프리-안내-폴리뉴클레오타이드 전사물로부터 자기-가공 리보자임에 의해 방출된다. RNA 폴리머라제 II 프로모터에 작동적으로 연결된 본 발명에 따른 프리-안내-폴리뉴클레오타이드를 암호화하는 폴리뉴클레오타이드를 포함하는 바람직한 구조물은 본 명세서에서 실시예 1 내지 4에 묘사된 것들이다. 상기와 같은 구조물에 대한 배경 정보를 예를 들어 문헌[Gao et al., 2014]에서 찾을 수 있다.In the composition according to the invention in which the guide-polynucleotide according to the invention is encoded by a polynucleotide, expression of the guide-polynucleotide can be promoted by a promoter operably linked to the encoding polynucleotide. Such promoters may be any suitable promoter known to those skilled in the art. Many types of promoters can be used. For convenience, an RNA polymerase III promoter or an RNA polymerase II promoter may be used. Background information on RNA polymerase III and its promoters can be found, for example, in Marck et al., 2006. Correspondingly, it is convenient to use the RNA polymerase II promoter, which is known to the person skilled in the art and is reviewed, for example, in Kornberg, 1999. However, transcripts from RNA II polymerases often have complex transcription terminators and transcripts are polyadenylated; This may interfere with the requirement of the guide-polynucleotide as it is necessary to precisely define both the 5' and 3' ends to achieve the secondary structure needed to generate a functional CRISPR-Cas system. However, this drawback can be avoided. When an RNA polymerase II promoter is used, the polynucleotide encoding the guide-polynucleotide may also encode a self-processing ribozyme and may be operably linked to an RNA polymerase II promoter; As such, the polynucleotide encodes the guide-polynucleotide and a pre-guide-polynucleotide comprising a self-processing ribozyme, wherein the guide-polynucleotide is transcribed from the pre-guide-polynucleotide transcript upon transcription. released by self-engineering ribozymes. Preferred constructs comprising a polynucleotide encoding a pre-guide-polynucleotide according to the invention operably linked to an RNA polymerase II promoter are those described in Examples 1-4 herein. Background information on such structures can be found, for example, in Gao et al., 2014.

바람직하게, 본 발명에 따른 조성물에서, 상기 안내-폴리뉴클레오타이드는 폴리뉴클레오타이드에 의해 암호화된다.Preferably, in the composition according to the invention, said guide-polynucleotide is encoded by a polynucleotide.

바람직하게, 상기 안내-폴리뉴클레오타이드가 폴리뉴클레오타이드에 의해 암호화되는 본 발명에 따른 조성물에서, 상기 폴리뉴클레오타이드는 RNA 폴리머라제 II 프로모터에 작동적으로 연결되며 상기 안내-폴리뉴클레오타이드 및 자기-가공 리보자임을 포함하는 프리-안내-폴리뉴클레오타이드를 암호화하고, 여기에서 상기 안내-폴리뉴클레오타이드는 전사시 상기 프리-안내-폴리뉴클레오타이드 전사물로부터 자기-가공 리보자임에 의해 방출된다. RNA 폴리머라제 II 프로모터에 작동적으로 연결된 본 발명에 따른 프리-안내-폴리뉴클레오타이드를 암호화하는 폴리뉴클레오타이드를 포함하는 바람직한 구조물은 본 명세서에서 실시예 1 내지 4에 묘사된 것들이다. 편의상, 다수의 프리-안내-폴리뉴클레오타이드 및 다수의 자기-가공 리보자임을 하나 이상의 RNA 폴리머라제 II 프로모터에 작동적으로 연결된 단일 폴리뉴클레오타이드에 의해 암호화할 수 있다.Preferably, in the composition according to the invention, wherein said guide-polynucleotide is encoded by a polynucleotide, said polynucleotide is operably linked to an RNA polymerase II promoter and comprises said guide-polynucleotide and a self-processing ribozyme and encodes a pre-guide-polynucleotide, wherein the guide-polynucleotide is released by a self-processing ribozyme from the pre-guide-polynucleotide transcript upon transcription. Preferred constructs comprising a polynucleotide encoding a pre-guide-polynucleotide according to the invention operably linked to an RNA polymerase II promoter are those described in Examples 1-4 herein. For convenience, multiple pre-guide-polynucleotides and multiple self-processing ribozymes may be encoded by a single polynucleotide operably linked to one or more RNA polymerase II promoters.

본 발명의 첫 번째 태양에 따른 조성물을 편의상 숙주 세포에서 폴리뉴클레오타이드의 발현을 조절하는데 사용할 수 있다. 상응하게, 두 번재 태양에서, 본 발명은 숙주 세포에서 폴리뉴클레오타이드의 발현을 조절하는 방법을 제공하며, 상기 방법은 숙주 세포를 본 발명의 첫 번째 태양에 따른 조성물과 접촉시킴을 포함하고, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다.The composition according to the first aspect of the present invention may conveniently be used to modulate the expression of polynucleotides in a host cell. Correspondingly, in a second aspect, the invention provides a method of modulating the expression of a polynucleotide in a host cell, said method comprising contacting the host cell with a composition according to the first aspect of the invention, wherein The guide-polynucleotide directs binding of the Cas protein to the target-polynucleotide in the host cell to form a CRISPR-Cas complex.

본 발명의 상황에서 "발현"이란 용어는 본 명세서에서 폴리뉴클레오타이드가 폴리뉴클레오타이드 주형으로부터 전사되는 과정(예를 들어 DNA 주형 폴리뉴클레오타이드가 mRNA 폴리뉴클레오타이드 전사물 또는 다른 RNA 전사물로 전사된다) 및/또는 mRNA 전사물이 후속적으로 펩타이드, 폴리펩타이드, 또는 단백질로 번역되는 과정으로서 정의된다. 전사물 및 암호화된 폴리펩타이드를 집합적으로 "유전자 산물"로서 지칭할 수 있다. 상기 폴리뉴클레오타이드 전사물이 게놈 주형 DNA로부터 유래되는 경우, 발현은 숙주 세포에서 상기 mRNA 전사물의 이어맞추기를 포함할 수 있다. "조절 발현"이란 용어는 본 명세서에서 동일한 조건을 사용하여 분석시 발현이 조절되지 않은 모 숙주 세포에 비해 증가되거나 감소된 발현을 지칭한다. 감소된 발현은 mRNA와 같은 전사물의 감소된 양 및/또는 폴리펩타이드와 같은 번역 산물의 감소된 양일 수 있다. 증가된 발현은 mRNA와 같은 전사물의 증대된 양 및/또는 폴리펩타이드와 같은 번역 산물의 증대된 양일 수 있음이 된다.The term "expression" in the context of the present invention refers herein to the process by which a polynucleotide is transcribed from a polynucleotide template (eg a DNA template polynucleotide is transcribed into an mRNA polynucleotide transcript or other RNA transcript) and/or It is defined as the process by which mRNA transcripts are subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be referred to collectively as “gene products”. Where the polynucleotide transcript is derived from genomic template DNA, expression may include splicing of the mRNA transcript in a host cell. The term “regulated expression” refers herein to increased or decreased expression as compared to a parental host cell whose expression is not regulated when assayed using the same conditions. Reduced expression may be a reduced amount of a transcript such as mRNA and/or a reduced amount of a translation product such as a polypeptide. Increased expression may be an enhanced amount of a transcript such as mRNA and/or an enhanced amount of a translation product such as a polypeptide.

바람직하게, 상기 CRISPR-Cas 복합체는 상기 표적-폴리뉴클레오타이드의 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두를 절단하여 유전자 산물의 조절된 발현을 생성시킨다. 상기 CRISPR-Cas 복합체는 또한 변경된 뉴클레아제 활성을 가질 수 있으며 표적-폴리뉴클레오타이드의 하나 또는 2개의 가닥을 모두 절단하는 능력이 실질적으로 없다; 상기와 같은 경우에 발현은 표적-폴리뉴클레오타이드에 대한 상기 복합체의 결합에 의해 조절된다. 실질적으로 모든 효소 활성이 없는 Cas 단백질을 편의상 유전자 침묵화 또는 발현의 하향조절에 사용할 수 있는데, 그 이유는 상기 CRISPR-Cas 복합체가 상기 표적-폴리뉴클레오타이드로부터의 전사를 방해할 것이기 때문이다. 한편으로, Cas 단백질을 관심 유전자의 프로그램화 가능한 전사 활성화 또는 침묵화를 위한 전사 인자로 변형시킬 수 있다(문헌[Larson, et al., 2013]).Preferably, the CRISPR-Cas complex cleaves one or both polynucleotide strands at the location of the target-polynucleotide resulting in regulated expression of the gene product. The CRISPR-Cas complex may also have altered nuclease activity and is substantially devoid of the ability to cleave one or both strands of the target-polynucleotide; In such cases, expression is regulated by binding of the complex to the target-polynucleotide. Cas proteins lacking substantially all enzymatic activity can conveniently be used for gene silencing or downregulation of expression, since the CRISPR-Cas complex will interfere with transcription from the target-polynucleotide. Alternatively, the Cas protein can be modified as a transcription factor for programmable transcriptional activation or silencing of a gene of interest (Larson, et al., 2013).

본 발명의 첫 번째 태양에 따른 조성물을 편의상 폴리뉴클레오타이드의 결실에 사용할 수 있다. 하나의 실시태양에서, 본 발명의 첫 번째 태양에 따른 조성물이 적어도 하나 또는 2개의 안내-폴리뉴클레오타이드의 소스를 포함하고/하거나 적어도 하나의 Cas 단백질의 소스를 포함하는 경우, 상기 표적-폴리뉴클레오타이드의 하나의 위치 또는 상이한 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥을 모두 절단하여 상기 표적-폴리뉴클레오타이드로부터 폴리뉴클레오타이드 단편의 결실을 생성시키는, 적어도 하나의 CRISPR-Cas 복합체 또는 2개의 상이한 CRISPR-Cas 복합체가 형성된다. 바람직하게, 적어도 하나 또는 2개의 안내-폴리뉴클레오타이드 및/또는 적어도 하나의 Cas 단백질의 소스를 포함하는 본 발명에 따른 상기와 같은 조성물은, 상기 안내-폴리뉴클레오타이드(들)에 의해 표적화된 적어도 하나 또는 2개의 표적-폴리뉴클레오타이드에 적어도 부분적으로 상보성인, 본 명세서에서 하기에 정의하는 바와 같은 외인성 폴리뉴클레오타이드를 추가로 포함한다. 결실되는 상기와 같은 폴리뉴클레오타이드 단편 또는 결실된 단편은 길이가 수 뉴클레오타이드 내지 수천 뉴클레오타이드일 수 있으며, 전체 유전자가 결실되거나 또는 유전자들의 클러스터가 결실될 수 있다. 상응하게, 본 발명은 숙주 세포에서 폴리뉴클레오타이드의 발현을 조절하는 방법을 제공하며, 여기에서 폴리뉴클레오타이드 단편은 표적-폴리뉴클레오타이드로부터 결실된다.The composition according to the first aspect of the present invention may be conveniently used for deletion of polynucleotides. In one embodiment, when the composition according to the first aspect of the invention comprises at least one or two sources of intra-polynucleotides and/or comprises at least one source of Cas proteins, the target-polynucleotides At least one CRISPR-Cas complex or two different CRISPR-Cas complexes is formed, which cleaves one or both polynucleotide strands at one or different positions resulting in deletion of a polynucleotide fragment from the target-polynucleotide do. Preferably, such a composition according to the invention comprising at least one or two guide-polynucleotides and/or a source of at least one Cas protein comprises at least one or and an exogenous polynucleotide, as defined herein below, that is at least partially complementary to the two target-polynucleotides. The polynucleotide fragment or the deleted fragment to be deleted may be several nucleotides to several thousand nucleotides in length, and the entire gene may be deleted or a cluster of genes may be deleted. Correspondingly, the present invention provides a method of modulating the expression of a polynucleotide in a host cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide.

하나의 실시태양에서, 발현 조절 방법은 상기 표적-폴리뉴클레오타이드 중 적어도 하나의 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단에 이어서 외인성 폴리뉴클레오타이드와의 상동성 재조합에 의한 상기 표적-폴리뉴클레오타이드의 변형을 포함한다. 상기와 같은 경우에, 본 발명의 첫 번째 태양에 따른 조성물은 바람직하게는 상기와 같은 외인성 폴리뉴클레오타이드를 추가로 포함한다. 상기와 같은 변형은 상기 표적-폴리뉴클레오타이드 중의 적어도 하나의 뉴클레오타이드의 삽입, 결실 또는 치환을 생성시킬 수 있으며, 여기에서 상기 삽입 또는 치환 뉴클레오타이드는 상기 외인성 폴리뉴클레오타이드로부터 기원할 수 있다. 변형은 또한 상기 외인성 폴리뉴클레오타이드가 문헌[Dong et al.] 및 [Beetham et al.]에 기재된 바와 같은 비-통합 존재인 경우 이루어질 수 있으며; 이 경우에 상기 표적-폴리뉴클레오타이드는 변형되지만 상기 외인성 폴리뉴클레오타이드의 뉴클레오타이드는 상기 표적-폴리뉴클레오타이드내로 도입된다. 결과적으로, 상기 생성 숙주는 본 발명에 따른 Cas-단백질을 단백질로서 숙주 세포에 도입시키는 경우 비-재조합 숙주 세포이다. 상기 외인성 폴리뉴클레오타이드는 임의의 관심 폴리뉴클레오타이드, 예를 들어 본 명세서에서 하기에 정의하는 바와 같은 관심 화합물을 암호화하는 폴리뉴클레오타이드, 또는 상기와 같은 폴리뉴클레오타이드의 일부 또는 그의 변체일 수 있다. 상기와 같은 외인성 폴리뉴클레오타이드는 본 명세서에서 본 발명에 따른 외인성 폴리뉴클레오타이드로서 지칭되며 단일 가닥 또는 이중 가닥일 수 있다.In one embodiment, the expression control method comprises cleavage of one or both polynucleotide strands at at least one position in the target-polynucleotide followed by modification of the target-polynucleotide by homologous recombination with an exogenous polynucleotide. includes In such a case, the composition according to the first aspect of the present invention preferably further comprises such an exogenous polynucleotide. Such modifications may result in an insertion, deletion or substitution of at least one nucleotide in the target-polynucleotide, wherein the insertion or substitution nucleotide may originate from the exogenous polynucleotide. Modifications can also be made when the exogenous polynucleotide is a non-integrating entity as described in Dong et al. and Beetham et al.; In this case, the target-polynucleotide is modified but the nucleotides of the exogenous polynucleotide are introduced into the target-polynucleotide. Consequently, the production host is a non-recombinant host cell when the Cas-protein according to the invention is introduced into the host cell as a protein. The exogenous polynucleotide may be any polynucleotide of interest, for example a polynucleotide encoding a compound of interest as defined herein below, or a portion of such polynucleotide or a variant thereof. Such exogenous polynucleotides are referred to herein as exogenous polynucleotides according to the present invention and may be single-stranded or double-stranded.

다양한 용도가 본 발명에 따른 조성물 및 방법에 대해서 당해 분야의 숙련가에 의해 고려될 수 있다. 게놈 중 폴리뉴클레오타이드(또는 유전자)를 본 발명에 따른 조성물 및 방법을 사용하여 변형시키거나, 편집하거나 또는 붕괴시킬 수 있다. 예를 들어 상기 표적-폴리뉴클레오타이드의 2개 가닥 모두를 절단하는 완전 활성 Cas 단백질을 사용하는 경우 및 외인성 폴리뉴클레오타이드가 적합한 복구 주형으로서 존재하지 않는 경우, 이중가닥 절단이 비-상동성 말단 부착 복구(NHEJ)에 의해 복구된다. NHEJ 동안 하나 또는 다수의 뉴클레오타이드의 삽입 및/또는 결실(일부의 경우에 치환으로서 이해될 수 있다)이 발생할 수 있으며, 이들은 상기 복구 부위에 무작위로 삽입되거나 결실되고; 이는 NHEJ의 특징이다. 상기와 같은 삽입 및/또는 결실은 암호화 서열의 판독 프레임에 영향을 미칠 수 있으며, 이는 (조기)정지 코돈의 발생 또는 이어맞추기 부위의 변경의 경우에 유전자 산물 또는 심지어 절두된 단백질 중의 아미노산 변화를 생성시킬 수 있다.Various uses are contemplated by those skilled in the art for the compositions and methods according to the present invention. Polynucleotides (or genes) in the genome can be modified, edited or disrupted using the compositions and methods according to the invention. For example, when using a fully active Cas protein that cleaves both strands of the target-polynucleotide and when an exogenous polynucleotide is not present as a suitable repair template, double-strand breaks are NHEJ). During NHEJ insertions and/or deletions of one or multiple nucleotides (which may be understood as substitutions in some cases) may occur, which are randomly inserted or deleted at the repair site; This is characteristic of NHEJ. Such insertions and/or deletions can affect the reading frame of the coding sequence, which in the case of the occurrence of (early)stop codons or alteration of splicing sites produces amino acid changes in the gene product or even in the truncated protein. can do it

외인성 폴리뉴클레오타이드가 복구 주형으로서 존재하는 경우, 게놈 중 폴리뉴클레오타이드(또는 유전자)를 상동성 말단 부착 복구(HEJ)(또한 상동성-지시된 복구(HDR)로서 공지되어 있다)를 사용하는 본 발명에 따른 조성물 및 방법을 사용하여 변형시키거나, 편집하거나 붕괴시킬 수 있다. 예를 들어 상기 표적-폴리뉴클레오타이드에 일치하는 서열을 갖는 외인성 폴리뉴클레오타이드(즉 상기 이중 가닥 절단의 상류(5') 및 하류(3'))가 본 발명에 따른 CRISPR-Cas 시스템과 함께 존재하는 경우, HDR은 상기 표적-폴리뉴클레오타이드 중의 이중 가닥 절단에 상기 외인성 폴리뉴클레오타이드의 상응하는 뉴클레오타이드를 도입시킬 것이다(또는 실제로 재생성시킬 것이다). 바람직하게, 본 발명에 따른 외인성 폴리뉴클레오타이드는 상기 표적 서열 자체에 이어서 기능성 PAM 서열을 함유하지 않아 상기 외인성 폴리뉴클레오타이드 자체 또는 변형된 표적-폴리뉴클레오타이드가 상기 CRISPR-Cas 시스템에 의해 (재)절단될 위험을 피한다.When an exogenous polynucleotide is present as a repair template, a polynucleotide (or gene) in the genome is used in the present invention using homology end attachment repair (HEJ) (also known as homology-directed repair (HDR)). It can be modified, edited, or disrupted using the compositions and methods according to the present invention. For example, when an exogenous polynucleotide having a sequence consistent with the target-polynucleotide (ie upstream (5') and downstream (3') of the double-strand break) is present with the CRISPR-Cas system according to the present invention. , HDR will introduce (or actually regenerate) the corresponding nucleotides of the exogenous polynucleotide to a double strand break in the target-polynucleotide. Preferably, the exogenous polynucleotide according to the present invention does not contain a functional PAM sequence following the target sequence itself, so that the exogenous polynucleotide itself or the modified target-polynucleotide risk (re)cleavage by the CRISPR-Cas system avoid

본 발명의 실시태양에서, 본 발명에 따른 CRISPR-Cas 시스템이 외인성 폴리뉴클레오타이드(공여체 폴리뉴클레오타이드, 공여체 DNA, 복구 주형)를 포함하는 경우, 본 발명에 따른 CRISPR-Cas 시스템은 바람직하게는 하나 이상의 별도의 폴리뉴클레오타이드 또는 벡터에 의해 암호화되거나 또는 상기 상에 존재하는 2개 이상의 안내-폴리뉴클레오타이드를 포함하며, 2개 이상의 외인성 폴리뉴클레오타이드는 상기 CRISPR-Cas 시스템과 함께 제공되어 2개 이상의 CRISPR-Cas 복합체의 형성을 가능하게 한다. 본 발명에 따른 방법에서, 본 발명에 따른 상기와 같은 CRISPR-Cas 시스템을 편의상 2개 이상의 표적-폴리뉴클레오타이드에서의 발현을 조절하는데 사용할 수 있다, 즉 다수의 표적 부위를 표적화하는 방법에 사용할 수 있다. 본 발명에 따른 상기와 같은 CRISPR-Cas 시스템은 하나 이상의 표적-폴리뉴클레오타이드에서 우연히 1, 2 또는 그 이상의 CRISPR-Cas 복합체를 형성시킬 것이다. 상기와 같은 방법을 사용하여 숙주 세포의 게놈 중에, 임의로 하나 이상의 외인성 폴리뉴클레오타이드와 함께 하나 이상의 삽입, 결실, 치환을 생성시키거나, 또는 상기 형성된 CRISPR-Cas 복합체를 통해 유전자의 발현을 조절할 수 있다.In an embodiment of the invention, if the CRISPR-Cas system according to the invention comprises an exogenous polynucleotide (donor polynucleotide, donor DNA, repair template), the CRISPR-Cas system according to the invention preferably comprises one or more separate at least two guide-polynucleotides encoded by or present on the polynucleotide or vector of make it possible to form In the method according to the present invention, the CRISPR-Cas system as described above according to the present invention can be conveniently used to regulate expression in two or more target-polynucleotides, that is, it can be used in a method for targeting multiple target sites. . Such a CRISPR-Cas system according to the present invention will inadvertently form one, two or more CRISPR-Cas complexes in one or more target-polynucleotides. Such methods can be used to generate one or more insertions, deletions, substitutions, optionally together with one or more exogenous polynucleotides, in the genome of a host cell, or to regulate the expression of genes through the formed CRISPR-Cas complex.

숙주 세포host cell

본 발명의 상기 태양에 따른 방법에서, 바람직한 숙주 세포는 본 명세서의 다른 어딘가에서 정의된 바와 같은 관심 화합물을 암호화하는 폴리뉴클레오타이드를 포함한다.In the method according to this aspect of the invention, a preferred host cell comprises a polynucleotide encoding a compound of interest as defined elsewhere herein.

본 발명의 상기 태양에 따른 방법에서, 상기 숙주 세포는 재조합 숙주 세포이거나 또는 비-재조합 숙주 세포일 수 있다.In the method according to this aspect of the invention, the host cell may be a recombinant host cell or a non-recombinant host cell.

일부 실시태양에서, 상기 숙주 세포는 라비린툴로마이세트, 바람직하게는 트라우스토키트리알레스 목의 일원, 바람직하게는 트라우스토키트리아세아에 과의 일원, 보다 바람직하게는 아우란티오키트리움, 오블롱기키트리움, 스키조키트리움, 트라우스토키트리움 및 울케니아로 이루어지는 그룹 중에서 선택되는 속의 일원, 훨씬 더 바람직하게는 스키조키트리움 종 ATCC# 20888이다.In some embodiments, the host cell is a labyrinthulomycet, preferably a member of the order Troystochytriales, preferably a member of the family Thraustochytriaceae, more preferably Aurantiochytrium, Ober A member of the genus selected from the group consisting of Longichytrium, Schizochytrium, Troustochytrium and Ulkenia, even more preferably Schizochytrium sp. ATCC# 20888.

본 발명의 상기 태양에 따른 숙주 세포 중의 폴리뉴클레오타이드의 발현 조절 방법은, 바람직하게는 본 발명의 첫 번째 태양에 따른 조성물의 성분들을 포함하는 변형된 숙주 세포를 생성시킨다. 상응하게, 세 번째 태양에서, 본 발명은 본 발명의 첫 번째 태양에 따른 조성물을 포함하는 숙주 세포를 제공한다. 상기와 같은 숙주 세포는 본 명세서에 정의된 바와 같은 임의의 숙주 세포일 수 있으며 본 명세서에서 달리 어딘가에 정의된 바와 같은 관심 화합물을 암호화하는 폴리뉴클레오타이드를 추가로 포함할 수 있다.The method for regulating the expression of a polynucleotide in a host cell according to this aspect of the invention preferably results in a modified host cell comprising the components of the composition according to the first aspect of the invention. Correspondingly, in a third aspect, the invention provides a host cell comprising a composition according to the first aspect of the invention. Such host cell may be any host cell as defined herein and may further comprise a polynucleotide encoding a compound of interest as defined elsewhere herein.

네 번째 태양에서, 본 발명은 숙주 세포의 생성 방법을 제공하며, 상기 방법은 숙주 세포를 본 발명의 첫 번째 태양에 따른 조성물과 접촉시킴을 포함하고, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다. 하나의 실시태양에서, 본 발명의 첫 번째 태양에 따른 조성물과의 접촉을 2 단계로 수행할 수 있으며, 여기에서 상기 숙주 세포를 본 발명에 따른 Cas 단백질의 소스와 1차 접촉시키고 후속으로 상기 숙주 세포를 본 발명에 따른 안내-폴리뉴클레오타이드의 소스 및 임의로 본 발명에 따른 외인성 폴리뉴클레오타이드와 접촉시킨다. 본 발명의 상기 실시태양에서 숙주 세포는 본 명세서에서 정의된 바와 같은 임의의 유형의 숙주 세포일 수 있으며 본 명세서의 다른 어딘가에 정의된 바와 같은 관심 화합물을 암호화하는 폴리뉴클레오타이드를 포함할 수 있다. 본 발명에 따른 숙주 세포의 바람직한 생성 방법은 자손 숙주 세포를 생성시키는 단계를 포함하며, 여기에서 상기 자손 숙주 세포 중에 본 발명에 따른 CRISPR-Cas 시스템의 성분은 더 이상 존재하지 않는다.In a fourth aspect, the invention provides a method for producing a host cell, said method comprising contacting said host cell with a composition according to the first aspect of the invention, wherein said guide-polynucleotide comprises said host cell It directs the binding of Cas protein at the target-polynucleotide in the CRISPR-Cas complex to form. In one embodiment, the contacting with the composition according to the first aspect of the invention can be carried out in two steps, wherein said host cell is first contacted with a source of Cas protein according to the invention and subsequently said host The cell is contacted with a source of an intra-polynucleotide according to the invention and optionally an exogenous polynucleotide according to the invention. The host cell in this embodiment of the invention may be any type of host cell as defined herein and may comprise a polynucleotide encoding a compound of interest as defined elsewhere herein. A preferred method for producing a host cell according to the present invention comprises the step of generating a progeny host cell, wherein in said progeny host cell the component of the CRISPR-Cas system according to the present invention is no longer present.

본 발명의 첫 번째 태양에 따른 조성물은 본 명세서에 정의된 바와 같은 임의의 상기와 같은 조성물일 수 있다. 숙주 세포를 본 발명에 따른 조성물과 접촉시키는 것을 당해 분야의 숙련가에게 공지된 임의의 수단에 의해 수행할 수 있다. 본 발명에 따른 숙주 세포를 간단히 본 발명에 따른 조성물을 포함하는 용액으로 가져올 수 있다. 본 발명에 따른 조성물을 숙주 세포로 전달하는 특정한 수단을 사용할 수 있다. 당해 분야의 숙련가는 상기와 같은 방법들을 알고 있으며(예를 들어 상기 문헌[Sambrook & Russell]; 문헌[Ausubel]을 참조하시오), 여기에는 비제한적으로 일렉트로포레이션 방법, 입자 충격 또는 미립자가속장치 충격, 원형질체 방법 및 아그로박테리움(Agrobacterium) 매개된 형질전환(AMT)이 포함된다. 라비린툴로마이세테스를 당해 분야에 공지된 임의의 방법을 사용하여 형질전환시킬 수 있다. 트라우스토키트리드의 유전자 형질전환을 위한 일반적인 기법이 미국특허 제 7,001,772 호 및 미국특허 제 8,637,651 호 및 문헌[Cheng et al. (2012)](이들은 모두 내용 전체가 본 명세서에 참고로 인용된다)에 상세히 기재되어 있다.The composition according to the first aspect of the present invention may be any such composition as defined herein. Contacting the host cell with the composition according to the invention can be effected by any means known to the person skilled in the art. The host cells according to the invention can simply be brought into solution comprising the composition according to the invention. Any means of delivering a composition according to the invention to a host cell may be employed. Those skilled in the art are aware of such methods (see, for example, Sambrook & Russell, supra; Ausubel), and include, but are not limited to, electroporation methods, particle bombardment or particle accelerator bombardment. , it includes the protoplast method, and Agrobacterium (Agrobacterium) mediated transfection (AMT). Labyrinthulomycetes can be transformed using any method known in the art. General techniques for genetic transformation of thraustochytrids are described in US Pat. Nos. 7,001,772 and 8,637,651 and Cheng et al . (2012)], all of which are incorporated herein by reference in their entirety.

바람직하게, 상기 CRISPR-Cas 복합체는 상기 표적-폴리뉴클레오타이드의 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두를 절단하여 상기 유전자 산물의 조절된 발현을 생성시킨다. 상기 CRISPR-Cas 복합체는 또한 변경된 뉴클레아제 활성을 가질 수 있으며 표적-폴리뉴클레오타이드의 하나 또는 2개의 가닥 모두를 절단하는 능력이 없을 수 있고; 상기와 같은 경우에, 발현은 상기 복합체의 상기 표적-폴리뉴클레오타이드에의 결합에 의해 조절된다.Preferably, said CRISPR-Cas complex cleaves one or both polynucleotide strands at the location of said target-polynucleotide resulting in regulated expression of said gene product. The CRISPR-Cas complex may also have altered nuclease activity and may lack the ability to cleave one or both strands of the target-polynucleotide; In such cases, expression is regulated by binding of the complex to the target-polynucleotide.

하나의 실시태양에서, 본 발명의 첫 번째 태양에 따른 조성물이 적어도 하나 또는 2개의 안내-폴리뉴클레오타이드의 소스 및/또는 적어도 하나의 Cas 단백질의 소스를 포함하는 경우, 상기 표적-폴리뉴클레오타이드의 하나의 위치 또는 상이한 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥을 모두 절단하여, 상기 표적-폴리뉴클레오타이드로부터 폴리뉴클레오타이드 단편의 결실을 생성시키는 적어도 하나의 CRISPR-Cas 복합체 또는 2개의 상이한 CRISPR-Cas 복합체가 형성된다. 바람직하게, 적어도 하나 또는 2개의 안내-폴리뉴클레오타이드 및/또는 적어도 하나의 Cas 단백질의 소스를 포함하는 본 발명에 따른 상기와 같은 조성물은 상기 안내-폴리뉴클레오타이드(들)에 의해 표적화된 상기 적어도 하나 또는 2개의 표적-폴리뉴클레오타이드에 적어도 부분적으로 상보성인, 본 명세서에서 하기에 정의하는 바와 같은 외인성 폴리뉴클레오타이드를 추가로 포함한다. 결실되는 상기와 같은 폴리뉴클레오타이드 단편 또는 결실된 단편은 길이가 수 뉴클레오타이드 내지 수천 뉴클레오타이드일 수 있으며, 전체 유전자가 결실되거나 또는 유전자들의 클러스터가 결실될 수 있다. 상응하게, 본 발명은 숙주 세포에서 폴리뉴클레오타이드의 발현을 조절하는 방법을 제공하며, 여기에서 폴리뉴클레오타이드 단편은 표적-폴리뉴클레오타이드로부터 결실된다.In one embodiment, when the composition according to the first aspect of the present invention comprises at least one or two sources of intra-polynucleotides and/or at least one source of Cas proteins, one of said target-polynucleotides At least one CRISPR-Cas complex or two different CRISPR-Cas complexes are formed that cleave one or both polynucleotide strands at a location or at a different location resulting in deletion of the polynucleotide fragment from the target-polynucleotide. Preferably, such a composition according to the invention comprising at least one or two guide-polynucleotides and/or a source of at least one Cas protein comprises said at least one or and an exogenous polynucleotide, as defined herein below, that is at least partially complementary to the two target-polynucleotides. The polynucleotide fragment or the deleted fragment to be deleted may be several nucleotides to several thousand nucleotides in length, and the entire gene may be deleted or a cluster of genes may be deleted. Correspondingly, the present invention provides a method of modulating the expression of a polynucleotide in a host cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide.

하나의 실시태양에서, 폴리뉴클레오타이드 단편이 표적-폴리뉴클레오타이드로부터 결실되는, 숙주 세포 중의 폴리뉴클레오타이드의 발현 조절 방법은 숙주 세포를 본 명세서에 기재된 바와 같은 조성물과 접촉시킴을 포함하며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다. 바람직하게, 폴리뉴클레오타이드 단편이 표적-폴리뉴클레오타이드로부터 결실되는, 숙주 세포 중의 폴리뉴클레오타이드의 발현 조절 방법은 숙주 세포를 본 명세서에 기재된 바와 같은 조성물과 접촉시킴을 포함하며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다. 또 다른 바람직한 실시태양에서, 폴리뉴클레오타이드 단편이 표적-폴리뉴클레오타이드로부터 결실되는, 숙주 세포 중의 폴리뉴클레오타이드의 발현 조절 방법은 숙주 세포를 본 명세서에 기재된 바와 같은 조성물과 접촉시킴을 포함하며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시키고, 여기에서 본 명세서에 기재된 바와 같은 조성물은 외인성 또는 공여체 폴리뉴클레오타이드를 포함하지 않는다. 숙주 세포에서 폴리뉴클레오타이드의 발현 조절 방법의 또 다른 실시태양에서, 상기 조성물은 자율적으로 복제하는 벡터 중에 포함된다.In one embodiment, a method of regulating the expression of a polynucleotide in a host cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide, comprises contacting the host cell with a composition as described herein, wherein the guide- The polynucleotide directs the binding of the Cas protein to the target-polynucleotide in the host cell to form a CRISPR-Cas complex. Preferably, the method for regulating the expression of a polynucleotide in a host cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide, comprises contacting the host cell with a composition as described herein, wherein the guide-polynucleotide comprises: The target-polynucleotide in the host cell directs the binding of the Cas protein to form a CRISPR-Cas complex. In another preferred embodiment, a method for regulating the expression of a polynucleotide in a host cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide, comprises contacting the host cell with a composition as described herein, wherein the instructions -polynucleotide directs binding of said Cas protein at target-polynucleotide in said host cell to form a CRISPR-Cas complex, wherein the composition as described herein does not comprise an exogenous or donor polynucleotide. In another embodiment of a method of regulating the expression of a polynucleotide in a host cell, the composition is comprised in an autonomously replicating vector.

따라서, 본 발명은 하나의 실시태양에서 세포 중의 폴리뉴클레오타이드의 발현을 조절하는 방법에 관한 것으로, 여기에서 폴리뉴클레오타이드 단편이 표적-폴리뉴클레오타이드로부터 결실되며, 상기 방법은 숙주 세포를 본 명세서에 기재된 바와 같은 조성물과 접촉시킴을 포함하지만, 바람직하게는 본 명세서에 정의된 바와 같은 공여체 폴리뉴클레오타이드는 포함하지 않으며, 여기에서 상기 안내-폴리뉴클레오타이드는 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킨다.Accordingly, the present invention relates, in one embodiment, to a method of modulating the expression of a polynucleotide in a cell, wherein the polynucleotide fragment is deleted from the target-polynucleotide, wherein the method comprises treating the host cell as described herein. contacting with a composition, but preferably not comprising a donor polynucleotide as defined herein, wherein said guide-polynucleotide directs binding of a Cas protein at a target-polynucleotide in said host cell to thereby CRISPR-Cas complex is formed.

바람직한 실시태양에서, 상기 Cas 단백질은 상기 표적-서열의 위치에서 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 지시하는 활성을 가지며, 여기에서 상기 절단은, 상기 Cas 단백질에 의해 절단시 서로 재결합하여 2개의 상동성 영역들 사이에 포함된 폴리뉴클레오타이드의 결실을 생성시키는 상기 영역들 사이에 포함된 게놈의 영역 중에서 발생한다. 바람직하게, 상기 2개의 상동성 영역간의 상동성 정도는 상동성 재조합을 허용하는 정도이다. 바람직하게, 상기 2개의 상동성 영역은 상기 상동성 영역의 전체 길이에 걸쳐 적어도 60%, 70%, 80%, 90%, 99% 또는 100% 서열 일치성을 갖는다. 놀랍게도, 상동성 영역의 길이는 라비린툴로마이세테스에서조차 매우 짧을 수 있으며, 여기에서 상동성 재조합을 허용하기 위해서 대개는 적어도 1 또는 수 kbp의 길이가 필요한 것으로 밝혀졌다. 따라서, 바람직한 실시태양에서, 상기 상동성 영역의 길이는 바람직하게는 1 kb 이하, 0.5 kb 이하, 100 bp 이하, 50 bp 이하, 40 bp 이하, 30 bp 이하, 20 bp 이하, 10 bp 이하이다.In a preferred embodiment, the Cas protein has the activity to direct cleavage of both polynucleotide strands at the position of the target-sequence, wherein the cleavage, upon cleavage by the Cas protein, rejoins each other to form two phases. Occurs among regions of the genome comprised between regions of homology that result in a deletion of the polynucleotide comprised between those regions. Preferably, the degree of homology between the two regions of homology is such that homologous recombination is permitted. Preferably, said two regions of homology have at least 60%, 70%, 80%, 90%, 99% or 100% sequence identity over the entire length of said regions of homology. Surprisingly, it has been found that the length of the regions of homology can be very short even in labyrinthulomycetes, where a length of at least 1 or several kbp is usually required to allow for homologous recombination. Thus, in a preferred embodiment, the length of the region of homology is preferably 1 kb or less, 0.5 kb or less, 100 bp or less, 50 bp or less, 40 bp or less, 30 bp or less, 20 bp or less, 10 bp or less.

바람직하게 상기 2개의 상동성 영역간의 거리는 10 kb 이하, 9 이하, 8 kb 이하, 7 kb 이하, 6 kb 이하, 5 kb 이하, 4 kb 이하, 3 kb 이하, 2 kb 이하, 1 kb 이하, 0.5 kb 이하, 100bp 이하, 50bp 이하, 40bp 이하, 30 bp 이하, 20 bp 이하, 10 bp 이하이다.Preferably, the distance between the two regions of homology is 10 kb or less, 9 or less, 8 kb or less, 7 kb or less, 6 kb or less, 5 kb or less, 4 kb or less, 3 kb or less, 2 kb or less, 1 kb or less, 0.5 kb or less, 100 bp or less, 50 bp or less, 40 bp or less, 30 bp or less, 20 bp or less, 10 bp or less.

하나의 태양에서, 본 발명은 하나 이상의 PAM 부위를 표적화하고 공여체 DNA의 사용 없이 폴리뉴클레오타이드의 결실을 생성시키는 방법을 설계하기 위해서 상기 PAM 부위의 이웃 중 약 7 내지 20 bp의 상동성 영역 사이에 포함된 게놈 중에서 PAM 부위를 식별할 수 있는 소프트웨어 알고리즘에 관한 것이다.In one aspect, the present invention provides for targeting one or more PAM sites and including between about 7-20 bp of homology regions of the neighbours of said PAM sites to design a method for generating deletions of polynucleotides without the use of donor DNA. It relates to a software algorithm that can identify PAM sites in the genome.

상기 방법을 설계된 방식으로 폴리뉴클레오타이드 서열의 효율적인 제거에 사용할 수 있다. 예를 들어, Cas9 발현 카세트를 상기 게놈 DNA에 도입시킬 때 및 상기 CRISPR/CAS9 시스템에 의해 매개되는 수회 라운드의 변형 후에, 상기 CAS9 발현 카세트를 상기 Cas9 발현 카세트 중의 한 부위를 표적화하는 gRNA의 도입에 의해 상기 게놈으로부터 제거할 수 있으며, 여기에서 상기 Cas9 발현 카세트는 상기 정의된 바와 같은 2개의 상동성 영역 사이에, 바람직하게 100-bp 길이로, 보다 바람직하게 20-bp, 15-bp 길이로 또는 더 짧게 포함되며, 상기 Cas9 개방 판독 프레임 또는 상기 발현 카세트의 큰 부분을 절단시킨다.The method can be used for efficient removal of polynucleotide sequences in a designed manner. For example, when introducing a Cas9 expression cassette into the genomic DNA and after several rounds of modification mediated by the CRISPR/CAS9 system, the CAS9 expression cassette can be used for introduction of a gRNA targeting a site in the Cas9 expression cassette. wherein the Cas9 expression cassette is between two regions of homology as defined above, preferably 100-bp in length, more preferably 20-bp, 15-bp in length or shorter, truncating a large portion of the Cas9 open reading frame or the expression cassette.

상기 방법을 또한 유전자의 일시적인 불활성화에 사용할 수 있다. 예를 들어, 각각 5'-단부 및 3'-단부에 2개의 상동성 영역을 포함하는 카로티노이드 신타제 또는 포화된 지방산 신타제 또는 다중불포화된 지방산 신타제 유전자의 ORF에 폴리뉴클레오타이드 서열을 삽입함으로써 상기 유전자(예를 들어 카로티노에드 신타제 또는 포화된 지방산 신타제 또는 다중불포화된 지방산 신타제)를 비-기능성으로 만들 수 있으며, 여기에서 바람직하게 상기 상동성 영역은 100-bp, 보다 바람직하게 20-bp, 15-bp 길이이거나 또는 더 짧다. 상기 언급한 신타제 유전자를 상술한 바와 같이 공여체 DNA 없이 CRISPR-Cas9 시스템을 사용하여 다시 기능성으로 만들 수 있다.The method can also be used for transient inactivation of genes. For example, by inserting a polynucleotide sequence into the ORF of a carotenoid synthase or saturated fatty acid synthase or polyunsaturated fatty acid synthase gene comprising two regions of homology at the 5′-end and the 3′-end, respectively. A gene (eg carotinoed synthase or saturated fatty acid synthase or polyunsaturated fatty acid synthase) may be rendered non-functional, wherein preferably said region of homology is 100-bp, more preferably 20 -bp, 15-bp long or shorter. The aforementioned synthase gene can be made functional again using the CRISPR-Cas9 system without donor DNA as described above.

하나의 실시태양에서, 상기 발현 조절 방법은 상기 표적-폴리뉴클레오타이드의 적어도 하나의 위치에서 하나 또는 2개의 폴리뉴클레오타이드 가닥 모두의 절단에 이어서 외인성 폴리뉴클레오타이드와의 상동성 재조합에 의한 상기 표적-폴리뉴클레오타이드의 변형을 포함한다. 상기와 같은 경우에, 본 발명의 첫 번째 태양에 따른 조성물은 바람직하게는 상기와 같은 외인성 폴리뉴클레오타이드를 추가로 포함한다. 상기와 같은 변형은 상기 표적-폴리뉴클레오타이드 중의 적어도 하나의 뉴클레오타이드의 삽입, 결실 또는 치환을 생성시킬 수 있으며, 여기에서 상기 삽입 또는 치환 뉴클레오타이드는 상기 외인성 폴리뉴클레오타이드로부터 기원할 수도 또는 기원하지 않을 수도 있다. 하나의 실시태양에서, 상기 외인성 폴리뉴클레오타이드는 상기 표적-폴리뉴클레오타이드와의 상동성 영역을 포함한다. 바람직하게, 이들 상동성 영역간의 상동성 정도는 상동성 재조합을 허용하는 정도이다. 바람직하게 상기 상동성 영역은 상기 상동성 영역의 전체 길이에 걸쳐 적어도 60%, 70%, 80%, 90%, 99% 또는 100% 서열 일치성을 갖는다. 상기 외인성 폴리뉴클레오타이드가 비-통합 개체인 경우 변형을 또한 수행할 수 있으며; 이 경우에 상기 표적-폴리뉴클레오타이드는 변형되지만 상기 외인성 폴리뉴클레오타이드의 뉴클레오타이드는 상기 표적-폴리뉴클레오타이드내로 도입되지 않는다. 결과적으로, 상기 생성되는 숙주는 본 발명에 따른 Cas-단백질을 단백질로서 상기 숙주 세포에 도입시킬 때 비-재조합 숙주이다. 따라서, 본 발명의 상기 태양에 따른 방법에서, 상기 숙주 세포는 재조합 숙주 세포이거나 또는 비-재조합 숙주 세포일 수 있다. 상기 외인성 폴리뉴클레오타이드는 임의의 관심 폴리뉴클레오타이드, 예를 들어 본 명세서에서 정의된 바와 같은 관심 화합물을 암호화하는 폴리뉴클레오타이드, 또는 상기와 같은 폴리뉴클레오타이드의 일부 또는 그의 변체일 수 있다.In one embodiment, the expression control method comprises cleavage of one or both polynucleotide strands at at least one position of the target-polynucleotide followed by homologous recombination with an exogenous polynucleotide of the target-polynucleotide. include transformations. In such a case, the composition according to the first aspect of the present invention preferably further comprises such an exogenous polynucleotide. Such modifications may result in an insertion, deletion or substitution of at least one nucleotide in the target-polynucleotide, wherein the insertion or substitution nucleotide may or may not originate from the exogenous polynucleotide. In one embodiment, the exogenous polynucleotide comprises a region of homology with the target-polynucleotide. Preferably, the degree of homology between these regions of homology is such that it permits homologous recombination. Preferably said regions of homology have at least 60%, 70%, 80%, 90%, 99% or 100% sequence identity over the entire length of said regions of homology. Modifications may also be effected when the exogenous polynucleotide is a non-integrating entity; In this case, the target-polynucleotide is modified but the nucleotides of the exogenous polynucleotide are not introduced into the target-polynucleotide. Consequently, the resulting host is a non-recombinant host when introducing the Cas-protein according to the invention as a protein into the host cell. Thus, in the method according to this aspect of the invention, the host cell may be a recombinant host cell or a non-recombinant host cell. The exogenous polynucleotide may be any polynucleotide of interest, for example a polynucleotide encoding a compound of interest as defined herein, or a portion of such polynucleotide or a variant thereof.

또 다른 태양에서, 본 발명은 관심 화합물의 제조 방법을 제공하며, 상기 방법은 상기 관심 화합물에 도움이 되는 조건하에서 본 발명의 세 번째 또는 네 번째 태양에 따른 숙주 세포 또는 본 발명의 두 번째 태양에 따른 방법에 의해 수득된 숙주 세포, 또는 본 발명의 네 번째 태양에 따른 방법에 의해 수득될 수 있는 숙주 세포를 배양하고 임의로 상기 관심 화합물을 정제 또는 단리시킴을 포함한다.In another aspect, the present invention provides a method for the preparation of a compound of interest, said method being in a host cell according to the third or fourth aspect of the invention or in the second aspect of the invention under conditions conducive to said compound of interest. culturing the host cell obtained by the method according to the present invention, or the host cell obtainable by the method according to the fourth aspect of the present invention, and optionally purifying or isolating said compound of interest.

본 발명의 모든 실시태양의 상황에서 관심 화합물은 임의의 생물학적 화합물일 수 있다. 상기 생물학적 화합물은 바이오매스 또는 생물중합체 또는 대사길항물질일 수 있다. 상기 생물학적 화합물은 단일 폴리뉴클레오타이드 또는 생합성 또는 대사 경로를 구성하는 일련의 폴리뉴클레오타이드에 의해 암호화되거나 또는 단일 폴리뉴클레오타이드의 생성물 또는 일련의 폴리뉴클레오타이드의 생성물의 직접적인 결과일 수 있으며, 상기 폴리뉴클레오타이드는 유전자일 수 있고, 상기 일련의 폴리뉴클레오타이드는 유전자 클러스터일 수 있다. 본 발명의 모든 실시태양에서, 상기 생물학적 관심 화합물을 암호화하는 단일 폴리뉴클레오타이드 또는 일련의 폴리뉴클레오타이드 또는 상기 생물학적 관심 화합물과 관련된 생합성 또는 대사 경로는 본 발명에 따른 조성물 및 방법에 바람직한 표적이다. 상기 생물학적 화합물은 상기 숙주 세포에 고유하거나 또는 상기 숙주 세포에 이종일 수 있다.In the context of all embodiments of the invention, the compound of interest may be any biological compound. The biological compound may be a biomass or a biopolymer or an antagonist. The biological compound may be encoded by a single polynucleotide or a series of polynucleotides constituting a biosynthetic or metabolic pathway or may be the product of a single polynucleotide or a direct result of the product of a series of polynucleotides, wherein the polynucleotide may be a gene and the series of polynucleotides may be a gene cluster. In all embodiments of the present invention, a single polynucleotide or series of polynucleotides encoding said compound of biological interest or a biosynthetic or metabolic pathway associated with said compound of biological interest is a preferred target for the compositions and methods according to the present invention. The biological compound may be native to the host cell or heterologous to the host cell.

"이종 생물학적 화합물"이란 용어는 본 명세서에서 상기 세포에 고유하지 않은 생물학적 화합물; 또는 구조적 변형이 이루어져 상기 고유의 생물학적 화합물을 변경시킨 고유의 생물학적 화합물로서 정의된다.The term "heterologous biological compound" as used herein refers to a biological compound that is not native to the cell; or as a native biological compound in which a structural modification has been made to alter the native biological compound.

"생물중합체"란 용어는 본 명세서에서 동일하거나, 유사하거나 또는 다른 서브유닛(단량체)의 쇄로서 정의된다. 상기 생물중합체는 임의의 생물중합체일 수 있다. 상기 생물중합체는 예를 들어 비제한적으로 핵산, 폴리아민, 폴리올, 폴리펩타이드(또는 폴리아미드), 또는 폴리사카라이드일 수 있다.The term "biopolymer" is defined herein as a chain of identical, similar or different subunits (monomers). The biopolymer may be any biopolymer. The biopolymer can be, for example, but not limited to, a nucleic acid, polyamine, polyol, polypeptide (or polyamide), or polysaccharide.

상기 생물중합체는 폴리펩타이드일 수 있다. 상기 폴리펩타이드는 관심 생물학적 활성을 갖는 임의의 폴리펩타이드일 수 있다. 상기 "폴리펩타이드"란 용어는 본 명세서에서 특정 길이의 암호화된 생성물을 지칭함을 의미하지 않으며, 따라서 펩타이드, 올리고펩타이드, 및 단백질을 포함한다. 폴리펩타이드란 용어는 임의의 길이의 아미노산의 중합체를 지칭한다. 상기 중합체는 선형이거나 분지될 수 있으며, 상기는 변형된 아미노산을 포함할 수 있고, 상기는 비-아미노산에 의해 중단될 수 있다. 상기 용어는 또한 변형된 아미노산 중합체를 포함한다; 예를 들어 다이설파이드 결합 형성, 글리코실화, 지질화, 아세틸화, 인산화, 또는 임의의 다른 조작, 예를 들어 표지화 성분과의 접합. 본 명세서에 사용되는 바와 같이, "아미노산"이란 용어는 천연 및/또는 비천연 또는 합성 아미노산, 예를 들어 글리신 및 D 또는 L 광학 이성질체 모두, 및 아미노산 유사체 및 펩티도유사물질을 포함한다. 폴리펩타이드는 천연 대립유전자 및 상기-언급한 폴리펩타이드의 조작된 변형 및 하이브리드 폴리펩타이드를 추가로 포함한다. 상기 폴리펩타이드는 상기 숙주 세포에 고유하거나 또는 이종일 수 있다. 상기 폴리펩타이드는 콜라겐 또는 젤라틴이거나, 또는 그의 변체 또는 하이브리드일 수 있다. 상기 폴리펩타이드는 항체 또는 그의 부분, 항원, 응고인자, 효소, 호르몬 또는 호르몬 변체, 수용체 또는 그의 부분, 조절 단백질, 구조 단백질, 리포터, 또는 수송 단배질, 분비 과정에 관련된 단백질, 폴딩 과정에 관련된 단백질, 셰프론, 펩타이드 아미노산 수송체, 글리코실화 인자, 전사 인자, 합성 펩타이드 또는 올리고펩타이드, 세포내 단백질일 수 있다. 상기 세포내 단백질은 효소, 예를 들어 프로테아제, 세라미다제, 에폭사이드 하이드롤라제, 아미노펩티다제, 아실라제, 알돌라제, 하이드록실라제, 아미노펩티다제, 리파제일 수 있다. 상기 폴리펩타이드는 또한 세포외로 분비된 효소일 수 있다. 상기와 같은 효소는 옥시도리덕타제, 트랜스퍼라제, 하이드롤라제, 라이아제, 아이소머라제, 리가제, 카탈라제, 셀룰라제, 키티나제, 큐티나제, 데옥시리보뉴클레아제, 덱스트라나제, 에스테라제의 그룹에 속할 수 있다. 상기 효소는 카보하이드라제, 예를 들어 셀룰라제, 예를 들어 엔도글루카나제, β-글루카나제, 셀로바이오하이드롤라제 또는 β-글루코시다제, 헤미셀룰라제 또는 펙틴분해 효소, 예를 들어 자일라나제, 자일로시다제, 만나나제, 갈락타나제, 갈락토시다제, 펙틴 메틸 에스테라제, 펙틴 라이아제, 펙테이트 라이아제, 엔도 폴리갈락튜로나제, 엑소폴리갈락튜로나제 람노갈락튜로나제, 아라바나제, 아라비노퓨라노시다제, 아라비녹실란 하이드롤라제, 갈락튜로나제, 라이아제, 또는 전분분해 효소; 하이드롤라제, 아이소머라제, 또는 리가제, 포스파타제, 예를 들어 피타제, 에스테라제, 예를 들어 리파제, 단백질분해 효소, 옥시도리덕타제, 예를 들어 옥시다제, 트랜스퍼라제, 또는 아이소머라제일 수 있다. 상기 효소는 피타제일 수 있다. 상기 효소는 아미노펩티다제, 아스파라기나제, 아밀라제, 말토제닉 아밀라제, 카보하이드라제, 카복시펩티다제, 엔도-프로테아제, 메탈로-프로테아제, 세린-프로테아제 카탈라제, 키티나제, 큐티나제, 사이클로덱스트린 글리코실트랜스퍼라제, 데옥시리보뉴클레아제, 에스테라제, 알파-갈락토시다제, 베타-갈락토시다제, 글루코아밀라제, 알파-글루코시다제, 베타-글루코시다제, 할로페록시다제, 단백질 데아미나제, 인베르타제, 라카제, 리파제, 만노시다제, 뮤타나제, 옥시다제, 펙틴분해 효소, 페록시다제, 포스포리파제, 갈락토리파제, 클로로필라제, 폴리페놀옥시다제, 리보뉴클레아제, 트랜스글루타미나제, 또는 글루코스 옥시다제, 헥소스 옥시다제, 모노옥시게나제일 수 있다.The biopolymer may be a polypeptide. The polypeptide may be any polypeptide having a biological activity of interest. The term "polypeptide" is not meant herein to refer to an encoded product of a particular length, and thus includes peptides, oligopeptides, and proteins. The term polypeptide refers to a polymer of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids. The term also includes modified amino acid polymers; For example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component. As used herein, the term "amino acid" includes natural and/or unnatural or synthetic amino acids, such as glycine and both D or L optical isomers, and amino acid analogs and peptidomites. Polypeptides further include native alleles and engineered modifications of the above-mentioned polypeptides and hybrid polypeptides. The polypeptide may be native or heterologous to the host cell. The polypeptide may be collagen or gelatin, or a variant or hybrid thereof. The polypeptide is an antibody or portion thereof, antigen, coagulation factor, enzyme, hormone or hormonal variant, receptor or portion thereof, regulatory protein, structural protein, reporter, or transport protein, protein involved in the secretion process, protein involved in the folding process , chephrons, peptide amino acid transporters, glycosylation factors, transcription factors, synthetic peptides or oligopeptides, intracellular proteins. The intracellular protein may be an enzyme such as a protease, ceramidase, epoxide hydrolase, aminopeptidase, acylase, aldolase, hydroxylase, aminopeptidase, lipase. The polypeptide may also be an extracellularly secreted enzyme. Such enzymes are oxidoreductase, transferase, hydrolase, lyase, isomerase, ligase, catalase, cellulase, chitinase, cutinase, deoxyribonuclease, dextranase, S It may belong to the group of therase. Said enzyme is a carbohydrase, for example a cellulase, for example an endoglucanase, β-glucanase, cellobiohydrolase or β-glucosidase, hemicellulase or pectinase, for example For xylanase, xylosidase, mannanase, galactanase, galactosidase, pectin methyl esterase, pectin lyase, pectate lyase, endo polygalacturonase, exopolygalacturonase rhamnogalacturonase, arabanase, arabinofuranosidase, arabinoxylan hydrolase, galacturonase, lyase, or starch degrading enzyme; hydrolases, isomerases, or ligases, phosphatases such as phytases, esterases such as lipases, proteases, oxidoreductases such as oxidases, transferases, or isomeras can be the best The enzyme may be phytase. The enzymes are aminopeptidase, asparaginase, amylase, maltogenic amylase, carbohydrase, carboxypeptidase, endo-protease, metallo-protease, serine-protease catalase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, haloperoxidase, Protein deaminase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, pectinase, peroxidase, phospholipase, galactolipase, chlorophyllase, polyphenoloxidase, ribo nuclease, transglutaminase, or glucose oxidase, hexose oxidase, monooxygenase.

본 발명에 따라, 관심 화합물은 WO2010/102982에 기재된 바와 같은 개선된 분비 특징을 갖는 폴리펩타이드 또는 효소일 수 있다. 본 발명에 따라, 관심 화합물은 또 다른 폴리펩타이드가 상기 폴리펩타이드 또는 그의 단편의 N-말단 또는 C-말단에 융합된, 융합된 또는 하이브리드 폴리펩타이드일 수 있다. 융합된 폴리펩타이드는 하나 이상의 폴리펩타이드를 암호화하는 핵산 서열(또는 그의 일부)을 또 다른 폴리펩타이드를 암호화하는 핵산 서열(또는 그의 일부)에 융합시킴으로써 생성된다.According to the present invention, the compound of interest may be a polypeptide or enzyme with improved secretion characteristics as described in WO2010/102982. According to the present invention, the compound of interest may be a fused, fused or hybrid polypeptide in which another polypeptide is fused to the N-terminus or C-terminus of said polypeptide or fragment thereof. A fused polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) encoding one or more polypeptides to a nucleic acid sequence (or portion thereof) encoding another polypeptide.

융합 폴리펩타이드의 생성 기법은 당해 분야에 공지되어 있으며, 상기 폴리펩타이드를 암호화하는 암호화 서열들을 이들이 인프레임이고 상기 융합된 폴리펩타이드의 발현이 동일한 프로모터(들) 및 종결자의 조절하에 있도록 결찰시킴을 포함한다. 상기 하이브리드 폴리펩타이드는 적어도 2개의 상이한 폴리펩타이드로부터 수득된 부분적인 또는 완전한 폴리펩타이드 서열의 조합을 포함할 수 있으며, 여기에서 하나 이상은 상기 숙주 세포에 대해 이종일 수 있다. 융합 폴리펩타이드 및 신호 서열 융합의 예는 예를 들어 WO2010/121933에 기재된 바와 같다.Techniques for generating fusion polypeptides are known in the art and include ligating the coding sequences encoding the polypeptide so that they are in frame and expression of the fused polypeptide is under the control of the same promoter(s) and terminator. do. The hybrid polypeptide may comprise a combination of partial or complete polypeptide sequences obtained from at least two different polypeptides, wherein one or more may be heterologous to the host cell. Examples of fusion polypeptides and signal sequence fusions are as described, for example, in WO2010/121933.

상기 생물중합체는 폴리사카라이드일 수 있다. 상기 폴리사카라이드는 임의의 폴리사카라이드, 예를 들어 비제한적으로 뮤코폴리사카라이드(예를 들어 헤파린 및 히아루론산) 및 질소-함유 폴리사카라이드(예를 들어 키틴)일 수 있다. 바람직한 옵션에서, 상기 폴리사카라이드는 히아루론산이다.The biopolymer may be a polysaccharide. The polysaccharide can be any polysaccharide, including but not limited to mucopolysaccharides (eg heparin and hyaluronic acid) and nitrogen-containing polysaccharides (eg chitin). In a preferred option, the polysaccharide is hyaluronic acid.

상기 관심 화합물을 암호화하거나 본 발명에 따른 상기 관심 화합물의 생성에 관련된 화합물을 암호화하는 폴리뉴클레오타이드는 1차 또는 2차 대사산물, 예를 들어 유기산, 카로티노이드, (베타-락탐) 항생제 및 비타민의 합성에 관련된 효소를 암호화할 수 있다. 상기와 같은 대사산물은 본 발명에 따른 생물학적 화합물로서 간주될 수 있다.Polynucleotides encoding said compound of interest or encoding a compound involved in the production of said compound of interest according to the present invention may be used in the synthesis of primary or secondary metabolites such as organic acids, carotenoids, (beta-lactam) antibiotics and vitamins. It can encode related enzymes. Such metabolites can be regarded as biological compounds according to the present invention.

"대사산물"이란 용어는 1차 및 2차 대사산물 모두를 포함하며; 상기 대사산물은 임의의 대사산물일 수 있다. 바람직한 대사산물은 불포화된 지방 및 지질(비제한적으로 지방산 도코사헥사엔산, 도코사펜타엔산, 에루크산, 파울린산, 바크센산(시스 또는 트랜스), 에이코사펜타엔산, 에이코사테트라엔산(n-3), 아라키돈산(n-6), 옥사데카펜타엔산, 스테아리돈산, 리놀렌산(n6 또는 n3), 리놀레산, 올레산, 팔미톨레산, 옥타코사옥타엔산, 및 이들 중에 포함된 지질을 포함한다), 포화된 지방 및 지질(비제한적으로 지방산: 카프로산, 카프릴산, 카프르산, 라우르산, 미리스트산, 팔미트산, 스테아르산, 아라키드산, 베헨산, 및 이들 중에 포함된 지질을 포함한다), 황산화된 지질, 소포로지질, 락톤, 세라미드, 인지질, 지방 알콜, 지방산 에스터, 지방산 에테르, 지방산 알데하이드, 스테롤, 카로티노이드, 옥시리핀, 레솔빈, 류코트리엔, 프로스타글란딘, 유기산(비제한적으로 아세트산, 부티르산, 시트르산, 글루콘산, 아디프산, 푸마르산, 이타콘산, 말산, 메발론산 및 숙신산을 포함한다), 당 알콜 및 당산이다.The term "metabolite" includes both primary and secondary metabolites; The metabolite may be any metabolite. Preferred metabolites are unsaturated fats and lipids, including but not limited to the fatty acids docosahexaenoic acid, docosapentaenoic acid, erucic acid, paulinic acid, baxenoic acid (cis or trans), eicosapentaenoic acid, eicosa. Tetraenoic acid (n-3), arachidonic acid (n-6), oxadecapentaenoic acid, stearidonic acid, linolenic acid (n6 or n3), linoleic acid, oleic acid, palmitoleic acid, octacosaoctaenoic acid, and these lipids), saturated fats and lipids (including but not limited to fatty acids: caproic acid, caprylic acid, capric acid, lauric acid, myristic acid, palmitic acid, stearic acid, arachidic acid, behenic acid, and lipids contained therein), sulfated lipids, sophorolipids, lactones, ceramides, phospholipids, fatty alcohols, fatty acid esters, fatty acid ethers, fatty acid aldehydes, sterols, carotenoids, oxylipins, resorbin , leukotrienes, prostaglandins, organic acids (including but not limited to acetic acid, butyric acid, citric acid, gluconic acid, adipic acid, fumaric acid, itaconic acid, malic acid, mevalonic acid and succinic acid), sugar alcohols and sugar acids.

대사산물은 예를 들어 생합성 또는 대사 경로 중의 하나 이상의 유전자에 의해 암호화될 수 있다. 1차 대사산물은 세포의 1차 또는 일반적인 대사의 산물이며, 이는 에너지 대사, 성장 및 구조에 관한 것이다. 2차 대사산물은 2차 대사의 산물이다(예를 들어 문헌[R. B. Herbert, The Biosynthesis of Secondary Metabolites, Chapman and Hall, New York, 1981]을 참조하시오).A metabolite may be encoded by, for example, one or more genes in a biosynthetic or metabolic pathway. Primary metabolites are products of the cell's primary or general metabolism, which relate to energy metabolism, growth and structure. Secondary metabolites are products of secondary metabolism (see, eg, R. B. Herbert, The Biosynthesis of Secondary Metabolites, Chapman and Hall, New York, 1981).

1차 대사산물은 비제한적으로 아미노산, 지방산, 뉴클레오사이드, 뉴클레오타이드, 당, 트리글리세라이드, 또는 비타민일 수 있다.The primary metabolite may be, but is not limited to, an amino acid, a fatty acid, a nucleoside, a nucleotide, a sugar, a triglyceride, or a vitamin.

2차 대사산물은 비제한적으로 알칼로이드, 쿠마린, 플라보노이드, 폴리케타이드, 퀴닌, 스테로이드, 펩타이드 또는 터펜일 수 있다. 상기 2차 대사산물은 항생제, 섭식저해물질, 유인물질, 살균제, 살진균제, 호르몬, 살충제, 또는 살서제일 수 있다.The secondary metabolite may be, but is not limited to, an alkaloid, coumarin, flavonoid, polyketide, quinine, steroid, peptide or terpene. The secondary metabolite may be an antibiotic, an eating inhibitor, an attractant, a bactericide, a fungicide, a hormone, an insecticide, or a rodenticide.

상기 생물학적 화합물은 또한 선택성 마커의 생성물일 수 있다. 선택성 마커는 생성물이 살생물제 또는 바이러스 내성, 중금속 내성, 영양요구체에 대한 원영양요구 등을 제공하는 관심 폴리뉴클레오타이드의 생성물이다. 선택성 마커는 비제한적으로 nptII(네오마이신 포스포트랜스퍼라제 II), ALS(아세토락테이트 신타제), bsd(블라스티시딘-S-데아미나제), 및 Sh ble(플레오마이신 결합)뿐만 아니라 이들의 균등물을 포함한다.The biological compound may also be the product of a selectable marker. A selectable marker is a product of a polynucleotide of interest for which the product provides biocide or viral resistance, heavy metal resistance, prototrophic auxotrophs, and the like. Selectable markers include, but are not limited to, nptII (neomycin phosphotransferase II), ALS (acetolactate synthase), bsd (blasticidin-S-deaminase), and Sh ble (pleomycin binding), as well as but also includes equivalents thereof.

본 발명에 따라, 관심 화합물은 바람직하게는 관심 화합물 목록에 기재된 바와 같은 폴리펩타이드이다.According to the invention, the compound of interest is preferably a polypeptide as described in the list of compounds of interest.

본 발명의 또 다른 실시태양에 따라, 관심 화합물은 바람직하게는 대사산물이다.According to another embodiment of the invention, the compound of interest is preferably a metabolite.

본 발명에 따른 숙주 세포는 이미 상기 관심 화합물을 생성할 수 있다. 상기 숙주 세포에는 또한 폴리펩타이드를 암호화하는 동종 또는 이종 핵산 구조물이 제공될 수 있으며, 여기에서 상기 폴리펩타이드는 상기 관심 화합물 또는 상기 관심 화합물의 생성에 관련된 폴리펩타이드일 수 있다. 당해 분야의 숙련가는 상기 관심 화합물을 생성할 수 있도록 조류 숙주 세포를 어떻게 변형시키는 지를 알고 있다.The host cell according to the invention is already capable of producing said compound of interest. The host cell may also be provided with a homologous or heterologous nucleic acid construct encoding a polypeptide, wherein the polypeptide may be the compound of interest or a polypeptide involved in the production of the compound of interest. One of ordinary skill in the art knows how to modify avian host cells to produce such compounds of interest.

일반적인 정의general definition

본 명세서 및 첨부된 특허청구범위 전체를 통해서, "포함하다", "함유하다" 및 "갖는"이란 단어 및 "포함하다", "포함하는", "함유하다" 및 "함유하는"과 같은 변형들은 포괄적으로 해석되어야 한다. 즉, 이들 단어는 문맥상 허용되는 경우 구체적으로 명시되지 않은 다른 요소들 또는 정수들의 가능한 포함을 전달하고자 한다.Throughout this specification and the appended claims, the words "comprises", "contains" and "having" and variations such as "comprises", "comprising", "contains" and "comprising" are used throughout. should be interpreted comprehensively. That is, these words are intended to convey the possible inclusion of other elements or integers not specifically specified where the context permits.

"하나의"란 용어는 본 명세서에서 상기 관사의 문법적 목적어 중 하나 또는 하나 초과(즉 하나 또는 적어도 하나)를 지칭한다. 예로서, "요소"는 하나의 요소 또는 하나 초과의 요소를 의미할 수 있다.The term “a” refers to one or more than one (ie, one or at least one) of the grammatical objects of the article herein. By way of example, “an element” may mean one element or more than one element.

"약" 또는 "대략적으로"란 단어는 수치와 함께 사용될 때(예를 들어 약 10) 바람직하게는 상기 값이 상기 주어진 값(10)의 1% 초과 또는 미만의 값일 수 있음을 의미한다.The word "about" or "approximately" when used in conjunction with a numerical value (eg about 10) preferably means that the value can be greater than or less than 1% of the given value (10) above.

달리 정의되지 않는 한, 본 명세서에 사용된 모든 과학기술 용어들은 본 명세가 속하는 분야의 통상적인 숙련가에게 통상적으로 이해되는 바와 동일한 의미를 갖는다.Unless defined otherwise, all scientific and technical terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this specification belongs.

바람직한 뉴클레오타이드 유사체 또는 균등물은 변형된 주쇄를 포함한다. 상기와 같은 주쇄의 예는 모폴리노 주쇄, 카바메이트 주쇄, 실록산 주쇄, 설파이드, 설폭사이드 및 설폰 주쇄, 포름아세틸 및 티오포름아세틸 주쇄, 메틸렌포름아세틸 주쇄, 리보아세틸 주쇄, 알켄 함유 주쇄, 설파메이트, 설포네이트 및 설폰아미드 주쇄, 메틸렌이미노 및 메틸렌하이드라지노 주쇄, 및 아미드 주쇄에 의해 제공된다. 주쇄 중의 잔기간의 결합은 인 원자를 포함하지 않는 것, 예를 들어 단쇄 알킬 또는 사이클로알킬 뉴클레오사이드간 결합, 혼합된 이종원자 및 알킬 또는 사이클로알킬 뉴클레오사이드간 결합, 또는 하나 이상의 단쇄 이종원자 또는 헤테로사이클릭 뉴클레오사이드간 결합에 의해 형성되는 것이 더욱 바람직하다.Preferred nucleotide analogs or equivalents include modified backbones. Examples of such main chains include morpholino main chain, carbamate main chain, siloxane main chain, sulfide, sulfoxide and sulfone main chain, formacetyl and thioformacetyl main chain, methyleneformacetyl main chain, riboacetyl main chain, alkene-containing main chain, sulfamate , sulfonate and sulfonamide backbones, methyleneimino and methylenehydrazino backbones, and amide backbones. Bonds between residues in the backbone do not contain phosphorus atoms, such as short chain alkyl or cycloalkyl internucleoside bonds, mixed heteroatoms and alkyl or cycloalkyl internucleoside bonds, or one or more short chain heteroatoms. Or it is more preferably formed by a heterocyclic internucleoside bond.

바람직한 뉴클레오타이드 유사체 또는 균등물은 변형된 폴리아미드 주쇄(문헌[Nielsen, et al. (1991) Science 254, 1497-1500]를 갖는 펩타이드 핵산(PNA)을 포함한다. PNA-계 분자는 염기쌍 인식에 관하여 DNA 분자의 진정한 유사물질이다. 상기 PNA의 주쇄는 펩타이드 결합에 의해 연결된 N-(2-아미노에틸)-글리신 단위로 구성되며, 여기에서 상기 핵염기는 메틸렌 카보닐 결합에 의해 주쇄에 연결된다. 또 다른 주쇄는 1-탄소 연장된 피롤리딘 PNA 단량체를 포함한다(문헌[Govindaraju and Kumar (2005) Chem. Commun, 495-497]). PNA 분자의 주쇄는 하전된 포스페이트기를 함유하지 않으므로, PNA-RNA 하이브리드가 각각 RNA-RNA 또는 RNA-DNA 하이브리드보다 대개는 더 안정하다(문헌[Egholm et al. (1993) Nature 365, 566-568]).Preferred nucleotide analogs or equivalents include peptide nucleic acids (PNAs) having a modified polyamide backbone (Nielsen, et al . (1991) Science 254, 1497-1500). PNA-based molecules are concerned with base pair recognition. It is a true analogue of a DNA molecule.The main chain of the PNA is composed of N-(2-aminoethyl)-glycine units linked by peptide bonds, wherein the nucleobase is linked to the main chain by a methylene carbonyl bond. Another backbone comprises a 1-carbon extended pyrrolidine PNA monomer (Govindaraju and Kumar (2005) Chem. Commun, 495-497). Since the backbone of the PNA molecule does not contain a charged phosphate group, the PNA -RNA hybrids are usually more stable than RNA-RNA or RNA-DNA hybrids, respectively (Egholm et al . (1993) Nature 365, 566-568).

더욱 바람직한 주쇄는 모폴리노 뉴클레오타이드 유사체 또는 균등물을 포함하며, 여기에서 리보스 또는 데옥시리보스 당이 6-원 모폴리노 고리에 의해 교체된다. 가장 바람직한 뉴클레오타이드 유사체 또는 균등물은 포스포로다이아미데이트 모폴리노 올리고머(PMO)를 포함하며, 여기에서 상기 리보스 또는 데옥시리보스 당은 6-원 모폴리노 고리에 의해 교체되고, 인접한 모폴리노 고리들간의 음이온성 포스포다이에스터 결합은 비-이온성 포스포로다이아미데이트 결합에 의해 교체된다.A more preferred backbone comprises a morpholino nucleotide analog or equivalent, wherein a ribose or deoxyribose sugar is replaced by a 6-membered morpholino ring. Most preferred nucleotide analogues or equivalents include phosphorodiamidate morpholino oligomers (PMOs), wherein the ribose or deoxyribose sugar is replaced by a 6-membered morpholino ring and adjacent morpholino rings Anionic phosphodiester bonds between the rings are replaced by non-ionic phosphorodiamidate bonds.

추가의 바람직한 뉴클레오타이드 유사체 또는 균등물은 상기 포스포다이에스터 결합 중의 비-가교 산소 중 적어도 하나의 치환을 포함한다. 상기 변형은 염기-짝짓기를 약간 탈안정화시키지만 뉴클레아제 분해에 상당한 내성을 가한다. 바람직한 뉴클레오타이드 유사체 또는 균등물은 포스포로티오에이트, 키랄 포스포로티오에이트, 포스포로다이티오에이트, 포스포트라이에스터, 아미노알킬포스포트라이에스터, H-포스포네이트, 3'-알킬렌 포스포네이트, 5'-알킬렌 포스포네이트 및 키랄 포스포네이트를 포함한 메틸 및 다른 알킬 포스포네이트, 포스피네이트, 3'-아미노 포스포아미데이트 및 아미노알킬포스포아미데이트를 포함한 포스포아미데이트, 티오노포스포아미데이트, 티오노알킬포스포네이트, 티오노알킬포스포트라이에스터, 셀레노포스페이트 또는 보라노포스페이트를 포함한다.Further preferred nucleotide analogs or equivalents comprise substitution of at least one of the non-bridging oxygens in the phosphodiester linkage. This modification slightly destabilizes the base-pairing but adds considerable resistance to nuclease degradation. Preferred nucleotide analogs or equivalents are phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphorotriesters, aminoalkylphosphotriesters, H-phosphonates, 3′-alkylene phosphonates, 5 methyl and other alkyl phosphonates, including '-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates, including 3'-amino phosphoramidates and aminoalkylphosphoamidates, thionophos sporamidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates or boranophosphates.

더욱 바람직한 뉴클레오타이드 유사체 또는 균등물은 2', 3' 및/또는 5' 위치에서 일- 또는 이치환된 하나 이상의 당 부분, 예를 들어 -OH; -F; 하나 이상의 헤테로원자에 의해 중단될 수 있는, 치환되거나 비치환된, 선형 또는 분지된 저급(C1-C10) 알킬, 알케닐, 알키닐, 알크아릴, 알릴, 아릴, 또는 아르알킬; O-, S-, 또는 N-알킬; O-, S-, 또는 N-알케닐; O-, S- 또는 N-알키닐; O-, S-, 또는 N-알릴; O-알킬-O-알킬, -메톡시, -아미노프로폭시; 아미노옥시, 메톡시에톡시; -다이메틸아미노옥시에톡시; 및 -다이메틸아미노에톡시에톡시를 포함한다. 상기 당 부분은 피라노스 또는 그의 유도체, 또는 데옥시피라노스 또는 그의 유도체, 바람직하게는 리보스 또는 그의 유도체, 또는 데옥시리보스 또는 그의 유도체일 수 있다. 상기와 같은 바람직한 유도체화된 당 부분은 잠김 핵산(LNA)(여기에서 2'-탄소 원자는 당 고리의 3' 또는 4' 탄소 원저에 결합되어 바이사이클릭 당 부분을 형성한다)을 포함한다. 바람직한 LNA는 2'-O,4'-C-에틸렌-가교된 핵산을 포함한다(문헌[Morita et al. 2001. Nucleic Acid Res Supplement No. 1: 241-242]). 이들 치환은 상기 뉴클레오타이드 유사체 또는 균등물을 RNase H 및 뉴클레아제 내성으로 만들고 표적에 대한 친화성을 증가시킨다.More preferred nucleotide analogs or equivalents include one or more sugar moieties mono- or disubstituted at the 2', 3' and/or 5' positions, for example -OH; -F; substituted or unsubstituted, linear or branched lower (C1-C10) alkyl, alkenyl, alkynyl, alkaryl, allyl, aryl, or aralkyl, which may be interrupted by one or more heteroatoms; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; O-, S-, or N-allyl; O-alkyl-O-alkyl, -methoxy, -aminopropoxy; aminooxy, methoxyethoxy; -dimethylaminooxyethoxy; and -dimethylaminoethoxyethoxy. The sugar moiety may be pyranose or a derivative thereof, or deoxypyranose or a derivative thereof, preferably ribose or a derivative thereof, or deoxyribose or a derivative thereof. Preferred such derivatized sugar moieties include locked nucleic acids (LNAs), wherein the 2'-carbon atom is bonded to the 3' or 4' carbon atom of the sugar ring to form a bicyclic sugar moiety. Preferred LNAs include 2'-0,4'-C-ethylene-crosslinked nucleic acids (Morita et al . 2001. Nucleic Acid Res Supplement No. 1: 241-242). These substitutions render the nucleotide analog or equivalent resistant to RNase H and nuclease and increase affinity for the target.

본 발명의 상황에서 아미노산- 또는 핵산-서열의 "서열 일치성" 또는 "일치성"은 본 명세서에서 서열들간의 비교에 의해 측정된 바와 같은, 2개 이상의 아미노산(펩타이드, 폴리펩타이드, 또는 단백질) 서열 또는 2개 이상의 핵산(뉴클레오타이드, 올리고뉴클레오타이드, 폴리뉴클레오타이드) 서열간의 관계로서 정의된다. 당해 분야에서, "일치성"은 또한 경우에 따라, 서열들의 스트링간의 합치에 의해 측정된 바와 같은, 아미노산 또는 뉴클레오타이드 서열들간의 서열 관련성 정도를 의미한다. 본 발명 내에서, 특정 서열과의 서열 일치성은 바람직하게는 상기 특정 폴리펩타이드 또는 폴리뉴클레오타이드 서열의 전체 길이에 걸친 서열 일치성을 의미한다."Sequence identity" or "identity" of an amino acid- or nucleic acid-sequence in the context of the present invention means two or more amino acids (peptides, polypeptides, or proteins), as determined by comparison between the sequences herein. It is defined as a sequence or relationship between two or more nucleic acid (nucleotide, oligonucleotide, polynucleotide) sequences. In the art, "identity" also refers to the degree of sequence relatedness between amino acid or nucleotide sequences, as the case may be, as measured by the agreement between strings of sequences. Within the present invention, sequence identity with a particular sequence preferably means sequence identity over the entire length of said particular polypeptide or polynucleotide sequence.

2개의 아미노산 서열간의 "유사성"은 하나의 펩타이드 또는 폴리펩타이드의 아미노산 서열 및 그의 보존된 아미노산 치환체를 제2 펩타이드 또는 폴리펩타이드의 서열에 비교함으로써 측정된다. 바람직한 실시태양에서, 일치성 또는 유사성은 본 명세서에서 확인된 바와 같은 전체 서열(서열번호)에 걸쳐 산정된다. "일치성" 및 "유사성"은 공지된 방법, 예를 들어 비제한적으로 문헌[Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988]; 문헌[Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993]; 문헌[Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994]; 문헌[Sequence Analysis in Molecular Biology, von Heine, G., Academic Press, 1987]; 및 문헌[Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991]; 및 문헌[Carillo, H., and Lipman, D., SIAM J. Applied Math., 48:1073 (1988)]에 기재된 것들에 의해 쉽게 산정될 수 있다."Similarity" between two amino acid sequences is determined by comparing the amino acid sequence of one peptide or polypeptide and its conserved amino acid substitutions to the sequence of a second peptide or polypeptide. In a preferred embodiment, identity or similarity is assessed over the entire sequence (SEQ ID NO:) as identified herein. “Identity” and “similarity” are defined by known methods, including, but not limited to, Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heine, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48:1073 (1988).

일치성을 측정하기에 바람직한 방법은 시험된 서열들간의 최대의 합치를 제공하도록 설계된다. 일치성 및 유사성을 측정하기 위한 방법은 공개적으로 입수할 수 있는 컴퓨터 프로그램에 체계적으로 정리되어 있다. 2개 서열간의 일치성 및 유사성을 측정하기에 바람직한 컴퓨터 프로그램 방법은 예를 들어 GCG 프로그램 패키지(문헌[Devereux, J., et al., Nucleic Acids Research 12 (1): 387 (1984)]), BestFit, BLASTP, BLASTN, 및 FASTA(문헌[Altschul, S. F. et al., J. Mol. Biol. 215:403-410 (1990)])를 포함한다. 상기 BLAST X 프로그램은 NCBI 및 다른 소스로부터 공개적으로 입수할 수 있다(문헌[BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, MD 20894]; 문헌[Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990)]). 주지된 스미스 워터맨(Smith Waterman) 알고리즘을 또한 일치성 측정에 사용할 수 있다.Preferred methods for determining identity are designed to provide maximum agreement between the sequences tested. Methods for measuring concordance and similarity are systematically documented in publicly available computer programs. Preferred computer program methods for determining identity and similarity between two sequences include, for example, the GCG program package (Devereux, J., et al ., Nucleic Acids Research 12 (1): 387 (1984)), BestFit, BLASTP, BLASTN, and FASTA (Altschul, SF et al ., J. Mol. Biol. 215:403-410 (1990)). The BLAST X program is publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al ., NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al . , J. Mol. Biol. 215:403-410 (1990)). The well-known Smith Waterman algorithm can also be used to measure concordance.

폴리펩타이드 서열 비교에 바람직한 매개변수는 하기와 같다: 알고리즘: 니들맨 앤드 분취(Needleman and Wunsch)(문헌[J. Mol. Biol. 48:443-453 (1970)]); 비교 행렬: BLOSSUM62(문헌[Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992)]); 끊김 벌점: 12; 및 끊김 길이 벌점: 4. 이들 매개변수에 유용한 프로그램을 미국 위스콘신주 매디슨에 위치한 제네틱스 컴퓨터 그룹(Genetics Computer Group)으로부터 "Ogap" 프로그램으로서 공개적으로 입수할 수 있다. 상기 언급한 매개변수는 아미노산 비교를 위한 디폴트 매개변수(단부 끊김에 대한 벌점 없음과 함께)이다.Preferred parameters for polypeptide sequence comparison are: Algorithm: Needleman and Wunsch (J. Mol. Biol. 48:443-453 (1970)); Comparison matrix: BLOSSUM62 (Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992)); Break Penalty: 12; and Break Length Penalties: 4. Programs useful for these parameters are publicly available as the "Ogap" program from the Genetics Computer Group of Madison, Wisconsin. The parameters mentioned above are the default parameters for amino acid comparison (with no penalty for end breaks).

핵산 비교에 바람직한 매개변수는 하기와 같다: 알고리즘: 니들맨 앤드 분취(문헌[J. Mol. Biol. 48:443-453 (1970)]); 비교 행렬: 합치 = +10, 오합치 = 0; 끊김 벌점: 50; 끊김 길이 벌점: 3. 미국 위스콘신주 매디슨에 위치한 제네틱스 컴퓨터 그룹으로부터 끊김 프로그램으로서 입수할 수 있다. 핵산 비교용 디폴트 매개변수는 상기에 제공되어 있다.Preferred parameters for nucleic acid comparison are: Algorithm: Needleman and Preparative (J. Mol. Biol. 48:443-453 (1970)); Comparison matrix: agreement = +10, mismatch = 0; Break Penalty: 50; Break Length Penalties: 3. Available as a break program from Genetics Computer Group, Madison, Wisconsin, USA. Default parameters for nucleic acid comparison are provided above.

임의로, 아미노산 유사성 정도의 측정에서, 숙련가는 또한 상기 숙련가에게 명백한 바와 같이, 소위 "보존적" 아미노산 치환을 고려할 수 있다. 보존적 아미노산 치환은 유사한 측쇄를 갖는 잔기들의 호환성을 지칭한다. 예를 들어, 지방족 측쇄를 갖는 아미노산의 그룹은 글리신, 알라닌, 발린, 류신 및 아이소류신이고; 지방족-하이드록실 측쇄를 갖는 아미노산의 그룹은 세린 및 쓰레오닌이며; 아미드-함유 측쇄를 갖는 아미노산의 그룹은 아스파라진 및 글루타민이고; 방향족 측쇄를 갖는 아미노산의 그룹은 페닐알라닌, 타이로신 및 트립토판이며; 염기성 측쇄를 갖는 아미노산의 그룹은 리신, 아르기닌 및 히스티딘이고; 황-함유 측쇄를 갖는 아미노산의 그룹은 시스테인 및 메티오닌이다. 바람직한 보존적 아미노산 치환기는 발린-류신-아이소류신, 페닐알라닌-타이로신, 리신-아르기닌, 알라닌-발린, 및 아스파라진-글루타민이다. 본 명세서에 개시된 아미노산 서열의 치환적 변체는 상기 개시된 서열 중 적어도 하나의 잔기가 제거되었고 상이한 잔기가 대신에 삽입된 것들이다. 바람직하게, 상기 아미노산 변화는 보존적이다. 각각의 천연 아미노산에 대한 바람직한 보존적 치환은 하기와 같다: Ala에서 ser로; Arg에서 lys로; Asn에서 gln 또는 his로; Asp에서 glu로; Cys에서 ser 또는 ala로; Gln에서 asn로; Glu에서 asp로; Gly에서 pro로; His에서 asn 또는 gln로; Ile에서 leu 또는 val로; Leu에서 ile 또는 val로; Lys에서 arg로; gln 또는 glu로; Met에서 leu 또는 ile로; Phe에서 met, leu 또는 tyr로; Ser에서 thr로; Thr에서 ser로; Trp에서 tyr로; Tyr에서 trp 또는 phe로; 및 Val에서 ile 또는 leu로.Optionally, in determining the degree of amino acid similarity, the skilled person may also consider so-called "conservative" amino acid substitutions, as will be clear to the skilled artisan. Conservative amino acid substitutions refer to compatibility of residues with similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine and isoleucine; A group of amino acids having aliphatic-hydroxyl side chains are serine and threonine; A group of amino acids having amide-containing side chains are asparagine and glutamine; A group of amino acids having aromatic side chains is phenylalanine, tyrosine and tryptophan; A group of amino acids having basic side chains are lysine, arginine and histidine; A group of amino acids having sulfur-containing side chains are cysteine and methionine. Preferred conservative amino acid substituents are valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. Substitutional variants of the amino acid sequences disclosed herein are those in which at least one residue in the disclosed sequences has been removed and a different residue inserted instead. Preferably, said amino acid change is conservative. Preferred conservative substitutions for each natural amino acid are: Ala to ser; Arg to lys; Asn to gln or his; Asp to Glu; Cys to ser or ala; Gln to asn; Glu to asp; Gly to pro; His to asn or gln; Ile to leu or val; Leu to ile or val; Lys to arg; with gln or glu; Met to leu or ile; Phe to met, leu or tyr; Ser to thr; Thr to ser; Trp to tyr; Tyr to trp or phe; and from Val to ile or leu.

본 발명에 따른 폴리뉴클레오타이드를 뉴클레오타이드 서열에 의해 나타낸다. 본 발명에 따른 폴리펩타이드를 아미노산 서열에 의해 나타낸다. 본 발명에 따른 핵산 구조물은 천연 유전자로부터 단리되거나 또는 달리 자연에는 존재하지 않는 방식으로 결합되거나 병치된 폴리뉴클레오타이드의 분절을 함유하도록 변형된 폴리뉴클레오타이드로서 정의된다. 임의로, 본 발명에 따른 핵산 구조물 중에 존재하는 폴리뉴클레오타이드는 하나 이상의 조절 서열에 작동적으로 연결되며, 상기 서열은 숙주 세포 또는 무세포 시스템에서 암호화된 생성물의 생성 또는 발현을 지시한다.A polynucleotide according to the present invention is represented by a nucleotide sequence. A polypeptide according to the present invention is represented by an amino acid sequence. Nucleic acid constructs according to the present invention are defined as polynucleotides isolated from native genes or modified to contain segments of polynucleotides that are joined or juxtaposed in a manner not otherwise present in nature. Optionally, the polynucleotides present in the nucleic acid construct according to the invention are operably linked to one or more regulatory sequences, said sequences directing the production or expression of the encoded product in a host cell or cell-free system.

본 명세서에 제공된 바와 같은 서열 정보는 잘못 식별된 염기의 포함을 요하는 것으로 좁게 이해되어서는 안 된다. 숙련가는 상기와 같은 잘못 식별된 염기를 식별할 수 있으며 상기와 같은 오류를 어떻게 보정하는 지를 알고 있다.Sequence information as provided herein should not be construed narrowly as requiring the inclusion of misidentified bases. The skilled person can identify such misidentified bases and knows how to correct such errors.

본 발명의 모든 실시태양, 즉 본 발명에 따른 조성물, 발현의 조절 방법, 본 발명에 따른 조성물을 포함하는 숙주 세포, 본 발명에 따른 숙주 세포의 생성 방법, 본 발명에 따른 숙주 세포, 및 본 발명에 따른 관심 화합물의 생성 방법은 바람직하게는 무세포 시험관내 시스템이 아닌 숙주 세포를 참조한다; 즉, 본 발명에 따른 CRISPR-Cas 시스템은 바람직하게는 무세포 시험관내 시스템이 아닌 숙주 세포 시스템이다.All embodiments of the present invention, namely a composition according to the invention, a method for regulating expression, a host cell comprising the composition according to the invention, a method for producing a host cell according to the invention, a host cell according to the invention, and the present invention The method for producing a compound of interest according to the present invention preferably refers to a host cell rather than a cell-free in vitro system; That is, the CRISPR-Cas system according to the present invention is preferably a host cell system rather than a cell-free in vitro system.

본 발명의 모든 실시태양, 예를 들어 본 발명에 따른 조성물, 발현의 조절 방법, 본 발명에 따른 조성물을 포함하는 숙주 세포, 본 발명에 따른 숙주 세포의 생성 방법, 본 발명에 따른 숙주 세포, 및 본 발명에 따른 관심 화합물의 생성 방법에서, 상기 숙주 세포는 반수체, 이배체 또는 배수체 숙주 세포일 수 있다.All embodiments of the invention, for example a composition according to the invention, a method for regulating expression, a host cell comprising the composition according to the invention, a method for producing a host cell according to the invention, a host cell according to the invention, and In the method for producing the compound of interest according to the present invention, the host cell may be a haploid, diploid or diploid host cell.

본 발명에 따른 숙주 세포는 라비린툴로마이세테스 숙주 세포, 바람직하게는 트라우스토키트리알레스 목, 보다 바람직하게는 트라우스토키트리아세아에 과, 보다 바람직하게는 아우란티오키트리움, 오블롱기키트리움, 스키조키트리움, 트라우스토키트리움 및 울케니아로 이루어지는 그룹 중에서 선택되는 속의 일원, 훨씬 더 바람직하게는 스키조키트리움 종 ATCC# 20888이다.The host cell according to the present invention is a labyrinthulomycetes host cell, preferably of the order Traustochytrales, more preferably of the family Traustochytriaceae, more preferably of Aurantiochytrium, Oblongikit a member of the genus selected from the group consisting of lyum, Schizochytrium, Thraustochytrium and Ulkenia, even more preferably Schizochytrium sp. ATCC# 20888.

바람직하게는 게놈 중 변형은 본 명세서에서 하나 이상의 변형으로서 이해된다. 바람직하게는 본 발명에 따른 숙주 세포의 게놈 중 변형은Preferably a modification in the genome is understood herein as one or more modifications. Preferably, the modification in the genome of the host cell according to the invention comprises:

a) 모 숙주 세포에 재조합 유전자 조작 기법을 가하고/가하거나a) subjecting the parental host cell to recombinant genetic engineering techniques and/or

b) 모 숙주 세포에 (고전적인) 돌연변이유발을 가하고/가하거나;b) subjecting (classical) mutagenesis to the parental host cell;

c) 모 숙주 세포에 억제 화합물 또는 조성물을 가함c) adding an inhibitory compound or composition to the parental host cell

으로써 수행될 수 있다.can be performed by

숙주 세포의 게놈의 변형은 본 명세서에서 상기 숙주 세포의 게놈 중 폴리뉴클레오타이드 서열의 변화를 생성시키는 임의의 사건으로서 정의된다.A modification of the genome of a host cell is defined herein as any event that results in a change in the polynucleotide sequence in the genome of said host cell.

바람직하게, 본 발명에 따른 숙주 세포는 바람직하게는, 동일한 조건하에서 분석시, 변형되지 않은 모 숙주 세포와 비교하는 경우 본 명세서에서 정의된 바와 같은 불필요한 화합물을 생성시키지 않거나 감소시키는 그의 게놈 중의 변형을 갖는다.Preferably, the host cell according to the invention preferably exhibits modifications in its genome that, when assayed under the same conditions, do not produce or reduce unnecessary compounds as defined herein when compared to an unmodified parental host cell. have

변형을 당해 분야의 숙련가에게 공지된 임의의 수단에 의해, 예를 들어 고전적인 균주 개선, 무작위 돌연변이유발에 이은 선택에 의해 도입시킬 수 있다. 변형을 또한 부위-지향된 돌연변이유발에 의해 도입시킬 수 있다.Modifications can be introduced by any means known to those skilled in the art, for example by classical strain improvement, random mutagenesis followed by selection. Modifications can also be introduced by site-directed mutagenesis.

변형을 폴리뉴클레오타이드 서열 중 하나 이상의 뉴클레오타이드의 도입(삽입), 치환(교체), 또는 제거(결실)에 의해 수행할 수 있다. 폴리펩타이드와 같은 불필요한 화합물을 암호화하는 폴리뉴클레오타이드의 전체 또는 부분 결실을 성취할 수 있다. 불필요한 화합물은 본 명세서에서 달리 어딘가에 나열된 임의의 불필요한 화합물일 수 있으며; 상기는 또한 대사산물과 같은 불필요한 화합물의 생물학적 합성 경로에서 단백질 및/또는 효소일 수 있다. 한편으로, 상기 불필요한 화합물을 암호화하는 폴리뉴클레오타이드를, 상기 불필요한 화합물을 암호화하지 않거나 또는 상기 불필요한 화합물의 부분적으로 또는 완전히 불활성인 형태를 암호화하는 폴리뉴클레오타이드 서열로 부분적으로 또는 완전히 교체할 수 있다. 또 다른 대안에서, 하나 이상의 뉴클레오타이드를 상기 불필요한 화합물을 암호화하는 폴리뉴클레오타이드내에 삽입시켜 상기 폴리뉴클레오타이드의 붕괴 및 결과적으로 상기 붕괴된 폴리뉴클레오타이드에 의해 암호화된 상기 불필요한 화합물의 부분적인 또는 완전한 불활성화를 생성시킬 수 있다.Modifications may be effected by introduction (insertion), substitution (replacement), or removal (deletion) of one or more nucleotides in the polynucleotide sequence. Whole or partial deletions of polynucleotides encoding unnecessary compounds, such as polypeptides, can be achieved. An unnecessary compound may be any unnecessary compound listed elsewhere herein; They may also be proteins and/or enzymes in the biological synthesis pathway of undesirable compounds such as metabolites. Alternatively, the polynucleotide encoding the unnecessary compound may be partially or completely replaced with a polynucleotide sequence that does not encode the unnecessary compound or encodes a partially or completely inactive form of the unnecessary compound. In another alternative, one or more nucleotides may be inserted into the polynucleotide encoding the undesired compound to result in disruption of the polynucleotide and consequently partial or complete inactivation of the undesired compound encoded by the disrupted polynucleotide. can

하나의 실시태양에서, 본 발명에 따른 돌연변이 미생물 숙주 세포는 In one embodiment, the mutant microbial host cell according to the invention comprises

a) 불필요한 화합물을 암호화하는 폴리뉴클레오타이드의 완전한 또는 부분적인 결실,a) a complete or partial deletion of a polynucleotide encoding an unnecessary compound;

b) 불필요한 화합물을 암호화하지 않거나 또는 상기 불필요한 화합물의 부분적으로 또는 완전히 불활성인 형태를 암호화하는 폴리뉴클레오타이드 서열에 의한, 상기 불필요한 화합물을 암호화하는 폴리뉴클레오타이드의 완전한 또는 부분적인 교체,b) complete or partial replacement of the polynucleotide encoding the unnecessary compound by a polynucleotide sequence that does not encode the unnecessary compound or which encodes a partially or completely inactive form of the unnecessary compound;

c) 불필요한 화합물을 암호화하는 폴리뉴클레오타이드 서열 중 하나 이상의 뉴클레오타이드의 삽입에 의한 상기 폴리뉴클레오타이드의 붕괴 및 결과적인 상기 붕괴된 폴리뉴클레오타이드에 의한 상기 불필요한 화합물의 부분적인 또는 완전한 불활성화c) disruption of said polynucleotide by insertion of one or more nucleotides in a polynucleotide sequence encoding an undesired compound and consequent partial or complete inactivation of said undesired compound by said disrupted polynucleotide

중에서 선택된 그의 게놈 중의 변형을 포함한다.a modification in its genome selected from

상기 변형은 예를 들어 상기 불필요한 화합물의 전사 또는 번역에 필요한 암호화 서열 또는 조절 요소 중에 있을 수 있다. 예를 들어, 뉴클레오타이드를 정지 코돈의 도입, 개시 코돈의 제거 또는 암호화 서열의 개방 판독 프레임의 변화 또는 프레임-이동을 생성시키기 위해 삽입하거나 제거할 수 있다. 상기 암호화 서열 또는 그의 조절 요소의 변형을 부위-지향된 또는 무작위 돌연변이유발, DNA 셔플링 방법, DNA 조립 방법, 유전자 합성(예를 들어 문헌[Young and Dong (2004), Nucleic Acids Research 32(7) electronic access at nar.oupjournals.org/cgi/reprint/32/7/e59 or Gupta et al. (1968), Proc. Natl. Acad. Sci USA, 60: 1338-1344]; 문헌[Scarpulla et al. (1982), Anal. Biochem. 121: 356-365; Stemmer et al. (1995), Gene 164: 49-53]을 참조하시오), 또는 당해 분야에 공지된 방법에 따른 PCR 생성된 돌연변이유발에 의해 수행할 수 있다. 무작위 돌연변이유발 과정의 예는 당해 분야에 주지되어 있다, 예를 들어 화학적(예를 들어 NTG) 돌연변이유발 또는 물리적(예를 들어 UV) 돌연변이유발. 부위-지향된 돌연변이유발 과정의 예는 퀵체인지(QuickChange)(상표) 부위-지향된 돌연변이유발 키트(스트라타진 클로닝 시스템(Stratagene Cloning Systems), 미국 캘리포니아주 라졸라 소재), 디 얼터드 사이츠('The Altered Sites)(등록상표) II 시험관내 돌연변이유발 시스템스(프로메가 코포레이션(Promega Corporation)) 또는 문헌[Ho et al., "Site-directed mutagenesis by overlap extension using the polymerase chain reaction", Gene, 1989 Apr 15, 77(1):51-9]에 기재된 바와 같은 PCR을 사용하거나 또는 문헌[Molecular Biology: Current Innovations and Future Trends., Eds. A.M. Griffin and H.G. Griffin. ISBN 1-898486-01-8; 1995 Horizon Scientific Press, PO Box 1, Wymondham, Norfolk, U.K.]에 기재된 바와 같은 PCR을 사용하는 오버랩 연장에 의한 것이다.Such modifications may be in, for example, coding sequences or regulatory elements necessary for the transcription or translation of the unnecessary compound. For example, nucleotides may be inserted or removed to create a frame-shift or introduction of a stop codon, removal of an initiation codon, or a change in the open reading frame of the coding sequence. Modification of the coding sequence or its regulatory elements can be performed by site-directed or random mutagenesis, DNA shuffling methods, DNA assembly methods, gene synthesis (see, e.g., Young and Dong (2004), Nucleic Acids Research 32(7) electronic access at nar.oupjournals.org/cgi/reprint/32/7/e59 or Gupta et al . (1968), Proc. Natl. Acad. Sci USA, 60: 1338-1344]; Scarpulla et al . (1982), Anal. Biochem. 121: 356-365; Stemmer et al . (1995), Gene 164: 49-53), or by PCR generated mutagenesis according to methods known in the art. Examples of random mutagenesis procedures are well known in the art, eg chemical (eg NTG) mutagenesis or physical (eg UV) mutagenesis. Examples of site-directed mutagenesis procedures include the QuickChange(TM) site-directed mutagenesis kit (Stratagene Cloning Systems, La Jolla, CA), The Altered Sites ( 'The Altered Sites® II in vitro mutagenesis systems (Promega Corporation) or Ho et al ., "Site-directed mutagenesis by overlap extension using the polymerase chain reaction", Gene, 1989 Apr 15, 77(1):51-9, or as described in Molecular Biology: Current Innovations and Future Trends., Eds. AM Griffin and HG Griffin. ISBN 1-898486-01-8; 1995 Horizon Scientific Press, PO Box 1, Wymondham, Norfolk, UK by overlap extension using PCR.

바람직한 변형 방법은 재조합 유전자 조작 기법, 예를 들어 부분 또는 완전한 유전자 교체 또는 부분 또는 완전한 유전자 결실을 기반으로 한다.Preferred methods of modification are based on recombinant genetic engineering techniques, for example partial or complete gene replacement or partial or complete gene deletion.

예를 들어, 폴리뉴클레오타이드, 핵산 구조물 또는 발현 카세트의 교체의 경우에, 적합한 DNA 서열을 교체하려는 표적 유전자좌에 도입시킬 수 있다. 적합한 DNA 서열은 바람직하게는 클로닝 벡터상에 존재한다. 바람직한 통합적인 클로닝 벡터는 DNA 단편을 포함하며, 상기 단편은 폴리뉴클레오타이드에 상동성이고/이거나, 소정의 유전자좌에의 클로닝 벡터의 통합을 표적화하기 위해 교체시키고자 하는 유전자좌에 측면 인접한 폴리뉴클레오타이드에 상동성을 갖는다. 표적화된 통합을 촉진하기 위해서, 상기 클로닝 벡터를 바람직하게는 세포의 형질전환에 앞서 선형화시킨다. 바람직하게, 선형화를 상기 클로닝 벡터의 적어도 하나, 바람직하게는 어느 한 단부가, 교체시키려는 DNA 서열(또는 측면에 인접한 서열)에 상동성인 서열에 측면 인접하도록 수행한다. 상기 과정을 상동성 재조합이라 칭하며 상기 기법을 또한 (부분적인) 유전자 결실을 성취하기 위해 사용할 수 있다.For example, in the case of replacement of a polynucleotide, nucleic acid construct or expression cassette, a suitable DNA sequence can be introduced at the target locus to be replaced. Suitable DNA sequences are preferably present on a cloning vector. Preferred integrative cloning vectors include DNA fragments that are homologous to polynucleotides and/or homologous to polynucleotides flanking the locus to be replaced in order to target integration of the cloning vector to a given locus. has To facilitate targeted integration, the cloning vector is preferably linearized prior to transformation of the cells. Preferably, the linearization is carried out so that at least one, preferably either end, of the cloning vector is flanking a sequence homologous to the DNA sequence (or flanking sequence) to be replaced. This process is called homologous recombination and the technique can also be used to achieve (partial) gene deletion.

예를 들어, 내인성 폴리뉴클레오타이드에 상응하는 폴리뉴클레오타이드를 결함성 폴리뉴클레오타이드, 즉 (완전히 기능성인) 폴리펩타이드를 생성시키지 못하는 폴리뉴클레오타이드에 의해 교체시킬 수 있다. 상동성 재조합에 의해, 상기 결함성 폴리뉴클레오타이드는 상기 내인성 폴리뉴클레오타이드를 대체한다. 상기 결함성 폴리뉴클레오타이드는 또한 마커를 암호화하는 것이 바람직할 수 있으며, 상기 마커는 핵산 서열이 변형된 형질전환체의 선택에 사용될 수 있다.For example, a polynucleotide corresponding to an endogenous polynucleotide can be replaced by a defective polynucleotide, ie a polynucleotide that fails to produce a (fully functional) polypeptide. By homologous recombination, the defective polynucleotide replaces the endogenous polynucleotide. The defective polynucleotide may also preferably encode a marker, and the marker may be used for selection of transformants in which the nucleic acid sequence is modified.

불필요한 화합물을 생성시키지 않거나 상기 화합물의 생성을 감소시키는 변형을, 상이한 방법에 의해, 예를 들어 상기와 같은 불필요한 화합물에 대한 항체 또는 화학적 억제제 또는 단백질 억제제 또는 물리적 억제제에 의해 획득할 수 있다(문헌[Tour O. et al., "Genetically targeted chromophore-assisted light inactivation", (2003) Nat. Biotech]). 한편으로, 또는 상기 언급한 기법과 함께, 불필요한 화합물의 감소된 생성 또는 생성되지 않음을 또한 예를 들어 UV 또는 화학적 돌연변이유발(문헌[Lian et al., "Increase of docosahexaenoic acid production by Schizochytrium sp. through mutagenesis and enzyme assay" (2010) Appl Biochem Biotechnol 162(4):935-941])에 의해 또는 본 명세서에 기재된 바와 같은 불필요한 폴리펩타이드의 효소 활성을 억제하는 억제제의 사용(예를 들어 β-글루코시다제의 억제제로서 기능하는 노지리마이신(문헌[Ren et al., "Effect of biotin and cerulenin addition on DHA production by Schizochytrium sp." (2012-01) Chinese J Bioprocess Engineering])에 의해 획득할 수 있다.Modifications that do not produce or reduce the production of undesired compounds can be obtained by different methods, for example by antibodies or chemical inhibitors or protein inhibitors or physical inhibitors against such undesired compounds (see [ Tour O. et al ., "Genetically targeted chromophore-assisted light inactivation", (2003) Nat. Biotech]). On the one hand, or in combination with the aforementioned techniques, reduced or no production of unnecessary compounds can also be achieved, for example, by UV or chemical mutagenesis (Lian et al ., "Increase of docosahexaenoic acid production by Schizochytrium sp. through mutagenesis and enzyme assay" (2010) Appl Biochem Biotechnol 162(4):935-941]) or the use of inhibitors that inhibit the enzymatic activity of unwanted polypeptides as described herein (e.g. β-glucosida It can be obtained by nojirimycin (Ren et al ., "Effect of biotin and cerulenin addition on DHA production by Schizochytrium sp." (2012-01) Chinese J Bioprocess Engineering), which functions as an agent inhibitor.

본 발명의 하나의 실시태양에서, 본 발명에 따른 숙주 세포의 게놈 중 변형은 불필요한 화합물을 암호화하는 폴리뉴클레오타이드의 적어도 하나의 위치에서의 변형이다.In one embodiment of the invention, the modification in the genome of the host cell according to the invention is a modification at at least one position of the polynucleotide encoding an unnecessary compound.

화합물, 예를 들어 불필요한 화합물, 예를 들어 불필요한 폴리펩타이드 및/또는 효소의 생성에서 세포의 결함은 본 명세서에서, 바람직하게는 게놈 중에서, 동일한 조건하에서 분석시 상기 세포가 변형되지 않은 모 숙주 세포에 비해, a) 상기 불필요한 화합물을 덜 생성시키거나 상기 불필요한 화합물을 실질적으로 생성시키지 않고/않거나 b) 감소된 활성 또는 감소된 비활성을 갖는 상기 불필요한 화합물, 또는 활성을 갖지 않거나 비활성을 갖지 않는 불필요한 화합물, 및 상기 가능성들 중 하나 이상의 조합을 생성시키는 표현형 특징을 생성시키도록 변형된 돌연변이 미생물 숙주 세포로서 정의된다. Defects in a cell in the production of a compound, e.g., an unwanted compound, e.g., an unwanted polypeptide and/or enzyme, are defined herein, preferably in the genome, in an unmodified parental host cell when the cell is analyzed under identical conditions. In comparison, a) less or substantially no production of said undesired compound and/or b) said undesired compound having reduced activity or reduced specific activity, or an undesired compound having no activity or no specific activity, and a mutant microbial host cell that has been modified to produce a phenotypic characteristic that results in a combination of one or more of the above possibilities.

바람직하게, 본 발명에 따른 변형된 숙주 세포는, 변형되지 않았고 동일한 조건하에서 측정된 모 숙주 세포에 비해 1% 미만의 상기 불필요한 화합물, 적어도 5% 미만의 상기 불필요한 화합물, 적어도 10% 미만의 상기 불필요한 화합물, 적어도 20% 미만의 상기 불필요한 화합물, 적어도 30% 미만의 상기 불필요한 화합물, 적어도 40% 미만의 상기 불필요한 화합물, 적어도 50% 미만의 상기 불필요한 화합물, 적어도 60% 미만의 상기 불필요한 화합물, 적어도 70% 미만의 상기 불필요한 화합물, 적어도 80% 미만의 상기 불필요한 화합물, 적어도 90% 미만의 상기 불필요한 화합물, 적어도 91% 미만의 상기 불필요한 화합물, 적어도 92% 미만의 상기 불필요한 화합물, 적어도 93% 미만의 상기 불필요한 화합물, 적어도 94% 미만의 상기 불필요한 화합물, 적어도 95% 미만의 상기 불필요한 화합물, 적어도 96% 미만의 상기 불필요한 화합물, 적어도 97% 미만의 상기 불필요한 화합물, 적어도 98% 미만의 상기 불필요한 화합물, 적어도 99% 미만의 상기 불필요한 화합물, 적어도 99.9% 미만의 상기 불필요한 화합물, 또는 가장 바람직하게는 100% 미만의 상기 불필요한 화합물을 생성시킨다.Preferably, the modified host cell according to the present invention has less than 1% of said undesired compound, at least 5% less than said undesired compound, at least 10% less than said undesired compound compared to a parental host cell that is not modified and measured under the same conditions. compound, at least less than 20% of the undesired compound, at least less than 30% of the undesired compound, at least less than 40% of the undesired compound, at least less than 50% of the undesired compound, at least less than 60% of the undesired compound, at least 70% less than the undesired compound, at least less than 80% of the undesired compound, at least less than 90% of the undesired compound, at least less than 91% of the undesired compound, at least less than 92% of the undesired compound, at least less than 93% of the undesired compound , at least less than 94% of the undesired compound, at least less than 95% of the undesired compound, at least less than 96% of the undesired compound, at least less than 97% of the undesired compound, at least less than 98% of the undesired compound, at least less than 99% of, at least 99.9% of the unwanted compounds, or most preferably less than 100% of the unwanted compounds.

본 명세서에 제공된 바와 같은 서열 정보는 잘못 식별된 염기의 포함을 요하는 것으로 좁게 이해되어서는 안 된다. 숙련가는 상기와 같은 잘못 식별된 염기를 식별할 수 있으며 상기와 같은 오류를 어떻게 보정하는 지를 알고 있다.Sequence information as provided herein should not be construed narrowly as requiring the inclusion of misidentified bases. The skilled person can identify such misidentified bases and knows how to correct such errors.

본 명세서에 제시된 각각의 참고문헌의 개시는 내용 전체가 본 명세서에 참고로 인용된다.The disclosure of each reference presented herein is incorporated herein by reference in its entirety.

본 발명을 하기의 실시예들에 의해 추가로 예시한다:The invention is further illustrated by the following examples:

실시예Example

실시예 1Example 1

형질전환을 위한 Cas9 벡터의 설계 및 형성Design and formation of Cas9 vectors for transformation

스트렙토코커스 피오게네스 MGAS5005 Cas9 서열을 스키조키트리움에서 발현을 위해 코돈-최적화하였다. 사용된 기본 Cas9 펩타이드 서열은 기재된 HA-태그 또는 GFP 융합 없이, 문헌[Jinek et al., 2013]에 사용된 바와 근본적으로 동일하였다(하기를 참조하시오). 상기 Cas9를 합성한 후에(DNA2.0, 미국 캘리포니아주 뉴워크 소재), 상기를 파로모마이신 선택 카세트를 함유하는 pCL122 벡터내로 클로닝하여 pCL122-Cas9(서열번호 )로 표기되는 벡터를 생성시켰다. OrfA/Pfa1 측면을 합성하고 pCL121(제오신 선택 카세트를 가졌다)내에 클로닝하고(젠스크립트(Genscript) USA, 미국 뉴저지주 피스카타웨이 소재) 최종 벡터를 pYB31(서열번호 2)로 표기하였다; 예를 들어 미국특허 제 8,940,884 호를 참조하시오. 상기 Cas9 유전자를, BamHI 및 NdeI 제한 부위 및 효소(뉴잉글랜드 바이오랩스(New England Biolabs), 미국 매사추세츠주 입스위치 소재)(도 3) 및 래피드 DNA 라이게이션(Rapid DNA Ligation) 키트(롯슈(Roche), 스위스 리치-로트크로이츠 소재)를 사용하여 절단 및 결찰을 통해 pCL122-Cas9 벡터로부터 pYB31 벡터내로 서브클로닝하였다. NEB10β 화학적 수용성 세포를 형질전환시키고(뉴잉글랜드 바이오랩스) 생성 콜로니를, 프라이머 121 Tub seq F(서열번호 13) 및 pYB32/3C R1(서열번호 14)(표 2) 및 5% DMSO(v:v)와 함께 선택된 콜로니, GoTaq 그린 마스터 믹스(프로메가(Promega), 미국 일리노이주 시카고 소재)를 사용하여 콜로니 PCR에 의해 선별하였으며 하기의 순환 조건을 상기 반응에 적용하였다: 95 ℃ 5분, (95 ℃ 30초, 59 ℃ 30초, 72 ℃ 1분) x 35주기, 72 ℃ 5분(도 4). 플라스미드를 PCR에 의해 양성 콜로니로부터 단리하고, 서열분석하고, 하나의 생성 벡터를 pYB32(서열번호 3)로 표기하였다.The Streptococcus pyogenes MGAS5005 Cas9 sequence was codon-optimized for expression in Schizochytrium. The basic Cas9 peptide sequence used was essentially identical to that used in Jinek et al., 2013, without the described HA-tag or GFP fusion (see below). After synthesizing the Cas9 (DNA2.0, Newark, CA), it was cloned into the pCL122 vector containing the paromomycin selection cassette to generate a vector designated as pCL122-Cas9 (SEQ ID NO: ). OrfA/Pfa1 flanking was synthesized and cloned into pCL121 (with Zeocin selection cassette) (Genscript USA, Piscataway, NJ) and the final vector was designated pYB31 (SEQ ID NO: 2); See, for example, US Pat. No. 8,940,884. The Cas9 gene was combined with BamHI and Nde I restriction sites and enzymes (New England Biolabs, Ipswich, MA) ( FIG. 3 ) and Rapid DNA Ligation kit (Rosche ( Roche), Ritchie-Rotkreuz, Switzerland) was subcloned from the pCL122-Cas9 vector into the pYB31 vector via cleavage and ligation. NEB10β chemically soluble cells were transformed (New England Biolabs) and the resulting colonies were harvested with primers 121 Tub seq F (SEQ ID NO: 13) and pYB32/3C R1 (SEQ ID NO: 14) (Table 2) and 5% DMSO (v:v). ) and selected colonies were selected by colony PCR using GoTaq Green Master Mix (Promega, Chicago, IL) and the following cycling conditions were applied to the reaction: 95° C. 5 min, (95 ℃ 30 sec, 59 ℃ 30 sec, 72 ℃ 1 min) x 35 cycles, 72 5 min (Fig. 4). Plasmids were isolated from positive colonies by PCR and sequenced, and one resulting vector was designated pYB32 (SEQ ID NO: 3).

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' 121 Tub seq F121 Tub seq F 1313 GGATCTCATGCTGGAGTTCTTCGGATCTCATGCTGGAGTTCTTC pYB32/3C R1pYB32/3C R1 1414 GTACTTCTCGTGGTAGGCAACCGTACTTCTCGTGGTAGGCAACC

앞서 언급한 바와 같이, 다른 연구들은 Cas9 발현의 수준이 형질전환체 생육성 및 뉴클레아제 활성의 최적화에 중요함을 지적하였다. 상기 pYB32 플라스미드에서, 상기 Cas9 유전자는 중간 강도 프로모터인 알파 튜불린 프로모터의 조절하에 있다. 카로틴 신타제(CS) 유전자로부터의 약한 프로모터를 또한 시험하고자 선택하였으며 인퓨전(InFusion) PCR 클로닝(클론테크/타카라 바이오 USA 인코포레이티드(Clontech/Takara Bio USA Inc.), 미국 캘리포니아주 마운틴뷰 소재)을 통해 pYB31내로 클로닝하였다. pYB31 플라스미드를 인퓨전 클로닝을 위한 준비에서 KpnI 및 NdeI 효소(뉴잉글랜드 바이오랩스)로 절단하였다. CS 프로모터를, 5% DMSO(v:v), KOD 핫스타트 마스터 믹스(HotStart Master Mix)(EMD 밀리포어(Millipore), 미국 매사추세츠주 빌레리카 소재) 및 주형으로서 사용된, CS 프로모터를 갖는 플라스미드 pTH043과 함께, CS 프로 KpnI IF F1(서열번호 15) 및 CS 프로 BamHI IF R1 프라이머(서열번호 16)(표 3)로 증폭시켰다. 반응 순환 조건은 하기와 같았다: 95 ℃ 2분, (95 ℃ 20초, 66 ℃ 10초, 70 ℃ 20초) x 35주기, 70 ℃ 2분. Cas9를, 5% DMSO(v:v), KOD 핫스타트 마스터 믹스, 및 주형으로서 사용된 pCL122-Cas9와 함께, CS 프로 BamHI IF F2(서열번호 17) 및 CS 프로 NdeI IF R2(서열번호 18) 프라이머(표 3)로 증폭시켰다. 반응 순환 조건은 하기와 같았다: 95 ℃ 2분, (95 ℃ 20초, 65 ℃ 10초, 70 ℃ 1분 45초) x 35주기, 70 ℃ 2분.As mentioned previously, other studies have indicated that the level of Cas9 expression is important for optimization of transformant viability and nuclease activity. In the pYB32 plasmid, the Cas9 gene is under the control of the alpha-tubulin promoter, which is a medium-strength promoter. A weak promoter from the carotene synthase (CS) gene was also selected for testing and was subjected to InFusion PCR cloning (Clontech/Takara Bio USA Inc., Mountain View, CA, USA). ) into pYB31. pYB31 the plasmid in preparation for infusion cloned was digested with Kpn I and Nde I enzyme (New England Bio Labs). Plasmid pTH043 with CS promoter, 5% DMSO (v:v), KOD HotStart Master Mix (EMD Millipore, Billerica, MA) and CS promoter used as template together with CS pro KpnI IF F1 (SEQ ID NO: 15) and CS pro BamHI IF R1 primers (SEQ ID NO: 16) (Table 3). Reaction cycle conditions were as follows: 95 °C 2 min, (95 °C 20 sec, 66 °C 10 sec, 70 °C 20 sec) x 35 cycles, 70 °C 2 min. Cas9, with 5% DMSO (v:v), KOD hotstart master mix, and pCL122-Cas9 used as template, CS pro BamHI IF F2 (SEQ ID NO: 17) and CS pro NdeI IF R2 (SEQ ID NO: 18) Amplified with primers (Table 3). Reaction cycle conditions were as follows: 95 °C 2 min, (95 °C 20 sec, 65 °C 10 sec, 70 °C 1 min 45 sec) x 35 cycles, 70 °C 2 min.

프라이머 명칭Primer name 서열 번호 SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' CS pro Kpn IF F1CS pro Kpn IF F1 1515 GTCTGAATTCCCGGGGTACCGAGCGGGCGATTCCACCGTCGTCTGAATTCCCGGGGTACCGAGCGGGCGATTCCACCGTC CS pro BamH IF R1CS pro BamH IF R1 1616 GTACTTCTTATCCATGGATCCCTCGGTCTCCGAGCGAGCGAGGTACTTCTTATCCATGGATCCCTCGGTCTCCGAGCGAGCGAG CS pro BamH IF F2CS pro BamH IF F2 1717 TCGCTCGCTCGGAGACCGAGGGATCCATGGATAAGAAGTACTCGCTCGCTCGGAGACCGAGGGATCCATGGATAAGAAGTAC CS pro Nde IF R2CS pro Nde IF R2 1818 GATTCACTAGTTTAGATCATATGTTAGACCTTGCGCTTCTTCTTAGGGTCCGATTCACTAGTTTAGATCATATGTTAGACCTTGCGCTTCTTCTTAGGGTCC

PCR 산물을 1% 아가로스 젤(론자(Lonza), 스위스 바젤 소재)상에서 실행시키고, 예상된 크기의 밴드를 절제하고 QIAquick 젤 추출 키트(퀴아겐(Qiagen), 독일 힐덴 소재)를 사용하여 젤 정제하였다(도 3). 인퓨전 클로닝에 대한 제조사의 프로토콜에 따라, 상기 Cas9 PCR 단편, CS 프로모터 PCR 단편 및 선형화된 pYB31 단편을 결찰하고, NEB10β 화학적 수용성 세포를 형질전환시키고, 생성 콜로니를 상술한 바와 같은 콜로니 PCR에 의해 선별하였다(도 4)(72 ℃에서 1분 30초 연장 시간으로). 서열 확인 후에, 상기 CS 프로모터에 작동적으로 연결된 Cas9를 함유하는 생성 벡터들 중 하나를 pYB33(서열번호 4)으로 표기하였다.PCR products were run on 1% agarose gel (Lonza, Basel, Switzerland), bands of the expected size were excised and gel purified using the QIAquick gel extraction kit (Qiagen, Hilden, Germany). (FIG. 3). Following the manufacturer's protocol for infusion cloning, the Cas9 PCR fragment, CS promoter PCR fragment and linearized pYB31 fragment were ligated, NEB10β chemically soluble cells were transformed, and the resulting colonies were selected by colony PCR as described above. (Fig. 4) (with an extension time of 1 min 30 s at 72 °C). After sequence confirmation, one of the production vectors containing Cas9 operably linked to the CS promoter was designated pYB33 (SEQ ID NO: 4).

pYB32 및 pYB33의 클로닝을 절단, 결찰 및 인퓨전 PCR 클로닝의 조합을 통해 수행하였다. 도 3에 도시된 바와 같이, 단편들이 모두 생성되었으며 컬럼 정제후 예상된 크기의 밴드를 가졌다. 절단된 플라스미드 및 PCR 증폭된 단편을 1% 아가로스 젤상에서 실행시켰다. 모든 레인에서 하기와 같이 예상된 크기의 단편들이 관찰되었다: pYB31-BamHI+NdeI=838 bp + 6555 bp, pYB31-KpnI+NdeI=1289bp + 6104bp, Cas9-BamHI+NdeI=4157bp + 5773bp, CS PCR 단편=1046 bp, Cas9 PCR 단편=4189 bp. 상기 젤 및 모든 후속 젤들에 사용된 분자량 마커는 DNA 콴티-래더(Quanti-Ladder)(오리진(Origene), 미국 메릴랜드주 록빌 소재)였으며, 단편 크기를 도 3의 우측 패널에 나타낸다.Cloning of pYB32 and pYB33 was performed through a combination of cleavage, ligation and infusion PCR cloning. As shown in FIG. 3 , all fragments were generated and had bands of the expected size after column purification. The digested plasmid and PCR amplified fragments were run on 1% agarose gel. Fragments of the expected size were observed in all lanes as follows: pYB31-BamHI+NdeI=838 bp+6555 bp, pYB31-KpnI+NdeI=1289bp+6104bp, Cas9-BamHI+NdeI=4157bp+5773bp, CS PCR fragment =1046 bp, Cas9 PCR fragment=4189 bp. The molecular weight marker used for this gel and all subsequent gels was DNA Quanti-Ladder (Origene, Rockville, MD), and the fragment size is shown in the right panel of FIG. 3 .

pYB32 및 pYB33 세균 형질전환으로부터 생성되는 콜로니를 콜로니 PCR에 의해 분석하였으며, 예상된 크기의 증폭산물을 갖는 콜로니들을 도 4에 굵은 글씨로 표시한다. 이들 콜로니는 pYB32-2, -4, -9, -14, -15, -24, -25, -28, 및 pYB33-16 및 pYB33D-1, -14이다. pYB32-2 및 pYB33-16이 정확한 서열을 갖는 것으로 확인된 것들 중에 있었으며 이들을 후속 연구에 사용하였다. pYB32 및 pYB33의 클로닝으로부터의 콜로니 PCR 결과들에 대해서, 예상된 증폭산물 크기는 하기와 같다: pYB32=1028 bp, pYB33=1584bp.Colonies generated from pYB32 and pYB33 bacterial transformation were analyzed by colony PCR, and colonies having an amplification product of the expected size are indicated in bold in FIG. 4 . These colonies are pYB32-2, -4, -9, -14, -15, -24, -25, -28, and pYB33-16 and pYB33D-1, -14. pYB32-2 and pYB33-16 were among those identified to have the correct sequence and were used for subsequent studies. For the colony PCR results from the cloning of pYB32 and pYB33, the expected amplification product sizes were: pYB32=1028 bp, pYB33=1584 bp.

실시예 2Example 2

Cas9 형질전환체의 형질전환 및 선택Transformation and selection of Cas9 transformants

스키조키트리움 종의 야생형 균주, ATCC 20888을 하기에 기재하는 바와 같은 입자 충격 방법 및 유전자총 장비(바이오래드(Bio-Rad), 미국 노쓰캐롤라이나주 롤리 소재)를 사용하여 pYB32 및 pYB33에 의한 형질전환에 사용하였다. 간단히, 20888을 27 ℃에서 250 ㎖ 편평-바닥 삼각 플라스크 중의 25 ㎖의 M50-20 배지(예를 들어 미국특허 제 8,003,772 호를 참조하시오)에서, 200 rpm에서 밤새 진탕시키면서 증식시켰다. 상기에 이어서, 상기 배양물을 250 ㎖ 진탕형 플라스크 중의 50 ㎖의 M2B 배지(예를 들어 미국특허 제 8,003,772 호를 참조하시오)에 1/100 희석하고 동일한 조건하에서 밤새 증식시켰다. 상기 스키조키트리움 배양물이 초기 로그기(0.6-2 OD 단위/㎖)에 있었을 때, 배양물을 3,220 x g에서 10분간 원심분리에 의해 수확하였다. 상등액을 경사분리하고, 펠릿을 M2B에 20 OD 단위/㎖의 최종 농도로 재현탁시키고 100 ㎕의 생성 세포 현탁액을 비-선택성 M2B 아가 플레이트(대략적으로 4 ㎝ 직경)의 대략 1/3상에 환상 움직임으로 도말하였다. 플라스미드 pYB32 및 pYB33을 25 ℃에서 밤새 SwaI로 절단하고 QIAquick PCR 정제 키트(퀴아겐, 독일 힐덴 소재)를 사용하여 정제하였다. 이어서 5 ㎍의 각각의 정제된 플라스미드를 50 ㎕의 2.5 M CaCl2, 20 ㎕의 0.1 M 스페르미딘 및 50 ㎕의 제조된 M10 텅스텐 비드(제조사의 프로토콜에 따라, 바이오 래드)와 혼합하고, 1분간 와동시키고, 이어서 실온에서 10분간 배양하여 상기 비드를 침전되게 하였다. DNA-코팅된 비드를 250 ㎕의 100% 에탄올로 1회 세척하고, 이어서 비드를 60 ㎕의 100% 에탄올에 재현탁시켰다. 각각의 제조된 거대담체(거대담체 조립 제조에 대한 제조사의 프로토콜에 따라)는 거대담체 디스크의 중심에 스폿팅된 에탄올 중의 코팅된 비드 10 ㎕를 가졌으며 에탄올은 건조되었다. 파열판 홀더는 70% 아이소프로판올 중에서 짧은 멸균 후 내부에 놓은 1,100 psi-지정된 파열판을 가졌으며, 조립된 거대담체를 꼭대기 선반 위치의 유전자총 플랫폼에 설치하고 스키조키트리움의 세포 패치가 있는 M2B 아가 플레이트를 상기 꼭대기로부터 세 번째 선반상에 세포쪽을 위로 하여 놓았다. 진공이 상기 유전자총 챔버내부에서 약 27 psi에 도달했으면, 파열판이 파손될 때까지 헬륨을 발사하고, 헬륨 흐름을 차단시키고, 상기 챔버를 대기로 배기시키고, 충격판을 상기 챔버로부터 제거하였다. 상기 충격 과정을 모든 샘플 및 대조용에 대해 반복하였으며, pYB32로 충격된 스키조키트리움 종 20888을 T188로 표기하고, pYB33으로 충격된 스키조키트리움 배양물은 T189로 표기하였다. 충격판을 27 ℃에서 4시간 동안 배양하고, 그 후에 세포를 상기 플레이트로부터 약 1 ㎖의 M2B로 세척하고 0.4% 무작위-메틸화된 β-사이클로덱스트린(CTD 홀딩스 인코포레이티드(Holdings, Inc.), 미국 플로리다주 하이스프링스 소재) 중의 0.5 mM DHA(누첵 프렙(Nu-Chek Prep), 미국 미네소타주 워터빌 소재) 및 50 ㎍/㎖ 제오신(써모피셔 사이언티픽(ThermoFisher Scientific), 미국 매사추세츠주 월썸 소재)을 함유하는 4개의 M2B 아가 플레이트들간에 균등하게 분배하였다. 세포를 3 ㎜ 멸균 유리 비드로 도말하고, 비드를 제거하고 플레이트를 싸고 27 ℃에서 약 7 내지 10일 동안 배양하였다. 콜로니의 크기가 2 내지 4 밀리미터에 도달했을 때, 각각의 콜로니를 DHA(상술한 바와 같이 무작위로 메틸화된 β-사이클로덱스트린과 복합체화되었다)와 함께 또는 상기 없이 100 ㎍/㎖ 제오신을 함유하는 M2B 아가 플레이트상에 반복해서 덧대어 OrfA/Pfa1 유전자의 붕괴에 의해 유도된 영양요구변이를 선택하였다.Transformation of a wild-type strain of Schizochytrium sp., ATCC 20888 with pYB32 and pYB33 using the particle bombardment method and gene gun equipment (Bio-Rad, Raleigh, NC) as described below was used for Briefly, 20888 was grown in 25 ml of M50-20 medium (see, eg, US Pat. No. 8,003,772) in a 250 ml flat-bottomed Erlenmeyer flask at 27° C. at 200 rpm with shaking overnight. Subsequently, the culture was diluted 1/100 in 50 ml of M2B medium (see, for example, US Pat. No. 8,003,772) in a 250 ml shake flask and grown overnight under the same conditions. When the Schizochytrium culture was in the initial log phase (0.6-2 OD units/ml), the culture was harvested by centrifugation at 3,220×g for 10 minutes. The supernatant is decanted, the pellet is resuspended in M2B to a final concentration of 20 OD units/ml and 100 μl of the resulting cell suspension is annularized onto approximately 1/3 of a non-selective M2B agar plate (approximately 4 cm diameter). smeared with movement. Plasmids pYB32 and pYB33 were digested with Swa I at 25° C. overnight and purified using the QIAquick PCR purification kit (Qiagen, Hilden, Germany). Then, 5 μg of each purified plasmid was mixed with 50 μl of 2.5 M CaCl 2 , 20 μl of 0.1 M spermidine and 50 μl of the prepared M10 tungsten beads (Bio-Rad, according to the manufacturer's protocol), and 1 The beads were allowed to settle by vortexing for minutes and then incubating for 10 minutes at room temperature. The DNA-coated beads were washed once with 250 μl of 100% ethanol, and then the beads were resuspended in 60 μl of 100% ethanol. Each prepared macrocarrier (according to the manufacturer's protocol for macrocarrier assembly preparation) had 10 μl of coated beads in ethanol spotted on the center of a macrocarrier disc and the ethanol was dried. The bursting disk holder had a 1,100 psi-designated bursting disk placed inside after short sterilization in 70% isopropanol, the assembled macrocarrier was installed on the gene gun platform in the top shelf position, and an M2B agar plate with cell patches of Schizochytrium was installed. Placed cell side up on the third shelf from the top. Once the vacuum reached about 27 psi inside the gene gun chamber, helium was fired until the rupture disk broke, the helium flow was shut off, the chamber vented to atmosphere, and the shock plate was removed from the chamber. The above bombardment procedure was repeated for all samples and controls, and the pYB32 bombarded Schizochytrium species 20888 was designated T188 and the pYB33 bombarded Schizochytrium culture was designated T189. The shock plate was incubated at 27° C. for 4 hours, after which time the cells were washed from the plate with about 1 ml of M2B and 0.4% random-methylated β-cyclodextrin (CTD Holdings, Inc.) , High Springs, FL, USA) in 0.5 mM DHA (Nu-Chek Prep, Waterville, Minn.) and 50 μg/ml Zeocin (ThermoFisher Scientific, Waltham, Mass.) material) was equally distributed among 4 M2B agar plates containing Cells were plated with 3 mm sterile glass beads, beads removed, plates were wrapped, and incubated at 27° C. for about 7-10 days. When the size of the colonies reached 2-4 millimeters, each colony was treated with or without DHA (which was complexed with randomly methylated β-cyclodextrin as described above) containing 100 μg/ml zeocin. The auxotrophs induced by the disruption of the OrfA/Pfa1 gene were selected by repeating over M2B agar plates.

DHA에 대해 영양요구성인 것으로 확인된 콜로니를 고르고, 250 ㎖ 편평 바닥 삼각 플라스크 중의 40% β-사이클로덱스트린 및 50 mM DHA 용액 500 ㎕가 보충된 50 ㎖의 M50-20내에 접종하고, 27 ℃ 진탕기에서 200 rpm에서 48시간 동안 두었다. 48시간 배양후에, 2 ㎖의 배양물을 5,000xg에서 10분간 원심분리에 의해 수집하고, 상등액을 경사분리하고, 펠릿을 변형된 페놀-클로로포름 추출에 따른 게놈 DNA 단리에 사용하였다. 게놈 DNA를 추출하고 GoTaq 그린 마스터믹스(프로메가, 미국 노쓰캐롤라이나주 더럼 소재)와 함께 PCR 주형으로서 사용하고 pYB32 및 pYB33에 대한 콜로니 PCR 선별에 대해 기재된 바와 같은 프라이머, GoTaq 그린 마스터 믹스 및 순환 조건을 사용하여 제오신 카세트 및 Cas9 유전자의 존재를 확인하였다(도 5). 상기 T188 및 T189 형질전환체 중의 OrfA/Pfa1 유전자의 붕괴를 PCR에 의해 확인하였다. 구체적으로, 5' 측면인접 영역을, O A1-KO F(서열번호 19) 및 pYB32/3 SV40 R1(서열번호 20) 프라이머(표 4), GoTaq 그린 마스터믹스, 5% DMSO(v:v) 및 주형으로서 T188/T189 gDNA와 함께 PCR에 의해 탐문하였으며, 순환 조건은 하기와 같았다: 95 ℃ 2분, (95 ℃ 30초, 60 ℃ 30초, 72 ℃ 2분 15초) x 35주기, 72 ℃ 5분. 더욱 또한, 3' 측면인접 영역을, O A1-KO R(서열번호 21) 및 pYB32/2 CF1(서열번호 22) 프라이머(표 4), GoTaq 그린 마스터믹스, 5% DMSO(v:v) 및 주형으로서 T188/T189 gDNA와 함께 PCR에 의해 탐문하였으며, 순환 조건은 하기와 같았다: 95 ℃ 2분, (95 ℃ 30초, 59 ℃ 30초, 72 ℃ 2분) x 35주기, 72 ℃ 5분(도 6).Colonies found to be auxotrophic for DHA are picked and inoculated into 50 ml M50-20 supplemented with 500 μl of 40% β-cyclodextrin and 50 mM DHA solution in 250 ml flat bottom Erlenmeyer flasks, 27° C. shaker at 200 rpm for 48 h. After 48 h of incubation, 2 ml of the culture was collected by centrifugation at 5,000×g for 10 min, the supernatant decanted and the pellet used for genomic DNA isolation following modified phenol-chloroform extraction. Genomic DNA was extracted and used as PCR template with GoTaq Green Mastermix (Promega, Durham, NC) and primers, GoTaq Green Master Mix and cycling conditions as described for colony PCR selection for pYB32 and pYB33 were applied. was used to confirm the presence of the zeocin cassette and Cas9 gene (FIG. 5). The disruption of the OrfA/Pfa1 gene in the T188 and T189 transformants was confirmed by PCR. Specifically, the 5' flanking region was prepared using O A1-KO F (SEQ ID NO: 19) and pYB32/3 SV40 R1 (SEQ ID NO: 20) primers (Table 4), GoTaq Green Mastermix, 5% DMSO (v: v) and T188/T189 gDNA as template, and cycled conditions were as follows: 95 °C 2 min, (95 °C 30 sec, 60 °C 30 sec, 72 °C 2 min 15 sec) x 35 cycles, 72 C 5 min. Furthermore, the 3' flanking region was prepared with O A1-KO R (SEQ ID NO: 21) and pYB32/2 CF1 (SEQ ID NO: 22) primers (Table 4), GoTaq Green Mastermix, 5% DMSO (v: v) and It was interrogated by PCR with T188/T189 gDNA as a template, and the cycling conditions were as follows: 95 °C 2 min, (95 °C 30 sec, 59 °C 30 sec, 72 °C 2 min) x 35 cycles, 72 °C 5 min (Fig. 6).

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' O A1-KO FO A1-KO F 1919 CCAAGTTCGCCAAGGCTTCCCAAGTTCGCCAAGGCTTC pYB32/3 SV40 R1pYB32/3 SV40 R1 2020 GTGGAATCGAAATCTCGTAGCACGTGGAATCGAAATCTCGTAGCAC O A1-KO RO A1-EN R 2121 GCTGTTGCAACTTTGCTCCACGCTGTTGCAACTTTGCTCCAC pYB32/3C F1pYB32/3C F1 2222 GTTAAGAAGACCGAGGTCCAGACGTTAAGAAGACCGAGGTCCAGAC

Cas9 유전자 통합 및 OrfA/Pfa1에서의 붕괴에 대한 PCR에 의한 양성 배양물을, 무작위로 메틸화된 β-사이클로덱스트린 및 DHA가 보충된 25 ㎖의 M50-20에 접종하고, 앞서 기재된 바와 같이 24시간 동안 증식시켰다. 상기 배양물의 OD600을 측정하고, 각 배양물의 2 OD 단위를, 250 ㎖ 진탕형 삼각 플라스크 중의 무작위로 메틸화된 β-사이클로덱스트린 및 DHA가 보충된 50 ㎖의 SPFM, pH 6.75에 접종하고 27 ℃ 진탕기에서 200 rpm으로 48시간 동안 증식시킨 후에 5% 글리세롤(v:v)을 저온보존을 위해 가하였다. 2개의 클론, 각각 형질전환 그룹으로부터 하나씩을 추가의 연구 - T188-1-20 및 T189-1-20을 위해 선택하였다.Positive cultures by PCR for Cas9 gene integration and disruption in OrfA/Pfa1 were inoculated into 25 ml of M50-20 supplemented with randomly methylated β-cyclodextrin and DHA and for 24 h as previously described. proliferated. The OD600 of the above cultures was measured, and 2 OD units of each culture were inoculated into 50 ml of SPFM, pH 6.75 supplemented with randomly methylated β-cyclodextrin and DHA in a 250 ml shaken Erlenmeyer flask, pH 6.75 and shaker at 27° C. After growth for 48 hours at 200 rpm, 5% glycerol (v:v) was added for cryopreservation. Two clones, one from each transgenic group, were selected for further studies - T188-1-20 and T189-1-20.

야생형 스키조키트리움 종 20888을 충격에 의해 pYB32 및 pYB33으로 형질전환시켜, 다중불포화된 지방산(PUFA) 신타제의 OrfA/PFA1 유전자좌에서 Cas9의 삽입에 기인하여 20888 중에 DHA 영양요구변이를 생성시켰다. 콜로니들을 카운트하고 DHA가 있거나 없는 선택성 플레이트상에서 반복하여 덧대었다. DHA에 대해 영양요구성인 콜로니를 상기 2개의 형질전환 그룹 모두로부터 무작위로 골라 추가의 특성화를 위해 사용하였다.Wild-type Schizochytrium sp. 20888 was transformed with pYB32 and pYB33 by shock to generate DHA auxotrophs during 20888 due to insertion of Cas9 at the OrfA/PFA1 locus of polyunsaturated fatty acid (PUFA) synthase. Colonies were counted and padded repeatedly on selectivity plates with or without DHA. Colonies auxotrophic for DHA were randomly selected from both transgenic groups and used for further characterization.

하기 표 5에 나타낸 바와 같이, pYB33 플라스미드에 의한 형질전환 효율이 pYB32 플라스미드의 경우보다 더 낮았으며, 발견된 영양요구성 콜로니들의 분획에 대해 유사한 성향이 관찰되었다.As shown in Table 5 below, the transformation efficiency with the pYB33 plasmid was lower than that of the pYB32 plasmid, and a similar tendency was observed for the fraction of the found auxotrophic colonies.

형질전환transformation 5 ㎍ DNA당 총 콜로니Total colonies per 5 μg DNA 영양요구성 콜로니auxotrophic colonies 녹아웃%knockout% T188: pYB32―Tub pro-Cas9T188: pYB32—Tub pro-Cas9 160160 8080 50%50% T189: pYB33―CS pro-Cas9T189: pYB33-CS pro-Cas9 9696 3737 38.5%38.5%

T188 및 T189 형질전환체를 Cas9에 결합된 프로모터의 존재 및 상기 Cas9 유전자의 상류 영역에 대해 탐문하였다. T188 클론에 대해서, 예상된 증폭산물 크기는 1028 bp이고, T189는 1584 bp였다. 도 5에 도시된 바와 같이, 클론 T189-1-30을 제외한 모두가 상기 예상된 증폭산물 크기를 생성시켰으며 이들을 추가의 시험에 사용하였다.T188 and T189 transformants were interrogated for the presence of a promoter bound to Cas9 and the region upstream of the Cas9 gene. For the T188 clone, the expected amplification product size was 1028 bp and for T189 was 1584 bp. As shown in Figure 5, all but clone T189-1-30 produced the expected amplification product size and these were used for further testing.

상기 OrfA/Pfa1 유전자좌 중 Cas9의 통합을 PCR에 의해 확인하였으며, 5' 및 3' 측면인접 영역을 모두 야생형-특이성 외부 프라이머 및 Cas9-특이성 내부 프라이머의 조합으로 증폭시켰고, 따라서 증폭산물은 오직 통합이 발생한 경우에만 성취될 수 있었다(도 6). 상기 Cas9 통합 부위의 5' 측면에 대한 예상 증폭산물 크기는 2229 bp였고, 3' 측면의 경우 1973 bp였다. 상기 시험된 균주들은 모두 OrfA/Pfa1 유전자좌에서 Cas9의 통합에 대해 양성이었다.Integration of Cas9 in the OrfA/Pfa1 locus was confirmed by PCR, and both the 5' and 3' flanking regions were amplified with a combination of a wild-type-specific outer primer and a Cas9-specific inner primer, so that the amplification product was only integrated It could only be achieved if it occurred ( FIG. 6 ). The expected amplification product size for the 5' side of the Cas9 integration site was 2229 bp and for the 3' side it was 1973 bp. All of the tested strains were positive for integration of Cas9 at the OrfA/Pfa1 locus.

실시예 3Example 3

gRNA 벡터의 설계 및 형성Design and formation of gRNA vectors

상기 안내 RNA(gRNA) 카세트는 모두 스키조키트리움으로부터 유래된 연장 인자 1α(EF-1α) 프로모터에 의해 발현되도록 설계되었다. 각각의 경우에, 상기 gRNA 서열은 2개의 리보자임 서열, 헴머헤드 및 HDV가 측면에 인접해 있다(문헌[Gao Y and Zhao Y "Self-processing of ribozyme-flanked RNAs into guide RNAs in vitro and in vivo for CRISPR-mediated genome editing" J Integr Plant Biol. 56(4):343-349 (2014)]). "NGG" 프로토스페이서-인접 모티프를 함유하는 4개의 표적 서열이 카로틴 신타제 유전자내에서 확인되었으며 각각의 바로 상류 20 bp가 표적 서열로서 선택되었다. 상기 두 리보자임 모두, 표적 서열 및 안내 RNA를 포함하는 하나의 카세트를 합성하고 BglII 및 NdeI 부위(DNA2.0)를 사용하여 pCL122 벡터내에 클로닝하여 pCL401-추가적인 클로닝을 위한 전구체 플라스미드(서열번호 7)로 표기된 벡터를 생성시켰다. 최종 gRNA 카세트 벡터를 제조하기 위해서, 플라스미드 pSP73(프로메가)을 NdeI 및 HPaI로 절단하고 생성되는 보다 큰 단편을 젤 정제하였다. 스키조키트리움 지방산 신타제 단편(FAS)의 단편을, 5' FAS PmeNde(서열번호 23) 및 3' FAS PmeHpa(서열번호 24) 프라이머(표 6), KOD 핫 스타트 마스터믹스, 및 5% DMSO와 함께 스키조키트리움 종 20888 게놈 DNA로부터 PCR에 의해, 하기의 순환 조건을 사용하여 증폭시켰다: 95 ℃ 2분, (95 ℃ 20초, 62 ℃ 10초, 70 ℃ 51초) x 40주기, 70 ℃ 5분(도 7). 상기 생성 단편을 HpaI 및 NdeI 제한 효소로 절단하고 정제하였다.All of the guide RNA (gRNA) cassettes were designed to be expressed by the elongation factor 1α (EF-1α) promoter derived from Schizochytrium. In each case, the gRNA sequence is flanked by two ribozyme sequences, a hemmerhead and HDV (see Gao Y and Zhao Y "Self-processing of ribozyme-flanked RNAs into guide RNAs in vitro and in vivo"). for CRISPR-mediated genome editing" J Integr Plant Biol. 56(4):343-349 (2014)]). Four target sequences containing the “NGG” protospacer-adjacent motif were identified within the carotene synthase gene and 20 bp immediately upstream of each was selected as the target sequence. For both ribozymes, one cassette containing the target sequence and guide RNA was synthesized and cloned into pCL122 vector using Bgl II and Nde I sites (DNA2.0) to pCL401-precursor plasmid for further cloning (SEQ ID NO: A vector denoted by 7) was generated. To prepare the final gRNA cassette vector, plasmid pSP73 (Promega) was digested with Nde I and HPa I and the resulting larger fragment was gel purified. A fragment of Schizochytrium fatty acid synthase fragment (FAS) was prepared with 5' FAS PmeNde (SEQ ID NO: 23) and 3' FAS PmeHpa (SEQ ID NO: 24) primers (Table 6), KOD Hot Start Mastermix, and 5% DMSO Together, from Schizochytrium sp. 20888 genomic DNA was amplified by PCR, using the following cycling conditions: 95 °C 2 min, (95 °C 20 sec, 62 °C 10 sec, 70 °C 51 sec) x 40 cycles, 70 °C 5 min (Figure 7). The resulting fragment was digested with Hpa I and Nde I restriction enzymes and purified.

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' 5′ FAS PmeNde5′ FAS PmeNde 2323 TAGCATATGTTTAAACTCGCGGCGTCTTTCGCTAGCATATGTTTAAACTCGCGGCGTCTTTCGC 3′ FAS PmeHpa3′ FAS PmeHpa 2424 AGTTAACGTTTAAACAGAGGAGGTGGCTGGCAGTTAACGTTTAAACAGAGGAGGTGGCTGGC

적합한 정제 및 절단된 pSP73 및 FAS 단편을 래피드 DNA 라이게이션 키트(롯슈)를 사용하여 결찰시키고 NEB10β 세포(뉴잉글랜드 바이오랩스)내로 형질전환시켜 벡터 pCL399(서열번호 5)를 생성시켰다. 이어서 상기 pCL399 벡터를 NdeI 엔도뉴클레아제 단독으로 절단하고, 단부를 녹두 뉴클레아제(뉴잉글랜드 바이오랩스)로 절두하고, 상기 벡터를 래피드 DNA 결찰 키트로 재결찰시켜 NdeI 부위를 제거하였다. 상기 생성 벡터를 pCL400(서열번호 6)으로 표기하였다. 파로모마이신 발현 카세트 및 gRNA 카세트를 인퓨전 PCR 클로닝에 의해, XhoI로 미리절단시킨 pCL400내로 삽입되는 주형으로서 pCL401(서열번호 7)을 사용하여 증폭시켜 벡터 pCL402(서열번호 8)를 생성시켰다. 간단히, pCL402 IF F(서열번호 25) 및 PCL402 IF R(서열번호 26) 프라이머(표 7), 5% DMSO(v:v), KOD 핫 스타트 마스터믹스, pCL401 플라스미드 주형을 사용하고 하기 순환 조건을 사용하여 pCL401로부터 목적하는 단편을 증폭시켰다: 95 ℃ 2분, (95 ℃ 20초, 60 ℃ 10초, 70 ℃ 1분 25초) x 35주기, 70 ℃ 5분(도 8). 상기 생성 벡터를 pCL402로 표기하였다.Appropriate purified and digested pSP73 and FAS fragments were ligated using a rapid DNA ligation kit (Roche) and transformed into NEB10β cells (New England Biolabs) to generate vector pCL399 (SEQ ID NO: 5). Next, the pCL399 vector was digested with Nde I endonuclease alone, the end was truncated with mung bean nuclease (New England Biolabs), and the vector was re-ligated with a rapid DNA ligation kit to remove the Nde I site. The resulting vector was designated as pCL400 (SEQ ID NO: 6). The paromomycin expression cassette and gRNA cassette were amplified by infusion PCR cloning, using pCL401 (SEQ ID NO: 7) as a template to be inserted into pCL400 pre-cleaved with Xho I, to generate vector pCL402 (SEQ ID NO: 8). Briefly, pCL402 IF F (SEQ ID NO: 25) and PCL402 IF R (SEQ ID NO: 26) primers (Table 7), 5% DMSO (v: v), KOD Hot Start Mastermix, pCL401 plasmid template were used and the following cycling conditions were used. was used to amplify the desired fragment from pCL401: 95 °C 2 min, (95 °C 20 sec, 60 °C 10 sec, 70 °C 1 min 25 sec) x 35 cycles, 70 °C 5 min (FIG. 8). The resulting vector was designated pCL402.

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' pCL402 IF FpCL402 IF F 2525 GAGGCGCTGACCGCCGGCCAAGCTTCCAATTTTAGGCCGAGGCGCTGACCGCCGGCCAAGCTTCCAATTTTAGGCC pCL402 IF RpCL402 IF R 2626 GCAGGTGCCGAGTTTCTCGAGAAGAATCTGAACTCACGTCGCAGGTGCCGAGTTTCTCGAGAAGAATCTGAACTCACGTC

4개의 게놈 카로틴 신타제 표적 서열을 BglII 및 NdeI 측면 인접 부위와 함께 합성하였다(유로핀스(Eurofins), 미국 노쓰캐롤라이나주 미베인 소재). 상기 단편을 gRNA3 CS1, CS2, CS3 및 CS4로 표기하였다. 4개의 단편 전부 및 벡터 pCL0402를 BglII및 NdeI로 절단하였다. 모든 적합한 단편을 젤 또는 컬럼 정제하고 이어서 래피드 DNA 라이게이션 키트(롯슈)를 사용하여 함께 결찰하고 생성되는 벡터들을 pYB36(gRNA3 CS1; 서열번호 9), pYB37(gRNA3 CS2; 서열번호 10), pYB38(gRNA3 CS3; 서열번호 11), 및 pYB39(gRNA3 CS4; 서열번호 12)로 표기하였다.Four genomic carotene synthase target sequences were synthesized with Bgl II and Nde I flanking regions (Eurofins, Mybain, NC). These fragments were designated gRNA3 CS1, CS2, CS3 and CS4. All four fragments and vector pCL0402 were digested with Bgl II and Nde I. All suitable fragments were gel or column purified and then ligated together using a rapid DNA ligation kit (Roche) and the resulting vectors were pYB36 (gRNA3 CS1; SEQ ID NO: 9), pYB37 (gRNA3 CS2; SEQ ID NO: 10), pYB38 ( gRNA3 CS3; SEQ ID NO: 11), and pYB39 (gRNA3 CS4; SEQ ID NO: 12).

스키조키트리움으로부터 유전자 편집 성분을 제거하기 위해서, Cas9 및 안내 RNA 카세트를 모두 ("녹아웃") 유전자를 붕괴시키도록 설계하고 이에 의해 영양요구변이를 유도하였다. 상기 Cas9 선택에 의한 영양요구변이는 DHA에 대한 것인 반면, gRNA 삽입에 대해 유도된 영양요구변이는 지방산 신타제 유전자좌(FAS)의 붕괴로 인해 팔미트산에 대한 것이었다. 상기 FAS 유전자의 부분을 스키조키트리움 종 20888 게놈 DNA로부터 증폭시키고 이어서 1% 아가로스 젤상에서 실행시켰다(도 8). 상기 증폭산물의 예상 크기는 2530 bp였다. 2개의 샘플을 제조하고 이들을 젤상에 로딩하였으며, 예상된 크기의 밴드는 상기 PCR의 주요 산물이었으나 다른 보다 작은 크기의 밴드들이 존재하였고, 이는 PCR 조건이 덜 이상적이었을 수도 있고 차후에 최적화시킬 필요가 있을 수도 있음을 암시한다. 2.5 kbp 밴드를 절단하고, 젤 정제하고 벡터 pCL399의 클로닝에 사용하였다.To remove gene editing components from Schizochytrium, both Cas9 and guide RNA cassettes (“knockouts”) were designed to disrupt the gene, thereby inducing auxotrophs. The auxotrophic mutation by Cas9 selection was for DHA, whereas the auxotroph induced for gRNA insertion was for palmitic acid due to disruption of the fatty acid synthase locus (FAS). A portion of the FAS gene was amplified from Schizochytrium sp. 20888 genomic DNA and then run on a 1% agarose gel ( FIG. 8 ). The expected size of the amplification product was 2530 bp. Two samples were prepared and loaded on the gel, the bands of the expected size were the main product of the PCR, but other smaller size bands were present, which could make the PCR conditions less ideal and may need to be optimized later. imply that there is The 2.5 kbp band was excised, gel purified and used for cloning vector pCL399.

PCR을 벡터 pCL400 중의 FAS 유전자좌내로의 파로모마이신 및 gRNA 카세트의 클로닝을 위해 셋업하였다. 상기 증폭산물의 예상 크기는 3376 bp였으며, 상기는 상기 젤상에 나타난 주요 밴드였다(도 8). 이어서 상기 단편을 pCL400내로 클로닝시켜 pCL402를 생성시켰다.PCR was set up for cloning of paromomycin and gRNA cassettes into the FAS locus in vector pCL400. The expected size of the amplification product was 3376 bp, which was the main band on the gel (FIG. 8). This fragment was then cloned into pCL400 to generate pCL402.

실시예 4Example 4

gRNA 구조물의 형질전환 및 선택Transformation and selection of gRNA constructs

T188-1-20 및 T189-1-20 균주(둘 다 PCR에 의해 상이한 프로모터, 각각 알파 튜불린 및 Cs의 조절하에서 Cas9 발현 카세트를 함유하는 것으로 확인되었으며 PUFA 신타제의 OrfA/PFA1 유전자좌에 삽입되었다)를 0.5 mM DNA와 함께 0.4% 무작위 메틸화된 β-사이클로덱스트린이 보충된 25 ㎖의 M50-20을 함유하는 편평바닥 삼각 플라스크에 접종하고 27 ℃, 200 rpm에서 밤새 증식시켰다. 상기 두 균주 모두 상술한 바와 같이 β-사이클로덱스트린 및 DHA가 보충된 50 ㎖의 M2B를 함유하는 진탕형 플라스크에 1/50 희석시키고 27 ℃에서 밤새 증식시켰다. T188 및 T189 배양물을 상술한 바와 같이 다음날 수확하고 충격을 위한 준비를 상술한 바와 같이 수행하였다. pYB36-39 플라스미드를 PmeI에 의한 충격에 앞서 절단하고 컬럼 정제하고, 4개의 절단된 플라스미드를 모두 사용하여 상기 2개의 균주를 형질전환시켜 T202-209로 표기된 형질전환을 생성시켰다(각각 pYB36-39를 갖는 T188, pYB36-39를 갖는 T189). 생성되는 형질전환체를 충격으로부터 회수 후 4시간째에 0.5 mM DHA, 0.5 mM C16:0(팔미트산, 시그마 알드리치(Sigma-Aldrich), 미국 미주리주 세인트 루이스 소재) 및 50 ㎍/㎖ 파로모마이신과 함께 0.4%의 β-사이클로덱스트린을 함유하는 M2B 플레이트상에 도말하였다. 플레이트를 싸고 27 ℃에서 7 내지 10일간 배양하였다. 일단 직경이 2 내지 4 ㎜가 되었으면, 콜로니를 0.5 mM 팔미트산의 존재 또는 부재하에 M2B+0.4% β-사이클로덱스트린+0.5 mM DHA+0.5 ㎍/㎖ 파로모마이신상에 반복해서 덧대어 FAS 유전자좌에 gRNA 카세트 삽입의 결과로서 생성된 팔미트산에 대한 영양요구변이를 확인하였다. 상기 DHA 및 팔미트산 모두에 대해 영양요구성인 것으로 확인된 8개 형질전환 각각으로부터 10개의 콜로니를 골라, 0.4% β-사이클로덱스트린, 0.5 mM DHA 및 0.5 mM 팔미트산이 보충된 25 ㎖의 M50-20을 함유하는 편평바닥 삼각 플라스크에 접종하고, 배양물을 27 ℃, 200 rpm에서 48시간 동안 증식시켰으며, 이때에 2 ㎖의 각 배양물을 게놈 DNA 제조를 위해 수집하였다. 게놈 DNA를 페놀:클로로포름으로 추출한 후에, PCR을 수행하여 gRNA 카세트가 표적화된 영역들을 함유하는 카로틴 신타제 유전자좌를 증폭시켰다. T202, 203, 204, 207, 208로부터 선택된 형질전환체로부터의 gDNA를 하기의 순환 조건을 사용하여 pYB36 CS1 F(서열번호 27) 및 pYB36 CS3 R(서열번호 29) 프라이머(표 8), 5% DMSO(v:v), KOD 핫 스타트 마스터믹스, 주형으로서 각각의 gDNA를 사용하여 PCR에 의해 증폭시켰다: 95 ℃ 2분, (95 ℃ 20초, 61 ℃ 10초, 70 ℃ 21초) x 35주기, 70 ℃ 5분(도 9). T206으로부터 gDNA를 하기의 순환 조건을 사용하여 pYB36 CS1 F 및 pYB36 CS1 R(서열번호 28) 프라이머(표 8), 5% DMSO(v:v), KOD 핫 스타트 마스터믹스로 증폭시켰다: 95 ℃ 2분, (95 ℃ 20초, 61 ℃ 10초, 70 ℃ 11초) x 35주기, 70 ℃ 5분(도 10). T205 및 T209로부터 gDNA를 하기의 순환 조건을 사용하여 pYB36 CS4 F(서열번호 30) 및 pYB36 CS4 R(서열번호 31) 프라이머(표 8), 5% DMSO(v:v), KOD 핫 스타트 마스터믹스로 증폭시켰다: 95 ℃ 2분, (95 ℃ 20초, 61 ℃ 10초, 70 ℃ 21초) x 35주기, 70 ℃ 5분(도 9). T188-1-20 and T189-1-20 strains (both were confirmed by PCR to contain Cas9 expression cassettes under the control of different promoters, alpha-tubulin and Cs, respectively, and were inserted into the OrfA/PFA1 locus of PUFA synthase ) were inoculated into flat-bottom Erlenmeyer flasks containing 25 ml of M50-20 supplemented with 0.4% random methylated β-cyclodextrin with 0.5 mM DNA and grown overnight at 27°C, 200 rpm. Both strains were diluted 1/50 in shake flasks containing 50 ml of M2B supplemented with β-cyclodextrin and DHA as described above and grown overnight at 27°C. T188 and T189 cultures were harvested the next day as described above and preparations for shock were performed as described above. The pYB36-39 plasmids were digested prior to bombardment with Pme I, column purified, and all four digested plasmids were used to transform the two strains to produce a transformation designated T202-209 (each pYB36-39, respectively). T188 with pYB36-39, T189 with pYB36-39). The resulting transformants were harvested 4 hours after recovery from shock with 0.5 mM DHA, 0.5 mM C16:0 (palmitic acid, Sigma-Aldrich, St. Louis, MO) and 50 μg/ml Paromo. Plates were plated on M2B plates containing 0.4% β-cyclodextrin along with mycin. The plate was wrapped and incubated at 27 °C for 7 to 10 days. Once 2-4 mm in diameter, colonies were repeatedly overlaid on M2B+0.4% β-cyclodextrin+0.5 mM DHA+0.5 μg/ml paromomycin in the presence or absence of 0.5 mM palmitic acid to the FAS locus. A auxotrophic mutation for palmitic acid produced as a result of gRNA cassette insertion was confirmed. 10 colonies were picked from each of the 8 transformations found to be auxotrophic for both DHA and palmitic acid, and 25 ml of M50- supplemented with 0.4% β-cyclodextrin, 0.5 mM DHA and 0.5 mM palmitic acid A flat-bottom Erlenmeyer flask containing 20 was inoculated, and the cultures were grown at 27° C., 200 rpm for 48 hours, at which time 2 ml of each culture was collected for genomic DNA preparation. After extraction of genomic DNA with phenol:chloroform, PCR was performed to amplify the carotene synthase locus containing the regions targeted to the gRNA cassette. gDNA from transformants selected from T202, 203, 204, 207, 208 was prepared using the following cycling conditions with pYB36 CS1 F (SEQ ID NO: 27) and pYB36 CS3 R (SEQ ID NO: 29) primers (Table 8), 5% Amplified by PCR using DMSO (v:v), KOD hot start mastermix, each gDNA as template: 95 °C 2 min, (95 °C 20 sec, 61 °C 10 sec, 70 °C 21 sec) x 35 Cycle, 70 °C 5 min (Figure 9). gDNA from T206 was amplified with pYB36 CS1 F and pYB36 CS1 R (SEQ ID NO: 28) primers (Table 8), 5% DMSO (v:v), KOD Hot Start Mastermix using the following cycling conditions: 95 °C 2 min, (95 °C 20 sec, 61 °C 10 sec, 70 °C 11 sec) x 35 cycles, 70 °C 5 min (Fig. 10). gDNA from T205 and T209 was prepared using the following cycling conditions with pYB36 CS4 F (SEQ ID NO: 30) and pYB36 CS4 R (SEQ ID NO: 31) primers (Table 8), 5% DMSO (v:v), KOD Hot Start Mastermix was amplified with: 95 °C 2 min, (95 °C 20 sec, 61 °C 10 sec, 70 °C 21 sec) x 35 cycles, 70 °C 5 min (FIG. 9).

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5′→ 3′Primer sequence, 5'→ 3' pYB36 CS1 FpYB36 CS1 F 2727 GAGTCGAAGGAGACGTTGTCGGAGTCGAAGGAGACGTTGTCG pYB36 CS1 RpYB36 CS1 R 2828 GTCATTGCGAATGATGCGATATGGTCATTGCGAATGATGCGATATG pYB36 CS3 RpYB36 CS3 R 2929 GGTCATCATGGAATACAACGCAGGGTCATCATGGAATACAACGCAG pYB36 CS4 FpYB36 CS4 F 3030 CGAGCTCATTTGTGCTACACTCTATGCGAGCTCATTTGTGCTACACTCTATG pYB36 CS4 RpYB36 CS4 R 3131 CACAAGATTTGCAGGATTGATGCCACAAGATTTGCAGGATTGATGC

생성되는 PCR 증폭산물을 컬럼 정제하고, pJet1.2 벡터(써모피셔, 미국 매사추세츠주 월썸 소재)내에 클로닝하고, 생성되는 세균 형질전환체에서 플라스미드를 소량 추출하고, 삽입물을 상기 키트에 제공된 프라이머로 서열분석하였다. 서열을 지니어스(Geneious) 소프트웨어를 사용하여 정렬시켰다.The resulting PCR amplification product was column purified, cloned into pJet1.2 vector (Thermo Fisher, Waltham, Mass.), a small amount of plasmid was extracted from the resulting bacterial transformant, and the insert was sequenced with the primers provided in the kit. analyzed. Sequences were aligned using Geneious software.

gRNA를 암호화하는 벡터에 의한 2개의 Cas9 배경(T188 및 T189)의 형질전환이 성공적으로 수행되었으며 하기 표 9에 나타낸 바와 같은 다수의 이중 영양요구성 균주(T202-209로 표기됨)를 생성시켰다. 선택 균주를 나중에 유전자 수준으로 분석하여 카로티노이드 신타제 유전자좌의 임의의 돌연변이를 검출하였으며, 이는 비-상동성 말단 부착(NHEJ) 사건을 가리킨다.Transformation of two Cas9 backgrounds (T188 and T189) with vectors encoding gRNAs was successfully performed and resulted in a number of dual auxotrophic strains (designated T202-209) as shown in Table 9 below. Selected strains were later analyzed at the genetic level to detect any mutations in the carotenoid synthase locus, indicating a non-homologous end attachment (NHEJ) event.

형질전환transformation 총 콜로니total colonies
(3ug DNA 당)(per 3ug DNA)
16:0 영양요구체의 수16:0 Number of auxotrophs KO 퍼센트KO percent
T202 (T188+gRNA1)T202 (T188+gRNA1) 103103 17/3217/32 53%53% T203 (T188+gRNA2)T203 (T188+gRNA2) 9393 15/3215/32 46%46% T204 (T188+gRNA3)T204 (T188+gRNA3) 6565 15/3215/32 46%46% T205 (T188+gRNA4)T205 (T188+gRNA4) 5050 15/3215/32 46%46% T206 (T189+gRNA1)T206 (T189+gRNA1) 5353 12/3212/32 37%37% T207 (T189+gRNA2)T207 (T189+gRNA2) 2828 8/208/20 40%40% T208 (T189+gRNA3)T208 (T189+gRNA3) 3838 9/299/29 31%31% T209 (T189+gRNA4)T209 (T189+gRNA4) 2828 9/229/22 41%41%

gDNA를 영양요구성인 것으로 나타난(DNA 및 팔미트산 모두가 보충될 것을 요한다) gRNA + Cas9 형질전환체로부터 단리한 후에, 하나의 PCR을 사용하여 하나의 증폭산물로서 첫 번째 3 CS gRNA 표적을 포함하는 전체 영역을 증폭시키고 또 다른 PCR을 사용하여 별도로 gRNA3 CS4 표적을 증폭시켰다(도 9). gRNA3 CS1-3 형질전환체 DNA로부터의 증폭산물에 대한 예상 크기는 1040 bp였으며, gRNA 3 CS4 형질전환체로부터의 경우는 715 bp였다. gRNA CS1-3 표적에 대한 증폭산물의 경우에, 상기 T188 계통으로부터 온 샘플은 정확한 크기의 단일 밴드를 가졌다. 그러나 T189로부터 온 샘플(T189 모 균주 자체로부터의 gDNA 포함)은 가장 현저한 약 1 kbp 밴드 및 두 번째로 가장 현저한 약 1.5 kbp 밴드와 함께 다수의 밴드를 가졌다. 상기 1.5 kbp 밴드는 비-특이적인 증폭의 산물인 것으로 판단되었다. 비특이적인 밴드의 존재를 감소시키고자 상기 PCR의 최적화를 수행하지는 않았으며, 약 1 kbp 밴드를 상기 형질전환체 모두에 대해 상기 젤로부터 절단하고 pJet1.2 클로닝에 사용하였다. CS4 증폭산물은 모두 정확한 크기의 단일 밴드였다. 상기 생성되는, Cas9/gRNA 형질전환체 각각으로부터의 증폭산물을 갖는 다수의 pJet 형질전환체 콜로니를 서열분석을 위해 보내어 Cas9/gRNA가 CS 서열에 임의의 영향을 미치는 지의 여부를 측정하였다.After isolation of gDNA from gRNA + Cas9 transformants that were shown to be auxotrophic (requiring both DNA and palmitic acid to be supplemented), one PCR was used to contain the first 3 CS gRNA targets as one amplification product was amplified and the gRNA3 CS4 target was amplified separately using another PCR (Fig. 9). The expected size for the amplification product from the gRNA3 CS1-3 transformant DNA was 1040 bp and that from the gRNA 3 CS4 transformant was 715 bp. In the case of amplicons against the gRNA CS1-3 target, the sample from the T188 line had a single band of the correct size. However, samples from T189 (including gDNA from the T189 parent strain itself) had multiple bands with the most prominent about 1 kbp band and the second most prominent about 1.5 kbp band. The 1.5 kbp band was determined to be the product of non-specific amplification. Optimization of the PCR was not performed to reduce the presence of non-specific bands, and an approximately 1 kbp band was excised from the gel for all of the transformants and used for pJet1.2 cloning. All CS4 amplicons were single bands of the correct size. A large number of pJet transformant colonies having amplification products from each of the above generated Cas9/gRNA transformants were sent for sequencing to determine whether Cas9/gRNA had any effect on the CS sequence.

추가로, 클론의 T206 계통상에서 분석을 수행하였다. 도 9에 기재된 바와 같이, T206 형질전환체는 gDNA 추출되었으며 PCR용 주형으로서 사용되어 오직 gRNA3 CS1 표적 서열만을 증폭시켰고 예상되는 증폭산물 크기는 689 bp였다. 상기 크기의 밴드가 상기 젤상에서 관찰되었으며, 이는 다른 것들 가운데에서 주 밴드였고, 보다 큰 분자량을 갖는 것이었으며, 선행 도면상에서 관찰된 것들과 유사하였다. 예상된 약 700 bp 크기의 젤 단편을 절단하고 정제하고 상기 단편의 pJet1.2 클로닝 및 서열분석에 사용하여 Cas9-gDNA가 CS 유전자 중의 임의의 서열 변화를 유도하였는지를 측정하였다(도 10).Additionally, analysis was performed on the T206 line of clones. As shown in FIG. 9 , the T206 transformant was gDNA extracted and used as a template for PCR to amplify only the gRNA3 CS1 target sequence and the expected amplification product size was 689 bp. A band of this size was observed on the gel, which was the main band among others, was the one with the higher molecular weight, and was similar to that observed in the preceding figure. A gel fragment of the expected size of about 700 bp was cut and purified, and the fragment was used for pJet1.2 cloning and sequencing to determine whether Cas9-gDNA induced any sequence change in the CS gene (FIG. 10).

Cas9 및 gRNA 암호화 벡터 모두로 형질전환된 클론들로부터의 CS 유전자좌의 증폭산물을 함유하는 다수의 pJet 벡터를 서열분석할 때, 여러 종류의 변화가 발생함이 관찰되었다(표 10). 선행 연구는 CAS9가 DNA를 절단하고 PAM 서열 상류의 3 염기쌍 위치에서 이중 가닥 절단을 생성시킴을 입증하였다(문헌[Jinek et al. "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity" Science 337(6096):816-821, 2012]). 상기 이중 가닥 절단 부위에서 다양한 길이의 삽입 및/또는 결실 돌연변이를 생성시킬 수도 있는 부정확한 비-상동성 말단 부착(NHEJ)-매개된 복구로부터 돌연변이가 발생할 수도 있음이 공지되어 있다(문헌[Sander and Joung, "CRISPR-Cas systems for editing, regulating and targeting genomes" Nat Biotechnol. 32(4):347-355, 2014]). 관찰된 변화는 모두 예상된 바와 같이 PAM에 비해 -3번 위치에서 발생한다(이 경우 AGG). Cas9+gRNA 형질전환체에서 검출된 스키조키트리움 종 20888의 카로티노이드 신타제의 예시적인 돌연변이를 표 10에 나타내며, 이때 결실은 대시로 도시하고 삽입은 밑줄로 도시되었다.When sequencing multiple pJet vectors containing amplification products of the CS locus from clones transformed with both Cas9 and gRNA encoding vectors, several types of changes were observed (Table 10). Previous studies have demonstrated that CAS9 cleaves DNA and generates a double-stranded break at a 3 base pair position upstream of the PAM sequence (Jinek et al. "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity" Science 337 (6096):816-821, 2012]). It is known that mutations may arise from imprecise non-homologous end attachment (NHEJ)-mediated repair, which may result in insertion and/or deletion mutations of varying lengths at the double-stranded cleavage site (Sander and Joung, "CRISPR-Cas systems for editing, regulating and targeting genomes" Nat Biotechnol. 32(4):347-355, 2014]). All observed changes occur at position -3 relative to PAM (in this case AGG), as expected. Exemplary mutations of the carotenoid synthase of Schizochytrium sp. 20888 detected in Cas9+gRNA transformants are shown in Table 10, with deletions shown in dashes and insertions underlined.

야생형 뉴클레오타이드 위치 #wild-type nucleotide position # 1515 1616 1717 1818 1919 2020 PAM/NGGPAM/NGG 야생형wild type AA CC GG CC GG CC AGGAGG CS1 표적 변체 #1CS1 Target Variant #1 AA CC GG CC -- -- AGGAGG CS1 표적 변체 #2CS1 Target Variant #2 AA CC -- CC GG CC AGGAGG CS1 표적 변체 #3CS1 Target Variant #3 AA CC GG C C C C GG CC AGGAGG CS1 표적 변체 #4CS1 Target Variant #4 AA CC GG T C T C GG CC AGGAGG

본 명세서에 인용된 모든 참고문헌은 각각의 참고문헌이 구체적이고 개별적으로 참고로 인용됨을 가리키는 바와 같이 본 명세서에 참고로 인용된다. 임의의 참고문헌의 인용은 출원일 이전의 그의 명세에 대한 것이며 본 명세가 종래 발명 때문에 상기와 같은 참고문헌에 선행할 자격을 갖지 않음을 인정하는 것으로서 이해해서는 안 된다.All references cited herein are incorporated herein by reference as if each reference were specifically and individually indicated to be incorporated by reference. Citation of any reference is for its disclosure prior to the filing date and is not to be construed as an admission that this disclosure is not entitled to antedate such reference because of prior invention.

각각의 상술한 요소들, 또는 2개 이상의 요소가 함께 상술한 유형과 상이한 다른 유형의 방법에서 유용한 용도를 또한 발견할 수 있음을 알 것이다. 추가의 분석 없이, 상기는 본 명세의 요지를 충분히 밝혀서, 다른 이들이 현재의 지식을 적용함으로써, 종래 기술의 관점으로부터, 첨부된 특허청구범위에 제시된 본 명세의 일반적이거나 특정한 태양의 필수적인 특징을 공정하게 구성하는 특징들을 생략하지 않고서 상기 명세를 다양한 용도들에 쉽게 적합화시킬 수 있다. 상기 실시태양들은 단지 예로서 제공되며; 본 명세의 범위는 오직 하기 특허청구범위에 의해서만 제한되게 된다.It will be appreciated that each of the aforementioned elements, or two or more elements together, may also find useful use in other types of methods different from those described above. Without further analysis, the foregoing sufficiently sets forth the subject matter of the present disclosure so that others, by applying their present knowledge, from the standpoint of the prior art, fairly share the essential features of the general or specific aspects of the present disclosure set forth in the appended claims. The above specification can be readily adapted to various uses without omitting constituting features. The above embodiments are provided by way of example only; The scope of the present disclosure is limited only by the following claims.

실시예 5Example 5

표적화되지 않은 Cas9 벡터(pYB61)의 설계 및 형성Design and formation of untargeted Cas9 vector (pYB61)

앞서, Cas9 플라스미드를 PUFA 신타제 유전자(OrfA 서브유닛; pYB32 및 pYB33을 참조하시오)에서 표적화된 통합을 위해 설계하였다. 이러한 방식으로 설계된 Cas9 카세트의 통합은 상기 OrfA 유전자의 녹아웃 및 DHA 영양요구변이를 생성시켰다. 영양요구변이의 유도는 OrfA 유전자좌에서의 후속적인 통합을 위한 형질전환체의 보다 용이한 선별을 가능하게 하나, CRISPR-매개된 카로틴 신타제 녹아웃의 분화를 보다 어렵게 한다. 임의의 염색체 유전자좌에 표적화하지 않으면서 Cas9에 대한 발현 카세트를 갖는 플라스미드를 형성시켜 이소성, 무작위 통합의 영향을 시험하였다. 플라스미드 pCL122-Cas9(서열번호 1)를 BamHI-HF(NEB) 및 NdeI(NEB)로 절단시켜 2개의 단편, 크기 4140 bp 및 5780 bp를 생성시켰다. 상기 4140 bp 단편을 1% 아가로스 젤상에서 분리시키고, 절제하고, QIAquick 젤 추출 키트(퀴아겐)를 사용하여 정제시켰다(도 11). 플라스미드 pYB30(서열번호 32)은 GFP 및 제오신 내성의 발현을 구동하는 알파-튜불린 프로모터의 부분적인 단편을 함유한다. 상기 플라스미드를 BamHI-HF(NEB) 및 NdeI(NEB)로 절단시켜, 2개의 단편 838 bp 및 4724 bp를 생성시켰다. 상기 4724 bp 단편을 1% 아가로스 젤상에서 분리시키고, 절제하고, QIAquick 젤 추출 키트(퀴아겐)를 사용하여 정제시켰다(도 11). 상기 2개의 관심 단편을 래피드 DNA 라이게이션 키트(롯슈)를 사용하여 결찰시켰다. 이어서 NEB10β 화학적 수용성 세포(NEB)를 상기 결찰 반응의 일부로 형질전환시켰다. 생성되는 콜로니를 2x GoTaq 그린 마스터 믹스(프로메가), 프라이머 121 Tub seq F 및 pYB32/3C R1 및 5% 최종 DMSO와 함께 PCR에 의한 선별을 위해 주형으로서 사용하였다. 하기의 순환 조건을 적용하였다: 95 ℃ 5분, [95 ℃ 30초, 59 ℃ 30초, 72 ℃ 1분] 35주기 동안, 및 72 ℃ 5분(앞서 기재됨)(도 12). 플라스미드를 PCR에 의해 양성 형질전환체로서 확인된 콜로니로부터 추출하고, 서열분석하고, 생성 벡터를 pYB61(서열번호 33)로 표기하였다.Previously, a Cas9 plasmid was designed for targeted integration in the PUFA synthase gene (OrfA subunit; see pYB32 and pYB33). Integration of the Cas9 cassette designed in this way resulted in a knockout of the OrfA gene and a DHA auxotroph. Induction of auxotrophs allows easier selection of transformants for subsequent integration at the OrfA locus, but makes differentiation of CRISPR-mediated carotene synthase knockouts more difficult. Plasmids with an expression cassette for Cas9 without targeting to any chromosomal locus were formed to test the effect of ectopic, random integration. Plasmid pCL122-Cas9 (SEQ ID NO: 1) was digested with BamHI-HF (NEB) and NdeI (NEB) to generate two fragments, 4140 bp and 5780 bp in size. The 4140 bp fragment was isolated on 1% agarose gel, excised, and purified using the QIAquick gel extraction kit (Qiagen) (FIG. 11). Plasmid pYB30 (SEQ ID NO: 32) contains a partial fragment of the alpha-tubulin promoter driving expression of GFP and zeocin resistance. The plasmid was digested with BamHI-HF (NEB) and NdeI (NEB) to generate two fragments 838 bp and 4724 bp. The 4724 bp fragment was isolated on 1% agarose gel, excised, and purified using the QIAquick gel extraction kit (Qiagen) (FIG. 11). The two fragments of interest were ligated using a rapid DNA ligation kit (Roche). NEB10β chemically soluble cells (NEBs) were then transformed as part of the ligation reaction. The resulting colonies were used as templates for selection by PCR with 2x GoTaq Green Master Mix (Promega), primers 121 Tub seq F and pYB32/3C R1 and 5% final DMSO. The following cycling conditions were applied: 95° C. 5 min, [95° C. 30 sec, 59° C. 30 sec, 72° C. 1 min] for 35 cycles, and 72° C. 5 min (described previously) ( FIG. 12 ). The plasmid was extracted from the colonies identified as positive transformants by PCR, sequenced, and the resulting vector was designated pYB61 (SEQ ID NO: 33).

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5'→3'Primer sequence, 5'→3' 121 Tub seq F121 Tub seq F 1313 GGATCTCATGCTGGAGTTCTTCGGATCTCATGCTGGAGTTCTTC pYB32/3C R1pYB32/3C R1 1414 GTACTTCTCGTGGTAGGCAACCGTACTTCTCGTGGTAGGCAACC

실시예 6Example 6

표적화되지 않은 gRNA 벡터(pYB66)의 설계 및 형성Design and formation of untargeted gRNA vector (pYB66)

앞서, gRNA 플라스미드를 지방산 신타제(FAS) 유전자 유전자좌에 대해 표적화하여, 팔미트산에 대한 영양요구성 요건을 생성시켰다. 상기 영양요구변이는 CRISPR-매개된 카로틴 신타제 불활성화로부터 생성되는 표현형을 가렸다. 카로티노이드 생합성에 관련된 유전자좌에서의 CRISPR-매개된 표현형 변화를 보다 잘 식별하기 위해서, 표적화되지 않은 gRNA 벡터를 제조하였다. 플라스미드 pCL122(서열번호 37)를 BamHI-HF(NEB) 및 NdeI(NEB)로 절단하여 GFP를 암호화하는 판독 프레임을 제거하여, 838 bp 및 5773 bp 크기의 2개의 단편을 생성시켰다. 상기 5773 bp 단편을 1% 아가로스 젤상에서 분리시키고, 절제하고, QIAquick 젤 추출 키트(퀴아겐)를 사용하여 정제하였다(도 13). gRNA3 CS1을 암호화하는 카세트를 2x KOD 핫 스타트 마스터 믹스(EMD 밀리포어), 프라이머 pYB66 BamBgl F 및 pYB66 Nde R, 및 5% 최종 DMSO와 함께 주형으로서 플라스미드 pYB36(서열번호 9)을 사용하여 PCR에 의해 증폭시켰다. 하기의 순환 조건을 적용하였다: 95 ℃ 2분, [95 ℃ 20초, 59 ℃ 10초, 70 ℃ 5초] 35주기 동안, 및 72 ℃ 2분(앞서 기재됨)(도 14). 상기 생성되는 PCR 단편을 QIAquick PCR 정제 키트(퀴아겐)를 사용하여 컬럼 정제하고 BamHI-HF(NEB) 및 NdeI(NEB)로 절단하였다. 절단에 이어서, 상기 생성되는 DNA 단편을 다시 QIAquick PCR 정제 키트를 사용하여 정제하였다(도 15). 상기 절단된 pCL122 및 gRNA 단편을 래피드 DNA 라이게이션 키트(롯슈)를 사용하여 결찰시켰다. 이어서 NEB10β 화학적 수용성 세포를 상기 결찰 반응의 일부로 형질전환시키고 생성되는 콜로니를 2x GoTaq 그린 마스터 믹스(프로메가), 프라이머 pYB66 EF1seq F 및 pCL122 OrfC R 및 5% 최종 부피의 DMSO를 사용하여, 하기의 순환 조건으로 콜로니 PCR에 의해 선별하였다: 95 ℃ 10분, [95 ℃ 30초, 60 ℃ 30초, 72 ℃ 45초] 35주기 동안, 및 72 ℃ 5분(도 16). 플라스미드를 PCR에 의해 양성 콜로니로부터 추출하고, 서열분석하고, 생성 벡터를 pYB66(서열번호 34)으로 표기하였다.Previously, a gRNA plasmid was targeted to the fatty acid synthase (FAS) gene locus, creating an auxotrophic requirement for palmitic acid. These auxotrophs masked the phenotype resulting from CRISPR-mediated carotene synthase inactivation. To better identify CRISPR-mediated phenotypic changes at loci involved in carotenoid biosynthesis, untargeted gRNA vectors were constructed. Plasmid pCL122 (SEQ ID NO: 37) was digested with BamHI-HF (NEB) and NdeI (NEB) to remove the reading frame encoding GFP, resulting in two fragments of 838 bp and 5773 bp in size. The 5773 bp fragment was separated on a 1% agarose gel, excised, and purified using the QIAquick gel extraction kit (Qiagen) (FIG. 13). The cassette encoding gRNA3 CS1 was prepared by PCR using plasmid pYB36 (SEQ ID NO: 9) as a template along with 2x KOD hot start master mix (EMD Millipore), primers pYB66 BamBgl F and pYB66 Nde R, and 5% final DMSO. amplified. The following cycling conditions were applied: 95° C. 2 min, [95° C. 20 sec, 59° C. 10 sec, 70° C. 5 sec] for 35 cycles, and 72° C. 2 min (described previously) ( FIG. 14 ). The resulting PCR fragment was column purified using a QIAquick PCR purification kit (Qiagen) and digested with BamHI-HF (NEB) and NdeI (NEB). Following cleavage, the resulting DNA fragment was purified again using a QIAquick PCR purification kit (FIG. 15). The cleaved pCL122 and gRNA fragments were ligated using a rapid DNA ligation kit (Roche). NEB10β chemically soluble cells were then transformed as part of the above ligation reaction and the resulting colonies were transformed using 2x GoTaq Green Master Mix (Promega), primers pYB66 EF1seq F and pCL122 OrfC R and 5% final volume of DMSO, in the following cycle: The conditions were selected by colony PCR: 95 °C 10 min, [95 °C 30 sec, 60 °C 30 sec, 72 °C 45 sec] for 35 cycles, and 72 °C 5 min (FIG. 16). The plasmid was extracted from the positive colonies by PCR and sequenced, and the resulting vector was designated pYB66 (SEQ ID NO: 34).

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5'→ 3'Primer sequence, 5'→ 3' pYB66 BamBgl FpYB66 BamBgl F 3838 CAAGGGATCCAGATCTTCCGCACTGATGAGTCCAAGGGATCCAGATCTTCCGCACTGATGAGTC pYB66 Nde RpYB66 Nde R 3939 AACTCATATGGTCCCATTCGCCAAACTCATATGGTCCCATTCGCCA pYB66 EF1seq FpYB66 EF1seq F 4040 GAGAGGATAGTATCTTGCGTGCTTGGAGAGGATAGTATCTTGCGTGCTTG pCL122 OrfC RpCL122 OrfC R 4141 GCAAGGTTGGAACATTACGATCAAGGCAAGGTTGGAACATTACGATCAAG

실시예 7Example 7

Cas9 및 gRNA 벡터(pYB73)의 설계 및 형성Design and formation of Cas9 and gRNA vectors (pYB73)

단일의 우성 선택성 마커를 갖는 동일한 벡터상에 Cas9 및 gRNA 카세트를 모두 함유하는 벡터를 생성시켜 게놈 편집에 대한 상기와 같은 배열의 효율을 시험하였다. 플라스미드 pYB61을 PstI(NEB)로 절단하고 QIAquick PCR 정제 키트(퀴아겐)에 의해 정제하였다(도 17). 상기 gRNA 발현 카세트를 하기와 함께 PCR을 사용하여 pYB36으로부터 증폭시켰다: 2x KOD 핫 스타트 마스터 믹스(노바겐(Novagen)), 프라이머 pYB73 gRNA Pst Kpn IF F 및 pYB73 gRNA Xho Pst IF R, 및 5% 최종 DMSO. 순환 조건은 하기와 같이 사용되었다: 95 ℃ 2분, [95 ℃ 20초, 58 ℃ 10초, 70 ℃ 38초] 35주기 동안, 및 70 ℃ 2분(도 18). 상기 생성되는 PCR 단편을 QIAquick PCR 클린업 키트(퀴아겐)를 사용하여 컬럼 정제하였다. 정제된, 절단된 pYB61 단편을 제조사의 프로토콜에 따라 인퓨전 키트(클론테크)를 사용하여 상기 정제된, 절단된 gRNA PCR 단편에 결찰시켰다. NEB10β 화학적 수용성 세포를 상기 인퓨전 반응의 일부로 형질전환시키고 생성되는 세균 콜로니를 2x GoTaq 그린 마스터 믹스(프로메가), 프라이머 pYB73 seq F 및 pYB73 seq R 및 5% 최종 DMSO와 함께 PCR에 의해 선별하였다. 하기의 순환 조건을 적용하였다: 95 ℃ 10분, [95 ℃ 30초, 60 ℃ 30초, 72 ℃ 2분] 35주기 동안, 및 72 ℃ 5분(도 19). 플라스미드를 PCR에 의해 양성 콜로니로부터 추출하고, 서열분석하고, 생성 벡터를 pYB73(서열번호 35)으로 표기하였다. 플라스미드 pYB73은 임의의 특정한 유전자에 대한 표적화를 위해 설계되지 않는다.The efficiency of such an arrangement for genome editing was tested by generating vectors containing both Cas9 and gRNA cassettes on the same vector with a single dominant selectable marker. Plasmid pYB61 was digested with PstI (NEB) and purified by QIAquick PCR purification kit (Qiagen) (FIG. 17). The gRNA expression cassette was amplified from pYB36 using PCR with: 2x KOD hot start master mix (Novagen), primers pYB73 gRNA Pst Kpn IF F and pYB73 gRNA Xho Pst IF R, and 5% final DMSO. Cycling conditions were used as follows: 95° C. 2 min, [95° C. 20 sec, 58° C. 10 sec, 70° C. 38 sec] for 35 cycles, and 70° C. 2 min (FIG. 18). The resulting PCR fragment was column purified using the QIAquick PCR Cleanup Kit (Qiagen). The purified, cleaved pYB61 fragment was ligated to the purified, cleaved gRNA PCR fragment using an infusion kit (Clontech) according to the manufacturer's protocol. NEB10β chemically soluble cells were transformed as part of the above infusion reaction and the resulting bacterial colonies were selected by PCR with 2x GoTaq Green Master Mix (Promega), primers pYB73 seq F and pYB73 seq R and 5% final DMSO. The following cycling conditions were applied: 95° C. 10 min, [95° C. 30 sec, 60° C. 30 sec, 72° C. 2 min] for 35 cycles, and 72° C. 5 min ( FIG. 19 ). The plasmid was extracted from the positive colonies by PCR and sequenced, and the resulting vector was designated pYB73 (SEQ ID NO: 35). Plasmid pYB73 is not designed for targeting to any particular gene.

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5'→ 3'Primer sequence, 5'→ 3' pYB73 gRNA Pst Kpn IF FpYB73 gRNA Pst Kpn IF F 4242 CATACATGGTCGACCTGCAGGGTACCTCTTATCTGCCTCGCCATACATGGTCGACCTGCAGGGTACCTCTTATCTGCCTCGC pYB73 gRNA Xho Pst IF RpYB73 gRNA Xho Pst IF R 4343 ATTAATGCAGGTTCCTGCAGCTCGAGAAGAATCTGAACTCACGTCATTAATGCAGGTTCCTGCAGCTCGAGAAGAATCTGAACTCACGTC pYB73 seq FpYB73 seq F 4444 CACCCCAACTTGTTTATTGCAGCACCCCAACTTGTTTATTGCAG pYB73 seq RpYB73 seq R 4545 GAGCGAGGAAGCGGAAGAGGAGCGAGGAAGCGGAAGAG

실시예 8Example 8

CarG 균주에 대한 설계, 형성 및 형질전환Design, Formation and Transformation of CarG Strains

CRIPSR-매개된 카로틴 신타제 유전자 불활성화를 갖는 형질전환체(백색 콜로니 표현형)와, CRIPSR-매개된 카로틴 신타제 유전자 불활성화를 갖지 않는 형질전환체(황색-오렌지색 콜로니 표현형)간의 구별을 개선하기 위해서, 뮤코르 시르시넬로이데스(Mucor circinelloides)로부터의 CarG 유전자(제라닐제라닐 피로포스페이트 신타제)를 스키조키트리움에서의 발현에 대해 코돈-최적화하였다. 상기 CarG 유전자(상기는 생성된 카로티노이드의 총량을 증가시키고 형질전환체를 더 오렌지색으로 만들 것이다)를 DNA2.0에 의해 합성하고, 블라스티시딘 선택 카세트를 함유하는 스키조키트리움 발현 벡터내로 클로닝하였다. 상기 생성 벡터를 pCL310(서열번호 36)이라 명명하였다.To improve the distinction between transformants with CRIPSR-mediated carotene synthase gene inactivation (white colony phenotype) and transformants without CRIPSR-mediated carotene synthase gene inactivation (yellow-orange colony phenotype) For this purpose, the CarG gene (geranylgeranyl pyrophosphate synthase) from Mucor circinelloides was codon-optimized for expression in Schizochytrium. The CarG gene (which would increase the total amount of carotenoids produced and make the transformants more orange) was synthesized by DNA2.0 and cloned into a Schizochytrium expression vector containing a blasticidin selection cassette. . The resulting vector was named pCL310 (SEQ ID NO: 36).

스키조키트리움 종 ATCC 20888의 야생형 균주를 입자 충격 방법(바이오 래드)을 통해 pCL310에 대한 형질전환에 사용하였다. ATCC 20888을 200 rpm에서 밤새 진탕시키면서 +27 ℃에서 250 ㎖ 편평바닥 삼각 플라스크 중의 25 ㎖의 M50-20 배지에서 증식시켰다. 이어서, 상기 20888 배양물을 250 ㎖ 진탕형 플라스크 중의 50 ㎖의 M2B 배지에 1/100 희석하고 앞서 사용된 조건하에서 밤새 증식시켰다. 상기 20888 배양물이 초기 로그기(0.6-2 OD 단위/㎖)에 있었을 때, 배양물을 3,220 x g에서 10분간 원심분리에 의해 수확하였다. 상등액을 경사분리하고, 펠릿을 M2B에 20 OD 단위/㎖의 최종 농도로 재현탁시켰다. 100 ㎕의 생성 세포 현탁액을 비-선택성 M2B 아가 플레이트(대략적으로 2 in 직경)의 대략 1/3상에 도말하였다. 5 ㎍의 플라스미드 DNA 및 5 ㎍의 pCL310을 50 ㎕의 2.5 M CaCl2, 20 ㎕의 0.1 M 스페르미딘 및 50 ㎕의 제조된 M10 텅스텐 비드(제조사의 프로토콜에 따라)와 혼합하고, 1분간 와동시키고, 이어서 실온에서 10분간 배양하여 상기 비드를 침전되게 하였다. DNA-코팅된 비드를 250 ㎕의 100% 에탄올로 1회 세척하고, 이어서 비드를 60 ㎕의 100% 에탄올에 재현탁시켰다. 각각의 제조된 거대담체(거대담체 조립 제조에 대한 제조사의 프로토콜에 따라)는, 후드로부터 최소의 진동을 유도하고 따라서 에탄올이 건조되게 하는 조건하에서 중심에 스폿팅된 에탄올 중의 코팅된 비드 10 ㎕를 가졌다. 파열판 홀더는 70% 아이소프로판올 중에서 짧은 멸균 후 1,100 psi 파열판을 가졌다. 조립된 거대담체 플랫폼을 꼭대기 선반 위치에 놓고 20888 세포 패치가 있는 M2B 아가 플레이트를 상기 꼭대기로부터 세 번째 선반상에 세포쪽을 위로 하여 놓았다. 진공이 상기 유전자총 챔버내부에서 약 27 인치의 수은에 도달했으면, 파열판이 파손될 때까지 헬륨을 발사하였다. 이어서 헬륨 흐름을 차단시키고, 상기 챔버를 대기로 배기시켰다. 이어서 충격판을 상기 챔버로부터 제거하였다. 상기 충격 과정을 모든 샘플 및 대조용에 대해 반복하였다. pCL310으로 충격된 20888 균주를 T212로 표기하였다. 충격판을 +27 ℃ 배양기에서 4시간 동안 배양하고, 그 후에 세포를 상기 플레이트로부터 약 1 ㎖의 M2B로 세척하고 100 ㎍/㎖ 블라스티시딘(써모피셔)을 함유하는 4개의 M2B 아가 플레이트들간에 균등하게 분배하였다. 세포를 3 ㎜ 멸균 유리 비드로 도말하였다. 비드 제거 후에, 플레이트를 싸고 +27 ℃에서 5 내지 8일 동안 배양하였다. 콜로니의 크기가 2 내지 4 밀리미터에 도달했을 때, 이들을 100 ㎍/㎖ 블라스티시딘을 함유하는 M2B 아가 플레이트상에 덧대었다. 블라스티시딘에 대해 내성인 것으로 확인된 콜로니를 고르고, 250 ㎖ 편평 바닥 삼각 플라스크 중의 50 ㎖의 M50-20내에 접종하고, +27 ℃ 및 200 rpm에서 48시간 동안 진탕기에 넣었다. 48시간 배양후에, 2 밀리리터의 배양물을 4,000xg에서 10분간 원심분리에 의해 수집하고, 상등액을 경사분리하고, 펠릿을 변형된 페놀-클로로포름 추출에 따른 게놈 DNA 단리에 사용하였다. 게놈 DNA를 추출하고 GoTaq 그린 마스터믹스(프로메가)와 함께 PCR 주형으로서 사용하고 프라이머 pYB13 YB1 seq F 및 pCL122 OrfC R 및 DMSO를 5% 최종 농도로 사용하여 Cas9 카세트의 존재를 확인하였다. 하기의 순환 조건을 적용하였다: 95 ℃ 2분, [95 ℃ 30초, 63 ℃ 30초, 72 ℃ 1분 45초] 35주기 동안, 및 72 ℃ 5분(도 20). PCR에 의해 측정시 pCL130 형질전환 DNA의 존재에 대해 양성이고 야생형 대조용에 비해 더 짙은 오렌지색을 나타내는 다수의 형질전환체를 UV-Vis 방법에 의해 총 카로티노이드에 대해 분석하였다. 상기 분석 중에서 2개의 균주, T212-3-1 및 T212-3-2가 보다 높은 총 카로티노이드를 갖는 것으로 확인되었다. T212-3-2는 게놈 편집 요소에 대한 후속 연구를 위해 선택된다.A wild-type strain of Schizochytrium sp. ATCC 20888 was used for transformation to pCL310 via the particle bombardment method (Bio Rad). ATCC 20888 was grown in 25 ml of M50-20 medium in a 250 ml flat bottom Erlenmeyer flask at +27° C. with shaking at 200 rpm overnight. The 20888 culture was then diluted 1/100 in 50 ml M2B medium in a 250 ml shake flask and grown overnight under the conditions previously used. When the 20888 culture was in the initial log phase (0.6-2 OD units/ml), the culture was harvested by centrifugation at 3,220×g for 10 minutes. The supernatant was decanted and the pellet resuspended in M2B to a final concentration of 20 OD units/ml. 100 μl of the resulting cell suspension was plated onto approximately 1/3 of a non-selective M2B agar plate (approximately 2 in diameter). 5 μg of plasmid DNA and 5 μg of pCL310 were mixed with 50 μl of 2.5 M CaCl 2 , 20 μl of 0.1 M spermidine and 50 μl of prepared M10 tungsten beads (according to manufacturer's protocol) and vortexed for 1 minute. and then incubated for 10 minutes at room temperature to allow the beads to precipitate. The DNA-coated beads were washed once with 250 μl of 100% ethanol, and then the beads were resuspended in 60 μl of 100% ethanol. Each prepared macrocarrier (according to the manufacturer's protocol for preparing the macrocarrier assembly) was subjected to 10 μl of the coated beads in ethanol that were centrally spotted under conditions that induce minimal vibration from the hood and thus allow the ethanol to dry. had The bursting disk holder had a 1,100 psi bursting disk after short sterilization in 70% isopropanol. The assembled macrocarrier platform was placed in the top shelf position and the M2B agar plate with the 20888 cell patch was placed cell side up on the third shelf from the top. Once the vacuum reached about 27 inches of mercury inside the gun chamber, helium was fired until the rupture disk was broken. The helium flow was then shut off and the chamber vented to atmosphere. The shock plate was then removed from the chamber. The shock procedure was repeated for all samples and controls. The 20888 strain shocked with pCL310 was designated T212. The shock plate was incubated for 4 hours in a +27° C. incubator, after which the cells were washed from the plate with about 1 ml of M2B and between 4 M2B agar plates containing 100 μg/ml blasticidin (ThermoFisher). evenly distributed in Cells were plated with 3 mm sterile glass beads. After bead removal, the plates were wrapped and incubated at +27 °C for 5-8 days. When the colonies reached 2-4 millimeters in size, they were overlaid on M2B agar plates containing 100 μg/ml blasticidin. Colonies found to be resistant to blasticidin were picked, inoculated into 50 ml of M50-20 in a 250 ml flat bottom Erlenmeyer flask, and placed on a shaker at +27° C. and 200 rpm for 48 hours. After 48 hours of incubation, 2 milliliters of culture were collected by centrifugation at 4,000×g for 10 minutes, the supernatant decanted and the pellet used for genomic DNA isolation following modified phenol-chloroform extraction. Genomic DNA was extracted and used as PCR template with GoTaq Green Mastermix (Promega) and primers pYB13 YB1 seq F and pCL122 OrfC R and DMSO at 5% final concentration to confirm the presence of the Cas9 cassette. The following cycling conditions were applied: 95° C. 2 min, [95° C. 30 sec, 63° C. 30 sec, 72° C. 1 min 45 sec] for 35 cycles, and 72° C. 5 min ( FIG. 20 ). A number of transformants positive for the presence of pCL130 transforming DNA as determined by PCR and exhibiting a darker orange color compared to the wild-type control were analyzed for total carotenoids by the UV-Vis method. In the above analysis, two strains, T212-3-1 and T212-3-2, were identified with higher total carotenoids. T212-3-2 is selected for subsequent studies on genome editing elements.

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5'→ 3'Primer sequence, 5'→ 3' pYB13 pYB1 seq FpYB13 pYB1 seq F 4646 GAGAGGATAGTATCTTGCGTGCTTGGGAGAGGATAGTATTCTTGCGTGCTTGG pCL122 OrfC RpCL122 OrfC R 4141 GCAAGGTTGGAACATTACGATCAAGGCAAGGTTGGAACATTACGATCAAG

실시예 9Example 9

pYB61, pYB66 및 pYB73에 의한 T212 및 20888의 형질전환Transformation of T212 and 20888 with pYB61, pYB66 and pYB73

스키조키트리움에서 편집 효율에 대한 표적화되지 않은 벡터의 영향을 시험하기 위해서 다수의 형질전환을 상술한 입자 충격 방법을 사용하여 수행하였다. 상기 형질전환을 하기와 같이 셋업하였다: T280-T212-3-2 균주를 pYB61로 형질전환시키고, T285-ATCC 20888 야생형 균주를 pYB61로 형질전환시켰다. 상기 두 형질전환을 모두 수행하여 gRNA 시험 연구를 위해 Cas9를 사전-발현하는 수용 균주를 생성시켰다. 후속의 형질전환을 하기와 같이 셋업하였다: T281-T212-3-2 균주를 pYB61 및 pYB66으로 동시-형질전환시키고, T286-ATCC 20888 야생형 균주를 pYB61 및 pYB66으로 동시-형질전환시켰다. T281 및 T286 형질전환을 수행하여, 게놈 중 임의의 특정 유전자좌에 대한 표적화를 위해 설계되지 않은 플라스미드에 의한 편집 효율을 평가하였다. pYB73으로 형질전환된 T282-T212-3-2 균주, 및 pYB73으로 형질전환된 T287-ATCC 20888 야생형 균주를, 게놈 편집을 위해 상기 두 요소를 모두 갖는 단일 플라스미드에 의한 편집 효율을 평가하기 위해 완성시켰다. T280, T282, T285, T287 형질전환을 제오신(써모피셔) 50 ㎍/㎖가 보충된 M2B 플레이트상에서 선택하였다. T281 및 T286 형질전환을 제오신 50 ㎍/㎖ 및 파로모마이신(시그마) 100 ㎍/㎖이 보충된 M2B 플레이트상에서 선택하였다. 이들 형질전환 모두로부터 생성된 콜로니를 100 ㎍/㎖의 제오신 단독 또는 100 ㎍/㎖ 제오신과 100 ㎍/㎖ 파로모마이신을 함께 함유하는 선택성 M2B 플레이트상에 덧대었다. 상기 패치 플레이트상에서 확고하게 증식하고 적용 가능한 경우 백색 콜로니 표현형을 나타낸 패치들을 추가의 분석을 위해 선택하였다. 상기 균주들을 상기 패치 플레이트로부터 떼어내어 250 ㎖ 삼각 편평바닥 플라스크 중의 25 ㎖의 M50-20 배지에 접종하였다. 플라스크를 대략 24시간 동안 +27 ℃, 200 rpm에서 배양하였다. 2 ㎖의 각 접종물의 분액을 취하여 미세원심분리 튜브에 넣고, 7,500 x g에서 10분간 회전시켰다. 상등액을 경사분리시키고 펠릿을 변형된 페놀-클로로포름 추출 방법을 사용하여 게놈 DNA의 제조에 사용하였다. 생성되는 gDNA에, 상기 선택 카세트 및 관심 유전자의 존재에 대해서 PCR에 의한 분석을 가하였다. T280, T281, T285, T286에 대해서, 상기 선택 카세트의 종결자와 상기 Cas9 유전자의 시작간의 접합부를 하기를 사용하여 증폭시켰다: 2X GoTaq 그린 마스터 믹스, 프라이머 121 Tub seq F 및 pYB32/3 CR1, 5% 최종 농도로 DMSO, 및 gDNA. 하기의 조건들을 적용하였다: 95 ℃ 2분, [95 ℃ 30초, 58 ℃ 30초, 72 ℃ 1분 2초] 35주기 동안, 및 72 ℃ 5분(도 21). T281 및 T286에 대해서, 상기 gRNA 카세트의 존재를 하기와 같이 전체 gRNA 카세트의 PCR 증폭에 의해 평가하였다: 2X GoTaq 그린 마스터 믹스, 프라이머 pYB66 EF1 seq F 및 pCL122 OrfC R, 5% 최종 농도로 DMSO, 및 gDNA. 하기의 순환 조건들을 적용하였다: 95 ℃ 2분, [95 ℃ 30초, 60 ℃ 30초, 72 ℃ 47초] 35주기 동안, 및 72 ℃ 5분(도 22). T282 및 T287에 대해서, 상기 gRNA 카세트의 존재를 하기와 같이 PCR 증폭에 의해 평가하였다: 2X GoTaq 그린 마스터 믹스, 프라이머 pYB66 EF1 seq F 및 TT pYB73 HDV R, 5% 최종 농도로 DMSO, 및 gDNA. 하기의 순환 조건들을 적용하였다: 95 ℃ 2분, [95 ℃ 30초, 60.9 ℃ 30초, 72 ℃ 16초] 35주기 동안, 및 72 ℃ 5분(도 23). PCR 증폭산물을 서열분석하고 상기 유전자 편집 실험으로부터 생성된 삽입-결실의 유형을 관찰하기 위해서 Cas9 및 gRNA에 대해 PCR에 의해 양성으로 시험된 T281, T282, T286 및 T287 형질전환체에 PCR을 가하여, gRNA가 변화에 영향을 미친 CS 유전자의 부분들을 증폭시켰다. 상기 CS 유전자의 관련 부분을 증폭시키기 위한 PCR을 하기에 의해 준비하였다: 2X KOD 핫 스타 마스터 믹스, 프라이머 pYB36 CS1 F 및 pYB36 CS1 R, 5% 최종 농도로 DMSO, 및 gDNA. 하기의 순환 조건들을 적용하였다: 95 ℃ 2분, [95 ℃ 20초, 61 ℃ 10초, 70 ℃ 11초] 35주기 동안, 및 70 ℃ 5분(도 24). PCR 단편을 QIAquick PCR 정제 키트(퀴아겐)를 사용하여 컬럼 정제하고 pYB36 CS1 F 및 pYB36 CS1 R로 서열을 확인하였다.A number of transformations were performed using the particle bombardment method described above to test the effect of untargeted vectors on editing efficiency in Schizochytrium. The transformation was set up as follows: the T280-T212-3-2 strain was transformed with pYB61 and the T285-ATCC 20888 wild-type strain was transformed with pYB61. Both transformations were performed to generate recipient strains pre-expressing Cas9 for gRNA testing studies. Subsequent transformations were set up as follows: the T281-T212-3-2 strain was co-transformed with pYB61 and pYB66, and the T286-ATCC 20888 wild-type strain was co-transformed with pYB61 and pYB66. T281 and T286 transformations were performed to evaluate the efficiency of editing with plasmids not designed for targeting to any specific locus in the genome. The T282-T212-3-2 strain transformed with pYB73, and the T287-ATCC 20888 wild-type strain transformed with pYB73 were completed to evaluate the editing efficiency by a single plasmid with both elements for genome editing. . T280, T282, T285, T287 transformations were selected on M2B plates supplemented with 50 μg/ml of Zeocin (Thermo Fisher). T281 and T286 transformations were selected on M2B plates supplemented with 50 μg/ml zeocin and 100 μg/ml paromomycin (Sigma). Colonies resulting from both of these transformations were plated onto selective M2B plates containing either 100 μg/ml zeocin alone or 100 μg/ml zeocin and 100 μg/ml paromomycin. Patches that grew robustly on the patch plates and displayed a white colony phenotype where applicable were selected for further analysis. The strains were removed from the patch plate and inoculated into 25 ml of M50-20 medium in a 250 ml Erlenmeyer flat bottom flask. Flasks were incubated at +27° C., 200 rpm for approximately 24 hours. An aliquot of each inoculum of 2 ml was taken, placed in a microcentrifuge tube, and spun at 7,500 x g for 10 minutes. The supernatant was decanted and the pellet used for the preparation of genomic DNA using a modified phenol-chloroform extraction method. The resulting gDNA was subjected to analysis by PCR for the presence of the selection cassette and the gene of interest. For T280, T281, T285, T286, the junction between the terminator of the selection cassette and the start of the Cas9 gene was amplified using: 2X GoTaq Green Master Mix, primers 121 Tub seq F and pYB32/3 CR1, 5 DMSO, and gDNA as % final concentrations. The following conditions were applied: 95° C. 2 min, [95° C. 30 sec, 58° C. 30 sec, 72° C. 1 min 2 sec] for 35 cycles, and 72° C. 5 min ( FIG. 21 ). For T281 and T286, the presence of the gRNA cassette was assessed by PCR amplification of the entire gRNA cassette as follows: 2X GoTaq Green Master Mix, primers pYB66 EF1 seq F and pCL122 OrfC R, DMSO at 5% final concentration, and gDNA. The following cycling conditions were applied: 95° C. 2 min, [95° C. 30 sec, 60° C. 30 sec, 72° C. 47 sec] for 35 cycles, and 72° C. 5 min ( FIG. 22 ). For T282 and T287, the presence of the gRNA cassette was assessed by PCR amplification as follows: 2X GoTaq Green Master Mix, primers pYB66 EF1 seq F and TT pYB73 HDV R, DMSO at 5% final concentration, and gDNA. The following cycling conditions were applied: 95° C. 2 min, [95° C. 30 sec, 60.9° C. 30 sec, 72° C. 16 sec] for 35 cycles, and 72° C. 5 min ( FIG. 23 ). PCR was applied to T281, T282, T286 and T287 transformants that tested positive by PCR for Cas9 and gRNA to sequence the PCR amplicons and to observe the types of indels generated from the gene editing experiments, The gRNA amplified the parts of the CS gene that affected the change. PCR to amplify the relevant portion of the CS gene was prepared by: 2X KOD Hot Star Master Mix, primers pYB36 CS1 F and pYB36 CS1 R, DMSO at 5% final concentration, and gDNA. The following cycling conditions were applied: 95° C. 2 min, [95° C. 20 sec, 61° C. 10 sec, 70° C. 11 sec] for 35 cycles, and 70° C. 5 min ( FIG. 24 ). PCR fragments were column purified using QIAquick PCR purification kit (Qiagen) and sequenced with pYB36 CS1 F and pYB36 CS1 R.

프라이머 명칭Primer name 서열번호SEQ ID NO: 프라이머 서열, 5'→ 3'Primer sequence, 5'→ 3' pYB66 EF1 seq FpYB66 EF1 seq F 4040 GAGAGGATAGTATCTTGCGTGCTTGGAGAGGATAGTATCTTGCGTGCTTG pCL122 OrfC RpCL122 OrfC R 4141 GCAAGGTTGGAACATTACGATCAAGGCAAGGTTGGAACATTACGATCAAG TT pYB73 HDV RTT pYB73 HDV R 4747 GAAGCATGTTGCCCAGCCGAAGCATGTTGCCCAGCC pYB36 CS1 FpYB36 CS1 F 2727 GAGTCGAAGGAGACGTTGTCGGAGTCGAAGGAGACGTTGTCG pYB36 CS1 RpYB36 CS1 R 2828 GTCATTGCGAATGATGCGATATGGTCATTGCGAATGATGCGATATG

SEQUENCE LISTING <110> DSM IP Assets B.V. <120> A CRISPR-CAS SYSTEM FOR AN ALGAL HOST CELL <130> 32176-WO-PCT <140> PCT/US2017/041949 <141> 2017-07-13 <150> US 62/361,741 <151> 2016-07-13 <160> 47 <170> PatentIn version 3.5 <210> 1 <211> 9920 <212> DNA <213> Artificial Sequence <220> <223> pCL122-Cas9 vector <220> <221> misc_feature <222> (1)..(1064) <223> EF-1 alpha promoter <220> <221> misc_feature <222> (1070)..(5209) <223> Cas9 <220> <221> misc_feature <222> (5229)..(5868) <223> OrfC_terminator <220> <221> misc_feature <222> (7153)..(8013) <223> AmpR_gene <220> <221> misc_feature <222> (8403)..(8851) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (8852)..(9646) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (9647)..(9920) <223> SV40_terminator <400> 1 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tggataagaa 1080 gtactcgatc ggcctcgaca ttggcaccaa cagcgtcggc tgggccgtca ttactgatga 1140 gtacaaggtc ccgtcgaaga agtttaaggt cctcggcaac actgaccgcc actccatcaa 1200 gaagaacctc atcggtgccc tcctttttga ctccggcgag accgctgagg ccactcgcct 1260 caagcgcact gcccgccgcc gttacacccg ccgcaagaac cgcatctgct acctccagga 1320 gattttctcg aacgaaatgg ccaaggtcga tgactccttt ttccaccgtc tcgaagaatc 1380 gttcctcgtc gaggaggaca agaagcacga gcgccacccc atcttcggta acattgtcga 1440 tgaggttgcc taccacgaga agtacccgac catctaccac ctccgcaaga agctcgtcga 1500 ctccaccgac aaggccgatc tccgccttat ctacctcgcc ctcgcccaca tgatcaagtt 1560 ccgcggccac tttcttatcg agggtgatct caaccctgat aactctgacg tcgacaagct 1620 tttcatccag ctcgtccaga cttacaacca gctcttcgag gagaacccca tcaacgcttc 1680 cggcgtcgac gcgaaggcca ttctcagcgc ccgcctcagc aagtcccgcc gcctcgaaaa 1740 cctcattgcc cagcttcccg gcgagaagaa gaacggcctc ttcggcaacc tcattgccct 1800 cagccttggc ctcaccccta acttcaagtc gaactttgac ctcgccgagg acgccaagct 1860 ccagctttcc aaggacactt acgacgacga tctcgacaac ctcctcgctc agattggcga 1920 ccagtacgct gacctcttcc tcgccgccaa gaaccttagc gatgccatcc tcctctccga 1980 catccttcgt gttaacacgg aaatcacgaa ggctccgctc tccgcctcca tgatcaagcg 2040 ttacgacgag caccatcagg acctcaccct cctcaaggcc ctcgtccgcc agcagctccc 2100 cgagaagtac aaggagatct tcttcgacca gagcaagaac ggctacgccg gctacattga 2160 cggcggcgcg tcgcaggagg agttttacaa gtttatcaag cccattcttg agaagatgga 2220 cggcaccgag gagctcctcg tcaagctcaa ccgtgaggac cttctccgca agcagcgcac 2280 gttcgacaac ggctctattc cccatcagat ccacctcggt gagcttcacg cgattcttcg 2340 ccgccaggaa gacttttacc cgttcctcaa ggacaaccgc gagaagattg agaagatcct 2400 cacctttcgc attccctact acgtcggccc cctcgcccgc ggcaactcgc gctttgcttg 2460 gatgacccgc aagtccgagg agaccatcac cccgtggaac ttcgaagagg tcgtcgacaa 2520 gggcgcctcc gcgcagtctt tcatcgagcg catgactaac tttgacaaga acctcccgaa 2580 cgagaaggtc ctccccaagc acagcctcct ttacgaatac tttacggtgt acaacgagct 2640 cacgaaggtc aagtacgtca ctgagggcat gcgcaagccg gcgttccttt cgggcgagca 2700 gaagaaggct atcgtcgacc tccttttcaa gaccaaccgc aaggttaccg tcaagcagct 2760 caaggaggac tacttcaaga agatcgagtg ctttgactcg gtcgagattt cgggcgtgga 2820 ggaccgtttc aacgcctccc tcggcactta ccacgacctt ctcaagatca tcaaggacaa 2880 ggactttctc gacaacgagg agaacgagga cattctcgag gacatcgtcc tcacgctcac 2940 cctctttgag gaccgtgaga tgatcgagga gcgcctcaag acctacgccc atctctttga 3000 cgacaaggtc atgaagcagc tcaagcgccg ccgctacacc ggctggggcc gcctttcccg 3060 caagctcatc aacggcatcc gcgacaagca gtctggcaag accatccttg actttcttaa 3120 gtctgatggt ttcgccaacc gcaacttcat gcagctcatc cacgacgaca gcctcacttt 3180 caaggaggac attcagaagg cccaggtctc cggccagggt gactctctcc acgaacacat 3240 cgccaacctt gctggcagcc cggctattaa gaagggcatc ctccagaccg tcaaggtcgt 3300 cgacgagctc gtcaaggtta tgggccgcca caagcccgag aacatcgtca ttgagatggc 3360 tcgcgaaaac cagaccaccc agaagggtca gaagaactcc cgcgagcgca tgaagcgtat 3420 cgaggagggc atcaaggagc tcggcagcca gatcctcaag gagcacccgg tcgagaacac 3480 ccagctccag aacgaaaagc tctacctcta ctacctccag aacggccgtg acatgtacgt 3540 tgaccaggag ctcgacatta accgcctctc cgattacgac gtcgaccata ttgtccccca 3600 gagctttctc aaggacgaca gcatcgacaa caaggtcctc acccgctcgg acaagaaccg 3660 cggcaagtcc gacaacgtcc cttccgagga ggtcgtgaag aagatgaaga actactggcg 3720 ccagcttctc aacgctaagc ttattactca gcgcaagttc gataacctca ccaaggccga 3780 acgcggcggc ctctccgagc tcgacaaggc cggttttatc aagcgccagc tcgttgagac 3840 tcgccagatc accaagcacg tggcgcagat cctcgactcg cgcatgaaca cgaagtacga 3900 cgagaacgac aagctcatcc gcgaggtcaa ggtcatcacc cttaagtcga agctcgtgtc 3960 cgactttcgc aaggacttcc agttctacaa ggtccgtgaa attaacaact accaccacgc 4020 tcacgacgct tacctcaacg cggtcgtggg taccgcgctc atcaagaagt acccgaagct 4080 cgagtcggag tttgtctacg gcgactacaa ggtctacgac gtgcgcaaga tgatcgccaa 4140 gtccgagcag gagatcggca aggccacggc caagtacttt ttctactcca acattatgaa 4200 cttctttaag actgagatca cccttgccaa cggcgagatc cgcaagcgcc cccttatcga 4260 gaccaacggc gagaccggcg aaattgtgtg ggataagggt cgcgactttg ccaccgtccg 4320 caaggtcctc agcatgcccc aggtcaacat tgttaagaag accgaggtcc agacgggcgg 4380 ctttagcaag gagtctatcc tccccaagcg taacagcgac aagctcatcg cccgcaagaa 4440 ggactgggac cctaagaagt acggcggctt cgattcgcct acggtcgcct acagcgtcct 4500 cgtcgtcgcc aaggtcgaga agggcaagtc caagaagctc aagtccgtca aggagctcct 4560 cggcatcacg atcatggagc gctccagctt tgagaagaac cccattgact tcctcgaggc 4620 taagggttac aaggaggtca agaaggacct tatcatcaag ctccccaagt actccctctt 4680 tgagctcgaa aacggccgca agcgtatgct cgctagcgct ggcgaactcc agaagggcaa 4740 cgagctcgcc ctccccagca agtacgtcaa ctttctctac ctcgcctccc actacgagaa 4800 gctcaagggt agcccggagg ataacgagca gaagcagctt tttgtggagc agcacaagca 4860 ctaccttgac gagatcattg aacagatctc cgagttctcc aagcgtgtta ttcttgctga 4920 cgccaacctc gataaggtgc tctccgcgta caacaagcac cgcgacaagc ctatccgcga 4980 gcaggccgag aacatcatcc acctctttac cctcaccaac ctcggcgccc cggccgcctt 5040 taagtacttt gatacgacta tcgaccgcaa gcgctacact tcgactaagg aggtcctcga 5100 cgctaccctc attcaccagt ccattaccgg cctctacgag acccgcattg acctttcgca 5160 gctcggtggc gactcgcgtg cggaccctaa gaagaagcgc aaggtctaac atatgagtta 5220 tgagatccga aagtgaacct tgtcctaacc cgacagcgaa tggcgggagg gggcgggcta 5280 aaagatcgta ttacatagta tttttcccct actctttgtg tttgtctttt tttttttttt 5340 gaacgcattc aagccacttg tctgggttta cttgtttgtt tgcttgcttg cttgcttgct 5400 tgcctgcttc ttggtcagac ggcccaaaaa agggaaaaaa ttcattcatg gcacagataa 5460 gaaaaagaaa aagtttgtcg accaccgtca tcagaaagca agagaagaga aacactcgcg 5520 ctcacattct cgctcgcgta agaatcttag ccacgcatac gaagtaattt gtccatctgg 5580 cgaatcttta catgagcgtt ttcaagctgg agcgtgagat catacctttc ttgatcgtaa 5640 tgttccaacc ttgcataggc ctcgttgcga tccgctagca atgcgtcgta ctcccgttgc 5700 aactgcgcca tcgcctcatt gtgacgtgag ttcagattct tctcgagacc ttcgagcgct 5760 gctaatttcg cctgacgctc cttcttttgt gcttccatga cacgccgctt caccgtgcgt 5820 tccacttctt cctcagacat gcccttggct gcctcgacct gctcggtaaa acgggcccca 5880 gcacgtgcta cgagatttcg attccaccgc cgccttctat gaaaggttgg gcttcggaat 5940 cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc tggagttctt 6000 cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 6060 aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 6120 caatgtatct tatcatacat ggtcgacctg caggaacctg cattaatgaa tcggccaacg 6180 cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 6240 gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 6300 atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 6360 caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 6420 gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 6480 ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 6540 cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 6600 taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 6660 cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 6720 acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 6780 aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 6840 atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 6900 atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 6960 gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 7020 gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 7080 ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 7140 ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 7200 tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt 7260 accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt 7320 atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc 7380 cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa 7440 tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg 7500 tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 7560 gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 7620 agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt 7680 aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg 7740 gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 7800 tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 7860 gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt 7920 tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 7980 aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag 8040 catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 8100 acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 8160 tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg 8220 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 8280 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 8340 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccaagc 8400 ttccaatttt aggcccccca ctgaccgagg tctgtcgata atccactttt ccattgattt 8460 tccaggtttc gttaactcat gccactgagc aaaacttcgg tctttcctaa caaaagctct 8520 cctcacaaag catggcgcgg caacggacgt gtcctcatac tccactgcca cacaaggtcg 8580 ataaactaag ctcctcacaa atagaggaga attccactga caactgaaaa caatgtatga 8640 gagacgatca ccactggagc ggcgcggcgg ttgggcgcgg aggtcggcag caaaaacaag 8700 cgactcgccg agcaaacccg aatcagcctt cagacggtcg tgcctaacaa cacgccgttc 8760 taccccgcct tcttcgcgcc ccttcgcgtc caagcatcct tcaagtttat ctctctagtt 8820 caacttcaag aagaacaaca ccaccaacac catgattgaa caagatggat tgcacgcagg 8880 ttctccggcc gcttgggtgg agaggctatt cggctatgac tgggcacaac agacaatcgg 8940 ctgctctgat gccgccgtgt tccggctgtc agcgcagggg cgcccggttc tttttgtcaa 9000 gaccgacctg tccggtgccc tgaatgaact gcaggacgag gcagcgcggc tatcgtggct 9060 ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga 9120 ctggctgcta ttgggcgaag tgccggggca ggatctcctg tcatctcacc ttgctcctgc 9180 cgagaaagta tccatcatgg ctgatgcaat gcggcggctg catacgcttg atccggctac 9240 ctgcccattc gaccaccaag cgaaacatcg catcgagcga gcacgtactc ggatggaagc 9300 cggtcttgtc gatcaggatg atctggacga agagcatcag gggctcgcgc cagccgaact 9360 gttcgccagg ctcaaggcgc gcatgcccga cggcgatgat ctcgtcgtga cccatggcga 9420 tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt tctggattca tcgactgtgg 9480 ccggctgggt gtggcggacc gctatcagga catagcgttg gctacccgtg atattgctga 9540 agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga 9600 ttcgcagcgc atcgccttct atcgccttct tgacgagttc ttctgacacg tgctacgaga 9660 tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 9720 cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt 9780 gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 9840 agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 9900 tgtctgaatt cccggggtac 9920 <210> 2 <211> 7393 <212> DNA <213> Artificial Sequence <220> <223> pYB31 vector <220> <221> misc_feature <222> (2)..(450) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(549) <223> Sec1_signal_for_eGFP <220> <221> misc_feature <222> (550)..(1266) <223> eGFP <220> <221> misc_feature <222> (1294)..(2293) <223> OrfA_terminator <220> <221> misc_feature <222> (4971)..(5970) <223> OrfA_UP_homology <220> <221> misc_feature <222> (5974)..(6289) <223> OrfC_terminator <220> <221> misc_feature <222> (6296)..(6744) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (6745)..(7119) <223> Sh_ble_gene <220> <221> misc_feature <222> (7120)..(7393) <223> SV40_terminator <400> 2 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatga agttcgcgac ctcggtcgca 480 attttgcttg tggccaacat agccaccgcc ctcgcgcaga gcgatggctg cacccccacc 540 gaccagacga tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc 600 gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat 660 gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct gcccgtgccc 720 tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg ctaccccgac 780 cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc 840 accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc 900 gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc 960 ctgggacaca agctggagta caactacaac agccacaacg tctatatcat ggccgacaag 1020 cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga cggcagcgtg 1080 cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc 1140 gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga gaagcgcgat 1200 cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg 1260 tacaagcacc accatcacca ccactaacat atgatctaaa ctagtgaatc gcgacaagtt 1320 gtctttttgt tactctacgt actgctcttt ctaattttac gacgtatgct tctgctcttg 1380 acgacgacaa ccatggcaat aaaagtaagg caagaagtga gtgcgtgctc gctcacaagg 1440 tctaggccga aacgaggcgc cttaggattg gttgtctccg tcgtaagtca tgacggagcg 1500 taggacaccg acggcctgcc gcgcaaatat acgaaccgct gcacaattct tttcgttgag 1560 aacaacctcg aacggcctgc cttgctgagc tgccgacacg gtggaaggaa gcatagcggc 1620 caatcgaggg gatgctatta attaggcttg gcgctgctgc cgccgtgcct gaagatgtgc 1680 gcaagcgtgg caacaccgtc caccttgcca ttaaagtcat gcactccaaa tacctcgaac 1740 agagcctctc gcatgtttgg ccaagcacct tcgagagagt ctggactctc actccttcct 1800 caccgttgtc gccattctca gcgagctcat gcaatcacca ggacggagag acggccagct 1860 gcttttctgc tttccattgt tattagagaa acgcttctcg ctctcatcgt cttagtagac 1920 attccgatgg cttcgttcgc caatttgtca cctaagtaag ctagagtgtt aagtctaaat 1980 gcctttgacc cgcgtacggc gtcacgtaga tgcctgtcct tgccagcaaa cgctagttcg 2040 cggtgtgcgt aatttggccc gcattatgct ggctctcaaa atcaaccgcc accactcgcg 2100 gctgcacgat gattttcgtg cactcatgac atgagaaatg tgatactcaa actagtatag 2160 acctcctact cctactgctg cttttctcgt cagagctgtc tccggaaaag ttgacaagtt 2220 gttggccttc ttcttctctg ctagtaagta gatcatcatg gatgagacga tgatgataat 2280 gatgatgatg ataatttaaa tctcgagacc ttcgagcgct gctaatttcg cctgacgctc 2340 cttcttttgt gcttccatga cacgccgctt caccgtgcgt tccacttctt cctcagacat 2400 gcccttggct gcctcgacct gctcggtaaa acgggcccca gcacgtgcta cgagatttcg 2460 attccaccgc cgccttctat gaaaggttgg gcttcggaat cgttttccgg gacgccggct 2520 ggatgatcct ccagcgcggg gatctcatgc tggagttctt cgcccacccc aacttgttta 2580 ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat 2640 ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct tatcatacat 2700 ggtcgacctg caggaacctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 2760 gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 2820 ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 2880 acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 2940 cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3000 caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3060 gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3120 tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3180 aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3240 ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3300 cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3360 tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 3420 tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3480 ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 3540 aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3600 aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 3660 aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 3720 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 3780 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 3840 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 3900 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 3960 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 4020 ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 4080 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 4140 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 4200 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 4260 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 4320 cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 4380 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 4440 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 4500 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 4560 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 4620 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 4680 catttccccg aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct 4740 ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa 4800 acctctgaca catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga 4860 gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact 4920 atgcggcatc agagcagatt gtactgagag tgcaccaagc ttatttaaat ctttatcggt 4980 gtggcgcgcc ttgacggccg cctcggattc acttcgcagg gcacttctga cgcgctcaag 5040 cgtgcccagc gtggtgccat ctttggcctc tgcaagacca tcggcctcga gtggtccgag 5100 tctgacgtct tttcccgcgg cgtggacatt gctcagggca tgcaccccga ggatgccgcc 5160 gtggcgattg tgcgcgagat ggcgtgcgct gacattcgca ttcgcgaggt cggcattggc 5220 gcaaaccagc agcgctgcac gatccgtgcc gccaagctcg agaccggcaa cccgcagcgc 5280 cagatcgcca aggacgacgt gctgctcgtt tctggcggcg ctcgcggcat cacgcctctt 5340 tgcatccggg agatcacgcg ccagatcgcg ggcggcaagt acattctgct tggccgcagc 5400 aaggtctctg cgagcgaacc ggcatggtgc gctggcatca ctgacgagaa ggctgtgcaa 5460 aaggctgcta cccaggagct caagcgcgcc tttagcgctg gcgagggccc caagcccacg 5520 ccccgcgctg tcactaagct tgtgggctct gttcttggcg ctcgcgaggt gcgcagctct 5580 attgctgcga ttgaagcgct cggcggcaag gccatctact cgtcgtgcga cgtgaactct 5640 gccgccgacg tggccaaggc cgtgcgcgat gccgagtccc agctcggtgc ccgcgtctcg 5700 ggcatcgttc atgcctcggg cgtgctccgc gaccgtctca tcgagaagaa gctccccgac 5760 gagttcgacg ccgtctttgg caccaaggtc accggtctcg agaacctcct cgccgccgtc 5820 gaccgcgcca acctcaagca catggtcctc ttcagctcgc tcgccggctt ccacggcaac 5880 gtcggccagt ctgactacgc catggccaac gaggccctta acaagatggg cctcgagctc 5940 gccaaggacg tctcggtcaa gtcgatctgc taagaaagtg aaccttgtcc taacccgaca 6000 gcgaatggcg ggagggggcg ggctaaaaga tcgtattaca tagtattttc ccctactctt 6060 tgtgtttgtc tttttttttt ttgaacgcat tcaagccact tgtcttggtt tacttgtttg 6120 tttgcttgct tgcttgcttg cttgcctgct tcttggtcag acggacccaa aaaagggaaa 6180 aaattcattc atggcacaga taagaaaaag aaaaagtttg tcgaccaccg tcatcagaaa 6240 gcaagagaag agaaacactc gcgctcacat tctcgctcgc gtaagaatca agcttccaat 6300 tttaggcccc ccactgaccg aggtctgtcg ataatccact tttccattga ttttccaggt 6360 ttcgttaact catgccactg agcaaaactt cggtctttcc taacaaaagc tctcctcaca 6420 aagcatggcg cggcaacgga cgtgtcctca tactccactg ccacacaagg tcgataaact 6480 aagctcctca caaatagagg agaattccac tgacaactga aaacaatgta tgagagacga 6540 tcaccactgg agcggcgcgg cggttgggcg cggaggtcgg cagcaaaaac aagcgactcg 6600 ccgagcaaac ccgaatcagc cttcagacgg tcgtgcctaa caacacgccg ttctaccccg 6660 ccttcttcgc gccccttcgc gtccaagcat ccttcaagtt tatctctcta gttcaacttc 6720 aagaagaaca acaccaccaa caccatggcc aagttgacca gtgccgttcc ggtgctcacc 6780 gcgcgcgacg tcgccggagc ggtcgagttc tggaccgacc ggctcgggtt ctcccgggac 6840 ttcgtggagg acgacttcgc cggtgtggtc cgggacgacg tgaccctgtt catcagcgcg 6900 gtccaggacc aggtggtgcc ggacaacacc ctggcctggg tgtgggtgcg cggcctggac 6960 gagctgtacg ccgagtggtc ggaggtcgtg tccacgaact tccgggacgc ctccgggccg 7020 gccatgaccg agatcggcga gcagccgtgg gggcgggagt tcgccctgcg cgacccggcc 7080 ggcaactgcg tgcacttcgt ggccgaggag caggactgac acgtgctacg agatttcgat 7140 tccaccgccg ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg 7200 atgatcctcc agcgcgggga tctcatgctg gagttcttcg cccaccccaa cttgtttatt 7260 gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt 7320 ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta tcatgtctga 7380 attcccgggg tac 7393 <210> 3 <211> 10702 <212> DNA <213> Artificial Sequence <220> <223> pYB32 vector <220> <221> misc_feature <222> (2)..(450) <223> alpha tubulin promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4603)..(5602) <223> OrfA terminator <220> <221> misc_feature <222> (8280)..(9279) <223> OrfA UP homology <220> <221> misc_feature <222> (9283)..(9598) <223> OrfC terminator <220> <221> misc_feature <222> (9605)..(10053) <223> alpha tubulin promoter <220> <221> misc_feature <222> (10054)..(10428) <223> Sh ble gene <220> <221> misc_feature <222> (10429)..(10702) <223> SV40_terminator <400> 3 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acattgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gacgacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgatctaaac tagtgaatcg 4620 cgacaagttg tctttttgtt actctacgta ctgctctttc taattttacg acgtatgctt 4680 ctgctcttga cgacgacaac catggcaata aaagtaaggc aagaagtgag tgcgtgctcg 4740 ctcacaaggt ctaggccgaa acgaggcgcc ttaggattgg ttgtctccgt cgtaagtcat 4800 gacggagcgt aggacaccga cggcctgccg cgcaaatata cgaaccgctg cacaattctt 4860 ttcgttgaga acaacctcga acggcctgcc ttgctgagct gccgacacgg tggaaggaag 4920 catagcggcc aatcgagggg atgctattaa ttaggcttgg cgctgctgcc gccgtgcctg 4980 aagatgtgcg caagcgtggc aacaccgtcc accttgccat taaagtcatg cactccaaat 5040 acctcgaaca gagcctctcg catgtttggc caagcacctt cgagagagtc tggactctca 5100 ctccttcctc accgttgtcg ccattctcag cgagctcatg caatcaccag gacggagaga 5160 cggccagctg cttttctgct ttccattgtt attagagaaa cgcttctcgc tctcatcgtc 5220 ttagtagaca ttccgatggc ttcgttcgcc aatttgtcac ctaagtaagc tagagtgtta 5280 agtctaaatg cctttgaccc gcgtacggcg tcacgtagat gcctgtcctt gccagcaaac 5340 gctagttcgc ggtgtgcgta atttggcccg cattatgctg gctctcaaaa tcaaccgcca 5400 ccactcgcgg ctgcacgatg attttcgtgc actcatgaca tgagaaatgt gatactcaaa 5460 ctagtataga cctcctactc ctactgctgc ttttctcgtc agagctgtct ccggaaaagt 5520 tgacaagttg ttggccttct tcttctctgc tagtaagtag atcatcatgg atgagacgat 5580 gatgataatg atgatgatga taatttaaat ctcgagacct tcgagcgctg ctaatttcgc 5640 ctgacgctcc ttcttttgtg cttccatgac acgccgcttc accgtgcgtt ccacttcttc 5700 ctcagacatg cccttggctg cctcgacctg ctcggtaaaa cgggccccag cacgtgctac 5760 gagatttcga ttccaccgcc gccttctatg aaaggttggg cttcggaatc gttttccggg 5820 acgccggctg gatgatcctc cagcgcgggg atctcatgct ggagttcttc gcccacccca 5880 acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa 5940 ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt 6000 atcatacatg gtcgacctgc aggaacctgc attaatgaat cggccaacgc gcggggagag 6060 gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 6120 ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 6180 caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 6240 aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 6300 atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 6360 cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 6420 ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 6480 gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 6540 accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat 6600 cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 6660 cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct 6720 gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac 6780 aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 6840 aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 6900 actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 6960 taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 7020 gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 7080 tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 7140 ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa 7200 accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 7260 agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 7320 acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 7380 tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 7440 cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 7500 tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 7560 ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt 7620 gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc 7680 tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 7740 ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 7800 gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 7860 cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg 7920 gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg 7980 ttccgcgcac atttccccga aaagtgccac ctgacgtcta agaaaccatt attatcatga 8040 cattaaccta taaaaatagg cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg 8100 acggtgaaaa cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg 8160 atgccgggag cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct 8220 ggcttaacta tgcggcatca gagcagattg tactgagagt gcaccaagct tatttaaatc 8280 tttatcggtg tggcgcgcct tgacggccgc ctcggattca cttcgcaggg cacttctgac 8340 gcgctcaagc gtgcccagcg tggtgccatc tttggcctct gcaagaccat cggcctcgag 8400 tggtccgagt ctgacgtctt ttcccgcggc gtggacattg ctcagggcat gcaccccgag 8460 gatgccgccg tggcgattgt gcgcgagatg gcgtgcgctg acattcgcat tcgcgaggtc 8520 ggcattggcg caaaccagca gcgctgcacg atccgtgccg ccaagctcga gaccggcaac 8580 ccgcagcgcc agatcgccaa ggacgacgtg ctgctcgttt ctggcggcgc tcgcggcatc 8640 acgcctcttt gcatccggga gatcacgcgc cagatcgcgg gcggcaagta cattctgctt 8700 ggccgcagca aggtctctgc gagcgaaccg gcatggtgcg ctggcatcac tgacgagaag 8760 gctgtgcaaa aggctgctac ccaggagctc aagcgcgcct ttagcgctgg cgagggcccc 8820 aagcccacgc cccgcgctgt cactaagctt gtgggctctg ttcttggcgc tcgcgaggtg 8880 cgcagctcta ttgctgcgat tgaagcgctc ggcggcaagg ccatctactc gtcgtgcgac 8940 gtgaactctg ccgccgacgt ggccaaggcc gtgcgcgatg ccgagtccca gctcggtgcc 9000 cgcgtctcgg gcatcgttca tgcctcgggc gtgctccgcg accgtctcat cgagaagaag 9060 ctccccgacg agttcgacgc cgtctttggc accaaggtca ccggtctcga gaacctcctc 9120 gccgccgtcg accgcgccaa cctcaagcac atggtcctct tcagctcgct cgccggcttc 9180 cacggcaacg tcggccagtc tgactacgcc atggccaacg aggcccttaa caagatgggc 9240 ctcgagctcg ccaaggacgt ctcggtcaag tcgatctgct aagaaagtga accttgtcct 9300 aacccgacag cgaatggcgg gagggggcgg gctaaaagat cgtattacat agtattttcc 9360 cctactcttt gtgtttgtct tttttttttt tgaacgcatt caagccactt gtcttggttt 9420 acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt cttggtcaga cggacccaaa 9480 aaagggaaaa aattcattca tggcacagat aagaaaaaga aaaagtttgt cgaccaccgt 9540 catcagaaag caagagaaga gaaacactcg cgctcacatt ctcgctcgcg taagaatcaa 9600 gcttccaatt ttaggccccc cactgaccga ggtctgtcga taatccactt ttccattgat 9660 tttccaggtt tcgttaactc atgccactga gcaaaacttc ggtctttcct aacaaaagct 9720 ctcctcacaa agcatggcgc ggcaacggac gtgtcctcat actccactgc cacacaaggt 9780 cgataaacta agctcctcac aaatagagga gaattccact gacaactgaa aacaatgtat 9840 gagagacgat caccactgga gcggcgcggc ggttgggcgc ggaggtcggc agcaaaaaca 9900 agcgactcgc cgagcaaacc cgaatcagcc ttcagacggt cgtgcctaac aacacgccgt 9960 tctaccccgc cttcttcgcg ccccttcgcg tccaagcatc cttcaagttt atctctctag 10020 ttcaacttca agaagaacaa caccaccaac accatggcca agttgaccag tgccgttccg 10080 gtgctcaccg cgcgcgacgt cgccggagcg gtcgagttct ggaccgaccg gctcgggttc 10140 tcccgggact tcgtggagga cgacttcgcc ggtgtggtcc gggacgacgt gaccctgttc 10200 atcagcgcgg tccaggacca ggtggtgccg gacaacaccc tggcctgggt gtgggtgcgc 10260 ggcctggacg agctgtacgc cgagtggtcg gaggtcgtgt ccacgaactt ccgggacgcc 10320 tccgggccgg ccatgaccga gatcggcgag cagccgtggg ggcgggagtt cgccctgcgc 10380 gacccggccg gcaactgcgt gcacttcgtg gccgaggagc aggactgaca cgtgctacga 10440 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 10500 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 10560 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 10620 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 10680 catgtctgaa ttcccggggt ac 10702 <210> 4 <211> 11332 <212> DNA <213> Artificial Sequence <220> <223> pYB33 vector <220> <221> misc_feature <222> (2)..(1006) <223> CS_promoter <220> <221> misc_feature <222> (1013)..(5152) <223> Cas9 <220> <221> misc_feature <222> (5159)..(6158) <223> OrfA_terminator <220> <221> misc_feature <222> (8836)..(9835) <223> OrfA_UP_homology <220> <221> misc_feature <222> (9839)..(10154) <223> OrfC_terminator <220> <221> misc_feature <222> (10161)..(10609) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (10610)..(10984) <223> Sh_ble_gene <220> <221> misc_feature <222> (10985)..(11258) <223> SV40_terminator <400> 4 cgagcgggcg attccaccgt caactggtct tcgcctgtgc cttccagcgc agtgcgcctt 60 gccccgctcg gtcttattat tcttggcgct ctcgtcaatg tgatcatgat ctagtcgaac 120 gtgcaatcta gccaatgaaa aaagagtcca gttctatctg aatttttcac tttctaaatc 180 tcgcatcgac aatctacttt tcaaatctcg caacaaagct gatcttgttt ctccctcacc 240 cagttctatc tgaatttttc cttttctgaa gctcgcgtta acaatctact tttcgaatct 300 gtcaacaaag ctgatcttgt ttctcccccc tatccccttc cctcccccct tctttgggat 360 cttgttgtgc gtgtcgcccc ttcaacttct ttgttcgacg atgacctcca cctagcctgt 420 gaagctcatc gtctccgagt atttctggcc tgctccaatt cctctcttcc attctccatc 480 gcatacatgc atgttctttg gtctcactcc gagccatgct tcttcggtca ctacttcatc 540 tatttgacta ggcctctgtt cgagcgacga accctccgtg ttcgcgggtg ttcattctct 600 gcaaagtggt ccgtaaccgt gactaccgga cacctcgcgt acactacatt cgggacggac 660 gcggccgagc gcgacgtctc tgggcccggc ctgccgcccc cggggccgcg gcttcctcgc 720 gccgccagcc gcgtccaagt cgccagcgcg aggtcgcgcg agtcgaagga gacgttgtcg 780 atctcgaccc tcgccatgcg cgtgacgggt gaccgcctca ccggatcccg ccctccgcgc 840 gctgccttca ttccttcatt ccttcattcc ttcactcaat cctgcatcat ccatcgcccg 900 cccgcccgct cgcacgcacc agaggcgcgc attgcgggcc agggcgccgc ctgcagaccg 960 ccatcgcgcc cgccttctgc cgcgcctcgc tcgctcggag accgagggat ccatggataa 1020 gaagtactcg atcggcctcg acattggcac caacagcgtc ggctgggccg tcattactga 1080 tgagtacaag gtcccgtcga agaagtttaa ggtcctcggc aacactgacc gccactccat 1140 caagaagaac ctcatcggtg ccctcctttt tgactccggc gagaccgctg aggccactcg 1200 cctcaagcgc actgcccgcc gccgttacac ccgccgcaag aaccgcatct gctacctcca 1260 ggagattttc tcgaacgaaa tggccaaggt cgatgactcc tttttccacc gtctcgaaga 1320 atcgttcctc gtcgaggagg acaagaagca cgagcgccac cccatcttcg gtaacattgt 1380 cgatgaggtt gcctaccacg agaagtaccc gaccatctac cacctccgca agaagctcgt 1440 cgactccacc gacaaggccg atctccgcct tatctacctc gccctcgccc acatgatcaa 1500 gttccgcggc cactttctta tcgagggtga tctcaaccct gataactctg acgtcgacaa 1560 gcttttcatc cagctcgtcc agacttacaa ccagctcttc gaggagaacc ccatcaacgc 1620 ttccggcgtc gacgcgaagg ccattctcag cgcccgcctc agcaagtccc gccgcctcga 1680 aaacctcatt gcccagcttc ccggcgagaa gaagaacggc ctcttcggca acctcattgc 1740 cctcagcctt ggcctcaccc ctaacttcaa gtcgaacttt gacctcgccg aggacgccaa 1800 gctccagctt tccaaggaca cttacgacga cgatctcgac aacctcctcg ctcagattgg 1860 cgaccagtac gctgacctct tcctcgccgc caagaacctt agcgatgcca tcctcctctc 1920 cgacatcctt cgtgttaaca cggaaatcac gaaggctccg ctctccgcct ccatgatcaa 1980 gcgttacgac gagcaccatc aggacctcac cctcctcaag gccctcgtcc gccagcagct 2040 ccccgagaag tacaaggaga tcttcttcga ccagagcaag aacggctacg ccggctacat 2100 tgacggcggc gcgtcgcagg aggagtttta caagtttatc aagcccattc ttgagaagat 2160 ggacggcacc gaggagctcc tcgtcaagct caaccgtgag gaccttctcc gcaagcagcg 2220 cacgttcgac aacggctcta ttccccatca gatccacctc ggtgagcttc acgcgattct 2280 tcgccgccag gaagactttt acccgttcct caaggacaac cgcgagaaga ttgagaagat 2340 cctcaccttt cgcattccct actacgtcgg ccccctcgcc cgcggcaact cgcgctttgc 2400 ttggatgacc cgcaagtccg aggagaccat caccccgtgg aacttcgaag aggtcgtcga 2460 caagggcgcc tccgcgcagt ctttcatcga gcgcatgact aactttgaca agaacctccc 2520 gaacgagaag gtcctcccca agcacagcct cctttacgaa tactttacgg tgtacaacga 2580 gctcacgaag gtcaagtacg tcactgaggg catgcgcaag ccggcgttcc tttcgggcga 2640 gcagaagaag gctatcgtcg acctcctttt caagaccaac cgcaaggtta ccgtcaagca 2700 gctcaaggag gactacttca agaagatcga gtgctttgac tcggtcgaga tttcgggcgt 2760 ggaggaccgt ttcaacgcct ccctcggcac ttaccacgac cttctcaaga tcatcaagga 2820 caaggacttt ctcgacaacg aggagaacga ggacattctc gaggacatcg tcctcacgct 2880 caccctcttt gaggaccgtg agatgatcga ggagcgcctc aagacctacg cccatctctt 2940 tgacgacaag gtcatgaagc agctcaagcg ccgccgctac accggctggg gccgcctttc 3000 ccgcaagctc atcaacggca tccgcgacaa gcagtctggc aagaccatcc ttgactttct 3060 taagtctgat ggtttcgcca accgcaactt catgcagctc atccacgacg acagcctcac 3120 tttcaaggag gacattcaga aggcccaggt ctccggccag ggtgactctc tccacgaaca 3180 catcgccaac cttgctggca gcccggctat taagaagggc atcctccaga ccgtcaaggt 3240 cgtcgacgag ctcgtcaagg ttatgggccg ccacaagccc gagaacatcg tcattgagat 3300 ggctcgcgaa aaccagacca cccagaaggg tcagaagaac tcccgcgagc gcatgaagcg 3360 tatcgaggag ggcatcaagg agctcggcag ccagatcctc aaggagcacc cggtcgagaa 3420 cacccagctc cagaacgaaa agctctacct ctactacctc cagaacggcc gtgacatgta 3480 cgttgaccag gagctcgaca ttaaccgcct ctccgattac gacgtcgacc atattgtccc 3540 ccagagcttt ctcaaggacg acagcatcga caacaaggtc ctcacccgct cggacaagaa 3600 ccgcggcaag tccgacaacg tcccttccga ggaggtcgtg aagaagatga agaactactg 3660 gcgccagctt ctcaacgcta agcttattac tcagcgcaag ttcgataacc tcaccaaggc 3720 cgaacgcggc ggcctctccg agctcgacaa ggccggtttt atcaagcgcc agctcgttga 3780 gactcgccag atcaccaagc acgtggcgca gatcctcgac tcgcgcatga acacgaagta 3840 cgacgagaac gacaagctca tccgcgaggt caaggtcatc acccttaagt cgaagctcgt 3900 gtccgacttt cgcaaggact tccagttcta caaggtccgt gaaattaaca actaccacca 3960 cgctcacgac gcttacctca acgcggtcgt gggtaccgcg ctcatcaaga agtacccgaa 4020 gctcgagtcg gagtttgtct acggcgacta caaggtctac gacgtgcgca agatgatcgc 4080 caagtccgag caggagatcg gcaaggccac ggccaagtac tttttctact ccaacattat 4140 gaacttcttt aagactgaga tcacccttgc caacggcgag atccgcaagc gcccccttat 4200 cgagaccaac ggcgagaccg gcgaaattgt gtgggataag ggtcgcgact ttgccaccgt 4260 ccgcaaggtc ctcagcatgc cccaggtcaa cattgttaag aagaccgagg tccagacggg 4320 cggctttagc aaggagtcta tcctccccaa gcgtaacagc gacaagctca tcgcccgcaa 4380 gaaggactgg gaccctaaga agtacggcgg cttcgattcg cctacggtcg cctacagcgt 4440 cctcgtcgtc gccaaggtcg agaagggcaa gtccaagaag ctcaagtccg tcaaggagct 4500 cctcggcatc acgatcatgg agcgctccag ctttgagaag aaccccattg acttcctcga 4560 ggctaagggt tacaaggagg tcaagaagga ccttatcatc aagctcccca agtactccct 4620 ctttgagctc gaaaacggcc gcaagcgtat gctcgctagc gctggcgaac tccagaaggg 4680 caacgagctc gccctcccca gcaagtacgt caactttctc tacctcgcct cccactacga 4740 gaagctcaag ggtagcccgg aggataacga gcagaagcag ctttttgtgg agcagcacaa 4800 gcactacctt gacgagatca ttgaacagat ctccgagttc tccaagcgtg ttattcttgc 4860 tgacgccaac ctcgataagg tgctctccgc gtacaacaag caccgcgaca agcctatccg 4920 cgagcaggcc gagaacatca tccacctctt taccctcacc aacctcggcg ccccggccgc 4980 ctttaagtac tttgatacga ctatcgaccg caagcgctac acttcgacta aggaggtcct 5040 cgacgctacc ctcattcacc agtccattac cggcctctac gagacccgca ttgacctttc 5100 gcagctcggt ggcgactcgc gtgcggaccc taagaagaag cgcaaggtct aacatatgat 5160 ctaaactagt gaatcgcgac aagttgtctt tttgttactc tacgtactgc tctttctaat 5220 tttacgacgt atgcttctgc tcttgacgac gacaaccatg gcaataaaag taaggcaaga 5280 agtgagtgcg tgctcgctca caaggtctag gccgaaacga ggcgccttag gattggttgt 5340 ctccgtcgta agtcatgacg gagcgtagga caccgacggc ctgccgcgca aatatacgaa 5400 ccgctgcaca attcttttcg ttgagaacaa cctcgaacgg cctgccttgc tgagctgccg 5460 acacggtgga aggaagcata gcggccaatc gaggggatgc tattaattag gcttggcgct 5520 gctgccgccg tgcctgaaga tgtgcgcaag cgtggcaaca ccgtccacct tgccattaaa 5580 gtcatgcact ccaaatacct cgaacagagc ctctcgcatg tttggccaag caccttcgag 5640 agagtctgga ctctcactcc ttcctcaccg ttgtcgccat tctcagcgag ctcatgcaat 5700 caccaggacg gagagacggc cagctgcttt tctgctttcc attgttatta gagaaacgct 5760 tctcgctctc atcgtcttag tagacattcc gatggcttcg ttcgccaatt tgtcacctaa 5820 gtaagctaga gtgttaagtc taaatgcctt tgacccgcgt acggcgtcac gtagatgcct 5880 gtccttgcca gcaaacgcta gttcgcggtg tgcgtaattt ggcccgcatt atgctggctc 5940 tcaaaatcaa ccgccaccac tcgcggctgc acgatgattt tcgtgcactc atgacatgag 6000 aaatgtgata ctcaaactag tatagacctc ctactcctac tgctgctttt ctcgtcagag 6060 ctgtctccgg aaaagttgac aagttgttgg ccttcttctt ctctgctagt aagtagatca 6120 tcatggatga gacgatgatg ataatgatga tgatgataat ttaaatctcg agaccttcga 6180 gcgctgctaa tttcgcctga cgctccttct tttgtgcttc catgacacgc cgcttcaccg 6240 tgcgttccac ttcttcctca gacatgccct tggctgcctc gacctgctcg gtaaaacggg 6300 ccccagcacg tgctacgaga tttcgattcc accgccgcct tctatgaaag gttgggcttc 6360 ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct catgctggag 6420 ttcttcgccc accccaactt gtttattgca gcttataatg gttacaaata aagcaatagc 6480 atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 6540 ctcatcaatg tatcttatca tacatggtcg acctgcagga acctgcatta atgaatcggc 6600 caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6660 tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6720 cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6780 aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 6840 gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 6900 agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 6960 cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7020 cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7080 ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7140 gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7200 tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7260 acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7320 tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 7380 attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 7440 gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 7500 ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 7560 taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 7620 ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 7680 ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 7740 gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 7800 ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 7860 gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 7920 tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 7980 atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 8040 gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 8100 tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 8160 atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 8220 agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 8280 ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 8340 tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 8400 aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 8460 tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 8520 aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 8580 accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtctc 8640 gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga gacggtcaca 8700 gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt 8760 ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 8820 caagcttatt taaatcttta tcggtgtggc gcgccttgac ggccgcctcg gattcacttc 8880 gcagggcact tctgacgcgc tcaagcgtgc ccagcgtggt gccatctttg gcctctgcaa 8940 gaccatcggc ctcgagtggt ccgagtctga cgtcttttcc cgcggcgtgg acattgctca 9000 gggcatgcac cccgaggatg ccgccgtggc gattgtgcgc gagatggcgt gcgctgacat 9060 tcgcattcgc gaggtcggca ttggcgcaaa ccagcagcgc tgcacgatcc gtgccgccaa 9120 gctcgagacc ggcaacccgc agcgccagat cgccaaggac gacgtgctgc tcgtttctgg 9180 cggcgctcgc ggcatcacgc ctctttgcat ccgggagatc acgcgccaga tcgcgggcgg 9240 caagtacatt ctgcttggcc gcagcaaggt ctctgcgagc gaaccggcat ggtgcgctgg 9300 catcactgac gagaaggctg tgcaaaaggc tgctacccag gagctcaagc gcgcctttag 9360 cgctggcgag ggccccaagc ccacgccccg cgctgtcact aagcttgtgg gctctgttct 9420 tggcgctcgc gaggtgcgca gctctattgc tgcgattgaa gcgctcggcg gcaaggccat 9480 ctactcgtcg tgcgacgtga actctgccgc cgacgtggcc aaggccgtgc gcgatgccga 9540 gtcccagctc ggtgcccgcg tctcgggcat cgttcatgcc tcgggcgtgc tccgcgaccg 9600 tctcatcgag aagaagctcc ccgacgagtt cgacgccgtc tttggcacca aggtcaccgg 9660 tctcgagaac ctcctcgccg ccgtcgaccg cgccaacctc aagcacatgg tcctcttcag 9720 ctcgctcgcc ggcttccacg gcaacgtcgg ccagtctgac tacgccatgg ccaacgaggc 9780 ccttaacaag atgggcctcg agctcgccaa ggacgtctcg gtcaagtcga tctgctaaga 9840 aagtgaacct tgtcctaacc cgacagcgaa tggcgggagg gggcgggcta aaagatcgta 9900 ttacatagta ttttccccta ctctttgtgt ttgtcttttt tttttttgaa cgcattcaag 9960 ccacttgtct tggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 10020 gtcagacgga cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 10080 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 10140 ctcgcgtaag aatcaagctt ccaattttag gccccccact gaccgaggtc tgtcgataat 10200 ccacttttcc attgattttc caggtttcgt taactcatgc cactgagcaa aacttcggtc 10260 tttcctaaca aaagctctcc tcacaaagca tggcgcggca acggacgtgt cctcatactc 10320 cactgccaca caaggtcgat aaactaagct cctcacaaat agaggagaat tccactgaca 10380 actgaaaaca atgtatgaga gacgatcacc actggagcgg cgcggcggtt gggcgcggag 10440 gtcggcagca aaaacaagcg actcgccgag caaacccgaa tcagccttca gacggtcgtg 10500 cctaacaaca cgccgttcta ccccgccttc ttcgcgcccc ttcgcgtcca agcatccttc 10560 aagtttatct ctctagttca acttcaagaa gaacaacacc accaacaccc caattttagg 10620 ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt 10680 aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat 10740 ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc 10800 ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca 10860 ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc 10920 aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct 10980 tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag 11040 aacaacacca ccaacaccca cgtgctacga gatttcgatt ccaccgccgc cttctatgaa 11100 aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat 11160 ctcatgctgg agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa 11220 taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt 11280 ggtttgtcca aactcatcaa tgtatcttat catgtctgaa ttcccggggt ac 11332 <210> 5 <211> 4777 <212> DNA <213> Artificial Sequence <220> <223> pCL399 vector <220> <221> misc_feature <222> (1)..(2523) <223> FAS_fragment <220> <221> misc_feature <222> (2528)..(4777) <223> pSP73_fragment <400> 5 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gcctcgagca 1200 acaaaagaag cagcagcagc agcagcacga gcagcagcag cagcagcacg agcagcagca 1260 gcacgagcag cagcagcagc agcagcaaga tggcgcagcc cgagtcgacg acgccgacca 1320 tgacgcccga ggaaggccag atggaggggg cgccgcagca ggataatgcc caggtgaaga 1380 agcactgctt cgccgacgcc gatgtggcaa cctgcatcgc cgcctttggc ggtcagggaa 1440 gcgactggct cagtgagctg cgctccctcc aggaaaaggg ccagaccaac gtgcgagaaa 1500 ccatcgaact cgcactcgac aaactcgagg atcttgtcaa ggccgagccc tggtacgagg 1560 agcacggagg atgcgatatc cgcgcctggc tcgagagcga cgacaatgtc cccaacttcg 1620 acctcctccg ctacgcgccc gtctccttcc ccctcatctt cctcacccaa atgtgcaatt 1680 acatgcgtgt cctcgagaaa ctcggcacct gccatgaaga cgccctccaa aagggctggg 1740 tcaaggcctc gctcggacac agccagggcg tcgtctccgc cgccgtcgtt gccgcagcca 1800 acaccgaccg cgagctgcgc aacctcgtgg tctccggcct cgaatacatg tcaaaagtcg 1860 gcatcgccgc ccagcgcacg ctcgactacg agctcggacg ccgcaacgcc ggcccggaga 1920 ccccgatgct cgctgtacag ggaatggacg aaaaagtcct taccaaggcc ttcaaggccg 1980 ccgtctcgct ctccaacgag aagcaggcca tgatggccaa aatctcccca cggccgccgc 2040 cgccaccgcc gccccggccg ccgttagcga cgaagatcgc ttctccatcg ccctccgcaa 2100 cggccacgac gactttgtcg tctgcggcga gcccaaggac ctgcgcgtcc tccgcaaggt 2160 catcgagaaa cagagcgccg agcccggcaa ggaggcacag gcgcgcacgc ccttttccaa 2220 gcgcaagccc gtcacccaga ccaccttcct ccgcatgacg gccgtcttcc acagcgctct 2280 caacaaggac gccctcgccc agatcaacac atgggccccg gagtccgcct ttagcaaggc 2340 cttcgcccag gcctcgctcc gtgttcccgt ctttgacacc aagtctggcg ctaatctgca 2400 agatgttccc gccgccgatg ttgtcgccca tcttaccacc aacatgctca ctgagcgcgc 2460 cgacgttctc gtctccctcc gtgccgccga gaccaagacc gacgccagcc acctcctctg 2520 tttaaacgtt aacctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 2580 tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg 2640 agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc 2700 aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt 2760 gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag 2820 tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc 2880 cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc 2940 ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt cggtgtaggt 3000 cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt 3060 atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc 3120 agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa 3180 gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa 3240 gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg 3300 tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga 3360 agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg 3420 gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg 3480 aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt 3540 aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 3600 ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 3660 gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 3720 aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 3780 ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 3840 tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 3900 ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 3960 cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 4020 agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 4080 gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 4140 gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa 4200 acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 4260 acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 4320 agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 4380 aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat 4440 gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 4500 tccccgaaaa gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa 4560 aaataggcgt atcacgaggc cctttcgtct cgcgcgtttc ggtgatgacg gtgaaaacct 4620 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 4680 acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggctggc ttaactatgc 4740 ggcatcagag cagattgtac tgagagtgca ccatatg 4777 <210> 6 <211> 4775 <212> DNA <213> Artificial Sequence <220> <223> pCL400 vector <220> <221> misc_feature <222> (1)..(2528) <223> Fas_fragment <220> <221> misc_feature <222> (2533)..(4775) <223> pSP73_fragment <400> 6 ccatggttta aactcgcggc gtcttcgccg tcgaggcgcg tctttcgagg cgggcgaggt 60 gtttttcttt tcttttcttc tcgctgcagc tgcgccgcgg cgaacgcagt tcgccgcggc 120 ggctgcgacg cgcctgcgat gtctatgcgc aggcaaggca cgacgtcttg cggcgccgct 180 tcctgcgccg ccttgcgtct tggccccgcc gacgacgcaa gcagcggcgg cgcccccccg 240 ccctcctcca ctgtgggccg cagccctcct tttcgcgcgc ccgcagccgc gcggcgcgcc 300 ccgcgaacaa agagccgccg cgccggtccg cactgcgcgg gccgccccgc aagtgccgca 360 aacgccggcc cgaaccgccg caaacgcgcc cgcagccgcg cccgcagccg cgcgcgaccc 420 gcggtgggga cgcgcgccaa gcgtcccctt ccgcgggatg acgtaggcgg cggccccgcc 480 tatgcaatac gggaggaacc aggaaccggg aggggggggg ggcggcgcgc gcgccgtcca 540 gtgcgggacc gatcggcgcc gggatgcccg ggcgggaggg acacagccag gcagtcagtc 600 agtcagccgc acagagagag cgcgcctgcg agtcccgtct ggtctcggaa ttgtatcccg 660 cgcagagctc agaatcgcag gtcgatcgat cgagcgatgg atccatcgct ctatccgtcc 720 atcgatccat cgcatccatc gcatccatcg catccatcgt tgcatcgctt gcaccgcccg 780 cttgcatcgc gtgcgtgcgc aggcgggcgg cggccacgac gcgaccgaga gcggcggcga 840 gtgcagacgc cgccggcgcc cgcggctgcg tcgccgcagg aagaaggagg ggggcgcgtg 900 tttcccgcgg gagggaggag ggagggaggg aggtggttgg gccaaaaagg gcggcctgga 960 caggcaggca ggccggaagc gacgccagcg agcgaaggaa gaggagagag ccgcgcgggc 1020 ggccagcgcg gggcgggcgg cagcaaaccc gctactcagg gtaaaagaca gacagctctt 1080 cgagcgagct tgttcacttc gcggaagcac gcgcgcaggc acgcaggcac gcaggcagca 1140 tagcgagcag cagcagcatc gcgagcagca ttgcgagagg aggcgctgac cgccggcctc 1200 gagcaacaaa agaagcagca gcagcagcag cacgagcagc agcagcagca gcacgagcag 1260 cagcagcacg agcagcagca gcagcagcag caagatggcg cagcccgagt cgacgacgcc 1320 gaccatgacg cccgaggaag gccagatgga gggggcgccg cagcaggata atgcccaggt 1380 gaagaagcac tgcttcgccg acgccgatgt ggcaacctgc atcgccgcct ttggcggtca 1440 gggaagcgac tggctcagtg agctgcgctc cctccaggaa aagggccaga ccaacgtgcg 1500 agaaaccatc gaactcgcac tcgacaaact cgaggatctt gtcaaggccg agccctggta 1560 cgaggagcac ggaggatgcg atatccgcgc ctggctcgag agcgacgaca atgtccccaa 1620 cttcgacctc ctccgctacg cgcccgtctc cttccccctc atcttcctca cccaaatgtg 1680 caattacatg cgtgtcctcg agaaactcgg cacctgccat gaagacgccc tccaaaaggg 1740 ctgggtcaag gcctcgctcg gacacagcca gggcgtcgtc tccgccgccg tcgttgccgc 1800 agccaacacc gaccgcgagc tgcgcaacct cgtggtctcc ggcctcgaat acatgtcaaa 1860 agtcggcatc gccgcccagc gcacgctcga ctacgagctc ggacgccgca acgccggccc 1920 ggagaccccg atgctcgctg tacagggaat ggacgaaaaa gtccttacca aggccttcaa 1980 ggccgccgtc tcgctctcca acgagaagca ggccatgatg gccaaaatct ccccacggcc 2040 gccgccgcca ccgccgcccc ggccgccgtt agcgacgaag atcgcttctc catcgccctc 2100 cgcaacggcc acgacgactt tgtcgtctgc ggcgagccca aggacctgcg cgtcctccgc 2160 aaggtcatcg agaaacagag cgccgagccc ggcaaggagg cacaggcgcg cacgcccttt 2220 tccaagcgca agcccgtcac ccagaccacc ttcctccgca tgacggccgt cttccacagc 2280 gctctcaaca aggacgccct cgcccagatc aacacatggg ccccggagtc cgcctttagc 2340 aaggccttcg cccaggcctc gctccgtgtt cccgtctttg acaccaagtc tggcgctaat 2400 ctgcaagatg ttcccgccgc cgatgttgtc gcccatctta ccaccaacat gctcactgag 2460 cgcgccgacg ttctcgtctc cctccgtgcc gccgagacca agaccgacgc cagccacctc 2520 ctctgtttaa acgttaacct gcattaatga atcggccaac gcgcggggag aggcggtttg 2580 cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 2640 cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 2700 aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 2760 gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 2820 tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 2880 agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 2940 ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct cagttcggtg 3000 taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc 3060 gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg 3120 gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc 3180 ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg 3240 ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc 3300 gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct 3360 caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt 3420 taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa 3480 aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 3540 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 3600 tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 3660 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 3720 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 3780 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt 3840 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 3900 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 3960 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 4020 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 4080 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 4140 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 4200 ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 4260 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 4320 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 4380 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 4440 ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 4500 acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc 4560 tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa 4620 aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg 4680 agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac 4740 tatgcggcat cagagcagat tgtactgaga gtgca 4775 <210> 7 <211> 5991 <212> DNA <213> Artificial Sequence <220> <223> pCL401 vector <220> <221> misc_feature <222> (1)..(1063) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (1076)..(1112) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (1113)..(1132) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (1133)..(1212) <223> gRNA <220> <221> misc_feature <222> (1213)..(1280) <223> HDV_ribozyme <220> <221> misc_feature <222> (1300)..(1939) <223> OrfC_terminator <220> <221> misc_feature <222> (4474)..(4922) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (4923)..(5717) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (5718)..(5991) <223> SV40_terminator <400> 7 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagagatcta gcacgctgat 1080 gagtccgtga ggacgaaacg agtaagctcg tctgcggacg tcgtggacgc gcgttttaga 1140 gctagaaata gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga 1200 gtcggtgctt ttggccggca tggtcccagc ctcctcgctg gcgccggctg ggcaacatgc 1260 ttcggcatgg cgaatgggac catatgagtt atgagatccg aaagtgaacc ttgtcctaac 1320 ccgacagcga atggcgggag ggggcgggct aaaagatcgt attacatagt atttttcccc 1380 tactctttgt gtttgtcttt tttttttttt tgaacgcatt caagccactt gtctgggttt 1440 acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt cttggtcaga cggcccaaaa 1500 aagggaaaaa attcattcat ggcacagata agaaaaagaa aaagtttgtc gaccaccgtc 1560 atcagaaagc aagagaagag aaacactcgc gctcacattc tcgctcgcgt aagaatctta 1620 gccacgcata cgaagtaatt tgtccatctg gcgaatcttt acatgagcgt tttcaagctg 1680 gagcgtgaga tcataccttt cttgatcgta atgttccaac cttgcatagg cctcgttgcg 1740 atccgctagc aatgcgtcgt actcccgttg caactgcgcc atcgcctcat tgtgacgtga 1800 gttcagattc ttctcgagac cttcgagcgc tgctaatttc gcctgacgct ccttcttttg 1860 tgcttccatg acacgccgct tcaccgtgcg ttccacttct tcctcagaca tgcccttggc 1920 tgcctcgacc tgctcggtaa aacgggcccc agcacgtgct acgagatttc gattccaccg 1980 ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcc 2040 tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 2100 ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 2160 tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcataca tggtcgacct 2220 gcaggaacct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 2280 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 2340 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 2400 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 2460 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 2520 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 2580 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 2640 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 2700 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 2760 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 2820 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 2880 ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 2940 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 3000 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 3060 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 3120 tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 3180 ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 3240 gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 3300 tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 3360 cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 3420 ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 3480 gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 3540 caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 3600 gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 3660 ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 3720 tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 3780 caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 3840 tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 3900 cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 3960 ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 4020 aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 4080 tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 4140 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 4200 gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata 4260 ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac 4320 acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag 4380 cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 4440 cagagcagat tgtactgaga gtgcaccaag cttccaattt taggcccccc actgaccgag 4500 gtctgtcgat aatccacttt tccattgatt ttccaggttt cgttaactca tgccactgag 4560 caaaacttcg gtctttccta acaaaagctc tcctcacaaa gcatggcgcg gcaacggacg 4620 tgtcctcata ctccactgcc acacaaggtc gataaactaa gctcctcaca aatagaggag 4680 aattccactg acaactgaaa acaatgtatg agagacgatc accactggag cggcgcggcg 4740 gttgggcgcg gaggtcggca gcaaaaacaa gcgactcgcc gagcaaaccc gaatcagcct 4800 tcagacggtc gtgcctaaca acacgccgtt ctaccccgcc ttcttcgcgc cccttcgcgt 4860 ccaagcatcc ttcaagttta tctctctagt tcaacttcaa gaagaacaac accaccaaca 4920 ccatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat 4980 tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 5040 cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 5100 tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 5160 tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 5220 aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 5280 tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 5340 gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 5400 aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 5460 acggcgatga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 5520 atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 5580 acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 5640 tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 5700 ttgacgagtt cttctgacac gtgctacgag atttcgattc caccgccgcc ttctatgaaa 5760 ggttgggctt cggaatcgtt ttccgggacg ccggctggat gatcctccag cgcggggatc 5820 tcatgctgga gttcttcgcc caccccaact tgtttattgc agcttataat ggttacaaat 5880 aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg 5940 gtttgtccaa actcatcaat gtatcttatc atgtctgaat tcccggggta c 5991 <210> 8 <211> 7612 <212> DNA <213> Artificial Sequence <220> <223> pCL402 vector <220> <221> misc_feature <222> (5)..(1198) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1199)..(1647) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1648)..(2442) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2443)..(2716) <223> SV40_terminator <220> <221> misc_feature <222> (2717)..(3779) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3792)..(3828) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3829)..(3848) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (3849)..(3928) <223> gRNA <220> <221> misc_feature <222> (3929)..(3996) <223> HDV_ribozyme <220> <221> misc_feature <222> (4016)..(4534) <223> OrfC_terminator <220> <221> misc_feature <222> (4535)..(5360) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (5365)..(7612) <223> pSP73 <400> 8 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gcctcgagcc 1200 aattttaggc cccccactga ccgaggtctg tcgataatcc acttttccat tgattttcca 1260 ggtttcgtta actcatgcca ctgagcaaaa cttcggtctt tcctaacaaa agctctcctc 1320 acaaagcatg gcgcggcaac ggacgtgtcc tcatactcca ctgccacaca aggtcgataa 1380 actaagctcc tcacaaatag aggagaattc cactgacaac tgaaaacaat gtatgagaga 1440 cgatcaccac tggagcggcg cggcggttgg gcgcggaggt cggcagcaaa aacaagcgac 1500 tcgccgagca aacccgaatc agccttcaga cggtcgtgcc taacaacacg ccgttctacc 1560 ccgccttctt cgcgcccctt cgcgtccaag catccttcaa gtttatctct ctagttcaac 1620 ttcaagaaga acaacaccac caacaccatg attgaacaag atggattgca cgcaggttct 1680 ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc 1740 tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc 1800 gacctgtccg gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc 1860 acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg 1920 ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag 1980 aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc 2040 ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt 2100 cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc 2160 gccaggctca aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc 2220 tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg 2280 ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag 2340 cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg 2400 cagcgcatcg ccttctatcg ccttcttgac gagttcttct gacacgtgct acgagatttc 2460 gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc 2520 tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt 2580 attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca 2640 tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc 2700 tgaattcccg gggtacctct tatctgcctc gcgccgttga ccgccgcttg actcttggcg 2760 cttgccgctc gcatcctgcc tcgctcgcgc aggcgggcgg gcgagtgggt gggtccgcag 2820 ccttccgcgc tcgcccgcta gctcgctcgc gccgtgctgc agccagcagg gcagcaccgc 2880 acggcaggca ggtcccggcg cggatcgatc gatccatcga tccatcgatc catcgatcgt 2940 gcggtcaaaa agaaaggaag aagaaaggaa aaagaaaggc gtgcgcaccc gagtgcgcgc 3000 tgagcgcccg ctcgcggtcc cgcggagcct ccgcgttagt ccccgccccg cgccgcgcag 3060 tcccccggga ggcatcgcgc acctctcgcc gccccctcgc gcctcgccga ttccccgcct 3120 ccccttttcc gcttcttcgc cgcctccgct cgcggccgcg tcgcccgcgc cccgctccct 3180 atctgctccc caggggggca ctccgcacct tttgcgcccg ctgccgccgc cgcggccgcc 3240 ccgccgccct ggtttccccc gcgagcgcgg ccgcgtcgcc gcgcaaagac tcgccgcgtg 3300 ccgccccgag caacgggtgg cggcggcgcg gcggcgggcg gggcgcggcg gcgcgtaggc 3360 ggggctaggc gccggctagg cgaaacgccg cccccgggcg ccgccgccgc ccgctccaga 3420 gcagtcgccg cgccagaccg ccaacgcaga gaccgagacc gaggtacgtc gcgcccgagc 3480 acgccgcgac gcgcggcagg gacgaggagc acgacgccgc gccgcgccgc gcgggggggg 3540 ggagggagag gcaggacgcg ggagcgagcg tgcatgtttc cgcgcgagac gacgccgcgc 3600 gcgctggaga ggagataagg cgcttggatc gcgagagggc cagccaggct ggaggcgaaa 3660 atgggtggag aggatagtat cttgcgtgct tggacgagga gactgacgag gaggacggat 3720 acgtcgatga tgatgtgcac agagaagaag cagttcgaaa gcgactacta gcaagcaaga 3780 gatctagcac gctgatgagt ccgtgaggac gaaacgagta agctcgtctg cggacgtcgt 3840 ggacgcgcgt tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt 3900 gaaaaagtgg caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc 3960 cggctgggca acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag 4020 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4080 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4140 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4200 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4260 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4320 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4380 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 4440 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 4500 cctcattgtg acgtgagttc agattcttct cgagaaactc ggcacctgcc atgaagacgc 4560 cctccaaaag ggctgggtca aggcctcgct cggacacagc cagggcgtcg tctccgccgc 4620 cgtcgttgcc gcagccaaca ccgaccgcga gctgcgcaac ctcgtggtct ccggcctcga 4680 atacatgtca aaagtcggca tcgccgccca gcgcacgctc gactacgagc tcggacgccg 4740 caacgccggc ccggagaccc cgatgctcgc tgtacaggga atggacgaaa aagtccttac 4800 caaggccttc aaggccgccg tctcgctctc caacgagaag caggccatga tggccaaaat 4860 ctccccacgg ccgccgccgc caccgccgcc ccggccgccg ttagcgacga agatcgcttc 4920 tccatcgccc tccgcaacgg ccacgacgac tttgtcgtct gcggcgagcc caaggacctg 4980 cgcgtcctcc gcaaggtcat cgagaaacag agcgccgagc ccggcaagga ggcacaggcg 5040 cgcacgccct tttccaagcg caagcccgtc acccagacca ccttcctccg catgacggcc 5100 gtcttccaca gcgctctcaa caaggacgcc ctcgcccaga tcaacacatg ggccccggag 5160 tccgccttta gcaaggcctt cgcccaggcc tcgctccgtg ttcccgtctt tgacaccaag 5220 tctggcgcta atctgcaaga tgttcccgcc gccgatgttg tcgcccatct taccaccaac 5280 atgctcactg agcgcgccga cgttctcgtc tccctccgtg ccgccgagac caagaccgac 5340 gccagccacc tcctctgttt aaacgttaac ctgcattaat gaatcggcca acgcgcgggg 5400 agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 5460 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 5520 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 5580 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 5640 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 5700 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 5760 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 5820 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 5880 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 5940 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 6000 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 6060 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 6120 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 6180 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 6240 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 6300 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 6360 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 6420 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 6480 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 6540 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 6600 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 6660 cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 6720 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 6780 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 6840 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 6900 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 6960 agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 7020 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 7080 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 7140 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 7200 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 7260 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 7320 ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 7380 atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt 7440 gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 7500 gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 7560 ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca tg 7612 <210> 9 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB36 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 9 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtct gcggacgtcg 3840 tggacgcgcg ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 10 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB37 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_2 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 10 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtcg cagaccttaa 3840 gttcgacgcg ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 11 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB38 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_3 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 11 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtcc cgactcgtcg 3840 accgtctcag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 12 <211> 7608 <212> DNA <213> Artificial Sequence <220> <223> pYB39 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3792)..(3828) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3829)..(3848) <223> carotene_synthase_target_4 <220> <221> misc_feature <222> (3848)..(3928) <223> gRNA <220> <221> misc_feature <222> (3929)..(3996) <223> HDV_ribozyme <220> <221> misc_feature <222> (4016)..(4534) <223> OrfC_terminator <220> <221> misc_feature <222> (4535)..(5367) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6366)..(7226) <223> AmpR_gene <400> 12 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttctt cctgatgagt ccgtgaggac gaaacgagta agctcgtcga agaacatgta 3840 ctcttcaagt tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt 3900 gaaaaagtgg caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc 3960 cggctgggca acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag 4020 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4080 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4140 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4200 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4260 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4320 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4380 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 4440 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 4500 cctcattgtg acgtgagttc agattcttct cgagaaactc ggcacctgcc atgaagacgc 4560 cctccaaaag ggctgggtca aggcctcgct cggacacagc cagggcgtcg tctccgccgc 4620 cgtcgttgcc gcagccaaca ccgaccgcga gctgcgcaac ctcgtggtct ccggcctcga 4680 atacatgtca aaagtcggca tcgccgccca gcgcacgctc gactacgagc tcggacgccg 4740 caacgccggc ccggagaccc cgatgctcgc tgtacaggga atggacgaaa aagtccttac 4800 caaggccttc aaggccgccg tctcgctctc caacgagaag caggccatga tggccaaaat 4860 ctccccacgg ccgccgccgc caccgccgcc ccggccgccg ttagcgacga agatcgcttc 4920 tccatcgccc tccgcaacgg ccacgacgac tttgtcgtct gcggcgagcc caaggacctg 4980 cgcgtcctcc gcaaggtcat cgagaaacag agcgccgagc ccggcaagga ggcacaggcg 5040 cgcacgccct tttccaagcg caagcccgtc acccagacca ccttcctccg catgacggcc 5100 gtcttccaca gcgctctcaa caaggacgcc ctcgcccaga tcaacacatg ggccccggag 5160 tccgccttta gcaaggcctt cgcccaggcc tcgctccgtg ttcccgtctt tgacaccaag 5220 tctggcgcta atctgcaaga tgttcccgcc gccgatgttg tcgcccatct taccaccaac 5280 atgctcactg agcgcgccga cgttctcgtc tccctccgtg ccgccgagac caagaccgac 5340 gccagccacc tcctctgttt aaacgttaac ctgcattaat gaatcggcca acgcgcgggg 5400 agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 5460 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 5520 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 5580 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 5640 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 5700 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 5760 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 5820 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 5880 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 5940 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 6000 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt 6060 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 6120 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 6180 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 6240 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 6300 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 6360 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 6420 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 6480 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 6540 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 6600 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 6660 cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 6720 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 6780 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 6840 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 6900 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 6960 agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 7020 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 7080 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 7140 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 7200 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 7260 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 7320 ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 7380 atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt 7440 gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 7500 gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 7560 ggctggctta actatgcggc atcagagcag attgtactga gagtgcac 7608 <210> 13 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> 12 Tub seq F primer <400> 13 ggatctcatg ctggagttct tc 22 <210> 14 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3C R1 primer <400> 14 gtacttctcg tggtaggcaa cc 22 <210> 15 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> CS pro Kpn IF F1 primer <400> 15 gtctgaattc ccggggtacc gagcgggcga ttccaccgtc 40 <210> 16 <211> 42 <212> DNA <213> Artificial Sequence <220> <223> CS pro BamH IF R1 primer <400> 16 gtacttctta tccatggatc cctcggtctc cgagcgagcg ag 42 <210> 17 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> CS pro BamH IF F2 primer <400> 17 tcgctcgctc ggagaccgag ggatccatgg ataagaagta c 41 <210> 18 <211> 51 <212> DNA <213> Artificial Sequence <220> <223> CS pro Nde IF R2 primer <400> 18 gattcactag tttagatcat atgttagacc ttgcgcttct tcttagggtc c 51 <210> 19 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> O A1-KO F primer <400> 19 ccaagttcgc caaggcttc 19 <210> 20 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3 SV40 R1 primer <400> 20 gtggaatcga aatctcgtag cac 23 <210> 21 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> O A1-KO R primer <400> 21 gctgttgcaa ctttgctcca c 21 <210> 22 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3C F1 primer <400> 22 gttaagaaga ccgaggtcca gac 23 <210> 23 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> 5' FAS PmeNde primer <400> 23 tagcatatgt ttaaactcgc ggcgtctttc gc 32 <210> 24 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> 3' FAS PmeHpa primer <400> 24 agttaacgtt taaacagagg aggtggctgg c 31 <210> 25 <211> 38 <212> DNA <213> Artificial Sequence <220> <223> pCL402 IF F primer <400> 25 gaggcgctga ccgccggcca agcttccaat tttaggcc 38 <210> 26 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> pCL402 IF R primer <400> 26 gcaggtgccg agtttctcga gaagaatctg aactcacgtc 40 <210> 27 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS1 F primer <400> 27 gagtcgaagg agacgttgtc g 21 <210> 28 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS1 R primer <400> 28 gtcattgcga atgatgcgat atg 23 <210> 29 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS3 R primer <400> 29 ggtcatcatg gaatacaacg cag 23 <210> 30 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS4 F primer <400> 30 cgagctcatt tgtgctacac tctatg 26 <210> 31 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS4 R primer <400> 31 cacaagattt gcaggattga tgc 23 <210> 32 <211> 5562 <212> DNA <213> Artificial Sequence <220> <223> pYB30 vector (5562 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_Tubulin_promoter <220> <221> misc_feature <222> (457)..(549) <223> Sec1_secretion_sequence <220> <221> misc_feature <222> (550)..(1287) <223> eGFP <220> <221> misc_feature <222> (1307)..(1946) <223> OrfC_terminator <220> <221> misc_feature <222> (4481)..(4929) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (4930)..(5304) <223> Sh_ble_gene <220> <221> misc_feature <222> (5305)..(5562) <223> SV40_terminator <400> 32 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatga agttcgcgac ctcggtcgca 480 attttgcttg tggccaacat agccaccgcc ctcgcgcaga gcgatggctg cacccccacc 540 gaccagacga tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc 600 gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat 660 gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct gcccgtgccc 720 tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg ctaccccgac 780 cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc 840 accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc 900 gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc 960 ctgggacaca agctggagta caactacaac agccacaacg tctatatcat ggccgacaag 1020 cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga cggcagcgtg 1080 cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc 1140 gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga gaagcgcgat 1200 cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg 1260 tacaagcacc accatcacca ccactaacat atgagttatg agatccgaaa gtgaaccttg 1320 tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt acatagtatt 1380 tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa gccacttgtc 1440 tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt ggtcagacgg 1500 cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa gtttgtcgac 1560 caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg ctcgcgtaag 1620 aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca tgagcgtttt 1680 caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt gcataggcct 1740 cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc gcctcattgt 1800 gacgtgagtt cagattcttc tcgagacctt cgagcgctgc taatttcgcc tgacgctcct 1860 tcttttgtgc ttccatgaca cgccgcttca ccgtgcgttc cacttcttcc tcagacatgc 1920 ccttggctgc ctcgacctgc tcggtaaaac gggccccagc acgtgctacg agatttcgat 1980 tccaccgccg ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg 2040 atgatcctcc agcgcgggga tctcatgctg gagttcttcg cccaccccaa cttgtttatt 2100 gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt 2160 ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta tcatacatgg 2220 tcgacctgca ggaacctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt 2280 attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg 2340 cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac 2400 gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg 2460 ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 2520 agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc 2580 tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc 2640 ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag 2700 gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc 2760 ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca 2820 gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg 2880 aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg 2940 aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct 3000 ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa 3060 gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa 3120 gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa 3180 tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc 3240 ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga 3300 ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca 3360 atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc 3420 ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat 3480 tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc 3540 attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt 3600 tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc 3660 ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg 3720 gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt 3780 gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg 3840 gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga 3900 aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg 3960 taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg 4020 tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt 4080 tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc 4140 atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca 4200 tttccccgaa aagtgccacc tgacgtctaa gaaaccatta ttatcatgac attaacctat 4260 aaaaataggc gtatcacgag gccctttcgt ctcgcgcgtt tcggtgatga cggtgaaaac 4320 ctctgacaca tgcagctccc ggagacggtc acagcttgtc tgtaagcgga tgccgggagc 4380 agacaagccc gtcagggcgc gtcagcgggt gttggcgggt gtcggggctg gcttaactat 4440 gcggcatcag agcagattgt actgagagtg caccaagctt ccaattttag gccccccact 4500 gaccgaggtc tgtcgataat ccacttttcc attgattttc caggtttcgt taactcatgc 4560 cactgagcaa aacttcggtc tttcctaaca aaagctctcc tcacaaagca tggcgcggca 4620 acggacgtgt cctcatactc cactgccaca caaggtcgat aaactaagct cctcacaaat 4680 agaggagaat tccactgaca actgaaaaca atgtatgaga gacgatcacc actggagcgg 4740 cgcggcggtt gggcgcggag gtcggcagca aaaacaagcg actcgccgag caaacccgaa 4800 tcagccttca gacggtcgtg cctaacaaca cgccgttcta ccccgccttc ttcgcgcccc 4860 ttcgcgtcca agcatccttc aagtttatct ctctagttca acttcaagaa gaacaacacc 4920 accaacacca tggccaagtt gaccagtgcc gttccggtgc tcaccgcgcg cgacgtcgcc 4980 ggagcggtcg agttctggac cgaccggctc gggttctccc gggacttcgt ggaggacgac 5040 ttcgccggtg tggtccggga cgacgtgacc ctgttcatca gcgcggtcca ggaccaggtg 5100 gtgccggaca acaccctggc ctgggtgtgg gtgcgcggcc tggacgagct gtacgccgag 5160 tggtcggagg tcgtgtccac gaacttccgg gacgcctccg ggccggccat gaccgagatc 5220 ggcgagcagc cgtgggggcg ggagttcgcc ctgcgcgacc cggccggcaa ctgcgtgcac 5280 ttcgtggccg aggagcagga ctgacacgtg ctacgagatt tcgattccac cgccgccttc 5340 tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg gctggatgat cctccagcgc 5400 ggggatctca tgctggagtt cttcgcccac cccaacttgt ttattgcagc ttataatggt 5460 tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 5520 agttgtggtt tgtccaaact catcaatgta tcttatcggt ac 5562 <210> 33 <211> 8887 <212> DNA <213> Artificial Sequence <220> <223> pYB61 vector (8887 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4616)..(5255) <223> OrfC_terminator <220> <221> misc_feature <222> (6540)..(7400) <223> Ampicillin_resistance_gene <220> <221> misc_feature <222> (7790)..(8238) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (8239)..(8613) <223> Sh_ble_gene <220> <221> misc_feature <222> (8614)..(8887) <223> SV40_terminator <400> 33 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acattgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gacgacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgagttatga gatccgaaag 4620 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4680 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4740 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4800 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4860 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4920 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4980 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 5040 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 5100 cctcattgtg acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct 5160 gacgctcctt cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct 5220 cagacatgcc cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga 5280 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 5340 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 5400 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 5460 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 5520 catacatggt cgacctgcag gaacctgcat taatgaatcg gccaacgcgc ggggagaggc 5580 ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 5640 cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 5700 ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 5760 aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 5820 cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 5880 cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 5940 gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 6000 tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 6060 cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 6120 ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 6180 gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 6240 gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 6300 accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 6360 ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 6420 tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 6480 aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 6540 taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 6600 gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 6660 agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 6720 cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 6780 tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 6840 gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 6900 agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 6960 gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 7020 atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 7080 gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 7140 tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 7200 atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 7260 agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 7320 gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 7380 cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 7440 tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 7500 ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca 7560 ttaacctata aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac 7620 ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat 7680 gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg 7740 cttaactatg cggcatcaga gcagattgta ctgagagtgc accaagcttc caattttagg 7800 ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt 7860 aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat 7920 ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc 7980 ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca 8040 ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc 8100 aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct 8160 tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag 8220 aacaacacca ccaacaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc 8280 gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg 8340 gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag 8400 gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg 8460 tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg 8520 accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac 8580 tgcgtgcact tcgtggccga ggagcaggac tgacacgtgc tacgagattt cgattccacc 8640 gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc 8700 ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct 8760 tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca 8820 ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgaattccc 8880 ggggtac 8887 <210> 34 <211> 5997 <212> DNA <213> Artificial Sequence <220> <223> pYB66 vector (5997 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF-1alpha_promoter <220> <221> misc_feature <222> (1082)..(1118) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (1119)..(1138) <223> gRNA3_CS1_target_sequence <220> <221> misc_feature <222> (1139)..(1218) <223> gRNA_scaffold <220> <221> misc_feature <222> (1219)..(1286) <223> HDV_ribozyme <220> <221> misc_feature <222> (1306)..(1945) <223> OrfC_terminator <220> <221> misc_feature <222> (4480)..(4928) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (4929)..(5723) <223> Neomycin_phosphotransferase_gene <220> <221> misc_feature <222> (5724)..(5997) <223> SV40_terminator <400> 34 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca gatcttccgc 1080 actgatgagt ccgtgaggac gaaacgagta agctcgtctg cggacgtcgt ggacgcgcgt 1140 tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg 1200 caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc cggctgggca 1260 acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag tgaaccttgt 1320 cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta catagtattt 1380 ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag ccacttgtct 1440 gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg gtcagacggc 1500 ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag tttgtcgacc 1560 accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc tcgcgtaaga 1620 atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat gagcgttttc 1680 aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg cataggcctc 1740 gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg cctcattgtg 1800 acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct gacgctcctt 1860 cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct cagacatgcc 1920 cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga gatttcgatt 1980 ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac gccggctgga 2040 tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 2100 cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 2160 tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catacatggt 2220 cgacctgcag gaacctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta 2280 ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 2340 gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg 2400 caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 2460 tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 2520 gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct 2580 ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc 2640 cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg 2700 tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 2760 tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 2820 cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga 2880 agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc gctctgctga 2940 agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 3000 gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 3060 aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag 3120 ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat 3180 gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct 3240 taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac 3300 tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa 3360 tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg 3420 gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt 3480 gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca 3540 ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt 3600 cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg gttagctcct 3660 tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc atggttatgg 3720 cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg 3780 agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg 3840 cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa 3900 aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt 3960 aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt 4020 gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt 4080 gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 4140 tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat 4200 ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata 4260 aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc 4320 tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat gccgggagca 4380 gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg cttaactatg 4440 cggcatcaga gcagattgta ctgagagtgc accaagcttc caattttagg ccccccactg 4500 accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt aactcatgcc 4560 actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat ggcgcggcaa 4620 cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc ctcacaaata 4680 gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca ctggagcggc 4740 gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc aaacccgaat 4800 cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct tcgcgcccct 4860 tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag aacaacacca 4920 ccaacaccat gattgaacaa gatggattgc acgcaggttc tccggccgct tgggtggaga 4980 ggctattcgg ctatgactgg gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 5040 ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 5100 atgaactgca ggacgaggca gcgcggctat cgtggctggc cacgacgggc gttccttgcg 5160 cagctgtgct cgacgttgtc actgaagcgg gaagggactg gctgctattg ggcgaagtgc 5220 cggggcagga tctcctgtca tctcaccttg ctcctgccga gaaagtatcc atcatggctg 5280 atgcaatgcg gcggctgcat acgcttgatc cggctacctg cccattcgac caccaagcga 5340 aacatcgcat cgagcgagca cgtactcgga tggaagccgg tcttgtcgat caggatgatc 5400 tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca 5460 tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 5520 tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg gcggaccgct 5580 atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc gaatgggctg 5640 accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 5700 gccttcttga cgagttcttc tgacacgtgc tacgagattt cgattccacc gccgccttct 5760 atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg 5820 gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt 5880 acaaataaag caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta 5940 gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgaattccc ggggtac 5997 <210> 35 <211> 10700 <212> DNA <213> Artificial Sequence <220> <223> pYB73 vector (10700 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4616)..(5255) <223> OrfC_terminator <220> <221> misc_feature <222> (5546)..(6608) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (6621)..(6657) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (6658)..(6677) <223> gRNA3_CS1_target_sequence <220> <221> misc_feature <222> (6678)..(6757) <223> gRNA_scaffold <220> <221> misc_feature <222> (6758)..(6825) <223> HDV_ribozyme <220> <221> misc_feature <222> (6845)..(7363) <223> OrfC_terminator <220> <221> misc_feature <222> (8369)..(9229) <223> Ampicillin_resistance_gene <220> <221> misc_feature <222> (9619)..(10067) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (10068)..(10442) <223> Sh_ble_gene <220> <221> misc_feature <222> (10443)..(10700) <223> SV40_terminator <400> 35 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acattgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gacgacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgagttatga gatccgaaag 4620 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4680 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4740 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4800 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4860 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4920 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4980 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 5040 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 5100 cctcattgtg acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct 5160 gacgctcctt cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct 5220 cagacatgcc cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga 5280 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 5340 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 5400 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 5460 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 5520 catacatggt cgacctgcag ggtacctctt atctgcctcg cgccgttgac cgccgcttga 5580 ctcttggcgc ttgccgctcg catcctgcct cgctcgcgca ggcgggcggg cgagtgggtg 5640 ggtccgcagc cttccgcgct cgcccgctag ctcgctcgcg ccgtgctgca gccagcaggg 5700 cagcaccgca cggcaggcag gtcccggcgc ggatcgatcg atccatcgat ccatcgatcc 5760 atcgatcgtg cggtcaaaaa gaaaggaaga agaaaggaaa aagaaaggcg tgcgcacccg 5820 agtgcgcgct gagcgcccgc tcgcggtccc gcggagcctc cgcgttagtc cccgccccgc 5880 gccgcgcagt cccccgggag gcatcgcgca cctctcgccg ccccctcgcg cctcgccgat 5940 tccccgcctc cccttttccg cttcttcgcc gcctccgctc gcggccgcgt cgcccgcgcc 6000 ccgctcccta tctgctcccc aggggggcac tccgcacctt ttgcgcccgc tgccgccgcc 6060 gcggccgccc cgccgccctg gtttcccccg cgagcgcggc cgcgtcgccg cgcaaagact 6120 cgccgcgtgc cgccccgagc aacgggtggc ggcggcgcgg cggcgggcgg ggcgcggcgg 6180 cgcgtaggcg gggctaggcg ccggctaggc gaaacgccgc ccccgggcgc cgccgccgcc 6240 cgctccagag cagtcgccgc gccagaccgc caacgcagag accgagaccg aggtacgtcg 6300 cgcccgagca cgccgcgacg cgcggcaggg acgaggagca cgacgccgcg ccgcgccgcg 6360 cggggggggg gagggagagg caggacgcgg gagcgagcgt gcatgtttcc gcgcgagacg 6420 acgccgcgcg cgctggagag gagataaggc gcttggatcg cgagagggcc agccaggctg 6480 gaggcgaaaa tgggtggaga ggatagtatc ttgcgtgctt ggacgaggag actgacgagg 6540 aggacggata cgtcgatgat gatgtgcaca gagaagaagc agttcgaaag cgactactag 6600 caagcaagag atcttccgca ctgatgagtc cgtgaggacg aaacgagtaa gctcgtctgc 6660 ggacgtcgtg gacgcgcgtt ttagagctag aaatagcaag ttaaaataag gctagtccgt 6720 tatcaacttg aaaaagtggc accgagtcgg tgcttttggc cggcatggtc ccagcctcct 6780 cgctggcgcc ggctgggcaa catgcttcgg catggcgaat gggaccatat gagttatgag 6840 atccgaaagt gaaccttgtc ctaacccgac agcgaatggc gggagggggc gggctaaaag 6900 atcgtattac atagtatttt tcccctactc tttgtgtttg tctttttttt ttttttgaac 6960 gcattcaagc cacttgtctg ggtttacttg tttgtttgct tgcttgcttg cttgcttgcc 7020 tgcttcttgg tcagacggcc caaaaaaggg aaaaaattca ttcatggcac agataagaaa 7080 aagaaaaagt ttgtcgacca ccgtcatcag aaagcaagag aagagaaaca ctcgcgctca 7140 cattctcgct cgcgtaagaa tcttagccac gcatacgaag taatttgtcc atctggcgaa 7200 tctttacatg agcgttttca agctggagcg tgagatcata cctttcttga tcgtaatgtt 7260 ccaaccttgc ataggcctcg ttgcgatccg ctagcaatgc gtcgtactcc cgttgcaact 7320 gcgccatcgc ctcattgtga cgtgagttca gattcttctc gagctgcagg aacctgcatt 7380 aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 7440 cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 7500 aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 7560 aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 7620 tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 7680 caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 7740 cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 7800 ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 7860 gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 7920 agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 7980 gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 8040 acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 8100 gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 8160 gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 8220 cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 8280 caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 8340 gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 8400 cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 8460 cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 8520 caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 8580 gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 8640 gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 8700 cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 8760 catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 8820 gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 8880 ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 8940 gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 9000 cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 9060 tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 9120 gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 9180 atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 9240 ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 9300 gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 9360 acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc 9420 cctttcgtct cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg 9480 agacggtcac agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt 9540 cagcgggtgt tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac 9600 tgagagtgca ccaagcttcc aattttaggc cccccactga ccgaggtctg tcgataatcc 9660 acttttccat tgattttcca ggtttcgtta actcatgcca ctgagcaaaa cttcggtctt 9720 tcctaacaaa agctctcctc acaaagcatg gcgcggcaac ggacgtgtcc tcatactcca 9780 ctgccacaca aggtcgataa actaagctcc tcacaaatag aggagaattc cactgacaac 9840 tgaaaacaat gtatgagaga cgatcaccac tggagcggcg cggcggttgg gcgcggaggt 9900 cggcagcaaa aacaagcgac tcgccgagca aacccgaatc agccttcaga cggtcgtgcc 9960 taacaacacg ccgttctacc ccgccttctt cgcgcccctt cgcgtccaag catccttcaa 10020 gtttatctct ctagttcaac ttcaagaaga acaacaccac caacaccatg gccaagttga 10080 ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag ttctggaccg 10140 accggctcgg gttctcccgg gacttcgtgg aggacgactt cgccggtgtg gtccgggacg 10200 acgtgaccct gttcatcagc gcggtccagg accaggtggt gccggacaac accctggcct 10260 gggtgtgggt gcgcggcctg gacgagctgt acgccgagtg gtcggaggtc gtgtccacga 10320 acttccggga cgcctccggg ccggccatga ccgagatcgg cgagcagccg tgggggcggg 10380 agttcgccct gcgcgacccg gccggcaact gcgtgcactt cgtggccgag gagcaggact 10440 gacacgtgct acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa 10500 tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct 10560 tcgcccaccc caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca 10620 caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca 10680 tcaatgtatc ttatcggtac 10700 <210> 36 <211> 6300 <212> DNA <213> Artificial Sequence <220> <223> pCL310 vector (6300 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (1070)..(1981) <223> CarG_gene <220> <221> misc_feature <222> (2001)..(2640) <223> OrfC_terminator <220> <221> misc_feature <222> (5175)..(5623) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (5627)..(6019) <223> Bsd_gene <220> <221> misc_feature <222> (6020)..(6300) <223> SV40_terminator <400> 36 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tgctcaactc 1080 gcacaaccgc actgaggagc gctccactga ggacatcatc cttgagccct acacctacct 1140 catttcccag cccggcaagg acatccgcgc taagcttatt tccgctttcg acctctggct 1200 ccatgtcccc aaggacgtcc tctgcgtcat caacaagatt attggcatgc ttcacaacgc 1260 ctccctcatg atcgacgatg tccaggatga ctccgacctt cgccgtggcg tccccgtcgc 1320 ccaccatatc tacggcgtcc cccagacgat caacaccgct aactacgtca tttttctcgc 1380 tctccaggag gtcatgaagc tcaacatccc ctccatgatg caggtctgca ctgaggagct 1440 catcaacctt caccgcggcc agggtatcga gctctactgg cgcgattcgc ttacttgccc 1500 cactgaggag gagtacattg acatggtcaa caacaagacg tccggccttc tccgtcttgc 1560 cgtccgtctc atgcaggccg cctcggagtc cgacatcgac tacacccctc tcgtcaacat 1620 catcggtatt cactttcagg tccgcgacga ttacatgaac ctccagtcca ctagctacac 1680 gaacaacaag ggtttctgcg aggacctcac ggagggcaag ttttcgttcc ccatcatcca 1740 cgccattcgc aaggacccca gcaaccgcca gctccttaac attatctccc agaagcctac 1800 gtccattgag gttaagaagt acgcccttga ggttattcgc aaggccggca gctttgagta 1860 cgttcgcgag ttcctgcgcc agaaggaggc cgagtccctt aaggagatca agcgccttgg 1920 aggcaaccct ctcctcgaga agtacattga gaccatccgc gtcgaggcca cgaacgacta 1980 acatatgagt tatgagatcc gaaagtgaac cttgtcctaa cccgacagcg aatggcggga 2040 gggggcgggc taaaagatcg tattacatag tatttttccc ctactctttg tgtttgtctt 2100 tttttttttt ttgaacgcat tcaagccact tgtctgggtt tacttgtttg tttgcttgct 2160 tgcttgcttg cttgcctgct tcttggtcag acggcccaaa aaagggaaaa aattcattca 2220 tggcacagat aagaaaaaga aaaagtttgt cgaccaccgt catcagaaag caagagaaga 2280 gaaacactcg cgctcacatt ctcgctcgcg taagaatctt agccacgcat acgaagtaat 2340 ttgtccatct ggcgaatctt tacatgagcg ttttcaagct ggagcgtgag atcatacctt 2400 tcttgatcgt aatgttccaa ccttgcatag gcctcgttgc gatccgctag caatgcgtcg 2460 tactcccgtt gcaactgcgc catcgcctca ttgtgacgtg agttcagatt cttctcgaga 2520 ccttcgagcg ctgctaattt cgcctgacgc tccttctttt gtgcttccat gacacgccgc 2580 ttcaccgtgc gttccacttc ttcctcagac atgcccttgg ctgcctcgac ctgctcggta 2640 aaacgggccc cagcacgtgc tacgagattt cgattccacc gccgccttct atgaaaggtt 2700 gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg gggatctcat 2760 gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt acaaataaag 2820 caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 2880 gtccaaactc atcaatgtat cttatcatac atggtcgacc tgcaggaacc tgcattaatg 2940 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 3000 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 3060 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 3120 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 3180 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 3240 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 3300 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 3360 tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 3420 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 3480 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 3540 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 3600 tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 3660 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 3720 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 3780 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 3840 aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 3900 atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 3960 gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat 4020 acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc 4080 ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc 4140 tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag 4200 ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg 4260 ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg 4320 atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag 4380 taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt 4440 catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga 4500 atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc 4560 acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc 4620 aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc 4680 ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc 4740 cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca 4800 atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat 4860 ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 4920 ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 4980 tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 5040 ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 5100 gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag 5160 agtgcaccaa gcttccaatt ttaggccccc cactgaccga ggtctgtcga taatccactt 5220 ttccattgat tttccaggtt tcgttaactc atgccactga gcaaaacttc ggtctttcct 5280 aacaaaagct ctcctcacaa agcatggcgc ggcaacggac gtgtcctcat actccactgc 5340 cacacaaggt cgataaacta agctcctcac aaatagagga gaattccact gacaactgaa 5400 aacaatgtat gagagacgat caccactgga gcggcgcggc ggttgggcgc ggaggtcggc 5460 agcaaaaaca agcgactcgc cgagcaaacc cgaatcagcc ttcagacggt cgtgcctaac 5520 aacacgccgt tctaccccgc cttcttcgcg ccccttcgcg tccaagcatc cttcaagttt 5580 atctctctag ttcaacttca agaagaacaa caccaccaac accatgatgc ctttgtctca 5640 agaagaatcc accctcattg aaagagcaac ggctacaatc aacagcatcc ccatctctga 5700 agactacagc gtcgccagcg cagctctctc tagcgacggc cgcatcttca ctggtgtcaa 5760 tgtatatcat tttactgggg gaccttgtgc agaactcgtg gtgctgggca ctgctgctgc 5820 tgcggcagct ggcaacctga cttgtatcgt cgcgatcgga aatgagaaca ggggcatctt 5880 gagcccctgt ggacggtgcc gacaggtgct tctcgatctg catcctggga tcaaagccat 5940 agtgaaggac agtgatggac agccgacggc agttgggatt cgtgaattgc tgccctctgg 6000 ttatgtgtgg gagggctaac acgtgctccg tgctacgaga tttcgattcc accgccgcct 6060 tctatgaaag gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc 6120 gcggggatct catgctggag ttcttcgccc accccaactt gtttattgca gcttataatg 6180 gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt 6240 ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca tgtctgaatt cccggggtac 6300 <210> 37 <211> 6611 <212> DNA <213> Artificial Sequence <220> <223> pCL122 vector (6611 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (1070)..(1162) <223> Sec1_secretion_signal <220> <221> misc_feature <222> (1163)..(1900) <223> eGFP_gene <220> <221> misc_feature <222> (1920)..(2559) <223> OrfC_terminator <220> <221> misc_feature <222> (5094)..(5542) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (5543)..(6337) <223> NPT_gene <220> <221> misc_feature <222> (6338)..(6611) <223> SV40_terminator <400> 37 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tgaagttcgc 1080 gacctcggtc gcaattttgc ttgtggccaa catagccacc gccctcgcgc agagcgatgg 1140 ctgcaccccc accgaccaga cgatggtgag caagggcgag gagctgttca ccggggtggt 1200 gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga 1260 gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa 1320 gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag 1380 ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta 1440 cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt 1500 gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga 1560 ggacggcaac atcctgggac acaagctgga gtacaactac aacagccaca acgtctatat 1620 catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga 1680 ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc 1740 cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa 1800 cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg 1860 catggacgag ctgtacaagc accaccatca ccaccactaa catatgagtt atgagatccg 1920 aaagtgaacc ttgtcctaac ccgacagcga atggcgggag ggggcgggct aaaagatcgt 1980 attacatagt atttttcccc tactctttgt gtttgtcttt tttttttttt tgaacgcatt 2040 caagccactt gtctgggttt acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt 2100 cttggtcaga cggcccaaaa aagggaaaaa attcattcat ggcacagata agaaaaagaa 2160 aaagtttgtc gaccaccgtc atcagaaagc aagagaagag aaacactcgc gctcacattc 2220 tcgctcgcgt aagaatctta gccacgcata cgaagtaatt tgtccatctg gcgaatcttt 2280 acatgagcgt tttcaagctg gagcgtgaga tcataccttt cttgatcgta atgttccaac 2340 cttgcatagg cctcgttgcg atccgctagc aatgcgtcgt actcccgttg caactgcgcc 2400 atcgcctcat tgtgacgtga gttcagattc ttctcgagac cttcgagcgc tgctaatttc 2460 gcctgacgct ccttcttttg tgcttccatg acacgccgct tcaccgtgcg ttccacttct 2520 tcctcagaca tgcccttggc tgcctcgacc tgctcggtaa aacgggcccc agcacgtgct 2580 acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2640 ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccaccc 2700 caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 2760 aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 2820 ttatcataca tggtcgacct gcaggaacct gcattaatga atcggccaac gcgcggggag 2880 aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 2940 cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 3000 atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 3060 taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 3120 aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 3180 tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 3240 gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 3300 cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 3360 cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 3420 atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 3480 tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 3540 ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 3600 acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 3660 aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 3720 aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 3780 tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 3840 cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 3900 catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 3960 ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 4020 aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 4080 ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 4140 caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 4200 attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 4260 agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 4320 actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 4380 ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 4440 ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 4500 gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 4560 atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 4620 cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 4680 gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 4740 gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 4800 ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 4860 gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga 4920 tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 4980 ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 5040 ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccaag cttccaattt 5100 taggcccccc actgaccgag gtctgtcgat aatccacttt tccattgatt ttccaggttt 5160 cgttaactca tgccactgag caaaacttcg gtctttccta acaaaagctc tcctcacaaa 5220 gcatggcgcg gcaacggacg tgtcctcata ctccactgcc acacaaggtc gataaactaa 5280 gctcctcaca aatagaggag aattccactg acaactgaaa acaatgtatg agagacgatc 5340 accactggag cggcgcggcg gttgggcgcg gaggtcggca gcaaaaacaa gcgactcgcc 5400 gagcaaaccc gaatcagcct tcagacggtc gtgcctaaca acacgccgtt ctaccccgcc 5460 ttcttcgcgc cccttcgcgt ccaagcatcc ttcaagttta tctctctagt tcaacttcaa 5520 gaagaacaac accaccaaca ccatgattga acaagatgga ttgcacgcag gttctccggc 5580 cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga 5640 tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct 5700 gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac 5760 gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct 5820 attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt 5880 atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt 5940 cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt 6000 cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag 6060 gctcaaggcg cgcatgcccg acggcgatga tctcgtcgtg acccatggcg atgcctgctt 6120 gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg 6180 tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg 6240 cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg 6300 catcgccttc tatcgccttc ttgacgagtt cttctgacac gtgctacgag atttcgattc 6360 caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg ccggctggat 6420 gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact tgtttattgc 6480 agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 6540 ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctgaat 6600 tcccggggta c 6611 <210> 38 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> pYB66 Bambgl F primer <400> 38 caagggatcc agatcttccg cactgatgag tc 32 <210> 39 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB66 Nde R primer <400> 39 aactcatatg gtcccattcg cca 23 <210> 40 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> pYB66 EF1seq F primer <400> 40 gagaggatag tatcttgcgt gcttg 25 <210> 41 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> pCL122 OrfC R primer <400> 41 gcaaggttgg aacattacga tcaag 25 <210> 42 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> pYB73 gRNA Pst Kpn IF F primer <400> 42 catacatggt cgacctgcag ggtacctctt atctgcctcg c 41 <210> 43 <211> 45 <212> DNA <213> Artificial Sequence <220> <223> pYB73 gRNA Xho Pst IF R primer <400> 43 attaatgcag gttcctgcag ctcgagaaga atctgaactc acgtc 45 <210> 44 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> pYB73 seq F primer <400> 44 caccccaact tgtttattgc ag 22 <210> 45 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> pYB73 seq R primer <400> 45 gagcgaggaa gcggaagag 19 <210> 46 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> pYB13 pYB1 seq F primer <400> 46 gagaggatag tatcttgcgt gcttgg 26 <210> 47 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> TT pYB73 HDV R primer <400> 47 gaagcatgtt gcccagcc 18 SEQUENCE LISTING <110> DSM IP Assets B.V. <120> A CRISPR-CAS SYSTEM FOR AN ALGAL HOST CELL <130> 32176-WO-PCT <140> PCT/US2017/041949 <141> 2017-07-13 <150> US 62/361,741 <151> 2016-07-13 <160> 47 <170> PatentIn version 3.5 <210> 1 <211> 9920 <212> DNA <213> Artificial Sequence <220> <223> pCL122-Cas9 vector <220> <221> misc_feature <222> (1)..(1064) <223> EF-1 alpha promoter <220> <221> misc_feature <222> (1070)..(5209) <223> Cas9 <220> <221> misc_feature <222> (5229)..(5868) <223> OrfC_terminator <220> <221> misc_feature <222> (7153)..(8013) <223> AmpR_gene <220> <221> misc_feature <222> (8403)..(8851) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (8852)..(9646) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (9647)..(9920) <223> SV40_terminator <400> 1 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tggataagaa 1080 gtactcgatc ggcctcgaca ttggcaccaa cagcgtcggc tgggccgtca ttactgatga 1140 gtacaaggtc ccgtcgaaga agtttaaggt cctcggcaac actgaccgcc actccatcaa 1200 gaagaacctc atcggtgccc tcctttttga ctccggcgag accgctgagg ccactcgcct 1260 caagcgcact gcccgccgcc gttacacccg ccgcaagaac cgcatctgct acctccagga 1320 gattttctcg aacgaaatgg ccaaggtcga tgactccttt ttccaccgtc tcgaagaatc 1380 gttcctcgtc gaggaggaca agaagcacga gcgccacccc atcttcggta acatgtcga 1440 tgaggttgcc taccacgaga agtacccgac catctaccac ctccgcaaga agctcgtcga 1500 ctccaccgac aaggccgatc tccgccttat ctacctcgcc ctcgcccaca tgatcaagtt 1560 ccgcggccac tttcttatcg agggtgatct caaccctgat aactctgacg tcgacaagct 1620 tttcatccag ctcgtccaga cttacaacca gctcttcgag gagaacccca tcaacgcttc 1680 cggcgtcgac gcgaaggcca ttctcagcgc ccgcctcagc aagtcccgcc gcctcgaaaa 1740 cctcattgcc cagcttcccg gcgagaagaa gaacggcctc ttcggcaacc tcattgccct 1800 cagccttggc ctcaccccta acttcaagtc gaactttgac ctcgccgagg acgccaagct 1860 ccagctttcc aaggacactt acgacgacga tctcgacaac ctcctcgctc agattggcga 1920 ccagtacgct gacctcttcc tcgccgccaa gaaccttagc gatgccatcc tcctctccga 1980 catccttcgt gttaacacgg aaatcacgaa ggctccgctc tccgcctcca tgatcaagcg 2040 ttacgacgag caccatcagg acctcaccct cctcaaggcc ctcgtccgcc agcagctccc 2100 cgagaagtac aaggagatct tcttcgacca gagcaagaac ggctacgccg gctacattga 2160 cggcggcgcg tcgcaggagg agttttacaa gtttatcaag cccattcttg agaagatgga 2220 cggcaccgag gagctcctcg tcaagctcaa ccgtgaggac cttctccgca agcagcgcac 2280 gttcgacaac ggctctattc cccatcagat ccacctcggt gagcttcacg cgattcttcg 2340 ccgccaggaa gacttttacc cgttcctcaa ggacaaccgc gagaagattg agaagatcct 2400 cacctttcgc attccctact acgtcggccc cctcgcccgc ggcaactcgc gctttgcttg 2460 gatgacccgc aagtccgagg agaccatcac cccgtggaac ttcgaagagg tcgtcgacaa 2520 gggcgcctcc gcgcagtctt tcatcgagcg catgactaac tttgacaaga acctcccgaa 2580 cgagaaggtc ctccccaagc acagcctcct ttacgaatac tttacggtgt acaacgagct 2640 cacgaaggtc aagtacgtca ctgagggcat gcgcaagccg gcgttccttt cgggcgagca 2700 gaagaaggct atcgtcgacc tccttttcaa gaccaaccgc aaggttaccg tcaagcagct 2760 caaggaggac tacttcaaga agatcgagtg ctttgactcg gtcgagattt cgggcgtgga 2820 ggaccgtttc aacgcctccc tcggcactta ccacgacctt ctcaagatca tcaaggacaa 2880 ggactttctc gacaacgagg agaacgagga cattctcgag gacatcgtcc tcacgctcac 2940 cctctttgag gaccgtgaga tgatcgagga gcgcctcaag acctacgccc atctctttga 3000 cgacaaggtc atgaagcagc tcaagcgccg ccgctacacc ggctggggcc gcctttcccg 3060 caagctcatc aacggcatcc gcgacaagca gtctggcaag accatccttg actttcttaa 3120 gtctgatggt ttcgccaacc gcaacttcat gcagctcatc cacgacgaca gcctcacttt 3180 caaggaggac attcagaagg cccaggtctc cggccagggt gactctctcc acgaacacat 3240 cgccaacctt gctggcagcc cggctatta gaagggcatc ctccagaccg tcaaggtcgt 3300 cgacgagctc gtcaaggtta tgggccgcca caagcccgag aacatcgtca ttgagatggc 3360 tcgcgaaaac cagaccaccc agaagggtca gaagaactcc cgcgagcgca tgaagcgtat 3420 cgaggagggc atcaaggagc tcggcagcca gatcctcaag gagcacccgg tcgagaacac 3480 ccagctccag aacgaaaagc tctacctcta ctacctccag aacggccgtg acatgtacgt 3540 tgaccaggag ctcgacatta accgcctctc cgattacgac gtcgaccata ttgtccccca 3600 gagctttctc aaggacgaca gcatcgacaa caaggtcctc acccgctcgg acaagaaccg 3660 cggcaagtcc gacaacgtcc cttccgagga ggtcgtgaag aagatgaaga actactggcg 3720 ccagcttctc aacgctaagc ttattactca gcgcaagttc gataacctca ccaaggccga 3780 acgcggcggc ctctccgagc tcgacaaggc cggttttatc aagcgccagc tcgttgagac 3840 tcgccagatc accaagcacg tggcgcagat cctcgactcg cgcatgaaca cgaagtacga 3900 cgagaacgac aagctcatcc gcgaggtcaa ggtcatcacc cttaagtcga agctcgtgtc 3960 cgactttcgc aaggacttcc agttctacaa ggtccgtgaa attaacaact accaccacgc 4020 tcacgacgct tacctcaacg cggtcgtggg taccgcgctc atcaagaagt acccgaagct 4080 cgagtcggag tttgtctacg gcgactacaa ggtctacgac gtgcgcaaga tgatcgccaa 4140 gtccgagcag gagatcggca aggccacggc caagtacttt ttctactcca acattatgaa 4200 cttctttaag actgagatca cccttgccaa cggcgagatc cgcaagcgcc cccttatcga 4260 gaccaacggc gagaccggcg aaattgtgtg ggataagggt cgcgactttg ccaccgtccg 4320 caaggtcctc agcatgcccc aggtcaacat tgttaagaag accgaggtcc agacgggcgg 4380 ctttagcaag gagtctatcc tccccaagcg taacagcgac aagctcatcg cccgcaagaa 4440 ggactgggac cctaagaagt acggcggctt cgattcgcct acggtcgcct acagcgtcct 4500 cgtcgtcgcc aaggtcgaga agggcaagtc caagaagctc aagtccgtca aggagctcct 4560 cggcatcacg atcatggagc gctccagctt tgagaagaac cccattgact tcctcgaggc 4620 taagggttac aaggaggtca agaaggacct tatcatcaag ctccccaagt actccctctt 4680 tgagctcgaa aacggccgca agcgtatgct cgctagcgct ggcgaactcc agaagggcaa 4740 cgagctcgcc ctccccagca agtacgtcaa ctttctctac ctcgcctccc actacgagaa 4800 gctcaagggt agcccggagg ataacgagca gaagcagctt tttgtggagc agcacaagca 4860 ctaccttgac gagatcattg aacagatctc cgagttctcc aagcgtgtta ttcttgctga 4920 cgccaacctc gataaggtgc tctccgcgta caacaagcac cgcgacaagc ctatccgcga 4980 gcaggccgag aacatcatcc acctctttac cctcaccaac ctcggcgccc cggccgcctt 5040 taagtacttt gatacgacta tcgaccgcaa gcgctacact tcgactaagg aggtcctcga 5100 cgctaccctc attcaccagt ccattaccgg cctctacgag acccgcattg acctttcgca 5160 gctcggtggc gactcgcgtg cggaccctaa gaagaagcgc aaggtctaac atatgagtta 5220 tgagatccga aagtgaacct tgtcctaacc cgacagcgaa tggcgggagg gggcgggcta 5280 aaagatcgta ttacatagta tttttcccct actctttgtg tttgtctttt tttttttttt 5340 gaacgcattc aagccacttg tctgggttta cttgtttgtt tgcttgcttg cttgcttgct 5400 tgcctgcttc ttggtcagac ggcccaaaaa agggaaaaaa ttcattcatg gcacagataa 5460 gaaaaagaaa aagtttgtcg accaccgtca tcagaaagca agagaagaga aacactcgcg 5520 ctcacattct cgctcgcgta agaatcttag ccacgcatac gaagtaattt gtccatctgg 5580 cgaatcttta catgagcgtt ttcaagctgg agcgtgagat catacctttc ttgatcgtaa 5640 tgttccaacc ttgcataggc ctcgttgcga tccgctagca atgcgtcgta ctcccgttgc 5700 aactgcgcca tcgcctcatt gtgacgtgag ttcagattct tctcgagacc ttcgagcgct 5760 gctaatttcg cctgacgctc cttcttttgt gcttccatga cacgccgctt caccgtgcgt 5820 tccacttctt cctcagacat gcccttggct gcctcgacct gctcggtaaa acgggcccca 5880 gcacgtgcta cgagatttcg attccaccgc cgccttctat gaaaggttgg gcttcggaat 5940 cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc tggagttctt 6000 cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 6060 aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 6120 caatgtatct tatcatacat ggtcgacctg caggaacctg cattaatgaa tcggccaacg 6180 cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 6240 gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 6300 atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 6360 caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 6420 gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 6480 ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 6540 cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 6600 taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 6660 cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 6720 acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 6780 aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 6840 atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 6900 atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 6960 gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 7020 gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 7080 ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 7140 ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 7200 tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt 7260 accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt 7320 atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc 7380 cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa 7440 tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg 7500 tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 7560 gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 7620 agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt 7680 aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg 7740 gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 7800 tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 7860 gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt 7920 tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 7980 aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag 8040 catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 8100 acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 8160 tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg 8220 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 8280 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 8340 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccaagc 8400 ttccaatttt aggcccccca ctgaccgagg tctgtcgata atccactttt ccattgattt 8460 tccaggtttc gttaactcat gccactgagc aaaacttcgg tctttcctaa caaaagctct 8520 cctcacaaag catggcgcgg caacggacgt gtcctcatac tccactgcca cacaaggtcg 8580 ataaactaag ctcctcacaa atagaggaga attccactga caactgaaaa caatgtatga 8640 gagacgatca ccactggagc ggcgcggcgg ttgggcgcgg aggtcggcag caaaaacaag 8700 cgactcgccg agcaaacccg aatcagcctt cagacggtcg tgcctaacaa cacgccgttc 8760 taccccgcct tcttcgcgcc ccttcgcgtc caagcatcct tcaagtttat ctctctagtt 8820 caacttcaag aagaacaaca ccaccaacac catgattgaa caagatggat tgcacgcagg 8880 ttctccggcc gcttgggtgg agaggctatt cggctatgac tgggcacaac agacaatcgg 8940 ctgctctgat gccgccgtgt tccggctgtc agcgcagggg cgcccggttc tttttgtcaa 9000 gaccgacctg tccggtgccc tgaatgaact gcaggacgag gcagcgcggc tatcgtggct 9060 ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga 9120 ctggctgcta ttgggcgaag tgccggggca ggatctcctg tcatctcacc ttgctcctgc 9180 cgagaaagta tccatcatgg ctgatgcaat gcggcggctg catacgcttg atccggctac 9240 ctgcccattc gaccaccaag cgaaacatcg catcgagcga gcacgtactc ggatggaagc 9300 cggtcttgtc gatcaggatg atctggacga agagcatcag gggctcgcgc cagccgaact 9360 gttcgccagg ctcaaggcgc gcatgcccga cggcgatgat ctcgtcgtga cccatggcga 9420 tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt tctggattca tcgactgtgg 9480 ccggctgggt gtggcggacc gctatcagga catagcgttg gctacccgtg atattgctga 9540 agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga 9600 ttcgcagcgc atcgccttct atcgccttct tgacgagttc ttctgacacg tgctacgaga 9660 tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 9720 cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt 9780 gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 9840 agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 9900 tgtctgaatt cccggggtac 9920 <210> 2 <211> 7393 <212> DNA <213> Artificial Sequence <220> <223> pYB31 vector <220> <221> misc_feature <222> (2)..(450) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(549) <223> Sec1_signal_for_eGFP <220> <221> misc_feature <222> (550)..(1266) <223> eGFP <220> <221> misc_feature <222> (1294)..(2293) <223> OrfA_terminator <220> <221> misc_feature <222> (4971)..(5970) <223> OrfA_UP_homology <220> <221> misc_feature <222> (5974)..(6289) <223> OrfC_terminator <220> <221> misc_feature <222> (6296)..(6744) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (6745)..(7119) <223> Sh_ble_gene <220> <221> misc_feature <222> (7120)..(7393) <223> SV40_terminator <400> 2 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatga agttcgcgac ctcggtcgca 480 attttgcttg tggccaacat agccaccgcc ctcgcgcaga gcgatggctg cacccccacc 540 gaccagacga tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc 600 gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat 660 gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct gcccgtgccc 720 tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg ctaccccgac 780 cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc 840 accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc 900 gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc 960 ctgggacaca agctggagta caactacaac agccacaacg tctatatcat ggccgacaag 1020 cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga cggcagcgtg 1080 cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc 1140 gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga gaagcgcgat 1200 cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg 1260 tacaagcacc accatcacca ccactaacat atgatctaaa ctagtgaatc gcgacaagtt 1320 gtctttttgt tactctacgt actgctcttt ctaattttac gacgtatgct tctgctcttg 1380 acgacgacaa ccatggcaat aaaagtaagg caagaagtga gtgcgtgctc gctcacaagg 1440 tctaggccga aacgaggcgc cttaggattg gttgtctccg tcgtaagtca tgacggagcg 1500 taggacaccg acggcctgcc gcgcaaatat acgaaccgct gcacaattct tttcgttgag 1560 aacaacctcg aacggcctgc cttgctgagc tgccgacacg gtggaaggaa gcatagcggc 1620 caatcgaggg gatgctatta attaggcttg gcgctgctgc cgccgtgcct gaagatgtgc 1680 gcaagcgtgg caacaccgtc caccttgcca ttaaagtcat gcactccaaa tacctcgaac 1740 agagcctctc gcatgtttgg ccaagcacct tcgaagagagt ctggactctc actccttcct 1800 caccgttgtc gccattctca gcgagctcat gcaatcacca ggacggagag acggccagct 1860 gcttttctgc tttccattgt tattagagaa acgcttctcg ctctcatcgt cttagtagac 1920 attccgatgg cttcgttcgc caatttgtca cctaagtaag ctagagtgtt aagtctaaat 1980 gcctttgacc cgcgtacggc gtcacgtaga tgcctgtcct tgccagcaaa cgctagttcg 2040 cggtgtgcgt aatttggccc gcattatgct ggctctcaaa atcaaccgcc accactcgcg 2100 gctgcacgat gattttcgtg cactcatgac atgagaaatg tgatactcaa actagtatag 2160 acctcctact cctactgctg cttttctcgt cagagctgtc tccggaaaag ttgacaagtt 2220 gttggccttc ttcttctctg ctagtaagta gatcatcatg gatgagacga tgatgataat 2280 gatgatgatg ataatttaaa tctcgagacc ttcgagcgct gctaatttcg cctgacgctc 2340 cttcttttgt gcttccatga cacgccgctt caccgtgcgt tccacttctt cctcagacat 2400 gcccttggct gcctcgacct gctcggtaaa acgggcccca gcacgtgcta cgagatttcg 2460 attccaccgc cgccttctat gaaaggttgg gcttcggaat cgttttccgg gacgccggct 2520 ggatgatcct ccagcgcggg gatctcatgc tggagttctt cgcccacccc aacttgttta 2580 ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat 2640 ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct tatcatacat 2700 ggtcgacctg caggaacctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 2760 gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 2820 ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 2880 acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 2940 cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3000 caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3060 gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3120 tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3180 aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3240 ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3300 cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3360 tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 3420 tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3480 ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 3540 aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3600 aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 3660 aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 3720 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 3780 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 3840 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 3900 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 3960 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 4020 ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 4080 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 4140 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 4200 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 4260 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 4320 cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 4380 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 4440 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 4500 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 4560 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 4620 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 4680 catttccccg aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct 4740 ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa 4800 acctctgaca catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga 4860 gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact 4920 atgcggcatc agagcagatt gtactgagag tgcaccaagc ttatttaaat ctttatcggt 4980 gtggcgcgcc ttgacggccg cctcggattc acttcgcagg gcacttctga cgcgctcaag 5040 cgtgcccagc gtggtgccat ctttggcctc tgcaagacca tcggcctcga gtggtccgag 5100 tctgacgtct tttcccgcgg cgtggacatt gctcagggca tgcaccccga ggatgccgcc 5160 gtggcgattg tgcgcgagat ggcgtgcgct gacattcgca ttcgcgaggt cggcattggc 5220 gcaaaccagc agcgctgcac gatccgtgcc gccaagctcg agaccggcaa cccgcagcgc 5280 cagatcgcca aggacgacgt gctgctcgtt tctggcggcg ctcgcggcat cacgcctctt 5340 tgcatccggg agatcacgcg ccagatcgcg ggcggcaagt acattctgct tggccgcagc 5400 aaggtctctg cgagcgaacc ggcatggtgc gctggcatca ctgacgagaa ggctgtgcaa 5460 aaggctgcta cccaggagct caagcgcgcc tttagcgctg gcgagggccc caagcccacg 5520 ccccgcgctg tcactaagct tgtgggctct gttcttggcg ctcgcgaggt gcgcagctct 5580 attgctgcga ttgaagcgct cggcggcaag gccatctact cgtcgtgcga cgtgaactct 5640 gccgccgacg tggccaaggc cgtgcgcgat gccgagtccc agctcggtgc ccgcgtctcg 5700 ggcatcgttc atgcctcggg cgtgctccgc gaccgtctca tcgagaagaa gctccccgac 5760 gagttcgacg ccgtctttgg caccaaggtc accggtctcg agaacctcct cgccgccgtc 5820 gaccgcgcca acctcaagca catggtcctc ttcagctcgc tcgccggctt ccacggcaac 5880 gtcggccagt ctgactacgc catggccaac gaggccctta acaagatggg cctcgagctc 5940 gccaaggacg tctcggtcaa gtcgatctgc taagaaagtg aaccttgtcc taacccgaca 6000 gcgaatggcg ggagggggcg ggctaaaaga tcgtattaca tagtattttc ccctactctt 6060 tgtgtttgtc tttttttttt ttgaacgcat tcaagccact tgtcttggtt tacttgtttg 6120 tttgcttgct tgcttgcttg cttgcctgct tcttggtcag acggacccaa aaaagggaaa 6180 aaattcattc atggcacaga taagaaaaag aaaaagtttg tcgaccaccg tcatcagaaa 6240 gcaagagaag agaaacactc gcgctcacat tctcgctcgc gtaagaatca agcttccaat 6300 tttaggcccc ccactgaccg aggtctgtcg ataatccact tttccattga ttttccaggt 6360 ttcgttaact catgccactg agcaaaactt cggtctttcc taacaaaagc tctcctcaca 6420 aagcatggcg cggcaacgga cgtgtcctca tactccactg ccacacaagg tcgataaact 6480 aagctcctca caaatagagg agaattccac tgacaactga aaacaatgta tgagagacga 6540 tcaccactgg agcggcgcgg cggttgggcg cggaggtcgg cagcaaaaac aagcgactcg 6600 ccgagcaaac ccgaatcagc cttcagacgg tcgtgcctaa caacacgccg ttctaccccg 6660 ccttcttcgc gccccttcgc gtccaagcat ccttcaagtt tatctctcta gttcaacttc 6720 aagaagaaca acaccaccaa caccatggcc aagttgacca gtgccgttcc ggtgctcacc 6780 gcgcgcgacg tcgccggagc ggtcgagttc tggaccgacc ggctcgggtt ctcccgggac 6840 ttcgtggagg acgacttcgc cggtgtggtc cgggacgacg tgaccctgtt catcagcgcg 6900 gtccaggacc aggtggtgcc ggacaacacc ctggcctggg tgtgggtgcg cggcctggac 6960 gagctgtacg ccgagtggtc ggaggtcgtg tccacgaact tccgggacgc ctccgggccg 7020 gccatgaccg agatcggcga gcagccgtgg gggcgggagt tcgccctgcg cgacccggcc 7080 ggcaactgcg tgcacttcgt ggccgaggag caggactgac acgtgctacg agatttcgat 7140 tccaccgccg ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg 7200 atgatcctcc agcgcgggga tctcatgctg gagttcttcg cccaccccaa cttgtttatt 7260 gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt 7320 ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta tcatgtctga 7380 attcccgggg tac 7393 <210> 3 <211> 10702 <212> DNA <213> Artificial Sequence <220> <223> pYB32 vector <220> <221> misc_feature <222> (2)..(450) <223> alpha tubulin promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4603)..(5602) <223> OrfA terminator <220> <221> misc_feature <222> (8280)..(9279) <223> OrfA UP homology <220> <221> misc_feature <222> (9283)..(9598) <223> OrfC terminator <220> <221> misc_feature <222> (9605)..(10053) <223> alpha tubulin promoter <220> <221> misc_feature <222> (10054)..(10428) <223> Sh ble gene <220> <221> misc_feature <222> (10429)..(10702) <223> SV40_terminator <400> 3 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acatgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gaggacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgatctaaac tagtgaatcg 4620 cgacaagttg tctttttgtt actctacgta ctgctctttc taattttacg acgtatgctt 4680 ctgctcttga cgacgacaac catggcaata aaagtaaggc aagaagtgag tgcgtgctcg 4740 ctcacaaggt ctaggccgaa acgaggcgcc ttaggattgg ttgtctccgt cgtaagtcat 4800 gacggagcgt aggacaccga cggcctgccg cgcaaatata cgaaccgctg cacaattctt 4860 ttcgttgaga acaacctcga acggcctgcc ttgctgagct gccgacacgg tggaaggaag 4920 catagcggcc aatcgagggg atgctattaa ttaggcttgg cgctgctgcc gccgtgcctg 4980 aagatgtgcg caagcgtggc aacaccgtcc accttgccat taaagtcatg cactccaaat 5040 acctcgaaca gagcctctcg catgtttggc caagcacctt cgagagagtc tggactctca 5100 ctccttcctc accgttgtcg ccattctcag cgagctcatg caatcaccag gacggagaga 5160 cggccagctg cttttctgct ttccattgtt attagagaaa cgcttctcgc tctcatcgtc 5220 ttagtagaca ttccgatggc ttcgttcgcc aatttgtcac ctaagtaagc tagagtgtta 5280 agtctaaatg cctttgaccc gcgtacggcg tcacgtagat gcctgtcctt gccagcaaac 5340 gctagttcgc ggtgtgcgta atttggcccg cattatgctg gctctcaaaa tcaaccgcca 5400 ccactcgcgg ctgcacgatg attttcgtgc actcatgaca tgagaaatgt gatactcaaa 5460 ctagtataga cctcctactc ctactgctgc ttttctcgtc agagctgtct ccggaaaagt 5520 tgacaagttg ttggccttct tcttctctgc tagtaagtag atcatcatgg atgagacgat 5580 gatgataatg atgatgatga taatttaaat ctcgagacct tcgagcgctg ctaatttcgc 5640 ctgacgctcc ttcttttgtg cttccatgac acgccgcttc accgtgcgtt ccacttcttc 5700 ctcagacatg cccttggctg cctcgacctg ctcggtaaaa cgggccccag cacgtgctac 5760 gagattcga ttccaccgcc gccttctatg aaaggttggg cttcggaatc gttttccggg 5820 acgccggctg gatgatcctc cagcgcgggg atctcatgct ggagttcttc gcccacccca 5880 acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa 5940 ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt 6000 atcatacatg gtcgacctgc aggaacctgc attaatgaat cggccaacgc gcggggagag 6060 gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 6120 ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 6180 caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 6240 aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 6300 atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 6360 cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 6420 ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 6480 gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 6540 accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat 6600 cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 6660 cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct 6720 gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac 6780 aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 6840 aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 6900 actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 6960 taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 7020 gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 7080 tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 7140 ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa 7200 accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 7260 agtctatta ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 7320 acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 7380 tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 7440 cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 7500 tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 7560 ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt 7620 gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc 7680 tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 7740 ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 7800 gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 7860 cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg 7920 gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg 7980 ttccgcgcac atttccccga aaagtgccac ctgacgtcta agaaaccatt attatcatga 8040 cattaaccta taaaaatagg cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg 8100 acggtgaaaa cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg 8160 atgccgggag cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct 8220 ggcttaacta tgcggcatca gagcagattg tactgagagt gcaccaagct tatttaaatc 8280 tttatcggtg tggcgcgcct tgacggccgc ctcggattca cttcgcaggg cacttctgac 8340 gcgctcaagc gtgcccagcg tggtgccatc tttggcctct gcaagaccat cggcctcgag 8400 tggtccgagt ctgacgtctt ttcccgcggc gtggacattg ctcagggcat gcaccccgag 8460 gatgccgccg tggcgattgt gcgcgagatg gcgtgcgctg acattcgcat tcgcgaggtc 8520 ggcattggcg caaaccagca gcgctgcacg atccgtgccg ccaagctcga gaccggcaac 8580 ccgcagcgcc agatcgccaa ggacgacgtg ctgctcgttt ctggcggcgc tcgcggcatc 8640 acgcctcttt gcatccggga gatcacgcgc cagatcgcgg gcggcaagta cattctgctt 8700 ggccgcagca aggtctctgc gagcgaaccg gcatggtgcg ctggcatcac tgacgagaag 8760 gctgtgcaaa aggctgctac ccaggagctc aagcgcgcct ttagcgctgg cgagggcccc 8820 aagcccacgc cccgcgctgt cactaagctt gtgggctctg ttcttggcgc tcgcgaggtg 8880 cgcagctcta ttgctgcgat tgaagcgctc ggcggcaagg ccatctactc gtcgtgcgac 8940 gtgaactctg ccgccgacgt ggccaaggcc gtgcgcgatg ccgagtccca gctcggtgcc 9000 cgcgtctcgg gcatcgttca tgcctcgggc gtgctccgcg accgtctcat cgagaagaag 9060 ctccccgacg agttcgacgc cgtctttggc accaaggtca ccggtctcga gaacctcctc 9120 gccgccgtcg accgcgccaa cctcaagcac atggtcctct tcagctcgct cgccggcttc 9180 cacggcaacg tcggccagtc tgactacgcc atggccaacg aggcccttaa caagatgggc 9240 ctcgagctcg ccaaggacgt ctcggtcaag tcgatctgct aagaaagtga accttgtcct 9300 aacccgacag cgaatggcgg gagggggcgg gctaaaagat cgtattacat agtattttcc 9360 cctactcttt gtgtttgtct tttttttttt tgaacgcatt caagccactt gtcttggttt 9420 acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt cttggtcaga cggacccaaa 9480 aaagggaaaa aattcattca tggcacagat aagaaaaaga aaaagtttgt cgaccaccgt 9540 catcagaaag caagagaaga gaaacactcg cgctcacatt ctcgctcgcg taagaatcaa 9600 gcttccaatt ttaggccccc cactgaccga ggtctgtcga taatccactt ttccattgat 9660 tttccaggtt tcgttaactc atgccactga gcaaaacttc ggtctttcct aacaaaagct 9720 ctcctcacaa agcatggcgc ggcaacggac gtgtcctcat actccactgc cacacaaggt 9780 cgataaacta agctcctcac aaatagagga gaattccact gacaactgaa aacaatgtat 9840 gagagacgat caccactgga gcggcgcggc ggttgggcgc ggaggtcggc agcaaaaaca 9900 agcgactcgc cgagcaaacc cgaatcagcc ttcagacggt cgtgcctaac aacacgccgt 9960 tctaccccgc cttcttcgcg ccccttcgcg tccaagcatc cttcaagttt atctctctag 10020 ttcaacttca agaagaacaa caccaccaac accatggcca agttgaccag tgccgttccg 10080 gtgctcaccg cgcgcgacgt cgccggagcg gtcgagttct ggaccgaccg gctcgggttc 10140 tcccgggact tcgtggagga cgacttcgcc ggtgtggtcc gggacgacgt gaccctgttc 10200 atcagcgcgg tccaggacca ggtggtgccg gacaacaccc tggcctgggt gtgggtgcgc 10260 ggcctggacg agctgtacgc cgagtggtcg gaggtcgtgt ccacgaactt ccgggacgcc 10320 tccgggccgg ccatgaccga gatcggcgag cagccgtggg ggcgggagtt cgccctgcgc 10380 gacccggccg gcaactgcgt gcacttcgtg gccgaggagc aggactgaca cgtgctacga 10440 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 10500 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 10560 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 10620 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 10680 catgtctgaa ttcccggggt ac 10702 <210> 4 <211> 11332 <212> DNA <213> Artificial Sequence <220> <223> pYB33 vector <220> <221> misc_feature <222> (2)..(1006) <223> CS_promoter <220> <221> misc_feature <222> (1013)..(5152) <223> Cas9 <220> <221> misc_feature <222> (5159)..(6158) <223> OrfA_terminator <220> <221> misc_feature <222> (8836)..(9835) <223> OrfA_UP_homology <220> <221> misc_feature <222> (9839)..(10154) <223> OrfC_terminator <220> <221> misc_feature <222> (10161)..(10609) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (10610)..(10984) <223> Sh_ble_gene <220> <221> misc_feature <222> (10985)..(11258) <223> SV40_terminator <400> 4 cgagcgggcg attccaccgt caactggtct tcgcctgtgc cttccagcgc agtgcgcctt 60 gccccgctcg gtcttattat tcttggcgct ctcgtcaatg tgatcatgat ctagtcgaac 120 gtgcaatcta gccaatgaaa aaagagtcca gttctatctg aatttttcac tttctaaatc 180 tcgcatcgac aatctacttt tcaaatctcg caacaaagct gatcttgttt ctccctcacc 240 cagttctatc tgaatttttc cttttctgaa gctcgcgtta acaatctact tttcgaatct 300 gtcaacaaag ctgatcttgt ttctcccccc tatccccttc cctcccccct tctttgggat 360 cttgttgtgc gtgtcgcccc ttcaacttct ttgttcgacg atgacctcca cctagcctgt 420 gaagctcatc gtctccgagt atttctggcc tgctccaatt cctctcttcc attctccatc 480 gcatacatgc atgttctttg gtctcactcc gagccatgct tcttcggtca ctacttcatc 540 tatttgacta ggcctctgtt cgagcgacga accctccgtg ttcgcgggtg ttcattctct 600 gcaaagtggt ccgtaaccgt gactaccgga cacctcgcgt acactacatt cgggacggac 660 gcggccgagc gcgacgtctc tgggcccggc ctgccgcccc cggggccgcg gcttcctcgc 720 gccgccagcc gcgtccaagt cgccagcgcg aggtcgcgcg agtcgaagga gacgttgtcg 780 atctcgaccc tcgccatgcg cgtgacgggt gaccgcctca ccggatcccg ccctccgcgc 840 gctgccttca ttccttcatt ccttcattcc ttcactcaat cctgcatcat ccatcgcccg 900 cccgcccgct cgcacgcacc agaggcgcgc attgcgggcc agggcgccgc ctgcagaccg 960 ccatcgcgcc cgccttctgc cgcgcctcgc tcgctcggag accgagggat ccatggataa 1020 gaagtactcg atcggcctcg acattggcac caacagcgtc ggctgggccg tcattactga 1080 tgagtacaag gtcccgtcga agaagtttaa ggtcctcggc aacactgacc gccactccat 1140 caagaagaac ctcatcggtg ccctcctttt tgactccggc gagaccgctg aggccactcg 1200 cctcaagcgc actgcccgcc gccgttacac ccgccgcaag aaccgcatct gctacctcca 1260 ggagattttc tcgaacgaaa tggccaaggt cgatgactcc tttttccacc gtctcgaaga 1320 atcgttcctc gtcgaggagg acaagaagca cgagcgccac cccatcttcg gtaacattgt 1380 cgatgaggtt gcctaccacg agaagtaccc gaccatctac cacctccgca agaagctcgt 1440 cgactccacc gacaaggccg atctccgcct tatctacctc gccctcgccc acatgatcaa 1500 gttccgcggc cactttctta tcgagggtga tctcaaccct gataactctg acgtcgacaa 1560 gcttttcatc cagctcgtcc agacttacaa ccagctcttc gaggagaacc ccatcaacgc 1620 ttccggcgtc gacgcgaagg ccattctcag cgcccgcctc agcaagtccc gccgcctcga 1680 aaacctcatt gcccagcttc ccggcgagaa gaagaacggc ctcttcggca acctcattgc 1740 cctcagcctt ggcctcaccc ctaacttcaa gtcgaacttt gacctcgccg aggacgccaa 1800 gctccagctt tccaaggaca cttacgacga cgatctcgac aacctcctcg ctcagattgg 1860 cgaccagtac gctgacctct tcctcgccgc caagaacctt agcgatgcca tcctcctctc 1920 cgacatcctt cgtgttaaca cggaaatcac gaaggctccg ctctccgcct ccatgatcaa 1980 gcgttacgac gagcaccatc aggacctcac cctcctcaag gccctcgtcc gccagcagct 2040 ccccgagaag tacaaggaga tcttcttcga ccagagcaag aacggctacg ccggctacat 2100 tgacggcggc gcgtcgcagg aggagtttta caagtttatc aagcccattc ttgagaagat 2160 ggacggcacc gaggagctcc tcgtcaagct caaccgtgag gaccttctcc gcaagcagcg 2220 cacgttcgac aacggctcta ttccccatca gatccacctc ggtgagcttc acgcgattct 2280 tcgccgccag gaagactttt acccgttcct caaggacaac cgcgagaaga ttgagaagat 2340 cctcaccttt cgcattccct actacgtcgg ccccctcgcc cgcggcaact cgcgctttgc 2400 ttggatgacc cgcaagtccg aggagaccat caccccgtgg aacttcgaag aggtcgtcga 2460 caagggcgcc tccgcgcagt ctttcatcga gcgcatgact aactttgaca agaacctccc 2520 gaacgagaag gtcctcccca agcacagcct cctttacgaa tactttacgg tgtacaacga 2580 gctcacgaag gtcaagtacg tcactgaggg catgcgcaag ccggcgttcc tttcgggcga 2640 gcagaagaag gctatcgtcg acctcctttt caagaccaac cgcaaggtta ccgtcaagca 2700 gctcaaggag gactacttca agaagatcga gtgctttgac tcggtcgaga tttcgggcgt 2760 ggaggaccgt ttcaacgcct ccctcggcac ttaccacgac cttctcaaga tcatcaagga 2820 caaggacttt ctcgacaacg aggagaacga ggacattctc gaggacatcg tcctcacgct 2880 caccctcttt gaggaccgtg agatgatcga ggagcgcctc aagacctacg cccatctctt 2940 tgacgacaag gtcatgaagc agctcaagcg ccgccgctac accggctggg gccgcctttc 3000 ccgcaagctc atcaacggca tccgcgacaa gcagtctggc aagaccatcc ttgactttct 3060 taagtctgat ggtttcgcca accgcaactt catgcagctc atccacgacg acagcctcac 3120 tttcaaggag gacattcaga aggcccaggt ctccggccag ggtgactctc tccacgaaca 3180 catcgccaac cttgctggca gcccggctat taagaagggc atcctccaga ccgtcaaggt 3240 cgtcgacgag ctcgtcaagg ttatgggccg ccacaagccc gagaacatcg tcattgagat 3300 ggctcgcgaa aaccagacca cccagaaggg tcagaagaac tcccgcgagc gcatgaagcg 3360 tatcgaggag ggcatcaagg agctcggcag ccagatcctc aaggagcacc cggtcgagaa 3420 cacccagctc cagaacgaaa agctctacct ctactacctc cagaacggcc gtgacatgta 3480 cgttgaccag gagctcgaca ttaaccgcct ctccgattac gacgtcgacc atattgtccc 3540 ccagagcttt ctcaaggacg acagcatcga caacaaggtc ctcacccgct cggacaagaa 3600 ccgcggcaag tccgacaacg tcccttccga ggaggtcgtg aagaagatga agaactactg 3660 gcgccagctt ctcaacgcta agcttattac tcagcgcaag ttcgataacc tcaccaaggc 3720 cgaacgcggc ggcctctccg agctcgacaa ggccggtttt atcaagcgcc agctcgttga 3780 gactcgccag atcaccaagc acgtggcgca gatcctcgac tcgcgcatga acacgaagta 3840 cgacgagaac gacaagctca tccgcgaggt caaggtcatc acccttaagt cgaagctcgt 3900 gtccgacttt cgcaaggact tccagttcta caaggtccgt gaaattaaca actaccacca 3960 cgctcacgac gcttacctca acgcggtcgt gggtaccgcg ctcatcaaga agtacccgaa 4020 gctcgagtcg gagtttgtct acggcgacta caaggtctac gacgtgcgca agatgatcgc 4080 caagtccgag caggagatcg gcaaggccac ggccaagtac tttttctact ccaacattat 4140 gaacttcttt aagactgaga tcacccttgc caacggcgag atccgcaagc gccccccttat 4200 cgagaccaac ggcgagaccg gcgaaattgt gtgggataag ggtcgcgact ttgccaccgt 4260 ccgcaaggtc ctcagcatgc cccaggtcaa cattgttaag aagaccgagg tccagacggg 4320 cggctttagc aaggagtcta tcctccccaa gcgtaacagc gacaagctca tcgcccgcaa 4380 gaaggactgg gaccctaaga agtacggcgg cttcgattcg cctacggtcg cctacagcgt 4440 cctcgtcgtc gccaaggtcg agaagggcaa gtccaagaag ctcaagtccg tcaaggagct 4500 cctcggcatc acgatcatgg agcgctccag ctttgagaag aaccccattg acttcctcga 4560 ggctaagggt tacaaggagg tcaagaagga ccttatcatc aagctcccca agtactccct 4620 ctttgagctc gaaaacggcc gcaagcgtat gctcgctagc gctggcgaac tccagaaggg 4680 caacgagctc gccctcccca gcaagtacgt caactttctc tacctcgcct cccactacga 4740 gaagctcaag ggtagcccgg aggataacga gcagaagcag ctttttgtgg agcagcacaa 4800 gcactacctt gacgagatca ttgaacagat ctccgagttc tccaagcgtg ttattcttgc 4860 tgacgccaac ctcgataagg tgctctccgc gtacaacaag caccgcgaca agcctatccg 4920 cgagcaggcc gagaacatca tccacctctt taccctcacc aacctcggcg ccccggccgc 4980 ctttaagtac tttgatacga ctatcgaccg caagcgctac acttcgacta aggaggtcct 5040 cgacgctacc ctcattcacc agtccattac cggcctctac gagacccgca ttgacctttc 5100 gcagctcggt ggcgactcgc gtgcggaccc taagaagaag cgcaaggtct aacatatgat 5160 ctaaactagt gaatcgcgac aagttgtctt tttgttactc tacgtactgc tctttctaat 5220 tttacgacgt atgcttctgc tcttgacgac gacaaccatg gcaataaaag taaggcaaga 5280 agtgagtgcg tgctcgctca caaggtctag gccgaaacga ggcgccttag gattggttgt 5340 ctccgtcgta agtcatgacg gagcgtagga caccgacggc ctgccgcgca aatatacgaa 5400 ccgctgcaca attcttttcg ttgagaacaa cctcgaacgg cctgccttgc tgagctgccg 5460 acacggtgga aggaagcata gcggccaatc gaggggatgc tattaattag gcttggcgct 5520 gctgccgccg tgcctgaaga tgtgcgcaag cgtggcaaca ccgtccacct tgccattaaa 5580 gtcatgcact ccaaatacct cgaacagagc ctctcgcatg tttggccaag caccttcgag 5640 agagtctgga ctctcactcc ttcctcaccg ttgtcgccat tctcagcgag ctcatgcaat 5700 caccaggacg gagagacggc cagctgcttt tctgctttcc attgttatta gagaaacgct 5760 tctcgctctc atcgtcttag tagacattcc gatggcttcg ttcgccaatt tgtcacctaa 5820 gtaagctaga gtgttaagtc taaatgcctt tgacccgcgt acggcgtcac gtagatgcct 5880 gtccttgcca gcaaacgcta gttcgcggtg tgcgtaattt ggcccgcatt atgctggctc 5940 tcaaaatcaa ccgccaccac tcgcggctgc acgatgattt tcgtgcactc atgacatgag 6000 aaatgtgata ctcaaactag tatagacctc ctactcctac tgctgctttt ctcgtcagag 6060 ctgtctccgg aaaagttgac aagttgttgg ccttcttctt ctctgctagt aagtagatca 6120 tcatggatga gacgatgatg ataatgatga tgatgataat ttaaatctcg agaccttcga 6180 gcgctgctaa tttcgcctga cgctccttct tttgtgcttc catgacacgc cgcttcaccg 6240 tgcgttccac ttcttcctca gacatgccct tggctgcctc gacctgctcg gtaaaacggg 6300 ccccagcacg tgctacgaga tttcgattcc accgccgcct tctatgaaag gttgggcttc 6360 ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct catgctggag 6420 ttcttcgccc accccaactt gtttattgca gcttataatg gttacaaata aagcaatagc 6480 atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 6540 ctcatcaatg tatcttatca tacatggtcg acctgcagga acctgcatta atgaatcggc 6600 caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6660 tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6720 cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6780 aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 6840 gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 6900 agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 6960 cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7020 cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7080 ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7140 gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7200 tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7260 acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7320 tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 7380 attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 7440 gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 7500 ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 7560 taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 7620 ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 7680 ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 7740 gattatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 7800 ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 7860 gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 7920 tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 7980 atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 8040 gccgcagtgt tattactcat ggttatggca gcactgcata attctcttac tgtcatgcca 8100 tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 8160 atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 8220 agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 8280 ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 8340 tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 8400 aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 8460 tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 8520 aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 8580 accattata tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtctc 8640 gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga gacggtcaca 8700 gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt 8760 ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 8820 caagcttatt taaatcttta tcggtgtggc gcgccttgac ggccgcctcg gattcacttc 8880 gcagggcact tctgacgcgc tcaagcgtgc ccagcgtggt gccatctttg gcctctgcaa 8940 gaccatcggc ctcgagtggt ccgagtctga cgtcttttcc cgcggcgtgg acattgctca 9000 gggcatgcac cccgaggatg ccgccgtggc gattgtgcgc gagatggcgt gcgctgacat 9060 tcgcattcgc gaggtcggca ttggcgcaaa ccagcagcgc tgcacgatcc gtgccgccaa 9120 gctcgagacc ggcaacccgc agcgccagat cgccaaggac gacgtgctgc tcgtttctgg 9180 cggcgctcgc ggcatcacgc ctctttgcat ccgggagatc acgcgccaga tcgcgggcgg 9240 caagtacatt ctgcttggcc gcagcaaggt ctctgcgagc gaaccggcat ggtgcgctgg 9300 catcactgac gagaaggctg tgcaaaaggc tgctacccag gagctcaagc gcgcctttag 9360 cgctggcgag ggccccaagc ccacgccccg cgctgtcact aagcttgtgg gctctgttct 9420 tggcgctcgc gaggtgcgca gctctattgc tgcgattgaa gcgctcggcg gcaaggccat 9480 ctactcgtcg tgcgacgtga actctgccgc cgacgtggcc aaggccgtgc gcgatgccga 9540 gtcccagctc ggtgcccgcg tctcgggcat cgttcatgcc tcgggcgtgc tccgcgaccg 9600 tctcatcgag aagaagctcc ccgacgagtt cgacgccgtc tttggcacca aggtcaccgg 9660 tctcgagaac ctcctcgccg ccgtcgaccg cgccaacctc aagcacatgg tcctcttcag 9720 ctcgctcgcc ggcttccacg gcaacgtcgg ccagtctgac tacgccatgg ccaacgaggc 9780 ccttaacaag atgggcctcg agctcgccaa ggacgtctcg gtcaagtcga tctgctaaga 9840 aagtgaacct tgtcctaacc cgacagcgaa tggcgggagg gggcgggcta aaagatcgta 9900 ttacatagta ttttccccta ctctttgtgt ttgtcttttt tttttttgaa cgcattcaag 9960 ccacttgtct tggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 10020 gtcagacgga cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 10080 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 10140 ctcgcgtaag aatcaagctt ccaattttag gccccccact gaccgaggtc tgtcgataat 10200 ccacttttcc attgattttc caggtttcgt taactcatgc cactgagcaa aacttcggtc 10260 tttcctaaca aaagctctcc tcacaaagca tggcgcggca acggacgtgt cctcatactc 10320 cactgccaca caaggtcgat aaactaagct cctcacaaat agaggagaat tccactgaca 10380 actgaaaaca atgtatgaga gacgatcacc actggagcgg cgcggcggtt gggcgcggag 10440 gtcggcagca aaaacaagcg actcgccgag caaacccgaa tcagccttca gacggtcgtg 10500 cctaacaaca cgccgttcta ccccgccttc ttcgcgcccc ttcgcgtcca agcatccttc 10560 aagtttatct ctctagttca acttcaagaa gaacaacacc accaacaccc caattttagg 10620 ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt 10680 aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat 10740 ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc 10800 ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca 10860 ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc 10920 aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct 10980 tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag 11040 aacaacacca ccaacaccca cgtgctacga gatttcgatt ccaccgccgc cttctatgaa 11100 aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat 11160 ctcatgctgg agttcttcgc ccaccccaac ttgtttattg cagcttataa tggttacaaa 11220 taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca ttctagttgt 11280 ggtttgtcca aactcatcaa tgtatcttat catgtctgaa ttcccggggt ac 11332 <210> 5 <211> 4777 <212> DNA <213> Artificial Sequence <220> <223> pCL399 vector <220> <221> misc_feature <222> (1)..(2523) <223> FAS_fragment <220> <221> misc_feature <222> (2528)..(4777) <223> pSP73_fragment <400> 5 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gcctcgagca 1200 acaaaagaag cagcagcagc agcagcacga gcagcagcag cagcagcacg agcagcagca 1260 gcacgagcag cagcagcagc agcagcaaga tggcgcagcc cgagtcgacg acgccgacca 1320 tgacgcccga ggaaggccag atggaggggg cgccgcagca ggataatgcc caggtgaaga 1380 agcactgctt cgccgacgcc gatgtggcaa cctgcatcgc cgcctttggc ggtcagggaa 1440 gcgactggct cagtgagctg cgctccctcc aggaaaaggg ccagaccaac gtgcgagaaa 1500 ccatcgaact cgcactcgac aaactcgagg atcttgtcaa ggccgagccc tggtacgagg 1560 agcacggagg atgcgatatc cgcgcctggc tcgagagcga cgacaatgtc cccaacttcg 1620 acctcctccg ctacgcgccc gtctccttcc ccctcatctt cctcacccaa atgtgcaatt 1680 acatgcgtgt cctcgagaaa ctcggcacct gccatgaaga cgccctccaa aagggctggg 1740 tcaaggcctc gctcggacac agccagggcg tcgtctccgc cgccgtcgtt gccgcagcca 1800 acaccgaccg cgagctgcgc aacctcgtgg tctccggcct cgaatacatg tcaaaagtcg 1860 gcatcgccgc ccagcgcacg ctcgactacg agctcggacg ccgcaacgcc ggcccggaga 1920 ccccgatgct cgctgtacag ggaatggacg aaaaagtcct taccaaggcc ttcaaggccg 1980 ccgtctcgct ctccaacgag aagcaggcca tgatggccaa aatctcccca cggccgccgc 2040 cgccaccgcc gccccggccg ccgttagcga cgaagatcgc ttctccatcg ccctccgcaa 2100 cggccacgac gactttgtcg tctgcggcga gcccaaggac ctgcgcgtcc tccgcaaggt 2160 catcgagaaa cagagcgccg agcccggcaa ggaggcacag gcgcgcacgc ccttttccaa 2220 gcgcaagccc gtcacccaga ccaccttcct ccgcatgacg gccgtcttcc acagcgctct 2280 caacaaggac gccctcgccc agatcaacac atgggccccg gagtccgcct ttagcaaggc 2340 cttcgcccag gcctcgctcc gtgttcccgt ctttgacacc aagtctggcg ctaatctgca 2400 agatgttccc gccgccgatg ttgtcgccca tcttaccacc aacatgctca ctgagcgcgc 2460 cgacgttctc gtctccctcc gtgccgccga gaccaagacc gacgccagcc acctcctctg 2520 tttaaacgtt aacctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 2580 tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg 2640 agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc 2700 aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt 2760 gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag 2820 tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc 2880 cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc 2940 ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt cggtgtaggt 3000 cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt 3060 atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc 3120 agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa 3180 gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa 3240 gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg 3300 tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga 3360 agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg 3420 gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg 3480 aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt 3540 aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 3600 ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 3660 gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 3720 aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 3780 ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 3840 tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 3900 ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 3960 cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 4020 agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 4080 gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 4140 gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa 4200 acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 4260 acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 4320 agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 4380 aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat 4440 gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 4500 tccccgaaaa gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa 4560 aaataggcgt atcacgaggc cctttcgtct cgcgcgtttc ggtgatgacg gtgaaaacct 4620 ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 4680 acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggctggc ttaactatgc 4740 ggcatcagag cagattgtac tgagagtgca ccatatg 4777 <210> 6 <211> 4775 <212> DNA <213> Artificial Sequence <220> <223> pCL400 vector <220> <221> misc_feature <222> (1)..(2528) <223> Fas_fragment <220> <221> misc_feature <222> (2533)..(4775) <223> pSP73_fragment <400> 6 ccatggttta aactcgcggc gtcttcgccg tcgaggcgcg tctttcgagg cgggcgaggt 60 gtttttcttt tcttttcttc tcgctgcagc tgcgccgcgg cgaacgcagt tcgccgcggc 120 ggctgcgacg cgcctgcgat gtctatgcgc aggcaaggca cgacgtcttg cggcgccgct 180 tcctgcgccg ccttgcgtct tggccccgcc gacgacgcaa gcagcggcgg cgcccccccg 240 ccctcctcca ctgtgggccg cagccctcct tttcgcgcgc ccgcagccgc gcggcgcgcc 300 ccgcgaacaa agagccgccg cgccggtccg cactgcgcgg gccgccccgc aagtgccgca 360 aacgccggcc cgaaccgccg caaacgcgcc cgcagccgcg cccgcagccg cgcgcgaccc 420 gcggtgggga cgcgcgccaa gcgtcccctt ccgcgggatg acgtaggcgg cggccccgcc 480 tatgcaatac gggaggaacc aggaaccggg aggggggggg ggcggcgcgc gcgccgtcca 540 gtgcgggacc gatcggcgcc gggatgcccg ggcgggaggg acacagccag gcagtcagtc 600 agtcagccgc acagagagag cgcgcctgcg agtcccgtct ggtctcggaa ttgtatcccg 660 cgcagagctc agaatcgcag gtcgatcgat cgagcgatgg atccatcgct ctatccgtcc 720 atcgatccat cgcatccatc gcatccatcg catccatcgt tgcatcgctt gcaccgcccg 780 cttgcatcgc gtgcgtgcgc aggcgggcgg cggccacgac gcgaccgaga gcggcggcga 840 gtgcagacgc cgccggcgcc cgcggctgcg tcgccgcagg aagaaggagg ggggcgcgtg 900 tttcccgcgg gagggaggag ggagggaggg aggtggttgg gccaaaaagg gcggcctgga 960 caggcaggca ggccggaagc gacgccagcg agcgaaggaa gaggagagag ccgcgcgggc 1020 ggccagcgcg gggcgggcgg cagcaaaccc gctactcagg gtaaaagaca gacagctctt 1080 cgagcgagct tgttcacttc gcggaagcac gcgcgcaggc acgcaggcac gcaggcagca 1140 tagcgagcag cagcagcatc gcgagcagca ttgcgagagg aggcgctgac cgccggcctc 1200 gagcaacaaa agaagcagca gcagcagcag cacgagcagc agcagcagca gcacgagcag 1260 cagcagcacg agcagcagca gcagcagcag caagatggcg cagcccgagt cgacgacgcc 1320 gaccatgacg cccgaggaag gccagatgga gggggcgccg cagcaggata atgcccaggt 1380 gaagaagcac tgcttcgccg acgccgatgt ggcaacctgc atcgccgcct ttggcggtca 1440 gggaagcgac tggctcagtg agctgcgctc cctccaggaa aagggccaga ccaacgtgcg 1500 agaaaccatc gaactcgcac tcgacaaact cgaggatctt gtcaaggccg agccctggta 1560 cgaggagcac ggaggatgcg atatccgcgc ctggctcgag agcgacgaca atgtccccaa 1620 cttcgacctc ctccgctacg cgcccgtctc cttccccctc atcttcctca cccaaatgtg 1680 caattacatg cgtgtcctcg agaaactcgg cacctgccat gaagacgccc tccaaaaggg 1740 ctgggtcaag gcctcgctcg gacacagcca gggcgtcgtc tccgccgccg tcgttgccgc 1800 agccaacacc gaccgcgagc tgcgcaacct cgtggtctcc ggcctcgaat acatgtcaaa 1860 agtcggcatc gccgcccagc gcacgctcga ctacgagctc ggacgccgca acgccggccc 1920 ggagaccccg atgctcgctg tacagggaat ggacgaaaaa gtccttacca aggccttcaa 1980 ggccgccgtc tcgctctcca acgagaagca ggccatgatg gccaaaatct ccccacggcc 2040 gccgccgcca ccgccgcccc ggccgccgtt agcgacgaag atcgcttctc catcgccctc 2100 cgcaacggcc acgacgactt tgtcgtctgc ggcgagccca aggacctgcg cgtcctccgc 2160 aaggtcatcg agaaacagag cgccgagccc ggcaaggagg cacaggcgcg cacgcccttt 2220 tccaagcgca agcccgtcac ccagaccacc ttcctccgca tgacggccgt cttccacagc 2280 gctctcaaca aggacgccct cgcccagatc aacacatggg ccccggagtc cgcctttagc 2340 aaggccttcg cccaggcctc gctccgtgtt cccgtctttg acaccaagtc tggcgctaat 2400 ctgcaagatg ttcccgccgc cgatgttgtc gcccatctta ccaccaacat gctcactgag 2460 cgcgccgacg ttctcgtctc cctccgtgcc gccgagacca agaccgacgc cagccacctc 2520 ctctgtttaa acgttaacct gcattaatga atcggccaac gcgcggggag aggcggtttg 2580 cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 2640 cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 2700 aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 2760 gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 2820 tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 2880 agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 2940 ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct cagttcggtg 3000 taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc 3060 gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg 3120 gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc 3180 ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg 3240 ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc 3300 gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct 3360 caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt 3420 taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa 3480 aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 3540 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 3600 tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 3660 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 3720 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 3780 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt 3840 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 3900 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 3960 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 4020 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 4080 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 4140 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 4200 ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 4260 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 4320 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 4380 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 4440 ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 4500 acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc 4560 tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa 4620 aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg 4680 agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac 4740 tatgcggcat cagagcagat tgtactgaga gtgca 4775 <210> 7 <211> 5991 <212> DNA <213> Artificial Sequence <220> <223> pCL401 vector <220> <221> misc_feature <222> (1)..(1063) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (1076)..(1112) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (1113)..(1132) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (1133)..(1212) <223> gRNA <220> <221> misc_feature <222> (1213)..(1280) <223> HDV_ribozyme <220> <221> misc_feature <222> (1300)..(1939) <223> OrfC_terminator <220> <221> misc_feature <222> (4474)..(4922) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (4923)..(5717) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (5718)..(5991) <223> SV40_terminator <400> 7 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagagatcta gcacgctgat 1080 gagtccgtga ggacgaaacg agtaagctcg tctgcggacg tcgtggacgc gcgttttaga 1140 gctagaaata gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga 1200 gtcggtgctt ttggccggca tggtcccagc ctcctcgctg gcgccggctg ggcaacatgc 1260 ttcggcatgg cgaatgggac catatgagtt atgagatccg aaagtgaacc ttgtcctaac 1320 ccgacagcga atggcgggag ggggcgggct aaaagatcgt attacatagt atttttcccc 1380 tactctttgt gtttgtcttt tttttttttt tgaacgcatt caagccactt gtctgggttt 1440 acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt cttggtcaga cggcccaaaa 1500 aagggaaaaa attcattcat ggcacagata agaaaaagaa aaagtttgtc gaccaccgtc 1560 atcagaaagc aagagaagag aaacactcgc gctcacattc tcgctcgcgt aagaatctta 1620 gccacgcata cgaagtaatt tgtccatctg gcgaatcttt acatgagcgt tttcaagctg 1680 gagcgtgaga tcataccttt cttgatcgta atgttccaac cttgcatagg cctcgttgcg 1740 atccgctagc aatgcgtcgt actcccgttg caactgcgcc atcgcctcat tgtgacgtga 1800 gttcagattc ttctcgagac cttcgagcgc tgctaatttc gcctgacgct ccttcttttg 1860 tgcttccatg acacgccgct tcaccgtgcg ttccacttct tcctcagaca tgcccttggc 1920 tgcctcgacc tgctcggtaa aacgggcccc agcacgtgct acgagatttc gattccaccg 1980 ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcc 2040 tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 2100 ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 2160 tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcataca tggtcgacct 2220 gcaggaacct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 2280 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 2340 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 2400 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 2460 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 2520 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 2580 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 2640 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 2700 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 2760 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 2820 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 2880 ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 2940 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 3000 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 3060 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 3120 tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 3180 ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 3240 gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 3300 tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 3360 cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 3420 ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 3480 gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 3540 caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 3600 gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 3660 ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 3720 tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 3780 caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 3840 tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 3900 cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 3960 ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 4020 aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 4080 tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 4140 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 4200 gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata 4260 ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac 4320 acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag 4380 cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 4440 cagagcagat tgtactgaga gtgcaccaag cttccaattt taggcccccc actgaccgag 4500 gtctgtcgat aatccacttt tccattgatt ttccaggttt cgttaactca tgccactgag 4560 caaaacttcg gtctttccta acaaaagctc tcctcacaaa gcatggcgcg gcaacggacg 4620 tgtcctcata ctccactgcc acacaaggtc gataaactaa gctcctcaca aatagaggag 4680 aattccactg acaactgaaa acaatgtatg agagacgatc accactggag cggcgcggcg 4740 gttgggcgcg gaggtcggca gcaaaaacaa gcgactcgcc gagcaaaccc gaatcagcct 4800 tcagacggtc gtgcctaaca acacgccgtt ctaccccgcc ttcttcgcgc cccttcgcgt 4860 ccaagcatcc ttcaagttta tctctctagt tcaacttcaa gaagaacaac accaccaaca 4920 ccatgattga acaagatgga ttgcaccgcag gttctccggc cgcttgggtg gagaggctat 4980 tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 5040 cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 5100 tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 5160 tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 5220 aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 5280 tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 5340 gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 5400 aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 5460 acggcgatga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 5520 atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 5580 acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 5640 tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 5700 ttgacgagtt cttctgacac gtgctacgag atttcgattc caccgccgcc ttctatgaaa 5760 ggttgggctt cggaatcgtt ttccgggacg ccggctggat gatcctccag cgcggggatc 5820 tcatgctgga gttcttcgcc caccccaact tgtttattgc agcttataat ggttacaaat 5880 aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg 5940 gtttgtccaa actcatcaat gtatcttatc atgtctgaat tcccggggta c 5991 <210> 8 <211> 7612 <212> DNA <213> Artificial Sequence <220> <223> pCL402 vector <220> <221> misc_feature <222> (5)..(1198) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1199)..(1647) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1648)..(2442) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2443)..(2716) <223> SV40_terminator <220> <221> misc_feature <222> (2717)..(3779) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3792)..(3828) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3829)..(3848) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (3849)..(3928) <223> gRNA <220> <221> misc_feature <222> (3929)..(3996) <223> HDV_ribozyme <220> <221> misc_feature <222> (4016)..(4534) <223> OrfC_terminator <220> <221> misc_feature <222> (4535)..(5360) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (5365)..(7612) <223> pSP73 <400> 8 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gcctcgagcc 1200 aattttaggc cccccactga ccgaggtctg tcgataatcc acttttccat tgattttcca 1260 ggtttcgtta actcatgcca ctgagcaaaa cttcggtctt tcctaacaaa agctctcctc 1320 acaaagcatg gcgcggcaac ggacgtgtcc tcatactcca ctgccacaca aggtcgataa 1380 actaagctcc tcacaaatag aggagaattc cactgacaac tgaaaacaat gtatgagaga 1440 cgatcaccac tggagcggcg cggcggttgg gcgcggaggt cggcagcaaa aacaagcgac 1500 tcgccgagca aacccgaatc agccttcaga cggtcgtgcc taacaacacg ccgttctacc 1560 ccgccttctt cgcgcccctt cgcgtccaag catccttcaa gtttatctct ctagttcaac 1620 ttcaagaaga acaacaccac caacaccatg attgaacaag atggattgca cgcaggttct 1680 ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc 1740 tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc 1800 gacctgtccg gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc 1860 acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg 1920 ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag 1980 aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc 2040 ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt 2100 cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc 2160 gccaggctca aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc 2220 tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg 2280 ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag 2340 cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg 2400 cagcgcatcg ccttctatcg ccttcttgac gagttcttct gacacgtgct acgagatttc 2460 gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc 2520 tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt 2580 attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca 2640 tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc 2700 tgaattcccg gggtacctct tatctgcctc gcgccgttga ccgccgcttg actcttggcg 2760 cttgccgctc gcatcctgcc tcgctcgcgc aggcgggcgg gcgagtgggt gggtccgcag 2820 ccttccgcgc tcgcccgcta gctcgctcgc gccgtgctgc agccagcagg gcagcaccgc 2880 acggcaggca ggtcccggcg cggatcgatc gatccatcga tccatcgatc catcgatcgt 2940 gcggtcaaaa agaaaggaag aagaaaggaa aaagaaaggc gtgcgcaccc gagtgcgcgc 3000 tgagcgcccg ctcgcggtcc cgcggagcct ccgcgttagt ccccgccccg cgccgcgcag 3060 tccccccggga ggcatcgcgc acctctcgcc gccccctcgc gcctcgccga ttccccgcct 3120 ccccttttcc gcttcttcgc cgcctccgct cgcggccgcg tcgcccgcgc cccgctccct 3180 atctgctccc caggggggca ctccgcacct tttgcgcccg ctgccgccgc cgcggccgcc 3240 ccgccgccct ggtttccccc gcgagcgcgg ccgcgtcgcc gcgcaaagac tcgccgcgtg 3300 ccgccccgag caacgggtgg cggcggcgcg gcggcgggcg gggcgcggcg gcgcgtaggc 3360 ggggctaggc gccggctagg cgaaacgccg cccccgggcg ccgccgccgc ccgctccaga 3420 gcagtcgccg cgccagaccg ccaacgcaga gaccgagacc gaggtacgtc gcgcccgagc 3480 acgccgcgac gcgcggcagg gacgaggagc acgacgccgc gccgcgccgc gcgggggggg 3540 ggagggagag gcaggacgcg ggagcgagcg tgcatgtttc cgcgcgagac gacgccgcgc 3600 gcgctggaga ggagataagg cgcttggatc gcgagagggc cagccaggct ggaggcgaaa 3660 atgggtggag aggatagtat cttgcgtgct tggacgagga gactgacgag gaggacggat 3720 acgtcgatga tgatgtgcac agagaagaag cagttcgaaa gcgactacta gcaagcaaga 3780 gatctagcac gctgatgagt ccgtgaggac gaaacgagta agctcgtctg cggacgtcgt 3840 ggacgcgcgt tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt 3900 gaaaaagtgg caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc 3960 cggctgggca acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag 4020 tgaaccttgt cctaacccga cagcgaatgg cgggagggg cgggctaaaa gatcgtatta 4080 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4140 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4200 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4260 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4320 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4380 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 4440 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 4500 cctcattgtg acgtgagttc agattcttct cgagaaactc ggcacctgcc atgaagacgc 4560 cctccaaaag ggctgggtca aggcctcgct cggacacagc cagggcgtcg tctccgccgc 4620 cgtcgttgcc gcagccaaca ccgaccgcga gctgcgcaac ctcgtggtct ccggcctcga 4680 atacatgtca aaagtcggca tcgccgccca gcgcacgctc gactacgagc tcggacgccg 4740 caacgccggc ccggagaccc cgatgctcgc tgtacaggga atggacgaaa aagtccttac 4800 caaggccttc aaggccgccg tctcgctctc caacgagaag caggccatga tggccaaaat 4860 ctccccacgg ccgccgccgc caccgccgcc ccggccgccg ttagcgacga agatcgcttc 4920 tccatcgccc tccgcaacgg ccacgacgac tttgtcgtct gcggcgagcc caaggacctg 4980 cgcgtcctcc gcaaggtcat cgagaaacag agcgccgagc ccggcaagga ggcacaggcg 5040 cgcacgccct tttccaagcg caagcccgtc acccagacca ccttcctccg catgacggcc 5100 gtcttccaca gcgctctcaa caaggacgcc ctcgcccaga tcaacacatg ggccccggag 5160 tccgccttta gcaaggcctt cgcccaggcc tcgctccgtg ttcccgtctt tgacaccaag 5220 tctggcgcta atctgcaaga tgttcccgcc gccgatgttg tcgcccatct taccaccaac 5280 atgctcactg agcgcgccga cgttctcgtc tccctccgtg ccgccgagac caagaccgac 5340 gccagccacc tcctctgttt aaacgttaac ctgcattaat gaatcggcca acgcgcgggg 5400 agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 5460 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 5520 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 5580 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 5640 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 5700 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 5760 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 5820 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 5880 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 5940 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 6000 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 6060 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 6120 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 6180 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 6240 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 6300 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 6360 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 6420 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 6480 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 6540 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 6600 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 6660 cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 6720 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 6780 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 6840 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 6900 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 6960 agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 7020 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 7080 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 7140 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 7200 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 7260 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 7320 ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 7380 atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt 7440 gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 7500 gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 7560 ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca tg 7612 <210> 9 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB36 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_1 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 9 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtct gcggacgtcg 3840 tggacgcgcg ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 10 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB37 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_2 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 10 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtcg cagaccttaa 3840 gttcgacgcg ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 11 <211> 7609 <212> DNA <213> Artificial Sequence <220> <223> pYB38 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3797)..(3829) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3830)..(3849) <223> carotene_synthase_target_3 <220> <221> misc_feature <222> (3850)..(3929) <223> gRNA <220> <221> misc_feature <222> (3930)..(3997) <223> HDV_ribozyme <220> <221> misc_feature <222> (4017)..(4535) <223> OrfC_terminator <220> <221> misc_feature <222> (4536)..(5368) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6367)..(7227) <223> AmpR_gene <400> 11 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttccg cactgatgag tccgtgagga cgaaacgagt aagctcgtcc cgactcgtcg 3840 accgtctcag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 3900 tgaaaaagtg gcaccgagtc ggtgcttttg gccggcatgg tcccagcctc ctcgctggcg 3960 ccggctgggc aacatgcttc ggcatggcga atgggaccat atgagttatg agatccgaaa 4020 gtgaaccttg tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt 4080 acatagtatt tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa 4140 gccacttgtc tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt 4200 ggtcagacgg cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa 4260 gtttgtcgac caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacattctcg 4320 ctcgcgtaag aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca 4380 tgagcgtttt caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt 4440 gcataggcct cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc 4500 gcctcattgt gacgtgagtt cagattcttc tcgagaaact cggcacctgc catgaagacg 4560 ccctccaaaa gggctgggtc aaggcctcgc tcggacacag ccagggcgtc gtctccgccg 4620 ccgtcgttgc cgcagccaac accgaccgcg agctgcgcaa cctcgtggtc tccggcctcg 4680 aatacatgtc aaaagtcggc atcgccgccc agcgcacgct cgactacgag ctcggacgcc 4740 gcaacgccgg cccggagacc ccgatgctcg ctgtacaggg aatggacgaa aaagtcctta 4800 ccaaggcctt caaggccgcc gtctcgctct ccaacgagaa gcaggccatg atggccaaaa 4860 tctccccacg gccgccgccg ccaccgccgc cccggccgcc gttagcgacg aagatcgctt 4920 ctccatcgcc ctccgcaacg gccacgacga ctttgtcgtc tgcggcgagc ccaaggacct 4980 gcgcgtcctc cgcaaggtca tcgagaaaca gagcgccgag cccggcaagg aggcacaggc 5040 gcgcacgccc ttttccaagc gcaagcccgt cacccagacc accttcctcc gcatgacggc 5100 cgtcttccac agcgctctca acaaggacgc cctcgcccag atcaacacat gggccccgga 5160 gtccgccttt agcaaggcct tcgcccaggc ctcgctccgt gttcccgtct ttgacaccaa 5220 gtctggcgct aatctgcaag atgttcccgc cgccgatgtt gtcgcccatc ttaccaccaa 5280 catgctcact gagcgcgccg acgttctcgt ctccctccgt gccgccgaga ccaagaccga 5340 cgccagccac ctcctctgtt taaacgttaa cctgcattaa tgaatcggcc aacgcgcggg 5400 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 5460 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 5520 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5580 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 5640 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5700 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5760 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5820 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5880 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 5940 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6000 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 6060 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6120 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6180 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6240 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6300 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6360 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 6420 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 6480 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 6540 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 6600 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 6660 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 6720 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 6780 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 6840 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 6900 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 6960 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 7020 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7080 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7140 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7200 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7260 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7320 aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7380 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 7440 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 7500 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 7560 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcac 7609 <210> 12 <211> 7608 <212> DNA <213> Artificial Sequence <220> <223> pYB39 vector <220> <221> misc_feature <222> (2)..(1192) <223> FAS_upstream_fragment <220> <221> misc_feature <222> (1200)..(1648) <223> alpha_tubulin_promoter <220> <221> misc_feature <222> (1679)..(2443) <223> paromomycin_resistance_gene <220> <221> misc_feature <222> (2444)..(2717) <223> SV40_terminator <220> <221> misc_feature <222> (2718)..(3780) <223> EF-1_alpha_promoter <220> <221> misc_feature <222> (3792)..(3828) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (3829)..(3848) <223> carotene_synthase_target_4 <220> <221> misc_feature <222> (3848)..(3928) <223> gRNA <220> <221> misc_feature <222> (3929)..(3996) <223> HDV_ribozyme <220> <221> misc_feature <222> (4016)..(4534) <223> OrfC_terminator <220> <221> misc_feature <222> (4535)..(5367) <223> FAS_downstream_fragment <220> <221> misc_feature <222> (6366)..(7226) <223> AmpR_gene <400> 12 gtttaaactc gcggcgtctt cgccgtcgag gcgcgtcttt cgaggcgggc gaggtgtttt 60 tcttttcttt tcttctcgct gcagctgcgc cgcggcgaac gcagttcgcc gcggcggctg 120 cgacgcgcct gcgatgtcta tgcgcaggca aggcacgacg tcttgcggcg ccgcttcctg 180 cgccgccttg cgtcttggcc ccgccgacga cgcaagcagc ggcggcgccc ccccgccctc 240 ctccactgtg ggccgcagcc ctccttttcg cgcgcccgca gccgcgcggc gcgccccgcg 300 aacaaagagc cgccgcgccg gtccgcactg cgcgggccgc cccgcaagtg ccgcaaacgc 360 cggcccgaac cgccgcaaac gcgcccgcag ccgcgcccgc agccgcgcgc gacccgcggt 420 ggggacgcgc gccaagcgtc cccttccgcg ggatgacgta ggcggcggcc ccgcctatgc 480 aatacgggag gaaccaggaa ccgggagggg gggggggcgg cgcgcgcgcc gtccagtgcg 540 ggaccgatcg gcgccgggat gcccgggcgg gagggacaca gccaggcagt cagtcagtca 600 gccgcacaga gagagcgcgc ctgcgagtcc cgtctggtct cggaattgta tcccgcgcag 660 agctcagaat cgcaggtcga tcgatcgagc gatggatcca tcgctctatc cgtccatcga 720 tccatcgcat ccatcgcatc catcgcatcc atcgttgcat cgcttgcacc gcccgcttgc 780 atcgcgtgcg tgcgcaggcg ggcggcggcc acgacgcgac cgagagcggc ggcgagtgca 840 gacgccgccg gcgcccgcgg ctgcgtcgcc gcaggaagaa ggaggggggc gcgtgtttcc 900 cgcgggaggg aggagggagg gagggaggtg gttgggccaa aaagggcggc ctggacaggc 960 aggcaggccg gaagcgacgc cagcgagcga aggaagagga gagagccgcg cgggcggcca 1020 gcgcggggcg ggcggcagca aacccgctac tcagggtaaa agacagacag ctcttcgagc 1080 gagcttgttc acttcgcgga agcacgcgcg caggcacgca ggcacgcagg cagcatagcg 1140 agcagcagca gcatcgcgag cagcattgcg agaggaggcg ctgaccgccg gccaagcttc 1200 caattttagg ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc 1260 aggtttcgtt aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct 1320 cacaaagcat ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata 1380 aactaagctc ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag 1440 acgatcacca ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga 1500 ctcgccgagc aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac 1560 cccgccttct tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa 1620 cttcaagaag aacaacacca ccaacaccat gattgaacaa gatggattgc acgcaggttc 1680 tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg 1740 ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac 1800 cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc 1860 cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg 1920 gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga 1980 gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg 2040 cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg 2100 tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt 2160 cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc 2220 ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg 2280 gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga 2340 gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc 2400 gcagcgcatc gccttctatc gccttcttga cgagttcttc tgacacgtgc tacgagattt 2460 cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg 2520 ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt 2580 tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 2640 atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 2700 ctgaattccc ggggtacctc ttatctgcct cgcgccgttg accgccgctt gactcttggc 2760 gcttgccgct cgcatcctgc ctcgctcgcg caggcgggcg ggcgagtggg tgggtccgca 2820 gccttccgcg ctcgcccgct agctcgctcg cgccgtgctg cagccagcag ggcagcaccg 2880 cacggcaggc aggtcccggc gcggatcgat cgatccatcg atccatcgat ccatcgatcg 2940 tgcggtcaaa aagaaaggaa gaagaaagga aaaagaaagg cgtgcgcacc cgagtgcgcg 3000 ctgagcgccc gctcgcggtc ccgcggagcc tccgcgttag tccccgcccc gcgccgcgca 3060 gtcccccggg aggcatcgcg cacctctcgc cgccccctcg cgcctcgccg attccccgcc 3120 tccccttttc cgcttcttcg ccgcctccgc tcgcggccgc gtcgcccgcg ccccgctccc 3180 tatctgctcc ccaggggggc actccgcacc ttttgcgccc gctgccgccg ccgcggccgc 3240 cccgccgccc tggtttcccc cgcgagcgcg gccgcgtcgc cgcgcaaaga ctcgccgcgt 3300 gccgccccga gcaacgggtg gcggcggcgc ggcggcgggc ggggcgcggc ggcgcgtagg 3360 cggggctagg cgccggctag gcgaaacgcc gcccccgggc gccgccgccg cccgctccag 3420 agcagtcgcc gcgccagacc gccaacgcag agaccgagac cgaggtacgt cgcgcccgag 3480 cacgccgcga cgcgcggcag ggacgaggag cacgacgccg cgccgcgccg cgcggggggg 3540 gggagggaga ggcaggacgc gggagcgagc gtgcatgttt ccgcgcgaga cgacgccgcg 3600 cgcgctggag aggagataag gcgcttggat cgcgagaggg ccagccaggc tggaggcgaa 3660 aatgggtgga gaggatagta tcttgcgtgc ttggacgagg agactgacga ggaggacgga 3720 tacgtcgatg atgatgtgca cagagaagaa gcagttcgaa agcgactact agcaagcaag 3780 agatcttctt cctgatgagt ccgtgaggac gaaacgagta agctcgtcga agaacatgta 3840 ctcttcaagt tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt 3900 gaaaaagtgg caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc 3960 cggctgggca acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag 4020 tgaaccttgt cctaacccga cagcgaatgg cgggagggg cgggctaaaa gatcgtatta 4080 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4140 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4200 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4260 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4320 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4380 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 4440 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 4500 cctcattgtg acgtgagttc agattcttct cgagaaactc ggcacctgcc atgaagacgc 4560 cctccaaaag ggctgggtca aggcctcgct cggacacagc cagggcgtcg tctccgccgc 4620 cgtcgttgcc gcagccaaca ccgaccgcga gctgcgcaac ctcgtggtct ccggcctcga 4680 atacatgtca aaagtcggca tcgccgccca gcgcacgctc gactacgagc tcggacgccg 4740 caacgccggc ccggagaccc cgatgctcgc tgtacaggga atggacgaaa aagtccttac 4800 caaggccttc aaggccgccg tctcgctctc caacgagaag caggccatga tggccaaaat 4860 ctccccacgg ccgccgccgc caccgccgcc ccggccgccg ttagcgacga agatcgcttc 4920 tccatcgccc tccgcaacgg ccacgacgac tttgtcgtct gcggcgagcc caaggacctg 4980 cgcgtcctcc gcaaggtcat cgagaaacag agcgccgagc ccggcaagga ggcacaggcg 5040 cgcacgccct tttccaagcg caagcccgtc acccagacca ccttcctccg catgacggcc 5100 gtcttccaca gcgctctcaa caaggacgcc ctcgcccaga tcaacacatg ggccccggag 5160 tccgccttta gcaaggcctt cgcccaggcc tcgctccgtg ttcccgtctt tgacaccaag 5220 tctggcgcta atctgcaaga tgttcccgcc gccgatgttg tcgcccatct taccaccaac 5280 atgctcactg agcgcgccga cgttctcgtc tccctccgtg ccgccgagac caagaccgac 5340 gccagccacc tcctctgttt aaacgttaac ctgcattaat gaatcggcca acgcgcgggg 5400 agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 5460 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 5520 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 5580 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 5640 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 5700 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 5760 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 5820 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 5880 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 5940 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 6000 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt 6060 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 6120 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 6180 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 6240 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 6300 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 6360 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 6420 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 6480 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 6540 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 6600 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 6660 cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 6720 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 6780 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 6840 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 6900 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 6960 agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa 7020 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 7080 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 7140 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 7200 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 7260 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 7320 ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 7380 atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt 7440 gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 7500 gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 7560 ggctggctta actatgcggc atcagagcag attgtactga gagtgcac 7608 <210> 13 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> 12 Tub seq F primer <400> 13 ggatctcatg ctggagttct tc 22 <210> 14 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3C R1 primer <400> 14 gtacttctcg tggtaggcaa cc 22 <210> 15 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> CS pro Kpn IF F1 primer <400> 15 gtctgaattc ccggggtacc gagcgggcga ttccaccgtc 40 <210> 16 <211> 42 <212> DNA <213> Artificial Sequence <220> <223> CS pro BamH IF R1 primer <400> 16 gtacttctta tccatggatc cctcggtctc cgagcgagcg ag 42 <210> 17 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> CS pro BamH IF F2 primer <400> 17 tcgctcgctc ggagaccgag ggatccatgg ataagaagta c 41 <210> 18 <211> 51 <212> DNA <213> Artificial Sequence <220> <223> CS pro Nde IF R2 primer <400> 18 gattcactag tttagatcat atgttagacc ttgcgcttct tcttagggtc c 51 <210> 19 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> O A1-KO F primer <400> 19 ccaagttcgc caaggcttc 19 <210> 20 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3 SV40 R1 primer <400> 20 gtggaatcga aatctcgtag cac 23 <210> 21 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> O A1-KO R primer <400> 21 gctgttgcaa ctttgctcca c 21 <210> 22 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB32/3C F1 primer <400> 22 gttaagaaga ccgaggtcca gac 23 <210> 23 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> 5' FAS PmeNde primer <400> 23 tagcatatgt ttaaactcgc ggcgtctttc gc 32 <210> 24 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> 3' FAS PmeHpa primer <400> 24 agttaacgtt taaacagagg aggtggctgg c 31 <210> 25 <211> 38 <212> DNA <213> Artificial Sequence <220> <223> pCL402 IF F primer <400> 25 gaggcgctga ccgccggcca agcttccaat tttaggcc 38 <210> 26 <211> 40 <212> DNA <213> Artificial Sequence <220> <223> pCL402 IF R primer <400> 26 gcaggtgccg agtttctcga gaagaatctg aactcacgtc 40 <210> 27 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS1 F primer <400> 27 gagtcgaagg agacgttgtc g 21 <210> 28 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS1 R primer <400> 28 gtcattgcga atgatgcgat atg 23 <210> 29 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS3 R primer <400> 29 ggtcatcatg gaatacaacg cag 23 <210> 30 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS4 F primer <400> 30 cgagctcatt tgtgctacac tctatg 26 <210> 31 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB36 CS4 R primer <400> 31 cacaagatt gcaggattga tgc 23 <210> 32 <211> 5562 <212> DNA <213> Artificial Sequence <220> <223> pYB30 vector (5562 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_Tubulin_promoter <220> <221> misc_feature <222> (457)..(549) <223> Sec1_secretion_sequence <220> <221> misc_feature <222> (550)..(1287) <223> eGFP <220> <221> misc_feature <222> (1307)..(1946) <223> OrfC_terminator <220> <221> misc_feature <222> (4481)..(4929) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (4930)..(5304) <223> Sh_ble_gene <220> <221> misc_feature <222> (5305)..(5562) <223> SV40_terminator <400> 32 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatga agttcgcgac ctcggtcgca 480 attttgcttg tggccaacat agccaccgcc ctcgcgcaga gcgatggctg cacccccacc 540 gaccagacga tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc 600 gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat 660 gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct gcccgtgccc 720 tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg ctaccccgac 780 cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc 840 accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc 900 gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc 960 ctgggacaca agctggagta caactacaac agccacaacg tctatatcat ggccgacaag 1020 cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga cggcagcgtg 1080 cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc 1140 gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga gaagcgcgat 1200 cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg 1260 tacaagcacc accatcacca ccactaacat atgagttatg agatccgaaa gtgaaccttg 1320 tcctaacccg acagcgaatg gcgggagggg gcgggctaaa agatcgtatt acatagtatt 1380 tttcccctac tctttgtgtt tgtctttttt ttttttttga acgcattcaa gccacttgtc 1440 tgggtttact tgtttgtttg cttgcttgct tgcttgcttg cctgcttctt ggtcagacgg 1500 cccaaaaaag ggaaaaaatt cattcatggc acagataaga aaaagaaaaa gtttgtcgac 1560 caccgtcatc agaaagcaag agaagagaaa cactcgcgct cacatctcg ctcgcgtaag 1620 aatcttagcc acgcatacga agtaatttgt ccatctggcg aatctttaca tgagcgtttt 1680 caagctggag cgtgagatca tacctttctt gatcgtaatg ttccaacctt gcataggcct 1740 cgttgcgatc cgctagcaat gcgtcgtact cccgttgcaa ctgcgccatc gcctcattgt 1800 gacgtgagtt cagattcttc tcgagacctt cgagcgctgc taatttcgcc tgacgctcct 1860 tcttttgtgc ttccatgaca cgccgcttca ccgtgcgttc cacttcttcc tcagacatgc 1920 ccttggctgc ctcgacctgc tcggtaaaac gggccccagc acgtgctacg agatttcgat 1980 tccaccgccg ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg 2040 atgatcctcc agcgcgggga tctcatgctg gagttcttcg cccaccccaa cttgtttatt 2100 gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt 2160 ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta tcatacatgg 2220 tcgacctgca ggaacctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt 2280 attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg 2340 cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac 2400 gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg 2460 ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 2520 agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc 2580 tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc 2640 ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag 2700 gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc 2760 ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca 2820 gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg 2880 aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg 2940 aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct 3000 ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa 3060 gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa 3120 gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa 3180 tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc 3240 ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga 3300 ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca 3360 atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc 3420 ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat 3480 tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc 3540 attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt 3600 tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc 3660 ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg 3720 gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt 3780 gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg 3840 gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga 3900 aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg 3960 taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg 4020 tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt 4080 tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc 4140 atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca 4200 tttccccgaa aagtgccacc tgacgtctaa gaaaccatta ttatcatgac attaacctat 4260 aaaaataggc gtatcacgag gccctttcgt ctcgcgcgtt tcggtgatga cggtgaaaac 4320 ctctgacaca tgcagctccc ggagacggtc acagcttgtc tgtaagcgga tgccgggagc 4380 agacaagccc gtcagggcgc gtcagcgggt gttggcgggt gtcggggctg gcttaactat 4440 gcggcatcag agcagattgt actgagagtg caccaagctt ccaattttag gccccccact 4500 gaccgaggtc tgtcgataat ccacttttcc attgattttc caggtttcgt taactcatgc 4560 cactgagcaa aacttcggtc tttcctaaca aaagctctcc tcacaaagca tggcgcggca 4620 acggacgtgt cctcatactc cactgccaca caaggtcgat aaactaagct cctcacaaat 4680 agaggagaat tccactgaca actgaaaaca atgtatgaga gacgatcacc actggagcgg 4740 cgcggcggtt gggcgcggag gtcggcagca aaaacaagcg actcgccgag caaacccgaa 4800 tcagccttca gacggtcgtg cctaacaaca cgccgttcta ccccgccttc ttcgcgcccc 4860 ttcgcgtcca agcatccttc aagtttatct ctctagttca acttcaagaa gaacaacacc 4920 accaacacca tggccaagtt gaccagtgcc gttccggtgc tcaccgcgcg cgacgtcgcc 4980 ggagcggtcg agttctggac cgaccggctc gggttctccc gggacttcgt ggaggacgac 5040 ttcgccggtg tggtccggga cgacgtgacc ctgttcatca gcgcggtcca ggaccaggtg 5100 gtgccggaca acaccctggc ctgggtgtgg gtgcgcggcc tggacgagct gtacgccgag 5160 tggtcggagg tcgtgtccac gaacttccgg gacgcctccg ggccggccat gaccgagatc 5220 ggcgagcagc cgtgggggcg ggagttcgcc ctgcgcgacc cggccggcaa ctgcgtgcac 5280 ttcgtggccg aggagcagga ctgacacgtg ctacgagatt tcgattccac cgccgccttc 5340 tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg gctggatgat cctccagcgc 5400 ggggatctca tgctggagtt cttcgcccac cccaacttgt ttattgcagc ttataatggt 5460 tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 5520 agttgtggtt tgtccaaact catcaatgta tcttatcggt ac 5562 <210> 33 <211> 8887 <212> DNA <213> Artificial Sequence <220> <223> pYB61 vector (8887 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4616)..(5255) <223> OrfC_terminator <220> <221> misc_feature <222> (6540)..(7400) <223> Ampicillin_resistance_gene <220> <221> misc_feature <222> (7790)..(8238) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (8239)..(8613) <223> Sh_ble_gene <220> <221> misc_feature <222> (8614)..(8887) <223> SV40_terminator <400> 33 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acatgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gaggacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgagttatga gatccgaaag 4620 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4680 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4740 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4800 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4860 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4920 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4980 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 5040 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 5100 cctcattgtg acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct 5160 gacgctcctt cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct 5220 cagacatgcc cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga 5280 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 5340 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 5400 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 5460 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 5520 catacatggt cgacctgcag gaacctgcat taatgaatcg gccaacgcgc ggggagaggc 5580 ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 5640 cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 5700 ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 5760 aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 5820 cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 5880 cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 5940 gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 6000 tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 6060 cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 6120 ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 6180 gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 6240 gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 6300 accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 6360 ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 6420 tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 6480 aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 6540 taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 6600 gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 6660 agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 6720 cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 6780 tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 6840 gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 6900 agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 6960 gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 7020 atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 7080 gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 7140 tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 7200 atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 7260 agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 7320 gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 7380 cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 7440 tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 7500 ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca 7560 ttaacctata aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac 7620 ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat 7680 gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg 7740 cttaactatg cggcatcaga gcagattgta ctgagagtgc accaagcttc caattttagg 7800 ccccccactg accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt 7860 aactcatgcc actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat 7920 ggcgcggcaa cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc 7980 ctcacaaata gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca 8040 ctggagcggc gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc 8100 aaacccgaat cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct 8160 tcgcgcccct tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag 8220 aacaacacca ccaacaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc 8280 gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg 8340 gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag 8400 gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg 8460 tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg 8520 accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac 8580 tgcgtgcact tcgtggccga ggagcaggac tgacacgtgc tacgagattt cgattccacc 8640 gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc 8700 ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct 8760 tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca 8820 ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgaattccc 8880 ggggtac 8887 <210> 34 <211> 5997 <212> DNA <213> Artificial Sequence <220> <223> pYB66 vector (5997 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF-1alpha_promoter <220> <221> misc_feature <222> (1082)..(1118) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (1119)..(1138) <223> gRNA3_CS1_target_sequence <220> <221> misc_feature <222> (1139)..(1218) <223> gRNA_scaffold <220> <221> misc_feature <222> (1219)..(1286) <223> HDV_ribozyme <220> <221> misc_feature <222> (1306)..(1945) <223> OrfC_terminator <220> <221> misc_feature <222> (4480)..(4928) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (4929)..(5723) <223> Neomycin_phosphotransferase_gene <220> <221> misc_feature <222> (5724)..(5997) <223> SV40_terminator <400> 34 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca gatcttccgc 1080 actgatgagt ccgtgaggac gaaacgagta agctcgtctg cggacgtcgt ggacgcgcgt 1140 tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg 1200 caccgagtcg gtgcttttgg ccggcatggt cccagcctcc tcgctggcgc cggctgggca 1260 acatgcttcg gcatggcgaa tgggaccata tgagttatga gatccgaaag tgaaccttgt 1320 cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta catagtattt 1380 ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag ccacttgtct 1440 gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg gtcagacggc 1500 ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag tttgtcgacc 1560 accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc tcgcgtaaga 1620 atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat gagcgttttc 1680 aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg cataggcctc 1740 gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg cctcattgtg 1800 acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct gacgctcctt 1860 cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct cagacatgcc 1920 cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga gatttcgatt 1980 ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac gccggctgga 2040 tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 2100 cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 2160 tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catacatggt 2220 cgacctgcag gaacctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta 2280 ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 2340 gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg 2400 caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 2460 tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 2520 gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct 2580 ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc 2640 cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg 2700 tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 2760 tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 2820 cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga 2880 agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc gctctgctga 2940 agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 3000 gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 3060 aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag 3120 ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat 3180 gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct 3240 taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac 3300 tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa 3360 tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg 3420 gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt 3480 gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca 3540 ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt 3600 cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg gttagctcct 3660 tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc atggttatgg 3720 cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg 3780 agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg 3840 cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa 3900 aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt 3960 aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt 4020 gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt 4080 gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 4140 tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat 4200 ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata 4260 aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc 4320 tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat gccgggagca 4380 gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg cttaactatg 4440 cggcatcaga gcagattgta ctgagagtgc accaagcttc caattttagg ccccccactg 4500 accgaggtct gtcgataatc cacttttcca ttgattttcc aggtttcgtt aactcatgcc 4560 actgagcaaa acttcggtct ttcctaacaa aagctctcct cacaaagcat ggcgcggcaa 4620 cggacgtgtc ctcatactcc actgccacac aaggtcgata aactaagctc ctcacaaata 4680 gaggagaatt ccactgacaa ctgaaaacaa tgtatgagag acgatcacca ctggagcggc 4740 gcggcggttg ggcgcggagg tcggcagcaa aaacaagcga ctcgccgagc aaacccgaat 4800 cagccttcag acggtcgtgc ctaacaacac gccgttctac cccgccttct tcgcgcccct 4860 tcgcgtccaa gcatccttca agtttatctc tctagttcaa cttcaagaag aacaacacca 4920 ccaacaccat gattgaacaa gatggattgc acgcaggttc tccggccgct tgggtggaga 4980 ggctattcgg ctatgactgg gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 5040 ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 5100 atgaactgca ggacgaggca gcgcggctat cgtggctggc cacgacgggc gttccttgcg 5160 cagctgtgct cgacgttgtc actgaagcgg gaagggactg gctgctattg ggcgaagtgc 5220 cggggcagga tctcctgtca tctcaccttg ctcctgccga gaaagtatcc atcatggctg 5280 atgcaatgcg gcggctgcat acgcttgatc cggctacctg cccattcgac caccaagcga 5340 aacatcgcat cgagcgagca cgtactcgga tggaagccgg tcttgtcgat caggatgatc 5400 tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca 5460 tgcccgacgg cgatgatctc gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 5520 tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg gcggaccgct 5580 atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc gaatgggctg 5640 accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 5700 gccttcttga cgagttcttc tgacacgtgc tacgagattt cgattccacc gccgccttct 5760 atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg 5820 gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt 5880 acaaataaag caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta 5940 gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgaattccc ggggtac 5997 <210> 35 <211> 10700 <212> DNA <213> Artificial Sequence <220> <223> pYB73 vector (10700 bp) <220> <221> misc_feature <222> (2)..(450) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (457)..(4596) <223> Cas9 <220> <221> misc_feature <222> (4616)..(5255) <223> OrfC_terminator <220> <221> misc_feature <222> (5546)..(6608) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (6621)..(6657) <223> Hammerhead_ribozyme <220> <221> misc_feature <222> (6658)..(6677) <223> gRNA3_CS1_target_sequence <220> <221> misc_feature <222> (6678)..(6757) <223> gRNA_scaffold <220> <221> misc_feature <222> (6758)..(6825) <223> HDV_ribozyme <220> <221> misc_feature <222> (6845)..(7363) <223> OrfC_terminator <220> <221> misc_feature <222> (8369)..(9229) <223> Ampicillin_resistance_gene <220> <221> misc_feature <222> (9619)..(10067) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (10068)..(10442) <223> Sh_ble_gene <220> <221> misc_feature <222> (10443)..(10700) <223> SV40_terminator <400> 35 cccaatttta ggccccccac tgaccgaggt ctgtcgataa tccacttttc cattgatttt 60 ccaggtttcg ttaactcatg ccactgagca aaacttcggt ctttcctaac aaaagctctc 120 ctcacaaagc atggcgcggc aacggacgtg tcctcatact ccactgccac acaaggtcga 180 taaactaagc tcctcacaaa tagaggagaa ttccactgac aactgaaaac aatgtatgag 240 agacgatcac cactggagcg gcgcggcggt tgggcgcgga ggtcggcagc aaaaacaagc 300 gactcgccga gcaaacccga atcagccttc agacggtcgt gcctaacaac acgccgttct 360 accccgcctt cttcgcgccc cttcgcgtcc aagcatcctt caagtttatc tctctagttc 420 aacttcaaga agaacaacac caccaacacc ggatccatgg ataagaagta ctcgatcggc 480 ctcgacattg gcaccaacag cgtcggctgg gccgtcatta ctgatgagta caaggtcccg 540 tcgaagaagt ttaaggtcct cggcaacact gaccgccact ccatcaagaa gaacctcatc 600 ggtgccctcc tttttgactc cggcgagacc gctgaggcca ctcgcctcaa gcgcactgcc 660 cgccgccgtt acacccgccg caagaaccgc atctgctacc tccaggagat tttctcgaac 720 gaaatggcca aggtcgatga ctcctttttc caccgtctcg aagaatcgtt cctcgtcgag 780 gaggacaaga agcacgagcg ccaccccatc ttcggtaaca ttgtcgatga ggttgcctac 840 cacgagaagt acccgaccat ctaccacctc cgcaagaagc tcgtcgactc caccgacaag 900 gccgatctcc gccttatcta cctcgccctc gcccacatga tcaagttccg cggccacttt 960 cttatcgagg gtgatctcaa ccctgataac tctgacgtcg acaagctttt catccagctc 1020 gtccagactt acaaccagct cttcgaggag aaccccatca acgcttccgg cgtcgacgcg 1080 aaggccattc tcagcgcccg cctcagcaag tcccgccgcc tcgaaaacct cattgcccag 1140 cttcccggcg agaagaagaa cggcctcttc ggcaacctca ttgccctcag ccttggcctc 1200 acccctaact tcaagtcgaa ctttgacctc gccgaggacg ccaagctcca gctttccaag 1260 gacacttacg acgacgatct cgacaacctc ctcgctcaga ttggcgacca gtacgctgac 1320 ctcttcctcg ccgccaagaa ccttagcgat gccatcctcc tctccgacat ccttcgtgtt 1380 aacacggaaa tcacgaaggc tccgctctcc gcctccatga tcaagcgtta cgacgagcac 1440 catcaggacc tcaccctcct caaggccctc gtccgccagc agctccccga gaagtacaag 1500 gagatcttct tcgaccagag caagaacggc tacgccggct acatgacgg cggcgcgtcg 1560 caggaggagt tttacaagtt tatcaagccc attcttgaga agatggacgg caccgaggag 1620 ctcctcgtca agctcaaccg tgaggacctt ctccgcaagc agcgcacgtt cgacaacggc 1680 tctattcccc atcagatcca cctcggtgag cttcacgcga ttcttcgccg ccaggaagac 1740 ttttacccgt tcctcaagga caaccgcgag aagattgaga agatcctcac ctttcgcatt 1800 ccctactacg tcggccccct cgcccgcggc aactcgcgct ttgcttggat gacccgcaag 1860 tccgaggaga ccatcacccc gtggaacttc gaagaggtcg tcgacaaggg cgcctccgcg 1920 cagtctttca tcgagcgcat gactaacttt gacaagaacc tcccgaacga gaaggtcctc 1980 cccaagcaca gcctccttta cgaatacttt acggtgtaca acgagctcac gaaggtcaag 2040 tacgtcactg agggcatgcg caagccggcg ttcctttcgg gcgagcagaa gaaggctatc 2100 gtcgacctcc ttttcaagac caaccgcaag gttaccgtca agcagctcaa ggaggactac 2160 ttcaagaaga tcgagtgctt tgactcggtc gagatttcgg gcgtggagga ccgtttcaac 2220 gcctccctcg gcacttacca cgaccttctc aagatcatca aggacaagga ctttctcgac 2280 aacgaggaga acgaggacat tctcgaggac atcgtcctca cgctcaccct ctttgaggac 2340 cgtgagatga tcgaggagcg cctcaagacc tacgcccatc tctttgacga caaggtcatg 2400 aagcagctca agcgccgccg ctacaccggc tggggccgcc tttcccgcaa gctcatcaac 2460 ggcatccgcg acaagcagtc tggcaagacc atccttgact ttcttaagtc tgatggtttc 2520 gccaaccgca acttcatgca gctcatccac gaggacagcc tcactttcaa ggaggacatt 2580 cagaaggccc aggtctccgg ccagggtgac tctctccacg aacacatcgc caaccttgct 2640 ggcagcccgg ctattaagaa gggcatcctc cagaccgtca aggtcgtcga cgagctcgtc 2700 aaggttatgg gccgccacaa gcccgagaac atcgtcattg agatggctcg cgaaaaccag 2760 accacccaga agggtcagaa gaactcccgc gagcgcatga agcgtatcga ggagggcatc 2820 aaggagctcg gcagccagat cctcaaggag cacccggtcg agaacaccca gctccagaac 2880 gaaaagctct acctctacta cctccagaac ggccgtgaca tgtacgttga ccaggagctc 2940 gacattaacc gcctctccga ttacgacgtc gaccatattg tcccccagag ctttctcaag 3000 gacgacagca tcgacaacaa ggtcctcacc cgctcggaca agaaccgcgg caagtccgac 3060 aacgtccctt ccgaggaggt cgtgaagaag atgaagaact actggcgcca gcttctcaac 3120 gctaagctta ttactcagcg caagttcgat aacctcacca aggccgaacg cggcggcctc 3180 tccgagctcg acaaggccgg ttttatcaag cgccagctcg ttgagactcg ccagatcacc 3240 aagcacgtgg cgcagatcct cgactcgcgc atgaacacga agtacgacga gaacgacaag 3300 ctcatccgcg aggtcaaggt catcaccctt aagtcgaagc tcgtgtccga ctttcgcaag 3360 gacttccagt tctacaaggt ccgtgaaatt aacaactacc accacgctca cgacgcttac 3420 ctcaacgcgg tcgtgggtac cgcgctcatc aagaagtacc cgaagctcga gtcggagttt 3480 gtctacggcg actacaaggt ctacgacgtg cgcaagatga tcgccaagtc cgagcaggag 3540 atcggcaagg ccacggccaa gtactttttc tactccaaca ttatgaactt ctttaagact 3600 gagatcaccc ttgccaacgg cgagatccgc aagcgccccc ttatcgagac caacggcgag 3660 accggcgaaa ttgtgtggga taagggtcgc gactttgcca ccgtccgcaa ggtcctcagc 3720 atgccccagg tcaacattgt taagaagacc gaggtccaga cgggcggctt tagcaaggag 3780 tctatcctcc ccaagcgtaa cagcgacaag ctcatcgccc gcaagaagga ctgggaccct 3840 aagaagtacg gcggcttcga ttcgcctacg gtcgcctaca gcgtcctcgt cgtcgccaag 3900 gtcgagaagg gcaagtccaa gaagctcaag tccgtcaagg agctcctcgg catcacgatc 3960 atggagcgct ccagctttga gaagaacccc attgacttcc tcgaggctaa gggttacaag 4020 gaggtcaaga aggaccttat catcaagctc cccaagtact ccctctttga gctcgaaaac 4080 ggccgcaagc gtatgctcgc tagcgctggc gaactccaga agggcaacga gctcgccctc 4140 cccagcaagt acgtcaactt tctctacctc gcctcccact acgagaagct caagggtagc 4200 ccggaggata acgagcagaa gcagcttttt gtggagcagc acaagcacta ccttgacgag 4260 atcattgaac agatctccga gttctccaag cgtgttattc ttgctgacgc caacctcgat 4320 aaggtgctct ccgcgtacaa caagcaccgc gacaagccta tccgcgagca ggccgagaac 4380 atcatccacc tctttaccct caccaacctc ggcgccccgg ccgcctttaa gtactttgat 4440 acgactatcg accgcaagcg ctacacttcg actaaggagg tcctcgacgc taccctcatt 4500 caccagtcca ttaccggcct ctacgagacc cgcattgacc tttcgcagct cggtggcgac 4560 tcgcgtgcgg accctaagaa gaagcgcaag gtctaacata tgagttatga gatccgaaag 4620 tgaaccttgt cctaacccga cagcgaatgg cgggaggggg cgggctaaaa gatcgtatta 4680 catagtattt ttcccctact ctttgtgttt gtcttttttt tttttttgaa cgcattcaag 4740 ccacttgtct gggtttactt gtttgtttgc ttgcttgctt gcttgcttgc ctgcttcttg 4800 gtcagacggc ccaaaaaagg gaaaaaattc attcatggca cagataagaa aaagaaaaag 4860 tttgtcgacc accgtcatca gaaagcaaga gaagagaaac actcgcgctc acattctcgc 4920 tcgcgtaaga atcttagcca cgcatacgaa gtaatttgtc catctggcga atctttacat 4980 gagcgttttc aagctggagc gtgagatcat acctttcttg atcgtaatgt tccaaccttg 5040 cataggcctc gttgcgatcc gctagcaatg cgtcgtactc ccgttgcaac tgcgccatcg 5100 cctcattgtg acgtgagttc agattcttct cgagaccttc gagcgctgct aatttcgcct 5160 gacgctcctt cttttgtgct tccatgacac gccgcttcac cgtgcgttcc acttcttcct 5220 cagacatgcc cttggctgcc tcgacctgct cggtaaaacg ggccccagca cgtgctacga 5280 gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac 5340 gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccccaac 5400 ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat 5460 aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat 5520 catacatggt cgacctgcag ggtacctctt atctgcctcg cgccgttgac cgccgcttga 5580 ctcttggcgc ttgccgctcg catcctgcct cgctcgcgca ggcgggcggg cgagtgggtg 5640 ggtccgcagc cttccgcgct cgcccgctag ctcgctcgcg ccgtgctgca gccagcaggg 5700 cagcaccgca cggcaggcag gtcccggcgc ggatcgatcg atccatcgat ccatcgatcc 5760 atcgatcgtg cggtcaaaaa gaaaggaaga agaaaggaaa aagaaaggcg tgcgcacccg 5820 agtgcgcgct gagcgcccgc tcgcggtccc gcggagcctc cgcgttagtc cccgccccgc 5880 gccgcgcagt cccccgggag gcatcgcgca cctctcgccg ccccctcgcg cctcgccgat 5940 tccccgcctc cccttttccg cttcttcgcc gcctccgctc gcggccgcgt cgcccgcgcc 6000 ccgctcccta tctgctcccc aggggggcac tccgcacctt ttgcgcccgc tgccgccgcc 6060 gcggccgccc cgccgccctg gtttcccccg cgagcgcggc cgcgtcgccg cgcaaagact 6120 cgccgcgtgc cgccccgagc aacgggtggc ggcggcgcgg cggcgggcgg ggcgcggcgg 6180 cgcgtaggcg gggctaggcg ccggctaggc gaaacgccgc ccccgggcgc cgccgccgcc 6240 cgctccagag cagtcgccgc gccagaccgc caacgcagag accgagaccg aggtacgtcg 6300 cgcccgagca cgccgcgacg cgcggcaggg acgaggagca cgacgccgcg ccgcgccgcg 6360 cgggggggggg gagggagagg caggacgcgg gagcgagcgt gcatgtttcc gcgcgagacg 6420 acgccgcgcg cgctggagag gagataaggc gcttggatcg cgagagggcc agccaggctg 6480 gaggcgaaaa tgggtggaga ggatagtatc ttgcgtgctt ggacgaggag actgacgagg 6540 aggacggata cgtcgatgat gatgtgcaca gagaagaagc agttcgaaag cgactactag 6600 caagcaagag atcttccgca ctgatgagtc cgtgaggacg aaacgagtaa gctcgtctgc 6660 ggacgtcgtg gacgcgcgtt ttagagctag aaatagcaag ttaaaataag gctagtccgt 6720 tatcaacttg aaaaagtggc accgagtcgg tgcttttggc cggcatggtc ccagcctcct 6780 cgctggcgcc ggctgggcaa catgcttcgg catggcgaat gggaccatat gagttatgag 6840 atccgaaagt gaaccttgtc ctaacccgac agcgaatggc gggagggggc gggctaaaag 6900 atcgtattac atagtatttt tcccctactc tttgtgtttg tctttttttt ttttttgaac 6960 gcattcaagc cacttgtctg ggtttacttg tttgtttgct tgcttgcttg cttgcttgcc 7020 tgcttcttgg tcagacggcc caaaaaaggg aaaaaattca ttcatggcac agataagaaa 7080 aagaaaaagt ttgtcgacca ccgtcatcag aaagcaagag aagagaaaca ctcgcgctca 7140 cattctcgct cgcgtaagaa tcttagccac gcatacgaag taatttgtcc atctggcgaa 7200 tctttacatg agcgttttca agctggagcg tgagatcata cctttcttga tcgtaatgtt 7260 ccaaccttgc ataggcctcg ttgcgatccg ctagcaatgc gtcgtactcc cgttgcaact 7320 gcgccatcgc ctcattgtga cgtgagttca gattcttctc gagctgcagg aacctgcatt 7380 aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 7440 cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 7500 aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 7560 aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 7620 tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 7680 caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 7740 cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 7800 ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 7860 gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 7920 agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 7980 gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 8040 acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 8100 gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 8160 gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 8220 cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 8280 caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 8340 gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 8400 cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 8460 cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 8520 caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 8580 gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 8640 gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt 8700 cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 8760 catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 8820 gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 8880 ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 8940 gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 9000 cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 9060 tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 9120 gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 9180 atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 9240 ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 9300 gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 9360 acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc 9420 cctttcgtct cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacat cagctcccgg 9480 agacggtcac agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt 9540 cagcgggtgt tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac 9600 tgagagtgca ccaagcttcc aattttaggc cccccactga ccgaggtctg tcgataatcc 9660 acttttccat tgattttcca ggtttcgtta actcatgcca ctgagcaaaa cttcggtctt 9720 tcctaacaaa agctctcctc acaaagcatg gcgcggcaac ggacgtgtcc tcatactcca 9780 ctgccacaca aggtcgataa actaagctcc tcacaaatag aggagaattc cactgacaac 9840 tgaaaacaat gtatgagaga cgatcaccac tggagcggcg cggcggttgg gcgcggaggt 9900 cggcagcaaa aacaagcgac tcgccgagca aacccgaatc agccttcaga cggtcgtgcc 9960 taacaacacg ccgttctacc ccgccttctt cgcgcccctt cgcgtccaag catccttcaa 10020 gtttatctct ctagttcaac ttcaagaaga acaacaccac caacaccatg gccaagttga 10080 ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag ttctggaccg 10140 accggctcgg gttctcccgg gacttcgtgg aggacgactt cgccggtgtg gtccgggacg 10200 acgtgaccct gttcatcagc gcggtccagg accaggtggt gccggacaac accctggcct 10260 gggtgtgggt gcgcggcctg gacgagctgt acgccgagtg gtcggaggtc gtgtccacga 10320 acttccggga cgcctccggg ccggccatga ccgagatcgg cgagcagccg tgggggcggg 10380 agttcgccct gcgcgacccg gccggcaact gcgtgcactt cgtggccgag gagcaggact 10440 gacacgtgct acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa 10500 tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct 10560 tcgcccaccc caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca 10620 caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca 10680 tcaatgtatc ttatcggtac 10700 <210> 36 <211> 6300 <212> DNA <213> Artificial Sequence <220> <223> pCL310 vector (6300 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (1070)..(1981) <223> CarG_gene <220> <221> misc_feature <222> (2001)..(2640) <223> OrfC_terminator <220> <221> misc_feature <222> (5175)..(5623) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (5627)..(6019) <223> Bsd_gene <220> <221> misc_feature <222> (6020)..(6300) <223> SV40_terminator <400> 36 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tgctcaactc 1080 gcacaaccgc actgaggagc gctccactga ggacatcatc cttgagccct acacctacct 1140 catttcccag cccggcaagg acatccgcgc taagcttatt tccgctttcg acctctggct 1200 ccatgtcccc aaggacgtcc tctgcgtcat caacaagatt attggcatgc ttcacaacgc 1260 ctccctcatg atcgacgatg tccaggatga ctccgacctt cgccgtggcg tccccgtcgc 1320 ccaccatatc tacggcgtcc cccagacgat caacaccgct aactacgtca tttttctcgc 1380 tctccaggag gtcatgaagc tcaacatccc ctccatgatg caggtctgca ctgaggagct 1440 catcaacctt caccgcggcc agggtatcga gctctactgg cgcgattcgc ttacttgccc 1500 cactgaggag gagtacattg acatggtcaa caacaagacg tccggccttc tccgtcttgc 1560 cgtccgtctc atgcaggccg cctcggagtc cgacatcgac tacacccctc tcgtcaacat 1620 catcggtatt cactttcagg tccgcgacga ttacatgaac ctccagtcca ctagctacac 1680 gaacaacaag ggtttctgcg aggacctcac ggagggcaag ttttcgttcc ccatcatcca 1740 cgccattcgc aaggacccca gcaaccgcca gctccttaac attatctccc agaagcctac 1800 gtccattgag gttaagaagt acgcccttga ggttattcgc aaggccggca gctttgagta 1860 cgttcgcgag ttcctgcgcc agaaggaggc cgagtccctt aaggagatca agcgccttgg 1920 aggcaaccct ctcctcgaga agtacattga gaccatccgc gtcgaggcca cgaacgacta 1980 acatatgagt tatgagatcc gaaagtgaac cttgtcctaa cccgacagcg aatggcggga 2040 gggggcgggc taaaagatcg tattacatag tatttttccc ctactctttg tgtttgtctt 2100 tttttttttt ttgaacgcat tcaagccact tgtctgggtt tacttgtttg tttgcttgct 2160 tgcttgcttg cttgcctgct tcttggtcag acggcccaaa aaagggaaaa aattcattca 2220 tggcacagat aagaaaaaga aaaagtttgt cgaccaccgt catcagaaag caagagaaga 2280 gaaacactcg cgctcacatt ctcgctcgcg taagaatctt agccacgcat acgaagtaat 2340 ttgtccatct ggcgaatctt tacatgagcg ttttcaagct ggagcgtgag atcatacctt 2400 tcttgatcgt aatgttccaa ccttgcatag gcctcgttgc gatccgctag caatgcgtcg 2460 tactcccgtt gcaactgcgc catcgcctca ttgtgacgtg agttcagatt cttctcgaga 2520 ccttcgagcg ctgctaattt cgcctgacgc tccttctttt gtgcttccat gacacgccgc 2580 ttcaccgtgc gttccacttc ttcctcagac atgcccttgg ctgcctcgac ctgctcggta 2640 aaacgggccc cagcacgtgc tacgagattt cgattccacc gccgccttct atgaaaggtt 2700 gggcttcgga atcgttttcc gggacgccgg ctggatgatc ctccagcgcg gggatctcat 2760 gctggagttc ttcgcccacc ccaacttgtt tattgcagct tataatggtt acaaataaag 2820 caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 2880 gtccaaactc atcaatgtat cttatcatac atggtcgacc tgcaggaacc tgcattaatg 2940 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 3000 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 3060 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 3120 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 3180 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 3240 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 3300 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 3360 tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 3420 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 3480 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 3540 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 3600 tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 3660 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 3720 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 3780 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 3840 aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 3900 atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 3960 gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat 4020 acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc 4080 ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc 4140 tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag 4200 ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg 4260 ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg 4320 atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag 4380 taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt 4440 catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga 4500 atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc 4560 acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc 4620 aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc 4680 ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc 4740 cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca 4800 atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat 4860 ttagaaaaat aaacaaatag gggttccgcg cacattccc cgaaaagtgc cacctgacgt 4920 ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 4980 tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 5040 ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 5100 gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag 5160 agtgcaccaa gcttccaatt ttaggccccc cactgaccga ggtctgtcga taatccactt 5220 ttccattgat tttccaggtt tcgttaactc atgccactga gcaaaacttc ggtctttcct 5280 aacaaaagct ctcctcacaa agcatggcgc ggcaacggac gtgtcctcat actccactgc 5340 cacacaaggt cgataaacta agctcctcac aaatagagga gaattccact gacaactgaa 5400 aacaatgtat gagagacgat caccactgga gcggcgcggc ggttgggcgc ggaggtcggc 5460 agcaaaaaca agcgactcgc cgagcaaacc cgaatcagcc ttcagacggt cgtgcctaac 5520 aacacgccgt tctaccccgc cttcttcgcg ccccttcgcg tccaagcatc cttcaagttt 5580 atctctctag ttcaacttca agaagaacaa caccaccaac accatgatgc ctttgtctca 5640 agaagaatcc accctcattg aaagagcaac ggctacaatc aacagcatcc ccatctctga 5700 agactacagc gtcgccagcg cagctctctc tagcgacggc cgcatcttca ctggtgtcaa 5760 tgtatatcat tttactgggg gaccttgtgc agaactcgtg gtgctgggca ctgctgctgc 5820 tgcggcagct ggcaacctga cttgtatcgt cgcgatcgga aatgagaaca ggggcatctt 5880 gagcccctgt ggacggtgcc gacaggtgct tctcgatctg catcctggga tcaaagccat 5940 agtgaaggac agtgatggac agccgacggc agttgggatt cgtgaattgc tgccctctgg 6000 ttatgtgtgg gagggctaac acgtgctccg tgctacgaga tttcgattcc accgccgcct 6060 tctatgaaag gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc 6120 gcggggatct catgctggag ttcttcgccc accccaactt gtttattgca gcttataatg 6180 gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt 6240 ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca tgtctgaatt cccggggtac 6300 <210> 37 <211> 6611 <212> DNA <213> Artificial Sequence <220> <223> pCL122 vector (6611 bp) <220> <221> misc_feature <222> (1)..(1064) <223> EF_1alpha_promoter <220> <221> misc_feature <222> (1070)..(1162) <223> Sec1_secretion_signal <220> <221> misc_feature <222> (1163)..(1900) <223> eGFP_gene <220> <221> misc_feature <222> (1920)..(2559) <223> OrfC_terminator <220> <221> misc_feature <222> (5094)..(5542) <223> Alpha_tubulin_promoter <220> <221> misc_feature <222> (5543)..(6337) <223> NPT_gene <220> <221> misc_feature <222> (6338)..(6611) <223> SV40_terminator <400> 37 ctcttatctg cctcgcgccg ttgaccgccg cttgactctt ggcgcttgcc gctcgcatcc 60 tgcctcgctc gcgcaggcgg gcgggcgagt gggtgggtcc gcagccttcc gcgctcgccc 120 gctagctcgc tcgcgccgtg ctgcagccag cagggcagca ccgcacggca ggcaggtccc 180 ggcgcggatc gatcgatcca tcgatccatc gatccatcga tcgtgcggtc aaaaagaaag 240 gaagaagaaa ggaaaaagaa aggcgtgcgc acccgagtgc gcgctgagcg cccgctcgcg 300 gtcccgcgga gcctccgcgt tagtccccgc cccgcgccgc gcagtccccc gggaggcatc 360 gcgcacctct cgccgccccc tcgcgcctcg ccgattcccc gcctcccctt ttccgcttct 420 tcgccgcctc cgctcgcggc cgcgtcgccc gcgccccgct ccctatctgc tccccagggg 480 ggcactccgc accttttgcg cccgctgccg ccgccgcggc cgccccgccg ccctggtttc 540 ccccgcgagc gcggccgcgt cgccgcgcaa agactcgccg cgtgccgccc cgagcaacgg 600 gtggcggcgg cgcggcggcg ggcggggcgc ggcggcgcgt aggcggggct aggcgccggc 660 taggcgaaac gccgcccccg ggcgccgccg ccgcccgctc cagagcagtc gccgcgccag 720 accgccaacg cagagaccga gaccgaggta cgtcgcgccc gagcacgccg cgacgcgcgg 780 cagggacgag gagcacgacg ccgcgccgcg ccgcgcgggg ggggggaggg agaggcagga 840 cgcgggagcg agcgtgcatg tttccgcgcg agacgacgcc gcgcgcgctg gagaggagat 900 aaggcgcttg gatcgcgaga gggccagcca ggctggaggc gaaaatgggt ggagaggata 960 gtatcttgcg tgcttggacg aggagactga cgaggaggac ggatacgtcg atgatgatgt 1020 gcacagagaa gaagcagttc gaaagcgact actagcaagc aagggatcca tgaagttcgc 1080 gacctcggtc gcaattttgc ttgtggccaa catagccacc gccctcgcgc agagcgatgg 1140 ctgcaccccc accgaccaga cgatggtgag caagggcgag gagctgttca ccggggtggt 1200 gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga 1260 gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa 1320 gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag 1380 ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta 1440 cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt 1500 gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga 1560 ggacggcaac atcctgggac acaagctgga gtacaactac aacagccaca acgtctatat 1620 catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga 1680 ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc 1740 cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa 1800 cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg 1860 catggacgag ctgtacaagc accaccatca ccaccactaa catatgagtt atgagatccg 1920 aaagtgaacc ttgtcctaac ccgacagcga atggcgggag ggggcgggct aaaagatcgt 1980 attacatagt atttttcccc tactctttgt gtttgtcttt tttttttttt tgaacgcatt 2040 caagccactt gtctgggttt acttgtttgt ttgcttgctt gcttgcttgc ttgcctgctt 2100 cttggtcaga cggcccaaaa aagggaaaaa attcattcat ggcacagata agaaaaagaa 2160 aaagtttgtc gaccaccgtc atcagaaagc aagagaagag aaacactcgc gctcacattc 2220 tcgctcgcgt aagaatctta gccacgcata cgaagtaatt tgtccatctg gcgaatcttt 2280 acatgagcgt tttcaagctg gagcgtgaga tcataccttt cttgatcgta atgttccaac 2340 cttgcatagg cctcgttgcg atccgctagc aatgcgtcgt actcccgttg caactgcgcc 2400 atcgcctcat tgtgacgtga gttcagattc ttctcgagac cttcgagcgc tgctaatttc 2460 gcctgacgct ccttcttttg tgcttccatg acacgccgct tcaccgtgcg ttccacttct 2520 tcctcagaca tgcccttggc tgcctcgacc tgctcggtaa aacgggcccc agcacgtgct 2580 acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2640 ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccaccc 2700 caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 2760 aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 2820 ttatcataca tggtcgacct gcaggaacct gcattaatga atcggccaac gcgcggggag 2880 aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 2940 cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 3000 atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 3060 taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 3120 aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 3180 tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 3240 gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 3300 cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 3360 cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 3420 atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 3480 tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 3540 ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 3600 acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 3660 aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 3720 aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 3780 tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 3840 cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 3900 catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 3960 ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 4020 aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 4080 ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 4140 caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 4200 attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 4260 agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 4320 actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 4380 ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 4440 ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 4500 gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 4560 atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 4620 cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 4680 gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 4740 gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 4800 ggttccgcgc acattcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 4860 gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga 4920 tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 4980 ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 5040 ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccaag cttccaattt 5100 taggcccccc actgaccgag gtctgtcgat aatccacttt tccattgatt ttccaggttt 5160 cgttaactca tgccactgag caaaacttcg gtctttccta acaaaagctc tcctcacaaa 5220 gcatggcgcg gcaacggacg tgtcctcata ctccactgcc acacaaggtc gataaactaa 5280 gctcctcaca aatagaggag aattccactg acaactgaaa acaatgtatg agagacgatc 5340 accactggag cggcgcggcg gttgggcgcg gaggtcggca gcaaaaacaa gcgactcgcc 5400 gagcaaaccc gaatcagcct tcagacggtc gtgcctaaca acacgccgtt ctaccccgcc 5460 ttcttcgcgc cccttcgcgt ccaagcatcc ttcaagttta tctctctagt tcaacttcaa 5520 gaagaacaac accaccaaca ccatgattga acaagatgga ttgcaccgcag gttctccggc 5580 cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga 5640 tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct 5700 gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac 5760 gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct 5820 attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt 5880 atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt 5940 cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt 6000 cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag 6060 gctcaaggcg cgcatgcccg acggcgatga tctcgtcgtg acccatggcg atgcctgctt 6120 gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg 6180 tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg 6240 cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg 6300 catcgccttc tatcgccttc ttgacgagtt cttctgacac gtgctacgag atttcgattc 6360 caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg ccggctggat 6420 gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact tgtttattgc 6480 agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 6540 ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctgaat 6600 tcccggggta c 6611 <210> 38 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> pYB66 Bambgl F primer <400> 38 caagggatcc agatcttccg cactgatgag tc 32 <210> 39 <211> 23 <212> DNA <213> Artificial Sequence <220> <223> pYB66 Nde R primer <400> 39 aactcatatg gtcccattcg cca 23 <210> 40 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> pYB66 EF1seq F primer <400> 40 gagaggatag tatcttgcgt gcttg 25 <210> 41 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> pCL122 OrfC R primer <400> 41 gcaaggttgg aacattacga tcaag 25 <210> 42 <211> 41 <212> DNA <213> Artificial Sequence <220> <223> pYB73 gRNA Pst Kpn IF F primer <400> 42 catacatggt cgacctgcag ggtacctctt atctgcctcg c 41 <210> 43 <211> 45 <212> DNA <213> Artificial Sequence <220> <223> pYB73 gRNA Xho Pst IF R primer <400> 43 attaatgcag gttcctgcag ctcgagaaga atctgaactc acgtc 45 <210> 44 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> pYB73 seq F primer <400> 44 caccccaact tgtttattgc ag 22 <210> 45 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> pYB73 seq R primer <400> 45 gagcgaggaa gcggaagag 19 <210> 46 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> pYB13 pYB1 seq F primer <400> 46 gagaggatag tatcttgcgt gcttgg 26 <210> 47 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> TT pYB73 HDV R primer <400> 47 gaagcatgtt gcccagcc 18

Claims (25)

안내-폴리뉴클레오타이드 및 Cas 단백질을 포함하는 CRISPR-Cas 시스템의 소스를 포함하는 비-천연 또는 조작된 조성물로서, 상기 안내-폴리뉴클레오타이드가 근본적으로 숙주 세포 중의 표적-폴리뉴클레오타이드의 역 보체인 안내-서열을 포함하고, 상기 안내-폴리뉴클레오타이드가 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 상기 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시킬 수 있으며, 상기 안내-서열이 근본적으로 상기 숙주 세포의 게놈 중 5'-(N)yPAM-3' 폴리뉴클레오타이드 서열 표적의 (N)y 부분의 역 보체이고, y가 8 내지 30의 정수이며, PAM이 프로토스페이서 인접 모티프이고, 상기 숙주 세포가 라비린툴로마이세트(Labyrinthulomycete)이며, PAM이 5'-XGG-3', 5'-XGGXG-3', 5'-XXAGAAW-3', 5'-XXXXGATT-3', 5'-XXAGAA-3', 5'-XAAAAC-3'로 이루어지는 그룹 중에서 선택된 서열이고, X가 A, C, G 또는 T 또는 그의 유사체이며; W가 A 또는 T이고, 상기 Cas 단백질이 벡터 중에 포함된 폴리뉴클레오타이드에 의해 암호화되고, 상기 안내-폴리뉴클레오타이드가 또 다른 벡터 중에 포함된 또 다른 폴리뉴클레오타이드에 의해 암호화되거나 상기 또 다른 폴리뉴클레오타이드상에 존재하고, 상기 Cas 단백질을 암호화하는 벡터가 중간 강도 프로모터에 의해 구동되고, 상기 안내-폴리뉴클레오타이드를 암호화하는 벡터가 고강도 프로모터에 의해 구동되는
조성물.
A non-natural or engineered composition comprising a source of a CRISPR-Cas system comprising a guide-polynucleotide and a Cas protein, wherein the guide-polynucleotide is essentially a guide-sequence that is the reverse complement of a target-polynucleotide in a host cell. wherein the guide-polynucleotide is capable of directing binding of the Cas protein at a target-polynucleotide in the host cell to form a CRISPR-Cas complex, wherein the guide-sequence is essentially in the genome of the host cell 5'-(N)yPAM-3' is the reverse complement of the (N)y portion of the polynucleotide sequence target, y is an integer from 8 to 30, PAM is a protospacer adjacent motif, and wherein the host cell is Labyrinthulomycete, with PAM of 5'-XGG-3', 5'-XGGXG-3', 5'-XXAGAAW-3', 5'-XXXXGATT-3', 5'-XXAGAA-3', 5 a sequence selected from the group consisting of '-XAAAAC-3', and X is A, C, G or T or an analog thereof; W is A or T, the Cas protein is encoded by a polynucleotide contained in a vector, and the guide-polynucleotide is encoded by or on another polynucleotide contained in another vector and the vector encoding the Cas protein is driven by a medium-strength promoter, and the vector encoding the guide-polynucleotide is driven by a high-strength promoter.
composition.
삭제delete 삭제delete 제 1 항에 있어서,
안내-폴리뉴클레오타이드가, 전사되어 실제 안내-폴리뉴클레오타이드를 제공하는 폴리뉴클레오타이드에 의해 암호화되는 조성물.
The method of claim 1,
A composition wherein the guide-polynucleotide is encoded by a polynucleotide that is transcribed to provide the actual guide-polynucleotide.
삭제delete 제 4 항에 있어서,
Cas 단백질을 암호화하는 벡터가 저강도 프로모터에 의해 구동되는 조성물.
5. The method of claim 4,
A composition wherein a vector encoding a Cas protein is driven by a low-strength promoter.
제 4 항에 있어서,
하나 이상의 또는 모든 벡터가 선택성 마커를 포함하는 조성물.
5. The method of claim 4,
A composition wherein one or more or all vectors comprise a selectable marker.
제 7 항에 있어서,
하나 이상의 외인성 폴리뉴클레오타이드가 안내-폴리뉴클레오타이드에 작동적으로 연결되는 조성물.
8. The method of claim 7,
A composition wherein one or more exogenous polynucleotides are operatively linked to an intra-polynucleotide.
제 8 항에 있어서,
Cas 단백질이 하나 이상의 핵 국소화 서열을 포함하는 조성물.
9. The method of claim 8,
A composition wherein the Cas protein comprises one or more nuclear localization sequences.
제 9 항에 있어서,
Cas 단백질이 표적-서열의 위치에서 2개의 폴리뉴클레오타이드 가닥 모두의 절단을 지시하는 활성을 갖는 조성물.
10. The method of claim 9,
A composition wherein the Cas protein has the activity to direct cleavage of both polynucleotide strands at the position of the target-sequence.
제 1 항에 있어서,
Cas 단백질 암호화 폴리뉴클레오타이드가 숙주 세포에 대해 코돈 최적화된 조성물.
The method of claim 1,
A composition wherein the Cas protein encoding polynucleotide is codon-optimized for the host cell.
제 8 항에 있어서,
고강도 프로모터가 하기 프로모터 중 하나인 조성물:
라비린툴로마이세트 EF-1 프로모터, 아르기나제 프로모터, 피루베이트 키나제 프로모터, 열 충격 단백질 프로모터 및 글리세르알데하이드 3-포스페이트 데하이드로게나제 프로모터.
9. The method of claim 8,
A composition wherein the high-strength promoter is one of the following promoters:
Labyrinthulomyset EF-1 promoter, arginase promoter, pyruvate kinase promoter, heat shock protein promoter and glyceraldehyde 3-phosphate dehydrogenase promoter.
제 1 항, 제 4 항 및 제 7 항 내지 제 12 항 중 어느 한 항에 있어서,
중강도 프로모터가 하기 프로모터 중 하나인 조성물:
알파 튜불린 프로모터, 아이소시트레이트 라이아제 프로모터, 아코니테이트 하이드라타제 2 프로모터, 말레이트 데하이드로게나제 프로모터 및 액포 ATP 신타제 서브유닛 D 프로모터.
13. The method according to any one of claims 1, 4 and 7 to 12,
A composition wherein the medium-strength promoter is one of the following promoters:
Alpha tubulin promoter, isocitrate lyase promoter, aconitate hydratase 2 promoter, malate dehydrogenase promoter and vacuolar ATP synthase subunit D promoter.
세포에서 폴리뉴클레오타이드의 발현을 조절하는 방법으로서, 숙주 세포를 제 1 항에 따른 조성물과 접촉시킴을 포함하며, 안내-폴리뉴클레오타이드가 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시키는 방법.A method of modulating the expression of a polynucleotide in a cell, comprising contacting a host cell with a composition according to claim 1 , wherein the guide-polynucleotide directs binding of a Cas protein at a target-polynucleotide in said host cell to CRISPR How to form the -Cas complex. 제 14 항에 있어서,
숙주 세포가 관심 화합물을 암호화하는 폴리뉴클레오타이드를 포함하는 방법.
15. The method of claim 14,
A method wherein the host cell comprises a polynucleotide encoding a compound of interest.
제 14 항에 있어서,
숙주 세포가 재조합 숙주 세포인 방법.
15. The method of claim 14,
A method wherein the host cell is a recombinant host cell.
제 1 항, 제 4 항 및 제 7 항 내지 제 12 항 중 어느 한 항에 따른 조성물을 포함하는 숙주 세포.13. A host cell comprising a composition according to any one of claims 1, 4 and 7 to 12. 숙주 세포의 생산 방법으로서, 숙주 세포를 제 1 항에 따른 조성물과 접촉시킴을 포함하며, 안내-폴리뉴클레오타이드가 상기 숙주 세포 중의 표적-폴리뉴클레오타이드에서 Cas 단백질의 결합을 지시하여 CRISPR-Cas 복합체를 형성시키는 방법.A method of producing a host cell, comprising contacting the host cell with a composition according to claim 1 , wherein the guide-polynucleotide directs binding of a Cas protein at a target-polynucleotide in said host cell to form a CRISPR-Cas complex how to do it. 제 18 항에 있어서,
숙주 세포를 먼저 Cas 단백질의 소스와 접촉시키고 후속적으로 안내-폴리뉴클레오타이드의 소스와 접촉시키는 방법.
19. The method of claim 18,
A method of contacting a host cell first with a source of Cas protein and subsequently with a source of guide-polynucleotides.
제 19 항에 있어서,
숙주 세포가 관심 화합물을 암호화하는 폴리뉴클레오타이드를 포함하는, 방법.
20. The method of claim 19,
A method, wherein the host cell comprises a polynucleotide encoding a compound of interest.
제 17 항에 있어서,
숙주 세포가 관심 화합물을 암호화하는 폴리뉴클레오타이드를 포함하는, 숙주 세포.
18. The method of claim 17,
A host cell comprising a polynucleotide encoding a compound of interest.
관심 화합물의 생성 방법으로서, 상기 관심 화합물의 생성에 도움이 되는 조건하에서 제 1 항, 제 4 항 및 제 7 항 내지 제 12 항 중 어느 한 항에 따른 조성물을 포함하는 숙주 세포를 배양함을 포함하는 방법.13. A method for producing a compound of interest, comprising culturing a host cell comprising a composition according to any one of claims 1, 4 and 7-12 under conditions conducive to the production of said compound of interest. How to. 제 14 항 내지 제 16 항 및 제 18 항 내지 제 20 항 중 어느 한 항에 있어서,
Cas 단백질이 Cas9 단백질인 방법.
21. The method according to any one of claims 14 to 16 and 18 to 20,
A method wherein the Cas protein is a Cas9 protein.
삭제delete 삭제delete
KR1020197000858A 2016-07-13 2017-07-13 CRISPR-CAS system for avian host cells KR102319845B1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201662361741P 2016-07-13 2016-07-13
US62/361,741 2016-07-13
PCT/US2017/041949 WO2018013821A1 (en) 2016-07-13 2017-07-13 A crispr-cas system for an algal host cell

Publications (2)

Publication Number Publication Date
KR20190025910A KR20190025910A (en) 2019-03-12
KR102319845B1 true KR102319845B1 (en) 2021-11-01

Family

ID=60952752

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020197000858A KR102319845B1 (en) 2016-07-13 2017-07-13 CRISPR-CAS system for avian host cells

Country Status (7)

Country Link
US (2) US11466269B2 (en)
EP (1) EP3484994A4 (en)
JP (2) JP2019520069A (en)
KR (1) KR102319845B1 (en)
CN (1) CN109689856A (en)
BR (1) BR112019000430A2 (en)
WO (1) WO2018013821A1 (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10323236B2 (en) 2011-07-22 2019-06-18 President And Fellows Of Harvard College Evaluation and improvement of nuclease cleavage specificity
US9163284B2 (en) 2013-08-09 2015-10-20 President And Fellows Of Harvard College Methods for identifying a target site of a Cas9 nuclease
US9359599B2 (en) 2013-08-22 2016-06-07 President And Fellows Of Harvard College Engineered transcription activator-like effector (TALE) domains and uses thereof
US9340799B2 (en) 2013-09-06 2016-05-17 President And Fellows Of Harvard College MRNA-sensing switchable gRNAs
US9737604B2 (en) 2013-09-06 2017-08-22 President And Fellows Of Harvard College Use of cationic lipids to deliver CAS9
US9322037B2 (en) 2013-09-06 2016-04-26 President And Fellows Of Harvard College Cas9-FokI fusion proteins and uses thereof
US9068179B1 (en) 2013-12-12 2015-06-30 President And Fellows Of Harvard College Methods for correcting presenilin point mutations
AU2015298571B2 (en) 2014-07-30 2020-09-03 President And Fellows Of Harvard College Cas9 proteins including ligand-dependent inteins
WO2016110512A1 (en) * 2015-01-06 2016-07-14 Dsm Ip Assets B.V. A crispr-cas system for a yeast host cell
IL258821B (en) 2015-10-23 2022-07-01 Harvard College Nucleobase editors and uses thereof
IL264565B1 (en) 2016-08-03 2024-03-01 Harvard College Adenosine nucleobase editors and uses thereof
CA3033327A1 (en) 2016-08-09 2018-02-15 President And Fellows Of Harvard College Programmable cas9-recombinase fusion proteins and uses thereof
WO2018039438A1 (en) 2016-08-24 2018-03-01 President And Fellows Of Harvard College Incorporation of unnatural amino acids into proteins using base editing
JP2019530464A (en) 2016-10-14 2019-10-24 プレジデント アンド フェローズ オブ ハーバード カレッジ Nucleobase editor AAV delivery
US10745677B2 (en) 2016-12-23 2020-08-18 President And Fellows Of Harvard College Editing of CCR5 receptor gene to protect against HIV infection
US11898179B2 (en) 2017-03-09 2024-02-13 President And Fellows Of Harvard College Suppression of pain by gene editing
CN110914310A (en) 2017-03-10 2020-03-24 哈佛大学的校长及成员们 Cytosine to guanine base editor
WO2018176009A1 (en) 2017-03-23 2018-09-27 President And Fellows Of Harvard College Nucleobase editors comprising nucleic acid programmable dna binding proteins
US11560566B2 (en) 2017-05-12 2023-01-24 President And Fellows Of Harvard College Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation
WO2019023680A1 (en) 2017-07-28 2019-01-31 President And Fellows Of Harvard College Methods and compositions for evolving base editors using phage-assisted continuous evolution (pace)
WO2019139645A2 (en) 2017-08-30 2019-07-18 President And Fellows Of Harvard College High efficiency base editors comprising gam
WO2019079347A1 (en) 2017-10-16 2019-04-25 The Broad Institute, Inc. Uses of adenosine base editors
WO2020191239A1 (en) 2019-03-19 2020-09-24 The Broad Institute, Inc. Methods and compositions for editing nucleotide sequences
CN110747223A (en) * 2019-11-29 2020-02-04 中国科学院海洋研究所 Method for silencing functional genes of laver and application thereof
DE112021002672T5 (en) 2020-05-08 2023-04-13 President And Fellows Of Harvard College METHODS AND COMPOSITIONS FOR EDIT BOTH STRANDS SIMULTANEOUSLY OF A DOUBLE STRANDED NUCLEOTIDE TARGET SEQUENCE

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016109840A2 (en) * 2014-12-31 2016-07-07 Synthetic Genomics, Inc. Compositions and methods for high efficiency in vivo genome editing

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK0979294T3 (en) 1997-04-11 2015-08-31 Dsm Ip Assets Bv Gene conversion AS A TOOL FOR CONSTRUCTION OF RECOMBINANT INDUSTRIAL filamentous fungi
US8003772B2 (en) 1999-01-14 2011-08-23 Martek Biosciences Corporation Chimeric PUFA polyketide synthase systems and uses thereof
TWI324181B (en) 2001-04-16 2010-05-01 Martek Biosciences Corp Product and process for transformation of thraustochytriales microorganisms
EP2410048B1 (en) 2005-01-24 2016-08-10 DSM IP Assets B.V. Method for producing a compound of interest in a filamentous fungal cell
JP4796786B2 (en) * 2005-04-28 2011-10-19 富士フイルム株式会社 A vector capable of transforming Labyrinthula
CN101490262B (en) 2006-06-29 2012-09-26 帝斯曼知识产权资产管理有限公司 A method for achieving improved polypeptide expression
CN102741421B (en) 2009-03-10 2015-03-25 帝斯曼知识产权资产管理有限公司 Method for improving the yield of a polypeptide
AU2010225930B2 (en) 2009-03-16 2017-01-12 Dsm Ip Assets B.V. Protein production in microorganisms of the phylum Labyrinthulomycota
CN104988078B (en) 2009-03-19 2021-07-30 帝斯曼知识产权资产管理有限公司 Thraustochytrid, fatty acid composition and preparation method and application thereof
CA2755639C (en) 2009-03-19 2018-09-25 Martek Biosciences Corporation Polyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof
CA2758404A1 (en) 2009-04-22 2010-10-28 Dsm Ip Assets B.V. Process for the production of a recombinant polypeptide of interest
US8889394B2 (en) 2009-09-07 2014-11-18 Empire Technology Development Llc Multiple domain proteins
JP6517143B2 (en) * 2012-10-23 2019-05-22 ツールゲン インコーポレイテッド Composition for cleaving target DNA comprising guide RNA specific for target DNA and CAS protein encoding nucleic acid or CAS protein, and use thereof
US9914931B2 (en) 2012-12-07 2018-03-13 Synthetic Genomics, Inc. Nannochloropsis spliced leader sequences and uses therefor
JP6670743B2 (en) 2013-05-29 2020-03-25 セレクティスCellectis Novel compact CAS9 scaffold in type II CRISPR system
SG11201600115SA (en) 2013-07-10 2016-02-26 Novartis Ag Multiple proteases deficient filamentous fungal cells and methods of use thereof
EP3760719A1 (en) * 2013-11-18 2021-01-06 CRISPR Therapeutics AG Crispr-cas system materials and methods
WO2015086798A2 (en) * 2013-12-13 2015-06-18 Cellectis New method of selection of algal-transformed cells using nuclease
AU2014363476A1 (en) * 2013-12-13 2016-06-23 Cellectis Cas9 nuclease platform for microalgae genome engineering
AU2015263881B2 (en) * 2014-05-22 2020-06-18 Synthetic Genomics, Inc. Labyrinthulomycete strains for producing docosahexaenoic acid
CA2966731C (en) * 2014-11-06 2023-01-31 E. I. Du Pont De Nemours And Company Peptide-mediated delivery of rna-guided endonuclease into cells
WO2016197136A2 (en) 2015-06-04 2016-12-08 Nmc, Inc. Improved productivity and bioproduct formation in phototropin knock/out mutants in microalgae
JP6937740B2 (en) * 2015-07-28 2021-09-22 ダニスコ・ユーエス・インク Genome editing system and usage
CN105238806B (en) 2015-11-02 2018-11-27 中国科学院天津工业生物技术研究所 A kind of building and its application of the CRISPR/Cas9 gene editing carrier for microorganism
EP3412765B1 (en) * 2016-02-04 2023-01-18 Kao Corporation Method for producing mutant filamentous fungi

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016109840A2 (en) * 2014-12-31 2016-07-07 Synthetic Genomics, Inc. Compositions and methods for high efficiency in vivo genome editing

Also Published As

Publication number Publication date
EP3484994A1 (en) 2019-05-22
CN109689856A (en) 2019-04-26
BR112019000430A2 (en) 2019-07-09
WO2018013821A1 (en) 2018-01-18
JP2022116027A (en) 2022-08-09
US20220380759A1 (en) 2022-12-01
KR20190025910A (en) 2019-03-12
US20200131511A1 (en) 2020-04-30
JP2019520069A (en) 2019-07-18
US11466269B2 (en) 2022-10-11
EP3484994A4 (en) 2020-01-22

Similar Documents

Publication Publication Date Title
KR102319845B1 (en) CRISPR-CAS system for avian host cells
AU2019203955C1 (en) Multipartite signaling proteins and uses thereof
RU2763170C2 (en) Production of human milk oligosaccharides in host microorganisms with modified import/export
DK2087105T3 (en) DELTA 17 DESATURASE AND ITS USE IN THE MANUFACTURE OF MULTI-Saturated FAT ACIDS
AU2018229561A1 (en) Recombinant adenoviruses and use thereof
KR102494564B1 (en) Malaria vaccine
US20030119104A1 (en) Chromosome-based platforms
KR20180048743A (en) 2A &amp;lt; / RTI &amp;gt; peptide.
CN101646766B (en) Detal 17 desaturases and use thereof in making polyunsaturated fatty acids
KR20200022486A (en) Engineered and fully-functional custom glycoproteins
CA3109035A1 (en) Microorganisms engineered to use unconventional sources of nitrogen
KR20210108423A (en) Adeno-associated virus (AAV) producer cell lines and related methods
CN109996874A (en) The heterologous of 10-methylstearic acid generates
CN115927299A (en) Methods and compositions for increasing double-stranded RNA production
KR20210105382A (en) RNA encoding protein
CN111094569A (en) Light-controlled viral protein, gene thereof, and viral vector containing same
CN109843909B (en) Cells and methods for producing rhamnolipids using alternative glucose transporters
CN115698297A (en) Preparation method of multi-module biosynthetic enzyme gene combined library
CN114026242A (en) AAV vector having myelin protein zero promoter and use thereof for treating Schwann cell-related diseases such as CHARCOT-MARIE-TOOTH disease
DK2935601T3 (en) RECOMBINANT MICROBELL CELLS PRODUCING AT LEAST 28% EICOSAPENTAIC ACID AS DRY WEIGHT
AU2017252409A1 (en) Compositions and methods for nucleic acid expression and protein secretion in bacteroides
KR20240021906A (en) Expression vectors, bacterial sequence-free vectors, and methods of making and using the same
KR20230054840A (en) Stabilized cell lines for directed production of rAAV virions
KR20220116485A (en) Improved genome editing using paired nickases
CN112209883B (en) Fluorescein dye specifically combined with RNA and application thereof

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant