KR20210023646A - A novel nicotine biosynthetic gene and use thereof - Google Patents

A novel nicotine biosynthetic gene and use thereof Download PDF

Info

Publication number
KR20210023646A
KR20210023646A KR1020190175480A KR20190175480A KR20210023646A KR 20210023646 A KR20210023646 A KR 20210023646A KR 1020190175480 A KR1020190175480 A KR 1020190175480A KR 20190175480 A KR20190175480 A KR 20190175480A KR 20210023646 A KR20210023646 A KR 20210023646A
Authority
KR
South Korea
Prior art keywords
gene
pmt1b
nucleotide sequence
seq
sequence
Prior art date
Application number
KR1020190175480A
Other languages
Korean (ko)
Other versions
KR102389358B1 (en
Inventor
서효석
이영기
이정헌
김광철
나웅현
오경환
전은영
Original Assignee
주식회사 케이티앤지
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 케이티앤지 filed Critical 주식회사 케이티앤지
Publication of KR20210023646A publication Critical patent/KR20210023646A/en
Application granted granted Critical
Publication of KR102389358B1 publication Critical patent/KR102389358B1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A24TOBACCO; CIGARS; CIGARETTES; SIMULATED SMOKING DEVICES; SMOKERS' REQUISITES
    • A24BMANUFACTURE OR PREPARATION OF TOBACCO FOR SMOKING OR CHEWING; TOBACCO; SNUFF
    • A24B15/00Chemical features or treatment of tobacco; Tobacco substitutes, e.g. in liquid form
    • A24B15/18Treatment of tobacco products or tobacco substitutes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1003Transferases (2.) transferring one-carbon groups (2.1)
    • C12N9/1007Methyltransferases (general) (2.1.1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/01053Putrescine N-methyltransferase (2.1.1.53)

Abstract

The present invention relates to a novel putrescine N-methyltransferase 1b (PMT1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1, and uses thereof. When the expression of the gene is suppressed together with the PMT1a, PMT2, PMT3 and PMT4 genes, nicotine biosynthesis of plants may be inhibited, and thus plant cells with no or less nicotine content may be produced.

Description

신규한 니코틴 생합성 유전자 및 그의 용도{A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF}Novel nicotine biosynthetic gene and its use {A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF}

신규한 니코틴 생합성 유전자 및 그의 용도에 관한 것이다.It relates to novel nicotine biosynthesis genes and uses thereof.

우리나라는 남성 흡연율이 1992년 68.9%에서 2006년 44.1%, 2016년 40.7% 등으로 감소하여 왔으나, 주요 선진국 및 OECD 국가 평균 23.9%를 크게 웃도는 수준을 유지하고 있다. 담뱃세 인상, 금연구역 확대 등 정부에서 적극적인 담배규제 정책을 펼치고 있으며, 미국 식품의약국(FDA)은 ‘담배의 니코틴 함량을 낮춰 중독성을 감소시키거나 없애는 것’을 골자로 한 사전 입법제안 예고를 2017년 공고하였으며, 2030년까지 유엔의 지속가능한 발전목표달성을 위한 건강 보장과 증진의 수단으로서 WHO담배규제기본협약의 관리방안을 반영하여 저니코틴 담배 품종의 개발이 필요한 실정이었다.In Korea, the male smoking rate has declined from 68.9% in 1992 to 44.1% in 2006 and 40.7% in 2016, but remains at a level significantly exceeding the average of 23.9% in major developed and OECD countries. The government is implementing active tobacco control policies, such as raising cigarette taxes and expanding non-smoking areas, and the US Food and Drug Administration (FDA) has announced a prior legislative proposal aimed at ``reducing or eliminating addiction by lowering the nicotine content of cigarettes''. It was announced in 2017, and it was necessary to develop low nicotine tobacco varieties by reflecting the management plan of the WHO Framework Convention on Tobacco Regulation as a means of guaranteeing and promoting health to achieve the UN's Sustainable Development Goals by 2030.

이에, 유전자 교정 기술을 이용하여 담배 식물의 니코틴 생합성에 관련된 유전자의 기능을 억제시킴으로써 니코틴 함량을 낮추는 방법이 중요하게 부각되고 있다. 그러나, 기존에 알려진 니코틴 생합성 유전자인 PMT 유전자 중 PMT1a, PMT2, PMT3, PMT4 유전자의 염기 서열은 밝혀져 있으나, PMT1b 유전자의 염기 서열은 완벽하게 밝혀진 바가 없었고, 이로 인해 유전자의 기능을 100% 억제할 수 없었으며, 니코틴 함량이 미량 검출되는 한계가 있었다.Accordingly, a method of lowering the nicotine content by inhibiting the function of genes related to nicotine biosynthesis in tobacco plants using gene editing technology has emerged as important. However, the nucleotide sequence of the PMT1a, PMT2, PMT3, and PMT4 genes among the previously known nicotine biosynthesis genes, PMT, has been revealed, but the nucleotide sequence of the PMT1b gene has not been fully identified, and as a result, the function of the gene can be suppressed by 100%. There was no limit, and there was a limit in which a trace amount of nicotine content was detected.

니코틴 함량이 미량이라도 검출되지 않도록 하기 위해서는 니코틴 생합성 유전자 중 PMT1a, PMT2, PMT3, PMT4 유전자 뿐만 아니라 PMT1b 유전자의 염기서열도 밝혀야 하는 바, 본 발명을 완성하기에 이르렀다.In order not to detect even a trace amount of nicotine, the nucleotide sequence of not only the PMT1a, PMT2, PMT3, and PMT4 genes among the nicotine biosynthetic genes, but also the PMT1b gene, has to be identified, resulting in the completion of the present invention.

한국특허출원 제10-2013-7031875호Korean Patent Application No. 10-2013-7031875

일 양상은 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자를 제공한다.One aspect provides a PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1.

다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포를 제공한다.Another aspect provides a plant cell genetically engineered to reduce the expression or activity of the PMT1b gene composed of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell.

또 다른 양상은 상기 식물 세포를 포함하는 식물을 제공한다.Another aspect provides a plant comprising the plant cell.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법을 제공한다.Another aspect provides a method for inhibiting plant nicotine biosynthesis comprising genetically engineering the PMT1b gene composed of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene to be reduced compared to the parent cell.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법을 제공한다.Another aspect is the production of plant cells in which the biosynthesis of nicotine is suppressed, comprising the step of genetically engineering the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the expression or activity of the protein expressed by the PMT1b gene compared to the parent cell. Provides a way.

일 양상은 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자에 관한 것이다.One aspect relates to a PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1.

담배의 대표적 알칼로이드인 니코틴은 트로판 알칼로이드(tropane alkaloid)와 같이 뿌리에서 오르니틴(ornithine)과 아르기닌(arginine)으로부터 생합성 되어진 푸트레신(putrescine)이 N-메틸푸트레신(N-methylputrescine)으로 전환되고 디아민 옥시다아제(diamine oxidase)에 의해 산화된 후 4-메틸아미노부타놀(4-methylaminobutanol)을 거쳐 1-메틸-Δ1-피롤리니움 양이온(1-methyl-Δ1-pyrrolinium cation)이 되고 니코틴산(nicotinic acid)을 거쳐 생합성된다(도 1 참조). 그 중에서 "푸트레신-N-메틸트랜스퍼라아제(putrescine-N-methyltransferase, 이하 PMT)"는 니코틴의 생합성과정에서 아주 중요한 효소로 니코틴의 생합성과정에서 속도제한효소(Rate limiting enzyme)라고 할 수 있고, 일 실시예에 있어서, 상기 유전자는 니코틴 생합성 유전자일 수 있다.Nicotine, a representative alkaloid of tobacco, is putrescine biosynthesized from ornithine and arginine at the roots, like tropane alkaloid, to N-methylputrescine. It is converted and oxidized by diamine oxidase, and then passed through 4-methylaminobutanol to become 1-methyl-Δ1-pyrrolinium cation, and nicotinic acid ( nicotinic acid) through biosynthesis (see Fig. 1). Among them, "putrescine-N-methyltransferase (PMT)" is a very important enzyme in the process of nicotine biosynthesis and can be called a rate limiting enzyme in the process of nicotine biosynthesis. And, in one embodiment, the gene may be a nicotine biosynthesis gene.

상기 PMT1b 유전자는 기존의 PMT 유전자로 밝혀진 PMT1a, PMT2, PMT3 및 PMT4와 함께 PMT 단백질의 일부를 발현하는 유전자로서 구체적인 서열은 본 명세서에서 최초로 밝혀졌으며, 구체적으로 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있다.The PMT1b gene is a gene that expresses a part of the PMT protein along with PMT1a, PMT2, PMT3 and PMT4, which have been identified as conventional PMT genes, and a specific sequence was first identified in the present specification, and specifically the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1. Can be

다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포에 관한 것이다.Another aspect relates to a plant cell genetically engineered to reduce the expression or activity of the PMT1b gene composed of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell.

상기 “모세포”는 일 양상에 따른 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 인위적인 조작을 수행하지 않은 세포로서, 인체에서 갓 분리된 세포 및 이를 배양한 세포를 의미한다.The "parent cell" refers to a cell that has not been artificially manipulated to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene according to an aspect, and refers to a cell freshly isolated from the human body and a cell in which it is cultured.

또한, 본 명세서에서 용어 "유전적 조작(genetic engineering)" 또는 "유전적으로 조작된(genetically engineered)"은 세포에 대하여 하나 이상의 유전적 변형(genetic modification)을 도입하는 행위 또는 그에 의하여 만들어진 세포를 나타낸다.In addition, the term "genetic engineering" or "genetically engineered" as used herein refers to the act of introducing one or more genetic modifications to a cell or a cell made thereby. .

상기 유전적 조작은 물리적인 방법에 의해 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다. 상기 물리적인 방법은 예를 들어, X-선 조사, 감마선 조사 등일 수 있다.The genetic manipulation may be induced by modification in the nucleic acid sequence of the PMT1b gene by a physical method. The physical method may be, for example, X-ray irradiation, gamma ray irradiation, or the like.

또한, 상기 유전적 조작은 화학적인 방법에 의해 PMT1b 유전자의 핵산서열 내 변형 또는 유전자의 발현의 변화로 유도되는 것일 수 있다. 상기 화학적인 방법은 예를 들어, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 다이메틸 설페이트(dimethyl sulfate) 처리 등일 수 있다.In addition, the genetic manipulation may be induced by modification of the PMT1b gene in the nucleic acid sequence or change in the expression of the gene by a chemical method. The chemical method may be, for example, ethyl methanesulfonate treatment, dimethyl sulfate treatment, or the like.

또한, 상기 유전적 조작은 유전자 편집 시스템에 의해 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다. 상기 유전자 편집 시스템은 예를 들어, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템 등일 수 있다.In addition, the genetic manipulation may be induced by modification of the PMT1b gene in the nucleic acid sequence by the gene editing system. The gene editing system may be, for example, a meganuclease system, a zinc finger nuclease system, a TALEN (transcription activator-like effector nuclease) system, a CRISPR/Cas9 system, and the like. .

또한, 상기 유전적 조작은 RNA 간섭(RNAi; RNA interference) 시스템에 의해 PMT1b 유전자로부터 전사된 mRNA에 결합하여 유전자 발현의 변화로 유도되는 것일 수 있다.In addition, the genetic manipulation may be induced by a change in gene expression by binding to mRNA transcribed from the PMT1b gene by an RNA interference (RNAi) system.

RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것일 수 있고, 구체적으로는 CRISPR/Cas9 시스템에 의한 것일 수 있고, 상기 유전적 조작은 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다.RNA interference (RNAi) system, meganuclease system, zinc finger nuclease system, TALEN (transcription activator-like effector nuclease) system, CRISPR/Cas9 system, X -It may be by at least one selected from the group consisting of radiation irradiation, gamma irradiation, ethyl methanesulfonate treatment, and dimethyl sulfate treatment, and specifically the CRISPR/Cas9 system. In addition, the genetic manipulation may be induced by modification in the nucleic acid sequence of the PMT1b gene.

이에 일 양상에 따른 식물 세포는 CRISPR/Cas9 시스템을 상기 식물 세포에 전달하여 니코틴 생합성에 참여하는 PMT1b 유전자를 불활성화시켜 변형된 PMT1b 단백질이 발현되어, 니코틴(Nicotine) 생합성을 억제할 수 있다. 또한, 노르니코틴(Nornicotine) 및 아나타빈(Anatabine)의 생합성도 억제할 수 있다.Accordingly, the plant cell according to an aspect may transmit the CRISPR/Cas9 system to the plant cell to inactivate the PMT1b gene participating in nicotine biosynthesis, thereby expressing the modified PMT1b protein, thereby inhibiting nicotine biosynthesis. In addition, it can inhibit the biosynthesis of nornicotine and anatabine.

용어 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 "발현 또는 활성이 감소" 또는 PMT1b 유전자의 "불활성화", PMT1b 단백질의 "발현 또는 활성이 감소" 또는 PMT1b 유전자의 "불활성화"된 유전적으로 조작된 식물 세포는 상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질이 비교 가능한 동일 종의 식물 세포 또는 그의 모세포에서 측정된 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성 수준보다 낮은 정도의 발현 또는 활성이 나타내거나 발현 또는 활성이 없는 것을 의미한다. 즉 식물 세포에 있어서 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 본래 조작되지 않은 식물 세포의 발현 또는 활성보다 약 20% 이 상, 약 30% 이상, 약 40% 이상, 약 50% 이상, 약 55% 이상, 약 60% 이상, 약 70% 이상, 약 75% 이상, 약 80% 이상, 약 85% 이상, 약 90% 이상, 약 95% 이상, 또는 약 100% 감소된 것일 수 있다. PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소된 유전적으로 조작된 식물 세포는 당업계에 공지된 임의의 방법을 사용하여 확인될 수 있다. 용어 "불활성화 (inactivation)"는 전혀 발현이 되지 않는 유전자 또는 발현이 되더라도 그 활성이 없는 단백질이 생성되는 것을 의미할 수 있다. 용어 "감소 (depression)"는 PMT1b 유전자가 조작되지 않은 식물 세포에 비하여 낮은 수준으로 발현되거나, 또는 PMT1b 유전자가 발현하는 단백질이 발현이 되더라도 그 활성이 낮은 것을 의미할 수 있다.The term "reduced expression or activity" of the protein expressed by the PMT1b gene or the PMT1b gene, or "inactivated" of the PMT1b gene, the "reduced expression or activity" of the PMT1b protein, or the "inactivated" of the PMT1b gene. Plant cells show expression or activity lower than the level of expression or activity of the protein expressed by the PMT1b gene or the PMT1b gene measured in plant cells of the same species, or their parent cells, where the protein expressed by the PMT1b gene or the PMT1b gene is comparable. It means that there is no expression or activity. That is, the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene in the plant cell is about 20% or more, about 30% or more, about 40% or more, about 50% or more than the expression or activity of the original unmanipulated plant cell. , At least about 55%, at least about 60%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or about 100% reduction. . Genetically engineered plant cells in which the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene is reduced can be identified using any method known in the art. The term "inactivation" may mean that a gene that is not expressed at all or a protein that is not active even if it is expressed is produced. The term "depression" may mean that the PMT1b gene is expressed at a lower level than that of unmanipulated plant cells, or that even if the protein expressed by the PMT1b gene is expressed, its activity is low.

상기 PMT1b 유전자 또는 PMT1b 단백질의 발현 또는 활성이 감소되는 것은 상기 PMT1b를 코딩하는 유전자의 일부 또는 전부가 변이, 치환, 삭제되거나 상기 유전자에 하나 이상의 염기가 삽입되는 것에 의한 것일 수 있으며, PMT1b 유전자 교정 수단에 의할 수 있다.The decrease in the expression or activity of the PMT1b gene or the PMT1b protein may be due to mutation, substitution, deletion of a part or all of the gene encoding the PMT1b, or insertion of one or more bases into the gene, PMT1b gene correction means It can be done by.

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene is the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of PMT4 genes and a nucleotide sequence of SEQ ID NO: 1, and more specifically, the PMT1b gene is a PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 식물 세포 내 니코틴이 존재하지 않을 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제될 수 있다.When the expression or activity of not only the PMT1b gene but also the PMT1a, PMT2, PMT3 and PMT4 genes is reduced, nicotine biosynthesis may be maximally suppressed, and thus nicotine may not be present in the plant cell. Furthermore, the biosynthesis of nornicotine and anatabine can also be maximally inhibited.

상기 CRISPR/Cas9 시스템은 crRNA(CRISPR RNA) 및 tracrRNA(transactivating crRNA)를 포함하는 sgRNA(single guide RNA)를 포함할 수 있다.The CRISPR/Cas9 system may include single guide RNA (sgRNA) including crRNA (CRISPR RNA) and tracrRNA (transactivating crRNA).

또한, 상기 CRISPR/Cas9 시스템은 Cas9(CRISPR associated protein 9) 단백질 또는 Cas9 단백질을 코딩하는 유전자 및 NLS(Nuclear Localization Signal) 단백질 또는 NLS 단백질을 코딩하는 유전자를 포함할 수 있다.In addition, the CRISPR/Cas9 system may include a Cas9 (CRISPR associated protein 9) protein or a gene encoding a Cas9 protein, and a Nuclear Localization Signal (NLS) protein or a gene encoding an NLS protein.

일반적으로, 널리 알려진 유전자 교정 수단인 "CRISPR(Clustered Regularly Interspaced Short Palindromic Repeats) 시스템"은 집합적으로 Cas 유전자를 코딩하는 서열, tracr(트랜스-활성화 CRISPR) 서열(예를 들어, tracrRNA 또는 활성 부분 tracrRNA), tracr-메이트 서열(내인성 CRISPR 시스템의 맥락에서 "직접 반복부" 및 tracrRNA-가공 부분 직접 반복부 포함), 가이드 서열(내인성 CRISPR 시스템의 맥락에서 "스페이서"로도 지칭), 가이드 RNA 또는 CRISPR 유전자좌로부터의 기타 서열 및 전사물을 포함하는 CRISPR-관련(CRISPR-associated; 이하 Cas) 유전자의 발현에 수반되거나, 그의 활성을 유도하는 전사물 및 다른 요소를 지칭한다. 일부 구현 예에서, CRISPR 시스템의 하나 이상의 요소는 I형, II형 또는 III형 CRISPR 시스템으로부터 유래된다. 일부 구현 예에서, CRISPR 시스템의 하나 이상의 요소는 내인성 CRISPR 시스템을 포함하는 특정 유기체, 예를 들어, 스트렙토코커스 피요게네스(Streptococcus pyogenes)로부터 유래된다. 일반적으로, CRISPR 시스템은 표적 서열의 부위에서 CRISPR 복합체의 형성을 증진시키는 요소(내인성 CRISPR 시스템의 맥락에서 프로토스페이서로도 지칭)를 특징으로 한다. CRISPR 복합체의 형성의 맥락에서, "표적 서열" 또는 "표적 유전자"는 가이드 서열이 상보성을 갖도록 설계된 서열을 지칭하며, 여기서, 표적 서열과 가이드 서열 간의 혼성화는 CRISPR 복합체의 형성을 증진시킨다. 본질적으로 완전한 상보성이 필요하지 않지만, 혼성화를 야기하고, CRISPR 복합체의 형성을 증진시키는 충분한 상보성이 존재한다. 표적 서열은 임의의 폴리뉴클레오티드, 예를 들어, DNA 또는 RNA 폴리뉴클레오티드를 포함할 수 있다. 일부 구현 예에서, 표적 서열은 세포의 핵 또는 세포질 내에 위치한다. 일부 구현 예에서, 표적 서열은 진핵 세포의 세포기관, 예를 들어, 미토콘드리아 또는 엽록체 내에 존재할 수 있다.In general, the well-known gene correction means "CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) system" is a sequence that collectively encodes a Cas gene, a tracr (trans-activated CRISPR) sequence (eg, tracrRNA or active moiety tracrRNA). ), tracr-mate sequence (including “direct repeat” and tracrRNA-processed portion direct repeat in the context of the endogenous CRISPR system), guide sequence (also referred to as “spacer” in the context of the endogenous CRISPR system), guide RNA or CRISPR locus It refers to transcripts and other elements involved in the expression of, or inducing activity of, CRISPR-associated (CRISPR-associated; hereinafter Cas) genes, including other sequences and transcripts from. In some embodiments, one or more elements of the CRISPR system are derived from a type I, type II, or type III CRISPR system. In some embodiments, one or more elements of the CRISPR system are derived from a particular organism comprising the endogenous CRISPR system, such as Streptococcus pyogenes. In general, the CRISPR system is characterized by an element that promotes the formation of a CRISPR complex at the site of the target sequence (also referred to as a protospacer in the context of the endogenous CRISPR system). In the context of the formation of a CRISPR complex, “target sequence” or “target gene” refers to a sequence designed such that the guide sequence has complementarity, wherein hybridization between the target sequence and the guide sequence promotes the formation of the CRISPR complex. Essentially complete complementarity is not required, but sufficient complementarity exists to cause hybridization and promote the formation of CRISPR complexes. The target sequence can include any polynucleotide, eg, DNA or RNA polynucleotide. In some embodiments, the target sequence is located within the nucleus or cytoplasm of the cell. In some embodiments, the target sequence may be within an organelle of a eukaryotic cell, eg, a mitochondrial or chloroplast.

상기 Cas 단백질은 CRISPR RNA(crRNA) 및 트랜스-활성화 crRNA(trans-activating crRNA, tracrRNA)로 불리는 두 RNA와 복합체를 형성할 때, 활성 엔도뉴클레아제 또는 니카아제(nickase)를 형성한다. 상기 Cas 단백질의 비제한적인 예는 Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9(Csn1 및 Csx12로도 알려짐), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, 그의 상동체 또는 그의 변형된 버전을 포함한다. 이들 효소가 알려져 있으며; 예를 들어, 스트렙토코커스 피오게네스 Cas9 단백질의 아미노산 서열은 수탁 번호 Q99ZW2 하에 스위스프로트(SwissProt) 데이터베이스에서 얻을 수 있다. 일부 구현 예에서, 비변형 CRISPR 효소, 예를 들어, Cas9는 DNA 절단 활성을 갖는다.When the Cas protein forms a complex with two RNAs called CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA), it forms an active endonuclease or nickase. Non-limiting examples of the Cas protein include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx3 Csf2, Csf3, Csf4, homologs thereof, or modified versions thereof. These enzymes are known; For example, the amino acid sequence of the Streptococcus pyogenes Cas9 protein can be obtained from the SwissProt database under accession number Q99ZW2. In some embodiments, the unmodified CRISPR enzyme, eg, Cas9, has DNA cleavage activity.

일부 구현 예에서, CRISPR 효소는 Cas9 단백질이며, 상기 Cas9 단백질은 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질, 캄필로박터 제주니(Campylobacter jejuni) 유래 Cas9 단백질, 스트렙토코커스 써모필러스(Streptococcus thermophiles) 유래 Cas9 단백질, 스트렙토코커스 아우레우스(Streptocuccus aureus) 유래 Cas9 단백질 및 네이세리아 메닝기디티스(Neisseria meningitidis) 유래 Cas9 단백질로 이루어진 군에서 선택되는 적어도 하나의 Cas9 단백질일 수 있고, 구체적으로 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질일 수 있다. 일부 구현 예에서, Cas9 단백질은 진핵 세포에서의 발현을 위해 코돈-최적화되며, 상기 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질을 사용하는 경우, PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 최대로 감소될 수 있다.In some embodiments, the CRISPR enzyme is a Cas9 protein, and the Cas9 protein is a Cas9 protein derived from Streptococcus pyogenes , a Cas9 protein derived from Campylobacter jejuni , and Streptococcus thermophiles. ) Derived Cas9 protein, Streptocuccus aureus (Streptocuccus aureus) derived Cas9 protein and Neisseria meningitidis (Nisseria meningitidis) may be at least one Cas9 protein selected from the group consisting of the derived Cas9 protein, specifically Streptococcus blood It may be a Cas9 protein derived from Streptococcus pyogenes. In some embodiments, the Cas9 protein is codon-optimized for expression in eukaryotic cells, and when using the Cas9 protein derived from Streptococcus pyogenes , the expression of a protein expressed by the PMT1b gene or the PMT1b gene, or The activity can be reduced to the maximum.

일부 구현 예에서, Cas9 단백질은 진핵 세포 내의 핵 내에 위치하기 위하여 Cas9 단백질의 5'- 또는 3'-, 또는 양 말단 부분에 NLS(nuclear localization sequence or signal)를 포함하는 것일 수 있으며, 상기 NLS는 하나 또는 그 이상일 수 있다.In some embodiments, the Cas9 protein may include a nuclear localization sequence or signal (NLS) at 5'- or 3'- or both terminal portions of the Cas9 protein in order to be located in the nucleus in a eukaryotic cell, and the NLS is There may be one or more.

본 명세서에서 용어 “핵 위치화 서열 또는 신호(Nuclear localization sequence or signal, NLS)"는 특정물질(예컨대, 단백질)을 세포 핵 내로 운반하는 역할을 하는 아미노산 서열을 의미하며, 대체적으로 핵공(Nuclear Pore)을 통하여 세포 핵 내로 운반하는 작용을 한다. 상기 핵 위치화 서열은 진핵생물에서 CRISPR 복합체 활성에 필요하지 않지만, 이러한 서열을 포함하여, 시스템의 활성을 증진시켜, 특히 핵 내의 핵산 분자를 표적화하는 것으로 여겨진다.In the present specification, the term “nuclear localization sequence or signal (NLS)” refers to an amino acid sequence that serves to transport a specific substance (eg, a protein) into the cell nucleus. The nuclear localization sequence is not required for the activity of the CRISPR complex in eukaryotes, but includes such a sequence to enhance the activity of the system, specifically targeting nucleic acid molecules in the nucleus. It is believed to be.

또한 RNA 유전자 가위(RNA-guided CRISPR)(clustered regularly interspaced short palindrome repeats)-연관된 뉴클레아제 Cas9는 표적 유전자의 넉아웃, 전사 활성화 및 single guide RNA(sgRNA)(즉, crRNA-tracrRNA 융합 전사체)를 이용한 억제에 대한 획기적인 기술을 제공하며, 이 기술은 수많은 유전자 위치를 타겟팅하는 것으로 알려져 있다.In addition, RNA-guided CRISPR (clustered regularly interspaced short palindrome repeats)-associated nuclease Cas9 is the knockout of the target gene, transcriptional activation and single guide RNA (sgRNA) (i.e., crRNA-tracrRNA fusion transcript). It provides a breakthrough technique for inhibition using, which is known to target a number of gene locations.

Cas9 (또는 Cpf1) 단백질은 CRISPR/Cas9 시스템에서 필수적인 단백질 요소를 의미하고, 상기 Cas9 (또는 Cpf1) 유전자 및 단백질의 정보는 국립생명공학정보센터(national center for biotechnology information, NCBI)의 GenBank에서 구할 수 있으나, 이에 제한되지 않는다. Cas (또는 Cpf1)단백질을 암호화하는 CRISPR-연관 유전자는 약 40 개 이상의 서로 다른 Cas (또는 Cpf1) 단백질 패밀리가 존재하는 것으로 알려져 있으며, cas 유전자 및 반복 구조(repeat structure)의 특정 조합에 따라 8개의 CRISPR 하위 유형 (Ecoli, Ypest, Nmeni, Dvulg, Tneap, Hmari, Apern, 및 Mtube)을 정의할 수 있다. 따라서 상기 각 CRISPR 하위 유형이 반복단위를 이루어 폴리리보뉴클레오티드-단백질 복합체를 형성할 수 있다.Cas9 (or Cpf1) protein means an essential protein element in the CRISPR/Cas9 system, and the information on the Cas9 (or Cpf1) gene and protein can be obtained from GenBank of the national center for biotechnology information (NCBI). However, it is not limited thereto. The CRISPR-associated gene encoding the Cas (or Cpf1) protein is known to exist about 40 or more different Cas (or Cpf1) protein families, and depending on the specific combination of the cas gene and the repeat structure, 8 CRISPR subtypes (Ecoli, Ypest, Nmeni, Dvulg, Tneap, Hmari, Apern, and Mtube) can be defined. Therefore, each of the CRISPR subtypes can form a repeating unit to form a polyribonucleotide-protein complex.

일 양상에 따른 식물 세포는 PMT1b 유전자의 특정 타겟 부위가 절단되어 상기 유전자가 넉아웃된 식물 세포로서, 넉아웃에 의하면 식물 세포의 니코틴 생합성 경로에서 오르니틴(ornithine)과 아르기닌(arginine)으로부터 생합성 되어진 푸트레신(putrescine)이 N-메틸푸트레신(N-methylputrescine)으로 전환되지 않아, 니코틴의 생합성이 억제될 수 있다. 일 양상에 따른 PMT1b 넉아웃 식물 세포를 제조하기 위하여 유전체 교정 기술로서 유전체 중 존재 비율이 매우 낮은 희귀 유전자 서열을 절단하는 희귀 절단 엔도뉴클레아제(rare-cutting endonuclease)를 사용하는 기술이 채용될 수 있다.A plant cell according to an aspect is a plant cell in which a specific target site of the PMT1b gene is cut and the gene is knocked out.According to the knockout, the plant cell is biosynthesized from ornithine and arginine in the nicotine biosynthesis pathway of the plant cell. Since putrescine is not converted to N-methylputrescine, the biosynthesis of nicotine may be inhibited. In order to manufacture the PMT1b knockout plant cell according to an aspect, a technology using a rare-cutting endonuclease that cuts a rare gene sequence with a very low presence in the genome may be employed as a genome editing technology. have.

상기 유전자 넉아웃은 유전자의 전부 또는 일부 (예컨대, 하나 이상의 뉴클레오티드)의 결실, 치환, 및/또는 하나 이상의 뉴클레오티드의 삽입에 의한 유전자의 활성 조절, 예컨대, 불활성화를 의미하는 것일 수 있다. 상기 유전자 불활성화는 유전자의 발현 억제 또는 발현 감소 (downregulation) 또는 본래의 기능을 상실한 단백질을 코딩하도록 변형된 것을 의미한다. 또한 유전자 조절은 타겟 유전자의 하나 이상의 Exon을 둘러싸고 있는 양쪽 intron 부위를 동시에 targeting함으로 인한 Exon 부위의 결실로 인해 얻어지는 단백질의 구조 변형, Dominant negative 형태의 단백질 발현, soluble 형태로 분비되는 경쟁적 저해제 발현 등의 결과에 의한 유전자의 기능 변화를 의미하는 것일 수 있다.The gene knockout may mean all or part of a gene (eg, one or more nucleotides) deletion, substitution, and/or regulation of the activity of a gene, eg, inactivation, by insertion of one or more nucleotides. The gene inactivation refers to a modification to encode a protein that has lost its original function or suppressed or reduced expression of a gene. In addition, gene regulation is a structural modification of the protein obtained by deletion of the exon site by simultaneously targeting both intron sites surrounding one or more exons of the target gene, expression of a protein in the form of a dominant negative, and expression of a competitive inhibitor secreted in a soluble form. It may mean a change in the function of a gene as a result.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 Cas9 단백질 또는 Cpf1 단백질에 의하여 유도될 수 있다. 상기 유전자 조작에 사용될 수 있는 Cas9 단백질은 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질, 캄필로박터 제주니(Campylobacter jejuni) 유래 Cas9 단백질, 스트렙토코커스 써모필러스(Streptococcus thermophiles) 유래 Cas9 단백질, 스트렙토코커스 아우레우스(Streptocuccus aureus) 유래 Cas9 단백질 및 네이세리아 메닝기디티스(Neisseria meningitidis) 유래 Cas9 단백질로 이루어진 군에서 선택되는 적어도 하나의 Cas9 단백질을 사용하여 유도될 수 있으며, 보다 구체적으로 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질을 사용하여 유도될 수 있다.Genetic manipulation artificially performed to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene may be induced by the Cas9 protein or the Cpf1 protein. The Cas9 protein that can be used for the genetic manipulation is a Cas9 protein derived from Streptococcus pyogenes , a Cas9 protein derived from Campylobacter jejuni , a Cas9 protein derived from Streptococcus thermophiles , and streptococcus. It can be induced using at least one Cas9 protein selected from the group consisting of a Cas9 protein derived from Streptocuccus aureus and a Cas9 protein derived from Neisseria meningitidis, and more specifically, Streptocuccus pyyoge It can be derived using Cas9 protein from Streptococcus pyogenes.

상기 Cas9가 DNA로 암호화되어 개체 또는 세포로 전달되는 경우, 상기 DNA는 일반적으로 (그러나 필수적이지는 않음) 타겟 세포에서 작동 가능한 조절 요소 (예컨대, 프로모터)를 포함할 수 있다. 상기 Cas9 발현을 위한 프로모터는, 예컨대, CMV, EF-l a, EFS, MSCV, PGK, 또는 CAG 프로모터일 수 있다. gRNA 발현을 위한 프로모터는, 예컨대, HI, EF-la, tRNA 또는 U6 프로모터일 수 있다. Cas9를 코딩하는 유전자의 서열은 nuclear localization signal(NLS) (e.g., SV40 NLS)를 포함할 수 있다. 일 예에서, 상기 프로모터는 조직 특이성 또는 세포 특이성을 갖는 것일 수 있다.When the Cas9 is encoded with DNA and delivered to an individual or cell, the DNA may generally (but not necessarily) contain regulatory elements (eg, promoters) operable in the target cell. The promoter for Cas9 expression may be, for example, a CMV, EF-1 a, EFS, MSCV, PGK, or CAG promoter. The promoter for gRNA expression may be, for example, a HI, EF-la, tRNA or U6 promoter. The sequence of the gene encoding Cas9 may include a nuclear localization signal (NLS) (e.g., SV40 NLS). In one example, the promoter may have tissue specificity or cell specificity.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 PMT1b 유전자를 구성하는 핵산서열 내 PAM(protospacer-adjacent Motif) 서열 중 또는 이의 5' 말단 또는 3'말단에 인접하여 위치하는 연속하는 1bp 내지 50bp의 염기 서열 부위 내의 유전자의 전부 또는 연속하는 염기 서열 부위의 하나 이상의 뉴클레오티드의 결실, 야생형 유전자와 상이한 뉴클레오티드로의 치환, 각각 독립적으로 A, T, C 및 G 중에서 선택된 1 내지 23개의 뉴클레오티드의 삽입, 또는 상기 변형의 조합에 의한 것일 수 있다. Genetic manipulation artificially performed to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene is performed in the protospacer-adjacent Motif (PAM) sequence in the nucleic acid sequence constituting the PMT1b gene or at the 5'or 3'end thereof. Deletion of one or more nucleotides of all or contiguous nucleotides in a contiguous 1 bp to 50 bp nucleotide sequence region, substitution with a nucleotide different from the wild-type gene, each independently of A, T, C, and G It may be by insertion of selected 1 to 23 nucleotides, or a combination of the above modifications.

일 구현 예에서, 가이드 RNA는 상기 식물 세포 내 적어도 하나의 대립유전자의 PMT1b 유전자에 결합되는 것일 수 있으며, 구체적으로 모든 대립유전자의 PMT1b 유전자에 결합되는 것일 수 있다. 모든 대립유전자의 PMT1b 유전자에 가이드 RNA가 결합하여 PMT1b 유전자를 넉아웃시키는 경우, 동일 세대 뿐만 아니라 후 세대의 식물 세포에서도 니코틴 생합성이 억제될 수 있다.In one embodiment, the guide RNA may be bound to the PMT1b gene of at least one allele in the plant cell, and specifically may be bound to the PMT1b gene of all alleles. When guide RNA binds to the PMT1b gene of all alleles and knocks out the PMT1b gene, nicotine biosynthesis may be inhibited in plant cells of the same generation as well as of a later generation.

일 구현 예에서 상기 식물 세포에서 PMT1b 유전자를 넉아웃 시키기 위하여 사용한 타겟 시퀀스는 예컨대 PMT1b 유전자의 Exon 1 내지 3으로 이루어진 부위에서 적어도 하나의 부위일 수 있다.In one embodiment, the target sequence used to knock out the PMT1b gene in the plant cell may be, for example, at least one site in the site consisting of Exon 1 to 3 of the PMT1b gene.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 PMT1b 유전자로부터 코딩되는 단백질이 본래의 기능을 갖는 단백질 형태로 발현되지 않도록 하는 것일 수 있다. 상기 유전자의 조작은 다음 중 하나 이상에 의하여 유도된 것일 수 있다:Genetic manipulation artificially performed to reduce the expression or activity of the protein expressed by the PMT1b gene or the PMT1b gene may be such that the protein encoded from the PMT1b gene is not expressed in the form of a protein having an original function. The manipulation of the gene may be induced by one or more of the following:

1) PMT1b 유전자의 전부 또는 일부 결실, 예컨대, PMT1b 유전자의 1bp 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드의 결실,1) All or partial deletion of the PMT1b gene, such as 1 bp or more nucleotides of the PMT1b gene, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15 , Deletion of 1 to 10, 1 to 5, 1 to 3, or 1 nucleotide,

2) PMT1b 유전자의 1bp 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드의 원래(야생형)와 상이한 뉴클레오티드로의 치환,2) 1 bp or more nucleotides of the PMT1b gene, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15, 1 to 10, 1 to 5 , Replacement of 1 to 3, or 1 nucleotide with a nucleotide different from the original (wild type),

3) 하나 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드 (각각 독립적으로 A, T, C 및 G 중에서 선택됨)의 타겟 유전자의 임의의 위치에의 삽입, 및3) one or more nucleotides, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15, 1 to 10, 1 to 5, 1 to Insertion of 3, or 1 nucleotide (each independently selected from A, T, C and G) at any position of the target gene, and

4) 상기 1) 내지 3) 중에서 선택된 2 가지 이상의 조합.4) A combination of two or more selected from 1) to 3) above.

상기 PMT1b 유전자의 변형되는 일부 ('타겟 부위')는 상기 유전자 중의 1bp 이상, 3bp 이상, 5bp 이상, 7bp 이상, 10bp 이상, 12bp 이상, 15bp 이상, 17bp 이상, 20bp 이상, 예컨대, 1bp 내지 30bp, 3bp 내지 30bp, 5bp 내지 30bp, 7bp 내지 30bp, 10bp 내지 30bp, 12bp 내지 30bp, 15bp 내지 30bp, 17bp 내지 30bp, 20bp 내지 30bp, 1bp 내지 27bp, 3bp 내지 27bp, 5bp 내지 27bp, 7bp 내지 27bp, 10bp 내지 27bp, 12bp 내지 27bp, 15bp 내지 27bp, 17bp 내지 27bp, 20bp 내지 27bp, 1bp 내지 25bp, 3bp 내지 25bp, 5bp 내지 25bp, 7bp 내지 25bp, 10bp 내지 25bp, 12bp 내지 25bp, 15bp 내지 25bp, 17bp 내지 25bp, 20bp 내지 25bp, 1bp 내지 23bp, 3bp 내지 23bp, 5bp 내지 23bp, 7bp 내지 23bp, 10bp 내지 23bp, 12bp 내지 23bp, 15bp 내지 23bp, 17bp 내지 23bp, 20bp 내지 23bp, 1bp 내지 20bp, 3bp 내지 20bp, 5bp 내지 20bp, 7bp 내지 20bp, 10bp 내지 20bp, 12bp 내지 20bp, 15bp 내지 20bp, 17bp 내지 20bp, 21bp 내지 25bp, 18bp 내지 22bp, 또는 21bp 내지 23bp의 연속하는 염기 서열 부위일 수 있다.The modified part ('target site') of the PMT1b gene is 1 bp or more, 3 bp or more, 5 bp or more, 7 bp or more, 10 bp or more, 12 bp or more, 15 bp or more, 17 bp or more, 20 bp or more, such as 1 bp to 30 bp, 3bp to 30bp, 5bp to 30bp, 7bp to 30bp, 10bp to 30bp, 12bp to 30bp, 15bp to 30bp, 17bp to 30bp, 20bp to 30bp, 1bp to 27bp, 3bp to 27bp, 5bp to 27bp, 7bp to 27bp, 10bp to 27bp, 12bp to 27bp, 15bp to 27bp, 17bp to 27bp, 20bp to 27bp, 1bp to 25bp, 3bp to 25bp, 5bp to 25bp, 7bp to 25bp, 10bp to 25bp, 12bp to 25bp, 15bp to 25bp, 17bp to 25bp, 20bp to 25bp, 1bp to 23bp, 3bp to 23bp, 5bp to 23bp, 7bp to 23bp, 10bp to 23bp, 12bp to 23bp, 15bp to 23bp, 17bp to 23bp, 20bp to 23bp, 1bp to 20bp, 3bp to 20bp, 5bp to 20 bp, 7 bp to 20 bp, 10 bp to 20 bp, 12 bp to 20 bp, 15 bp to 20 bp, 17 bp to 20 bp, 21 bp to 25 bp, 18 bp to 22 bp, or 21 bp to 23 bp.

일 예에서, 상기 유전자 넉아웃은 희귀 절단 엔도뉴클레아제를 포함하는 유전체 교정 시스템에 의하여 타겟된 유전자 내의 특정 부위의 단일가닥 또는 이중가닥 절단(cleavage)을 촉매화하여 타겟된 유전자인 PMT1b 유전자를 발현을 감소시키는 것일 수 있다. 상기 희귀 절단 엔도뉴클레아제에 의하여 촉매되는 핵산가닥 손상(breaks)은 상동(homologous) 재조합(recombination) 또는 비상동 말단 연결 (non-homologous end joining; NHEJ) 등의 메커니즘들을 통하여 수선될 수 있다. 이 경우, NHEJ 메커니즘이 일어나면, 절단 위치(cleavage site)에서 DNA 서열에 변화가 유발되고, 이에 의하여 유전자가 불활성화될 수 있다. NHEJ을 통한 수선은 짧은 유전자 단편의 치환들, 삽입들 또는 결실을 야기하고, 해당 유전자 넉아웃(knockouts)의 유도에 사용될 수 있다. 상기 변형은 하나 이상의 뉴클레오티드, 예컨대, 1 내지 30bp, 1 내지 27bp, 1 내지 25bp, 1 내지 23bp, 1 내지 20bp, 1 내지 15bp, 1 내지 10bp, 1 내지 5bp, 1 내지 3bp, 또는 1bp의 뉴클레오티드의 치환, 결실, 및/또는 삽입일 수 있다.In one example, the gene knockout catalyzes single-stranded or double-stranded cleavage of a specific site in the targeted gene by a genome editing system including a rare cleavage endonuclease, thereby generating the targeted gene, the PMT1b gene. It may be to reduce the expression. Nucleic acid strand breaks catalyzed by the rare cleavage endonuclease can be repaired through mechanisms such as homologous recombination or non-homologous end joining (NHEJ). In this case, when the NHEJ mechanism occurs, a change in the DNA sequence is induced at the cleavage site, whereby the gene may be inactivated. Repair through NHEJ causes substitutions, insertions or deletions of short gene fragments, and can be used to induce knockouts of the corresponding gene. The modification of one or more nucleotides, such as 1 to 30 bp, 1 to 27 bp, 1 to 25 bp, 1 to 23 bp, 1 to 20 bp, 1 to 15 bp, 1 to 10 bp, 1 to 5 bp, 1 to 3 bp, or 1 bp of nucleotides Substitutions, deletions, and/or insertions.

상기 희귀 절단 엔도뉴클레아제는 메가뉴클레아제(meganuclease), 징크핑거(Zinc finger) 뉴클레아제, CRISPR/Cas9 (Cas9 단백질), CRISPR-Cpf1 (Cpf1 단백질) 및 TALE-뉴클레아제로 이루어진 군에서 선택된 하나 이상일 수 있다. 일 구체예에서, 상기 희귀 절단 엔도뉴클레아제는 Cas9 단백질 또는 Cpf1 단백질일 수 있다.The rare cleavage endonuclease is in the group consisting of meganuclease, zinc finger nuclease, CRISPR/Cas9 (Cas9 protein), CRISPR-Cpf1 (Cpf1 protein), and TALE-nuclease. It may be one or more selected. In one embodiment, the rare cleavage endonuclease may be a Cas9 protein or a Cpf1 protein.

용어 "키메라 RNA", "키메라 가이드 RNA", "가이드 RNA", "단일의 가이드 RNA(single guide RNA, sgRNA)" 및 "합성 가이드 RNA"는 상호교환가능하게 사용되며, 가이드 서열, tracr 서열 및/또는 tracr 메이트 서열을 포함하는 폴리뉴클레오티드 서열을 지칭한다. 용어 "가이드 서열"은 표적 부위를 지정하는 가이드 RNA 내의 약 20bp 서열을 지칭하며, 용어 "가이드" 또는 "스페이서"와 상호교환가능하게 사용될 수 있다. 또한, 용어 "tracr 메이트 서열"은 용어 "직접 반복부(들)"와 상호교환가능하게 사용될 수 있다. 상기 가이드 RNA는 두 개의 RNA, 즉, CRISPR RNA (crRNA) 및 트랜스 활성화 crRNA (transactivating crRNA, tracrRNA)로 이루어져 있는 것일 수 있으며, 또는 crRNA 및 tracrRNA의 부분을 포함하고 상기 표적 DNA와 혼성화하는 단일 사슬 RNA (single-chain RNA, sgRNA)일 수 있다.The terms “chimeric RNA”, “chimeric guide RNA”, “guide RNA”, “single guide RNA (sgRNA)” and “synthetic guide RNA” are used interchangeably, and guide sequences, tracr sequences and / Or a polynucleotide sequence comprising a tracr mate sequence. The term “guide sequence” refers to an about 20 bp sequence within a guide RNA that designates a target site, and may be used interchangeably with the terms “guide” or “spacer”. In addition, the term “tracr mate sequence” may be used interchangeably with the term “direct repeat(s)”. The guide RNA may be composed of two RNAs, that is, CRISPR RNA (crRNA) and transactivating crRNA (transactivating crRNA, tracrRNA), or single-stranded RNA that hybridizes with the target DNA and includes portions of crRNA and tracrRNA. (single-chain RNA, sgRNA).

일반적으로, 가이드 서열은 표적 서열과 혼성화하고, 표적 서열로의 CRISPR 복합체의 서열-특이적 결합을 유도하기에 충분한, 표적 폴리뉴클레오티드 서열과의 상보성을 갖는 임의의 폴리뉴클레오티드 서열이다. 또한 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성을 감소시키기 위한 유전자 조작에 이용할 수 있는 염기 서열이라면 제한 없이 가이드 RNA로 이용할 수 있으며, 예컨대 상기 염기 서열은 PMT1b 유전자와 혼성화할 수 있는 서열일 수 있다. 또한 상기 가이드 RNA의 기능을 변형/증진시키기 위하여 가이드 RNA 염기 서열의 일부분을 변형할 수 있다. 또한 일부 구현 예에서, 가이드 서열과 그의 상응하는 표적 서열 간의 상보성의 정도는 적절한 정렬 알고리즘을 사용하여 최적으로 정렬되는 경우, 약 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99% 이상이다. 최적의 정렬은 서열을 정렬하기에 적절한 임의의 알고리즘의 사용으로 결정될 수 있으며, 그의 비제한적인 예는 스미스-워터만(Smith-Waterman) 알고리즘, 니들만-분쉬(Needleman-Wunsch) 알고리즘, 버로우즈-휠러 트랜스폼(Burrows-Wheeler Transform)에 기초한 알고리즘(예를 들어, 버로우즈 휠러 얼라이너(Burrows Wheeler Aligner)), ClustalW, Clustal X, BLAT, 노보얼라인(Novoalign)(노보크라프트 테크놀로지즈(Novocraft Technologies), ELAND(일루미나(Illumina), 미국 캘리포니아주 샌디에고), SOAP(soap.genomics.org.cn에서 이용가능) 및 Maq(maq.sourceforge.net에서 이용가능)를 포함한다. 일부 구현 예에서, 가이드 서열은 예컨대 약 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75개 이상의 뉴클레오티드 길이이다. 일부 구현 예에서, 가이드 서열은 약 75, 50, 45, 40, 35, 30, 25, 20, 15, 12개 이하의 뉴클레오티드 길이이다. 표적 서열로의 CRISPR 복합체의 서열-특이적 결합을 유도하는 가이드 서열의 능력은 임의의 적절한 검정에 의해 평가될 수 있다. 예를 들어, 시험되는 가이드 서열을 포함하는 CRISPR 복합체를 형성하기에 충분한 CRISPR 시스템의 성분은 예를 들어, CRISPR 서열의 성분을 인코딩하는 벡터로의 트랜스펙션 후에, 예를 들어, 본원에 기술된 바와 같은 서베이어 검정에 의한 표적 서열 내의 우선적인 절단의 평가에 의해서와 같이, 상응하는 표적 서열을 갖는 숙주 세포로 제공될 수 있다. 유사하게, 표적 폴리뉴클레오티드 서열의 절단은 표적 서열, 시험되는 가이드 서열 및 시험 가이드 서열과 상이한 대조군 가이드 서열을 포함하는 CRISPR 복합체의 성분을 제공하고, 표적 서열에서 시험 및 대조군 가이드 서열 반응 간의 결합 또는 절단 비율을 비교함으로써 시험관에서 평가될 수 있다. 다른 검정이 가능하며, 당업자에게 용이하게 사용될 수 있을 것이다.In general, the guide sequence is any polynucleotide sequence that has sufficient complementarity with the target polynucleotide sequence to hybridize with the target sequence and induce sequence-specific binding of the CRISPR complex to the target sequence. In addition, any nucleotide sequence that can be used for genetic manipulation to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene can be used as a guide RNA without limitation.For example, the nucleotide sequence is a sequence capable of hybridizing with the PMT1b gene. I can. In addition, in order to modify/enhance the function of the guide RNA, a part of the guide RNA base sequence may be modified. Also in some embodiments, the degree of complementarity between the guide sequence and its corresponding target sequence, when optimally aligned using an appropriate alignment algorithm, is about 50%, 60%, 75%, 80%, 85%, 90%, It is more than 95%, 97.5%, and 99%. Optimal alignment can be determined by the use of any algorithm suitable for aligning the sequences, non-limiting examples of which are Smith-Waterman algorithm, Needleman-Wunsch algorithm, Burrows- Algorithms based on Burrows-Wheeler Transform (e.g. Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies) , ELAND (Illumina, San Diego, CA, USA), SOAP (available from soap.genomics.org.cn) and Maq (available from maq.sourceforge.net) In some embodiments, guide sequence Is for example about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40 , 45, 50, 75 or more nucleotides in length, in some embodiments, the guide sequence is no more than about 75, 50, 45, 40, 35, 30, 25, 20, 15, 12 nucleotides in length to the target sequence. The ability of the guide sequence to induce sequence-specific binding of the CRISPR complex of can be assessed by any suitable assay, eg, sufficient components of the CRISPR system to form a CRISPR complex comprising the guide sequence being tested. Is the corresponding target, e.g., after transfection with a vector encoding a component of the CRISPR sequence, e.g., by assessment of preferential cleavage in the target sequence by Surveyor assay as described herein. Sequences can be provided to host cells. Similarly, cleavage of the target polynucleotide sequence can be accomplished by CRIS comprising a target sequence, a guide sequence to be tested, and a control guide sequence that is different from the test guide sequence. It can be evaluated in vitro by providing the components of the PR complex and comparing the binding or cleavage ratio between the test and control guide sequence reactions in the target sequence. Other assays are possible and will be readily available to those skilled in the art.

가이드 서열은 임의의 표적 서열을 표적화하도록 선택될 수 있다. 일부 구현 예에서, 표적 서열은 세포의 게놈 내의 서열이다. 예시적인 표적 서열은 표적 게놈에서 독특한 것들을 포함한다. 예를 들어, 스트렙토코커스 피오게네스 유래 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXGG의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXGG (N은 A, G, T 또는 C이며; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXGG의 스트렙토코커스 피오게네스 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXGG (N은 A, G, T 또는 C 이며; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 스트렙토코커스 써모필러스 CRISPR1 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXXAGAAW의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXXAGAAW (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있으며; W는 A 또는 T임)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXXAGAAW의 스트렙토코커스 써모필러스 CRISPR1 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXXAGAAW(N은 A, G, T 또는 C이고; X는 임의의 것일 수 있으며; W는 A 또는 T임)는 게놈 내에 단일의 존재를 갖는다. 스트렙토코커스 피오게네스 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXGGXG의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXGGXG (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXGGXG의 스트렙토코커스 피오게네스 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXGGXG (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 이들 서열 각각에서, "M"은 A, G, T 또는 C일 수 있다.The guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within the genome of the cell. Exemplary target sequences include those that are unique in the target genome. For example, for Cas9 from Streptococcus pyogenes, a unique target sequence in the genome may include the Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXGG, where NNNNNNNNNNNNXGG (N is A, G, T or C; X is any Has a single presence in the genome. Unique target sequences in the genome may include the Streptococcus pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGG, where NNNNNNNNNNNXGG (N is A, G, T or C; X may be any) is a single in the genome Has the presence of. For Streptococcus thermophilus CRISPR1 Cas9, the unique target sequence in the genome may include the Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXXAGAAW, where NNNNNNNNNNNNXXAGAAW (N is A, G, T or C; X can be any ; W is A or T) has a single presence in the genome. A unique target sequence in the genome may include a Streptococcus thermophilus CRISPR1 Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXXAGAAW, where NNNNNNNNNNNXXAGAAW (N is A, G, T or C; X can be any; W is A. Or T) has a single presence in the genome. For Streptococcus pyogenes Cas9, a unique target sequence in the genome can include the Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXGGXG, where NNNNNNNNNNNNXGGXG (N is A, G, T or C; X can be any) Has a single presence in the genome. A unique target sequence in the genome may include a Streptococcus pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGGXG, where NNNNNNNNNNNXGGXG (N is A, G, T or C; X can be any) is a single in the genome. Has the presence of. In each of these sequences, “M” can be A, G, T or C.

일 구현 예에서, 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 티민(Thymine)이 결합되어 있는 것일 수 있고, 상기 PMT1b에 결합하는 가이드 RNA는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 우라실(Uracil)가 결합되어 있는 것일 수 있다.In one embodiment, the gene encoding the guide RNA that binds to the PMT1b is a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 is bound to the 5'-end, and five consecutive thymines at the 3'-end The guide RNA that binds to PMT1b is a polynucleotide transcribed with a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 at the 5'-end, and 5 contiguous 5 at the 3'-end. Uracil may be bound.

상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드(P_U6)가 결합되어 있고, 3'-말단에 5개의 연속된 티민(Thymine)이 결합되어 있거나, 상기 PMT1b에 결합하는 가이드 RNA는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 우라실(Uracil)가 결합되어 있는 경우, 불안정한 가이드 RNA가 안정화될 수 있다. 상기 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드는 U6 프로모터(P_U6)에 해당한다.The gene encoding the guide RNA binding to the PMT1b has a polynucleotide (P_U6) consisting of the nucleotide sequence of SEQ ID NO: 19 at the 5'-end and 5 consecutive thymines at the 3'-end. Or the guide RNA that binds to the PMT1b has a polynucleotide transcribed with a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 at the 5'-end, and 5 consecutive uracils at the 3'-end When bound, the unstable guide RNA can be stabilized. The polynucleotide to which the polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 is transcribed corresponds to the U6 promoter (P_U6).

일 구현 예에서, 상기 조작은 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자, 상기 Cas9 단백질을 코딩하는 유전자 및 상기 NLS를 코딩하는 유전자를 포함하는 벡터를 형질전환시켜 이루어지는 것일 수 있다.In one embodiment, the manipulation may be performed by transforming a vector including a gene encoding a guide RNA binding to the PMT1b, a gene encoding the Cas9 protein, and a gene encoding the NLS.

용어 "벡터(vector)"는 숙주 세포에서 목적 유전자를 발현시키기 위한 수단을 의미한다. 예를 들어, 플라스미드 벡터, 코즈미드 벡터 및 박테리오파아지 벡터, 아데노바이러스 벡터, 레트로바이러스 벡터 및 아데노-연관 바이러스 벡터와 같은 바이러스 벡터를 포함한다. 상기 재조합 벡터로 사용될 수 있는 벡터는 당업계에서 종종 사용되는 플라스미드 (예를 들면, V1k_GE, pSC101, pGV1106, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1, pHV14, pGEX 시리즈, pET 시리즈 및 pUC19 등), 파지 또는 바이러스 (예를 들면, SV40 등)를 조작하여 제작될 수 있다.The term “vector” refers to a means for expressing a gene of interest in a host cell. For example, plasmid vectors, cosmid vectors and bacteriophage vectors, adenovirus vectors, retroviral vectors, and viral vectors such as adeno-associated viral vectors. Vectors that can be used as the recombinant vector include plasmids often used in the art (e.g., V1k_GE, pSC101, pGV1106, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1 , pHV14, pGEX series, pET series and pUC19, etc.), phage or virus (e.g., SV40, etc.).

상기 벡터에서 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자, 상기 Cas9 단백질을 코딩하는 유전자 및 상기 NLS를 코딩하는 유전자는 프로모터에 작동 가능하게 연결될 수 있다. 용어 "작동 가능하게 연결된(operatively linked)"은 뉴클레오타이드 발현 조절 서열(예를 들어, 프로모터 서열)과 다른 뉴클레오타이드 서열 사이의 기능적인 결합을 의미한다. 상기 조절 서열은 "작동 가능하게 연결(operatively linked)"됨으로써 다른 뉴클레오타이드 서열의 전사 및/또는 해독을 조절할 수 있다.In the vector, a gene encoding a guide RNA binding to the PMT1b, a gene encoding the Cas9 protein, and a gene encoding the NLS may be operably linked to a promoter. The term “operatively linked” refers to a functional linkage between a nucleotide expression control sequence (eg, a promoter sequence) and another nucleotide sequence. The regulatory sequences can be “operatively linked” to regulate transcription and/or translation of other nucleotide sequences.

상기 벡터는, 전형적으로 클로닝을 위한 벡터 또는 발현을 위한 벡터로서 구축될 수 있다. 상기 발현용 벡터는 당업계에서 식물, 동물 또는 미생물에서 외래의 단백질을 발현하는 데 사용되는 통상의 것을 사용할 수 있다. 상기 벡터는 당업계에 공지된 다양한 방법을 통해 구축될 수 있다.The vector can be constructed, typically as a vector for cloning or as a vector for expression. The expression vector may be a conventional one used in the art to express foreign proteins in plants, animals, or microorganisms. The vector can be constructed through various methods known in the art.

상기 벡터는 원핵 세포 또는 진핵 세포를 숙주로 하여 구축될 수 있다. 예를 들어, 사용되는 벡터가 발현 벡터이고, 원핵 세포를 숙주로 하는 경우에는, 전사를 진행시킬 수 있는 강력한 프로모터(예를 들어, CMV 프로모터, trp 프로모터, lac 프로모터, tac 프로모터, T7 프로모터 등), 해독의 개시를 위한 라이보좀 결합 자리 및 전사/해독 종결 서열을 포함하는 것이 일반적이다. 진핵 세포를 숙주로 하는 경우에는, 벡터에 포함되는 진핵 세포에서 작동하는 복제원점은 f1 복제원점, SV40 복제원점, pMB1 복제원점, 아데노 복제원점, AAV 복제원점 및 BBV 복제원점 등을 포함하나, 이에 한정되는 것은 아니다. 또한, 포유동물 세포의 게놈으로부터 유래된 프로모터(예를 들어, 메탈로티오닌 프로모터) 또는 포유동물 바이러스로부터 유래된 프로모터(예를 들어, 아데노바이러스 후기 프로모터, 백시니아 바이러스 7.5K 프로모터, SV40 프로모터, 사이토메갈로바이러스 프로모터 및 HSV의 tk프로모터)가 이용될 수 있으며, 전사 종결 서열로서 폴리아데닐화 서열을 일반적으로 갖는다.The vector can be constructed using a prokaryotic cell or a eukaryotic cell as a host. For example, when the vector used is an expression vector and a prokaryotic cell is used as a host, a strong promoter capable of promoting transcription (eg, CMV promoter, trp promoter, lac promoter, tac promoter, T7 promoter, etc.) , It is common to include a ribosome binding site for initiation of translation and a transcription/translation termination sequence. In the case of eukaryotic cells as a host, the origin of replication operating in eukaryotic cells included in the vector includes the f1 origin of replication, SV40 origin of replication, pMB1 origin of replication, adeno origin of replication, AAV origin of replication, BBV origin of replication, etc. It is not limited. In addition, a promoter derived from the genome of a mammalian cell (eg, a metallotionine promoter) or a promoter derived from a mammalian virus (eg, a late adenovirus promoter, a vaccinia virus 7.5K promoter, a SV40 promoter, Cytomegalovirus promoter and tk promoter of HSV) can be used and generally have a polyadenylation sequence as a transcription termination sequence.

상기 형질전환에 있어서, 당업계에 주지된 형질전환 방법으로서 벼의 유전자의 형질전환이 가능한 방법이라면 특별히 제한되지는 아니하나, 구체적인 예를 들자면, 아그로박테리움-매개 형질전환법, 폴리에틸렌글리콜(Polyethylene glycol; PEG)-매개 프로토플라스트 형질전환법, 유전자총법, 전극(electrode) 형질전환법, 진공 침윤(Vacuum infiltration) 형질전환법 및 탄화규소 섬유-매개 형질전환법으로 이루어진 군에서 선택된 하나에 의해 수행되는 것일 수 있고, 벼의 특성과 형질전환율을 고려할 때 아그로박테리움-매개 형질전환법을 사용할 수 있다.In the above transformation, as a transformation method well known in the art, as long as it is a method capable of transforming the gene of rice, it is not particularly limited, but a specific example is, for example, Agrobacterium-mediated transformation method, polyethylene glycol (Polyethylene glycol; PEG)-mediated protoplast transformation method, gene gun method, electrode transformation method, vacuum infiltration method, and silicon carbide fiber-mediated transformation method by one selected from the group consisting of It may be performed, and the Agrobacterium-mediated transformation method may be used when considering the characteristics and transformation rate of rice.

일 실시예에 있어서, 상기 식물은 재배담배(Nicotiana tabacum)일 수 있고, 구체적으로 황색종, 버어리종, 재래종, 흑담배 또는 오리엔트종일 수 있고, 보다 구체적으로는 버어리종 또는 황색종일 수 있다.In one embodiment, the plant may be a cultivated tobacco ( Nicotiana tabacum ), specifically yellow species, burley species, native species, black tobacco, or orient species, and more specifically Burley species or yellow species. .

다른 양상은 상기 식물 세포를 포함하는 식물에 관한 것이다.Another aspect relates to a plant comprising the plant cell.

상기 식물은 동일 세대 뿐만 아니라 후 세대에서도 니코틴 생합성이 억제된 식물일 수 있고, 이에 따라 계속해서 니코틴 생산이 저감화된 식물을 얻을 수 있다.The plant may be a plant in which nicotine biosynthesis is suppressed not only in the same generation but also in subsequent generations, and accordingly, a plant in which nicotine production is continuously reduced can be obtained.

상기 식물 세포는 전술한 범위 내의 식물 세포일 수 있다.The plant cell may be a plant cell within the above-described range.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법에 관한 것이다.Another aspect relates to a method for inhibiting plant nicotine biosynthesis comprising genetically engineering to reduce the expression or activity of the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell.

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene is the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of PMT4 genes and a nucleotide sequence of SEQ ID NO: 1, and more specifically, the PMT1b gene is a PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 식물 세포 내 니코틴이 존재하지 않을 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제될 수 있다.When the expression or activity of not only the PMT1b gene but also the PMT1a, PMT2, PMT3 and PMT4 genes is reduced, nicotine biosynthesis may be maximally suppressed, and thus nicotine may not be present in the plant cell. Furthermore, the biosynthesis of nornicotine and anatabine can also be maximally inhibited.

상기 용어 "모세포", "니코틴", "PMT1b", "유전적으로 조작" 등은 전술한 범위 내일 수 있다.The terms "blast", "nicotine", "PMT1b", "genetically engineered" and the like may be within the above-described range.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법에 관한 것이다.Another aspect is the production of plant cells in which the biosynthesis of nicotine is suppressed, comprising the step of genetically engineering the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the expression or activity of the protein expressed by the PMT1b gene compared to the parent cell. It's about the method.

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene is the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of PMT4 genes and a nucleotide sequence of SEQ ID NO: 1, and more specifically, the PMT1b gene is a PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 니코틴이 존재하지 않는 식물 세포를 제조할 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제된 식물 세포를 제조할 수 있다.When the expression or activity of not only the PMT1b gene but also the PMT1a, PMT2, PMT3 and PMT4 genes is reduced, nicotine biosynthesis can be maximally suppressed, and thus plant cells without the nicotine can be prepared. Furthermore, it is possible to produce plant cells in which the biosynthesis of nornicotine and anatabine is also maximally inhibited.

상기 용어 "모세포", "니코틴", "PMT1b", "유전적으로 조작" 등은 전술한 범위 내일 수 있다.The terms "blast", "nicotine", "PMT1b", "genetically engineered" and the like may be within the above-described range.

중복되는 내용은 본 명세서의 복잡성을 고려하여 생락하며, 본 명세서에서 달리 정의되지 않은 용어들은 본 발명이 속하는 기술분야에서 통상적으로 사용되는 의미를 갖는다.Redundant content is omitted in consideration of the complexity of the present specification, and terms not otherwise defined herein have meanings commonly used in the technical field to which the present invention belongs.

본 발명은 니코틴 생합성 유전자인 PMT 유전자 중 지금까지 알려지지 않았던 PMT1b 유전자의 구체적인 염기 서열을 밝힘으로써, 유전자 교정 방법을 통하더라도 기존에 미량이라도 존재하던 식물 내 니코틴을 완벽하게 조절할 수 있도록 한다.The present invention reveals the specific nucleotide sequence of the PMT1b gene, which was not known until now, among the PMT genes, which are nicotine biosynthesis genes, so that even through a gene editing method, nicotine in plants that existed even in trace amounts can be completely regulated.

도 1은 담배 식물체에서 일어나는 니코틴 생합성 경로이다.
도 2는 NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자 CDS 부위 염기 서열 중 절단 위치로 선정한 (A) sg1, (B) sg2, (C) sg3 부위 염기 서열 비교도이다.
도 3 내지 5는 유전자 운반체 제작에 사용한 운반체 및 발현 블록으로서, 도 3(A)는 GE_block, 도 3(B)는 벡터인 V1k_GE, 도 4(A)는 Cas9_block을, 도 4(B)는 sgRNA_PMT 블록을, 도 5는 최종 유전자 가위 발현 블록을 나타낸다.
도 6은 단계별 식물 조직배양 사진으로서, (A)는 잎 조직을 잘라 agrobacterium과 함께 배양하여 형질전환 시키는 단계, (B)는 callus 분화 및 shoot 분화를 유도하는 단계, (C)는 root 분화를 유도하는 단계, (D)는 분화가 완료된 소식물체 상태를 나타낸다.
도 7은 돌연변이 여부 및 양상을 확인하기 위하여 조직배양체 4 개체의 gDNA 내 PMT 유전자를 증폭하여 전기영동한 사진으로서, Lane M1: 100 bp DNA marker, Lane 1~4: NtPMT1a 부위 증폭 결과, Lane 5~8: NtPMT2 부위 증폭 결과, Lane 9~12: NtPMT3 부위 증폭 결과, Lane 13~16: NtPMT4 부위 증폭 결과, Lane M2: 1 kb DNA marker를 나타낸다.
도 8은 Wild type (A)과 21번 mutant (B)의 PMT1a 유전자 염기 서열 분석 프로파일을 나타낸 도로서, 단일 염기가 insertion 된 부분은 빨간색 사각형으로 표시되었다.
도 9 내지 12는 CRISPR/Cas9 시스템을 통해 돌연변이가 확인된 조직배양체로부터 CRISPR/Cas9 시스템이 포함된 유전자블록이 제거된 최종 선별된 F1 식물(108_PMTm_F1)과 야생형(wild-type) 담배의 PMT 아미노산 서열을 비교한 도로서, 각각 도 9는 PMT1a, 도 10은 PMT2, 도 11은 PMT3, 도 12는 PMT4의 아미노산 서열을 비교한 도이다.
1 is a pathway for nicotine biosynthesis occurring in tobacco plants.
Figure 2 is a comparison diagram of the (A) sg1, (B) sg2, (C) sg3 site nucleotide sequence selected as a cleavage site among the nucleotide sequences of the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 genes.
Figures 3 to 5 are carriers and expression blocks used in the production of the gene carrier, Figure 3 (A) is a GE_block, Figure 3 (B) is a vector V1k_GE, Figure 4 (A) is a Cas9_block, Figure 4 (B) is sgRNA_PMT Block, Figure 5 shows the final gene scissor expression block.
Figure 6 is a step-by-step plant tissue culture picture, (A) is a step of cutting the leaf tissue and culturing it with agrobacterium to transform it, (B) the step of inducing callus differentiation and shoot differentiation, (C) inducing root differentiation Step, (D) represents the state of the news object in which the differentiation is completed.
7 is a photograph of amplification of PMT gene in gDNA of tissue culture 4 individuals and electrophoresis in order to confirm the presence and pattern of mutations, Lane M1: 100 bp DNA marker, Lanes 1 to 4: NtPMT1a site amplification results, Lane 5 to 8: NtPMT2 site amplification result, Lanes 9 to 12: NtPMT3 site amplification result, Lane 13 to 16: NtPMT4 site amplification result, Lane M2: 1 kb DNA marker.
8 is a diagram showing the nucleotide sequence analysis profile of the PMT1a gene of wild type (A) and mutant #21 (B), where a single nucleotide is inserted is indicated by a red square.
9 to 12 are the final selected F 1 plant (108_PMTm_F 1 ) and wild-type tobacco PMT from which the gene block containing the CRISPR/Cas9 system was removed from the tissue culture in which the mutation was confirmed through the CRISPR/Cas9 system. As a diagram comparing amino acid sequences, FIG. 9 is a diagram comparing the amino acid sequences of PMT1a, FIG. 10 is PMT2, FIG. 11 is PMT3, and FIG. 12 is PMT4, respectively.

이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나, 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다. Hereinafter, the present invention will be described in more detail through examples. However, these examples are for illustrative purposes only, and the scope of the present invention is not limited to these examples.

실시예Example

1. PMT1b 유전자를 제외한 PMT1a, PMT2, PMT3 및 PMT4 유전자를 타겟으로 CRISPR/Cas9 시스템을 이용한 저니코틴 담배 생산1. Production of low nicotine tobacco using CRISPR/Cas9 system targeting PMT1a, PMT2, PMT3 and PMT4 genes excluding PMT1b gene

(1) 유전자 운반체 제작(1) Gene carrier production

1) PMT(putrescine N-methyltransferase) 유전자1) PMT (putrescine N-methyltransferase) gene

재배담배(Nicotiana tabacum)의 니코틴 생합성 경로로부터 니코틴 생합성 유전자 중 PMT가 관여되어 있음을 알려져 있어, 니코틴 담배 생산을 위하여 PMT 유전자를 타겟으로 넉아웃시키면, 니코틴 함량이 저감화된 재배담배를 생산할 수 있음을 알 수 있었다(도 1).It is known that PMT is involved in the nicotine biosynthesis gene from the nicotine biosynthesis pathway of cultivated tobacco ( Nicotiana tabacum ), so if the PMT gene is knocked out as a target for nicotine tobacco production, it is possible to produce cultivated tobacco with reduced nicotine content. It was found (Fig. 1).

2) 니코틴 생합성 관련 유전자 (NtPMT) 염기 서열 확인2) Confirmation of nicotine biosynthesis related gene (NtPMT) base sequence

버어리종(KB108) 대상으로 하기 표 1의 프라이머를 이용해 유전자 증폭 반응(PCR, Polymerase Chain Reaction) 수행하였고, 염기 서열 분석 서비스를 의뢰하였다.For Burley species (KB108), a gene amplification reaction (PCR, Polymerase Chain Reaction) was performed using the primers shown in Table 1 below, and a nucleotide sequence analysis service was requested.

프라이머
명칭
primer
designation
프라이머 염기 서열Primer sequence 증폭
크기 (bp)
Amplification
Size (bp)
유전자 증폭 조건Gene amplification conditions 서열번호Sequence number
Annealing
Tmp (℃)
Annealing
Tmp (℃)
Elongation
time (sec)
Elongation
time (sec)
CyclesCycles
F_PMT1aF_PMT1a GCATAGATGTTTATTGGGAGTGTACAGCAAGCATAGATGTTTATTGGGAGTGTACAGCAA 724724 74~68
(Step down)
74~68
(Step down)
3030 4040 66
R_PMT1aR_PMT1a AGTTGCTGACTGTACGTGCACAAGTTGCTGACTGTACGTGCACA 77 F_PMT2F_PMT2 CATATGTGTCTATTGGGAGTGTACATCAACATATGTGTCTATTGGGAGTGTACATCAA 685685 68
(2 step)
68
(2 step)
3030 4040 88
R_PMT2R_PMT2 AAGTTGCTGACTGTACGTACACATAAGTTGCTGACTGTACGTACACAT 99 F_PMT3F_PMT3 CATAGATATGTTTATTGGGAGTGTACATCAACATAGATATGTTTATTGGGAGTGTACATCAA 783783 62
(3 step)
62
(3 step)
3030 4040 1010
R_PMT3R_PMT3 CCACTTTAAATATAATCATACATGAGCTTGACCCACTTTAAATATAATCATACATGAGCTTGAC 1111 F_PMT4F_PMT4 GCGAGCTTTCATAAAGTACAAATCAGCGAGCTTTCATAAAGTACAAATCA 812812 62
(3 step)
62
(3 step)
3030 4040 1212
R_PMT4R_PMT4 AGTATTTATTCATATTAAAGCATGTGTATTAGAAGTATTTATTCATATTAAAGCATGTGTATTAGA 1313

KB108 품종의 exon 1 ~ exon 3를 포함하는 gDNA 영역을 증폭하여 염기 서열 분석한 결과, NCBI (National Center for Biotechnology Information) 데이터베이스에 공개된 재배담배(Nicotiana tabacum)의 PMT 유전자 염기 서열(표 2)과 100% 일치하였다.As a result of nucleotide sequence analysis by amplifying the gDNA region containing exon 1 to exon 3 of the KB108 variety, the nucleotide sequence of the PMT gene of Nicotiana tabacum published in the NCBI (National Center for Biotechnology Information) database (Table 2) and 100% match.

유전자gene GeneBank Accession No.GeneBank Accession No. 서열번호Sequence number NtPMT1aNtPMT1a AF126810 (gDNA, CDS)AF126810 (gDNA, CDS) 22 NtPMT2NtPMT2 AF126809 (gDNA, CDS)AF126809 (gDNA, CDS) 33 NtPMT3NtPMT3 AF126811 (gDNA, CDS)AF126811 (gDNA, CDS) 44 NtPMT4NtPMT4 AF126812 (gDNA, CDS)AF126812 (gDNA, CDS) 55

2) 유전자 가위 블록 디자인 및 운반체 재조합2) Scissor block design and vehicle recombination

PMT 유전자로부터 폴리펩티드가 만들어지는 번역(translation) 단계의 초기에 중단될 수 있도록 exon 1 ~ exon 3 부분에 절단 위치를 선정하였고, PMT1a, PMT2, PMT3 및 PMT4 유전자 모두를 절단할 수 있는 공통된 부위를 선정하였다(표 3 및 도 2).The cleavage site was selected at exon 1 to exon 3 so that the polypeptide from the PMT gene can be stopped at the beginning of the translation step, and a common site that can cut all of the PMT1a, PMT2, PMT3 and PMT4 genes is selected (Table 3 and Figure 2).

sgRNAsgRNA 염기 서열Base sequence PMT 유전자 내 결합 위치Binding site in PMT gene 서열번호Sequence number PMT1aPMT1a PMT2PMT2 PMT3PMT3 PMT4PMT4 sg1sg1 AAGTCATATCTACCAACACAAATGG AAGTCATATCTACCAACACAAA TGG Exon 1Exon 1 1414 sg2sg2 TTTTCAGAGTTTAGCGCATTATGG TTTTCAGAGTTTAGCGCATTA TGG Exon 2Exon 2 Exon 1Exon 1 Exon 1Exon 1 Exon 1Exon 1 1515 sg3sg3 CTCAAAGAGCATGACATCTTGG CTCAAAGAGCATGACATCT TGG Exon 3Exon 3 Exon 2Exon 2 Exon 2Exon 2 Exon 2Exon 2 1616

또한, 식물 발현 유전자 운반체(V1k_GE, 서열번호 17) 내에 유전자 가위 운반체 클로닝하여 최종 유전자 운반체(V1k_GE_PMT, 서열번호 18)를 완성하였다(도 3 내지 5).In addition, the final gene carrier (V1k_GE_PMT, SEQ ID NO: 18) was completed by cloning the scissor carrier into the plant expression gene carrier (V1k_GE, SEQ ID NO: 17) (FIGS. 3 to 5).

pBI121은 E. coliagrobacterium에서 복제 가능한 binary vector이며 식물 형질전환에 널리 쓰이고 있는 벡터이다. pBI121을 HindIII와 EcoRI으로 절단하여 CRISPR/Cas9 시스템에 필요한 GE_block을 클로닝할 수 있도록 준비하였다(도 3A). GE_block은 차례대로 CaMV 35S promoter with dual enhancer, TEV leader sequence, Cas9 블록을 클로닝하기 위한 multi cloning site (MCS), CaMV 35S terminator, linker sequence, 그리고 sgRNA 블록을 클로닝하기 위한 multi cloning site (MCS)로 구성되어 있으며 양 끝에는 HindIII와 EcoRI의 인식 염기서열이 추가되어 있다. GE_block의 각 블록들은 DNA 합성으로 준비되었으며 순차적으로 클로닝하여 완성하였다. HindIII와 EcoRI으로 절단된 pBI121과 GE_block을 ligation하여 V1k_GE를 제작하였다(도 3B).pBI121 is a binary vector that can be replicated in E. coli and agrobacterium , and is a vector widely used for plant transformation. pBI121 was digested with HindIII and EcoRI to prepare the GE_block required for the CRISPR/Cas9 system to be cloned (FIG. 3A). GE_block consists of CaMV 35S promoter with dual enhancer, TEV leader sequence, multi cloning site (MCS) for cloning Cas9 block, CaMV 35S terminator, linker sequence, and multi cloning site (MCS) for cloning sgRNA block in turn. And HindIII and EcoRI recognition sequences are added at both ends. Each block of GE_block was prepared by DNA synthesis and was sequentially cloned to complete. V1k_GE was produced by ligation of pBI121 and GE_block cut with HindIII and EcoRI (Fig. 3B).

Cas9_block은 N-terminus nuclear localization sequence (NLS), Cas9 coding sequence (CDS), C-terminus nuclear localization sequence (NLS)로 구성된다. 양 말단에는 BamHI과 SacI 인식 염기서열이 추가되어 있다(도 4A). Cas9_block is composed of N-terminus nuclear localization sequence (NLS), Cas9 coding sequence (CDS), and C-terminus nuclear localization sequence (NLS). BamHI and SacI recognition nucleotide sequences are added to both ends (Fig. 4A).

sgRNA가 발현될 수 있는 블록은 U6 promoter(P_U6, 서열번호 19)와 sgRNA, 그리고 poly T로 구성된다. PMT 유전자에 특이적으로 결합할 수 있는 세 종류의 sgRNA 블록을 overlap extension PCR 기법을 통해 하나의 연속된 DNA로 연결하여 sgRNA_PMT를 완성하였다. sgRNA_PMT의 양 끝에는 SalI, SpeI의 인식 염기서열이 존재한다(도 4B). The block in which sgRNA can be expressed is composed of a U6 promoter (P_U6, SEQ ID NO: 19), sgRNA, and poly T. Three types of sgRNA blocks capable of specifically binding to the PMT gene were connected to one continuous DNA through overlap extension PCR technique to complete sgRNA_PMT. Recognition nucleotide sequences of SalI and SpeI exist at both ends of sgRNA_PMT (Fig. 4B).

BamHI과 SacI으로 절단된 V1k_GE와 Cas9_block을 ligation 반응을 통해 이어 붙인 후, SalI과 SpeI으로 절단하여 sgRNA_PMT를 삽입하여 V1k_GE_PMT를 제작하였다(도 5).V1k_GE and Cas9_block cut with BamHI and SacI were bonded together through ligation reaction, and then cut with SalI and SpeI to insert sgRNA_PMT to produce V1k_GE_PMT (FIG. 5).

4) 유전자 전달 미생물 (Agrobacterium) 내 재조합 운반체 도입4) Introduction of recombinant carriers in gene transfer microorganisms (Agrobacterium)

Freeze-thaw method 실험법으로 식물용 운반체를 Agrobacterium LBA4404 strain에 형질전환시켰다.The plant carrier was transformed into Agrobacterium LBA4404 strain by the Freeze-thaw method.

구체적으로, Agrobacterium을 YEP 액체 배지 (yeast extract 10 g, bacto peptone 10 g, NacCl 5 g)에 접종한 후, 16시간 동안 28℃, 250 rpm 조건으로 진탕 배양하였다. 배양액을 3,000 g의 속도 및 4℃ 조건으로 20분간 원심분리하여 세포를 분리하고, 20 mM CaCl2에 부유시켜서 competent cell을 만들었다. Competent cell 100 μL에 plasmid DNA (식물용 운반체) 5 μL를 첨가한 후, 액체질소에서 5분간, 37℃ 온도에서 5분간 배양하였다. YEP 액체배지 1 mL를 첨가하여 28℃, 250 rpm 조건으로 2시간 동안 진탕 배양하였다. 100 μL 배양액을 kanamycin 100 mg/L가 포함된 YEP 고체배지에 spreading 한 후, 28℃ 온도에서 3일 동안 배양하였다. Single colony들을 각각 계대배양한 후, plasmid DNA가 형질전환되었는지 PCR을 통해 확인하였다. Specifically, Agrobacterium was inoculated in YEP liquid medium (yeast extract 10 g, bacto peptone 10 g, NacCl 5 g), and then cultured with shaking at 28° C. and 250 rpm for 16 hours. The culture solution was centrifuged at a rate of 3,000 g and 4°C for 20 minutes to separate cells, and suspended in 20 mM CaCl 2 to make competent cells. After adding 5 μL of plasmid DNA (vegetable carrier) to 100 μL of competent cells, the cells were incubated in liquid nitrogen for 5 minutes and at 37°C for 5 minutes. 1 mL of YEP liquid medium was added and cultured with shaking at 28° C. and 250 rpm for 2 hours. 100 μL of the culture solution was spread on YEP solid medium containing 100 mg/L of kanamycin, and then incubated at 28° C. for 3 days. After subculturing each of the single colonies, it was confirmed through PCR whether the plasmid DNA was transformed.

(2) 식물 조직배양(2) Plant tissue culture

1) 식물 형질전환1) plant transformation

Agrobacterium를 YEP liquid media (70 mg/L kanamycin, 70 mg/L streptomycin 포함), 28℃ 조건에서 24시간 배양하였다.Agrobacterium was cultured in YEP liquid media (including 70 mg/L kanamycin and 70 mg/L streptomycin) at 28°C for 24 hours.

또한, 발아된지 한 달된 식물체의 잎을 70% 에탄올과 락스로 멸균한 후, 3 mm X 3 mm 크기의 절편으로 자르고, MS liquid 배지가 5 ml 담겨 있는 petri-dish에 절편을 올려 놓은 뒤, Agrobacterium 배양액 1 mL을 골고루 뿌려줌으로써 담배 잎 절편을 준비하였다. 이후 25℃, 암조건에서 48 시간 배양시켰다.In addition, after sterilizing the leaves of a plant that has been germinated for one month with 70% ethanol and lactose, cut into 3 mm X 3 mm sized slices, place the slices in a petri-dish containing 5 ml of MS liquid medium, and then Agrobacterium Tobacco leaf slices were prepared by spraying 1 mL of the culture solution evenly. After that, it was cultured at 25° C. for 48 hours in dark conditions.

2) 식물 조직배양2) Plant tissue culture

잎 절편을 멸균 증류수(200 ug/ml cefotaxim 포함)에 4 회 세척한 후, shooting 배지(MS 배지, 2 mg/L BA, 0.1 mg/L NAA, 200 mg/L cefotaxim, 100 mg/L kanamycin 포함)에 치상하여 25℃, 16시간/8시간 광주기 조건에서 배양하며, 2주 마다 새로운 배지로 계대배양함으로써 Washing 및 selection 배지 치상을 수행하였다.Leaf slices were washed 4 times in sterile distilled water (200 ug/ml cefotaxim included), and then shooting medium (MS medium, 2 mg/L BA, 0.1 mg/L NAA, 200 mg/L cefotaxim, 100 mg/L kanamycin included) ), and cultured at 25° C., 16 hours/ 8 hours photoperiod conditions, and subcultured with a new medium every 2 weeks to perform washing and selection medium placement.

또한, 잎 절편에서 분화된 shoot을 절단하여 rooting 배지(MS 배지, 200 mg/L cefotaxim 포함)에 치상하여 25℃, 16시간/8시간 광주기 조건에서 배양함으로써 Rooting 배지 치상을 수행하였다.In addition, shoots differentiated from leaf slices were cut and placed in a rooting medium (including MS medium, 200 mg/L cefotaxim), and cultured in a photoperiod condition at 25° C. for 16 hours/8 hours to perform rooting culture.

Agrobacterium mediated transformation 방법으로 담배 잎 조직에 형질전환 시킨 후, 캘러스 분화, 잎 분화, 뿌리 분화가 차례로 잘 이루어졌음을 확인하였으며, 조직배양을 통해 잎, 줄기, 뿌리를 갖춘 조직배양 소식물체 50 개체를 확보하였다(도 6).After transforming the tobacco leaf tissue by the Agrobacterium mediated transformation method, it was confirmed that callus differentiation, leaf differentiation, and root differentiation were performed well in order, and 50 individuals of tissue culture with leaves, stems, and roots were secured through tissue culture. Was done (Fig. 6).

(3) 돌연변이체 선별(3) selection of mutants

1) 목표 유전자 내 돌연변이 발생 여부 및 양상 확인1) Confirmation of the occurrence and pattern of mutations in the target gene

건강한 잎 조직 100 mg을 샘플링하여 균일하게 분쇄한 후, silica column을 이용하는 상용화 키트 (예: Nucleospin 96 plant II, Macherey Nagel, 독일)를 이용하여 genomic DNA를 추출 및 정제하였다.100 mg of healthy leaf tissue was sampled and pulverized evenly, and genomic DNA was extracted and purified using a commercial kit using a silica column (eg, Nucleospin 96 plant II, Macherey Nagel, Germany).

잎 조직에서 gDNA를 추출/정제한 후, PCR을 통해 목표 유전자 부위를 증폭하여 염기 서열 분석을 수행하였다.After extracting/purifying gDNA from leaf tissue, a nucleotide sequence analysis was performed by amplifying the target gene site through PCR.

조직 배양체들의 PMT1a, PMT2, PMT3, PMT4 유전자를 각각 증폭하여 염기 서열을 분석한 결과, sg3 sgRNA 결합 부위의 염기 서열에서 돌연변이가 자주 유발됨을 확인하였다(도 7).As a result of amplifying the PMT1a, PMT2, PMT3, and PMT4 genes of tissue cultures, respectively, and analyzing the nucleotide sequence, it was confirmed that mutations are frequently induced in the nucleotide sequence of the sg3 sgRNA binding site (FIG. 7).

또한, 총 세 개의 sgRNA (sg1, sg2, sg3) 중 sg3가 결합하는 위치에서 돌연변이가 유발된 확률은 50.25%에 이르며, sg1 결합 위치에는 2.75% 발생률을 보였음을 알 수 있었다(표 4).In addition, it was found that the probability of inducing mutation at the site where sg3 binds among a total of three sgRNAs (sg1, sg2, sg3) reached 50.25%, and showed a 2.75% incidence rate at the sg1 binding site (Table 4).

sgRNAsgRNA 돌연변이 발생 횟수Number of mutations 목표 유전자 개수Number of target genes 돌연변이 발생율 (%)Mutation incidence rate (%) sg1sg1 1111 400400 2.752.75 sg2sg2 00 400400 00 sg3sg3 201201 400400 50.2550.25

또한, sg3 sgRNA 결합 위치에만 dual mutation 양상을 확인할 수 있었으며 dual mutation이 발생한 확률은 31%이었음을 확인하였다(표 5).In addition, it was confirmed that the dual mutation pattern was confirmed only at the sg3 sgRNA binding site, and the probability of occurrence of the dual mutation was 31% (Table 5).

sgRNAsgRNA Dual mutation
(M + M)
Dual mutation
(M + M)
Half mutation
(M +W)
Half mutation
(M +W)
Wild type
(W)
Wild type
(W)
합계Sum
개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) sg1sg1 00 00 1111 5.55.5 189189 94.594.5 200200 100100 sg2sg2 00 00 00 00 200200 100100 200200 100100 sg3sg3 6262 3131 7777 38.538.5 6161 30.530.5 200200 100100

이 후, PMT1a, PMT2, PMT3, PMT4, 네 유전자의 allele 모두에 돌연변이가 유발된 조직배양체 12 개체를 선별하였다(표 6 및 도 8).Thereafter, PMT1a, PMT2, PMT3, PMT4, and 12 tissue culture organisms in which mutations were induced in all alleles of the four genes were selected (Table 6 and FIG. 8).

번호number 개체 번호Object number PMT1aPMT1a PMT2PMT2 PMT3PMT3 PMT4PMT4 1One 1One DualDual DualDual DualDual DualDual 22 22 DualDual Dual
(T ins, G ins)
Dual
(T ins, G ins)
DualDual DualDual
33 33 Dual(G ins, A ins)Dual(G ins, A ins) DualDual Homo
(T ins)
Homo
(T ins)
DualDual
44 44 Dual(G ins, A ins)Dual(G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(A ins)
Homo
(A ins)
Homo
(T del)
Homo
(T del)
55 55 DualDual Homo(T ins)Homo(T ins) Homo
(Tins)
Homo
(Tins)
Homo
(35bp del)
Homo
(35bp del)
66 88 DualDual Homo
(23bp del)
Homo
(23bp del)
DualDual Homo
(TGACTACCAAGATGTC del)
Homo
(TGACTACCAAGATGTC del)
77 1010 DualDual Homo
(TG del)
Homo
(TG del)
Homo
(Ains)
Homo
(Ains)
Homo
(TG del)
Homo
(TG del)
88 1111 DualDual DualDual Homo(Ains)Homo(Ains) Homo
(T del)
Homo
(T del)
99 1313 Homo(GA del)Homo(GA del) DualDual DualDual Homo
(T del)
Homo
(T del)
1010 1818 DualDual Homo(T del)Homo(T del) DualDual Homo
(T ins)
Homo
(T ins)
1111 2121 Dual(G ins, A ins)Dual(G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
1212 2424 Dual(G ins, A ins)Dual(G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)

※ Dual은 두 개의 대립 유전자(allele)에 각기 다른 유형의 돌연변이가 생성된 것을 나타낸다(예를 들어, 개체번호 24번의 PMT1a 유전자의 경우, A allele에는 G insertion 돌연변이가 일어났고, B allele에는 A insertion 돌연변이가 유도됨).※ Dual indicates that different types of mutations were generated in the two alleles (for example, in the case of PMT1a gene of individual number 24, a G insertion mutation occurred in A allele and A insertion mutation in B allele. Mutation is induced).

※※ Homo는 두 개의 대립 유전자(allele)에 같은 유형의 돌연변이가 생성된 것을 나타낸다(예를 들어, 개체번호 24번의 PMT2 유전자의 경우, A allele과 B allel에 똑같이 A deletion이 일어났음).※※ Homo indicates that the same type of mutation was generated in two alleles (for example, in the case of PMT2 gene of individual number 24, A deletion occurred in A allele and B allel identically).

2) 식물체 순화2) Plant purification

돌연변이가 확인된 조직배양체 12 개체를 상토가 담긴 화분으로 이식하여 온실에서 재배하였다.Twelve tissue cultures with confirmed mutations were transplanted into pots containing top soil and cultivated in a greenhouse.

3) F3) F 1One 세대 종자 확보 및 도입 유전자 제거 확인 Securing generation seeds and confirming removal of transgenes

CRISPR/Cas9 발현을 위해 도입된 유전자 블록을 제거하기 위하여 자가 수정을 통해 F1 세대 종자를 확보하였다. F1 세대 종자를 128구 트레이에 파종하여 30일 동안 기른 후, 잎을 채취하여 균일하게 분쇄한 후, silica column을 이용하는 상용화 키트 (예: Nucleospin 96 plant II, Macherey Nagel, 독일)를 이용하여 genomic DNA를 추출 및 정제하였다. 35S 프로모터와 Cas9 DNA 블록을 특이적으로 증폭하는 프라이머(하기 표 7)를 이용하여 PCR을 진행하였다. 35S 프로모터와 Cas9 DNA 블록이 검출되지 않는 식물체를 선별하였다.In order to remove the gene block introduced for CRISPR/Cas9 expression, F first generation seeds were obtained through self-fertilization. F 1st generation seeds were sown in a 128-neck tray, grown for 30 days, and leaves were collected and pulverized evenly, and then genomic using a commercial kit using silica column (e.g. Nucleospin 96 plant II, Macherey Nagel, Germany). DNA was extracted and purified. PCR was performed using the 35S promoter and primers (Table 7 below) that specifically amplify the Cas9 DNA block. Plants in which the 35S promoter and Cas9 DNA block were not detected were selected.

NameName SequenceSequence PCR lengthPCR length NoteNote 서열번호Sequence number F_C9F_C9 GACCATCCTGGACTTCCTGAAGAGCGACCATCCTGGACTTCCTGAAGAGC 420 bp420 bp Cas9 검출Cas9 detection 2020 R_C9R_C9 TGCAGGTAGTACAGGTACAGCTTCTCGTGCAGGTAGTACAGGTACAGCTTCTCG 2121 F_35SF_35S GCTCCTACAAATGCCATCAGCTCCTACAAATGCCATCA 195 bp195 bp 35S 프로모터 검출35S promoter detection 2222 R_35SR_35S GATAGTGGGATTGTGCGTCAGATAGTGGGATTGTGCGTCA 2323

4) 최종 선별 식물체의 PMT 유전자 유전형4) PMT genotype of the final selected plant

최종 선별한 F1 식물(108_PMTm_F1)은 도입 유전자를 포함하지 않으며, PMT 유전자들에 각각 동형접합 돌연변이(homozygous mutation)가 일어난 식물이었다. 108_PMTm_F1의 네 가지 PMT 유전자의 DNA 염기서열을 아미노산 서열로 변환하여 야생형(wild-type) 담배의 PMT 아미노산 서열과 비교해 본 결과, 비정상적으로 종결코돈(stop codon)이 일찍 생성됨을 확인하였다(표 8 및 도 9 내지 12).The final selected F 1 plant (108_PMTm_F 1 ) did not contain a transgene, and each of the PMT genes had a homozygous mutation. As a result of converting the DNA sequences of the four PMT genes of 108_PMTm_F 1 into amino acid sequences and comparing them with the PMT amino acid sequence of wild-type tobacco, it was confirmed that abnormal stop codons were generated early (Table 8). And FIGS. 9 to 12).

유전자gene PMT1PMT1 PMT2PMT2 PMT3PMT3 PMT4PMT4 돌연변이 양상108_PMTm_F1 Mutation pattern 108_PMTm_F 1 A insA ins T delT del T delT del T delT del

(4) F(4) F 1One 세대 니코틴 함량 분석 Generation nicotine content analysis

1) 니코틴 분석용 샘플 준비1) Preparation of samples for nicotine analysis

온실 환경에서 60일 동안 재배한 식물체의 꽃대를 자른 뒤, 2주 후에 식물체의 모든 잎을 수확하였고, 수확한 잎을 65℃ 드라이 오븐에서 48 시간 동안 건조 시킨 후, glass bead가 담긴 용기에 담아 gyro-shaker를 이용하여 분쇄하였다. After cutting the flower stalks of plants grown for 60 days in a greenhouse environment, all the leaves of the plants were harvested 2 weeks later. After drying the harvested leaves in a dry oven at 65℃ for 48 hours, put them in a container containing glass beads and put them in a gyro. It was pulverized using a -shaker.

2) 니코틴 함량 분석2) Nicotine content analysis

GC/MS 분석을 통한 니코틴 함량 분석을 수행하였다.Nicotine content analysis was performed through GC/MS analysis.

GC/MS 분석기법으로 KB108 (wild type)과 돌연변이체의 니코틴 함량을 정량 분석한 결과, KB108 모든 잎의 니코틴 함량은 건조엽 중량 기준 평균 7.1 mg/g이며, 돌연변이체 시료에서는 니코틴이 검출되지 않았다.As a result of quantitative analysis of the nicotine content of KB108 (wild type) and mutant by GC/MS analysis, the nicotine content of all leaves of KB108 was 7.1 mg/g on average based on dry leaf weight, and nicotine was not detected in the mutant sample. .

또한, 돌연변이체의 Nornicotine과 Anatabine의 함량을 대조군의 함량과 상대 비교하려 하였으나, 돌연변이체에서 두 가지 성분 또한 검출이 되지 않았다(표 9).In addition, an attempt was made to compare the contents of Nornicotine and Anatabine in the mutant with that of the control group, but neither component was detected in the mutant (Table 9).

실험군Experimental group 개체 수Number of objects Nicotine
(mg/g) 1)
Nicotine
(mg/g) 1)
Nornicotine
(ratio)
Nornicotine
(ratio)
Anatabine
(ratio)
Anatabine
(ratio)
KB108(대조군)KB108 (control) 1010 7.17.1 0.0240.024 0.0220.022 108_PMTm108_PMTm 1212 ND 2) ND 2) NDND NDND

1) 잎의 건조중량 (g) 당 니코틴 함량 (mg)2) ND (Not Detected) : 검출 한계값 (0.0002 mg/g) 미만1) Nicotine content per dry leaf weight (g) (mg) 2) ND (Not Detected): Below detection limit (0.0002 mg/g)

결과적으로, 니코틴 생합성에 중요한 역할을 하는 PMT 유전자에 특이적으로 돌연변이를 유발하는 것을 목적으로 CRISPR/Cas9 시스템을 디자인하였다.As a result, the CRISPR/Cas9 system was designed for the purpose of specifically mutating the PMT gene, which plays an important role in nicotine biosynthesis.

재배담배(Nicotiana tabacum)에 존재하는 4가지 종류의 PMT 유전자에만 특이적으로 결합할 수 있는 sgRNA 결합 부위를 세 곳 선정하였으며 실험 결과, sgRNA3의 절단 효율이 나머지 두 개보다 현저히 높았다.Three sgRNA binding sites that can specifically bind only to four types of PMT genes present in cultivated tobacco ( Nicotiana tabacum ) were selected. As a result of the experiment, the cleavage efficiency of sgRNA3 was significantly higher than that of the other two.

PMT1a, PMT2, PMT3 및 PMT4 유전자 모두에 돌연변이가 유발되었고, 이 돌연변이로 인해 PMT 효소가 정상적으로 만들어지지 않을 것으로 생각되는 식물체를 선별하였다.Mutations were induced in all of the PMT1a, PMT2, PMT3 and PMT4 genes, and plants that were thought to be not normally produced by the PMT enzyme due to this mutation were selected.

돌연변이 담배 식물체(108_PMTm) 및 대조군(KB108)의 잎 샘플에서 니코틴, 노르니코틴, 아나타빈의 함량(평균)을 확인한 결과, 상기 표 9와 같이, 니코틴이 검출되지 않았다.As a result of confirming the content (average) of nicotine, nornicotine, and anatabine in the leaf samples of the mutant tobacco plant (108_PMTm) and the control (KB108), nicotine was not detected as shown in Table 9 above.

2. 새로운 PMT1b 유전자를 포함한 PMT1a, PMT1b, PMT2, PMT3 및 PMT4 유전자를 타겟으로 CRISPR/Cas9 시스템을 이용한 저니코틴 담배 생산2. Production of low nicotine tobacco using CRISPR/Cas9 system targeting PMT1a, PMT1b, PMT2, PMT3 and PMT4 genes including new PMT1b gene

(1) NtPMT1b 유전자의 중요성 (1) Importance of the NtPMT1b gene

PMT 유전자 편집을 통한 NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자 뿐 아니라 NtPMT1b 유전자(서열번호 1)까지 비활성화 시켜야 제로 니코틴이 가능한 것을 알 수 있었고, 최초로 NtPMT1b 유전자의 완성된 CDS를 포함하는 gDNA 염기 서열을 밝혀내었다.It was found that zero nicotine was possible only when the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 genes as well as the NtPMT1b gene (SEQ ID NO: 1) were inactivated through PMT gene editing. .

현재까지 Genebank에 NtPMT1b로 명명한 nucleotide는 없었다. 또한, 이 후 PMT 유전자 관련 연구에서 PMT1b 유전자를 대상으로 한 연구가 없었다. 이번 실험에서 밝혀 낸 염기 서열은 전체 CDS를 포함하는 gDNA의 염기 서열이었다.To date, there has been no nucleotide named NtPMT1b in Genebank. In addition, there have been no studies on the PMT1b gene in the subsequent PMT gene-related studies. The nucleotide sequence identified in this experiment was the nucleotide sequence of gDNA including the entire CDS.

(2) NtPMT1b 유전자의 비활성화 여부에 따른 니코틴 함량 비교(2) Comparison of nicotine content according to whether or not NtPMT1b gene is inactivated

CRISPR/Cas9 기술을 이용하여 NtPMT 유전자의 exon 2 부위에 돌연변이를 유도 시킨 결과, NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자에는 공통적으로 돌연변이가 유발되었으면서, NtPMT1b에도 돌연변이가 유발된 식물체(7_23_1S)와 NtPMT1b에만 돌연변이가 유발되지 않은 식물체(7_23_4S)를 선별하였다. 7_23_1S 식물체의 잎에서는 니코틴이 검출되지 않는 반면, 7_23_4S 식물체에서는 니코틴이 0.5 mg/g 검출되었다(표 10).As a result of inducing a mutation in the exon 2 site of the NtPMT gene using CRISPR/Cas9 technology, mutations were commonly induced in the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 genes. Plants without mutations (7_23_4S) were selected. Nicotine was not detected in the leaves of the 7_23_1S plant, whereas 0.5 mg/g of nicotine was detected in the 7_23_4S plant (Table 10).

식물체Plant 7_23_1S7_23_1S 7_23_4S7_23_4S NtPMT1aNtPMT1a T delT del AGAT delAGAT del NtPMT1bNtPMT1b T delT del Wild typeWild type NtPMT2NtPMT2 T delT del T delT del NtPMT3NtPMT3 T delT del T delT del NtPMT4NtPMT4 T insT ins T insT ins 건조엽 내 니코틴 함량Nicotine content in dried leaves 미검출
(검출 한계 : 0.0002 mg/g)
Not detected
(Detection limit: 0.0002 mg/g)
0.5 mg/g0.5 mg/g

상기 결과로부터 실시예 1에서 제조된 108_PMTm의 경우, PMT1a, PMT2, PMT3 및 PMT4만을 타겟으로 삼아 sgRNA를 설계하였으나, 실제 설계된 sg3 sgRNA는 PMT1b의 돌연변이도 유발하였음을 알 수 있었다. 다만, 본 실험결과로부터 PMT1b까지 비활성화되어야만 니코틴을 완벽하게 조절할 수 있음을 알 수 있었다.From the above results, in the case of 108_PMTm prepared in Example 1, sgRNA was designed by targeting only PMT1a, PMT2, PMT3, and PMT4, but it was found that the actually designed sg3 sgRNA also caused the mutation of PMT1b. However, from the results of this experiment, it was found that nicotine can be completely controlled only when PMT1b is inactivated.

(3) PMT (putrescine N-methyltransferase) 유전자(3) PMT (putrescine N-methyltransferase) gene

PMT 유전자는 S-adenosylmethionine-dependent N-methylation of putrescine 반응의 촉매 역할을 하여 nicotine과 tropane alkaloids가 생성되도록 한다.The PMT gene acts as a catalyst for the S-adenosylmethionine-dependent N-methylation of putrescine reaction, resulting in the production of nicotine and tropane alkaloids.

한편, 재배담배(Nicotiana tabacum)에는 다섯 개의 PMT 기능을 하는 유전자가 존재한다(NtPMT1a, NtPMT1b, NtPMT2, NtPMT3, NtPMT4).On the other hand, in the cultivation tobacco ( Nicotiana tabacum ), there are five genes that function as PMT (NtPMT1a, NtPMT1b, NtPMT2, NtPMT3, NtPMT4).

이 중 NtPMT2, NtPMT3, NtPMT4 유전자는 각각 N.sysvestris의 NsPMT1, NsPMT2, NsPMT3 유전자로부터 유래된 것으로 예상되었다(exon 1 내의 반복서열 개수 및 intron 염기서열 비교 결과).Among NtPMT2, NtPMT3, NtPMT4 gene was expected to be derived from each NsPMT1, NsPMT2, NsPMT3 genes N.sysvestris (number of repeated sequences and intron DNA sequence in the comparison result exon 1).

NtPMT1은 NtPMT1a와 NtPMT1b로 구분되는데, 일반적으로 NtPMT1이라 하면 NtPMT1a를 지칭한다.NtPMT1 is divided into NtPMT1a and NtPMT1b. In general, NtPMT1 refers to NtPMT1a.

NtPMT1a는 N.tomentosiformis에서 유래된 것으로 예상하며, NtPMT1b는 N.tomentosiformisN.otophora에서 유래되었을 것으로 예상된다.NtPMT1a and is expected to be derived from the N.tomentosiformis, NtPMT1b is expected to have been derived from N.tomentosiformis and N.otophora.

<110> KT&G CORPORATION <120> A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF <130> P20190961KR(PN130282KR) <150> KR 10-2019-0103310 <151> 2019-08-22 <160> 23 <170> KoPatentIn 3.0 <210> 1 <211> 2211 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(2211) <223> PMT1b <400> 1 atggaagtca tatctaccaa cacaaatggc tctaccatct tcaagaatgg taccattccc 60 atgaacggcc accaaaatgg ctcttccgaa cacctcaacg gctaccagaa tggcatttcc 120 aaacaccaaa acgggcacca gaatggcact tccgaacatc ggaacggcca ccagaatggg 180 acatccgaac aacagaacgg gacaatcagc catgacaatg gcaacgagct actgggaagc 240 tccaactcta ttaagcctgg ttggttttca gagtttagcg cattatggcc aggttagtat 300 taagaaagaa actcaaatgc atcgtactct tgtattttgg tttgtgtata atttataata 360 tggataaatt atgacgaagt aaactcgcat acattaattt gaaataatct aattttgcag 420 gtgaagcatt ctcacttaag gtcgagaagt tactattcca ggggaaatct gattaccaag 480 atgtcatgct ctttgaggta attaatattc tgatacacat gctttaatct aaagtgatac 540 ttttaattta cttttagttt attgcatgtg cacgtacagt cagcaactta tgggaaggtt 600 ctgactttgg atggagcaat tcaacataca gagaatggtg gatttccata cactgaaatg 660 attgttcatc taccacttgg ttccatccca aacccaaaaa aggttttgat catcggcgga 720 ggaattggtt ttacattatt cgaaatgctt cgttatcctt caatcgaaaa aattgacatt 780 gttgagatcg atgacgtggt agttgatgta agtcaaactt cttttaccca cataaagaaa 840 atgatttaga ttgcaattct ttttattttt ctaaaagaat aaatatattc tctctttttt 900 ttttttaaaa caaaattctc tttcttacag gtatccagaa aatttttccc ttatctggca 960 gctaatttta acgatcctcg tgtaacccta gttctcggag atggtgcgta tatgatagtc 1020 tcgttttata ttttatttca cttgattttt accttttttt gtggttaatt aatcatctac 1080 cattggttct ctttaccttc aggagctgca tttgtaaagg ctgcacaagc gggatattat 1140 gatgctatta tagtggactc ttctgatccc attggtacgc tattactatt taataccaag 1200 actattctta ttatataagc tactaagaaa ctaattgaat aattaataaa cgtaactgta 1260 attgatttct aaaataatat atataatttc aggtccagca aaagatttgt ttgagaggcc 1320 attctttgag gcagtagcca aagcccttag gccaggagga gttgtatgca cacaggctga 1380 aagcatttgg cttcatatgc atattattaa gcaaatcatt gctaactgtc gtcaagtctt 1440 taagggttct gtcaactatg cttggacaac cgttccaaca tatcccacgt attctttttc 1500 tctctctctt cctgtctttt tcgatgcaat gtaaatttat aaaattggaa gtccgtttta 1560 cttttctata gacgtagatc ctaaaattgt caagaaatgg agaattgact tacaagaaaa 1620 atcaacttct tttcatttac tattcttttt ggtgacaaac tttacttatt atttcgttct 1680 aaaatgaaaa tttattttta tattttaaaa taatttagct ttaaactttt aattttactt 1740 gttatatttt taataaaaaa gatttattgt caaataaatg ttgtggccat acaataagtt 1800 tcaaattatg tcacaaaaat tgaaacagag tgagcaaatt tttatttcaa gtaaactttc 1860 aaggaattgt gtttaagttt tctcaactga tactgatcaa tgatgtctta aatattttga 1920 tttcagcggt gtgattggtt atatgctctg ctctactgaa gggccagaag ttaacttcaa 1980 gaatccagta aatccaattg acaaagagac aactcaagtc aagtccaaat taggacctct 2040 caagttctac aactctgatg taacttcata tctcacaatt tctttttccg ttttactgta 2100 tgttcttcat caaattttat aactaactct tttcatattg tctttttttt tcagattcac 2160 aaagcagcat tcattttgcc atctttcgcc cgaagtatga tcgagtctta a 2211 <210> 2 <211> 3129 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3129) <223> PMT1a <400> 2 gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat cataagtttc 2580 agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg acgtattgta 3120 ggagaattc 3129 <210> 3 <211> 3091 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3091) <223> PMT2 <400> 3 ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 ggaaaaagta tcaaataaat tcaaaaaatg gatagtaact tcgcatatta ctctacacat 1500 taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 ccaaaaaagg ttttgatcat cggcggagga attggtttta cattattcga aatgcttcgt 1860 tatcctacaa tcgaaaaaat tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 gaaaaaccaa cttcttttct tttactcttc aaggtattgt gtttaatttt ttttcaactg 2700 atatgatcaa ttattttgat ttcagcggtg tgattggtta tatgctctgc tctactgaag 2760 gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 aataagtgaa agacgtattg tatgagaatt c 3091 <210> 4 <211> 3269 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3269) <223> PMT3 <400> 4 gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 gtgaaatgcg acgtattgta taagaattc 3269 <210> 5 <211> 3001 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3001) <223> PMT4 <400> 5 gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 780 gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 aaatcgatga cgtggtagtt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 c 3001 <210> 6 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> F_PMT1a <400> 6 gcatagatgt ttattgggag tgtacagcaa 30 <210> 7 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> R_PMT1a <400> 7 agttgctgac tgtacgtgca ca 22 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> F_PMT2 <400> 8 catatgtgtc tattgggagt gtacatcaa 29 <210> 9 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> R_PMT2 <400> 9 aagttgctga ctgtacgtac acat 24 <210> 10 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> F_PMT3 <400> 10 catagatatg tttattggga gtgtacatca a 31 <210> 11 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> R_PMT3 <400> 11 ccactttaaa tataatcata catgagcttg ac 32 <210> 12 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_PMT4 <400> 12 gcgagctttc ataaagtaca aatca 25 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> R_PMT4 <400> 13 agtatttatt catattaaag catgtgtatt aga 33 <210> 14 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg1 <400> 14 aagtcatatc taccaacaca aatgg 25 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg2 <400> 15 ttttcagagt ttagcgcatt atgg 24 <210> 16 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg3 <400> 16 ctcaaagagc atgacatctt gg 22 <210> 17 <211> 17283 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE <400> 17 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg accccgggac tagtctcgag 10500 cacgtggaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 10560 ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 10620 ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgcccgct cctttcgctt 10680 tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 10740 tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 10800 gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 10860 agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 10920 cgggctattc ttttgattta taagggattt tgccgatttc ggaaccacca tcaaacagga 10980 ttttcgcctg ctggggcaaa ccagcgtgga ccgcttgctg caactctctc agggccaggc 11040 ggtgaagggc aatcagctgt tgcccgtctc actggtgaaa agaaaaacca ccccagtaca 11100 ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat 11160 atatcctgcc accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac 11220 tcgatacagg cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc 11280 agactttgct catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga 11340 aacacggatg atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc 11400 ctgtgatcaa atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc 11460 tcgcttaacc gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct 11520 ggataaagcc gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgata 11580 tcaactcccc tatccattgc tcaccgaatg gtacaggtcg gggacccgaa gttccgactg 11640 tcggcctgat gcatccccgg ctgatcgacc ccagatctgg ggctgagaaa gcccagtaag 11700 gaaacaactg taggttcgag tcgcgagatc ccccggaacc aaaggaagta ggttaaaccc 11760 gctccgatca ggccgagcca cgccaggccg agaacattgg ttcctgtagg catcgggatt 11820 ggcggatcaa acactaaagc tactggaacg agcagaagtc ctccggccgc cagttgccag 11880 gcggtaaagg tgagcagagg cacgggaggt tgccacttgc gggtcagcac ggttccgaac 11940 gccatggaaa ccgcccccgc caggcccgct gcgacgccga caggatctag cgctgcgttt 12000 ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa tagcccccag gaccgccatc 12060 aatcgtatcg ggctacctag cagagcggca gagatgaaca cgaccatcag cggctgcaca 12120 gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga ccgaaataaa caacaagctc 12180 cagaatagcg aaatattaag tgcgccgagg atgaagatgc gcatccacca gattcccgtt 12240 ggaatctgtc ggacgatcat cacgagcaat aaacccgccg gcaacgcccg cagcagcata 12300 ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa cgccggacag atgcgccttg 12360 tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca gcccaatgat ctcgccgtcg 12420 atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag cgaacgcctc catgggcttt 12480 ttctcctcgt gctcgtaaac ggacccgaac atctctggag ctttcttcag ggccgacaat 12540 cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc gtcgaatctg agccttaatc 12600 acaattgtca attttaatcc tctgtttatc ggcagttcgt agagcgcgcc gtgcgtcccg 12660 agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc ttgttcctga aatgccagta 12720 aagcgctggc tgctgaaccc ccagccggaa ctgaccccac aaggccctag cgtttgcaat 12780 gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc gctgcctcgc aactcttcgc 12840 aggcttcgcc gacctgctcg cgccacttct tcacgcgggt ggaatccgat ccgcacatga 12900 ggcggaaggt ttccagcttg agcgggtacg gctcccggtg cgagctgaaa tagtcgaaca 12960 tccgtcgggc cgtcggcgac agcttgcggt acttctccca tatgaatttc gtgtagtggt 13020 cgccagcaaa cagcacgacg atttcctcgt cgatcaggac ctggcaacgg gacgttttct 13080 tgccacggtc caggacgcgg aagcggtgca gcagcgacac cgattccagg tgcccaacgc 13140 ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga caggcattcc tcggccttcg 13200 tgtaataccg gccattgatc gaccagccca ggtcctggca aagctcgtag aacgtgaagg 13260 tgatcggctc gccgataggg gtgcgcttcg cgtactccaa cacctgctgc cacaccagtt 13320 cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat cttcacgtcc ttgttgacgt 13380 ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt cttgttgcgc gtggtgaaca 13440 gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt gtccggccac ggcgcaatat 13500 cgaacaagga aagctgcatt tccttgatct gctgcttcgt gtgtttcagc aacgcggcct 13560 gcttggcctc gctgacctgt tttgccaggt cctcgccggc ggtttttcgc ttcttggtcg 13620 tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa acctgccgcc tcctgttcga 13680 gacgacgcga acgctccacg gcggccgatg gcgcgggcag ggcaggggga gccagttgca 13740 cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat cgagccgacg gactggaagg 13800 tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt ttcggcatcc tcggcggaaa 13860 accccgcgtc gatcagttct tgcctgtatg ccttccggtc aaacgtccga ttcattcacc 13920 ctccttgcgg gattgccccg actcacgccg gggcaatgtg cccttattcc tgatttgacc 13980 cgcctggtgc cttggtgtcc agataatcca ccttatcggc aatgaagtcg gtcccgtaga 14040 ccgtctggcc gtccttctcg tacttggtat tccgaatctt gccctgcacg aataccagcg 14100 accccttgcc caaatacttg ccgtgggcct cggcctgaga gccaaaacac ttgatgcgga 14160 agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg ccacatctag gtactaaaac 14220 aattcatcca gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag 14280 tcaaaaaata gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag 14340 aaggcaatgt cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt 14400 actttgccat ctttcacaaa gatgttgctg tctcccaggt cgccgtggga aaagacaagt 14460 tcctcttcgg gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga 14520 gtgtcttctt cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc 14580 aattcggcta agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag 14640 tgaaagagcc tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct 14700 tcatactctt ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca 14760 tcatgccgtt caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc 14820 atgtcctttt cccgttccac atcataggtg gtccctttat accggctgtc cgtcattttt 14880 aaatataggt tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc 14940 gtatctttta cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt 15000 ttagccattt attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta 15060 attataacaa gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga 15120 aaacagcttt ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga 15180 ttttgaaacc acaattatgg gtgatgctgc caacttactg atttagtgta tgatggtgtt 15240 tttgaggtgc tccagtggct tctgtgtcta tcagctgtcc ctcctgttca gctactgacg 15300 gggtggtgcg taacggcaaa agcaccgccg gacatcagcg ctatctctgc tctcactgcc 15360 gtaaaacatg gcaactgcag ttcacttaca ccgcttctca acccggtacg caccagaaaa 15420 tcattgatat ggccatgaat ggcgttggat gccgggcaac agcccgcatt atgggcgttg 15480 gcctcaacac gattttacgt cacttaaaaa actcaggccg cagtcggtaa cctcgcgcat 15540 acagccgggc agtgacgtca tcgtctgcgc ggaaatggac gaacagtggg gctatgtcgg 15600 ggctaaatcg cgccagcgct ggctgtttta cgcgtatgac agtctccgga agacggttgt 15660 tgcgcacgta ttcggtgaac gcactatggc gacgctgggg cgtcttatga gcctgctgtc 15720 accctttgac gtggtgatat ggatgacgga tggctggccg ctgtatgaat cccgcctgaa 15780 gggaaagctg cacgtaatca gcaagcgata tacgcagcga attgagcggc ataacctgaa 15840 tctgaggcag cacctggcac ggctgggacg gaagtcgctg tcgttctcaa aatcggtgga 15900 gctgcatgac aaagtcatcg ggcattatct gaacataaaa cactatcaat aagttggagt 15960 cattacccaa ttatgataga atttacaagc tataaggtta ttgtcctggg tttcaagcat 16020 tagtccatgc aagtttttat gctttgccca ttctatagat atattgataa gcgcgctgcc 16080 tatgccttgc cccctgaaat ccttacatac ggcgatatct tctatataaa agatatatta 16140 tcttatcagt attgtcaata tattcaaggc aatctgcctc ctcatcctct tcatcctctt 16200 cgtcttggta gctttttaaa tatggcgctt catagagtaa ttctgtaaag gtccaattct 16260 cgttttcata cctcggtata atcttaccta tcacctcaaa tggttcgctg ggtttatcgc 16320 acccccgaac acgagcacgg cacccgcgac cactatgcca agaatgccca aggtaaaaat 16380 tgccggcccc gccatgaagt ccgtgaatgc cccgacggcc gaagtgaagg gcaggccgcc 16440 acccaggccg ccgccctcac tgcccggcac ctggtcgctg aatgtcgatg ccagcacctg 16500 cggcacgtca atgcttccgg gcgtcgcgct cgggctgatc gcccatcccg ttactgcccc 16560 gatcccggca atggcaagga ctgccagcgc tgccattttt ggggtgaggc cgttcgcggc 16620 cgaggggcgc agcccctggg gggatgggag gcccgcgtta gcgggccggg agggttcgag 16680 aagggggggc accccccttc ggcgtgcgcg gtcacgcgca cagggcgcag ccctggttaa 16740 aaacaaggtt tataaatatt ggtttaaaag caggttaaaa gacaggttag cggtggccga 16800 aaaacgggcg gaaacccttg caaatgctgg attttctgcc tgtggacagc ccctcaaatg 16860 tcaataggtg cgcccctcat ctgtcagcac tctgcccctc aagtgtcaag gatcgcgccc 16920 ctcatctgtc agtagtcgcg cccctcaagt gtcaataccg cagggcactt atccccaggc 16980 ttgtccacat catctgtggg aaactcgcgt aaaatcaggc gttttcgccg atttgcgagg 17040 ctggccagct ccacgtcgcc ggccgaaatc gagcctgccc ctcatctgtc aacgccgcgc 17100 cgggtgagtc ggcccctcaa gtgtcaacgt ccgcccctca tctgtcagtg agggccaagt 17160 tttccgcgag gtatccacaa cgccggcggc cgcggtgtct cgcacacggc ttcgacggcg 17220 tttctggcgc gtttgcaggg ccatagacgg ccgccagccc agcggcgagg gcaaccagcc 17280 cgg 17283 <210> 18 <211> 18700 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE_PMT <400> 18 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg acagaaatct caaaattccg 10500 gcagaacaat tttgaatctc gatccgtaga aacgagacgg tcattgtttt agttccacca 10560 cgattatatt tgaaatttac gtgagtgtga gtgagacttg cataagaaaa taaaatcttt 10620 agttgggaaa aaattcaata atataaatgg gcttgagaag gaagcgaggg ataggccttt 10680 ttctaaaata ggcccattta agctattaac aatcttcaaa agtaccacag cgcttaggta 10740 aagaaagcag ctgagtttat atatggttag agacgaagta gtgattgaag tcatatctac 10800 caacacaaag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 10860 tgaaaaagtg gcaccgagtc ggtgcttttt tcaaatgaca attaaccacc gtgtattcgt 10920 tataacatca ggcagtttaa gtcgggacaa taggagccgc aatacacagt ttaccgcatc 10980 ttgacctaac tagaaatctc aaaattccgg cagaacaatt ttgaatctcg atccgtagaa 11040 acgagacggt cattgtttta gttccaccac gattatattt gaaatttacg tgagtgtgag 11100 tgagacttgc ataagaaaat aaaatcttta gttgggaaaa aattcaataa tataaatggg 11160 cttgagaagg aagcgaggga taggcctttt tctaaaatag gcccatttaa gctattaaca 11220 atcttcaaaa gtaccacagc gcttaggtaa agaaagcagc tgagtttata tatggttaga 11280 gacgaagtag tgattgtttt cagagtttag cgcattagtt ttagagctag aaatagcaag 11340 ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 11400 ctgcgcttcg atcttagatc aaggttgcct ggagtgtctc ttgaactgta cagagttgag 11460 ctagtcgcga agcagatcag caacacattt ctgcttgcga gaaatctcaa aattccggca 11520 gaacaatttt gaatctcgat ccgtagaaac gagacggtca ttgttttagt tccaccacga 11580 ttatatttga aatttacgtg agtgtgagtg agacttgcat aagaaaataa aatctttagt 11640 tgggaaaaaa ttcaataata taaatgggct tgagaaggaa gcgagggata ggcctttttc 11700 taaaataggc ccatttaagc tattaacaat cttcaaaagt accacagcgc ttaggtaaag 11760 aaagcagctg agtttatata tggttagaga cgaagtagtg attgctcaaa gagcatgaca 11820 tctgttttag agctagaaat agcaagttaa aataaggcta gtccgttatc aacttgaaaa 11880 agtggcaccg agtcggtgct tttttactag tctcgagcac gtggaattca ctggccgtcg 11940 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 12000 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 12060 agttgcgcag cctgaatggc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 12120 ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 12180 gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca 12240 tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 12300 ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 12360 gggattttgc cgatttcgga accaccatca aacaggattt tcgcctgctg gggcaaacca 12420 gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc 12480 ccgtctcact ggtgaaaaga aaaaccaccc cagtacatta aaaacgtccg caatgtgtta 12540 ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 12600 cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 12660 cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 12720 ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 12780 tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 12840 cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 12900 gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 12960 agtggcgcta tttctttaga agtgaacgtt gacgatatca actcccctat ccattgctca 13020 ccgaatggta caggtcgggg acccgaagtt ccgactgtcg gcctgatgca tccccggctg 13080 atcgacccca gatctggggc tgagaaagcc cagtaaggaa acaactgtag gttcgagtcg 13140 cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc cgagccacgc 13200 caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca ctaaagctac 13260 tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga gcagaggcac 13320 gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg cccccgccag 13380 gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca acagcgccac 13440 gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc tacctagcag 13500 agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg ccgcgacccc 13560 gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa tattaagtgc 13620 gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga cgatcatcac 13680 gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc ggcctcgctg 13740 ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg ggccgtcctc 13800 ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga atgccacggc 13860 atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct cgtaaacgga 13920 cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga aatcctgcac 13980 gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt ttaatcctct 14040 gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactgagc gaagcaagtg 14100 cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc tgaaccccca 14160 gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc attgacccag 14220 gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac ctgctcgcgc 14280 cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc cagcttgagc 14340 gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt cggcgacagc 14400 ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag cacgacgatt 14460 tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag gacgcggaag 14520 cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa gcccatcgcc 14580 gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc attgatcgac 14640 cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc gataggggtg 14700 cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc ccgcagctcg 14760 acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt gttttgcagc 14820 gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc cgtgtcgttt 14880 ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag ctgcatttcc 14940 ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct gacctgtttt 15000 gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg cgtgtcgatg 15060 gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg ctccacggcg 15120 gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc gatcttggcc 15180 gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc acgcatgacg 15240 gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat cagttcttgc 15300 ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat tgccccgact 15360 cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt ggtgtccaga 15420 taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc cttctcgtac 15480 ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa atacttgccg 15540 tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg ctcctgcttg 15600 tcgccggcat cgttgcgcca catctaggta ctaaaacaat tcatccagta aaatataata 15660 ttttattttc tcccaatcag gcttgatccc cagtaagtca aaaaatagct cgacatactg 15720 ttcttccccg atatcctccc tgatcgaccg gacgcagaag gcaatgtcat accacttgtc 15780 cgccctgccg cttctcccaa gatcaataaa gccacttact ttgccatctt tcacaaagat 15840 gttgctgtct cccaggtcgc cgtgggaaaa gacaagttcc tcttcgggct tttccgtctt 15900 taaaaaatca tacagctcgc gcggatcttt aaatggagtg tcttcttccc agttttcgca 15960 atccacatcg gccagatcgt tattcagtaa gtaatccaat tcggctaagc ggctgtctaa 16020 gctattcgta tagggacaat ccgatatgtc gatggagtga aagagcctga tgcactccgc 16080 atacagctcg ataatctttt cagggctttg ttcatcttca tactcttccg agcaaaggac 16140 gccatcggcc tcactcatga gcagattgct ccagccatca tgccgttcaa agtgcaggac 16200 ctttggaaca ggcagctttc cttccagcca tagcatcatg tccttttccc gttccacatc 16260 ataggtggtc cctttatacc ggctgtccgt catttttaaa tataggtttt cattttctcc 16320 caccagctta tataccttag caggagacat tccttccgta tcttttacgc agcggtattt 16380 ttcgatcagt tttttcaatt ccggtgatat tctcatttta gccatttatt atttccttcc 16440 tcttttctac agtatttaaa gataccccaa gaagctaatt ataacaagac gaactccaat 16500 tcactgttcc ttgcattcta aaaccttaaa taccagaaaa cagctttttc aaagttgttt 16560 tcaaagttgg cgtataacat agtatcgacg gagccgattt tgaaaccaca attatgggtg 16620 atgctgccaa cttactgatt tagtgtatga tggtgttttt gaggtgctcc agtggcttct 16680 gtgtctatca gctgtccctc ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc 16740 accgccggac atcagcgcta tctctgctct cactgccgta aaacatggca actgcagttc 16800 acttacaccg cttctcaacc cggtacgcac cagaaaatca ttgatatggc catgaatggc 16860 gttggatgcc gggcaacagc ccgcattatg ggcgttggcc tcaacacgat tttacgtcac 16920 ttaaaaaact caggccgcag tcggtaacct cgcgcataca gccgggcagt gacgtcatcg 16980 tctgcgcgga aatggacgaa cagtggggct atgtcggggc taaatcgcgc cagcgctggc 17040 tgttttacgc gtatgacagt ctccggaaga cggttgttgc gcacgtattc ggtgaacgca 17100 ctatggcgac gctggggcgt cttatgagcc tgctgtcacc ctttgacgtg gtgatatgga 17160 tgacggatgg ctggccgctg tatgaatccc gcctgaaggg aaagctgcac gtaatcagca 17220 agcgatatac gcagcgaatt gagcggcata acctgaatct gaggcagcac ctggcacggc 17280 tgggacggaa gtcgctgtcg ttctcaaaat cggtggagct gcatgacaaa gtcatcgggc 17340 attatctgaa cataaaacac tatcaataag ttggagtcat tacccaatta tgatagaatt 17400 tacaagctat aaggttattg tcctgggttt caagcattag tccatgcaag tttttatgct 17460 ttgcccattc tatagatata ttgataagcg cgctgcctat gccttgcccc ctgaaatcct 17520 tacatacggc gatatcttct atataaaaga tatattatct tatcagtatt gtcaatatat 17580 tcaaggcaat ctgcctcctc atcctcttca tcctcttcgt cttggtagct ttttaaatat 17640 ggcgcttcat agagtaattc tgtaaaggtc caattctcgt tttcatacct cggtataatc 17700 ttacctatca cctcaaatgg ttcgctgggt ttatcgcacc cccgaacacg agcacggcac 17760 ccgcgaccac tatgccaaga atgcccaagg taaaaattgc cggccccgcc atgaagtccg 17820 tgaatgcccc gacggccgaa gtgaagggca ggccgccacc caggccgccg ccctcactgc 17880 ccggcacctg gtcgctgaat gtcgatgcca gcacctgcgg cacgtcaatg cttccgggcg 17940 tcgcgctcgg gctgatcgcc catcccgtta ctgccccgat cccggcaatg gcaaggactg 18000 ccagcgctgc catttttggg gtgaggccgt tcgcggccga ggggcgcagc ccctgggggg 18060 atgggaggcc cgcgttagcg ggccgggagg gttcgagaag ggggggcacc ccccttcggc 18120 gtgcgcggtc acgcgcacag ggcgcagccc tggttaaaaa caaggtttat aaatattggt 18180 ttaaaagcag gttaaaagac aggttagcgg tggccgaaaa acgggcggaa acccttgcaa 18240 atgctggatt ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc ccctcatctg 18300 tcagcactct gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt agtcgcgccc 18360 ctcaagtgtc aataccgcag ggcacttatc cccaggcttg tccacatcat ctgtgggaaa 18420 ctcgcgtaaa atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca cgtcgccggc 18480 cgaaatcgag cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc ccctcaagtg 18540 tcaacgtccg cccctcatct gtcagtgagg gccaagtttt ccgcgaggta tccacaacgc 18600 cggcggccgc ggtgtctcgc acacggcttc gacggcgttt ctggcgcgtt tgcagggcca 18660 tagacggccg ccagcccagc ggcgagggca accagcccgg 18700 <210> 19 <211> 304 <212> DNA <213> Artificial Sequence <220> <223> P_U6 <400> 19 agaaatctca aaattccggc agaacaattt tgaatctcga tccgtagaaa cgagacggtc 60 attgttttag ttccaccacg attatatttg aaatttacgt gagtgtgagt gagacttgca 120 taagaaaata aaatctttag ttgggaaaaa attcaataat ataaatgggc ttgagaagga 180 agcgagggat aggccttttt ctaaaatagg cccatttaag ctattaacaa tcttcaaaag 240 taccacagcg cttaggtaaa gaaagcagct gagtttatat atggttagag acgaagtagt 300 gatt 304 <210> 20 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_C9 <400> 20 gaccatcctg gacttcctga agagc 25 <210> 21 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> R_C9 <400> 21 tgcaggtagt acaggtacag cttctcg 27 <210> 22 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> F_35S <400> 22 gctcctacaa atgccatca 19 <210> 23 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> R_35S <400> 23 gatagtggga ttgtgcgtca 20 <110> KT&G CORPORATION <120> A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF <130> P20190961KR(PN130282KR) <150> KR 10-2019-0103310 <151> 2019-08-22 <160> 23 <170> KoPatentIn 3.0 <210> 1 <211> 2211 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(2211) <223> PMT1b <400> 1 atggaagtca tatctaccaa cacaaatggc tctaccatct tcaagaatgg taccattccc 60 atgaacggcc accaaaatgg ctcttccgaa cacctcaacg gctaccagaa tggcatttcc 120 aaacaccaaa acgggcacca gaatggcact tccgaacatc ggaacggcca ccagaatggg 180 acatccgaac aacagaacgg gacaatcagc catgacaatg gcaacgagct actgggaagc 240 tccaactcta ttaagcctgg ttggttttca gagtttagcg cattatggcc aggttagtat 300 taagaaagaa actcaaatgc atcgtactct tgtattttgg tttgtgtata atttataata 360 tggataaatt atgacgaagt aaactcgcat acattaattt gaaataatct aattttgcag 420 gtgaagcatt ctcacttaag gtcgagaagt tactattcca ggggaaatct gattaccaag 480 atgtcatgct ctttgaggta attaatattc tgatacacat gctttaatct aaagtgatac 540 ttttaattta cttttagttt attgcatgtg cacgtacagt cagcaactta tgggaaggtt 600 ctgactttgg atggagcaat tcaacataca gagaatggtg gatttccata cactgaaatg 660 attgttcatc taccacttgg ttccatccca aacccaaaaa aggttttgat catcggcgga 720 ggaattggtt ttacattatt cgaaatgctt cgttatcctt caatcgaaaa aattgacatt 780 gttgagatcg atgacgtggt agttgatgta agtcaaactt cttttaccca cataaagaaa 840 atgatttaga ttgcaattct ttttattttt ctaaaagaat aaatatattc tctctttttt 900 ttttttaaaa caaaattctc tttcttacag gtatccagaa aatttttccc ttatctggca 960 gctaatttta acgatcctcg tgtaacccta gttctcggag atggtgcgta tatgatagtc 1020 tcgttttata ttttatttca cttgattttt accttttttt gtggttaatt aatcatctac 1080 cattggttct ctttaccttc aggagctgca tttgtaaagg ctgcacaagc gggatattat 1140 gatgctatta tagtggactc ttctgatccc attggtacgc tattactatt taataccaag 1200 actattctta ttatataagc tactaagaaa ctaattgaat aattaataaa cgtaactgta 1260 attgatttct aaaataatat atataatttc aggtccagca aaagatttgt ttgagaggcc 1320 attctttgag gcagtagcca aagcccttag gccaggagga gttgtatgca cacaggctga 1380 aagcatttgg cttcatatgc atattattaa gcaaatcatt gctaactgtc gtcaagtctt 1440 taagggttct gtcaactatg cttggacaac cgttccaaca tatcccacgt attctttttc 1500 tctctctctt cctgtctttt tcgatgcaat gtaaatttat aaaattggaa gtccgtttta 1560 cttttctata gacgtagatc ctaaaattgt caagaaatgg agaattgact tacaagaaaa 1620 atcaacttct tttcatttac tattcttttt ggtgacaaac tttacttatt atttcgttct 1680 aaaatgaaaa tttattttta tattttaaaa taatttagct ttaaactttt aattttactt 1740 gttatatttt taataaaaaa gatttattgt caaataaatg ttgtggccat acaataagtt 1800 tcaaattatg tcacaaaaat tgaaacagag tgagcaaatt tttatttcaa gtaaactttc 1860 aaggaattgt gtttaagttt tctcaactga tactgatcaa tgatgtctta aatattttga 1920 tttcagcggt gtgattggtt atatgctctg ctctactgaa gggccagaag ttaacttcaa 1980 gaatccagta aatccaattg acaaagagac aactcaagtc aagtccaaat taggacctct 2040 caagttctac aactctgatg taacttcata tctcacaatt tctttttccg ttttactgta 2100 tgttcttcat caaattttat aactaactct tttcatattg tctttttttt tcagattcac 2160 aaagcagcat tcattttgcc atctttcgcc cgaagtatga tcgagtctta a 2211 <210> 2 <211> 3129 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3129) <223> PMT1a <400> 2 gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat cataagtttc 2580 agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg acgtattgta 3120 ggagaattc 3129 <210> 3 <211> 3091 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3091) <223> PMT2 <400> 3 ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 ggaaaaagta tcaaataaat tcaaaaaatg gatagtaact tcgcatatta ctctacacat 1500 taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 ccaaaaaagg ttttgatcat cggcggagga attggtttta cattattcga aatgcttcgt 1860 tatcctacaa tcgaaaaaat tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 gaaaaaccaa cttcttttct tttactcttc aaggtattgt gtttaatttt ttttcaactg 2700 atatgatcaa ttattttgat ttcagcggtg tgattggtta tatgctctgc tctactgaag 2760 gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 aataagtgaa agacgtattg tatgagaatt c 3091 <210> 4 <211> 3269 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3269) <223> PMT3 <400> 4 gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 gtgaaatgcg acgtattgta taagaattc 3269 <210> 5 <211> 3001 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3001) <223> PMT4 <400> 5 gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 780 gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 aaatcgatga cgtggtagtt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 c 3001 <210> 6 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> F_PMT1a <400> 6 gcatagatgt ttattgggag tgtacagcaa 30 <210> 7 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> R_PMT1a <400> 7 agttgctgac tgtacgtgca ca 22 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> F_PMT2 <400> 8 catatgtgtc tattgggagt gtacatcaa 29 <210> 9 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> R_PMT2 <400> 9 aagttgctga ctgtacgtac acat 24 <210> 10 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> F_PMT3 <400> 10 catagatatg tttattggga gtgtacatca a 31 <210> 11 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> R_PMT3 <400> 11 ccactttaaa tataatcata catgagcttg ac 32 <210> 12 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_PMT4 <400> 12 gcgagctttc ataaagtaca aatca 25 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> R_PMT4 <400> 13 agtatttatt catattaaag catgtgtatt aga 33 <210> 14 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg1 <400> 14 aagtcatatc taccaacaca aatgg 25 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg2 <400> 15 ttttcagagt ttagcgcatt atgg 24 <210> 16 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg3 <400> 16 ctcaaagagc atgacatctt gg 22 <210> 17 <211> 17283 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE <400> 17 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg accccgggac tagtctcgag 10500 cacgtggaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 10560 ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 10620 ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgcccgct cctttcgctt 10680 tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 10740 tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 10800 gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 10860 agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 10920 cgggctattc ttttgattta taagggattt tgccgatttc ggaaccacca tcaaacagga 10980 ttttcgcctg ctggggcaaa ccagcgtgga ccgcttgctg caactctctc agggccaggc 11040 ggtgaagggc aatcagctgt tgcccgtctc actggtgaaa agaaaaacca ccccagtaca 11100 ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat 11160 atatcctgcc accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac 11220 tcgatacagg cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc 11280 agactttgct catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga 11340 aacacggatg atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc 11400 ctgtgatcaa atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc 11460 tcgcttaacc gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct 11520 ggataaagcc gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgata 11580 tcaactcccc tatccattgc tcaccgaatg gtacaggtcg gggacccgaa gttccgactg 11640 tcggcctgat gcatccccgg ctgatcgacc ccagatctgg ggctgagaaa gcccagtaag 11700 gaaacaactg taggttcgag tcgcgagatc ccccggaacc aaaggaagta ggttaaaccc 11760 gctccgatca ggccgagcca cgccaggccg agaacattgg ttcctgtagg catcgggatt 11820 ggcggatcaa acactaaagc tactggaacg agcagaagtc ctccggccgc cagttgccag 11880 gcggtaaagg tgagcagagg cacgggaggt tgccacttgc gggtcagcac ggttccgaac 11940 gccatggaaa ccgcccccgc caggcccgct gcgacgccga caggatctag cgctgcgttt 12000 ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa tagcccccag gaccgccatc 12060 aatcgtatcg ggctacctag cagagcggca gagatgaaca cgaccatcag cggctgcaca 12120 gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga ccgaaataaa caacaagctc 12180 cagaatagcg aaatattaag tgcgccgagg atgaagatgc gcatccacca gattcccgtt 12240 ggaatctgtc ggacgatcat cacgagcaat aaacccgccg gcaacgcccg cagcagcata 12300 ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa cgccggacag atgcgccttg 12360 tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca gcccaatgat ctcgccgtcg 12420 atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag cgaacgcctc catgggcttt 12480 ttctcctcgt gctcgtaaac ggacccgaac atctctggag ctttcttcag ggccgacaat 12540 cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc gtcgaatctg agccttaatc 12600 acaattgtca attttaatcc tctgtttatc ggcagttcgt agagcgcgcc gtgcgtcccg 12660 agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc ttgttcctga aatgccagta 12720 aagcgctggc tgctgaaccc ccagccggaa ctgaccccac aaggccctag cgtttgcaat 12780 gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc gctgcctcgc aactcttcgc 12840 aggcttcgcc gacctgctcg cgccacttct tcacgcgggt ggaatccgat ccgcacatga 12900 ggcggaaggt ttccagcttg agcgggtacg gctcccggtg cgagctgaaa tagtcgaaca 12960 tccgtcgggc cgtcggcgac agcttgcggt acttctccca tatgaatttc gtgtagtggt 13020 cgccagcaaa cagcacgacg atttcctcgt cgatcaggac ctggcaacgg gacgttttct 13080 tgccacggtc caggacgcgg aagcggtgca gcagcgacac cgattccagg tgcccaacgc 13 140 ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga caggcattcc tcggccttcg 13200 tgtaataccg gccattgatc gaccagccca ggtcctggca aagctcgtag aacgtgaagg 13260 tgatcggctc gccgataggg gtgcgcttcg cgtactccaa cacctgctgc cacaccagtt 13320 cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat cttcacgtcc ttgttgacgt 13380 ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt cttgttgcgc gtggtgaaca 13440 gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt gtccggccac ggcgcaatat 13500 cgaacaagga aagctgcatt tccttgatct gctgcttcgt gtgtttcagc aacgcggcct 13560 gcttggcctc gctgacctgt tttgccaggt cctcgccggc ggtttttcgc ttcttggtcg 13620 tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa acctgccgcc tcctgttcga 13680 gacgacgcga acgctccacg gcggccgatg gcgcgggcag ggcaggggga gccagttgca 13740 cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat cgagccgacg gactggaagg 13800 tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt ttcggcatcc tcggcggaaa 13860 accccgcgtc gatcagttct tgcctgtatg ccttccggtc aaacgtccga ttcattcacc 13920 ctccttgcgg gattgccccg actcacgccg gggcaatgtg cccttattcc tgatttgacc 13980 cgcctggtgc cttggtgtcc agataatcca ccttatcggc aatgaagtcg gtcccgtaga 14040 ccgtctggcc gtccttctcg tacttggtat tccgaatctt gccctgcacg aataccagcg 14100 accccttgcc caaatacttg ccgtgggcct cggcctgaga gccaaaacac ttgatgcgga 14160 agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg ccacatctag gtactaaaac 14220 aattcatcca gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag 14280 tcaaaaaata gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag 14340 aaggcaatgt cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt 14400 actttgccat ctttcacaaa gatgttgctg tctcccaggt cgccgtggga aaagacaagt 14460 tcctcttcgg gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga 14520 gtgtcttctt cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc 14580 aattcggcta agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag 14640 tgaaagagcc tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct 14700 tcatactctt ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca 14760 tcatgccgtt caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc 14820 atgtcctttt cccgttccac atcataggtg gtccctttat accggctgtc cgtcattttt 14880 aaatataggt tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc 14940 gtatctttta cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt 15000 ttagccattt attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta 15060 attataacaa gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga 15120 aaacagcttt ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga 15180 ttttgaaacc acaattatgg gtgatgctgc caacttactg atttagtgta tgatggtgtt 15240 tttgaggtgc tccagtggct tctgtgtcta tcagctgtcc ctcctgttca gctactgacg 15300 gggtggtgcg taacggcaaa agcaccgccg gacatcagcg ctatctctgc tctcactgcc 15360 gtaaaacatg gcaactgcag ttcacttaca ccgcttctca acccggtacg caccagaaaa 15420 tcattgatat ggccatgaat ggcgttggat gccgggcaac agcccgcatt atgggcgttg 15480 gcctcaacac gattttacgt cacttaaaaa actcaggccg cagtcggtaa cctcgcgcat 15540 acagccgggc agtgacgtca tcgtctgcgc ggaaatggac gaacagtggg gctatgtcgg 15600 ggctaaatcg cgccagcgct ggctgtttta cgcgtatgac agtctccgga agacggttgt 15660 tgcgcacgta ttcggtgaac gcactatggc gacgctgggg cgtcttatga gcctgctgtc 15720 accctttgac gtggtgatat ggatgacgga tggctggccg ctgtatgaat cccgcctgaa 15780 gggaaagctg cacgtaatca gcaagcgata tacgcagcga attgagcggc ataacctgaa 15840 tctgaggcag cacctggcac ggctgggacg gaagtcgctg tcgttctcaa aatcggtgga 15900 gctgcatgac aaagtcatcg ggcattatct gaacataaaa cactatcaat aagttggagt 15960 cattacccaa ttatgataga atttacaagc tataaggtta ttgtcctggg tttcaagcat 16020 tagtccatgc aagtttttat gctttgccca ttctatagat atattgataa gcgcgctgcc 16080 tatgccttgc cccctgaaat ccttacatac ggcgatatct tctatataaa agatatatta 16140 tcttatcagt attgtcaata tattcaaggc aatctgcctc ctcatcctct tcatcctctt 16200 cgtcttggta gctttttaaa tatggcgctt catagagtaa ttctgtaaag gtccaattct 16260 cgttttcata cctcggtata atcttaccta tcacctcaaa tggttcgctg ggtttatcgc 16320 acccccgaac acgagcacgg cacccgcgac cactatgcca agaatgccca aggtaaaaat 16380 tgccggcccc gccatgaagt ccgtgaatgc cccgacggcc gaagtgaagg gcaggccgcc 16440 acccaggccg ccgccctcac tgcccggcac ctggtcgctg aatgtcgatg ccagcacctg 16500 cggcacgtca atgcttccgg gcgtcgcgct cgggctgatc gcccatcccg ttactgcccc 16560 gatcccggca atggcaagga ctgccagcgc tgccattttt ggggtgaggc cgttcgcggc 16620 cgaggggcgc agcccctggg gggatgggag gcccgcgtta gcgggccggg agggttcgag 16680 aagggggggc accccccttc ggcgtgcgcg gtcacgcgca cagggcgcag ccctggttaa 16740 aaacaaggtt tataaatatt ggtttaaaag caggttaaaa gacaggttag cggtggccga 16800 aaaacgggcg gaaacccttg caaatgctgg attttctgcc tgtggacagc ccctcaaatg 16860 tcaataggtg cgcccctcat ctgtcagcac tctgcccctc aagtgtcaag gatcgcgccc 16920 ctcatctgtc agtagtcgcg cccctcaagt gtcaataccg cagggcactt atccccaggc 16980 ttgtccacat catctgtggg aaactcgcgt aaaatcaggc gttttcgccg atttgcgagg 17040 ctggccagct ccacgtcgcc ggccgaaatc gagcctgccc ctcatctgtc aacgccgcgc 17100 cgggtgagtc ggcccctcaa gtgtcaacgt ccgcccctca tctgtcagtg agggccaagt 17160 tttccgcgag gtatccacaa cgccggcggc cgcggtgtct cgcacacggc ttcgacggcg 17220 tttctggcgc gtttgcaggg ccatagacgg ccgccagccc agcggcgagg gcaaccagcc 17280 cgg 17283 <210> 18 <211> 18700 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE_PMT <400> 18 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg acagaaatct caaaattccg 10500 gcagaacaat tttgaatctc gatccgtaga aacgagacgg tcattgtttt agttccacca 10560 cgattatatt tgaaatttac gtgagtgtga gtgagacttg cataagaaaa taaaatcttt 10620 agttgggaaa aaattcaata atataaatgg gcttgagaag gaagcgaggg ataggccttt 10680 ttctaaaata ggcccattta agctattaac aatcttcaaa agtaccacag cgcttaggta 10740 aagaaagcag ctgagtttat atatggttag agacgaagta gtgattgaag tcatatctac 10800 caacacaaag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 10860 tgaaaaagtg gcaccgagtc ggtgcttttt tcaaatgaca attaaccacc gtgtattcgt 10920 tataacatca ggcagtttaa gtcgggacaa taggagccgc aatacacagt ttaccgcatc 10980 ttgacctaac tagaaatctc aaaattccgg cagaacaatt ttgaatctcg atccgtagaa 11040 acgagacggt cattgtttta gttccaccac gattatattt gaaatttacg tgagtgtgag 11100 tgagacttgc ataagaaaat aaaatcttta gttgggaaaa aattcaataa tataaatggg 11160 cttgagaagg aagcgaggga taggcctttt tctaaaatag gcccatttaa gctattaaca 11220 atcttcaaaa gtaccacagc gcttaggtaa agaaagcagc tgagtttata tatggttaga 11280 gacgaagtag tgattgtttt cagagtttag cgcattagtt ttagagctag aaatagcaag 11340 ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 11400 ctgcgcttcg atcttagatc aaggttgcct ggagtgtctc ttgaactgta cagagttgag 11460 ctagtcgcga agcagatcag caacacattt ctgcttgcga gaaatctcaa aattccggca 11520 gaacaatttt gaatctcgat ccgtagaaac gagacggtca ttgttttagt tccaccacga 11580 ttatatttga aatttacgtg agtgtgagtg agacttgcat aagaaaataa aatctttagt 11640 tgggaaaaaa ttcaataata taaatgggct tgagaaggaa gcgagggata ggcctttttc 11700 taaaataggc ccatttaagc tattaacaat cttcaaaagt accacagcgc ttaggtaaag 11760 aaagcagctg agtttatata tggttagaga cgaagtagtg attgctcaaa gagcatgaca 11820 tctgttttag agctagaaat agcaagttaa aataaggcta gtccgttatc aacttgaaaa 11880 agtggcaccg agtcggtgct tttttactag tctcgagcac gtggaattca ctggccgtcg 11940 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 12000 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 12060 agttgcgcag cctgaatggc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 12120 ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 12180 gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca 12240 tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 12300 ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 12360 gggattttgc cgatttcgga accaccatca aacaggattt tcgcctgctg gggcaaacca 12420 gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc 12480 ccgtctcact ggtgaaaaga aaaaccaccc cagtacatta aaaacgtccg caatgtgtta 12540 ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 12600 cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 12660 cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 12720 ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 12780 tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 12840 cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 12900 gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 12960 agtggcgcta tttctttaga agtgaacgtt gacgatatca actcccctat ccattgctca 13020 ccgaatggta caggtcgggg acccgaagtt ccgactgtcg gcctgatgca tccccggctg 13080 atcgacccca gatctggggc tgagaaagcc cagtaaggaa acaactgtag gttcgagtcg 13140 cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc cgagccacgc 13200 caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca ctaaagctac 13260 tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga gcagaggcac 13320 gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg cccccgccag 13380 gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca acagcgccac 13440 gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc tacctagcag 13500 agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg ccgcgacccc 13560 gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa tattaagtgc 13620 gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga cgatcatcac 13680 gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc ggcctcgctg 13740 ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg ggccgtcctc 13800 ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga atgccacggc 13860 atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct cgtaaacgga 13920 cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga aatcctgcac 13980 gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt ttaatcctct 14040 gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactgagc gaagcaagtg 14100 cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc tgaaccccca 14160 gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc attgacccag 14220 gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac ctgctcgcgc 14280 cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc cagcttgagc 14340 gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt cggcgacagc 14400 ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag cacgacgatt 14460 tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag gacgcggaag 14520 cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa gcccatcgcc 14580 gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc attgatcgac 14640 cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc gataggggtg 14700 cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc ccgcagctcg 14760 acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt gttttgcagc 14820 gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc cgtgtcgttt 14880 ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag ctgcatttcc 14940 ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct gacctgtttt 15000 gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg cgtgtcgatg 15060 gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg ctccacggcg 15120 gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc gatcttggcc 15180 gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc acgcatgacg 15240 gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat cagttcttgc 15300 ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat tgccccgact 15360 cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt ggtgtccaga 15420 taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc cttctcgtac 15480 ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa atacttgccg 15540 tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg ctcctgcttg 15600 tcgccggcat cgttgcgcca catctaggta ctaaaacaat tcatccagta aaatataata 15660 ttttattttc tcccaatcag gcttgatccc cagtaagtca aaaaatagct cgacatactg 15720 ttcttccccg atatcctccc tgatcgaccg gacgcagaag gcaatgtcat accacttgtc 15780 cgccctgccg cttctcccaa gatcaataaa gccacttact ttgccatctt tcacaaagat 15840 gttgctgtct cccaggtcgc cgtgggaaaa gacaagttcc tcttcgggct tttccgtctt 15900 taaaaaatca tacagctcgc gcggatcttt aaatggagtg tcttcttccc agttttcgca 15960 atccacatcg gccagatcgt tattcagtaa gtaatccaat tcggctaagc ggctgtctaa 16020 gctattcgta tagggacaat ccgatatgtc gatggagtga aagagcctga tgcactccgc 16080 atacagctcg ataatctttt cagggctttg ttcatcttca tactcttccg agcaaaggac 16140 gccatcggcc tcactcatga gcagattgct ccagccatca tgccgttcaa agtgcaggac 16200 ctttggaaca ggcagctttc cttccagcca tagcatcatg tccttttccc gttccacatc 16260 ataggtggtc cctttatacc ggctgtccgt catttttaaa tataggtttt cattttctcc 16320 caccagctta tataccttag caggagacat tccttccgta tcttttacgc agcggtattt 16380 ttcgatcagt tttttcaatt ccggtgatat tctcatttta gccatttatt atttccttcc 16440 tcttttctac agtatttaaa gataccccaa gaagctaatt ataacaagac gaactccaat 16500 tcactgttcc ttgcattcta aaaccttaaa taccagaaaa cagctttttc aaagttgttt 16560 tcaaagttgg cgtataacat agtatcgacg gagccgattt tgaaaccaca attatgggtg 16620 atgctgccaa cttactgatt tagtgtatga tggtgttttt gaggtgctcc agtggcttct 16680 gtgtctatca gctgtccctc ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc 16740 accgccggac atcagcgcta tctctgctct cactgccgta aaacatggca actgcagttc 16800 acttacaccg cttctcaacc cggtacgcac cagaaaatca ttgatatggc catgaatggc 16860 gttggatgcc gggcaacagc ccgcattatg ggcgttggcc tcaacacgat tttacgtcac 16920 ttaaaaaact caggccgcag tcggtaacct cgcgcataca gccgggcagt gacgtcatcg 16980 tctgcgcgga aatggacgaa cagtggggct atgtcggggc taaatcgcgc cagcgctggc 17040 tgttttacgc gtatgacagt ctccggaaga cggttgttgc gcacgtattc ggtgaacgca 17100 ctatggcgac gctggggcgt cttatgagcc tgctgtcacc ctttgacgtg gtgatatgga 17160 tgacggatgg ctggccgctg tatgaatccc gcctgaaggg aaagctgcac gtaatcagca 17220 agcgatatac gcagcgaatt gagcggcata acctgaatct gaggcagcac ctggcacggc 17280 tgggacggaa gtcgctgtcg ttctcaaaat cggtggagct gcatgacaaa gtcatcgggc 17340 attatctgaa cataaaacac tatcaataag ttggagtcat tacccaatta tgatagaatt 17400 tacaagctat aaggttattg tcctgggttt caagcattag tccatgcaag tttttatgct 17460 ttgcccattc tatagatata ttgataagcg cgctgcctat gccttgcccc ctgaaatcct 17520 tacatacggc gatatcttct atataaaaga tatattatct tatcagtatt gtcaatatat 17580 tcaaggcaat ctgcctcctc atcctcttca tcctcttcgt cttggtagct ttttaaatat 17640 ggcgcttcat agagtaattc tgtaaaggtc caattctcgt tttcatacct cggtataatc 17700 ttacctatca cctcaaatgg ttcgctgggt ttatcgcacc cccgaacacg agcacggcac 17760 ccgcgaccac tatgccaaga atgcccaagg taaaaattgc cggccccgcc atgaagtccg 17820 tgaatgcccc gacggccgaa gtgaagggca ggccgccacc caggccgccg ccctcactgc 17880 ccggcacctg gtcgctgaat gtcgatgcca gcacctgcgg cacgtcaatg cttccgggcg 17940 tcgcgctcgg gctgatcgcc catcccgtta ctgccccgat cccggcaatg gcaaggactg 18000 ccagcgctgc catttttggg gtgaggccgt tcgcggccga ggggcgcagc ccctgggggg 18060 atgggaggcc cgcgttagcg ggccgggagg gttcgagaag ggggggcacc ccccttcggc 18120 gtgcgcggtc acgcgcacag ggcgcagccc tggttaaaaa caaggtttat aaatattggt 18180 ttaaaagcag gttaaaagac aggttagcgg tggccgaaaa acgggcggaa acccttgcaa 18240 atgctggatt ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc ccctcatctg 18300 tcagcactct gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt agtcgcgccc 18360 ctcaagtgtc aataccgcag ggcacttatc cccaggcttg tccacatcat ctgtgggaaa 18420 ctcgcgtaaa atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca cgtcgccggc 18480 cgaaatcgag cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc ccctcaagtg 18540 tcaacgtccg cccctcatct gtcagtgagg gccaagtttt ccgcgaggta tccacaacgc 18600 cggcggccgc ggtgtctcgc acacggcttc gacggcgttt ctggcgcgtt tgcagggcca 18660 tagacggccg ccagcccagc ggcgagggca accagcccgg 18700 <210> 19 <211> 304 <212> DNA <213> Artificial Sequence <220> <223> P_U6 <400> 19 agaaatctca aaattccggc agaacaattt tgaatctcga tccgtagaaa cgagacggtc 60 attgttttag ttccaccacg attatatttg aaatttacgt gagtgtgagt gagacttgca 120 taagaaaata aaatctttag ttgggaaaaa attcaataat ataaatgggc ttgagaagga 180 agcgagggat aggccttttt ctaaaatagg cccatttaag ctattaacaa tcttcaaaag 240 taccacagcg cttaggtaaa gaaagcagct gagtttatat atggttagag acgaagtagt 300 gatt 304 <210> 20 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_C9 <400> 20 gaccatcctg gacttcctga agagc 25 <210> 21 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> R_C9 <400> 21 tgcaggtagt acaggtacag cttctcg 27 <210> 22 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> F_35S <400> 22 gctcctacaa atgccatca 19 <210> 23 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> R_35S <400> 23 gatagtggga ttgtgcgtca 20

Claims (11)

서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자.PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1. 청구항 1에 있어서, 상기 유전자는 니코틴 생합성 유전자인, 유전자.The gene of claim 1, wherein the gene is a nicotine biosynthesis gene. 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포.Plant cells genetically engineered to reduce the expression or activity of the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell. 청구항 3에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자인, 식물 세포.The method according to claim 3, wherein the PMT1b gene is a PMT1a gene consisting of a nucleotide sequence of SEQ ID NO: 2, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a nucleotide sequence of SEQ ID NO: 5 At least one gene selected from the group consisting of PMT4 gene and the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, plant cells. 청구항 3에 있어서, 상기 유전적 조작은 RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것인, 식물 세포.The method of claim 3, wherein the genetic manipulation is an RNA interference (RNAi) system, a meganuclease system, a zinc finger nuclease system, and a TALEN; Transcription Activator-Like Effector. Nuclease) system, CRISPR/Cas9 system, X-ray irradiation, gamma irradiation, ethyl methanesulfonate treatment, dimethyl sulfate treatment by at least one selected from the group consisting of, plant cells . 청구항 3에 있어서, 상기 식물은 재배담배(Nicotiana tabacum)인, 식물 세포.The plant cell of claim 3, wherein the plant is a cultivated tobacco ( Nicotiana tabacum). 청구항 3 내지 6 중 어느 한 항의 식물 세포를 포함하는 식물.A plant comprising the plant cell of any one of claims 3 to 6. 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법.A method for inhibiting plant nicotine biosynthesis comprising genetically engineering to reduce the expression or activity of the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell. 청구항 8에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자인, 식물의 니코틴 생합성 억제 방법.The method according to claim 8, wherein the PMT1b gene is a PMT1a gene consisting of a nucleotide sequence of SEQ ID NO: 2, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a nucleotide sequence of SEQ ID NO: 5 A method for inhibiting plant nicotine biosynthesis, which is a PMT1b gene consisting of at least one gene selected from the group consisting of PMT4 genes and the nucleotide sequence of SEQ ID NO: 1. 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법.A method for producing a plant cell in which the biosynthesis of nicotine is suppressed, comprising the step of genetically engineering the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1 or the expression or activity of the protein expressed by the PMT1b gene compared to the parent cell. 청구항 10에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자인, 니코틴 생합성이 억제된 식물 세포의 제조 방법.The method according to claim 10, wherein the PMT1b gene is a PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 A method for producing a plant cell in which nicotine biosynthesis is suppressed, which is a PMT1b gene consisting of at least one gene selected from the group consisting of PMT4 genes and the nucleotide sequence of SEQ ID NO: 1.
KR1020190175480A 2019-08-22 2019-12-26 A novel nicotine biosynthetic gene and use thereof KR102389358B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR20190103310 2019-08-22
KR1020190103310 2019-08-22

Publications (2)

Publication Number Publication Date
KR20210023646A true KR20210023646A (en) 2021-03-04
KR102389358B1 KR102389358B1 (en) 2022-04-22

Family

ID=75174660

Family Applications (2)

Application Number Title Priority Date Filing Date
KR1020190175479A KR102395185B1 (en) 2019-08-22 2019-12-26 Genetically engineered plant cell and using method of the same
KR1020190175480A KR102389358B1 (en) 2019-08-22 2019-12-26 A novel nicotine biosynthetic gene and use thereof

Family Applications Before (1)

Application Number Title Priority Date Filing Date
KR1020190175479A KR102395185B1 (en) 2019-08-22 2019-12-26 Genetically engineered plant cell and using method of the same

Country Status (1)

Country Link
KR (2) KR102395185B1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022270788A1 (en) * 2021-06-21 2022-12-29 주식회사 케이티앤지 Plant cells having engineered qpt gene and method for using same

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100899993B1 (en) * 2007-03-07 2009-05-28 한국생명공학연구원 A Low Nicotine Transgenic Tabacco and A method of producing thereof
WO2018222667A1 (en) * 2017-05-31 2018-12-06 22Nd Century Limited, Llc Genome editing methods for producing low-nicotine tobacco products
US20200029522A1 (en) * 2018-07-26 2020-01-30 Altria Client Services Llc Compositions and Methods Based on PMT Engineering for Producing Tobacco Plants and Products Having Altered Alkaloid Levels

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100899993B1 (en) * 2007-03-07 2009-05-28 한국생명공학연구원 A Low Nicotine Transgenic Tabacco and A method of producing thereof
WO2018222667A1 (en) * 2017-05-31 2018-12-06 22Nd Century Limited, Llc Genome editing methods for producing low-nicotine tobacco products
US20200029522A1 (en) * 2018-07-26 2020-01-30 Altria Client Services Llc Compositions and Methods Based on PMT Engineering for Producing Tobacco Plants and Products Having Altered Alkaloid Levels

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022270788A1 (en) * 2021-06-21 2022-12-29 주식회사 케이티앤지 Plant cells having engineered qpt gene and method for using same

Also Published As

Publication number Publication date
KR20210023645A (en) 2021-03-04
KR102389358B1 (en) 2022-04-22
KR102395185B1 (en) 2022-05-10

Similar Documents

Publication Publication Date Title
Čermák et al. High-frequency, precise modification of the tomato genome
AU2013254857B2 (en) Targeted genome engineering in plants
WO2018103686A1 (en) Chloroplast genome editing method
CN110157726B (en) Method for site-directed substitution of plant genome
US6734019B1 (en) Isolated DNA that encodes an Arabidopsis thaliana MSH3 protein involved in DNA mismatch repair and a method of modifying the mismatch repair system in a plant transformed with the isolated DNA
JP2015500648A (en) Compositions and methods for modifying a given target nucleic acid sequence
JPS61502166A (en) Improved methods and vectors for transformation of plant cells
US20190352653A1 (en) Conferring resistance to geminiviruses in plants in alternative manner to gene drive, using crispr/cas systems
CN110892074A (en) Compositions and methods for increasing the shelf life of bananas
CN116157144A (en) Uracil stabilizing proteins and active fragments and variants thereof and methods of use
AU2010257316A1 (en) Transformation Vectors
KR102389358B1 (en) A novel nicotine biosynthetic gene and use thereof
CN113846075A (en) MAD7-NLS fusion protein, nucleic acid construct for site-directed editing of plant genome and application thereof
CN112779282A (en) Method for creating multiple fruit color materials from tomato background materials by gene editing technology
CN110791525A (en) Method for knocking out rice tillering number regulation gene OsFWL4 to increase rice tillering number and yield
KR102516522B1 (en) pPLAⅡη gene inducing haploid plant and uses thereof
KR20180128864A (en) Gene editing composition comprising sgRNAs with matched 5&#39; nucleotide and gene editing method using the same
CN114438056A (en) CasF2 protein, CRISPR/Cas gene editing system and application thereof in plant gene editing
CN110079535B (en) Maize ZmPIF3s mutant protein, coding gene thereof and application thereof in breeding
EP4130262A1 (en) Plant cells having engineered qpt gene and method for using same
KR102522125B1 (en) Plant with reduced heavy metals and methods for preparation thereof
JP2018113967A (en) Means for creating adenovirus vector for cloning large nucleic acids
KR102550308B1 (en) Method for producing genome-edited tomato plant with increased salt tolerance by SlHKT1;2 gene editing and genome-edited tomato plant with increased salt tolerance produced by the same method
CN114540366B (en) Rice fertility regulating gene GMS3, mutant and application thereof
WO2001064865A1 (en) Improved promoters and utilization thereof

Legal Events

Date Code Title Description
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
X091 Application refused [patent]
AMND Amendment
X701 Decision to grant (after re-examination)
GRNT Written decision to grant